BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 014083
         (431 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|183579871|dbj|BAG28345.1| arabinofuranosidase [Citrus unshiu]
          Length = 769

 Score =  878 bits (2269), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 425/430 (98%), Positives = 425/430 (98%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVL GKYAASYVR LQGNTGSRLKVA
Sbjct: 142 MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGKYAASYVRRLQGNTGSRLKVA 201

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG
Sbjct: 202 ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 261

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
           KPTCADPDILKNTI GQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC
Sbjct: 262 KPTCADPDILKNTIRGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 321

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           GPFLAIHTEGAVRGGLLREEDVNLA AYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA
Sbjct: 322 GPFLAIHTEGAVRGGLLREEDVNLASAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 381

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP
Sbjct: 382 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 441

Query: 301 LQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           LQGISRYAKTIHQAGC GVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL
Sbjct: 442 LQGISRYAKTIHQAGCLGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 501

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
           LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA
Sbjct: 502 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 561

Query: 421 DVLFGRANPG 430
           DVLFGRANPG
Sbjct: 562 DVLFGRANPG 571


>gi|224111912|ref|XP_002316021.1| predicted protein [Populus trichocarpa]
 gi|222865061|gb|EEF02192.1| predicted protein [Populus trichocarpa]
          Length = 768

 Score =  778 bits (2008), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 377/431 (87%), Positives = 396/431 (91%), Gaps = 3/431 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYNGGMAGLTYWSPNVN+FRDPRWGRGQETPGEDPV+ GKYAASYVRGLQGN G RLKVA
Sbjct: 142 MYNGGMAGLTYWSPNVNVFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNNGLRLKVA 201

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFK+CVV GKVASVMCSYNQVNG
Sbjct: 202 ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKSCVVAGKVASVMCSYNQVNG 261

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
           KPTCADP +LKNTI G+W L+GYIVSDCDSVGVL++TQHYT TPEEAAA  I+AGLDLDC
Sbjct: 262 KPTCADPYLLKNTIRGEWGLNGYIVSDCDSVGVLFDTQHYTATPEEAAASTIRAGLDLDC 321

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           GPFLAIHTE AV+GGLL+EEDVN+ALA TITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA
Sbjct: 322 GPFLAIHTENAVKGGLLKEEDVNMALANTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 381

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
           HQQLALQAA QGIVLL+N  RTLPLS TL+  TVAVIGPNSDVTVTMIGNYAGVACGYTT
Sbjct: 382 HQQLALQAARQGIVLLQNRGRTLPLSRTLQ--TVAVIGPNSDVTVTMIGNYAGVACGYTT 439

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           PLQGI RYAKT+H  GC  V CNGNQ   AAEVAAR ADAT+LVMGLDQSIEAEF DR G
Sbjct: 440 PLQGIRRYAKTVHHPGCNDVFCNGNQQFNAAEVAARHADATILVMGLDQSIEAEFRDRKG 499

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           LLLPG QQELVS VA+ASRGP +LVLM GGP+DVSFAKNDPRIGAILWVGYPGQAGGAAI
Sbjct: 500 LLLPGYQQELVSIVARASRGPTILVLMSGGPIDVSFAKNDPRIGAILWVGYPGQAGGAAI 559

Query: 420 ADVLFGRANPG 430
           ADVLFG ANPG
Sbjct: 560 ADVLFGTANPG 570


>gi|255556320|ref|XP_002519194.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
 gi|223541509|gb|EEF43058.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
          Length = 782

 Score =  758 bits (1956), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 370/430 (86%), Positives = 394/430 (91%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYNGG+AGLTYWSPNVN+FRDPRWGRGQETPGEDPVL GKYAASYVRGLQ +TG +LKVA
Sbjct: 156 MYNGGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPVLAGKYAASYVRGLQSSTGLKLKVA 215

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTY+VPFKACVVEGKVASVMCSYNQVNG
Sbjct: 216 ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYDVPFKACVVEGKVASVMCSYNQVNG 275

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
           KPTCADP +LKNTI GQW L+GYIVSDCDSVGVLY+ QHYT TPEEAAA  IKAGLDLDC
Sbjct: 276 KPTCADPILLKNTIRGQWGLNGYIVSDCDSVGVLYDNQHYTSTPEEAAAATIKAGLDLDC 335

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           GPFLAIHTE AV+ GLL EEDVNLALA TITVQMRLGMFDGEPSA P+GNLGPRDVCTPA
Sbjct: 336 GPFLAIHTENAVKKGLLVEEDVNLALANTITVQMRLGMFDGEPSAHPYGNLGPRDVCTPA 395

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           HQ+LAL+AA QGIVLL+N  + LPLS+ RHHT+AVIGPNSDVTVTMIGNYAG+AC YT+P
Sbjct: 396 HQELALEAARQGIVLLENRGQALPLSSSRHHTIAVIGPNSDVTVTMIGNYAGIACKYTSP 455

Query: 301 LQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           LQGISRYAKT+HQ GC  VAC+ NQ  GAAE AARQADATVLVMGLDQSIEAEF DR GL
Sbjct: 456 LQGISRYAKTLHQNGCGDVACHSNQQFGAAEAAARQADATVLVMGLDQSIEAEFRDRVGL 515

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
           LLPG QQELVSRVA+ASRGP +LVLM GGP+DVSFAKNDPR+GAILW GYPGQAGGAAIA
Sbjct: 516 LLPGHQQELVSRVARASRGPTILVLMSGGPIDVSFAKNDPRVGAILWAGYPGQAGGAAIA 575

Query: 421 DVLFGRANPG 430
           DVLFG  NPG
Sbjct: 576 DVLFGTTNPG 585


>gi|408354264|gb|AFU54451.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Prunus salicina]
          Length = 775

 Score =  749 bits (1933), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 368/431 (85%), Positives = 391/431 (90%), Gaps = 1/431 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN-TGSRLKV 59
           MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVL  KYAA YV+GLQG+  G+RLKV
Sbjct: 146 MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLASKYAARYVKGLQGDGAGNRLKV 205

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           AACCKHYTAYDLDNWNGV+R+HFNARVSKQDL DTYNVPFKACVVEG VASVMCSYNQVN
Sbjct: 206 AACCKHYTAYDLDNWNGVNRFHFNARVSKQDLADTYNVPFKACVVEGHVASVMCSYNQVN 265

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           GKPTCADPD+LK TI GQWRL+GYIVSDCDSVGVLY  QHYTRTPEEAAADAIKAGLDLD
Sbjct: 266 GKPTCADPDLLKGTIRGQWRLNGYIVSDCDSVGVLYEEQHYTRTPEEAAADAIKAGLDLD 325

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CGPFLAIHTE AVR GL+ + ++N ALA T+TVQMRLGMFDGEPSA  +GNLGPRDVCTP
Sbjct: 326 CGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQYGNLGPRDVCTP 385

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
           AHQQLAL+AA QGIVLL+N  R+LPLS  RH TVAVIGPNSDVTVTMIGNYAGVACGYTT
Sbjct: 386 AHQQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMIGNYAGVACGYTT 445

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           PLQGI RY +TIHQAGC  V CNGNQL GAAE AARQADATVLVMGLDQSIEAEF+DR G
Sbjct: 446 PLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQSIEAEFVDRVG 505

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           LLLPG QQELVSRVA+ASRGP +LVLM GGP+DV+FAKNDPRI AI+WVGYPGQAGG AI
Sbjct: 506 LLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWVGYPGQAGGTAI 565

Query: 420 ADVLFGRANPG 430
           ADVLFG  NPG
Sbjct: 566 ADVLFGTTNPG 576


>gi|408354266|gb|AFU54452.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Prunus salicina]
          Length = 775

 Score =  749 bits (1933), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 368/431 (85%), Positives = 391/431 (90%), Gaps = 1/431 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN-TGSRLKV 59
           MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVL  KYAA YV+GLQG+  G+RLKV
Sbjct: 146 MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLASKYAARYVKGLQGDGAGNRLKV 205

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           AACCKHYTAYDLDNWNGV+R+HFNARVSKQDL DTYNVPFKACVVEG VASVMCSYNQVN
Sbjct: 206 AACCKHYTAYDLDNWNGVNRFHFNARVSKQDLADTYNVPFKACVVEGHVASVMCSYNQVN 265

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           GKPTCADPD+LK TI GQWRL+GYIVSDCDSVGVLY  QHYTRTPEEAAADAIKAGLDLD
Sbjct: 266 GKPTCADPDLLKGTIRGQWRLNGYIVSDCDSVGVLYEEQHYTRTPEEAAADAIKAGLDLD 325

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CGPFLAIHTE AVR GL+ + ++N ALA T+TVQMRLGMFDGEPSA  +GNLGPRDVCTP
Sbjct: 326 CGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQYGNLGPRDVCTP 385

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
           AHQQLAL+AA QGIVLL+N  R+LPLS  RH TVAVIGPNSDVTVTMIGNYAGVACGYTT
Sbjct: 386 AHQQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMIGNYAGVACGYTT 445

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           PLQGI RY +TIHQAGC  V CNGNQL GAAE AARQADATVLVMGLDQSIEAEF+DR G
Sbjct: 446 PLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQSIEAEFVDRVG 505

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           LLLPG QQELVSRVA+ASRGP +LVLM GGP+DV+FAKNDPRI AI+WVGYPGQAGG AI
Sbjct: 506 LLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWVGYPGQAGGTAI 565

Query: 420 ADVLFGRANPG 430
           ADVLFG  NPG
Sbjct: 566 ADVLFGTTNPG 576


>gi|224099193|ref|XP_002311398.1| predicted protein [Populus trichocarpa]
 gi|222851218|gb|EEE88765.1| predicted protein [Populus trichocarpa]
          Length = 755

 Score =  741 bits (1913), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/430 (86%), Positives = 394/430 (91%), Gaps = 1/430 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           M+NGGMAGLTYWSPNVN+FRDPRWGRGQETPGEDPV+ GKYAASYVRGLQGN+G RLKVA
Sbjct: 134 MFNGGMAGLTYWSPNVNVFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNSGFRLKVA 193

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTY+VPFK+CVVEGKVASVMCSYNQVNG
Sbjct: 194 ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYDVPFKSCVVEGKVASVMCSYNQVNG 253

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
           KPTCADP++LKNTI G+WRL+GYIVSDCDSVGVLY  QHYT TPEEAAA  IKAGLDLDC
Sbjct: 254 KPTCADPNLLKNTIRGEWRLNGYIVSDCDSVGVLYENQHYTATPEEAAAATIKAGLDLDC 313

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           GPFLAIHTE AV+GGLL EEDVN+ALA TITVQMRLG+FDGEPSAQPFG LGPRDVCTPA
Sbjct: 314 GPFLAIHTENAVKGGLLNEEDVNMALANTITVQMRLGLFDGEPSAQPFGKLGPRDVCTPA 373

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           HQQLAL AA QGIVLL+NS RTLPLS   + TVAVIGP +DVTVTMIGNYAGVACGYTTP
Sbjct: 374 HQQLALHAAQQGIVLLQNSGRTLPLSR-PNLTVAVIGPIADVTVTMIGNYAGVACGYTTP 432

Query: 301 LQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           LQGISRYAKTIHQ+GC  VACNGNQ  G AE AA QADATVLVMGLDQSIEAEF DR  L
Sbjct: 433 LQGISRYAKTIHQSGCIDVACNGNQQFGMAEAAASQADATVLVMGLDQSIEAEFRDRKDL 492

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
           LLPG QQEL+SRVA+ASRGP +LVLM GGP+DVSFAKNDPRIGAILW GYPGQAGGAAIA
Sbjct: 493 LLPGYQQELISRVARASRGPTILVLMSGGPIDVSFAKNDPRIGAILWAGYPGQAGGAAIA 552

Query: 421 DVLFGRANPG 430
           DVLFG  NPG
Sbjct: 553 DVLFGTTNPG 562


>gi|298364130|gb|ADI79208.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Malus x domestica]
          Length = 774

 Score =  737 bits (1903), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 351/430 (81%), Positives = 384/430 (89%), Gaps = 1/430 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN-TGSRLKV 59
           MYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP+L  KY A YV+GLQG+  G+RLKV
Sbjct: 145 MYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPILAAKYGARYVKGLQGDGAGNRLKV 204

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           AACCKHYTAYDLDNWNGVDR+HFNARVSKQDLEDTYNVPF+ACVV+G VASVMCSYNQVN
Sbjct: 205 AACCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFRACVVDGNVASVMCSYNQVN 264

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           GKPTCADP++LK TI GQW+L+GYIVSDCDSVGV Y+ QHYT+TPEEAAA AIKAGLDLD
Sbjct: 265 GKPTCADPELLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHYTKTPEEAAAYAIKAGLDLD 324

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CGPFL IHTE AVR G + E D+N ALA TITVQMRLGMFDGEPSAQ +GNLG  DVC P
Sbjct: 325 CGPFLGIHTEAAVRFGQVNEIDINYALANTITVQMRLGMFDGEPSAQRYGNLGLADVCKP 384

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
           +  +LAL+AA QGIVLL+N   +LPLST+RH TVAVIGPNSDVT TMIGNYAG+ACGYTT
Sbjct: 385 SSNELALEAARQGIVLLENRGNSLPLSTMRHRTVAVIGPNSDVTETMIGNYAGIACGYTT 444

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           PLQGI+RY +TIHQAGC  V CNGNQLIGAAEVAARQADATVLV+GLDQSIEAEF DR  
Sbjct: 445 PLQGIARYTRTIHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEAEFRDRTD 504

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           LLLPG QQELVSRVA+ASRGP +LV+M GGP+DV+FAKNDPRIGAI+WVGYPGQAGG AI
Sbjct: 505 LLLPGHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPRIGAIIWVGYPGQAGGTAI 564

Query: 420 ADVLFGRANP 429
           ADVLFG  NP
Sbjct: 565 ADVLFGTTNP 574


>gi|32481073|gb|AAP83934.1| auxin-induced beta-glucosidase [Chenopodium rubrum]
          Length = 767

 Score =  734 bits (1895), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 355/431 (82%), Positives = 382/431 (88%), Gaps = 1/431 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKV 59
           MYNGG AGLTYWSPNVNIFRDPRWGRGQETPGEDP L  +YAASYVRGLQG    +RLKV
Sbjct: 140 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLASQYAASYVRGLQGIYNKNRLKV 199

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           AACCKHYTAYDLDNWN VDR+HFNA+VSKQDLEDTYNVPFK CV EG+VASVMCSYNQVN
Sbjct: 200 AACCKHYTAYDLDNWNAVDRFHFNAKVSKQDLEDTYNVPFKGCVQEGRVASVMCSYNQVN 259

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           GKPTCADPD+L+NTI GQWRL+GYIVSDCDSVGVLY+ QHYTRTPEEAAAD IKAGLDLD
Sbjct: 260 GKPTCADPDLLRNTIRGQWRLNGYIVSDCDSVGVLYDDQHYTRTPEEAAADTIKAGLDLD 319

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CGPFLA+HTE AV+ GLL E DVN AL  T TVQMRLGMFDGE +AQPFG+LGP+DVC+P
Sbjct: 320 CGPFLAVHTEAAVKRGLLTEADVNQALTNTFTVQMRLGMFDGEAAAQPFGHLGPKDVCSP 379

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
           AHQ LALQAA QGIVLL+N  R+LPLST RH  +AVIGPN+D TVTMIGNYAGVACGYT+
Sbjct: 380 AHQDLALQAARQGIVLLQNRGRSLPLSTARHRNIAVIGPNADATVTMIGNYAGVACGYTS 439

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           PLQGI+RYAKT+HQAGC GVAC  NQ  GAA  AA  ADATVLVMGLDQSIEAEF DRA 
Sbjct: 440 PLQGIARYAKTVHQAGCIGVACTSNQQFGAATAAAAHADATVLVMGLDQSIEAEFRDRAS 499

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           +LLPG QQELVS+VA ASRGP +LVLMCGGPVDV+FAKNDP+I AILWVGYPGQAGG AI
Sbjct: 500 VLLPGHQQELVSKVALASRGPTILVLMCGGPVDVTFAKNDPKISAILWVGYPGQAGGTAI 559

Query: 420 ADVLFGRANPG 430
           ADVLFG  NPG
Sbjct: 560 ADVLFGTTNPG 570


>gi|357442285|ref|XP_003591420.1| Beta xylosidase [Medicago truncatula]
 gi|355480468|gb|AES61671.1| Beta xylosidase [Medicago truncatula]
          Length = 765

 Score =  731 bits (1887), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 343/430 (79%), Positives = 376/430 (87%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYNGG AGLTYWSPNVNIFRDPRWGRGQETPGEDP+L GKYAASYVRGLQG   SRLKVA
Sbjct: 137 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQGTDSSRLKVA 196

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A CKH+TAYDLDNWNGVDR+HFNA+VSKQD+EDT+NVPF+ CV EG VASVMCSYNQVNG
Sbjct: 197 ASCKHFTAYDLDNWNGVDRFHFNAKVSKQDMEDTFNVPFRMCVKEGNVASVMCSYNQVNG 256

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
            PTCADP++LK TI GQW LDGYIVSDCDSVGV Y  QHYT TPEEAAADAIKAGLDLDC
Sbjct: 257 VPTCADPNLLKRTIRGQWHLDGYIVSDCDSVGVFYTNQHYTSTPEEAAADAIKAGLDLDC 316

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           GPFLA HT+ AV+ GLL E DVN ALA T+TVQMRLGMFDGEPSAQP+GNLGP DVCTP 
Sbjct: 317 GPFLAQHTQNAVKKGLLTETDVNGALANTLTVQMRLGMFDGEPSAQPYGNLGPTDVCTPT 376

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           HQ+LAL AA QGIVLLKN+  +LPLST  H TVAVIGPNS+ TVTMIGNYAG+ACGYT+P
Sbjct: 377 HQELALDAARQGIVLLKNTGPSLPLSTKNHQTVAVIGPNSNATVTMIGNYAGIACGYTSP 436

Query: 301 LQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           LQGI +YA+TIH+ GC  VACN ++  G+A  AARQADATVLVMGLDQSIEAE +DR GL
Sbjct: 437 LQGIGKYARTIHEPGCANVACNDDKQFGSALNAARQADATVLVMGLDQSIEAEMVDRTGL 496

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
           LLPG QQ+LVS+VA ASRGP +LVLM GGP+D++FAKNDPRI  ILW GYPGQAGGAAIA
Sbjct: 497 LLPGHQQDLVSKVAAASRGPTILVLMSGGPIDITFAKNDPRIMGILWAGYPGQAGGAAIA 556

Query: 421 DVLFGRANPG 430
           D+LFG  NPG
Sbjct: 557 DILFGTTNPG 566


>gi|356529243|ref|XP_003533205.1| PREDICTED: beta-D-xylosidase 1-like [Glycine max]
          Length = 774

 Score =  728 bits (1878), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 349/431 (80%), Positives = 385/431 (89%), Gaps = 1/431 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN-TGSRLKV 59
           MYNGG AGLTYWSPNVNIFRDPRWGRGQETPGEDP L  KYAASYV+GLQG+  G+RLKV
Sbjct: 150 MYNGGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLAAKYAASYVKGLQGDGAGNRLKV 209

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           AACCKHYTAYDLDNWNGVDR+HFNA+VSKQDLEDTY+VPFKACV+EG+VASVMCSYNQVN
Sbjct: 210 AACCKHYTAYDLDNWNGVDRFHFNAKVSKQDLEDTYDVPFKACVLEGQVASVMCSYNQVN 269

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           GKPTCADPD+L+NTI GQW L+GYIVSDCDSVGV ++ QHYTRTPEEAAA+AIKAGLDLD
Sbjct: 270 GKPTCADPDLLRNTIRGQWGLNGYIVSDCDSVGVFFDNQHYTRTPEEAAAEAIKAGLDLD 329

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CGPFLAIHT+ A+R GL+ E D+NLALA  ITVQMRLGMFDGEPS QPFGNLGPRDVCTP
Sbjct: 330 CGPFLAIHTDSAIRKGLISENDLNLALANLITVQMRLGMFDGEPSTQPFGNLGPRDVCTP 389

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
           AHQQLAL+AA + IVLL+N   +LPLS  R   V VIGPN+D TVTMIGNYAGVACGYTT
Sbjct: 390 AHQQLALEAARESIVLLQNKGNSLPLSPSRLRIVGVIGPNTDATVTMIGNYAGVACGYTT 449

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           PLQGI+RY KT HQ GC GVAC GN+L GAAE+ ARQ DATVLVMGLDQ+IEAE  DR G
Sbjct: 450 PLQGIARYVKTAHQVGCRGVACRGNELFGAAEIIARQVDATVLVMGLDQTIEAETRDRVG 509

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           LLLPG QQELV+RVA+A++GPV+LV+M GGPVDVSFAKN+P+I AILWVGYPGQAGG AI
Sbjct: 510 LLLPGLQQELVTRVARAAKGPVILVIMSGGPVDVSFAKNNPKISAILWVGYPGQAGGTAI 569

Query: 420 ADVLFGRANPG 430
           ADV+FG  NPG
Sbjct: 570 ADVIFGATNPG 580


>gi|86553064|gb|AAS17751.2| beta xylosidase [Fragaria x ananassa]
          Length = 772

 Score =  726 bits (1875), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 359/430 (83%), Positives = 392/430 (91%), Gaps = 1/430 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN-TGSRLKV 59
           MYNGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPVL+ KYAASYV+GLQG+  G+RLKV
Sbjct: 146 MYNGGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSAKYAASYVKGLQGDGAGNRLKV 205

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           AACCKHYTAYDLDNWNGVDR+HFNARVSKQDL DTY+VPF+ CV+EGKVASVMCSYNQVN
Sbjct: 206 AACCKHYTAYDLDNWNGVDRFHFNARVSKQDLADTYDVPFRGCVLEGKVASVMCSYNQVN 265

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           GKPTCADPD+LKNTI G+W+L+GYIVSDCDSVGV Y+ QHYTRTPEEAAA+AIKAGLDLD
Sbjct: 266 GKPTCADPDLLKNTIRGEWKLNGYIVSDCDSVGVFYDQQHYTRTPEEAAAEAIKAGLDLD 325

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CGPFLAIHTEGA++ GLL E DV+ ALA T+TVQMRLGMFDGEPSAQ +GNLGPRDVCTP
Sbjct: 326 CGPFLAIHTEGAIKAGLLPEIDVDYALANTLTVQMRLGMFDGEPSAQQYGNLGPRDVCTP 385

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
           AHQ+LAL+A+ QGIVLL+N+  TLPLST+RH TVAV+GPNSDVT TMIGNYAGVACGYTT
Sbjct: 386 AHQELALEASRQGIVLLQNNGHTLPLSTVRHRTVAVVGPNSDVTETMIGNYAGVACGYTT 445

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           PLQGI RY KTIHQ GC  VAC  NQL GAAE AARQADATVLVMGLDQSIEAEF DR  
Sbjct: 446 PLQGIGRYTKTIHQQGCTNVACTTNQLFGAAEAAARQADATVLVMGLDQSIEAEFRDRTD 505

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           L++PG QQELVSRVA+ASRGP VLVLM GGP+DVSFAKNDP+IGAI+WVGYPGQAGG A+
Sbjct: 506 LVMPGHQQELVSRVARASRGPTVLVLMSGGPIDVSFAKNDPKIGAIIWVGYPGQAGGTAM 565

Query: 420 ADVLFGRANP 429
           ADVLFG  NP
Sbjct: 566 ADVLFGTTNP 575


>gi|157041199|dbj|BAF79669.1| beta-D-xylosidase [Pyrus pyrifolia]
          Length = 774

 Score =  726 bits (1875), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 352/430 (81%), Positives = 383/430 (89%), Gaps = 1/430 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN-TGSRLKV 59
           MYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDPVL  KY A YV+GLQG+  G+RLKV
Sbjct: 145 MYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPVLAAKYGARYVKGLQGDGAGNRLKV 204

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           AACCKHYTAYDLDNWNGVDR+HFNARVSKQDLEDTYNVPFKACVV+G VASVMCSYNQVN
Sbjct: 205 AACCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFKACVVDGNVASVMCSYNQVN 264

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           GKPTCADPD+LK TI GQW+L+GYIVSDCDSVGV Y+ QHYT+TPE AAA AIKAGLDLD
Sbjct: 265 GKPTCADPDLLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHYTKTPEAAAAYAIKAGLDLD 324

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CGPFL IHTE A+R G + E D+N ALA TITVQMRLGMFDGEPS Q +GNLG  DVC P
Sbjct: 325 CGPFLGIHTEAAIRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQRYGNLGLADVCKP 384

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
           +  +LAL+AA QGIVLL+N   +LPLST+RH TVAVIGPNSDVT TMIGNYAG+ACGYTT
Sbjct: 385 SSNELALEAARQGIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMIGNYAGIACGYTT 444

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           PLQGI+RY +TIHQAGC  V CNGNQLIGAAEVAARQADATVLV+GLDQSIEAEF DR G
Sbjct: 445 PLQGIARYTRTIHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEAEFRDRTG 504

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           LLLPG QQELVSRVA+ASRGP +LV+M GGP+DV+FAKNDPRIGAI+WVGYPGQAGG AI
Sbjct: 505 LLLPGHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPRIGAIIWVGYPGQAGGTAI 564

Query: 420 ADVLFGRANP 429
           ADVLFG  NP
Sbjct: 565 ADVLFGTTNP 574


>gi|225437531|ref|XP_002270249.1| PREDICTED: probable beta-D-xylosidase 2 [Vitis vinifera]
 gi|297743965|emb|CBI36935.3| unnamed protein product [Vitis vinifera]
          Length = 768

 Score =  725 bits (1871), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 338/430 (78%), Positives = 379/430 (88%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDPVL GKYAA YVRGLQGN G RLKVA
Sbjct: 143 MYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPVLAGKYAARYVRGLQGNAGDRLKVA 202

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKH+TAYDLDNWNGVDR+HF+ARVSKQ++EDT++VPF++CVVEGKVASVMCSYNQVNG
Sbjct: 203 ACCKHFTAYDLDNWNGVDRFHFDARVSKQEMEDTFDVPFRSCVVEGKVASVMCSYNQVNG 262

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
            PTCADP++L+NT+  QW L+GY+VSDCDSVGV Y+ QHYT TPEEAAADAIKAGLDLDC
Sbjct: 263 VPTCADPNLLRNTVRKQWHLNGYVVSDCDSVGVFYDNQHYTNTPEEAAADAIKAGLDLDC 322

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           GPFLA+HT+ A++ GL+ E DV+ AL  T+TVQMRLGMFDGEPSAQPFG+LGP+DVC+PA
Sbjct: 323 GPFLAVHTQDAIKKGLVSEADVDSALVNTVTVQMRLGMFDGEPSAQPFGDLGPKDVCSPA 382

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           HQ+LA++AA QGIVLLKN   +LPLST  H ++AVIGPNSD  VTMIGNYAG+ C YTTP
Sbjct: 383 HQELAIEAARQGIVLLKNHGHSLPLSTRSHRSIAVIGPNSDANVTMIGNYAGIPCEYTTP 442

Query: 301 LQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           LQGI RY++TIHQ GC  VAC+ +QL   A  AA QADATVLVMGLDQSIEAE  DRA L
Sbjct: 443 LQGIGRYSRTIHQKGCADVACSEDQLFAGAIDAASQADATVLVMGLDQSIEAEAKDRADL 502

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
           LLPGRQQELVS+VA ASRGP VLVLM GGPVDVSFAK DPRI AI+W GYPGQAGGAAIA
Sbjct: 503 LLPGRQQELVSKVAMASRGPTVLVLMSGGPVDVSFAKKDPRIAAIVWAGYPGQAGGAAIA 562

Query: 421 DVLFGRANPG 430
           D+LFG ANPG
Sbjct: 563 DILFGVANPG 572


>gi|255548487|ref|XP_002515300.1| Beta-glucosidase, putative [Ricinus communis]
 gi|223545780|gb|EEF47284.1| Beta-glucosidase, putative [Ricinus communis]
          Length = 768

 Score =  724 bits (1869), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 342/430 (79%), Positives = 376/430 (87%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYNGG AGLTYWSPNVNI RDPRWGRGQETPGEDP+L GKYAASYV+GLQGN G RLKVA
Sbjct: 142 MYNGGAAGLTYWSPNVNILRDPRWGRGQETPGEDPLLVGKYAASYVKGLQGNDGERLKVA 201

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKH+TAYDLDNWNGVDR+HFNA+VSKQD++DT++VPF+ CV EGKVASVMCSYNQVNG
Sbjct: 202 ACCKHFTAYDLDNWNGVDRFHFNAKVSKQDMKDTFDVPFRMCVKEGKVASVMCSYNQVNG 261

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
            PTCADP++L+ T+  QW L+GYIVSDCDSVGV Y+ QHYT TPEEAAADAIKAGLDLDC
Sbjct: 262 IPTCADPNLLRKTVRTQWGLNGYIVSDCDSVGVFYDKQHYTSTPEEAAADAIKAGLDLDC 321

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           GPFLA+HT+ AV+ GL+ E DVN AL  T+TVQMRLGMFDGEPSAQP+GNLGP+DVCTPA
Sbjct: 322 GPFLAVHTQDAVKRGLISEADVNGALFNTLTVQMRLGMFDGEPSAQPYGNLGPKDVCTPA 381

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           HQ+LAL+A  QGIVLLKN   +LPLS  RH TVA+IGPNS+VTVTMIGNYAGVAC YTTP
Sbjct: 382 HQELALEAGRQGIVLLKNHGPSLPLSPRRHRTVAIIGPNSNVTVTMIGNYAGVACQYTTP 441

Query: 301 LQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           LQGI  YAKTIHQ GC  V C  +QL   A  AARQADATVLVMGLDQSIEAEF DR GL
Sbjct: 442 LQGIGSYAKTIHQQGCADVGCVTDQLFSGAIDAARQADATVLVMGLDQSIEAEFRDRTGL 501

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
           LLPGRQQELVS+VA AS+GP +LVLM GGP+DVSFAK DP+I AILW GYPGQAGGAAIA
Sbjct: 502 LLPGRQQELVSKVAMASKGPTILVLMSGGPIDVSFAKKDPKIAAILWAGYPGQAGGAAIA 561

Query: 421 DVLFGRANPG 430
           DVLFG  NPG
Sbjct: 562 DVLFGTINPG 571


>gi|318136853|gb|ADV41671.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Actinidia deliciosa
           var. deliciosa]
          Length = 634

 Score =  723 bits (1866), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 351/430 (81%), Positives = 380/430 (88%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYNGGMAGLT+WSPNVNIFRDPRWGRGQETPGEDP+L G YAASYVRGLQGN G RLKVA
Sbjct: 6   MYNGGMAGLTFWSPNVNIFRDPRWGRGQETPGEDPMLAGNYAASYVRGLQGNDGERLKVA 65

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKHYTAYDLDNW GVDR+HFNARVSKQD++DT+ +PF+ CV+ GKVASVMCSYNQVNG
Sbjct: 66  ACCKHYTAYDLDNWRGVDRFHFNARVSKQDIKDTFEIPFRECVLGGKVASVMCSYNQVNG 125

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
            PTCA+P +LK TI G WRL+GYIVSDCDSVGV +  QHYT  PEEA A AIKAGLDLDC
Sbjct: 126 IPTCANPKLLKGTIRGSWRLNGYIVSDCDSVGVFFENQHYTSKPEEAVAAAIKAGLDLDC 185

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           GPFLAIHTE AVR GL+ + ++N ALA T+T QMRLGMFDGEPSA  +GNLGPRDVCTPA
Sbjct: 186 GPFLAIHTEAAVRRGLVSQLEINWALANTMTAQMRLGMFDGEPSAHQYGNLGPRDVCTPA 245

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           HQQLAL+AA QGIVLL+N  R+LPLS  RH TVAVIGPNSDVTVTMIGNYAGVACGYTTP
Sbjct: 246 HQQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMIGNYAGVACGYTTP 305

Query: 301 LQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           LQGI RY +TIHQAGC  V CNGNQL GAAE AARQADATVLVMGLDQSIEAEF+DRAG 
Sbjct: 306 LQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQSIEAEFVDRAGP 365

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
           LLPG QQELVSRVA+ASRGP +LVLM GGP+DV+FAKNDPRI AI+WVGYPGQAGG AIA
Sbjct: 366 LLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWVGYPGQAGGTAIA 425

Query: 421 DVLFGRANPG 430
           DVLFG  NPG
Sbjct: 426 DVLFGTTNPG 435


>gi|65736613|dbj|BAD98523.1| alpha-L-arabinofuranosidase / beta-D-xylosidase [Pyrus pyrifolia]
          Length = 774

 Score =  723 bits (1865), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 351/430 (81%), Positives = 382/430 (88%), Gaps = 1/430 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN-TGSRLKV 59
           MYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDPVL  KY A YV+GLQG+  G+RLKV
Sbjct: 145 MYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPVLAAKYGARYVKGLQGDGAGNRLKV 204

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           AACCKHYTAYDLDNWNGVDR+HFNARVSKQDLEDTYNVPFKACVV+G VASVMCSYNQVN
Sbjct: 205 AACCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFKACVVDGNVASVMCSYNQVN 264

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           GKPTCADPD+LK TI GQW+L+GYIVSDCDSVGV Y+ QHYT+TPE AAA AIKAGLDLD
Sbjct: 265 GKPTCADPDLLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHYTKTPEAAAAYAIKAGLDLD 324

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CGPFL IHTE A+R G + E D+N ALA TITVQMRLGMFDGEPS Q +GNLG  DVC P
Sbjct: 325 CGPFLGIHTEAAIRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQRYGNLGLADVCKP 384

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
           +  +LAL+AA QGIVLL+N   +LPLST+RH TVAVIGPNSDVT TMIGNYAG+ACGYTT
Sbjct: 385 SSNELALEAARQGIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMIGNYAGIACGYTT 444

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           PLQGI+RY +TIHQAGC  V CNGNQLIGAAEVAARQADATVLV+GLDQSIEAEF DR G
Sbjct: 445 PLQGIARYTRTIHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEAEFRDRTG 504

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           LLLPG QQELVSRVA+ASRGP +LV+M GGP+DV+FAKNDP IGAI+WVGYPGQAGG AI
Sbjct: 505 LLLPGHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPCIGAIIWVGYPGQAGGTAI 564

Query: 420 ADVLFGRANP 429
           ADVLFG  NP
Sbjct: 565 ADVLFGTTNP 574


>gi|356556038|ref|XP_003546334.1| PREDICTED: beta-D-xylosidase 1-like [Glycine max]
          Length = 775

 Score =  721 bits (1862), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 342/431 (79%), Positives = 385/431 (89%), Gaps = 1/431 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKV 59
           MYNGG AGLTYWSPNVNIFRDPRWGRGQETPGEDP L  KYAASYV+GLQG++ G+ LKV
Sbjct: 151 MYNGGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLAAKYAASYVKGLQGDSAGNHLKV 210

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           AACCKHYTAYDLDNWNGVDR+HFNA+VSKQDLEDTY+VPFKACV+EG+VASVMCSYNQVN
Sbjct: 211 AACCKHYTAYDLDNWNGVDRFHFNAKVSKQDLEDTYDVPFKACVLEGQVASVMCSYNQVN 270

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           GKPTCADPD+L+NTI GQWRL+GYIVSDCDSVGV ++ QHYT+TPEEAAA+AIKAGLDLD
Sbjct: 271 GKPTCADPDLLRNTIRGQWRLNGYIVSDCDSVGVFFDNQHYTKTPEEAAAEAIKAGLDLD 330

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CGPFLAIHT+ A+R GL+ E D+NLALA  I+VQMRLGMFDGEPS QP+GNLGPRDVCT 
Sbjct: 331 CGPFLAIHTDSAIRKGLISENDLNLALANLISVQMRLGMFDGEPSTQPYGNLGPRDVCTS 390

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
           AHQQLAL+AA + IVLL+N   +LPLS  R  T+ V+GPN+D TVTMIGNYAGVACGYTT
Sbjct: 391 AHQQLALEAARESIVLLQNKGNSLPLSPSRLRTIGVVGPNADATVTMIGNYAGVACGYTT 450

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           PLQGI+RY KT HQ GC GVAC GN+L GAAE  ARQADA VLVMGLDQ++EAE  DR G
Sbjct: 451 PLQGIARYVKTAHQVGCRGVACRGNELFGAAETIARQADAIVLVMGLDQTVEAETRDRVG 510

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           LLLPG QQELV+RVA+A++GPV+L++M GGPVD+SFAKNDP+I AILWVGYPGQAGG AI
Sbjct: 511 LLLPGLQQELVTRVARAAKGPVILLIMSGGPVDISFAKNDPKISAILWVGYPGQAGGTAI 570

Query: 420 ADVLFGRANPG 430
           ADV+FG  NPG
Sbjct: 571 ADVIFGTTNPG 581


>gi|356503923|ref|XP_003520749.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
          Length = 775

 Score =  720 bits (1858), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 339/430 (78%), Positives = 375/430 (87%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYNGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPVL G YAASYVRGLQG  G+RLKVA
Sbjct: 145 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGTYAASYVRGLQGTDGNRLKVA 204

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKH+TAYDLDNWNG+DR+HFNA+VSKQD+E+T++VPF+ CV EGKVASVMCSYNQVNG
Sbjct: 205 ACCKHFTAYDLDNWNGMDRFHFNAQVSKQDIEETFDVPFRMCVSEGKVASVMCSYNQVNG 264

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
            PTCADP++LK T+ G W+LDGYIVSDCDSVGV Y+ QHYT TPEEAAADAIKAGLDLDC
Sbjct: 265 VPTCADPNLLKKTVRGLWQLDGYIVSDCDSVGVFYDNQHYTPTPEEAAADAIKAGLDLDC 324

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           GPFLA+HT+ AV  GLL E DVN AL  T+TVQMRLGMFDGEPSA  +G LGP+DVC PA
Sbjct: 325 GPFLAVHTQNAVEKGLLSEADVNGALVNTLTVQMRLGMFDGEPSAHAYGKLGPKDVCKPA 384

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           HQ+LAL+AA QGIVLLKN+   LPLS  RHHTVAVIGPNS  TVTMIGNYAGVACGYT P
Sbjct: 385 HQELALEAARQGIVLLKNTGPVLPLSPQRHHTVAVIGPNSKATVTMIGNYAGVACGYTNP 444

Query: 301 LQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           LQGI RYAKTIHQ GC  VAC  ++L G+A  AARQADATVLVMGLDQSIEAE +DR GL
Sbjct: 445 LQGIGRYAKTIHQLGCENVACKNDKLFGSAINAARQADATVLVMGLDQSIEAETVDRTGL 504

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
           LLPGRQQ+LVS+VA AS+GP +LV+M GG VD++FAKN+PRI  ILW GYPGQAGGAAIA
Sbjct: 505 LLPGRQQDLVSKVAAASKGPTILVIMSGGSVDITFAKNNPRIVGILWAGYPGQAGGAAIA 564

Query: 421 DVLFGRANPG 430
           D+LFG  NPG
Sbjct: 565 DILFGTTNPG 574


>gi|357449039|ref|XP_003594795.1| Beta xylosidase [Medicago truncatula]
 gi|355483843|gb|AES65046.1| Beta xylosidase [Medicago truncatula]
          Length = 762

 Score =  719 bits (1856), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 344/430 (80%), Positives = 383/430 (89%), Gaps = 3/430 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN-TGSRLKV 59
           MYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP + GKYAASYV+GLQGN  G+RLKV
Sbjct: 141 MYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPTVAGKYAASYVQGLQGNGAGNRLKV 200

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           AACCKHYTAYDLDNWNGVDR+HFNA+VSKQDL DTY+VPFKACV +GKVASVMCSYNQVN
Sbjct: 201 AACCKHYTAYDLDNWNGVDRFHFNAKVSKQDLADTYDVPFKACVRDGKVASVMCSYNQVN 260

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           GKPTCADP++L+NTI G+W L+GYIVSDCDSVGVLY+ QHYTRTPE+AAA AIKAGLDLD
Sbjct: 261 GKPTCADPELLRNTIRGEWGLNGYIVSDCDSVGVLYDNQHYTRTPEQAAAAAIKAGLDLD 320

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CGPFLA+HT+GA++ GL+ E D+NLALA  ITVQMRLGMFDG+  AQP+GNLG RDVC P
Sbjct: 321 CGPFLALHTDGAIKQGLISENDLNLALANLITVQMRLGMFDGD--AQPYGNLGTRDVCLP 378

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
           +H  +AL+AA QGIVLL+N    LPLS  R+ TV VIGPNSDVTVTMIGNYAG+ACGYTT
Sbjct: 379 SHNDVALEAARQGIVLLQNKGNALPLSPTRYRTVGVIGPNSDVTVTMIGNYAGIACGYTT 438

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           PLQGI+RY KTIHQAGC  V C GNQL G +E  ARQADATVLVMGLDQSIEAEF DR G
Sbjct: 439 PLQGIARYVKTIHQAGCKDVGCGGNQLFGLSEQVARQADATVLVMGLDQSIEAEFRDRTG 498

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           LLLPG QQELVSRVA+A+RGPV+LVLM GGP+DV+FAKNDP+I AILWVGYPGQ+GG AI
Sbjct: 499 LLLPGHQQELVSRVARAARGPVILVLMSGGPIDVTFAKNDPKISAILWVGYPGQSGGTAI 558

Query: 420 ADVLFGRANP 429
           ADV+FGR NP
Sbjct: 559 ADVIFGRTNP 568


>gi|356572781|ref|XP_003554544.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
          Length = 771

 Score =  717 bits (1852), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 336/430 (78%), Positives = 376/430 (87%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYNGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPVL G YAA+YVRGLQG   +RLKVA
Sbjct: 141 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGTYAATYVRGLQGTHANRLKVA 200

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKH+TAYDLDNWNG+DR+HFNA+VSKQD+EDT++VPFK CV EGKVASVMCSYNQVNG
Sbjct: 201 ACCKHFTAYDLDNWNGMDRFHFNAQVSKQDIEDTFDVPFKMCVSEGKVASVMCSYNQVNG 260

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
            PTCADP++LK T+ G W+LDGYIVSDCDSVGV Y+ QHYT TPEEAAADAIKAGLDLDC
Sbjct: 261 VPTCADPNLLKKTVRGLWQLDGYIVSDCDSVGVFYDNQHYTPTPEEAAADAIKAGLDLDC 320

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           GPFLA+HT+ AV+ GLL E DVN AL  T+TVQMRLGMFDGEP+A P+G+LGP+DVC PA
Sbjct: 321 GPFLAVHTQNAVKKGLLSEADVNGALVNTLTVQMRLGMFDGEPTAHPYGHLGPKDVCKPA 380

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           HQ+LAL+AA QGIVLLKN+   LPLS+  H TVAVIGPNS  T+TMIGNYAGVACGYT P
Sbjct: 381 HQELALEAARQGIVLLKNTGPVLPLSSQLHRTVAVIGPNSKATITMIGNYAGVACGYTNP 440

Query: 301 LQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           LQGI RYA+T+HQ GC  VAC  ++L G A  AARQADATVLVMGLDQSIEAE +DR GL
Sbjct: 441 LQGIGRYARTVHQLGCQNVACKNDKLFGPAINAARQADATVLVMGLDQSIEAETVDRTGL 500

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
           LLPGRQ +LVS+VA AS+GP +LVLM GGPVD++FAKN+PRI  ILW GYPGQAGGAAIA
Sbjct: 501 LLPGRQPDLVSKVAAASKGPTILVLMSGGPVDITFAKNNPRIVGILWAGYPGQAGGAAIA 560

Query: 421 DVLFGRANPG 430
           D+LFG ANPG
Sbjct: 561 DILFGTANPG 570


>gi|225431898|ref|XP_002276351.1| PREDICTED: beta-D-xylosidase 1-like [Vitis vinifera]
          Length = 770

 Score =  717 bits (1850), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 358/430 (83%), Positives = 384/430 (89%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDP +  KYAA+YVRGLQGN   RLKVA
Sbjct: 141 MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPAVAAKYAAAYVRGLQGNARDRLKVA 200

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKHYTAYDLD+W G+DR+HFNARVSKQDLEDTY+VPFKACVVEG VASVMCSYNQVNG
Sbjct: 201 ACCKHYTAYDLDHWGGIDRFHFNARVSKQDLEDTYDVPFKACVVEGNVASVMCSYNQVNG 260

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
           KPTCADP +L++TI G+W+L+GYIVSDCDSVGV Y+ QHYT TPEEAAA AIKAGLDLDC
Sbjct: 261 KPTCADPHLLRDTIRGEWKLNGYIVSDCDSVGVFYDEQHYTATPEEAAAVAIKAGLDLDC 320

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           GPFLAIHTE A+RGG L E DVN AL  TI+VQMRLGMFDGEPSAQP+GNLGPRDVCTPA
Sbjct: 321 GPFLAIHTEAAIRGGKLTEADVNGALMNTISVQMRLGMFDGEPSAQPYGNLGPRDVCTPA 380

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           HQQLAL+AA QGIVL++N    LPLST RH T+AVIGPNSDVT TMIGNYAGVACGYTTP
Sbjct: 381 HQQLALEAARQGIVLVQNRGPALPLSTSRHRTIAVIGPNSDVTETMIGNYAGVACGYTTP 440

Query: 301 LQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           LQGI RYA+TIHQAGC GVAC  +Q  GAA  AARQADATVLVMGLDQSIEAEF DR  +
Sbjct: 441 LQGIGRYARTIHQAGCSGVACRDDQQFGAAVAAARQADATVLVMGLDQSIEAEFRDRVDI 500

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
           LLPGRQQELVS+VA ASRGP VLVLM GGP+DVSFAKNDPRI AI+WVGYPGQAGG AIA
Sbjct: 501 LLPGRQQELVSKVAVASRGPTVLVLMSGGPIDVSFAKNDPRIAAIIWVGYPGQAGGTAIA 560

Query: 421 DVLFGRANPG 430
           DVLFGR NPG
Sbjct: 561 DVLFGRTNPG 570


>gi|356534827|ref|XP_003535953.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
          Length = 771

 Score =  717 bits (1850), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 336/430 (78%), Positives = 378/430 (87%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYNGG AGLTYWSPNVNIFRDPRWGRGQETPGEDP+L GKYAASYVRGLQ   G+RLKVA
Sbjct: 141 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQETDGNRLKVA 200

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A CKH+TAYDLDNWNGVDR+HFNA+VSKQD+EDT+NVPF+ CV EGKVASVMCSYNQVNG
Sbjct: 201 ASCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFNVPFRMCVKEGKVASVMCSYNQVNG 260

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
            PTCADP +LK T+ GQW L+GYIVSDCDSVGV YN+QHYT TPEEAAADAIKAGLDLDC
Sbjct: 261 VPTCADPILLKRTVRGQWGLNGYIVSDCDSVGVFYNSQHYTSTPEEAAADAIKAGLDLDC 320

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           GPFL  HT+ AV+ GL+ E DVN AL  T+TVQMRLGM+DGEPS+ P+G LGPRDVCTP+
Sbjct: 321 GPFLGQHTQNAVKKGLISETDVNGALLNTLTVQMRLGMYDGEPSSHPYGKLGPRDVCTPS 380

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           HQ+LAL+AA QGIVLLKN   +LPLST RH TVAVIGPNS+VTVTMIGNYAG+ACGYT+P
Sbjct: 381 HQELALEAARQGIVLLKNKGPSLPLSTRRHPTVAVIGPNSNVTVTMIGNYAGIACGYTSP 440

Query: 301 LQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           L+GI RY KTIH+ GC  VAC  ++  G A   A+QADATVLVMGLDQSIEAE +DRAGL
Sbjct: 441 LEGIGRYTKTIHELGCANVACTNDKQFGRAINVAQQADATVLVMGLDQSIEAETVDRAGL 500

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
           LLPGRQQ+LVS+VA AS+GP +LV+M GGPVD++FAKN+PRI AILW GYPGQAGGAAIA
Sbjct: 501 LLPGRQQDLVSKVAAASKGPTILVIMSGGPVDITFAKNNPRIQAILWAGYPGQAGGAAIA 560

Query: 421 DVLFGRANPG 430
           D+LFG +NPG
Sbjct: 561 DILFGTSNPG 570


>gi|357511337|ref|XP_003625957.1| Beta-xylosidase [Medicago truncatula]
 gi|355500972|gb|AES82175.1| Beta-xylosidase [Medicago truncatula]
          Length = 771

 Score =  716 bits (1849), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 335/430 (77%), Positives = 376/430 (87%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYNGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPVL G+YAASYV+GLQG  G++LKVA
Sbjct: 142 MYNGGAAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGRYAASYVKGLQGTDGNKLKVA 201

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKH+TAYD+DNWNGVDR+HFNA VSKQD+EDT++VPF+ CV EGKVASVMCSYNQVNG
Sbjct: 202 ACCKHFTAYDVDNWNGVDRFHFNALVSKQDIEDTFDVPFRMCVKEGKVASVMCSYNQVNG 261

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
            PTCADP++LK T+ G W LDGYIVSDCDSVGVLYN+QHYT TPEEAAADAIKAGLDLDC
Sbjct: 262 VPTCADPNLLKKTVRGVWGLDGYIVSDCDSVGVLYNSQHYTSTPEEAAADAIKAGLDLDC 321

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           GPFL +HT+ AV+ GLL E DVN AL  T+ VQMRLGMFDGEPSAQ +G LGP+DVC PA
Sbjct: 322 GPFLGVHTQDAVKKGLLTEADVNNALVNTLKVQMRLGMFDGEPSAQAYGRLGPKDVCKPA 381

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           HQ+LAL+AA QGIVLLKN+  TLPLS  RH TVAVIGPNSDVTVTMIGNYAG+ACGYT+P
Sbjct: 382 HQELALEAARQGIVLLKNTGPTLPLSPQRHRTVAVIGPNSDVTVTMIGNYAGIACGYTSP 441

Query: 301 LQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           LQGI RYAKTIHQ GC  VAC  ++  G A  AAR ADAT+LV+GLDQSIEAE +DR  L
Sbjct: 442 LQGIGRYAKTIHQQGCSNVACRDDKQFGPALDAARHADATILVIGLDQSIEAETVDRTSL 501

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
           LLPG QQ+LVS+VA AS+GP +LVLM GGPVD++FAKNDP++  ILW GYPGQAGGAAIA
Sbjct: 502 LLPGHQQDLVSKVAAASKGPTILVLMSGGPVDITFAKNDPKVAGILWAGYPGQAGGAAIA 561

Query: 421 DVLFGRANPG 430
           D+LFG A+PG
Sbjct: 562 DILFGTASPG 571


>gi|15239867|ref|NP_199747.1| beta-xylosidase 1 [Arabidopsis thaliana]
 gi|75262458|sp|Q9FGY1.1|BXL1_ARATH RecName: Full=Beta-D-xylosidase 1; Short=AtBXL1; AltName:
           Full=Alpha-L-arabinofuranosidase; Flags: Precursor
 gi|9759419|dbj|BAB09906.1| xylosidase [Arabidopsis thaliana]
 gi|21539545|gb|AAM53325.1| xylosidase [Arabidopsis thaliana]
 gi|332008419|gb|AED95802.1| beta-xylosidase 1 [Arabidopsis thaliana]
          Length = 774

 Score =  716 bits (1849), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 350/431 (81%), Positives = 387/431 (89%), Gaps = 3/431 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKV 59
           MYNGG+AGLTYWSPNVNI RDPRWGRGQETPGEDP++  KYAASYVRGLQG   G+RLKV
Sbjct: 149 MYNGGVAGLTYWSPNVNILRDPRWGRGQETPGEDPIVAAKYAASYVRGLQGTAAGNRLKV 208

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           AACCKHYTAYDLDNWNGVDR+HFNA+V++QDLEDTYNVPFK+CV EGKVASVMCSYNQVN
Sbjct: 209 AACCKHYTAYDLDNWNGVDRFHFNAKVTQQDLEDTYNVPFKSCVYEGKVASVMCSYNQVN 268

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           GKPTCAD ++LKNTI GQWRL+GYIVSDCDSV V +N QHYT TPEEAAA +IKAGLDLD
Sbjct: 269 GKPTCADENLLKNTIRGQWRLNGYIVSDCDSVDVFFNQQHYTSTPEEAAARSIKAGLDLD 328

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CGPFLAI TEGAV+ GLL E D+NLALA T+TVQMRLGMFDG  +  P+ NLGPRDVCTP
Sbjct: 329 CGPFLAIFTEGAVKKGLLTENDINLALANTLTVQMRLGMFDG--NLGPYANLGPRDVCTP 386

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
           AH+ LAL+AAHQGIVLLKNSAR+LPLS  RH TVAVIGPNSDVT TMIGNYAG AC YT+
Sbjct: 387 AHKHLALEAAHQGIVLLKNSARSLPLSPRRHRTVAVIGPNSDVTETMIGNYAGKACAYTS 446

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           PLQGISRYA+T+HQAGC GVAC GNQ  GAAE AAR+ADATVLVMGLDQSIEAE  DR G
Sbjct: 447 PLQGISRYARTLHQAGCAGVACKGNQGFGAAEAAAREADATVLVMGLDQSIEAETRDRTG 506

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           LLLPG QQ+LV+RVA+ASRGPV+LVLM GGP+DV+FAKNDPR+ AI+W GYPGQAGGAAI
Sbjct: 507 LLLPGYQQDLVTRVAQASRGPVILVLMSGGPIDVTFAKNDPRVAAIIWAGYPGQAGGAAI 566

Query: 420 ADVLFGRANPG 430
           A+++FG ANPG
Sbjct: 567 ANIIFGAANPG 577


>gi|296083274|emb|CBI22910.3| unnamed protein product [Vitis vinifera]
          Length = 738

 Score =  715 bits (1846), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 358/430 (83%), Positives = 384/430 (89%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDP +  KYAA+YVRGLQGN   RLKVA
Sbjct: 141 MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPAVAAKYAAAYVRGLQGNARDRLKVA 200

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKHYTAYDLD+W G+DR+HFNARVSKQDLEDTY+VPFKACVVEG VASVMCSYNQVNG
Sbjct: 201 ACCKHYTAYDLDHWGGIDRFHFNARVSKQDLEDTYDVPFKACVVEGNVASVMCSYNQVNG 260

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
           KPTCADP +L++TI G+W+L+GYIVSDCDSVGV Y+ QHYT TPEEAAA AIKAGLDLDC
Sbjct: 261 KPTCADPHLLRDTIRGEWKLNGYIVSDCDSVGVFYDEQHYTATPEEAAAVAIKAGLDLDC 320

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           GPFLAIHTE A+RGG L E DVN AL  TI+VQMRLGMFDGEPSAQP+GNLGPRDVCTPA
Sbjct: 321 GPFLAIHTEAAIRGGKLTEADVNGALMNTISVQMRLGMFDGEPSAQPYGNLGPRDVCTPA 380

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           HQQLAL+AA QGIVL++N    LPLST RH T+AVIGPNSDVT TMIGNYAGVACGYTTP
Sbjct: 381 HQQLALEAARQGIVLVQNRGPALPLSTSRHRTIAVIGPNSDVTETMIGNYAGVACGYTTP 440

Query: 301 LQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           LQGI RYA+TIHQAGC GVAC  +Q  GAA  AARQADATVLVMGLDQSIEAEF DR  +
Sbjct: 441 LQGIGRYARTIHQAGCSGVACRDDQQFGAAVAAARQADATVLVMGLDQSIEAEFRDRVDI 500

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
           LLPGRQQELVS+VA ASRGP VLVLM GGP+DVSFAKNDPRI AI+WVGYPGQAGG AIA
Sbjct: 501 LLPGRQQELVSKVAVASRGPTVLVLMSGGPIDVSFAKNDPRIAAIIWVGYPGQAGGTAIA 560

Query: 421 DVLFGRANPG 430
           DVLFGR NPG
Sbjct: 561 DVLFGRTNPG 570


>gi|297795695|ref|XP_002865732.1| beta-xylosidase 1 [Arabidopsis lyrata subsp. lyrata]
 gi|297311567|gb|EFH41991.1| beta-xylosidase 1 [Arabidopsis lyrata subsp. lyrata]
          Length = 774

 Score =  709 bits (1830), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 347/431 (80%), Positives = 385/431 (89%), Gaps = 3/431 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKV 59
           MYNGG+AGLTYWSPNVNI RDPRWGRGQETPGEDP++  KYAASYVRGLQG   G+RLKV
Sbjct: 149 MYNGGVAGLTYWSPNVNILRDPRWGRGQETPGEDPIVAAKYAASYVRGLQGTAAGNRLKV 208

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           AACCKHYTAYDLDNWNGVDR+HFNA+V++QDLEDTYNVPFK+CV EGKVASVMCSYNQVN
Sbjct: 209 AACCKHYTAYDLDNWNGVDRFHFNAKVTQQDLEDTYNVPFKSCVYEGKVASVMCSYNQVN 268

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           GKPTCAD ++LKNTI G+WRL+GYIVSDCDSV V +N QHYT TPEEAAA +IKAGLDLD
Sbjct: 269 GKPTCADENLLKNTIRGKWRLNGYIVSDCDSVDVFFNQQHYTSTPEEAAAASIKAGLDLD 328

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CGPFLAI TEGAV+ GLL E D+NLALA T+TVQMRLGMFDG  +  P+ NLGPRDVC+ 
Sbjct: 329 CGPFLAIFTEGAVKKGLLTENDINLALANTLTVQMRLGMFDG--NLGPYANLGPRDVCSL 386

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
           AH+ LAL+AAHQGIVLLKNS R+LPLS  RH TVAVIGPNSDVT TMIGNYAG AC YTT
Sbjct: 387 AHKHLALEAAHQGIVLLKNSGRSLPLSPRRHRTVAVIGPNSDVTETMIGNYAGKACAYTT 446

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           PLQGISRYA+T+HQAGC GVAC GNQ  GAAE AAR+ADATVLVMGLDQSIEAE  DR G
Sbjct: 447 PLQGISRYARTLHQAGCAGVACKGNQGFGAAEAAAREADATVLVMGLDQSIEAETRDRTG 506

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           LLLPG QQ+LV+RVA+ASRGPV+LVLM GGP+DV+FAKNDPR+ AI+W GYPGQAGGAAI
Sbjct: 507 LLLPGYQQDLVTRVAQASRGPVILVLMSGGPIDVTFAKNDPRVAAIIWAGYPGQAGGAAI 566

Query: 420 ADVLFGRANPG 430
           A+++FG ANPG
Sbjct: 567 ANIIFGAANPG 577


>gi|356501877|ref|XP_003519750.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
          Length = 772

 Score =  704 bits (1818), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 332/430 (77%), Positives = 373/430 (86%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYNGG AGLTYWSPNVNIFRDPRWGRGQETPGEDP+L GKYAASYVRGLQG  G+RLKVA
Sbjct: 142 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQGTDGNRLKVA 201

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A CKH+TAYDLDNWNGVDR+HFNA+VSKQD+EDT+NVPF+ CV EGKVASVMCSYNQVNG
Sbjct: 202 ASCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFNVPFRMCVKEGKVASVMCSYNQVNG 261

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
            PTCADP +LK T+ GQW L+GYIVSDCDSVGV YN+QHYT TPEEAAADAIKAGLDLDC
Sbjct: 262 VPTCADPILLKRTVRGQWGLNGYIVSDCDSVGVFYNSQHYTSTPEEAAADAIKAGLDLDC 321

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           GPFL  HT+ AV+ GL+ E DVN AL  T+TVQMRLGM+DGEPS+ P+ NLGPRDVCT +
Sbjct: 322 GPFLGQHTQNAVKKGLISEADVNGALLNTLTVQMRLGMYDGEPSSHPYNNLGPRDVCTQS 381

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           HQ+LAL+AA QGIVLLKN   +LPLST R  TVAVIGPNS+VT TMIGNYAG+ACGYT+P
Sbjct: 382 HQELALEAARQGIVLLKNKGPSLPLSTRRGRTVAVIGPNSNVTFTMIGNYAGIACGYTSP 441

Query: 301 LQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           LQGI  Y KTI++ GC  VAC  ++  G A  AA+QADATVLVMGLDQSIEAE +DRA L
Sbjct: 442 LQGIGTYTKTIYEHGCANVACTDDKQFGRAINAAQQADATVLVMGLDQSIEAETVDRASL 501

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
           LLPG QQ+LVS+VA AS+GP +LV+M GGPVD++FAKNDPRI  ILW GYPGQAGGAAIA
Sbjct: 502 LLPGHQQDLVSKVAAASKGPTILVIMSGGPVDITFAKNDPRIQGILWAGYPGQAGGAAIA 561

Query: 421 DVLFGRANPG 430
           D+LFG +NPG
Sbjct: 562 DILFGTSNPG 571


>gi|449466797|ref|XP_004151112.1| PREDICTED: beta-D-xylosidase 1-like [Cucumis sativus]
          Length = 770

 Score =  695 bits (1793), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 345/431 (80%), Positives = 377/431 (87%), Gaps = 1/431 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS-RLKV 59
           MYNGG AGLTYWSPNVNIFRDPRWGRGQETPGEDP+L  KYAA+YV+GLQGN G  RLKV
Sbjct: 146 MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPILAAKYAANYVQGLQGNDGKKRLKV 205

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           AACCKHYTAYDLDNWNGVDRYHFNA+VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN
Sbjct: 206 AACCKHYTAYDLDNWNGVDRYHFNAKVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 265

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           GKPTCADPD+LKNTI G W LDGYIVSDCDSVGVLY++QH+T TPEEAAA  IKAGLDLD
Sbjct: 266 GKPTCADPDLLKNTIRGAWGLDGYIVSDCDSVGVLYDSQHFTPTPEEAAASTIKAGLDLD 325

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CGPFLA+HT  AV  GLL+E D+N ALA  ++VQMRLGMFDGEP+AQP+GNLGP+DVCTP
Sbjct: 326 CGPFLAVHTATAVGRGLLKEVDLNNALANLLSVQMRLGMFDGEPAAQPYGNLGPKDVCTP 385

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
           AH+ LAL+AA QGIVLL+N A  LPLS  RH TVAVIGPNSD TVTMIGNYAGVAC YTT
Sbjct: 386 AHKHLALEAARQGIVLLQNRAGALPLSPTRHRTVAVIGPNSDATVTMIGNYAGVACEYTT 445

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           P+QGIS+Y KTIH  GC  VAC G+QLIG AE AAR ADA V+V+GLDQSIEAE  DR G
Sbjct: 446 PVQGISKYVKTIHAKGCANVACVGDQLIGEAEAAARVADAAVVVVGLDQSIEAESRDRNG 505

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           +LLPG+Q+ELV R+  A +GP V+VLM GGP+DVSFAKND +I  ILWVGYPGQAGGAAI
Sbjct: 506 VLLPGKQEELVRRIGLACKGPTVVVLMSGGPIDVSFAKNDGKISGILWVGYPGQAGGAAI 565

Query: 420 ADVLFGRANPG 430
           ADVLFG  NPG
Sbjct: 566 ADVLFGATNPG 576


>gi|18378991|ref|NP_563659.1| beta-glucosidase [Arabidopsis thaliana]
 gi|75250279|sp|Q94KD8.1|BXL2_ARATH RecName: Full=Probable beta-D-xylosidase 2; Short=AtBXL2; Flags:
           Precursor
 gi|14194121|gb|AAK56255.1|AF367266_1 At1g02640/T14P4_11 [Arabidopsis thaliana]
 gi|23506063|gb|AAN28891.1| At1g02640/T14P4_11 [Arabidopsis thaliana]
 gi|332189332|gb|AEE27453.1| beta-glucosidase [Arabidopsis thaliana]
          Length = 768

 Score =  694 bits (1791), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 329/430 (76%), Positives = 371/430 (86%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYNGG+ GLTYWSPNVNI RDPRWGRGQETPGEDPV+ GKYAASYVRGLQGN  SRLKVA
Sbjct: 144 MYNGGVGGLTYWSPNVNILRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNDRSRLKVA 203

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKH+TAYDLDNWNGVDR+HFNA+VSKQD+EDT++VPF+ CV EG VAS+MCSYNQVNG
Sbjct: 204 ACCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVKEGNVASIMCSYNQVNG 263

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
            PTCADP++LK TI  QW L+GYIVSDCDSVGVLY+TQHYT TPEEAAAD+IKAGLDLDC
Sbjct: 264 VPTCADPNLLKKTIRNQWGLNGYIVSDCDSVGVLYDTQHYTGTPEEAAADSIKAGLDLDC 323

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           GPFL  HT  AV+  LLRE DV+ AL  T+TVQMRLGMFDG+ +AQP+G+LGP  VCTP 
Sbjct: 324 GPFLGAHTIDAVKKNLLRESDVDNALINTLTVQMRLGMFDGDIAAQPYGHLGPAHVCTPV 383

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H+ LAL+AA QGIVLLKN   +LPLS+ RH TVAVIGPNSD TVTMIGNYAGVACGYT+P
Sbjct: 384 HKGLALEAAQQGIVLLKNHGSSLPLSSQRHRTVAVIGPNSDATVTMIGNYAGVACGYTSP 443

Query: 301 LQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           +QGI+ YA+TIHQ GC  V C  ++L  AA  AAR ADATVLVMGLDQSIEAEF DR  L
Sbjct: 444 VQGITGYARTIHQKGCVDVHCMDDRLFDAAVEAARGADATVLVMGLDQSIEAEFKDRNSL 503

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
           LLPG+QQELVSRVAKA++GPV+LVLM GGP+D+SFA+ D +I AI+W GYPGQ GG AIA
Sbjct: 504 LLPGKQQELVSRVAKAAKGPVILVLMSGGPIDISFAEKDRKIPAIVWAGYPGQEGGTAIA 563

Query: 421 DVLFGRANPG 430
           D+LFG ANPG
Sbjct: 564 DILFGSANPG 573


>gi|9972374|gb|AAG10624.1|AC022521_2 Similar to xylosidase [Arabidopsis thaliana]
          Length = 763

 Score =  694 bits (1790), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 329/430 (76%), Positives = 371/430 (86%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYNGG+ GLTYWSPNVNI RDPRWGRGQETPGEDPV+ GKYAASYVRGLQGN  SRLKVA
Sbjct: 139 MYNGGVGGLTYWSPNVNILRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNDRSRLKVA 198

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKH+TAYDLDNWNGVDR+HFNA+VSKQD+EDT++VPF+ CV EG VAS+MCSYNQVNG
Sbjct: 199 ACCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVKEGNVASIMCSYNQVNG 258

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
            PTCADP++LK TI  QW L+GYIVSDCDSVGVLY+TQHYT TPEEAAAD+IKAGLDLDC
Sbjct: 259 VPTCADPNLLKKTIRNQWGLNGYIVSDCDSVGVLYDTQHYTGTPEEAAADSIKAGLDLDC 318

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           GPFL  HT  AV+  LLRE DV+ AL  T+TVQMRLGMFDG+ +AQP+G+LGP  VCTP 
Sbjct: 319 GPFLGAHTIDAVKKNLLRESDVDNALINTLTVQMRLGMFDGDIAAQPYGHLGPAHVCTPV 378

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H+ LAL+AA QGIVLLKN   +LPLS+ RH TVAVIGPNSD TVTMIGNYAGVACGYT+P
Sbjct: 379 HKGLALEAAQQGIVLLKNHGSSLPLSSQRHRTVAVIGPNSDATVTMIGNYAGVACGYTSP 438

Query: 301 LQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           +QGI+ YA+TIHQ GC  V C  ++L  AA  AAR ADATVLVMGLDQSIEAEF DR  L
Sbjct: 439 VQGITGYARTIHQKGCVDVHCMDDRLFDAAVEAARGADATVLVMGLDQSIEAEFKDRNSL 498

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
           LLPG+QQELVSRVAKA++GPV+LVLM GGP+D+SFA+ D +I AI+W GYPGQ GG AIA
Sbjct: 499 LLPGKQQELVSRVAKAAKGPVILVLMSGGPIDISFAEKDRKIPAIVWAGYPGQEGGTAIA 558

Query: 421 DVLFGRANPG 430
           D+LFG ANPG
Sbjct: 559 DILFGSANPG 568


>gi|297843058|ref|XP_002889410.1| hypothetical protein ARALYDRAFT_470222 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335252|gb|EFH65669.1| hypothetical protein ARALYDRAFT_470222 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 763

 Score =  689 bits (1777), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 323/430 (75%), Positives = 370/430 (86%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYNGG+ GLTYWSPNVNI RDPRWGRGQETPGEDPV+ GKYAASYVRGLQGN  SRLKVA
Sbjct: 139 MYNGGVGGLTYWSPNVNILRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNDRSRLKVA 198

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKH+TAYDLDNWNGVDR+HFNA+VSKQD+EDT++VPF+ CV EG VAS+MCSYN+VNG
Sbjct: 199 ACCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVKEGNVASIMCSYNEVNG 258

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
            PTCADP++LK TI  +W L+GYIVSDCDSVGVLY+TQHYT TPEEAAAD+IKAGLDLDC
Sbjct: 259 VPTCADPNLLKKTIRNEWGLNGYIVSDCDSVGVLYDTQHYTGTPEEAAADSIKAGLDLDC 318

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           GPFL  HT  AV+  LLRE DV+ AL  T+TVQMRLGMFDG+ +AQP+G+LGP  VCTP 
Sbjct: 319 GPFLGAHTIDAVKKNLLRESDVDNALINTLTVQMRLGMFDGDIAAQPYGHLGPAHVCTPV 378

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H+ LAL+AA QGIVLLKN   +LPLS+ RH TVAVIGPNSD TV MIGNYAG+ACGYT+P
Sbjct: 379 HKGLALEAAQQGIVLLKNHGSSLPLSSQRHRTVAVIGPNSDATVAMIGNYAGIACGYTSP 438

Query: 301 LQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           +QGI+ YA+T+HQ GC  V C  ++L  AA  AAR ADATVLVMGLDQSIEAEF DR  L
Sbjct: 439 VQGITGYARTVHQKGCVDVHCMDDRLFDAAVEAARGADATVLVMGLDQSIEAEFKDRNSL 498

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
           LLPG+QQEL+SRVAKA++GPV+LVLM GGP+D+SFA+ D +I AI+W GYPGQ GG AIA
Sbjct: 499 LLPGKQQELISRVAKAAKGPVILVLMSGGPIDISFAEKDRKIPAIVWAGYPGQEGGTAIA 558

Query: 421 DVLFGRANPG 430
           D+LFG ANPG
Sbjct: 559 DILFGSANPG 568


>gi|371917282|dbj|BAL44717.1| SlArf/Xyl2 [Solanum lycopersicum]
          Length = 774

 Score =  687 bits (1774), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 334/431 (77%), Positives = 371/431 (86%), Gaps = 1/431 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKV 59
           MYNG M GLTYWSPNVNIFRDPRWGRGQETPGEDPV+   YA  YVRGLQGN  G  LKV
Sbjct: 145 MYNGEMGGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAALYAERYVRGLQGNEDGDSLKV 204

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           AACCKHYTAYDLDNW GVDR+HFNA+V+KQD+EDT++VPF++CV +GKVAS+MCSYNQVN
Sbjct: 205 AACCKHYTAYDLDNWGGVDRFHFNAKVTKQDIEDTFDVPFRSCVKQGKVASIMCSYNQVN 264

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           G PTCADP +L+ TI G W L+GYIVSDCDSVGV Y+TQHYT TPEEAAA AIKAGLDLD
Sbjct: 265 GIPTCADPQLLRKTIRGGWGLNGYIVSDCDSVGVFYDTQHYTSTPEEAAAAAIKAGLDLD 324

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CGPFL+ HTE AV  G+L+E  ++  LA T+ VQMRLGMFDGEPSAQ +G+LGPRDVC+P
Sbjct: 325 CGPFLSQHTENAVHIGILKEAAIDTNLANTVAVQMRLGMFDGEPSAQQYGHLGPRDVCSP 384

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
           AHQ+LA++AA QGIVLLKN    LPLS  RH TVAVIGPNSDVTVTMIGNYAGVACGYT+
Sbjct: 385 AHQELAVEAARQGIVLLKNHGPALPLSPRRHRTVAVIGPNSDVTVTMIGNYAGVACGYTS 444

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           PLQGIS+YAKTIH+ GC  VAC+ ++L   A  AARQADATVLVMGLDQSIEAEF DR G
Sbjct: 445 PLQGISKYAKTIHEKGCGDVACSDDKLFAGAVNAARQADATVLVMGLDQSIEAEFRDRTG 504

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           LLLPG QQEL+S V+KASRGPVVLVLM GGPVDV+FA NDPRIGAI+W GYPGQ GGAAI
Sbjct: 505 LLLPGFQQELISEVSKASRGPVVLVLMSGGPVDVTFANNDPRIGAIVWAGYPGQGGGAAI 564

Query: 420 ADVLFGRANPG 430
           ADVLFG  NPG
Sbjct: 565 ADVLFGAHNPG 575


>gi|371917280|dbj|BAL44716.1| SlArf/Xyl1 [Solanum lycopersicum]
          Length = 771

 Score =  686 bits (1769), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 333/433 (76%), Positives = 369/433 (85%), Gaps = 5/433 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT---GSRL 57
           MYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP L  +Y  SYV+GLQG      +RL
Sbjct: 139 MYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPHLVAQYGVSYVKGLQGGGGRGNTRL 198

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVAACCKHYTAYDLD+WNG DRYHFNA+VS QDLEDTYN PFKACVVEG VASVMCSYNQ
Sbjct: 199 KVAACCKHYTAYDLDDWNGYDRYHFNAKVSMQDLEDTYNAPFKACVVEGNVASVMCSYNQ 258

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           +NGKP+CADP +L++TI  QW L+GYIVSDCDSVGVL+  QHYTR PE+AAA  IKAGLD
Sbjct: 259 INGKPSCADPTLLRDTIRNQWHLNGYIVSDCDSVGVLFEKQHYTRYPEDAAAITIKAGLD 318

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           LDCGPFLAIHT+ AV  G + + ++N ALA TITVQMRLGMFDG P+  P+ NLGP+DVC
Sbjct: 319 LDCGPFLAIHTDKAVHTGKVSQVEINNALANTITVQMRLGMFDG-PNG-PYANLGPKDVC 376

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           +PAHQQLALQAA +GIVLLKN  + LPLST RH TVAVIGPNSD T+ MIGNYAGV CGY
Sbjct: 377 SPAHQQLALQAAREGIVLLKNIGQALPLSTKRHRTVAVIGPNSDATLAMIGNYAGVPCGY 436

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
            +PLQGISRYA+TIHQ GC GVAC GNQ  G AEVAAR ADATVLVMGLDQSIEAE  DR
Sbjct: 437 ISPLQGISRYARTIHQQGCMGVACPGNQNFGLAEVAARHADATVLVMGLDQSIEAEAKDR 496

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             LLLPG QQ+L+SRVA AS+GPVVLVLM GGP+DV+FAKNDPR+ +I+WVGYPGQAGGA
Sbjct: 497 VTLLLPGHQQDLISRVAMASKGPVVLVLMSGGPIDVTFAKNDPRVSSIVWVGYPGQAGGA 556

Query: 418 AIADVLFGRANPG 430
           AIADVLFG  NPG
Sbjct: 557 AIADVLFGATNPG 569


>gi|449436749|ref|XP_004136155.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
          Length = 772

 Score =  669 bits (1727), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/430 (75%), Positives = 359/430 (83%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYNGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPV+ G+YAA Y++GLQGN G RLKVA
Sbjct: 143 MYNGGAAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGEYAARYIKGLQGNDGDRLKVA 202

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKH+TAYDLDNWNG DR+HFNA+V++QD+ DT+ VPF+ CV EGKVASVMCSYNQVNG
Sbjct: 203 ACCKHFTAYDLDNWNGTDRFHFNAKVTRQDMVDTFEVPFRKCVKEGKVASVMCSYNQVNG 262

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
            PTCADP++LK TI  QW L+GYIVSDCDSVGV Y+ QHYT T EEAAADAIKAGLDLDC
Sbjct: 263 VPTCADPNLLKGTIRNQWGLNGYIVSDCDSVGVFYDNQHYTSTAEEAAADAIKAGLDLDC 322

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           GPFLA+HTE AV+ GLL +  +N ALA TITVQMRLGMFDG PS+  +G LGP++VC+P+
Sbjct: 323 GPFLAVHTEDAVKKGLLTQTHINNALANTITVQMRLGMFDGAPSSHAYGKLGPKNVCSPS 382

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           HQQLAL AA QGIVLLKN    LPLS   H TVAVIGPNSDV VTMIGNYAGVACGY TP
Sbjct: 383 HQQLALDAARQGIVLLKNRLPGLPLSADHHRTVAVIGPNSDVNVTMIGNYAGVACGYVTP 442

Query: 301 LQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           L+GI RY   +H+ GC  VAC  +     A  AA  ADATVLVMGLDQS+EAE  DR GL
Sbjct: 443 LEGIKRYTTVVHRKGCDNVACATDYSFTDALAAASTADATVLVMGLDQSVEAETKDRDGL 502

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
           LLPGRQQELV +VA ASRGP V++LM GGP+DVSFA NDPRI AILWVGYPGQAGGAAIA
Sbjct: 503 LLPGRQQELVLKVAAASRGPTVVILMSGGPIDVSFADNDPRISAILWVGYPGQAGGAAIA 562

Query: 421 DVLFGRANPG 430
           DVLFG  NPG
Sbjct: 563 DVLFGTTNPG 572


>gi|449505346|ref|XP_004162442.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 2-like
           [Cucumis sativus]
          Length = 772

 Score =  668 bits (1724), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 323/430 (75%), Positives = 358/430 (83%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYNGG AGLTYWSPNVNIFRDPRWGRGQETPGEDPV+ G+YAA Y++GLQGN G RLKVA
Sbjct: 143 MYNGGAAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGEYAARYIKGLQGNDGDRLKVA 202

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKH+TAYDLDNWNG DR+HFNA+V++QD+ DT+ VPF+ CV EGKVASVMCSYNQVNG
Sbjct: 203 ACCKHFTAYDLDNWNGTDRFHFNAKVTRQDMVDTFEVPFRKCVKEGKVASVMCSYNQVNG 262

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
            PTCADP++LK TI  QW L+GYIVSDCDSVGV Y+ QHYT T EEAAADAIKAGLDLDC
Sbjct: 263 VPTCADPNLLKGTIRNQWGLNGYIVSDCDSVGVFYDNQHYTSTAEEAAADAIKAGLDLDC 322

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           GPFLA+HTE AV+  LL +  +N ALA TITVQMRLGMFDG PS+  +G LGP++VC+P+
Sbjct: 323 GPFLAVHTEDAVKKXLLTQTHINNALANTITVQMRLGMFDGAPSSHAYGKLGPKNVCSPS 382

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           HQQLAL AA QGIVLLKN    LPLS   H TVAVIGPNSDV VTMIGNYAGVACGY TP
Sbjct: 383 HQQLALDAARQGIVLLKNRLPGLPLSAXHHRTVAVIGPNSDVNVTMIGNYAGVACGYVTP 442

Query: 301 LQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           L+GI RY   +H+ GC  VAC  +     A  AA  ADATVLVMGLDQS+EAE  DR GL
Sbjct: 443 LEGIKRYTTVVHRKGCDNVACATDYSFTDALAAASTADATVLVMGLDQSVEAETKDRDGL 502

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
           LLPGRQQELV +VA ASRGP V++LM GGP+DVSFA NDPRI AILWVGYPGQAGGAAIA
Sbjct: 503 LLPGRQQELVLKVAAASRGPTVVILMSGGPIDVSFADNDPRISAILWVGYPGQAGGAAIA 562

Query: 421 DVLFGRANPG 430
           DVLFG  NPG
Sbjct: 563 DVLFGTTNPG 572


>gi|449484229|ref|XP_004156823.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 2-like
           [Cucumis sativus]
          Length = 769

 Score =  662 bits (1709), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 311/430 (72%), Positives = 358/430 (83%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYNGG+ GLTYWSPNVNIFRDPRWGRGQETPGEDP+L G YA +YVRGLQG  G+RLKVA
Sbjct: 141 MYNGGVGGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGTYAVNYVRGLQGTEGNRLKVA 200

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKH+TAYDLDNWNGVDR+HFNA+VSKQD+EDT+ VPF+ CV  GKV+SVMCSYNQVNG
Sbjct: 201 ACCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFEVPFRMCVKGGKVSSVMCSYNQVNG 260

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
            PTCADP++L NT+  QW LDGYIVSDCDSVGV YN+QHYT TPEEAAA AIKAGLDLDC
Sbjct: 261 VPTCADPNLLTNTLRSQWHLDGYIVSDCDSVGVFYNSQHYTSTPEEAAAMAIKAGLDLDC 320

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           G FL  HTE AV+ GLL E  +N AL+ T++VQMRLGMFDG+   QP+ +LG + VC+  
Sbjct: 321 GSFLETHTENAVKRGLLNESHINGALSNTLSVQMRLGMFDGDLKTQPYAHLGAKHVCSDH 380

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           ++QLA+ AA QGIVLL+N   +LPLST RH  VAV+GPNS+ T+TMIGNYAG+AC Y TP
Sbjct: 381 NRQLAVDAARQGIVLLENRRGSLPLSTNRHRIVAVVGPNSNATLTMIGNYAGIACEYITP 440

Query: 301 LQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           LQGIS+Y +TIHQ GC GVAC  N+  G A  AAR ADA VLVMGLDQSIEAEF DRAGL
Sbjct: 441 LQGISKYTRTIHQEGCRGVACRSNKFFGGAIEAARVADAVVLVMGLDQSIEAEFRDRAGL 500

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
           LLPG Q +LV +VA  ++GPV+LVLM GGP+DVSFAK+ P+I  I+W GYPGQAGG AIA
Sbjct: 501 LLPGLQPDLVLKVASVAKGPVILVLMSGGPIDVSFAKDHPKISGIIWGGYPGQAGGLAIA 560

Query: 421 DVLFGRANPG 430
           DVLFG+ NPG
Sbjct: 561 DVLFGQTNPG 570


>gi|449469042|ref|XP_004152230.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
          Length = 769

 Score =  662 bits (1707), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 311/430 (72%), Positives = 358/430 (83%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYNGG+ GLTYWSPNVNIFRDPRWGRGQETPGEDP+L G YA +YVRGLQG  G+RLKVA
Sbjct: 141 MYNGGVGGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGTYAVNYVRGLQGTEGNRLKVA 200

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKH+TAYDLDNWNGVDR+HFNA+VSKQD+EDT+ VPF+ CV  GKV+SVMCSYNQVNG
Sbjct: 201 ACCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFEVPFRMCVKGGKVSSVMCSYNQVNG 260

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
            PTCADP++L NT+  QW LDGYIVSDCDSVGV YN+QHYT TPEEAAA AIKAGLDLDC
Sbjct: 261 VPTCADPNLLTNTLRSQWHLDGYIVSDCDSVGVFYNSQHYTSTPEEAAAMAIKAGLDLDC 320

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           G FL  HTE AV+ GLL E  +N AL+ T++VQMRLGMFDG+   QP+ +LG + VC+  
Sbjct: 321 GSFLETHTENAVKRGLLNESHINGALSNTLSVQMRLGMFDGDLKTQPYAHLGAKHVCSDH 380

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           ++QLA+ AA QGIVLL+N   +LPLST RH  VAV+GPNS+ T+TMIGNYAG+AC Y TP
Sbjct: 381 NRQLAVDAARQGIVLLENRRGSLPLSTNRHRIVAVVGPNSNATLTMIGNYAGIACEYITP 440

Query: 301 LQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           LQGIS+Y +TIHQ GC GVAC  N+  G A  AAR ADA VLVMGLDQSIEAEF DRAGL
Sbjct: 441 LQGISKYTRTIHQEGCRGVACRSNKFFGGAIEAARVADAVVLVMGLDQSIEAEFRDRAGL 500

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
           LLPG Q +LV +VA  ++GPV+LVLM GGP+DVSFAK+ P+I  I+W GYPGQAGG AIA
Sbjct: 501 LLPGLQPDLVLKVASVAKGPVILVLMSGGPIDVSFAKDHPKISGIIWGGYPGQAGGLAIA 560

Query: 421 DVLFGRANPG 430
           DVLFG+ NPG
Sbjct: 561 DVLFGQTNPG 570


>gi|189380221|gb|ACD93208.1| beta xylosidase [Camellia sinensis]
          Length = 767

 Score =  652 bits (1683), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 330/436 (75%), Positives = 360/436 (82%), Gaps = 14/436 (3%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVL GKYAASYVRGLQGN+G++LKVA
Sbjct: 142 MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGKYAASYVRGLQGNSGNQLKVA 201

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKHYTAYDLDNWN VDRY FNARVSKQDL DTY+VPFKACVVEGK   V C++     
Sbjct: 202 ACCKHYTAYDLDNWNSVDRYRFNARVSKQDLADTYDVPFKACVVEGKY-QVYCAHT---- 256

Query: 121 KPTCADPDILK--NTIHGQWRLDGYI----VSDCDSVGVLYNTQHYTRTPEEAAADAIKA 174
               A+P +L   +  H  W    ++    +  C    + ++T H   TPE+AAA  IKA
Sbjct: 257 IKLMANPLVLTLISPQHHPWSWHSWLHCFRLYRCWGF-ICHSTLH--STPEDAAAATIKA 313

Query: 175 GLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPR 234
           GLDL+CGPFLAIHTE AVR G L E DVN AL  T++VQMRLGMFDGEPS+QP+GNLGPR
Sbjct: 314 GLDLECGPFLAIHTEQAVRQGKLGEADVNGALINTLSVQMRLGMFDGEPSSQPYGNLGPR 373

Query: 235 DVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA 294
           DVCTPAHQQLAL+AA QGIVLL+N  R+LPLST  H TVAVIGPNSDVTVTM+GNYAGVA
Sbjct: 374 DVCTPAHQQLALEAARQGIVLLQNRGRSLPLSTQLHRTVAVIGPNSDVTVTMLGNYAGVA 433

Query: 295 CGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEF 354
           CG+TTPLQGI RY +TIHQ+GC  VAC+ NQL G AE AARQADATVLVMGLDQSIE EF
Sbjct: 434 CGFTTPLQGIERYVRTIHQSGCDSVACSNNQLFGVAETAARQADATVLVMGLDQSIETEF 493

Query: 355 IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQA 414
            DR GLLLPG QQELVSRVA ASRGPVVLVLM GGP+DVSFAKNDPRIGAILWVGYPGQA
Sbjct: 494 KDRVGLLLPGPQQELVSRVAMASRGPVVLVLMSGGPIDVSFAKNDPRIGAILWVGYPGQA 553

Query: 415 GGAAIADVLFGRANPG 430
           GG AIADVLFGR NPG
Sbjct: 554 GGTAIADVLFGRTNPG 569


>gi|224128360|ref|XP_002320310.1| predicted protein [Populus trichocarpa]
 gi|222861083|gb|EEE98625.1| predicted protein [Populus trichocarpa]
          Length = 635

 Score =  640 bits (1652), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 310/430 (72%), Positives = 355/430 (82%), Gaps = 5/430 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           M+NGG+AGLTYWSPNVNIFRDPRWGRGQETPGEDPV+ GKYAASYVRGLQG+ G+RLKVA
Sbjct: 11  MFNGGVAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVVGKYAASYVRGLQGSDGNRLKVA 70

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKH+TAYDLDNWNGVDR+HFNA VSKQD+EDT++VPF+ CV EGKVASVMCSYNQVNG
Sbjct: 71  ACCKHFTAYDLDNWNGVDRFHFNAEVSKQDMEDTFDVPFRMCVKEGKVASVMCSYNQVNG 130

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
            PTCADP++LK T+ G       ++     +G     Q   + P        +A LDLDC
Sbjct: 131 IPTCADPNLLKKTVRGTLFQTVTLLEFI--MGSNTILQPRRKQPRMLLK---QASLDLDC 185

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           GPFL  HTE AV+ GLL E ++N AL  T+TVQMRLGMFDGEPS+Q +GNLGP DVCTPA
Sbjct: 186 GPFLGQHTEDAVKKGLLNEAEINNALLNTLTVQMRLGMFDGEPSSQLYGNLGPNDVCTPA 245

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           HQ+LAL+AA QGIVLLKN   +LPLST RH +VA++GPNS+VT TMIGNYAG+ACGYTTP
Sbjct: 246 HQELALEAARQGIVLLKNHGPSLPLSTRRHLSVAIVGPNSNVTATMIGNYAGLACGYTTP 305

Query: 301 LQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           LQGI RYA+TIH+ GC  VAC  +Q   AA  AARQADATVLVMGLDQSIEAEF DR GL
Sbjct: 306 LQGIQRYAQTIHRQGCADVACVSDQQFSAAIDAARQADATVLVMGLDQSIEAEFRDRTGL 365

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
           LLPGRQQELVS+VA AS+GP +LVLM GGP+DVSFA+NDP+IG+I+W GYPGQAGGAAI+
Sbjct: 366 LLPGRQQELVSKVAAASKGPTILVLMSGGPIDVSFAENDPKIGSIVWAGYPGQAGGAAIS 425

Query: 421 DVLFGRANPG 430
           DVLFG  NPG
Sbjct: 426 DVLFGITNPG 435


>gi|357130854|ref|XP_003567059.1| PREDICTED: probable beta-D-xylosidase 2-like [Brachypodium
           distachyon]
          Length = 779

 Score =  618 bits (1593), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 301/438 (68%), Positives = 355/438 (81%), Gaps = 10/438 (2%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           +YNG  AGLT+WSPNVNIFRDPRWGRGQETPGEDP ++G+YAA+YVRGLQ     RLK A
Sbjct: 140 IYNGRQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVSGRYAAAYVRGLQQQHAGRLKTA 199

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKH+TAYDLD W+G DR+HFNA V+ QDLEDT+N PF+ACVVEG+ A+VMCSYNQVNG
Sbjct: 200 ACCKHFTAYDLDRWSGADRFHFNAIVTPQDLEDTFNAPFRACVVEGRAAAVMCSYNQVNG 259

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
            PTCAD   L+ TI G+W+LDGYIVSDCDSV V Y  QHYTRT E+A A  ++AGLDLDC
Sbjct: 260 VPTCADQGFLRGTIRGKWKLDGYIVSDCDSVDVFYREQHYTRTREDAVAATLRAGLDLDC 319

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           GPFLA +TE AV  G ++E D++ A+  T+TVQMRLGMFDG+ +AQPFG+LGP+ VCTPA
Sbjct: 320 GPFLAQYTEAAVAQGKVKEADIDAAVVNTVTVQMRLGMFDGDVAAQPFGHLGPQHVCTPA 379

Query: 241 HQQLALQAAHQGIVLLKNSA---RTLPLSTLRHH---TVAVIGPNSDVTVTMIGNYAGVA 294
           H++LAL+AA Q IVLLKN       LPLS+  HH   TVAV+GP+S+ TV MIGNYAG  
Sbjct: 380 HRELALEAACQSIVLLKNGGGNNMRLPLSS--HHRRGTVAVVGPHSEATVAMIGNYAGKP 437

Query: 295 CGYTTPLQGISRYAK-TIHQAGCFGVACNGN-QLIGAAEVAARQADATVLVMGLDQSIEA 352
           C YTTPLQG+ RYA+ T+HQAGC  VAC G+ Q I AA  AAR ADATV+V+GLDQS+EA
Sbjct: 438 CAYTTPLQGVGRYARATVHQAGCTDVACQGSGQPIDAAVDAARHADATVVVVGLDQSVEA 497

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E +DR  LLLPGRQ ELVS VA+AS+GPV+LVLM GGPVD++FA+ND  + AILW GYPG
Sbjct: 498 EGLDRTTLLLPGRQAELVSAVARASKGPVILVLMSGGPVDIAFAQNDRNVAAILWAGYPG 557

Query: 413 QAGGAAIADVLFGRANPG 430
           QAGG AIADV+FG  NPG
Sbjct: 558 QAGGQAIADVIFGHHNPG 575


>gi|226531269|ref|NP_001145980.1| uncharacterized protein LOC100279508 precursor [Zea mays]
 gi|219885199|gb|ACL52974.1| unknown [Zea mays]
 gi|413920228|gb|AFW60160.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 794

 Score =  605 bits (1560), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 303/444 (68%), Positives = 349/444 (78%), Gaps = 14/444 (3%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ-------GNT 53
           MYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP ++ +YAA+YVRGLQ       G  
Sbjct: 140 MYNGGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVSARYAAAYVRGLQQPYAAPNGGH 199

Query: 54  GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 113
            +RLK+AACCKH+TAYDLD W G DR+HFNA V+ QDLEDT+NVPF+ACV +G+ ASVMC
Sbjct: 200 RNRLKLAACCKHFTAYDLDKWGGTDRFHFNAVVAAQDLEDTFNVPFRACVEDGRAASVMC 259

Query: 114 SYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
           SYNQVNG PTCAD   L+ TI G+W LDGYIVSDCDSV V +  QHYTRTPE+AAA  ++
Sbjct: 260 SYNQVNGVPTCADAAFLRGTIRGRWGLDGYIVSDCDSVDVFFRDQHYTRTPEDAAAATLR 319

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AGLDLDCGPFLA++   AV  G + + DV+ AL  T+TVQMRLGMFDG+P+A PFG LGP
Sbjct: 320 AGLDLDCGPFLALYAGSAVAAGKVADADVDAALLNTVTVQMRLGMFDGDPAAGPFGRLGP 379

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNS--AR----TLPLSTLRHHTVAVIGPNSDVTVTMI 287
            DVCT  HQ LAL AA QG+VLLKN   AR     LPL    H  VAV+GP++D TV MI
Sbjct: 380 ADVCTREHQDLALDAARQGVVLLKNRRGARHNRDVLPLRPAAHRVVAVVGPHADATVAMI 439

Query: 288 GNYAGVACGYTTPLQGISRY-AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGL 346
           GNYAG  C YTTPLQG++ Y A+  HQAGC  VAC GNQ I AA  AARQADATV+V GL
Sbjct: 440 GNYAGKPCRYTTPLQGVAAYAARVAHQAGCTDVACRGNQPIAAAVEAARQADATVVVAGL 499

Query: 347 DQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
           DQ +EAE +DR  LLLPGRQ EL+S VAKAS+GPV+LVLM GGP+D++FA+NDPRI  IL
Sbjct: 500 DQRVEAEGLDRTTLLLPGRQAELISAVAKASKGPVILVLMSGGPIDIAFAQNDPRIDGIL 559

Query: 407 WVGYPGQAGGAAIADVLFGRANPG 430
           WVGYPGQAGG AIADV+FG  NPG
Sbjct: 560 WVGYPGQAGGQAIADVIFGHHNPG 583


>gi|195614824|gb|ACG29242.1| auxin-induced beta-glucosidase [Zea mays]
 gi|413920229|gb|AFW60161.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 655

 Score =  605 bits (1559), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 303/444 (68%), Positives = 349/444 (78%), Gaps = 14/444 (3%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ-------GNT 53
           MYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP ++ +YAA+YVRGLQ       G  
Sbjct: 1   MYNGGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVSARYAAAYVRGLQQPYAAPNGGH 60

Query: 54  GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 113
            +RLK+AACCKH+TAYDLD W G DR+HFNA V+ QDLEDT+NVPF+ACV +G+ ASVMC
Sbjct: 61  RNRLKLAACCKHFTAYDLDKWGGTDRFHFNAVVAAQDLEDTFNVPFRACVEDGRAASVMC 120

Query: 114 SYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
           SYNQVNG PTCAD   L+ TI G+W LDGYIVSDCDSV V +  QHYTRTPE+AAA  ++
Sbjct: 121 SYNQVNGVPTCADAAFLRGTIRGRWGLDGYIVSDCDSVDVFFRDQHYTRTPEDAAAATLR 180

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AGLDLDCGPFLA++   AV  G + + DV+ AL  T+TVQMRLGMFDG+P+A PFG LGP
Sbjct: 181 AGLDLDCGPFLALYAGSAVAAGKVADADVDAALLNTVTVQMRLGMFDGDPAAGPFGRLGP 240

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNS--AR----TLPLSTLRHHTVAVIGPNSDVTVTMI 287
            DVCT  HQ LAL AA QG+VLLKN   AR     LPL    H  VAV+GP++D TV MI
Sbjct: 241 ADVCTREHQDLALDAARQGVVLLKNRRGARHNRDVLPLRPAAHRVVAVVGPHADATVAMI 300

Query: 288 GNYAGVACGYTTPLQGISRY-AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGL 346
           GNYAG  C YTTPLQG++ Y A+  HQAGC  VAC GNQ I AA  AARQADATV+V GL
Sbjct: 301 GNYAGKPCRYTTPLQGVAAYAARVAHQAGCTDVACRGNQPIAAAVEAARQADATVVVAGL 360

Query: 347 DQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
           DQ +EAE +DR  LLLPGRQ EL+S VAKAS+GPV+LVLM GGP+D++FA+NDPRI  IL
Sbjct: 361 DQRVEAEGLDRTTLLLPGRQAELISAVAKASKGPVILVLMSGGPIDIAFAQNDPRIDGIL 420

Query: 407 WVGYPGQAGGAAIADVLFGRANPG 430
           WVGYPGQAGG AIADV+FG  NPG
Sbjct: 421 WVGYPGQAGGQAIADVIFGHHNPG 444


>gi|449438167|ref|XP_004136861.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Cucumis sativus]
          Length = 782

 Score =  594 bits (1532), Expect = e-167,   Method: Compositional matrix adjust.
 Identities = 293/432 (67%), Positives = 340/432 (78%), Gaps = 4/432 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG---SRL 57
           MYN G+AGLTYWSPNVNIFRDPRWGRGQETPGEDP+L+ KYAA YVRGLQ        RL
Sbjct: 162 MYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYAAGYVRGLQQRDDGDPDRL 221

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVAACCKHYTAYDLDNW G DRYHFNA VS QDLEDT+  PFK+CV++G VASVMCSYNQ
Sbjct: 222 KVAACCKHYTAYDLDNWKGTDRYHFNAVVSPQDLEDTFQPPFKSCVIDGNVASVMCSYNQ 281

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           VNGKPTCADPD+L   I GQW+L+GYIVSDCDSV VLYN+QHYT++PEEAAA  I AGLD
Sbjct: 282 VNGKPTCADPDLLAGVIRGQWKLNGYIVSDCDSVDVLYNSQHYTKSPEEAAAKTILAGLD 341

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           LDCG FL  HTE AV GGL+ E  ++ A+   +   MRLG FDG PS Q +G LGP+DVC
Sbjct: 342 LDCGDFLGKHTEAAVTGGLVNEAAISKAVFNNLLTLMRLGFFDGNPSKQLYGKLGPKDVC 401

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           TP HQ+LA +AA QGIVLLKNS ++LPLS+    ++AVIGPN++VT TMIGNY G  C Y
Sbjct: 402 TPEHQELAREAARQGIVLLKNSPKSLPLSSSAIKSLAVIGPNANVTKTMIGNYEGTPCKY 461

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TTPLQG+S    T  Q GC  VAC   QL  A ++AA  ADATVLV+G DQSIEAE  DR
Sbjct: 462 TTPLQGLSAVVSTSFQPGCANVACTSAQLDEAKKIAA-SADATVLVVGSDQSIEAESRDR 520

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             L LPG+Q  L++ VAKAS+GPV+LV+M GG +D++FAK D +I +ILWVG+PG+AGGA
Sbjct: 521 VDLNLPGQQALLITEVAKASKGPVILVIMTGGGMDITFAKKDDKITSILWVGFPGEAGGA 580

Query: 418 AIADVLFGRANP 429
           AIADV+FG  NP
Sbjct: 581 AIADVIFGSFNP 592


>gi|255545293|ref|XP_002513707.1| Beta-glucosidase, putative [Ricinus communis]
 gi|223547158|gb|EEF48654.1| Beta-glucosidase, putative [Ricinus communis]
          Length = 777

 Score =  592 bits (1527), Expect = e-167,   Method: Compositional matrix adjust.
 Identities = 285/432 (65%), Positives = 338/432 (78%), Gaps = 4/432 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ---GNTGSRL 57
           MYN G+AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KY + YVRGLQ        RL
Sbjct: 156 MYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSCYVRGLQQTDNGDSERL 215

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVAACCKHYTAYDLDNW G DRYHFNA V+KQDL+DT+  PFK+CV++G VASVMCSYNQ
Sbjct: 216 KVAACCKHYTAYDLDNWKGTDRYHFNAVVTKQDLDDTFQPPFKSCVIDGNVASVMCSYNQ 275

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           VNGKPTCADPD+L   I G+W+L+GYIVSDCDSV V+YN+QHYT+TPEEAAA  I AGLD
Sbjct: 276 VNGKPTCADPDLLAGIIRGEWKLNGYIVSDCDSVDVIYNSQHYTKTPEEAAAITILAGLD 335

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           L+CG FL  HTE AV  GLL    V+ A++      MRLG FDG+PS Q +G LGP+DVC
Sbjct: 336 LNCGSFLGKHTEAAVNAGLLNVSAVDKAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVC 395

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           T  +Q+LA +AA QGIVLLKNS  +LPLS     T+AVIGPN++VT TMIGNY G  C Y
Sbjct: 396 TAVNQELAREAARQGIVLLKNSPGSLPLSPTAIKTLAVIGPNANVTKTMIGNYEGTPCKY 455

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TTPLQG++    T + AGC  VAC   Q+  A ++AA  ADATVLVMG DQSIEAE  DR
Sbjct: 456 TTPLQGLTASVATTYLAGCSNVACAAAQVDDAKKLAA-SADATVLVMGADQSIEAESRDR 514

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             +LLPG+QQ L+++VA  S+GPV+LV+M GG +DVSFAK + +I +ILWVGYPG+AGGA
Sbjct: 515 VDVLLPGQQQLLITQVANVSKGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGA 574

Query: 418 AIADVLFGRANP 429
           AIADV+FG  NP
Sbjct: 575 AIADVIFGYYNP 586


>gi|449479116|ref|XP_004155509.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Cucumis sativus]
          Length = 809

 Score =  592 bits (1526), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 293/432 (67%), Positives = 340/432 (78%), Gaps = 4/432 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG---SRL 57
           MYN G+AGLTYWSPNVNIFRDPRWGRGQETPGEDP+L+ KYAA YVRGLQ        RL
Sbjct: 189 MYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYAAGYVRGLQQRDDGDPDRL 248

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVAACCKHYTAYDLDNW G DRYHFNA VS QDLEDT+  PFK+CV++G VASVMCSYNQ
Sbjct: 249 KVAACCKHYTAYDLDNWKGTDRYHFNAVVSPQDLEDTFQPPFKSCVIDGNVASVMCSYNQ 308

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           VNGKPTCADPD+L   I GQW+L+GYIVSDCDSV VLYN+QHYT++PEEAAA  I AGLD
Sbjct: 309 VNGKPTCADPDLLAGVIRGQWKLNGYIVSDCDSVDVLYNSQHYTKSPEEAAAKTILAGLD 368

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           LDCG FL  HTE AV GGL+ E  ++ A+   +   MRLG FDG PS Q +G LGP+DVC
Sbjct: 369 LDCGDFLGKHTEAAVTGGLVNEAAISKAVFNNLLTLMRLGFFDGNPSKQLYGKLGPKDVC 428

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           TP HQ+LA +AA QGIVLLKNS ++LPLS+    ++AVIGPN++VT TMIGNY G  C Y
Sbjct: 429 TPEHQELAREAARQGIVLLKNSPKSLPLSSSAIKSLAVIGPNANVTKTMIGNYEGTPCKY 488

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TTPLQG+S    T  Q GC  VAC   QL  A ++AA  ADATVLV+G DQSIEAE  DR
Sbjct: 489 TTPLQGLSAVVSTSFQPGCANVACTSAQLDEAKKIAA-SADATVLVVGSDQSIEAESRDR 547

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             L LPG+Q  L++ VAKAS+GPV+LV+M GG +D++FAK D +I +ILWVG+PG+AGGA
Sbjct: 548 VDLNLPGQQALLITEVAKASKGPVILVIMTGGGMDITFAKKDDKITSILWVGFPGEAGGA 607

Query: 418 AIADVLFGRANP 429
           AIADV+FG  NP
Sbjct: 608 AIADVIFGSFNP 619


>gi|125535275|gb|EAY81823.1| hypothetical protein OsI_36995 [Oryza sativa Indica Group]
          Length = 885

 Score =  585 bits (1508), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 298/435 (68%), Positives = 353/435 (81%), Gaps = 5/435 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS--RLK 58
           MYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP +  +YAA+YVRGLQ    S  RLK
Sbjct: 236 MYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQQQQPSSGRLK 295

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           +AACCKH+TAYDLDNW+G DR+HFNA V++QDLEDT+NVPF++CVV+G+ ASVMCSYNQV
Sbjct: 296 LAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRSCVVDGRAASVMCSYNQV 355

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           NG PTCAD   L+ TI  +W L GYIVSDCDSV V Y+ QHYTRT E+A A  ++AGLDL
Sbjct: 356 NGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHYTRTREDAVAATLRAGLDL 415

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCGPFLA +TEGAV  G + + D++ A+  T+TVQMRLGMFDG+P+AQPFG+LGP+ VCT
Sbjct: 416 DCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGMFDGDPAAQPFGHLGPQHVCT 475

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
            AHQ+LA++AA QGIVLLKN  R LPLS       VAV+GP+++ TV MIGNYAG  C Y
Sbjct: 476 AAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVGPHAEATVAMIGNYAGKPCRY 535

Query: 298 TTPLQGISRY-AKTIHQAGCFGVACNGN-QLIGAAEVAARQADATVLVMGLDQSIEAEFI 355
           TTPLQG++RY A+  HQ GC  VAC G+ Q I AA  AAR+ADAT++V GLDQ IEAE +
Sbjct: 536 TTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAARRADATIVVAGLDQKIEAEGL 595

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DRA LLLPGRQ EL+S VAKAS+GPV+LVLM GGP+D+ FA+NDP+I  ILW GYPGQAG
Sbjct: 596 DRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQNDPKIAGILWAGYPGQAG 655

Query: 416 GAAIADVLFGRANPG 430
           G AIADV+FG  NPG
Sbjct: 656 GQAIADVIFGHHNPG 670


>gi|74355968|dbj|BAE44362.1| alpha-L-arabinofuranosidase [Raphanus sativus]
          Length = 780

 Score =  585 bits (1508), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 283/432 (65%), Positives = 339/432 (78%), Gaps = 5/432 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS---RL 57
           MYN G+AGLTYWSPNVNIFRDPRWGRGQETPGEDP+L+ KYA+ YV+GLQ    S   RL
Sbjct: 154 MYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVKGLQETDSSDANRL 213

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVAACCKHYTAYD+DNW GV+RY FNA V++QDL+DTY  PFK+CVV+G VASVMCSYN+
Sbjct: 214 KVAACCKHYTAYDVDNWKGVERYSFNAVVNQQDLDDTYQPPFKSCVVDGNVASVMCSYNK 273

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           VNGKPTCADPD+L   I G+W+L+GYIVSDCDSV VLY  QHYT+TPEEAAA +I AGLD
Sbjct: 274 VNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQHYTKTPEEAAAISINAGLD 333

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           L+CG FL  HTE AV+ GL++E  ++ A+       MRLG FDG+P  Q +G LGP+DVC
Sbjct: 334 LNCGYFLGDHTEAAVKAGLVKEAAIDKAITNNFLTLMRLGFFDGDPKKQIYGGLGPKDVC 393

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           TPA+Q+LA +AA QGIVLLKN+   LPLS     T+AVIGPN++VT TMIGNY G  C Y
Sbjct: 394 TPANQELAAEAARQGIVLLKNTG-ALPLSPKTIKTLAVIGPNANVTKTMIGNYEGTPCKY 452

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TTPLQG++    T +  GC  VAC    + G+ ++AA  +DATVLV+G DQSIEAE  DR
Sbjct: 453 TTPLQGLAGTVHTTYLPGCSNVACAVADVAGSTKLAA-ASDATVLVIGADQSIEAESRDR 511

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             L LPG+QQELV++VAKA++GPV LV+M GG  D++FAKND +I  ILWVGYPG+AGG 
Sbjct: 512 VDLNLPGQQQELVTQVAKAAKGPVFLVIMSGGGFDITFAKNDAKIAGILWVGYPGEAGGI 571

Query: 418 AIADVLFGRANP 429
           A ADV+FGR NP
Sbjct: 572 ATADVIFGRYNP 583


>gi|77552476|gb|ABA95273.1| Beta-D-xylosidase, putative, expressed [Oryza sativa Japonica
           Group]
          Length = 883

 Score =  585 bits (1507), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 298/435 (68%), Positives = 353/435 (81%), Gaps = 5/435 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS--RLK 58
           MYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP +  +YAA+YVRGLQ    S  RLK
Sbjct: 236 MYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQQQQPSSGRLK 295

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           +AACCKH+TAYDLDNW+G DR+HFNA V++QDLEDT+NVPF++CVV+G+ ASVMCSYNQV
Sbjct: 296 LAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRSCVVDGRAASVMCSYNQV 355

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           NG PTCAD   L+ TI  +W L GYIVSDCDSV V Y+ QHYTRT E+A A  ++AGLDL
Sbjct: 356 NGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHYTRTREDAVAATLRAGLDL 415

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCGPFLA +TEGAV  G + + D++ A+  T+TVQMRLGMFDG+P+AQPFG+LGP+ VCT
Sbjct: 416 DCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGMFDGDPAAQPFGHLGPQHVCT 475

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
            AHQ+LA++AA QGIVLLKN  R LPLS       VAV+GP+++ TV MIGNYAG  C Y
Sbjct: 476 AAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVGPHAEATVAMIGNYAGKPCRY 535

Query: 298 TTPLQGISRY-AKTIHQAGCFGVACNGN-QLIGAAEVAARQADATVLVMGLDQSIEAEFI 355
           TTPLQG++RY A+  HQ GC  VAC G+ Q I AA  AAR+ADAT++V GLDQ IEAE +
Sbjct: 536 TTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAARRADATIVVAGLDQKIEAEGL 595

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DRA LLLPGRQ EL+S VAKAS+GPV+LVLM GGP+D+ FA+NDP+I  ILW GYPGQAG
Sbjct: 596 DRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQNDPKIAGILWAGYPGQAG 655

Query: 416 GAAIADVLFGRANPG 430
           G AIADV+FG  NPG
Sbjct: 656 GQAIADVIFGHHNPG 670


>gi|359481045|ref|XP_002268626.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Vitis vinifera]
 gi|296089342|emb|CBI39114.3| unnamed protein product [Vitis vinifera]
          Length = 774

 Score =  583 bits (1503), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 284/432 (65%), Positives = 341/432 (78%), Gaps = 4/432 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ-GNTGS--RL 57
           MYN G+AGLT+WSPNVNIFRDPRWGRGQETPGEDP+L+ KYA++YVRGLQ G+ GS  RL
Sbjct: 154 MYNVGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASAYVRGLQQGDDGSPDRL 213

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVAACCKHYTAYDLDNW GVDR HFNA V+KQD++DT+  PFK+CV++G VASVMCS+NQ
Sbjct: 214 KVAACCKHYTAYDLDNWKGVDRLHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMCSFNQ 273

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           VNGKPTCADPD+L   + G+W+L+GYIVSDCDSV V YN+QHYT+TPEEAAA AI AGLD
Sbjct: 274 VNGKPTCADPDLLSGIVRGEWKLNGYIVSDCDSVDVFYNSQHYTKTPEEAAAKAILAGLD 333

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           L+CG FL  HTE AV+GGL+ E  V+ A++      MRLG FDG PS   +G LGP+DVC
Sbjct: 334 LNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIYGKLGPKDVC 393

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           T  HQ++A +AA QGIVLLKNS  +LPLS     T+A+IGPN++VT TMIGNY G  C Y
Sbjct: 394 TSEHQEMAREAARQGIVLLKNSKGSLPLSPTAIKTLAIIGPNANVTKTMIGNYEGTPCKY 453

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TTPLQG++    T +  GC  VAC G   I  A+  A  ADATVL++G+DQSIEAE  DR
Sbjct: 454 TTPLQGLTALVATTYLPGCSNVAC-GTAQIDEAKKIAAAADATVLIVGIDQSIEAEGRDR 512

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             + LPG+Q  L++ VAKAS+G V+LV+M GG  D+SFAKND +I +ILWVGYPG+AGGA
Sbjct: 513 VSIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISFAKNDDKIASILWVGYPGEAGGA 572

Query: 418 AIADVLFGRANP 429
           AIADV+FG  NP
Sbjct: 573 AIADVIFGFYNP 584


>gi|115486595|ref|NP_001068441.1| Os11g0673200 [Oryza sativa Japonica Group]
 gi|113645663|dbj|BAF28804.1| Os11g0673200 [Oryza sativa Japonica Group]
          Length = 822

 Score =  583 bits (1502), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 298/435 (68%), Positives = 353/435 (81%), Gaps = 5/435 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS--RLK 58
           MYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP +  +YAA+YVRGLQ    S  RLK
Sbjct: 175 MYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQQQQPSSGRLK 234

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           +AACCKH+TAYDLDNW+G DR+HFNA V++QDLEDT+NVPF++CVV+G+ ASVMCSYNQV
Sbjct: 235 LAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRSCVVDGRAASVMCSYNQV 294

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           NG PTCAD   L+ TI  +W L GYIVSDCDSV V Y+ QHYTRT E+A A  ++AGLDL
Sbjct: 295 NGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHYTRTREDAVAATLRAGLDL 354

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCGPFLA +TEGAV  G + + D++ A+  T+TVQMRLGMFDG+P+AQPFG+LGP+ VCT
Sbjct: 355 DCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGMFDGDPAAQPFGHLGPQHVCT 414

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
            AHQ+LA++AA QGIVLLKN  R LPLS       VAV+GP+++ TV MIGNYAG  C Y
Sbjct: 415 AAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVGPHAEATVAMIGNYAGKPCRY 474

Query: 298 TTPLQGISRY-AKTIHQAGCFGVACNGN-QLIGAAEVAARQADATVLVMGLDQSIEAEFI 355
           TTPLQG++RY A+  HQ GC  VAC G+ Q I AA  AAR+ADAT++V GLDQ IEAE +
Sbjct: 475 TTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAARRADATIVVAGLDQKIEAEGL 534

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DRA LLLPGRQ EL+S VAKAS+GPV+LVLM GGP+D+ FA+NDP+I  ILW GYPGQAG
Sbjct: 535 DRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQNDPKIAGILWAGYPGQAG 594

Query: 416 GAAIADVLFGRANPG 430
           G AIADV+FG  NPG
Sbjct: 595 GQAIADVIFGHHNPG 609


>gi|356524862|ref|XP_003531047.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Glycine max]
          Length = 765

 Score =  583 bits (1502), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 278/432 (64%), Positives = 340/432 (78%), Gaps = 4/432 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG---SRL 57
           MYN G+AGLTYWSPN+NIFRDPRWGRG ETPGEDPVLT KYAA YV+GLQ   G   ++L
Sbjct: 145 MYNVGLAGLTYWSPNINIFRDPRWGRGLETPGEDPVLTSKYAAGYVKGLQQTDGGDPNKL 204

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVAACCKHYTAYD+DNW G+ RY FNA V+KQD+EDT+  PFK+CV++G VASVMCSYN+
Sbjct: 205 KVAACCKHYTAYDVDNWKGIQRYTFNAVVTKQDMEDTFQPPFKSCVIDGNVASVMCSYNK 264

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           VNGKPTCADPD+LK  + G+W+L+GYIVSDCDSV VLY  QHYT+TPEEAAA +I AGLD
Sbjct: 265 VNGKPTCADPDLLKGVVRGEWKLNGYIVSDCDSVEVLYKDQHYTKTPEEAAAISILAGLD 324

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           L+CG FL  +TEGAV+ GL+ E  +N A+       MRLG FDG+P  QP+GNLGP+DVC
Sbjct: 325 LNCGRFLGQYTEGAVKQGLIDEASINNAVTNNFATLMRLGFFDGDPRKQPYGNLGPKDVC 384

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           T  +Q+LA +AA QGIVLLKNS  +LPL+     ++AVIGPN++ T  MIGNY G+ C Y
Sbjct: 385 TQENQELAREAARQGIVLLKNSPASLPLNAKAIKSLAVIGPNANATRVMIGNYEGIPCKY 444

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
            +PLQG++ +A T + AGC  V C    L  A ++AA  ADATV+V+G   +IEAE +DR
Sbjct: 445 ISPLQGLTAFAPTSYAAGCLDVRCPNPVLDDAKKIAA-SADATVIVVGASLAIEAESLDR 503

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             +LLPG+QQ LVS VA AS+GPV+LV+M GG +DVSFAKN+ +I +ILWVGYPG+AGGA
Sbjct: 504 VNILLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSFAKNNNKITSILWVGYPGEAGGA 563

Query: 418 AIADVLFGRANP 429
           AIADV+FG  NP
Sbjct: 564 AIADVIFGFHNP 575


>gi|359485890|ref|XP_002264183.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Vitis vinifera]
          Length = 774

 Score =  582 bits (1501), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 285/432 (65%), Positives = 339/432 (78%), Gaps = 4/432 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ-GNTGS--RL 57
           MYN G+AGLT+WSPNVNIFRDPRWGRGQETPGEDP+L+ KYA+ YVRGLQ  + GS  RL
Sbjct: 154 MYNVGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVRGLQQSDDGSPDRL 213

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVAACCKHYTAYDLDNW GVDR+HFNA V+KQD++DT+  PFK+CV++G VASVMCSYNQ
Sbjct: 214 KVAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQ 273

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           VNGKP CADPD+L   + G+W+L+GYIVSDCDSV V YN+QHYT+TPEEAAA AI AGLD
Sbjct: 274 VNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVDVFYNSQHYTKTPEEAAAKAILAGLD 333

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           L+CG FL  HTE AV+GGL+ E  V+ A++      MRLG FDG PS   +G LGP+DVC
Sbjct: 334 LNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIYGKLGPKDVC 393

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           T  HQ+LA +AA QGIVLLKNS  +LPLS     T+AVIGPN++VT TMIGNY G  C Y
Sbjct: 394 TSEHQELAREAARQGIVLLKNSKGSLPLSPTAIKTLAVIGPNANVTKTMIGNYEGTPCKY 453

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TTPLQG++    T +  GC  VAC G   I  A+  A  ADATVL++G+DQSIEAE  DR
Sbjct: 454 TTPLQGLTALVATTYLPGCSNVAC-GTAQIDEAKKIAAAADATVLIVGIDQSIEAEGRDR 512

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             + LPG+Q  L++ VAKAS+G V+LV+M GG  D+SFAKND +I +ILWVGYPG+AGGA
Sbjct: 513 VNIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISFAKNDDKITSILWVGYPGEAGGA 572

Query: 418 AIADVLFGRANP 429
           AIADV+FG  NP
Sbjct: 573 AIADVIFGFYNP 584


>gi|297745522|emb|CBI40687.3| unnamed protein product [Vitis vinifera]
          Length = 751

 Score =  582 bits (1499), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 285/432 (65%), Positives = 339/432 (78%), Gaps = 4/432 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ-GNTGS--RL 57
           MYN G+AGLT+WSPNVNIFRDPRWGRGQETPGEDP+L+ KYA+ YVRGLQ  + GS  RL
Sbjct: 154 MYNVGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVRGLQQSDDGSPDRL 213

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVAACCKHYTAYDLDNW GVDR+HFNA V+KQD++DT+  PFK+CV++G VASVMCSYNQ
Sbjct: 214 KVAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQ 273

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           VNGKP CADPD+L   + G+W+L+GYIVSDCDSV V YN+QHYT+TPEEAAA AI AGLD
Sbjct: 274 VNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVDVFYNSQHYTKTPEEAAAKAILAGLD 333

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           L+CG FL  HTE AV+GGL+ E  V+ A++      MRLG FDG PS   +G LGP+DVC
Sbjct: 334 LNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIYGKLGPKDVC 393

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           T  HQ+LA +AA QGIVLLKNS  +LPLS     T+AVIGPN++VT TMIGNY G  C Y
Sbjct: 394 TSEHQELAREAARQGIVLLKNSKGSLPLSPTAIKTLAVIGPNANVTKTMIGNYEGTPCKY 453

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TTPLQG++    T +  GC  VAC G   I  A+  A  ADATVL++G+DQSIEAE  DR
Sbjct: 454 TTPLQGLTALVATTYLPGCSNVAC-GTAQIDEAKKIAAAADATVLIVGIDQSIEAEGRDR 512

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             + LPG+Q  L++ VAKAS+G V+LV+M GG  D+SFAKND +I +ILWVGYPG+AGGA
Sbjct: 513 VNIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISFAKNDDKITSILWVGYPGEAGGA 572

Query: 418 AIADVLFGRANP 429
           AIADV+FG  NP
Sbjct: 573 AIADVIFGFYNP 584


>gi|297797477|ref|XP_002866623.1| beta-xylosidase 4 [Arabidopsis lyrata subsp. lyrata]
 gi|297312458|gb|EFH42882.1| beta-xylosidase 4 [Arabidopsis lyrata subsp. lyrata]
          Length = 784

 Score =  580 bits (1495), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 282/432 (65%), Positives = 335/432 (77%), Gaps = 5/432 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ---GNTGSRL 57
           MYN G+AGLTYWSPNVNIFRDPRWGRGQETPGEDP+L  KYA+ YV+GLQ   G   +RL
Sbjct: 159 MYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLASKYASGYVKGLQETDGGDSNRL 218

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVAACCKHYTAYD+DNW GV+RY FNA V++QD++DTY  PFK+CVV+G VASVMCSYNQ
Sbjct: 219 KVAACCKHYTAYDVDNWKGVERYSFNAVVTQQDMDDTYQPPFKSCVVDGNVASVMCSYNQ 278

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           VNGKPTCADPD+L   I G+W+L+GYIVSDCDSV VLY  QHYT+TP EAAA +I AGLD
Sbjct: 279 VNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQHYTKTPAEAAAISILAGLD 338

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           L+CG FL  HTE AV+ GL+ E  ++ A++      MRLG FDG P  Q +G LGP DVC
Sbjct: 339 LNCGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLTLMRLGFFDGNPKNQIYGGLGPTDVC 398

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           T A+Q+LA  AA QGIVLLKN+   LPLS     T+AVIGPN++VT TMIGNY G  C Y
Sbjct: 399 TSANQELAADAARQGIVLLKNTG-FLPLSPKSIKTLAVIGPNANVTKTMIGNYEGTPCKY 457

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TTPLQG++    T +  GC  VAC    + GA ++AA  AD TVL++G DQSIEAE  DR
Sbjct: 458 TTPLQGLAGAVSTTYLPGCSNVACAVADVAGATKLAA-TADVTVLLIGADQSIEAESRDR 516

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             L LPG+QQELV +VAKA++GPV+LV+M GG  D++FAKNDP+I  ILWVGYPG+AGG 
Sbjct: 517 VDLNLPGQQQELVIQVAKAAKGPVLLVIMSGGGFDITFAKNDPKIAGILWVGYPGEAGGI 576

Query: 418 AIADVLFGRANP 429
           AIAD++FGR NP
Sbjct: 577 AIADIIFGRYNP 588


>gi|15237736|ref|NP_201262.1| beta-D-xylosidase 4 [Arabidopsis thaliana]
 gi|75262663|sp|Q9FLG1.1|BXL4_ARATH RecName: Full=Beta-D-xylosidase 4; Short=AtBXL4; Flags: Precursor
 gi|10178060|dbj|BAB11424.1| beta-xylosidase [Arabidopsis thaliana]
 gi|332010539|gb|AED97922.1| beta-D-xylosidase 4 [Arabidopsis thaliana]
          Length = 784

 Score =  580 bits (1494), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 282/432 (65%), Positives = 335/432 (77%), Gaps = 5/432 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ---GNTGSRL 57
           MYN G+AGLTYWSPNVNIFRDPRWGRGQETPGEDP+L  KYA+ YV+GLQ   G   +RL
Sbjct: 159 MYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLASKYASGYVKGLQETDGGDSNRL 218

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVAACCKHYTAYD+DNW GV+RY FNA V++QD++DTY  PFK+CVV+G VASVMCSYNQ
Sbjct: 219 KVAACCKHYTAYDVDNWKGVERYSFNAVVTQQDMDDTYQPPFKSCVVDGNVASVMCSYNQ 278

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           VNGKPTCADPD+L   I G+W+L+GYIVSDCDSV VLY  QHYT+TP EAAA +I AGLD
Sbjct: 279 VNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQHYTKTPAEAAAISILAGLD 338

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           L+CG FL  HTE AV+ GL+ E  ++ A++      MRLG FDG P  Q +G LGP DVC
Sbjct: 339 LNCGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLTLMRLGFFDGNPKNQIYGGLGPTDVC 398

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           T A+Q+LA  AA QGIVLLKN+   LPLS     T+AVIGPN++VT TMIGNY G  C Y
Sbjct: 399 TSANQELAADAARQGIVLLKNTG-CLPLSPKSIKTLAVIGPNANVTKTMIGNYEGTPCKY 457

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TTPLQG++    T +  GC  VAC    + GA ++AA  AD +VLV+G DQSIEAE  DR
Sbjct: 458 TTPLQGLAGTVSTTYLPGCSNVACAVADVAGATKLAA-TADVSVLVIGADQSIEAESRDR 516

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             L LPG+QQELV +VAKA++GPV+LV+M GG  D++FAKNDP+I  ILWVGYPG+AGG 
Sbjct: 517 VDLHLPGQQQELVIQVAKAAKGPVLLVIMSGGGFDITFAKNDPKIAGILWVGYPGEAGGI 576

Query: 418 AIADVLFGRANP 429
           AIAD++FGR NP
Sbjct: 577 AIADIIFGRYNP 588


>gi|242077366|ref|XP_002448619.1| hypothetical protein SORBIDRAFT_06g030270 [Sorghum bicolor]
 gi|241939802|gb|EES12947.1| hypothetical protein SORBIDRAFT_06g030270 [Sorghum bicolor]
          Length = 767

 Score =  578 bits (1491), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 286/432 (66%), Positives = 334/432 (77%), Gaps = 4/432 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ--GNTGSRLK 58
           M+N G+AGLT+WSPN+NIFRDPRWGRGQETPGEDP+LT KYA  YV GLQ  G+    LK
Sbjct: 148 MHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQDAGSGSGSLK 207

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           VAACCKHYTAYD+DNW GV+RY FNA VS+QDL+DT+  PFK+CVV+G VASVMCSYNQV
Sbjct: 208 VAACCKHYTAYDVDNWKGVERYTFNAVVSQQDLDDTFQPPFKSCVVDGNVASVMCSYNQV 267

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           NGKPTCAD D+L   I G W+L+GYI SDCDSV VLYN QHYT+TPE+AAA +IKAGLDL
Sbjct: 268 NGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAISIKAGLDL 327

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG FLA HT  AV+ G L E DV+ A+       MRLG FDG+P   PFGNLGP DVCT
Sbjct: 328 NCGNFLAQHTVAAVQAGKLSESDVDRAITNNFITLMRLGFFDGDPRKLPFGNLGPSDVCT 387

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
            ++Q+LA +AA QGIVLLKNS   LPLS     ++AVIGPN++ + TMIGNY G  C YT
Sbjct: 388 SSNQELAREAARQGIVLLKNSG-ALPLSASSIKSLAVIGPNANASFTMIGNYEGTPCKYT 446

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQL-IGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TPLQG+     T++Q GC  V C+GN L + AA  AA  AD TVLV+G DQSIE E +DR
Sbjct: 447 TPLQGLGANVATVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSIERESLDR 506

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             LLLPG+Q +LVS VA ASRGP +LV+M GGP D+SFAK+  +I AILWVGYPG+AGGA
Sbjct: 507 TSLLLPGQQPQLVSAVANASRGPCILVIMSGGPFDISFAKSSDKIAAILWVGYPGEAGGA 566

Query: 418 AIADVLFGRANP 429
           AIADVLFG  NP
Sbjct: 567 AIADVLFGHHNP 578


>gi|413919688|gb|AFW59620.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 773

 Score =  577 bits (1486), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 284/432 (65%), Positives = 336/432 (77%), Gaps = 4/432 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG--NTGSRLK 58
           M+N G+AGLT+WSPN+NIFRDPRWGRGQETPGEDP+LT KYA  YV GLQG  +    LK
Sbjct: 154 MHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQGAVSGAGALK 213

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           VAACCKHYTAYD+DNW GV+RY F+A VS+QDL+DT+  PFK+CVV+G VASVMCSYNQV
Sbjct: 214 VAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVVDGNVASVMCSYNQV 273

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           NGKPTCAD D+L   I G W+L+GYI SDCDSV VLYN QHYT+TPE+AAA +IKAGLDL
Sbjct: 274 NGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAISIKAGLDL 333

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG FLA HT  AV+ G L E DV+ A+   +   MRLG FDG+P   PFGNLGP DVCT
Sbjct: 334 NCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNLGPSDVCT 393

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           P++Q+LA +AA QGIVLLKN+ + LPLS     ++AVIGPN++ + TMIGNY G  C YT
Sbjct: 394 PSNQELAREAARQGIVLLKNTGK-LPLSATSIKSMAVIGPNANASFTMIGNYEGTPCKYT 452

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQL-IGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TPLQG+     T++Q GC  V C+GN L + AA  AA  AD TVLV+G DQSIE E +DR
Sbjct: 453 TPLQGLGANVATVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSIERESLDR 512

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             LLLPG+Q +LVS VA AS GP +LV+M GGP D+SFAK+  +I AILWVGYPG+AGGA
Sbjct: 513 TSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWVGYPGEAGGA 572

Query: 418 AIADVLFGRANP 429
           AIADVLFG  NP
Sbjct: 573 AIADVLFGYHNP 584


>gi|356558612|ref|XP_003547598.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
           [Glycine max]
          Length = 776

 Score =  574 bits (1479), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 271/432 (62%), Positives = 338/432 (78%), Gaps = 4/432 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT---GSRL 57
           MYN G+AGLTYWSPN+NIFRDPRWGRGQETPGEDP+L+ KYA  YV+GLQ       ++L
Sbjct: 156 MYNVGLAGLTYWSPNINIFRDPRWGRGQETPGEDPLLSSKYATGYVKGLQQTDDGDSNKL 215

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVAACCKHYTAYDLDNW G+ RY FNA V++QD++DT+  PFK+CV++G VASVMCSYNQ
Sbjct: 216 KVAACCKHYTAYDLDNWKGIQRYTFNAVVTQQDMDDTFQPPFKSCVIDGNVASVMCSYNQ 275

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           VNGKPTCADPD+LK  I G+W+L+GYIVSDCDSV VL+  QHYT+TPEEAAA  I AGLD
Sbjct: 276 VNGKPTCADPDLLKGIIRGEWKLNGYIVSDCDSVEVLFKDQHYTKTPEEAAAQTILAGLD 335

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           L+CG +L  +TEGAV+ GLL E  +N A++      MRLG FDG+PS QP+GNLGP+DVC
Sbjct: 336 LNCGNYLGQYTEGAVKQGLLDEASINNAVSNNFATLMRLGFFDGDPSKQPYGNLGPKDVC 395

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           T  +++LA +AA QGIVLLKNS  +LPL+     ++AVIGPN++ T  MIGNY G+ C Y
Sbjct: 396 TSENRELAREAARQGIVLLKNSPGSLPLNAKTIKSLAVIGPNANATRVMIGNYEGIPCNY 455

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
            +PLQ ++    T + AGC  V C   +L  A ++AA  ADATV+++G   +IEAE +DR
Sbjct: 456 ISPLQTLTALVPTSYAAGCPNVQCANAELDDATQIAA-SADATVIIVGASLAIEAESLDR 514

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             +LLPG+QQ LVS VA AS+GPV+LV+M GG +DVSFAK++ +I +ILWVGYPG+AGGA
Sbjct: 515 INILLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSFAKSNDKITSILWVGYPGEAGGA 574

Query: 418 AIADVLFGRANP 429
           AIADV+FG  NP
Sbjct: 575 AIADVIFGFYNP 586


>gi|292630923|sp|A5JTQ3.1|XYL2_MEDVA RecName: Full=Beta-xylosidase/alpha-L-arabinofuranosidase 2;
           AltName: Full=Xylan
           1,4-beta-xylosidase/Alpha-N-arabinofuranosidase 2;
           Short=MsXyl2; Includes: RecName: Full=Beta-xylosidase;
           AltName: Full=1,4-beta-D-xylan xylohydrolase; AltName:
           Full=Xylan 1,4-beta-xylosidase; Includes: RecName:
           Full=Alpha-N-arabinofuranosidase; AltName:
           Full=Alpha-L-arabinofuranosidase; Short=Arabinosidase;
           Flags: Precursor
 gi|146762263|gb|ABQ45228.1| beta-xylosidase/alpha-L-arabinosidase [Medicago sativa subsp. x
           varia]
          Length = 774

 Score =  573 bits (1476), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 273/432 (63%), Positives = 337/432 (78%), Gaps = 4/432 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT---GSRL 57
           M+N G+AGLTYWSPN+NIFRDPRWGRGQETPGEDP+L  KYAA YV+GLQ       ++L
Sbjct: 154 MHNVGLAGLTYWSPNINIFRDPRWGRGQETPGEDPLLASKYAAGYVKGLQQTDDGDSNKL 213

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVAACCKHYTAYD+D+W GV RY FNA V++QDL+DTY  PFK+CV++G VASVMCSYNQ
Sbjct: 214 KVAACCKHYTAYDVDDWKGVQRYTFNAVVTQQDLDDTYQPPFKSCVIDGNVASVMCSYNQ 273

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           VNGKPTCADPD+LK  I G+W+L+GYIVSDCDSV VL+  QHYT+TPEEAAA +I AGLD
Sbjct: 274 VNGKPTCADPDLLKGVIRGKWKLNGYIVSDCDSVDVLFKNQHYTKTPEEAAAKSILAGLD 333

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           L+CG FL  +TEGAV+ GL+ E  +N A+       MRLG FDG+PS QP+GNLGP+DVC
Sbjct: 334 LNCGSFLGRYTEGAVKQGLIGEASINNAVYNNFATLMRLGFFDGDPSKQPYGNLGPKDVC 393

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           T A+Q+LA +AA QGIVLLKN A +LPL+     ++AVIGPN++ T  MIGNY G+ C Y
Sbjct: 394 TSANQELAREAARQGIVLLKNCAGSLPLNAKAIKSLAVIGPNANATRAMIGNYEGIPCKY 453

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           T+PLQG++    T   AGC  V C    L  A ++AA  ADATV+V+G + +IEAE  DR
Sbjct: 454 TSPLQGLTALVPTSFAAGCPDVQCTNAALDDAKKIAA-SADATVIVVGANLAIEAESHDR 512

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             +LLPG+QQ+LV+ VA  ++GPV+L +M GG +DVSFAK + +I +ILWVGYPG+AGGA
Sbjct: 513 INILLPGQQQQLVTEVANVAKGPVILAIMSGGGMDVSFAKTNKKITSILWVGYPGEAGGA 572

Query: 418 AIADVLFGRANP 429
           AIADV+FG  NP
Sbjct: 573 AIADVIFGYHNP 584


>gi|115460876|ref|NP_001054038.1| Os04g0640700 [Oryza sativa Japonica Group]
 gi|38344900|emb|CAE02971.2| OSJNBb0079B02.3 [Oryza sativa Japonica Group]
 gi|113565609|dbj|BAF15952.1| Os04g0640700 [Oryza sativa Japonica Group]
 gi|116310882|emb|CAH67823.1| OSIGBa0138H21-OSIGBa0138E01.14 [Oryza sativa Indica Group]
 gi|218195682|gb|EEC78109.1| hypothetical protein OsI_17615 [Oryza sativa Indica Group]
          Length = 765

 Score =  572 bits (1475), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 279/433 (64%), Positives = 335/433 (77%), Gaps = 4/433 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ--GNTGSRLK 58
           M+N G+AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L  KYA  YV GLQ  G     LK
Sbjct: 145 MHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGGGSDALK 204

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           VAACCKHYTAYD+DNW GV+RY F+A VS+QDL+DT+  PFK+CV++G VASVMCSYN+V
Sbjct: 205 VAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKV 264

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           NGKPTCAD D+L   I G W+L+GYIVSDCDSV VLYN QHYT+ PE+AAA  IK+GLDL
Sbjct: 265 NGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYTKNPEDAAAITIKSGLDL 324

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG FLA HT  AV+ G L E DV+ A+     V MRLG FDG+P   PFG+LGP+DVCT
Sbjct: 325 NCGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRLGFFDGDPRKLPFGSLGPKDVCT 384

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
            ++Q+LA +AA QGIVLLKN+   LPLS     ++AVIGPN++ + TMIGNY G  C YT
Sbjct: 385 SSNQELAREAARQGIVLLKNTG-ALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYT 443

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQL-IGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TPLQG+     T++Q GC  V C+GN L + AA  AA  AD TVLV+G DQS+E E +DR
Sbjct: 444 TPLQGLGANVATVYQPGCTNVGCSGNSLQLSAATQAAASADVTVLVVGADQSVERESLDR 503

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             LLLPG+Q +LVS VA ASRGPV+LV+M GGP D+SFAK+  +I AILWVGYPG+AGGA
Sbjct: 504 TSLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDISFAKSSDKISAILWVGYPGEAGGA 563

Query: 418 AIADVLFGRANPG 430
           A+AD+LFG  NPG
Sbjct: 564 ALADILFGYHNPG 576


>gi|32488698|emb|CAE03635.1| OSJNBb0003B01.27 [Oryza sativa Japonica Group]
          Length = 839

 Score =  572 bits (1473), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 279/433 (64%), Positives = 335/433 (77%), Gaps = 4/433 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ--GNTGSRLK 58
           M+N G+AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L  KYA  YV GLQ  G     LK
Sbjct: 219 MHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGGGSDALK 278

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           VAACCKHYTAYD+DNW GV+RY F+A VS+QDL+DT+  PFK+CV++G VASVMCSYN+V
Sbjct: 279 VAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKV 338

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           NGKPTCAD D+L   I G W+L+GYIVSDCDSV VLYN QHYT+ PE+AAA  IK+GLDL
Sbjct: 339 NGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYTKNPEDAAAITIKSGLDL 398

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG FLA HT  AV+ G L E DV+ A+     V MRLG FDG+P   PFG+LGP+DVCT
Sbjct: 399 NCGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRLGFFDGDPRKLPFGSLGPKDVCT 458

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
            ++Q+LA +AA QGIVLLKN+   LPLS     ++AVIGPN++ + TMIGNY G  C YT
Sbjct: 459 SSNQELAREAARQGIVLLKNTG-ALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYT 517

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQL-IGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TPLQG+     T++Q GC  V C+GN L + AA  AA  AD TVLV+G DQS+E E +DR
Sbjct: 518 TPLQGLGANVATVYQPGCTNVGCSGNSLQLSAATQAAASADVTVLVVGADQSVERESLDR 577

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             LLLPG+Q +LVS VA ASRGPV+LV+M GGP D+SFAK+  +I AILWVGYPG+AGGA
Sbjct: 578 TSLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDISFAKSSDKISAILWVGYPGEAGGA 637

Query: 418 AIADVLFGRANPG 430
           A+AD+LFG  NPG
Sbjct: 638 ALADILFGYHNPG 650


>gi|356510699|ref|XP_003524073.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Glycine max]
          Length = 613

 Score =  572 bits (1473), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 273/434 (62%), Positives = 341/434 (78%), Gaps = 5/434 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG---SRL 57
           MYN G+AGLTYWSPN+NIFRDPRWGRG ETPGEDPVLT KYAA YV+GLQ   G   ++L
Sbjct: 145 MYNVGLAGLTYWSPNINIFRDPRWGRGLETPGEDPVLTSKYAAGYVKGLQQTDGGDPNKL 204

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVAACCKHYTAYD+D W G+ RY FNA ++KQDLEDT+  PFK+CV++G VASVMCSYN+
Sbjct: 205 KVAACCKHYTAYDVDKWKGIQRYTFNAVLTKQDLEDTFQPPFKSCVIDGNVASVMCSYNK 264

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           VNGKPTCADPD+LK  + G+W+L+GY+VSDCDSV VLY  QHYT+TPEEAAA +I AGLD
Sbjct: 265 VNGKPTCADPDLLKGVVRGEWKLNGYMVSDCDSVEVLYKYQHYTKTPEEAAAISILAGLD 324

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           L+CG FL  +TEGAV+ GL+ +E +N A++      MRLG FDG+P  QP+GNLGP+DVC
Sbjct: 325 LNCGRFLGQYTEGAVKQGLI-DESINNAVSNNFATLMRLGFFDGDPRKQPYGNLGPKDVC 383

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           TPA+Q+LA +AA QGIV LKNS  +LPL+     ++AVIGPN++ T  MIGNY G+ C Y
Sbjct: 384 TPANQELAREAARQGIVSLKNSPASLPLNAKAIKSLAVIGPNANATRVMIGNYEGIPCKY 443

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
            +PLQG++ +  T + AGC  V C    L  A +++A   DATV+V+G   +IEAE +DR
Sbjct: 444 ISPLQGLTAFVPTSYAAGCLDVRCPNPVLDDAKKISA-SGDATVIVVGASLAIEAESLDR 502

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             +LLPG+QQ LV+ VA AS+GPV+LV+M GG +DVSFAK++ +I +ILWVGYPG+AGGA
Sbjct: 503 VNILLPGQQQLLVTEVANASKGPVILVIMSGGGMDVSFAKDNNKITSILWVGYPGEAGGA 562

Query: 418 AIADVLFGRANPGT 431
           AIADV+FG  NP T
Sbjct: 563 AIADVIFGFHNPHT 576


>gi|356525896|ref|XP_003531557.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
           [Glycine max]
          Length = 776

 Score =  571 bits (1472), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 271/432 (62%), Positives = 337/432 (78%), Gaps = 4/432 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT---GSRL 57
           MYN G+AGLTYWSPN+NIFRDPRWGRGQETPGEDP+L+ KYA  YV+GLQ       ++L
Sbjct: 156 MYNVGLAGLTYWSPNINIFRDPRWGRGQETPGEDPLLSSKYATGYVKGLQQTDDGDSNKL 215

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVAACCKHYTAYDLDNW G+ RY FNA V++QD++DT+  PFK+CV++G VASVMCSYNQ
Sbjct: 216 KVAACCKHYTAYDLDNWKGIQRYTFNAVVTQQDMDDTFQPPFKSCVIDGNVASVMCSYNQ 275

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           VNGKPTCADPD+LK  I G+W+L+GYIVSDCDSV VL+  QHYT+TPEEAAA+ I AGLD
Sbjct: 276 VNGKPTCADPDLLKGVIRGEWKLNGYIVSDCDSVEVLFKDQHYTKTPEEAAAETILAGLD 335

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           L+CG +L  +TEGAV+ GLL E  +N A++      MRLG FDG+PS Q +GNLGP DVC
Sbjct: 336 LNCGNYLGQYTEGAVKQGLLDEASINNAVSNNFATLMRLGFFDGDPSKQTYGNLGPNDVC 395

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           T  +++LA +AA QGIVLLKNS  +LPL+     ++AVIGPN++ T  MIGNY G+ C Y
Sbjct: 396 TSENRELAREAARQGIVLLKNSLGSLPLNAKAIKSLAVIGPNANATRVMIGNYEGIPCNY 455

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
            +PLQ ++    T + AGC  V C   +L  A ++AA  ADATV+V+G   +IEAE +DR
Sbjct: 456 ISPLQALTALVPTSYAAGCPNVQCANAELDDATQIAA-SADATVIVVGASLAIEAESLDR 514

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             +LLPG+QQ LVS VA AS+GPV+LV+M GG +DVSFAK++ +I +ILWVGYPG+AGGA
Sbjct: 515 INILLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSFAKSNDKITSILWVGYPGEAGGA 574

Query: 418 AIADVLFGRANP 429
           AIADV+FG  NP
Sbjct: 575 AIADVIFGFYNP 586


>gi|292630922|sp|A5JTQ2.1|XYL1_MEDVA RecName: Full=Beta-xylosidase/alpha-L-arabinofuranosidase 1;
           AltName: Full=Xylan
           1,4-beta-xylosidase/Alpha-N-arabinofuranosidase 1;
           Short=MsXyl1; Includes: RecName: Full=Beta-xylosidase;
           AltName: Full=1,4-beta-D-xylan xylohydrolase; AltName:
           Full=Xylan 1,4-beta-xylosidase; Includes: RecName:
           Full=Alpha-N-arabinofuranosidase; AltName:
           Full=Alpha-L-arabinofuranosidase; Short=Arabinosidase;
           Flags: Precursor
 gi|146762261|gb|ABQ45227.1| beta-xylosidase/alpha-L-arabinosidase [Medicago sativa subsp. x
           varia]
          Length = 774

 Score =  571 bits (1472), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 273/432 (63%), Positives = 338/432 (78%), Gaps = 4/432 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT---GSRL 57
           MYN G+AGLTYWSPN+NIFRDPRWGRGQETPGEDP+L+ KYAA YV+GLQ        +L
Sbjct: 154 MYNVGLAGLTYWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAGYVKGLQQTDDGDSDKL 213

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVAACCKHYTAYD+DNW GV RY F+A VS+QDL+DT+  PFK+CV++G VASVMCSYN+
Sbjct: 214 KVAACCKHYTAYDVDNWKGVQRYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNK 273

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           VNGKPTCADPD+LK  I G+W+L+GYIVSDCDSV VLY  QHYT+TPEEAAA  I +GLD
Sbjct: 274 VNGKPTCADPDLLKGVIRGKWKLNGYIVSDCDSVEVLYKDQHYTKTPEEAAAKTILSGLD 333

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           LDCG +L  +T GAV+ GL+ E  +  A++      MRLG FDG+PS QP+GNLGP+DVC
Sbjct: 334 LDCGSYLGQYTGGAVKQGLVDEASITNAVSNNFATLMRLGFFDGDPSKQPYGNLGPKDVC 393

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           TP +Q+LA +AA QGIVLLKNS R+LPLS+    ++AVIGPN++ T  MIGNY G+ C Y
Sbjct: 394 TPENQELAREAARQGIVLLKNSPRSLPLSSKAIKSLAVIGPNANATRVMIGNYEGIPCKY 453

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           T+PLQG++ +  T +  GC  V C  N  I  A   A  ADAT++V+G + +IEAE +DR
Sbjct: 454 TSPLQGLTAFVPTSYAPGCPDVQC-ANAQIDDAAKIAASADATIIVVGANLAIEAESLDR 512

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             +LLPG+QQ+LV+ VA  S+GPV+LV+M GG +DVSFAK + +I +ILWVGYPG+AGGA
Sbjct: 513 VNILLPGQQQQLVNEVANVSKGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGA 572

Query: 418 AIADVLFGRANP 429
           AIADV+FG  NP
Sbjct: 573 AIADVIFGSYNP 584


>gi|357445735|ref|XP_003593145.1| Beta-xylosidase/alpha-L-arabinofuranosidase [Medicago truncatula]
 gi|355482193|gb|AES63396.1| Beta-xylosidase/alpha-L-arabinofuranosidase [Medicago truncatula]
          Length = 775

 Score =  568 bits (1464), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 272/432 (62%), Positives = 338/432 (78%), Gaps = 4/432 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT---GSRL 57
           MYN G+AGLTYWSPN+NIFRDPRWGRGQETPGEDP+L+ KYAA YV+GLQ        +L
Sbjct: 155 MYNVGLAGLTYWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAGYVKGLQQTDDGDSDKL 214

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVAACCKHYTAYD+DNW GV RY F+A VS+QDL+DT+  PFK+CV++G VASVMCSYN+
Sbjct: 215 KVAACCKHYTAYDVDNWKGVQRYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNK 274

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           VNGKPTCADPD+LK  I G+W+L+GYIVSDCDSV VL+  QHYT+TPEEAAA  I +GLD
Sbjct: 275 VNGKPTCADPDLLKGVIRGKWKLNGYIVSDCDSVEVLFKDQHYTKTPEEAAAKTILSGLD 334

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           LDCG +L  +T GAV+ GL+ E  +N A++      MRLG FDG+PS QP+GNLGP+DVC
Sbjct: 335 LDCGSYLGQYTGGAVKQGLVDEASINNAVSNNFATLMRLGFFDGDPSKQPYGNLGPKDVC 394

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           TP +Q+LA +AA QGIVLLKNS  +LPLS+    ++AVIGPN++ T  MIGNY G+ C Y
Sbjct: 395 TPENQELAREAARQGIVLLKNSPGSLPLSSKAIKSLAVIGPNANATRVMIGNYEGIPCKY 454

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           T+PLQG++ +  T +  GC  V C  N  I  A   A  ADAT++V+G + +IEAE +DR
Sbjct: 455 TSPLQGLTAFVPTSYAPGCPDVQC-ANAQIDDAAKIAASADATIIVVGANLAIEAESLDR 513

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             +LLPG+QQ+LV+ VA  S+GPV+LV+M GG +DVSFAK + +I +ILWVGYPG+AGGA
Sbjct: 514 VNILLPGQQQQLVNEVANVSKGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGA 573

Query: 418 AIADVLFGRANP 429
           AIADV+FG  NP
Sbjct: 574 AIADVIFGSYNP 585


>gi|343172466|gb|AEL98937.1| beta-xylosidase, partial [Silene latifolia]
 gi|343172468|gb|AEL98938.1| beta-xylosidase, partial [Silene latifolia]
          Length = 374

 Score =  567 bits (1460), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 263/310 (84%), Positives = 284/310 (91%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYNGG AGLTYWSPNVNIFRDPRWGRGQETPGEDP L+ +YAASYV GLQGN G+RLKVA
Sbjct: 65  MYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLSAQYAASYVTGLQGNYGNRLKVA 124

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKHYTAYDLDNWNG+DR+HFNA+VSKQDLEDTYNVPFKACV+EGKVASVMCSYNQVNG
Sbjct: 125 ACCKHYTAYDLDNWNGMDRFHFNAKVSKQDLEDTYNVPFKACVLEGKVASVMCSYNQVNG 184

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
           KPTCADPDIL+NTI GQW L+GYIVSDCDSVGVLY+ QHYTRTPEEAAAD I AGLDLDC
Sbjct: 185 KPTCADPDILRNTIRGQWHLNGYIVSDCDSVGVLYDDQHYTRTPEEAAADTINAGLDLDC 244

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           GPFLA+HTEGA+R GL+ E  VN ALA TITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA
Sbjct: 245 GPFLAVHTEGAIRQGLVTEAAVNQALANTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 304

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           HQ LALQAA +GIVLLKN   +LPLST+RH  +AVIGPN+  T TMIGNYAG+ACGYT+P
Sbjct: 305 HQDLALQAAREGIVLLKNQVGSLPLSTVRHRNIAVIGPNAQATTTMIGNYAGIACGYTSP 364

Query: 301 LQGISRYAKT 310
           LQGISRYA+T
Sbjct: 365 LQGISRYART 374


>gi|242071935|ref|XP_002451244.1| hypothetical protein SORBIDRAFT_05g026400 [Sorghum bicolor]
 gi|241937087|gb|EES10232.1| hypothetical protein SORBIDRAFT_05g026400 [Sorghum bicolor]
          Length = 790

 Score =  565 bits (1457), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 283/440 (64%), Positives = 331/440 (75%), Gaps = 20/440 (4%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN-TGSRLKV 59
           MYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP ++ +YAA+YVRGLQ     +RLK+
Sbjct: 147 MYNGGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPAISSRYAAAYVRGLQQPYDHNRLKL 206

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           AACCKH+TAYDLD+W G DR+HFNA VS QDLEDT+NVPF+ACV  G+ ASVMCSYNQVN
Sbjct: 207 AACCKHFTAYDLDSWGGTDRFHFNAVVSPQDLEDTFNVPFRACVAGGRAASVMCSYNQVN 266

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           G PTCAD   L+ TI   W LDGYIVSDCDSV V +  QHYTRT E+A A  ++AGLDLD
Sbjct: 267 GVPTCADQGFLRGTIRKAWGLDGYIVSDCDSVDVFFRDQHYTRTAEDAVAATLRAGLDLD 326

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CGPFLA++TE AV    + + DV+ AL  T+TVQMRLGMFDG+P++ PFG+LG  DVCT 
Sbjct: 327 CGPFLALYTENAVARKKVSDADVDAALLNTVTVQMRLGMFDGDPASGPFGHLGAADVCTK 386

Query: 240 AHQQLALQAAHQGIVLLKNS-------ARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAG 292
           AHQ LAL AA Q +VLLKN           LPL    H  VAV+GP++D TV MIGNYAG
Sbjct: 387 AHQDLALDAARQSVVLLKNQRGRKHRDRDVLPLRPAAHRVVAVVGPHADATVAMIGNYAG 446

Query: 293 VACGYTTPLQGISRY-AKTIHQAGCFGVACNG-NQLIGAAEVAARQADATVLVMGLDQSI 350
             C YTTPLQG++ Y A+ +HQAGC  VAC G NQ I AA  AAR+        GL +S 
Sbjct: 447 KPCRYTTPLQGVAAYAARVVHQAGCADVACQGKNQPIAAAVDAARRLTPPSSSPGLTRS- 505

Query: 351 EAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGY 410
                    LLLPGRQ EL+S VAKA++GPV+LVLM GGP+D++FA+NDPRI  ILWVGY
Sbjct: 506 ---------LLLPGRQAELISAVAKAAKGPVILVLMSGGPIDIAFAQNDPRIDGILWVGY 556

Query: 411 PGQAGGAAIADVLFGRANPG 430
           PGQAGG AIADV+FG+ NPG
Sbjct: 557 PGQAGGQAIADVIFGQHNPG 576


>gi|224054312|ref|XP_002298197.1| predicted protein [Populus trichocarpa]
 gi|222845455|gb|EEE83002.1| predicted protein [Populus trichocarpa]
          Length = 741

 Score =  564 bits (1454), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 278/435 (63%), Positives = 336/435 (77%), Gaps = 8/435 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ----GNTGSR 56
           MYN G+AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KY + YV+GLQ    GN    
Sbjct: 119 MYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSGYVKGLQQRDDGNPDG- 177

Query: 57  LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYN 116
           LKVAACCKHYTAYDLDNW GVDRYHFNA V+KQD++DT+  PFK+CVV+G VASVMCSYN
Sbjct: 178 LKVAACCKHYTAYDLDNWKGVDRYHFNAVVTKQDMDDTFQPPFKSCVVDGNVASVMCSYN 237

Query: 117 QVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAG- 175
           +VNG PTCADPD+L   I G+W+L+GYIV+DCDS+ V YN+QHYT+TPEEAAA AI AG 
Sbjct: 238 KVNGIPTCADPDLLSGVIRGEWKLNGYIVTDCDSIDVFYNSQHYTKTPEEAAAKAILAGI 297

Query: 176 -LDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPR 234
            LDL+CG FL  HTE AV  GL+ E  ++ A++      MRLG FDG+PS Q +G LGP+
Sbjct: 298 RLDLNCGSFLGKHTEAAVTAGLVNESAIDRAVSNNFATLMRLGFFDGDPSKQLYGKLGPK 357

Query: 235 DVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA 294
           DVCT  +Q+LA +AA QGIVLLKN+A +LPLS      +AVIGPN++VT TMIGNY G  
Sbjct: 358 DVCTAENQELAREAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTP 417

Query: 295 CGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEF 354
           C YTTPLQG++    T +  GC  VAC+  Q +  A+  A  ADATVLVMG D SIEAE 
Sbjct: 418 CKYTTPLQGLAALVATTYLPGCSNVACSTAQ-VDDAKKIAAAADATVLVMGADLSIEAES 476

Query: 355 IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQA 414
            DR  +LLPG+QQ L++ VA AS GPV+LV+M GG +DVSFAK + +I +ILWVGYPG+A
Sbjct: 477 RDRVDILLPGQQQLLITAVANASTGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEA 536

Query: 415 GGAAIADVLFGRANP 429
           GGAAIAD++FG  NP
Sbjct: 537 GGAAIADIIFGSYNP 551


>gi|15242492|ref|NP_196535.1| beta-xylosidase 3 [Arabidopsis thaliana]
 gi|75264323|sp|Q9LXD6.1|BXL3_ARATH RecName: Full=Beta-D-xylosidase 3; Short=AtBXL3; AltName:
           Full=Alpha-L-arabinofuranosidase; Flags: Precursor
 gi|7671416|emb|CAB89357.1| beta-xylosidase-like protein [Arabidopsis thaliana]
 gi|9759004|dbj|BAB09531.1| beta-xylosidase [Arabidopsis thaliana]
 gi|15450735|gb|AAK96639.1| AT5g09730/F17I14_80 [Arabidopsis thaliana]
 gi|332004056|gb|AED91439.1| beta-xylosidase 3 [Arabidopsis thaliana]
          Length = 773

 Score =  561 bits (1445), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 274/432 (63%), Positives = 328/432 (75%), Gaps = 5/432 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG---SRL 57
           MYN G AGLT+WSPNVNIFRDPRWGRGQETPGEDP L+ KYA +YV+GLQ   G   +RL
Sbjct: 149 MYNVGSAGLTFWSPNVNIFRDPRWGRGQETPGEDPTLSSKYAVAYVKGLQETDGGDPNRL 208

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVAACCKHYTAYD+DNW  V+R  FNA V++QDL DT+  PFK+CVV+G VASVMCSYNQ
Sbjct: 209 KVAACCKHYTAYDIDNWRNVNRLTFNAVVNQQDLADTFQPPFKSCVVDGHVASVMCSYNQ 268

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           VNGKPTCADPD+L   I GQW+L+GYIVSDCDSV VL+  QHY +TPEEA A ++ AGLD
Sbjct: 269 VNGKPTCADPDLLSGVIRGQWQLNGYIVSDCDSVDVLFRKQHYAKTPEEAVAKSLLAGLD 328

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           L+C  F   H  GAV+ GL+ E  ++ A++      MRLG FDG+P  Q +G LGP+DVC
Sbjct: 329 LNCDHFNGQHAMGAVKAGLVNETAIDKAISNNFATLMRLGFFDGDPKKQLYGGLGPKDVC 388

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           T  +Q+LA   A QGIVLLKNSA +LPLS     T+AVIGPN++ T TMIGNY GV C Y
Sbjct: 389 TADNQELARDGARQGIVLLKNSAGSLPLSPSAIKTLAVIGPNANATETMIGNYHGVPCKY 448

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TTPLQG++    + +Q GC  VAC    +  A ++AA  ADA VLV+G DQSIE E  DR
Sbjct: 449 TTPLQGLAETVSSTYQLGC-NVACVDADIGSAVDLAA-SADAVVLVVGADQSIEREGHDR 506

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             L LPG+QQELV+RVA A+RGPVVLV+M GG  D++FAKND +I +I+WVGYPG+AGG 
Sbjct: 507 VDLYLPGKQQELVTRVAMAARGPVVLVIMSGGGFDITFAKNDKKITSIMWVGYPGEAGGL 566

Query: 418 AIADVLFGRANP 429
           AIADV+FGR NP
Sbjct: 567 AIADVIFGRHNP 578


>gi|326492918|dbj|BAJ90315.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 775

 Score =  560 bits (1442), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 277/435 (63%), Positives = 333/435 (76%), Gaps = 7/435 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ-----GNTGS 55
           M+N G+AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L  KYA  YV GLQ     G T  
Sbjct: 151 MHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGAGGVTDG 210

Query: 56  RLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSY 115
            LKVAACCKHYTAYD+DNW GV+RY F+A+VS+QDL+DT+  PFK+CV++G VASVMCSY
Sbjct: 211 ALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSY 270

Query: 116 NQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAG 175
           N+VNGKPTCAD D+L+  I G W+L+GYIVSDCDSV VLY  QHYT+TPEEAAA  IK+G
Sbjct: 271 NKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIKSG 330

Query: 176 LDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD 235
           LDL+CG FLA HT  AV+ G L EEDV+ A+     + MRLG FDG+P    FG+LGP+D
Sbjct: 331 LDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKD 390

Query: 236 VCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVAC 295
           VCT ++++LA + A QGIVLLKNS   LPLS     ++AVIGPN++ + TMIGNY G  C
Sbjct: 391 VCTSSNRELARETARQGIVLLKNSG-ALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPC 449

Query: 296 GYTTPLQGISRYAKTIHQAGCFGVACNGNQL-IGAAEVAARQADATVLVMGLDQSIEAEF 354
            YTTPLQG+     T++Q GC  V C+GN L +  A  AA  AD TVLV+G DQSIE E 
Sbjct: 450 KYTTPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERES 509

Query: 355 IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQA 414
           +DR  LLLPG+Q +LVS VA AS GPV+LV+M GGP D+SFAK   +I AILWVGYPG+A
Sbjct: 510 LDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAILWVGYPGEA 569

Query: 415 GGAAIADVLFGRANP 429
           GGAA+AD+LFG  NP
Sbjct: 570 GGAALADILFGSHNP 584


>gi|326494302|dbj|BAJ90420.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326521150|dbj|BAJ96778.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326527851|dbj|BAK08165.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 775

 Score =  560 bits (1442), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 277/435 (63%), Positives = 333/435 (76%), Gaps = 7/435 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ-----GNTGS 55
           M+N G+AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L  KYA  YV GLQ     G T  
Sbjct: 151 MHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGAGGVTDG 210

Query: 56  RLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSY 115
            LKVAACCKHYTAYD+DNW GV+RY F+A+VS+QDL+DT+  PFK+CV++G VASVMCSY
Sbjct: 211 ALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSY 270

Query: 116 NQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAG 175
           N+VNGKPTCAD D+L+  I G W+L+GYIVSDCDSV VLY  QHYT+TPEEAAA  IK+G
Sbjct: 271 NKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIKSG 330

Query: 176 LDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD 235
           LDL+CG FLA HT  AV+ G L EEDV+ A+     + MRLG FDG+P    FG+LGP+D
Sbjct: 331 LDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKD 390

Query: 236 VCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVAC 295
           VCT ++++LA + A QGIVLLKNS   LPLS     ++AVIGPN++ + TMIGNY G  C
Sbjct: 391 VCTSSNRELARETARQGIVLLKNSG-ALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPC 449

Query: 296 GYTTPLQGISRYAKTIHQAGCFGVACNGNQL-IGAAEVAARQADATVLVMGLDQSIEAEF 354
            YTTPLQG+     T++Q GC  V C+GN L +  A  AA  AD TVLV+G DQSIE E 
Sbjct: 450 KYTTPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERES 509

Query: 355 IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQA 414
           +DR  LLLPG+Q +LVS VA AS GPV+LV+M GGP D+SFAK   +I AILWVGYPG+A
Sbjct: 510 LDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAILWVGYPGEA 569

Query: 415 GGAAIADVLFGRANP 429
           GGAA+AD+LFG  NP
Sbjct: 570 GGAALADILFGSHNP 584


>gi|350534908|ref|NP_001233910.1| beta-D-xylosidase 1 precursor [Solanum lycopersicum]
 gi|37359706|dbj|BAC98298.1| LEXYL1 [Solanum lycopersicum]
          Length = 770

 Score =  559 bits (1440), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 258/432 (59%), Positives = 338/432 (78%), Gaps = 4/432 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN---TGSRL 57
           MYN G+AGLTYWSPNVNIFRDPRWGRGQETPGEDP LT KY  +YV GLQ     + ++L
Sbjct: 150 MYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLTSKYGVAYVEGLQQTDDGSTNKL 209

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVAACCKHYTAYD+DNW G++RY FNA V +QDL+DT+  PF++CV+EG VASVMCSYNQ
Sbjct: 210 KVAACCKHYTAYDVDNWKGIERYSFNAVVRQQDLDDTFQPPFRSCVLEGAVASVMCSYNQ 269

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           VNGKPTC DP++L   + G+W+L+GYIV+DCDS+ V++ +Q+YT+TPEEAAA  + +G+D
Sbjct: 270 VNGKPTCGDPNLLAGIVRGEWKLNGYIVTDCDSLQVIFKSQNYTKTPEEAAALGLNSGVD 329

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           L+CG +L+ +T+GAV   L+ E  ++ A++      MRLG FDG P ++ +GNLGP+DVC
Sbjct: 330 LNCGSWLSTYTQGAVNQKLVNESVIDRAISNNFATLMRLGFFDGNPKSRIYGNLGPKDVC 389

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           TP +Q+LA +AA QGIVLLKN+A +LPL+     ++AVIGPN++VT TMIGNY G+ C Y
Sbjct: 390 TPENQELAREAARQGIVLLKNTAGSLPLTPTAIKSLAVIGPNANVTKTMIGNYEGIPCKY 449

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TTPLQG++    TI++ GC  V+CN  Q+  A ++A   ADA VLVMG DQSIE E +DR
Sbjct: 450 TTPLQGLTASVATIYKPGCADVSCNTAQIDDAKQIAT-TADAVVLVMGSDQSIEKESLDR 508

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             + LPG+Q  LV+ VAK ++GPV+LV+M GG +DV FA ++P+I +ILWVG+PG+AGGA
Sbjct: 509 TSITLPGQQSILVAEVAKVAKGPVILVIMSGGGMDVQFAVDNPKITSILWVGFPGEAGGA 568

Query: 418 AIADVLFGRANP 429
           A+ADV+FG  NP
Sbjct: 569 ALADVIFGYYNP 580


>gi|326489197|dbj|BAK01582.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 709

 Score =  559 bits (1440), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 277/435 (63%), Positives = 333/435 (76%), Gaps = 7/435 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ-----GNTGS 55
           M+N G+AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L  KYA  YV GLQ     G T  
Sbjct: 85  MHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGAGGVTDG 144

Query: 56  RLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSY 115
            LKVAACCKHYTAYD+DNW GV+RY F+A+VS+QDL+DT+  PFK+CV++G VASVMCSY
Sbjct: 145 ALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSY 204

Query: 116 NQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAG 175
           N+VNGKPTCAD D+L+  I G W+L+GYIVSDCDSV VLY  QHYT+TPEEAAA  IK+G
Sbjct: 205 NKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIKSG 264

Query: 176 LDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD 235
           LDL+CG FLA HT  AV+ G L EEDV+ A+     + MRLG FDG+P    FG+LGP+D
Sbjct: 265 LDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKD 324

Query: 236 VCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVAC 295
           VCT ++++LA + A QGIVLLKNS   LPLS     ++AVIGPN++ + TMIGNY G  C
Sbjct: 325 VCTSSNRELARETARQGIVLLKNSG-ALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPC 383

Query: 296 GYTTPLQGISRYAKTIHQAGCFGVACNGNQL-IGAAEVAARQADATVLVMGLDQSIEAEF 354
            YTTPLQG+     T++Q GC  V C+GN L +  A  AA  AD TVLV+G DQSIE E 
Sbjct: 384 KYTTPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERES 443

Query: 355 IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQA 414
           +DR  LLLPG+Q +LVS VA AS GPV+LV+M GGP D+SFAK   +I AILWVGYPG+A
Sbjct: 444 LDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAILWVGYPGEA 503

Query: 415 GGAAIADVLFGRANP 429
           GGAA+AD+LFG  NP
Sbjct: 504 GGAALADILFGSHNP 518


>gi|357166259|ref|XP_003580652.1| PREDICTED: beta-D-xylosidase 4-like [Brachypodium distachyon]
          Length = 774

 Score =  558 bits (1439), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 271/433 (62%), Positives = 332/433 (76%), Gaps = 5/433 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ---GNTGSRL 57
           M+N G+AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L  +YA  YV GLQ    +    L
Sbjct: 152 MHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASRYAVGYVSGLQDAGADADGPL 211

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVAACCKHYTAYD+DNW GV+RY F+A+VS+QDL+DT+  PFK+CV++GKVASVMCSYN+
Sbjct: 212 KVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVIDGKVASVMCSYNK 271

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           VNGKPTCAD D+L   I G W+L+GYIVSDCDSV VLY+ QHYT+TPEEAAA  IK+GLD
Sbjct: 272 VNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYSQQHYTKTPEEAAAITIKSGLD 331

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           L+CG FLA HT  AV+ G L E DV+ A+     + MRLG FDG+P    +G+LGP+DVC
Sbjct: 332 LNCGDFLAKHTVAAVQAGNLSESDVDRAITNNFIMLMRLGFFDGDPRKLAYGSLGPKDVC 391

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           T ++Q+LA + A QGIVLLKN    LPLS     ++AVIGPN++ + TMIGNY G  C Y
Sbjct: 392 TSSNQELARETARQGIVLLKNDG-ALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKY 450

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQL-IGAAEVAARQADATVLVMGLDQSIEAEFID 356
           TTPL G+     T++Q GC  V C+GN L + AA  AA  AD TVLV+G DQSIE E +D
Sbjct: 451 TTPLHGLGNNVATVYQPGCSNVGCSGNSLQLSAATAAAASADVTVLVVGADQSIEREALD 510

Query: 357 RAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGG 416
           R  LLLPG+Q +L+S VA AS+G V+LV+M GGP D+SFAK   +I AILWVGYPG+AGG
Sbjct: 511 RTSLLLPGQQPDLISAVANASKGHVILVVMSGGPFDISFAKASDKISAILWVGYPGEAGG 570

Query: 417 AAIADVLFGRANP 429
           AAIAD++FG+ NP
Sbjct: 571 AAIADIIFGKYNP 583


>gi|9294427|dbj|BAB02547.1| beta-1,4-xylosidase [Arabidopsis thaliana]
          Length = 876

 Score =  558 bits (1439), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 265/435 (60%), Positives = 326/435 (74%), Gaps = 7/435 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ----GNTGSR 56
           M+N G+AGLTYWSPNVN+FRDPRWGRGQETPGEDP++  KYA +YV+GLQ         R
Sbjct: 141 MHNVGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSKYAVNYVKGLQDVHDAGKSRR 200

Query: 57  LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYN 116
           LKV++CCKHYTAYDLDNW G+DR+HF+A+V+KQDLEDTY  PFK+CV EG V+SVMCSYN
Sbjct: 201 LKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQTPFKSCVEEGDVSSVMCSYN 260

Query: 117 QVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGL 176
           +VNG PTCADP++L+  I GQWRLDGYIVSDCDS+ V +N  HYT+T E+A A A+KAGL
Sbjct: 261 RVNGIPTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFNDIHYTKTREDAVALALKAGL 320

Query: 177 DLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDV 236
           +++CG FL  +TE AV+   L   DV+ AL Y   V MRLG FDG+P + PFGNLGP DV
Sbjct: 321 NMNCGDFLGKYTENAVKLKKLNGSDVDEALIYNYIVLMRLGFFDGDPKSLPFGNLGPSDV 380

Query: 237 CTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
           C+  HQ LAL+AA QGIVLL+N    LPL       +AVIGPN++ T  MI NYAGV C 
Sbjct: 381 CSKDHQMLALEAAKQGIVLLENRG-DLPLPKTTVKKLAVIGPNANATKVMISNYAGVPCK 439

Query: 297 YTTPLQGISRYA--KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEF 354
           YT+P+QG+ +Y   K +++ GC  V C    LI AA  A  +AD TVLV+GLDQ++EAE 
Sbjct: 440 YTSPIQGLQKYVPEKIVYEPGCKDVKCGDQTLISAAVKAVSEADVTVLVVGLDQTVEAEG 499

Query: 355 IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQA 414
           +DR  L LPG Q++LV  VA A++  VVLV+M  GP+D+SFAKN   I A+LWVGYPG+A
Sbjct: 500 LDRVNLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFAKNLSTIRAVLWVGYPGEA 559

Query: 415 GGAAIADVLFGRANP 429
           GG AIA V+FG  NP
Sbjct: 560 GGDAIAQVIFGDYNP 574


>gi|15230897|ref|NP_188596.1| putative beta-D-xylosidase 5 [Arabidopsis thaliana]
 gi|259585724|sp|Q9LJN4.2|BXL5_ARATH RecName: Full=Probable beta-D-xylosidase 5; Short=AtBXL5; Flags:
           Precursor
 gi|332642747|gb|AEE76268.1| putative beta-D-xylosidase 5 [Arabidopsis thaliana]
          Length = 781

 Score =  557 bits (1436), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 265/435 (60%), Positives = 326/435 (74%), Gaps = 7/435 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ----GNTGSR 56
           M+N G+AGLTYWSPNVN+FRDPRWGRGQETPGEDP++  KYA +YV+GLQ         R
Sbjct: 141 MHNVGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSKYAVNYVKGLQDVHDAGKSRR 200

Query: 57  LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYN 116
           LKV++CCKHYTAYDLDNW G+DR+HF+A+V+KQDLEDTY  PFK+CV EG V+SVMCSYN
Sbjct: 201 LKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQTPFKSCVEEGDVSSVMCSYN 260

Query: 117 QVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGL 176
           +VNG PTCADP++L+  I GQWRLDGYIVSDCDS+ V +N  HYT+T E+A A A+KAGL
Sbjct: 261 RVNGIPTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFNDIHYTKTREDAVALALKAGL 320

Query: 177 DLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDV 236
           +++CG FL  +TE AV+   L   DV+ AL Y   V MRLG FDG+P + PFGNLGP DV
Sbjct: 321 NMNCGDFLGKYTENAVKLKKLNGSDVDEALIYNYIVLMRLGFFDGDPKSLPFGNLGPSDV 380

Query: 237 CTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
           C+  HQ LAL+AA QGIVLL+N    LPL       +AVIGPN++ T  MI NYAGV C 
Sbjct: 381 CSKDHQMLALEAAKQGIVLLENRG-DLPLPKTTVKKLAVIGPNANATKVMISNYAGVPCK 439

Query: 297 YTTPLQGISRYA--KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEF 354
           YT+P+QG+ +Y   K +++ GC  V C    LI AA  A  +AD TVLV+GLDQ++EAE 
Sbjct: 440 YTSPIQGLQKYVPEKIVYEPGCKDVKCGDQTLISAAVKAVSEADVTVLVVGLDQTVEAEG 499

Query: 355 IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQA 414
           +DR  L LPG Q++LV  VA A++  VVLV+M  GP+D+SFAKN   I A+LWVGYPG+A
Sbjct: 500 LDRVNLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFAKNLSTIRAVLWVGYPGEA 559

Query: 415 GGAAIADVLFGRANP 429
           GG AIA V+FG  NP
Sbjct: 560 GGDAIAQVIFGDYNP 574


>gi|18025340|gb|AAK38481.1| alpha-L-arabinofuranosidase/beta-D-xylosidase isoenzyme ARA-I
           [Hordeum vulgare]
          Length = 777

 Score =  553 bits (1425), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 274/435 (62%), Positives = 330/435 (75%), Gaps = 7/435 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ-----GNTGS 55
           M+N G+AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L  KYA  YV GLQ     G T  
Sbjct: 153 MHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGAGGVTDG 212

Query: 56  RLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSY 115
            LKVAACCKHYTAYD+DNW GV+RY F+A+VS+QDL+DT+  PFK+CV++G VASVMCSY
Sbjct: 213 ALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSY 272

Query: 116 NQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAG 175
           N+VNGKPTCAD D+L+  I G W+L+GYIVSDCDSV VLY  QHYT+TPEEAAA  IK+G
Sbjct: 273 NKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIKSG 332

Query: 176 LDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD 235
           +DL+CG FLA HT  AV+ G L EEDV+ A+     + MRLG FDG+P    FG+LGP+D
Sbjct: 333 VDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKD 392

Query: 236 VCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVAC 295
           VCT ++++LA + A QGIVLLKNS   LPLS     ++AVIGPN++ + TMIGNY G  C
Sbjct: 393 VCTSSNRELARETARQGIVLLKNSG-ALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPC 451

Query: 296 GYTTPLQGISRYAKTIHQAGCFGVACNGNQL-IGAAEVAARQADATVLVMGLDQSIEAEF 354
            YTTPLQG+     T++Q GC  V C+GN L +  A  AA  AD TVLV+G DQSIE E 
Sbjct: 452 KYTTPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERES 511

Query: 355 IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQA 414
           +DR  LLLPG+Q +LVS VA AS GPV+LV+M GGP D+SFAK   +I A LWVGYPG+A
Sbjct: 512 LDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAATLWVGYPGEA 571

Query: 415 GGAAIADVLFGRANP 429
           GGAA+ D LFG  NP
Sbjct: 572 GGAALDDTLFGSHNP 586


>gi|224070626|ref|XP_002303181.1| predicted protein [Populus trichocarpa]
 gi|222840613|gb|EEE78160.1| predicted protein [Populus trichocarpa]
          Length = 773

 Score =  551 bits (1421), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 272/432 (62%), Positives = 335/432 (77%), Gaps = 4/432 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG---SRL 57
           MYN G+AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KY + YV+GLQ        +L
Sbjct: 153 MYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSCYVKGLQQRDDGDPDKL 212

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVAACCKHYTAYDLDNW G DRYHFNA V+KQD++DT+  PFK+CV++G VASVMCSYNQ
Sbjct: 213 KVAACCKHYTAYDLDNWKGSDRYHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQ 272

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           VNGKPTCADPD+L   I G+W L+GYIV+DCDS+ V Y +Q+YT+TPEEAAA AI AG+D
Sbjct: 273 VNGKPTCADPDLLSGVIRGEWNLNGYIVTDCDSLDVFYKSQNYTKTPEEAAAAAILAGVD 332

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           L+CG FL  HTE AV+GGL+ E  +++A++      MRLG FDG+PS Q +G LGP+DVC
Sbjct: 333 LNCGSFLGQHTEAAVKGGLVNEHAIDIAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVC 392

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           T  +Q+LA +AA QGIVLLKN+A +LPLS      +AVIGPN++VT TMIGNY G  C Y
Sbjct: 393 TAENQELAREAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKY 452

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TTPLQG++    T +  GC  VAC+  Q +  A+  A  ADATVLVMG D SIEAE  DR
Sbjct: 453 TTPLQGLAASVATTYLPGCSNVACSTAQ-VDDAKKLAAAADATVLVMGADLSIEAESRDR 511

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             +LLPG+QQ L++ VA  S GPV+LV+M GG +DVSFA+ + +I +ILWVGYPG+AGGA
Sbjct: 512 VDVLLPGQQQLLITAVANVSCGPVILVIMSGGGMDVSFARTNDKITSILWVGYPGEAGGA 571

Query: 418 AIADVLFGRANP 429
           AIAD++FG  NP
Sbjct: 572 AIADIIFGYYNP 583


>gi|297811069|ref|XP_002873418.1| beta-xylosidase 3 [Arabidopsis lyrata subsp. lyrata]
 gi|297319255|gb|EFH49677.1| beta-xylosidase 3 [Arabidopsis lyrata subsp. lyrata]
          Length = 780

 Score =  545 bits (1404), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 273/433 (63%), Positives = 327/433 (75%), Gaps = 6/433 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG---SRL 57
           MYN G AGLT+WSPNVNIFRDPRWGRGQETPGEDP L+ KYA +YVRGLQ   G   +RL
Sbjct: 154 MYNVGSAGLTFWSPNVNIFRDPRWGRGQETPGEDPELSSKYAVAYVRGLQETDGGDPNRL 213

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVAACCKHYTAYD+DNW  V R+ FNA V++QD+ DT+  PFK+CVV+G VASVMCSYNQ
Sbjct: 214 KVAACCKHYTAYDVDNWKDVHRFTFNAVVNQQDMADTFQPPFKSCVVDGNVASVMCSYNQ 273

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           VNGKPTCADPD+L   I GQW+L+GYIVSDCDSV VLY  QHYT+TPEEA A +I AGLD
Sbjct: 274 VNGKPTCADPDLLSGVIRGQWKLNGYIVSDCDSVDVLYTKQHYTKTPEEAVAKSILAGLD 333

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQP-FGNLGPRDV 236
           L+C  F   +   AV+ GL+ E  ++ A++      MRLG FDG+P  Q  +G LGP DV
Sbjct: 334 LNCDHFTGQYAMKAVKVGLVNETAIDKAISNNFATLMRLGFFDGDPKKQQLYGGLGPNDV 393

Query: 237 CTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
           CT  +Q+LA  AA QGIVLLKNSA +LPLS     T+AVIGPN++ T TMIGNY G+ C 
Sbjct: 394 CTANNQELARDAARQGIVLLKNSAGSLPLSPSAIKTLAVIGPNANATETMIGNYNGIPCK 453

Query: 297 YTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFID 356
           YTTPLQG++    + +Q GC  VAC     +G+A   A  ADA VLVMG DQSIE E +D
Sbjct: 454 YTTPLQGLAETVSSTYQLGC-NVAC-AEPDLGSAAALAASADAVVLVMGADQSIEQENLD 511

Query: 357 RAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGG 416
           R  L LPG+QQELV++VAK ++GPVVLV+M GG  D++FAKN+ +I  I+WVGYPG+AGG
Sbjct: 512 RLDLYLPGKQQELVTQVAKVAKGPVVLVIMSGGAFDITFAKNEEKITGIMWVGYPGEAGG 571

Query: 417 AAIADVLFGRANP 429
            AIADV+FGR NP
Sbjct: 572 LAIADVIFGRHNP 584


>gi|302796585|ref|XP_002980054.1| hypothetical protein SELMODRAFT_419541 [Selaginella moellendorffii]
 gi|300152281|gb|EFJ18924.1| hypothetical protein SELMODRAFT_419541 [Selaginella moellendorffii]
          Length = 779

 Score =  544 bits (1401), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 264/448 (58%), Positives = 322/448 (71%), Gaps = 18/448 (4%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT--GSRLK 58
           M+N   AGLTYWSPNVNI+RDPRWGRGQETPGEDP+L+ KYA  YVRGLQ     G +LK
Sbjct: 136 MHNYQRAGLTYWSPNVNIYRDPRWGRGQETPGEDPLLSSKYATFYVRGLQDTNLGGDKLK 195

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           V+ACCKH TAYD+DNW G  R+ FNA V++QDL DTYN PF++CV + KV+SVMCSYN+V
Sbjct: 196 VSACCKHMTAYDVDNWKGTTRFKFNAIVTQQDLSDTYNPPFQSCVEDAKVSSVMCSYNRV 255

Query: 119 NGKPTCADPDILKNTIHGQWRLDG----------------YIVSDCDSVGVLYNTQHYTR 162
           NG PTCAD ++L  T+   W L+G                YIVSDCDS+   ++  +Y +
Sbjct: 256 NGVPTCADYNLLSATVRSSWNLNGSILLTCEVLLLYLPCSYIVSDCDSLQTFFDNTNYAK 315

Query: 163 TPEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGE 222
           T E+  ADA+ AGL+LDCGPFLAIHT+ A+  G + E +VN AL Y   VQMRLG++DG 
Sbjct: 316 TAEDVVADALLAGLNLDCGPFLAIHTQSAITNGKITEANVNQALRYLYNVQMRLGLYDGN 375

Query: 223 PSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDV 282
           P +QP+GNLGP+ VCT  +QQLAL AA +GIVLLKN+   LP S     TVA IGP++  
Sbjct: 376 PRSQPYGNLGPQSVCTGENQQLALDAAKEGIVLLKNNGNVLPFSKSNIRTVAAIGPHAKA 435

Query: 283 TVTMIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVL 342
           T  MIGNY G+ C YTTP  G+S YA+ ++ AGC  VAC  N LIG+A   A QADA VL
Sbjct: 436 TRAMIGNYQGIPCKYTTPHDGLSAYARVVYSAGCSDVACYSNSLIGSAASTASQADAVVL 495

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
            +GLD + EAE  DR  LLLPG+QQELV+ V KA++GPVVLV+  GG VDVSFAK D ++
Sbjct: 496 FVGLDLNQEAEGKDRTSLLLPGKQQELVTEVTKAAKGPVVLVIFSGGSVDVSFAKYDKKV 555

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
             +LW GYPG+AGGAAIA VLFG  NPG
Sbjct: 556 QGMLWAGYPGEAGGAAIAQVLFGDHNPG 583


>gi|225428983|ref|XP_002264114.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
          Length = 818

 Score =  544 bits (1401), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 267/437 (61%), Positives = 324/437 (74%), Gaps = 8/437 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ--GNTGS--- 55
           MYN G AGLTYWSPNVNIFRDPRWGRGQETPGEDP++  +YA +YVRGLQ  G  G+   
Sbjct: 165 MYNVGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEVGKEGNFAA 224

Query: 56  -RLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCS 114
            RLKV++CCKHYTAYD+D W GVDR+HF+A+V+ QDLEDTY  PFK+CV EG V+SVMCS
Sbjct: 225 DRLKVSSCCKHYTAYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFKSCVEEGHVSSVMCS 284

Query: 115 YNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKA 174
           YN+VNG PTCA+P++LK  I  QW LDGYIVSDCDS+ V +   +YT TPE+A A A+KA
Sbjct: 285 YNRVNGVPTCANPELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNYTETPEDAVALALKA 344

Query: 175 GLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPR 234
           GL+L+CG +L  +T+ AV  G ++E  VN AL Y   V MRLG FDG+P+  PFG +GP 
Sbjct: 345 GLNLNCGSYLGDYTKNAVNLGKVKESIVNQALIYNYIVLMRLGFFDGDPTMLPFGKMGPS 404

Query: 235 DVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA 294
           DVCT  HQ LAL AA QGIVLL N+   LPLS     T+AVIGPN+D T TM+ NYAGV 
Sbjct: 405 DVCTVDHQLLALDAAKQGIVLLHNNG-ALPLSPNTTKTLAVIGPNADATNTMLSNYAGVP 463

Query: 295 CGYTTPLQGISRYAKTI-HQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
           C YT+PLQG+ +Y   + ++ GC  V+C+   LI  A   A  ADATV+V+GLD  IEAE
Sbjct: 464 CRYTSPLQGLQKYVSAVSYEKGCANVSCSEETLIEGAASIASMADATVVVVGLDLFIEAE 523

Query: 354 FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQ 413
            +DR  L LPG Q++LV   AKA+ G V+LV+M  GPVD+SF KN  +IG ILWVGYPGQ
Sbjct: 524 DLDRVNLTLPGFQEKLVMEAAKAANGTVILVVMSAGPVDISFVKNVSKIGGILWVGYPGQ 583

Query: 414 AGGAAIADVLFGRANPG 430
           AGG AI+ V+FG  NPG
Sbjct: 584 AGGDAISQVIFGDYNPG 600


>gi|296083056|emb|CBI22460.3| unnamed protein product [Vitis vinifera]
          Length = 896

 Score =  543 bits (1399), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 267/437 (61%), Positives = 324/437 (74%), Gaps = 8/437 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ--GNTGS--- 55
           MYN G AGLTYWSPNVNIFRDPRWGRGQETPGEDP++  +YA +YVRGLQ  G  G+   
Sbjct: 165 MYNVGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEVGKEGNFAA 224

Query: 56  -RLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCS 114
            RLKV++CCKHYTAYD+D W GVDR+HF+A+V+ QDLEDTY  PFK+CV EG V+SVMCS
Sbjct: 225 DRLKVSSCCKHYTAYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFKSCVEEGHVSSVMCS 284

Query: 115 YNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKA 174
           YN+VNG PTCA+P++LK  I  QW LDGYIVSDCDS+ V +   +YT TPE+A A A+KA
Sbjct: 285 YNRVNGVPTCANPELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNYTETPEDAVALALKA 344

Query: 175 GLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPR 234
           GL+L+CG +L  +T+ AV  G ++E  VN AL Y   V MRLG FDG+P+  PFG +GP 
Sbjct: 345 GLNLNCGSYLGDYTKNAVNLGKVKESIVNQALIYNYIVLMRLGFFDGDPTMLPFGKMGPS 404

Query: 235 DVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA 294
           DVCT  HQ LAL AA QGIVLL N+   LPLS     T+AVIGPN+D T TM+ NYAGV 
Sbjct: 405 DVCTVDHQLLALDAAKQGIVLLHNNG-ALPLSPNTTKTLAVIGPNADATNTMLSNYAGVP 463

Query: 295 CGYTTPLQGISRYAKTI-HQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
           C YT+PLQG+ +Y   + ++ GC  V+C+   LI  A   A  ADATV+V+GLD  IEAE
Sbjct: 464 CRYTSPLQGLQKYVSAVSYEKGCANVSCSEETLIEGAASIASMADATVVVVGLDLFIEAE 523

Query: 354 FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQ 413
            +DR  L LPG Q++LV   AKA+ G V+LV+M  GPVD+SF KN  +IG ILWVGYPGQ
Sbjct: 524 DLDRVNLTLPGFQEKLVMEAAKAANGTVILVVMSAGPVDISFVKNVSKIGGILWVGYPGQ 583

Query: 414 AGGAAIADVLFGRANPG 430
           AGG AI+ V+FG  NPG
Sbjct: 584 AGGDAISQVIFGDYNPG 600


>gi|147844622|emb|CAN82161.1| hypothetical protein VITISV_035506 [Vitis vinifera]
          Length = 925

 Score =  542 bits (1396), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 266/437 (60%), Positives = 323/437 (73%), Gaps = 8/437 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ--GNTGS--- 55
           MYN G AGLTYWSPNVNIFRDPRWGRGQETPGEDP++  +YA +YVRGLQ  G  G+   
Sbjct: 141 MYNVGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEVGKEGNFAA 200

Query: 56  -RLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCS 114
            RLKV++CCKHYTAYD+D W GVDR+HF+A+V+ QDLEDTY  PFK CV EG V+SVMCS
Sbjct: 201 DRLKVSSCCKHYTAYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFKXCVEEGHVSSVMCS 260

Query: 115 YNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKA 174
           YN+VNG PTCA+P++LK  I  QW LDGYIVSDCDS+ V +   +YT TPE+A A A+KA
Sbjct: 261 YNRVNGVPTCANPELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNYTETPEDAVALALKA 320

Query: 175 GLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPR 234
           GL+L+CG +L  +T+ AV  G ++E  VB AL Y   V MRLG FDG+P+  PFG +GP 
Sbjct: 321 GLNLNCGSYLGDYTKNAVNLGKVKESIVBQALIYNYIVLMRLGFFDGDPTMLPFGKMGPS 380

Query: 235 DVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA 294
           DVCT  HQ LAL AA QGIVLL N+   LPLS     T+AVIGPN+D T TM+ NYAGV 
Sbjct: 381 DVCTVDHQLLALDAAKQGIVLLHNNG-ALPLSPNTTKTLAVIGPNADATNTMLSNYAGVP 439

Query: 295 CGYTTPLQGISRYAKTI-HQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
           C YT+PLQG+ +Y   + ++ GC  V+C+   LI  A   A  ADATV+V+GLD  IEAE
Sbjct: 440 CRYTSPLQGLQKYVSAVSYEKGCANVSCSEETLIEGAASIASMADATVVVVGLDLFIEAE 499

Query: 354 FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQ 413
            +DR  L LPG Q++LV   AKA+ G V+LV+M  GPVD+SF KN  +IG ILWVGYPGQ
Sbjct: 500 DLDRVNLTLPGFQEKLVMEAAKAANGTVILVVMSAGPVDISFVKNVSKIGGILWVGYPGQ 559

Query: 414 AGGAAIADVLFGRANPG 430
           AGG AI+ V+FG  NPG
Sbjct: 560 AGGDAISQVIFGDYNPG 576


>gi|222629651|gb|EEE61783.1| hypothetical protein OsJ_16354 [Oryza sativa Japonica Group]
          Length = 771

 Score =  540 bits (1390), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 264/413 (63%), Positives = 317/413 (76%), Gaps = 4/413 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ--GNTGSRLK 58
           M+N G+AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L  KYA  YV GLQ  G     LK
Sbjct: 153 MHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGGGSDALK 212

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           VAACCKHYTAYD+DNW GV+RY F+A VS+QDL+DT+  PFK+CV++G VASVMCSYN+V
Sbjct: 213 VAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKV 272

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           NGKPTCAD D+L   I G W+L+GYIVSDCDSV VLYN QHYT+ PE+AAA  IK+GLDL
Sbjct: 273 NGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYTKNPEDAAAITIKSGLDL 332

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG FLA HT  AV+ G L E DV+ A+     V MRLG FDG+P   PFG+LGP+DVCT
Sbjct: 333 NCGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRLGFFDGDPRKLPFGSLGPKDVCT 392

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
            ++Q+LA +AA QGIVLLKN+   LPLS     ++AVIGPN++ + TMIGNY G  C YT
Sbjct: 393 SSNQELAREAARQGIVLLKNTG-ALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYT 451

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQL-IGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TPLQG+     T++Q GC  V C+GN L + AA  AA  AD TVLV+G DQS+E E +DR
Sbjct: 452 TPLQGLGANVATVYQPGCTNVGCSGNSLQLSAATQAAASADVTVLVVGADQSVERESLDR 511

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGY 410
             LLLPG+Q +LVS VA ASRGPV+LV+M GGP D+SFAK+  +I AILWVGY
Sbjct: 512 TSLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDISFAKSSDKISAILWVGY 564


>gi|302811516|ref|XP_002987447.1| hypothetical protein SELMODRAFT_426207 [Selaginella moellendorffii]
 gi|300144853|gb|EFJ11534.1| hypothetical protein SELMODRAFT_426207 [Selaginella moellendorffii]
          Length = 779

 Score =  538 bits (1387), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 262/448 (58%), Positives = 321/448 (71%), Gaps = 18/448 (4%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT--GSRLK 58
           M+N   AGLTYWSPNVNI+RDPRWGRGQETPGEDP+L+ KYA  YVRGLQ     G +LK
Sbjct: 136 MHNYQRAGLTYWSPNVNIYRDPRWGRGQETPGEDPLLSSKYATFYVRGLQDTNLGGDKLK 195

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           V+ACCKH TAYD+DNW G  R+ FNA V++QDL DTYN PF++CV + KV+SVMCSYN+V
Sbjct: 196 VSACCKHMTAYDVDNWKGTTRFKFNAIVTQQDLSDTYNPPFQSCVEDAKVSSVMCSYNRV 255

Query: 119 NGKPTCADPDILKNTIHGQWRLDG----------------YIVSDCDSVGVLYNTQHYTR 162
           NG PTCAD ++L  T+   W L+G                YIVSDCDS+   ++  +Y +
Sbjct: 256 NGVPTCADYNLLSATVRSSWNLNGSILLTCEVLLLYLPCSYIVSDCDSLQTFFDNTNYAK 315

Query: 163 TPEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGE 222
           T E+  ADA+ AGL+LDCGPFLAIHT+ A+  G + E +VN AL Y   VQMRLG++DG 
Sbjct: 316 TAEDVVADALLAGLNLDCGPFLAIHTQSAITNGKITEANVNQALRYLYNVQMRLGLYDGN 375

Query: 223 PSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDV 282
           P +QP+GNLGP+ VCT  +QQLAL AA +GIVLLKN+   LP S     TVA IGP++  
Sbjct: 376 PRSQPYGNLGPQSVCTGENQQLALDAAKEGIVLLKNNGNVLPFSKSNIRTVAAIGPHAKA 435

Query: 283 TVTMIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVL 342
           T  MIGNY G+ C YTTP  G+S YA+ ++ AGC  VAC  + LIG+A   A QADA VL
Sbjct: 436 TRAMIGNYQGIPCKYTTPHDGLSAYARVVYSAGCSDVACYSDSLIGSAVSTASQADAVVL 495

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
            +GLD + EAE  DR  LLLPG+QQELV+ V KA++GP VLV+  GG VDVSFAK + ++
Sbjct: 496 FVGLDLNQEAEGKDRTSLLLPGKQQELVTEVTKAAKGPAVLVIFSGGSVDVSFAKYNNKV 555

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
             ILW GYPG+AGGAAIA VLFG  NPG
Sbjct: 556 QGILWAGYPGEAGGAAIAQVLFGDHNPG 583


>gi|255573163|ref|XP_002527511.1| Beta-glucosidase, putative [Ricinus communis]
 gi|223533151|gb|EEF34909.1| Beta-glucosidase, putative [Ricinus communis]
          Length = 810

 Score =  536 bits (1381), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 264/436 (60%), Positives = 325/436 (74%), Gaps = 8/436 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ------GNTG 54
           M++ G+AGLTYWSPNVN+FRDPRWGRGQETPGEDP++  +YA +YVRGLQ       +T 
Sbjct: 139 MHSVGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEVGDEGNSTA 198

Query: 55  SRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCS 114
            +LKV++CCKHYTAYDLD W GVDR+HF+A+V+KQDLEDTY  PF++CV E  V+SVMCS
Sbjct: 199 DKLKVSSCCKHYTAYDLDKWKGVDRFHFDAKVTKQDLEDTYQPPFRSCVEEAHVSSVMCS 258

Query: 115 YNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKA 174
           YN+VNG PTCADPD+LK  I G+W LDGYIVSDCDS+ V Y++ +YT TPE+A A A+KA
Sbjct: 259 YNRVNGIPTCADPDLLKGIIRGEWNLDGYIVSDCDSIEVYYDSINYTATPEDAVALALKA 318

Query: 175 GLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPR 234
           GL+++CG FL  +T  AV+   + E  V+ AL Y   V MRLG FDG+P +  FGNLGP 
Sbjct: 319 GLNMNCGEFLGKYTVDAVKLNKVEESVVDQALIYNFIVLMRLGFFDGDPKSLLFGNLGPS 378

Query: 235 DVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA 294
           DVC+  HQ+LAL AA QGIVLL N    LPLS      +AVIGPN++VT TMI NYAG+ 
Sbjct: 379 DVCSDGHQKLALDAARQGIVLLYNKG-ALPLSKNNTRNLAVIGPNANVTTTMISNYAGIP 437

Query: 295 CGYTTPLQGISRYAKTI-HQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
           C YTTPLQG+ +Y  T+ + AGC  V+C+ + LI AA  AA  ADA VL++GLDQSIE E
Sbjct: 438 CKYTTPLQGLQKYVSTVTYAAGCKSVSCSDDTLIDAATQAAAAADAVVLLVGLDQSIERE 497

Query: 354 FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQ 413
            +DR  L LPG Q++LV  V  A+ G VVLV+M   P+DVSFA N  +I  ILWVGYPGQ
Sbjct: 498 GLDRENLTLPGFQEKLVVDVVNATNGTVVLVVMSSSPIDVSFAVNKSKIKGILWVGYPGQ 557

Query: 414 AGGAAIADVLFGRANP 429
           AGG A+A V+FG  NP
Sbjct: 558 AGGDAVAQVMFGDYNP 573


>gi|297834874|ref|XP_002885319.1| beta-1,4-xylosidase [Arabidopsis lyrata subsp. lyrata]
 gi|297331159|gb|EFH61578.1| beta-1,4-xylosidase [Arabidopsis lyrata subsp. lyrata]
          Length = 865

 Score =  536 bits (1380), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 258/435 (59%), Positives = 317/435 (72%), Gaps = 18/435 (4%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ----GNTGSR 56
           M+N G+AGLTYWSPNVNIFRDPRWGRGQETPGEDP++  KYA +YV+GLQ         R
Sbjct: 141 MHNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLVVSKYAVNYVKGLQDVQDAGKSRR 200

Query: 57  LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYN 116
           LKV++CCKHYTAYDLDNW G+DR+HF+A+V+KQDLEDTY  PFK+CV EG V+SVMCSYN
Sbjct: 201 LKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQPPFKSCVEEGDVSSVMCSYN 260

Query: 117 QVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGL 176
           +VNG PTCADP++L+  I GQWRLDGYIVSDCDS+ V ++  HYT+T            L
Sbjct: 261 RVNGIPTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFDDIHYTKT-----------RL 309

Query: 177 DLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDV 236
           +++CG FL  +TE AV+   L   +V+ AL Y   V MRLG FDG+P + PFG LGP DV
Sbjct: 310 NMNCGDFLGKYTENAVKLKKLNGSEVDEALIYNYIVLMRLGFFDGDPKSLPFGQLGPSDV 369

Query: 237 CTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
           C+  HQ LAL+AA QGIVLL+N    LPLS      +AVIGPN++ T  MI NYAGV C 
Sbjct: 370 CSKDHQMLALEAAKQGIVLLENRG-DLPLSKTAVKKIAVIGPNANATKVMISNYAGVPCK 428

Query: 297 YTTPLQGISRYA--KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEF 354
           YT+PLQG+ +Y   K +++ GC  V C    LI AA  A  +AD TVLV+GLDQ++EAE 
Sbjct: 429 YTSPLQGLQKYVPEKVVYEPGCKDVNCGEQTLISAAVKAVSEADVTVLVVGLDQTVEAEG 488

Query: 355 IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQA 414
           +DR  L LPG Q++LV  VA A++  VVLV+M  GP+D+SFAKN   I A+LWVGYPG+A
Sbjct: 489 LDRVNLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFAKNLSTISAVLWVGYPGEA 548

Query: 415 GGAAIADVLFGRANP 429
           GG AIA V+FG  NP
Sbjct: 549 GGDAIAQVIFGDYNP 563


>gi|357444469|ref|XP_003592512.1| Xylosidase [Medicago truncatula]
 gi|355481560|gb|AES62763.1| Xylosidase [Medicago truncatula]
          Length = 781

 Score =  532 bits (1371), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 266/440 (60%), Positives = 329/440 (74%), Gaps = 14/440 (3%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ------GNTG 54
           MYN  +AGLT+WSPNVN+FRDPRWGRGQETPGEDP++  +YA +YVRGLQ         G
Sbjct: 138 MYNVDLAGLTFWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEVGDEASAKG 197

Query: 55  SRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCS 114
            RLKV++CCKHYTAYD+DNW GVDR+HF+A+V+KQDLEDTY  PFK+CV+EG V+SVMCS
Sbjct: 198 DRLKVSSCCKHYTAYDVDNWKGVDRFHFDAKVTKQDLEDTYQPPFKSCVLEGHVSSVMCS 257

Query: 115 YNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKA 174
           YN+VNG PTCADPD+L+  I GQW LDGYIVSDCDSV V YN+ HYT+TPE+A A A+KA
Sbjct: 258 YNRVNGIPTCADPDLLQGVIRGQWGLDGYIVSDCDSVEVYYNSIHYTKTPEDAVALALKA 317

Query: 175 GLDLDCGPFLAIHTEGAVRGGLLREEDVNL---ALAYTITVQMRLGMFDGEPSAQPFGNL 231
           GL+++CG FL  +T  AV    L++ DV++   AL Y   V MRLG F+  P + PF NL
Sbjct: 318 GLNMNCGDFLKKYTANAVN---LKKVDVSIVDQALVYNYIVLMRLGFFE-NPKSLPFANL 373

Query: 232 GPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA 291
           GP DVCT  +QQLAL+AA QGIVLL+N+   LPLS  +   +AVIGPN++ T  MI NYA
Sbjct: 374 GPSDVCTKENQQLALEAAKQGIVLLENNKGALPLSKTKIKNLAVIGPNANATTVMISNYA 433

Query: 292 GVACGYTTPLQGISRYAKTIHQA-GCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSI 350
           G+ C Y++PLQG+ +Y  ++  A GC  V C+   L  AA  AA  ADA VLV+GLDQSI
Sbjct: 434 GIPCRYSSPLQGLQKYISSVTYARGCSDVKCSNQNLFAAAVKAAASADAVVLVVGLDQSI 493

Query: 351 EAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGY 410
           EAE +DR  L LPG Q++LV  VA A++G ++LV+M  GP+D+SF K+   IG ILWVGY
Sbjct: 494 EAEGLDRVNLTLPGFQEKLVKDVAAATKGTLILVIMAAGPIDISFTKSVSNIGGILWVGY 553

Query: 411 PGQAGGAAIADVLFGRANPG 430
           PGQ GG AIA V+FG  NPG
Sbjct: 554 PGQDGGNAIAQVIFGDYNPG 573


>gi|85813770|emb|CAJ65921.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
          Length = 704

 Score =  531 bits (1367), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 269/449 (59%), Positives = 335/449 (74%), Gaps = 19/449 (4%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG---SRL 57
           MYN G+AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KY + YV+GLQ        +L
Sbjct: 159 MYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSCYVKGLQQRDDGDPDKL 218

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARV-SKQDLEDTYNVPFKACVVEGKVASVMCSYN 116
           KVAACCKHYTAYDLDNW G DRYHFNA V +KQD++DT+  PFK+CV++G VASVMCSYN
Sbjct: 219 KVAACCKHYTAYDLDNWKGSDRYHFNAVVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYN 278

Query: 117 QVNGKPTCADPDILKNTIHGQWRLDGY-------IVSDCDSVGVLYNTQHYTRTPEEA-- 167
           QVNGKPTCADPD+L   I G+W L+GY       IV+DCDS+ V Y +Q+YT+TPEEA  
Sbjct: 279 QVNGKPTCADPDLLSGVIRGEWNLNGYQWGCCRYIVTDCDSLDVFYKSQNYTKTPEEAAA 338

Query: 168 ----AADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEP 223
               A +++  G+DL+CG FL  HTE AV+GGL+ E  +++A++      MRLG FDG+P
Sbjct: 339 AAILAGNSLVTGVDLNCGSFLGQHTEAAVKGGLVNEHAIDIAVSNNFATLMRLGFFDGDP 398

Query: 224 SAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVT 283
           S Q +G LGP+DVCT  +Q+LA +AA QGIVLLKN+A +LPLS      +AVIGPN++VT
Sbjct: 399 SKQLYGKLGPKDVCTAENQELAREAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVT 458

Query: 284 VTMIGNY-AGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVL 342
            TMIGNY  G  C YTTPLQG++    T +  GC  VAC+  Q +  A+  A  ADATVL
Sbjct: 459 KTMIGNYEGGTPCKYTTPLQGLAASVATTYLPGCSNVACSTAQ-VDDAKKLAAAADATVL 517

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           VMG D SIEAE  DR  +LLPG+QQ L++ VA  S GPV+LV+M GG +DVSFA+ + +I
Sbjct: 518 VMGADLSIEAESRDRVDVLLPGQQQLLITAVANVSCGPVILVIMSGGGMDVSFARTNDKI 577

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPGT 431
            +ILWVGYPG+AGGAAIAD++FG  NP T
Sbjct: 578 TSILWVGYPGEAGGAAIADIIFGYYNPST 606


>gi|356574315|ref|XP_003555294.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 5-like
           [Glycine max]
          Length = 901

 Score =  530 bits (1365), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 266/437 (60%), Positives = 321/437 (73%), Gaps = 8/437 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ------GNTG 54
           MYN  +AGLT+WSPNVN+FRDPRWGRGQETPGEDP++  +YA  Y+RGLQ          
Sbjct: 136 MYNVDLAGLTFWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVMYLRGLQEVEDEASAKA 195

Query: 55  SRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCS 114
            RLKV++CCKHYTAYDLDNW G+DR+HF+A+V+KQDLED+Y  PFK+CVVEG V+SVMCS
Sbjct: 196 DRLKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDSYQPPFKSCVVEGHVSSVMCS 255

Query: 115 YNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKA 174
           YN+VNG PTCADPD+LK  I GQW LDGYIVSDCDSV V YN  HYT TPE+A A A+KA
Sbjct: 256 YNRVNGIPTCADPDLLKGIIRGQWGLDGYIVSDCDSVEVYYNAIHYTATPEDAVALALKA 315

Query: 175 GLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPR 234
           GL+++CG FL  +T  AV    +    V+ AL Y   V MRLG FD +P + PF NLGP 
Sbjct: 316 GLNMNCGDFLKKYTANAVNLKKVDVATVDQALVYNYIVLMRLGFFD-DPKSLPFANLGPS 374

Query: 235 DVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA 294
           DVCT  +QQLAL AA QGIVLL+N+   LPLS      +AVIGPN++ T  MI NYAG+ 
Sbjct: 375 DVCTKDNQQLALDAAKQGIVLLENNNGALPLSQTNIKKLAVIGPNANATTVMISNYAGIP 434

Query: 295 CGYTTPLQGISRYAKTIHQA-GCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
           C YT+PLQG+ +Y  +++ A GC  V C+   LI AA  AA  ADA VLV+GLDQSIEAE
Sbjct: 435 CRYTSPLQGLQKYISSVNYAPGCSNVKCDNQSLIAAAVKAAASADAVVLVVGLDQSIEAE 494

Query: 354 FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQ 413
            +DR  L LPG Q++ V  VA A++G V+LV+M  GP+D+S  K+   IG ILWVGYPGQ
Sbjct: 495 GLDRENLTLPGFQEKFVKDVAGATKGKVILVIMAAGPIDISSTKSVSNIGGILWVGYPGQ 554

Query: 414 AGGAAIADVLFGRANPG 430
           AGG AIA V+FG  NPG
Sbjct: 555 AGGDAIAQVIFGDYNPG 571


>gi|302141935|emb|CBI19138.3| unnamed protein product [Vitis vinifera]
          Length = 1411

 Score =  522 bits (1344), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 249/437 (56%), Positives = 323/437 (73%), Gaps = 8/437 (1%)

Query: 1    MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT--GSR- 56
            MYN G A G+T+W+PN+NIFRDPRWGRGQETPGEDPV+ GKYA SYVRGLQG+T  G + 
Sbjct: 811  MYNAGQAMGMTFWAPNINIFRDPRWGRGQETPGEDPVVAGKYAVSYVRGLQGDTFEGGKV 870

Query: 57   --LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCS 114
              L+ +ACCKH+TAYDLDNW  +DRY F+ARV+ QDL DTY  PF++C+ EG+ + +MC+
Sbjct: 871  DVLQASACCKHFTAYDLDNWTSIDRYTFDARVTMQDLADTYQPPFRSCIEEGRASGLMCA 930

Query: 115  YNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKA 174
            YN VNG P CAD ++L  T  GQW  DGYIVSDCD+V ++++ Q Y ++PE+A A  + A
Sbjct: 931  YNLVNGVPNCADFNLLSKTARGQWGFDGYIVSDCDAVSLVHDVQGYAKSPEDAVAIVLTA 990

Query: 175  GLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPR 234
            G+D+ CG +L  H + AV    L E +++ AL    TV+MRLG+F+G P   PFGN+GP 
Sbjct: 991  GMDVACGGYLQKHAKSAVSQKKLTESEIDRALLNLFTVRMRLGLFNGNPRKLPFGNIGPD 1050

Query: 235  DVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA 294
             VC+  HQ LAL+AA  GIVLLKNS R LPLS     ++AVIGPN++ T T++GNYAG  
Sbjct: 1051 QVCSTEHQTLALEAARSGIVLLKNSDRLLPLSKGETLSLAVIGPNANATDTLLGNYAGPP 1110

Query: 295  CGYTTPLQGISRYA-KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
            C + +PLQG+  Y   T++ AGC  VAC+   +  A +V A+QAD  VLVMGLDQ+ E E
Sbjct: 1111 CKFISPLQGLQSYVNNTMYHAGCNDVACSSASIENAVDV-AKQADYVVLVMGLDQTQERE 1169

Query: 354  FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQ 413
              DR  L+LPG+Q++L++ VAKA++ PVVLVL+CGGPVD+SFAK    IG+ILW GYPG+
Sbjct: 1170 KYDRLDLVLPGKQEQLITGVAKAAKKPVVLVLLCGGPVDISFAKGSSNIGSILWAGYPGE 1229

Query: 414  AGGAAIADVLFGRANPG 430
            AGGAAIA+ +FG  NPG
Sbjct: 1230 AGGAAIAETIFGDHNPG 1246



 Score =  473 bits (1217), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 236/438 (53%), Positives = 314/438 (71%), Gaps = 9/438 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLK- 58
           +YN G   G+T+W+PN+NIFRDPRWGRGQETPGEDP++TG YA SYVRG+QG+    LK 
Sbjct: 143 VYNAGQTKGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGSYAVSYVRGVQGDCLRGLKR 202

Query: 59  -----VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 113
                 +ACCKH+TAYDLD+W G+DR+ F+ARV+ QDL DTY  PF  C+ EG+ + +MC
Sbjct: 203 CGELQASACCKHFTAYDLDDWKGIDRFKFDARVTMQDLADTYQPPFHRCIEEGRASGIMC 262

Query: 114 SYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
           +YN+VNG P+CAD ++L NT   +W   GYI SDCD+V +++++  + +TPE+A  D +K
Sbjct: 263 AYNRVNGVPSCADFNLLTNTARKRWNFQGYITSDCDAVSLIHDSYGFAKTPEDAVVDVLK 322

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AG+D++CG +L  HT+ AV    L E +++ AL     V+MRLG+F+G P  QP+G++GP
Sbjct: 323 AGMDVNCGTYLLNHTKSAVMQKKLPESELDRALENLFAVRMRLGLFNGNPKGQPYGDIGP 382

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 293
             VC+  HQ LAL AA  GIVLLKNS R LPL   +  ++AVIGPN++   T+IGNYAG 
Sbjct: 383 NQVCSVEHQTLALDAARDGIVLLKNSQRLLPLPKGKTMSLAVIGPNANSPKTLIGNYAGP 442

Query: 294 ACGYTTPLQGISRYAK-TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA 352
            C + TPLQ +  Y K T++  GC  VAC+   +  A E+ A++AD  VLVMGLDQ+ E 
Sbjct: 443 PCKFITPLQALQSYVKSTMYHPGCDAVACSSPSIEKAVEI-AQKADYVVLVMGLDQTQER 501

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E  DR  L+LPG+QQ+L+  VA A++ PVVLVL+ GGPVD+SFAK    IG+ILW GYPG
Sbjct: 502 EAHDRLDLVLPGKQQQLIICVANAAKKPVVLVLLSGGPVDISFAKYSNNIGSILWAGYPG 561

Query: 413 QAGGAAIADVLFGRANPG 430
            AGGAAIA+ +FG  NPG
Sbjct: 562 GAGGAAIAETIFGDHNPG 579


>gi|449531013|ref|XP_004172482.1| PREDICTED: beta-D-xylosidase 1-like, partial [Cucumis sativus]
          Length = 534

 Score =  520 bits (1339), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 264/340 (77%), Positives = 292/340 (85%)

Query: 91  LEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDS 150
           LEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD+LKNTI G W LDGYIVSDCDS
Sbjct: 1   LEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDLLKNTIRGAWGLDGYIVSDCDS 60

Query: 151 VGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTI 210
           VGVLY++QH+T TPEEAAA  IKAGLDLDCGPFLA+HT  AV  GLL+E D+N ALA  +
Sbjct: 61  VGVLYDSQHFTPTPEEAAASTIKAGLDLDCGPFLAVHTATAVGRGLLKEVDLNNALANLL 120

Query: 211 TVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRH 270
           +VQMRLGMFDGEP+AQP+GNLGP+DVCTPAH+ LAL+AA QGIVLL+N A  LPLS  RH
Sbjct: 121 SVQMRLGMFDGEPAAQPYGNLGPKDVCTPAHKHLALEAARQGIVLLQNRAGALPLSPTRH 180

Query: 271 HTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAA 330
            TVAVIGPNSD TVTMIGNYAGVAC YTTP+QGIS+Y KTIH  GC  VAC G+QLIG A
Sbjct: 181 RTVAVIGPNSDATVTMIGNYAGVACEYTTPVQGISKYVKTIHAKGCANVACVGDQLIGEA 240

Query: 331 EVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGP 390
           E AAR ADA V+V+GLDQSIEAE  DR G+LLPG+Q+ELV R+  A +GP V+VLM GGP
Sbjct: 241 EAAARVADAAVVVVGLDQSIEAESRDRNGVLLPGKQEELVRRIGLACKGPTVVVLMSGGP 300

Query: 391 VDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           +DVSFAKND +I  ILWVGYPGQAGGAAIADVLFG  NPG
Sbjct: 301 IDVSFAKNDGKISGILWVGYPGQAGGAAIADVLFGATNPG 340


>gi|302786124|ref|XP_002974833.1| hypothetical protein SELMODRAFT_101733 [Selaginella moellendorffii]
 gi|300157728|gb|EFJ24353.1| hypothetical protein SELMODRAFT_101733 [Selaginella moellendorffii]
          Length = 784

 Score =  519 bits (1336), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 247/437 (56%), Positives = 323/437 (73%), Gaps = 9/437 (2%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN------TG 54
           ++N G AGLT+WSPNVNIFRDPRWGRGQETPGEDP+L  K+A+ YVRGLQG       + 
Sbjct: 150 LHNLGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLASKFASLYVRGLQGGAYEGSASD 209

Query: 55  SRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCS 114
             LKV+ACCKH TAYD+DNW G+DRYHFNA VS+QDL DTYN PF++C+ +G+V+SVMCS
Sbjct: 210 GFLKVSACCKHLTAYDVDNWKGMDRYHFNAEVSEQDLVDTYNPPFQSCIEDGRVSSVMCS 269

Query: 115 YNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKA 174
           YN+VNG PTCAD ++L  T+   W  +GYIVSDCD++ VL+    Y  + E+A AD+I A
Sbjct: 270 YNRVNGVPTCADRNLLTETVRNSWGFNGYIVSDCDALQVLFEDTTYAPSAEDAVADSILA 329

Query: 175 GLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPR 234
           GLDL+CG FL  H + A++ G + E D++ A++  +  +MRLG+FDG+P++QP+ +LG  
Sbjct: 330 GLDLNCGTFLGKHAKSALQAGKITEADLDHAVSNLMRTRMRLGLFDGDPNSQPYSSLGAT 389

Query: 235 DVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA 294
           D+C+  HQQLAL AA QG+VLLKN   +LPLST    TVA+IGPN++ T TM+GNY G+ 
Sbjct: 390 DICSNDHQQLALDAALQGVVLLKNDG-SLPLSTAL-KTVALIGPNANATYTMLGNYEGIP 447

Query: 295 CGYTTPLQGISRYAKTI-HQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
           C Y +PLQG+  Y+  I +  GC  VACN   L+ +A   A +ADA VLV+GLDQS E E
Sbjct: 448 CKYISPLQGMQIYSSNILYSPGCRNVACNEGDLVASAVEVATKADAVVLVVGLDQSQERE 507

Query: 354 FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQ 413
             DR  LLLPG Q +LVS +A A   P+VLV+M  GPVD+S  K++ RI +++W+GYPGQ
Sbjct: 508 TFDRTSLLLPGMQSQLVSNIANAVTSPIVLVIMSAGPVDISTFKDNSRISSVIWLGYPGQ 567

Query: 414 AGGAAIADVLFGRANPG 430
           +GGAA+A V+FG  NPG
Sbjct: 568 SGGAALAHVVFGAYNPG 584


>gi|255545664|ref|XP_002513892.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
 gi|223546978|gb|EEF48475.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
          Length = 774

 Score =  517 bits (1331), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 240/438 (54%), Positives = 325/438 (74%), Gaps = 9/438 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT--GSRL 57
           +YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP++TGKYA SYVRG+QG++  G +L
Sbjct: 143 VYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDSFQGGKL 202

Query: 58  K----VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 113
           K     +ACCKH+TAYDLDNW GV+R+ F+ARV+ QDL DTY  PF++CV +GK + +MC
Sbjct: 203 KGHLQASACCKHFTAYDLDNWKGVNRFVFDARVTMQDLADTYQPPFQSCVQQGKASGIMC 262

Query: 114 SYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
           +YN+VNG P+CAD ++L  T  GQW   GYI SDCD+V ++Y+ Q Y ++PE+A  D +K
Sbjct: 263 AYNRVNGIPSCADFNLLSRTARGQWDFHGYIASDCDAVSIIYDNQGYAKSPEDAVVDVLK 322

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AG+D++CG +L  HT+ AV    L E  ++ AL    +V+MRLG+F+G P+ QPF N+GP
Sbjct: 323 AGMDVNCGSYLQKHTKAAVEQKKLPEASIDRALHNLFSVRMRLGLFNGNPTEQPFSNIGP 382

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 293
             VC+  HQ LAL+AA  GIVLLKNSAR LPL   +  ++AVIGPN++   T++GNYAG 
Sbjct: 383 DQVCSQEHQILALEAARNGIVLLKNSARLLPLQKSKTVSLAVIGPNANSVQTLLGNYAGP 442

Query: 294 ACGYTTPLQGISRYAK-TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA 352
            C   TPLQ +  Y K TI+ +GC  V C+   +  A ++ A+  D  V++MGLDQ+ E 
Sbjct: 443 PCKTVTPLQALQYYVKNTIYYSGCDTVKCSSASIDKAVDI-AKGVDRVVMIMGLDQTQER 501

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E +DR  L+LPG+QQEL++ VAK+++ P+VLVL+ GGPVD+SFAK D  IG+ILW GYPG
Sbjct: 502 EELDRLDLVLPGKQQELITNVAKSAKNPIVLVLLSGGPVDISFAKYDENIGSILWAGYPG 561

Query: 413 QAGGAAIADVLFGRANPG 430
           +AGG A+A+++FG  NPG
Sbjct: 562 EAGGIALAEIIFGDHNPG 579


>gi|302796583|ref|XP_002980053.1| hypothetical protein SELMODRAFT_112087 [Selaginella moellendorffii]
 gi|300152280|gb|EFJ18923.1| hypothetical protein SELMODRAFT_112087 [Selaginella moellendorffii]
          Length = 772

 Score =  513 bits (1321), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 255/431 (59%), Positives = 330/431 (76%), Gaps = 3/431 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ-GNTGS-RLK 58
           M+N G +GLT+WSPN+NI+RDPRWGRGQETPGEDP+L+  +A  YVRGLQ    GS +LK
Sbjct: 143 MHNLGQSGLTFWSPNINIYRDPRWGRGQETPGEDPLLSSNFATYYVRGLQESQAGSDKLK 202

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           V+ACCKH TAYD+DNW G DRYHFNA V++QDLEDTYN PFK+CV +G V+SVMCSYN++
Sbjct: 203 VSACCKHMTAYDVDNWLGTDRYHFNAIVTEQDLEDTYNAPFKSCVEDGGVSSVMCSYNRL 262

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           NG PTCAD ++L  T+   W+L+GYIVSDCDS+ V ++  +Y  T E+AAADA+ AGL+L
Sbjct: 263 NGVPTCADHELLTTTVRETWKLNGYIVSDCDSLQVFFDNTNYAATAEDAAADALLAGLNL 322

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG FLA HT  A++   + E  +N AL Y +TVQMRLG++DG+P +Q +G+LG  DVCT
Sbjct: 323 NCGTFLAKHTLSAIQQKKVTEATINQALTYLVTVQMRLGLYDGDPKSQTYGSLGASDVCT 382

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
             HQ LAL+AA QG+VLLKN    LPLST +  ++AV+GP+++ T  MIGNYAG+ C YT
Sbjct: 383 SEHQTLALEAARQGMVLLKNLG-ALPLSTSKIKSLAVVGPHANATRAMIGNYAGIPCKYT 441

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           +PLQ   +YA+  +  GC  VAC+ + LI  A  AA  ADA V+ +GLD +IEAE +DR 
Sbjct: 442 SPLQAFQKYAQVSYAPGCANVACSSDSLISGAVSAAAAADAVVVAVGLDLTIEAESLDRT 501

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            LLLPG+QQELVS+V +A++GPVV+V++  G +D+ FA +D RI  ILW GYPGQAGGAA
Sbjct: 502 SLLLPGKQQELVSQVMQAAKGPVVIVILSAGAIDIPFALSDSRIAGILWAGYPGQAGGAA 561

Query: 419 IADVLFGRANP 429
           IA+V+FG  NP
Sbjct: 562 IAEVIFGDHNP 572


>gi|302811514|ref|XP_002987446.1| hypothetical protein SELMODRAFT_426206 [Selaginella moellendorffii]
 gi|300144852|gb|EFJ11533.1| hypothetical protein SELMODRAFT_426206 [Selaginella moellendorffii]
          Length = 772

 Score =  513 bits (1321), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 255/431 (59%), Positives = 330/431 (76%), Gaps = 3/431 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ-GNTGS-RLK 58
           M+N G +GLT+WSPN+NI+RDPRWGRGQETPGEDP+L+  +A  YVRGLQ    GS +LK
Sbjct: 143 MHNLGQSGLTFWSPNINIYRDPRWGRGQETPGEDPLLSSNFATYYVRGLQESQAGSDKLK 202

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           V+ACCKH TAYD+DNW G DRYHFNA V++QDLEDTYN PFK+CV +G V+SVMCSYN++
Sbjct: 203 VSACCKHMTAYDVDNWLGTDRYHFNAIVTEQDLEDTYNAPFKSCVEDGGVSSVMCSYNRL 262

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           NG PTCAD ++L  T+   W+L+GYIVSDCDS+ V ++  +Y  T E+AAADA+ AGL+L
Sbjct: 263 NGVPTCADHELLTTTVRETWKLNGYIVSDCDSLQVFFDNTNYAATAEDAAADALLAGLNL 322

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG FLA HT  A++   + E  +N AL Y +TVQMRLG++DG+P +Q +G+LG  DVCT
Sbjct: 323 NCGTFLAKHTLSAIQQKKVTEATINQALTYLVTVQMRLGLYDGDPKSQTYGSLGASDVCT 382

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
             HQ LAL+AA QG+VLLKN    LPLST +  ++AV+GP+++ T  MIGNYAG+ C YT
Sbjct: 383 SEHQTLALEAARQGMVLLKNLG-ALPLSTSKIKSLAVVGPHANATRAMIGNYAGIPCKYT 441

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           +PLQ   +YA+  +  GC  VAC+ + LI  A  AA  ADA V+ +GLD +IEAE +DR 
Sbjct: 442 SPLQAFQKYAQVSYAPGCANVACSSDSLISGAVSAAAAADAVVVAVGLDLTIEAESLDRT 501

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            LLLPG+QQELVS+V +A++GPVV+V++  G +D+ FA +D RI  ILW GYPGQAGGAA
Sbjct: 502 SLLLPGKQQELVSQVMQAAKGPVVIVILSAGAIDIPFALSDSRIAGILWAGYPGQAGGAA 561

Query: 419 IADVLFGRANP 429
           IA+V+FG  NP
Sbjct: 562 IAEVIFGDHNP 572


>gi|37359708|dbj|BAC98299.1| LEXYL2 [Solanum lycopersicum]
          Length = 633

 Score =  512 bits (1318), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 250/433 (57%), Positives = 324/433 (74%), Gaps = 5/433 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR---L 57
           MYN G AGLTYWSPNVNI+RDPRWGRGQET GEDP L+ +Y  +YV+GLQ     +   L
Sbjct: 13  MYNVGQAGLTYWSPNVNIYRDPRWGRGQETAGEDPTLSSRYGVAYVKGLQQRDDGKKDML 72

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVA+CCKHYTAYD+D+W G+ RY+FNA+V++QDL+DT+N PFK+CV++G VASVMCSYNQ
Sbjct: 73  KVASCCKHYTAYDVDDWKGIQRYNFNAKVTQQDLDDTFNPPFKSCVLDGNVASVMCSYNQ 132

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           V+GKPTC D D+L   I GQW+L+GYIV+DCDS+  +Y  QHYT+TPEE AA ++ AGL 
Sbjct: 133 VDGKPTCGDYDLLAGVIRGQWKLNGYIVTDCDSLNEMYWAQHYTKTPEETAALSLNAGLG 192

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           L+CG +L  +T+GAV  GL+ E  ++ A+       MRLG FDG P  Q +GNLGP+D+C
Sbjct: 193 LNCGSWLGKYTQGAVNQGLVNESVIDRAVTNNFATLMRLGFFDGNPKNQLYGNLGPKDIC 252

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           T  HQ+LA +AA QGIVLLKN+A +LPLS     ++AVIGPN+++  TM+G+Y G  C Y
Sbjct: 253 TEDHQELAREAARQGIVLLKNTAGSLPLSPKSIKSLAVIGPNANLAYTMVGSYEGSPCKY 312

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TTPL G+     T++Q GC  +AC   Q +  A+  A  ADA VLVMG DQ+IE E  DR
Sbjct: 313 TTPLDGLGASVSTVYQQGC-DIACATAQ-VDNAKKVAAAADAVVLVMGSDQTIERESKDR 370

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             + LPG+Q  LV+ VA  S+GPV+LV+M GG +DV FA ++P++ +ILWVG+PG+AGGA
Sbjct: 371 FNITLPGQQSLLVTEVASVSKGPVILVIMSGGGMDVKFAVDNPKVTSILWVGFPGEAGGA 430

Query: 418 AIADVLFGRANPG 430
           A+ADV+FG  NPG
Sbjct: 431 ALADVVFGYHNPG 443


>gi|302760655|ref|XP_002963750.1| hypothetical protein SELMODRAFT_80102 [Selaginella moellendorffii]
 gi|300169018|gb|EFJ35621.1| hypothetical protein SELMODRAFT_80102 [Selaginella moellendorffii]
          Length = 785

 Score =  508 bits (1308), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 244/437 (55%), Positives = 319/437 (72%), Gaps = 9/437 (2%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN------TG 54
           ++N G AGLT+WSPNVNIFRDPRWGRGQETPGEDP+L  K+A+ YVRGLQG       + 
Sbjct: 151 LHNLGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLASKFASLYVRGLQGGAYGGSASD 210

Query: 55  SRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCS 114
             LKV+ACCKH TAYD+DNW G+DRYHFNA VS+QDL DTYN PF++C+ +G+V+SVMCS
Sbjct: 211 GFLKVSACCKHLTAYDMDNWKGMDRYHFNAEVSEQDLVDTYNPPFQSCIEDGRVSSVMCS 270

Query: 115 YNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKA 174
           YN+VNG PTCAD  +L  T+   W  +GYIVSDCD++ VL+    Y  + E+A AD+I A
Sbjct: 271 YNRVNGVPTCADRSLLTETVRNSWGFNGYIVSDCDALQVLFEDTTYAPSAEDAVADSILA 330

Query: 175 GLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPR 234
           GLDL+CG FL  H + A++ G + E D++ A++  +  +MRLG+FDG+ + +P+ +LG  
Sbjct: 331 GLDLNCGTFLGKHAKSALQAGKVTEADLDHAISNLMRTRMRLGLFDGDLNTRPYSSLGAT 390

Query: 235 DVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA 294
           D+C+  HQQLAL AA QG+VLLKN   +LPLST    TVA+IGPN++ T TM+GNY G+ 
Sbjct: 391 DICSNDHQQLALDAALQGVVLLKNDG-SLPLSTAL-KTVALIGPNANATYTMLGNYEGIP 448

Query: 295 CGYTTPLQGISRYAKTI-HQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
           C Y +PLQG+  Y   I +  GC  VAC+   L+ +A   A +ADA VLV+GLDQS E E
Sbjct: 449 CKYVSPLQGMQIYNNNILYSPGCRDVACSEGDLVASAVEVATKADAVVLVVGLDQSQERE 508

Query: 354 FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQ 413
             DR  LLLPG Q +LVS +A A   P+VLV+M  GPVD+S  K++ RI +++W+GYPGQ
Sbjct: 509 TFDRTSLLLPGMQSQLVSNIANAVTCPIVLVIMSAGPVDISTFKDNSRISSVIWIGYPGQ 568

Query: 414 AGGAAIADVLFGRANPG 430
           +GGAA+A V+FG  NPG
Sbjct: 569 SGGAALAHVVFGAYNPG 585


>gi|224082152|ref|XP_002306583.1| predicted protein [Populus trichocarpa]
 gi|222856032|gb|EEE93579.1| predicted protein [Populus trichocarpa]
          Length = 745

 Score =  507 bits (1306), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 241/438 (55%), Positives = 315/438 (71%), Gaps = 9/438 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT------ 53
           +YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP++TGKYA SYVRGLQG++      
Sbjct: 142 LYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLMTGKYAVSYVRGLQGDSFKGGEI 201

Query: 54  GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 113
              L+ +ACCKH+TAYDL+NWNG  RY F+A V+ QDL DTY  PFK+CV EG+ + +MC
Sbjct: 202 KGPLQASACCKHFTAYDLENWNGTSRYVFDAYVTAQDLADTYQPPFKSCVEEGRASGIMC 261

Query: 114 SYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
           +YN+VNG P CAD + L  T   QW  DGYI SDCD+V ++++ Q Y +TPE+A    +K
Sbjct: 262 AYNRVNGIPNCADSNFLSRTARAQWGFDGYIASDCDAVSIIHDAQGYAKTPEDAVVAVLK 321

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AG+D++CG +L  HT+ AV    L   +++ AL    +V+MRLG+F+G P+ Q FGN+GP
Sbjct: 322 AGMDVNCGSYLQQHTKAAVDQKKLTISEIDRALHNLFSVRMRLGLFNGNPTGQQFGNIGP 381

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 293
             VC+  +Q LAL AA  GIVLLKNSA  LPLS  +  ++AVIGPN++   T++GNYAG 
Sbjct: 382 DQVCSQENQILALDAARNGIVLLKNSAGLLPLSKSKTMSLAVIGPNANSVQTLLGNYAGP 441

Query: 294 ACGYTTPLQGISRYAK-TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA 352
            C   TPLQ +  Y K TI   GC  V C+   ++GA  V A+ AD  VL+MGLD + E 
Sbjct: 442 PCKLVTPLQALQSYIKHTIPYPGCDSVQCSSASIVGAVNV-AKGADHVVLIMGLDDTQEK 500

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E +DR  L+LPG+QQEL+  VAKA++ PVVLVL+ GGPVD+SFAKND  IG+ILW GYPG
Sbjct: 501 EGLDRRDLVLPGKQQELIISVAKAAKNPVVLVLLSGGPVDISFAKNDKNIGSILWAGYPG 560

Query: 413 QAGGAAIADVLFGRANPG 430
           +AG  A+A+++FG  NPG
Sbjct: 561 EAGAIALAEIIFGDHNPG 578


>gi|224066931|ref|XP_002302285.1| predicted protein [Populus trichocarpa]
 gi|222844011|gb|EEE81558.1| predicted protein [Populus trichocarpa]
          Length = 773

 Score =  506 bits (1304), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 237/438 (54%), Positives = 318/438 (72%), Gaps = 9/438 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT------ 53
           +YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP++TG YAASYV+G+QG++      
Sbjct: 142 LYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGLYAASYVKGVQGDSFEGGKI 201

Query: 54  GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 113
              L+ +ACCKH+TAYDLDNW G++R+ F+ARV+ QDL DTY  PFK+CV +G+ + +MC
Sbjct: 202 KGHLQASACCKHFTAYDLDNWKGMNRFVFDARVTMQDLADTYQPPFKSCVEQGRASGIMC 261

Query: 114 SYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
           +YN+VNG P+CAD ++L  T   QW   GYI SDCD+V ++++ Q Y ++PE+A  D +K
Sbjct: 262 AYNKVNGVPSCADSNLLSKTARAQWGFRGYITSDCDAVSIIHDDQGYAKSPEDAVVDVLK 321

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AG+D++CG +L  H + AV    L E D++ AL    +V+MRLG+F+G P  Q FGN+GP
Sbjct: 322 AGMDVNCGSYLLKHAKVAVEQKKLSESDIDKALHNLFSVRMRLGLFNGRPEGQLFGNIGP 381

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 293
             VC+  HQ LAL+AA  GIVLLKNSAR LPLS  +  ++AVIGPN++    ++GNYAG 
Sbjct: 382 DQVCSQEHQILALEAARNGIVLLKNSARLLPLSKSKTKSLAVIGPNANSGQMLLGNYAGP 441

Query: 294 ACGYTTPLQGISRYAK-TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA 352
            C + TPLQ +  Y K T++   C  V C+   +  A +V A+ AD  VL+MGLDQ+ E 
Sbjct: 442 PCRFVTPLQALQSYIKQTVYHPACDTVQCSSASVDRAVDV-AKGADNVVLMMGLDQTQER 500

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E +DR  LLLPG+QQEL+  VAKA++ PVVLVL  GGPVD+SFAKND  IG+ILW GYPG
Sbjct: 501 EELDRTDLLLPGKQQELIIAVAKAAKNPVVLVLFSGGPVDISFAKNDKNIGSILWAGYPG 560

Query: 413 QAGGAAIADVLFGRANPG 430
           + G  A+A+++FG  NPG
Sbjct: 561 EGGAIALAEIVFGDHNPG 578


>gi|85813772|emb|CAJ65922.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
          Length = 757

 Score =  506 bits (1304), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 260/440 (59%), Positives = 314/440 (71%), Gaps = 43/440 (9%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ----GNTGSR 56
           MYN G+AGLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KY + YV+GLQ    GN    
Sbjct: 160 MYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSGYVKGLQQRDDGNPDG- 218

Query: 57  LKVAACCKHYTAYDLDNWNGVDRYHFNARV-SKQDLEDTYNVPFKACVVEGKVASVMCSY 115
           LKVAACCKHYTAYDLDNW GVDRYHFNA V +KQD++DT+  PFK+CVV+G VASVMCSY
Sbjct: 219 LKVAACCKHYTAYDLDNWKGVDRYHFNAVVVTKQDMDDTFQPPFKSCVVDGNVASVMCSY 278

Query: 116 NQVNGKPTCADPDILKNTIHGQWRLDGY--IVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
           N+VNG PTCADPD+L   I G+W+L+GY  IV+DCDS+ V YN+QHYT+TPEEAAA AI 
Sbjct: 279 NKVNGIPTCADPDLLSGVIRGEWKLNGYVYIVTDCDSIDVFYNSQHYTKTPEEAAAKAIL 338

Query: 174 AG--LDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNL 231
           AG  LDL+CG FL  HTE AV  GL+ E  ++ A++      MRLG FDG+PS Q +G L
Sbjct: 339 AGIGLDLNCGSFLGKHTEAAVTAGLVNESAIDRAVSNNFATLMRLGFFDGDPSKQLYGKL 398

Query: 232 GPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA 291
           GP+DVCT  +Q+LA +AA QGIVLLKN+                                
Sbjct: 399 GPKDVCTAENQELAREAARQGIVLLKNT-------------------------------- 426

Query: 292 GVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIE 351
           G  C YTTPLQG++    T +  GC  VAC+  Q +  A+  A  ADATVLVMG D SIE
Sbjct: 427 GTPCKYTTPLQGLAALVATTYLPGCSNVACSTAQ-VDDAKKIAAAADATVLVMGADLSIE 485

Query: 352 AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYP 411
           AE  DR  +LLPG+QQ L++ VA AS GPV+LV+M GG +DVSFAK + +I +ILWVGYP
Sbjct: 486 AESRDRVDILLPGQQQLLITAVANASTGPVILVIMSGGGMDVSFAKTNDKITSILWVGYP 545

Query: 412 GQAGGAAIADVLFGRANPGT 431
           G+AGGAAIAD++FG  NP T
Sbjct: 546 GEAGGAAIADIIFGSYNPST 565


>gi|297842585|ref|XP_002889174.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335015|gb|EFH65433.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 766

 Score =  504 bits (1298), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 237/439 (53%), Positives = 321/439 (73%), Gaps = 10/439 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT------ 53
           +YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP++TG YA +YVRGLQG++      
Sbjct: 137 VYNAGQAQGMTFWAPNINIFRDPRWGRGQETPGEDPIMTGTYAVAYVRGLQGDSFDGRKT 196

Query: 54  -GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVM 112
               L+ +ACCKH+TAYDLD W G+ RY FNA+VS  DL +TY  PFK C+ EG+ + +M
Sbjct: 197 LSIHLQASACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQPPFKKCIEEGRASGIM 256

Query: 113 CSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAI 172
           C+YN+VNG P+CADP++L  T  G WR  GYI SDCD+V ++++ Q Y +TPE+A AD +
Sbjct: 257 CAYNRVNGIPSCADPNLLTRTARGLWRFRGYITSDCDAVSIIHDAQGYAKTPEDAVADVL 316

Query: 173 KAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLG 232
           KAG+D++CG +L  HT+ A++   + E D++ AL    +V++RLG+F+G+P+  P+GN+ 
Sbjct: 317 KAGMDVNCGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRLGLFNGDPTKLPYGNIS 376

Query: 233 PRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAG 292
           P DVC+PAHQ LAL+AA  GIVLLKN+ + LP S     ++AVIGPN+ V  T++GNYAG
Sbjct: 377 PNDVCSPAHQALALEAARNGIVLLKNNLKLLPFSKRSVSSLAVIGPNAHVAKTLLGNYAG 436

Query: 293 VACGYTTPLQGISRYAKT-IHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIE 351
             C   TPL  +  Y K  ++  GC  VAC+ N  I  A   AR AD  VL+MGLDQ+ E
Sbjct: 437 PPCKTVTPLDALRSYVKNAVYHNGCDSVACS-NAAIDQAVAIARNADHVVLIMGLDQTQE 495

Query: 352 AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYP 411
            E +DR  L LPG+QQEL++ VA A++ PVVLVL+CGGPVD+SFA N+ +IG+I+W GYP
Sbjct: 496 KEDMDRVDLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISFATNNDKIGSIMWAGYP 555

Query: 412 GQAGGAAIADVLFGRANPG 430
           G+AGG A+A+++FG  NPG
Sbjct: 556 GEAGGIALAEIIFGDHNPG 574


>gi|326513064|dbj|BAK03439.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 694

 Score =  504 bits (1297), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 255/414 (61%), Positives = 308/414 (74%), Gaps = 7/414 (1%)

Query: 22  PRWGRGQETPGEDPVLTGKYAASYVRGLQ-----GNTGSRLKVAACCKHYTAYDLDNWNG 76
           P +    ETPGEDP+L  KYA  YV GLQ     G T   LKVAACCKHYTAYD+DNW G
Sbjct: 91  PAYEWWSETPGEDPLLASKYAVGYVTGLQDAGAGGVTDGALKVAACCKHYTAYDVDNWKG 150

Query: 77  VDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILKNTIHG 136
           V+RY F+A+VS+QDL+DT+  PFK+CV++G VASVMCSYN+VNGKPTCAD D+L+  I G
Sbjct: 151 VERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNGKPTCADKDLLEGVIRG 210

Query: 137 QWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGL 196
            W+L+GYIVSDCDSV VLY  QHYT+TPEEAAA  IK+GLDL+CG FLA HT  AV+ G 
Sbjct: 211 DWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIKSGLDLNCGNFLAQHTVAAVQAGE 270

Query: 197 LREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLL 256
           L EEDV+ A+     + MRLG FDG+P    FG+LGP+DVCT ++++LA + A QGIVLL
Sbjct: 271 LSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTSSNRELARETARQGIVLL 330

Query: 257 KNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRYAKTIHQAGC 316
           KNS   LPLS     ++AVIGPN++ + TMIGNY G  C YTTPLQG+     T++Q GC
Sbjct: 331 KNSG-ALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPLQGLGAKVNTVYQPGC 389

Query: 317 FGVACNGNQL-IGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAK 375
             V C+GN L +  A  AA  AD TVLV+G DQSIE E +DR  LLLPG+Q +LVS VA 
Sbjct: 390 TNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESLDRTSLLLPGQQTQLVSAVAN 449

Query: 376 ASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           AS GPV+LV+M GGP D+SFAK   +I AILWVGYPG+AGGAA+AD+LFG  NP
Sbjct: 450 ASSGPVILVVMSGGPFDISFAKASDKIAAILWVGYPGEAGGAALADILFGSHNP 503


>gi|302791321|ref|XP_002977427.1| hypothetical protein SELMODRAFT_106899 [Selaginella moellendorffii]
 gi|300154797|gb|EFJ21431.1| hypothetical protein SELMODRAFT_106899 [Selaginella moellendorffii]
          Length = 772

 Score =  502 bits (1293), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 238/443 (53%), Positives = 314/443 (70%), Gaps = 15/443 (3%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN-------- 52
           MYN G +GLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KYAA +VRGLQ          
Sbjct: 134 MYNAGRSGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAYFVRGLQEGDYDEGTAI 193

Query: 53  -----TGSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGK 107
                + +RLKV++CCKH+TAYD++   G D +HFNA+V+ QDL+DT++ PF++C+V+G+
Sbjct: 194 STMQGSPTRLKVSSCCKHFTAYDMEKSEGTDCFHFNAQVTVQDLQDTFDPPFRSCIVDGQ 253

Query: 108 VASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEA 167
            + +MCSYN+VNG P+CAD   L  T+   W  +GYIVSDCD+V +LY   +YT T E+A
Sbjct: 254 ASGLMCSYNRVNGVPSCADYTFLTETVRNSWGFEGYIVSDCDAVALLYEYINYTTTAEDA 313

Query: 168 AADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQP 227
            AD + AG+DL+CG FL  HT  A+  G + E  V+ AL+  +TV+MRLG+FDG  S + 
Sbjct: 314 VADVLSAGMDLNCGTFLLRHTAAAIEQGKVTEAAVDRALSNVMTVRMRLGLFDGN-SGET 372

Query: 228 FGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMI 287
           + ++GP  VCTP H+QL+L+AA QGIVLLKNS   LP       T+AVIGP+ + T TM+
Sbjct: 373 YNSIGPDAVCTPEHRQLSLEAAEQGIVLLKNSGNVLPFPRNDLMTIAVIGPSGNATETML 432

Query: 288 GNYAGVACGYTTPLQGISRYAK-TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGL 346
           GNYAGV C Y TP QG+  Y K  + + GC  + CN   L  AA  AA  +DA V+V+GL
Sbjct: 433 GNYAGVPCQYITPFQGLQEYTKCVVFEPGCKDIMCNDTTLFLAAVRAAENSDAVVIVVGL 492

Query: 347 DQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
           D+  E E +DR  LLLPG QQ LV  V+K ++GPV+LV+M GGP+DV+FAK + +I  +L
Sbjct: 493 DKDQEREGLDRTSLLLPGNQQGLVLEVSKVAKGPVILVVMSGGPIDVTFAKENCKISNVL 552

Query: 407 WVGYPGQAGGAAIADVLFGRANP 429
           WVGYPG+AGG AIA V+FG  NP
Sbjct: 553 WVGYPGEAGGKAIARVIFGDHNP 575


>gi|15218202|ref|NP_177929.1| putative beta-D-xylosidase 7 [Arabidopsis thaliana]
 gi|259585708|sp|Q9SGZ5.2|BXL7_ARATH RecName: Full=Probable beta-D-xylosidase 7; Short=AtBXL7; Flags:
           Precursor
 gi|18086336|gb|AAL57631.1| At1g78060/F28K19_32 [Arabidopsis thaliana]
 gi|332197942|gb|AEE36063.1| putative beta-D-xylosidase 7 [Arabidopsis thaliana]
          Length = 767

 Score =  502 bits (1292), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 235/439 (53%), Positives = 320/439 (72%), Gaps = 10/439 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT------ 53
           +YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP++TG YA +YVRGLQG++      
Sbjct: 138 VYNAGQANGMTFWAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDSFDGRKT 197

Query: 54  -GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVM 112
             + L+ +ACCKH+TAYDLD W G+ RY FNA+VS  DL +TY  PFK C+ EG+ + +M
Sbjct: 198 LSNHLQASACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQPPFKKCIEEGRASGIM 257

Query: 113 CSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAI 172
           C+YN+VNG P+CADP++L  T  GQW   GYI SDCD+V ++Y+ Q Y ++PE+A AD +
Sbjct: 258 CAYNRVNGIPSCADPNLLTRTARGQWAFRGYITSDCDAVSIIYDAQGYAKSPEDAVADVL 317

Query: 173 KAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLG 232
           KAG+D++CG +L  HT+ A++   + E D++ AL    +V++RLG+F+G+P+  P+GN+ 
Sbjct: 318 KAGMDVNCGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRLGLFNGDPTKLPYGNIS 377

Query: 233 PRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAG 292
           P +VC+PAHQ LAL AA  GIVLLKN+ + LP S     ++AVIGPN+ V  T++GNYAG
Sbjct: 378 PNEVCSPAHQALALDAARNGIVLLKNNLKLLPFSKRSVSSLAVIGPNAHVVKTLLGNYAG 437

Query: 293 VACGYTTPLQGISRYAKT-IHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIE 351
             C   TPL  +  Y K  ++  GC  VAC+ N  I  A   A+ AD  VL+MGLDQ+ E
Sbjct: 438 PPCKTVTPLDALRSYVKNAVYHQGCDSVACS-NAAIDQAVAIAKNADHVVLIMGLDQTQE 496

Query: 352 AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYP 411
            E  DR  L LPG+QQEL++ VA A++ PVVLVL+CGGPVD+SFA N+ +IG+I+W GYP
Sbjct: 497 KEDFDRVDLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISFAANNNKIGSIIWAGYP 556

Query: 412 GQAGGAAIADVLFGRANPG 430
           G+AGG AI++++FG  NPG
Sbjct: 557 GEAGGIAISEIIFGDHNPG 575


>gi|302786474|ref|XP_002975008.1| hypothetical protein SELMODRAFT_103038 [Selaginella moellendorffii]
 gi|300157167|gb|EFJ23793.1| hypothetical protein SELMODRAFT_103038 [Selaginella moellendorffii]
          Length = 772

 Score =  500 bits (1287), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 237/443 (53%), Positives = 314/443 (70%), Gaps = 15/443 (3%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG--------- 51
           MYN G +GLT+WSPN+NIFRDPRWGRGQETPGEDP+L+ KYAA +VRGLQ          
Sbjct: 134 MYNAGRSGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAYFVRGLQEGDYDEGTAI 193

Query: 52  ----NTGSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGK 107
                  +RLKV++CCKH+TAYD++   G D +HFNA+V+ QDL+DT++ PF++C+V+G+
Sbjct: 194 STMQRRPTRLKVSSCCKHFTAYDMEKSEGTDCFHFNAQVTVQDLQDTFDPPFRSCIVDGQ 253

Query: 108 VASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEA 167
            + +MCSYN+VNG P+CAD   L  T+   W  +GYIVSDCD+V +LY   +YT T E+A
Sbjct: 254 ASGLMCSYNRVNGVPSCADYTFLTETVRNSWGFEGYIVSDCDAVALLYEYINYTTTAEDA 313

Query: 168 AADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQP 227
            AD + AG+DL+CG FL  HT  A+  G + E  V+ AL+  +TV+MRLG+FDG  S + 
Sbjct: 314 VADVLSAGMDLNCGTFLLRHTAAAIEQGKVTEAAVDRALSNVMTVRMRLGLFDGN-SGET 372

Query: 228 FGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMI 287
           + ++GP  VCT  H+QL+L+AA QGIVLLKNS   LP       T+AVIGP+ + T TM+
Sbjct: 373 YNSIGPDAVCTREHRQLSLEAAEQGIVLLKNSGNVLPFPRNDLMTIAVIGPSGNATETML 432

Query: 288 GNYAGVACGYTTPLQGISRYAK-TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGL 346
           GNYAGV C Y TP QG+  Y K  + + GC  + CN   L  AA  AA  +DA V+V+GL
Sbjct: 433 GNYAGVPCQYITPFQGLQEYTKGVVFEPGCKDIMCNDTTLFLAAVRAAENSDAVVIVVGL 492

Query: 347 DQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
           D+  E E +DR  LLLPG QQ+LV  V+K ++GPV+LV+M GGP+DV+FAK + +I ++L
Sbjct: 493 DKDQEREGLDRTSLLLPGYQQDLVLEVSKVAKGPVILVVMSGGPIDVTFAKGNCKISSVL 552

Query: 407 WVGYPGQAGGAAIADVLFGRANP 429
           WVGYPG+AGG AIA V+FG  NP
Sbjct: 553 WVGYPGEAGGKAIARVIFGDHNP 575


>gi|224066929|ref|XP_002302284.1| predicted protein [Populus trichocarpa]
 gi|222844010|gb|EEE81557.1| predicted protein [Populus trichocarpa]
          Length = 742

 Score =  498 bits (1282), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 232/438 (52%), Positives = 318/438 (72%), Gaps = 13/438 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT------ 53
           +YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP++ GKYA SYVRG+QG++      
Sbjct: 149 IYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLVAGKYAVSYVRGVQGDSFGGGTL 208

Query: 54  GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 113
           G +L+ +ACCKH+TAYDLD W G++R+ F+A    QDL DTY  PF++C+ EGK + +MC
Sbjct: 209 GEQLQASACCKHFTAYDLDKWKGMNRFVFDA----QDLADTYQPPFQSCIQEGKASGIMC 264

Query: 114 SYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
           +YN+VNG P CAD ++L     GQW   GYI SDCD+V ++++ Q Y ++PE+A AD +K
Sbjct: 265 AYNRVNGVPNCADYNLLSKKARGQWGFYGYITSDCDAVAIIHDDQGYAKSPEDAVADVLK 324

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AG+D++CG +L  +T+ AV+   L E +++ AL    +++MRLG+F+G P+ QP+GN+ P
Sbjct: 325 AGMDVNCGDYLKNYTKSAVKKKKLPESEIDRALHNLFSIRMRLGLFNGNPTKQPYGNIAP 384

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 293
             VC+  HQ LAL+AA  GIVLLKN  + LPLS L   ++AVIGPN++ +  ++GNY G 
Sbjct: 385 DQVCSQEHQALALKAAQDGIVLLKNPDKLLPLSKLETKSLAVIGPNANNSTKLLGNYFGP 444

Query: 294 ACGYTTPLQGISRYAK-TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA 352
            C   TPLQG+  Y K T +  GC  VAC+   +  A ++ A+ AD  +LVMGLDQ+ E 
Sbjct: 445 PCKTVTPLQGLQNYIKNTRYHPGCSRVACSSASINQAVKI-AKGADQVILVMGLDQTQEK 503

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E  DR  L+LPG+Q+EL++ VAKA++ PVVLVL CGGPVDVSFAK D  IG+I+W GYPG
Sbjct: 504 EEQDRVDLVLPGKQRELITAVAKAAKKPVVLVLFCGGPVDVSFAKYDQNIGSIIWAGYPG 563

Query: 413 QAGGAAIADVLFGRANPG 430
           +AGG A+A ++FG  NPG
Sbjct: 564 EAGGTALAQIIFGDHNPG 581


>gi|168065036|ref|XP_001784462.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162663987|gb|EDQ50724.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 726

 Score =  497 bits (1280), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 257/438 (58%), Positives = 318/438 (72%), Gaps = 13/438 (2%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ-GNTGS-RLK 58
            YN G+AGLTYWSP +NI RDPRWGR QET GEDP  T  YA  +V+G+Q G+  S RLK
Sbjct: 99  FYNDGIAGLTYWSPVINIARDPRWGRIQETSGEDPYTTSAYATHFVQGMQEGDANSKRLK 158

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           ++ACCKH+TAYD+DNW G+DRYHF+A   K +L DTYN PF++CV EG+ AS+MCSYN+V
Sbjct: 159 LSACCKHFTAYDVDNWEGIDRYHFDA---KANLADTYNPPFQSCVQEGRSASLMCSYNKV 215

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           NG PTCA+ D L+NT+   W L+GYIVSDCDSV V++ + +Y  T E+AAADA+ AGLDL
Sbjct: 216 NGVPTCANYDFLENTVRRAWGLNGYIVSDCDSVLVMHESTNYAPTTEDAAADALNAGLDL 275

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG +LA +TEGAV  G +    V+ A+     V+MRLGMFDG P+ Q FGN+G  DVCT
Sbjct: 276 NCGDYLASYTEGAVAMGKVNASRVDNAVYNVFLVRMRLGMFDGNPANQEFGNIGVADVCT 335

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PAHQ+LA++AA QGIVLLKN    LPLS  ++   AVIGPN++ T TM+GNY G+ C Y 
Sbjct: 336 PAHQELAVEAARQGIVLLKNDGNILPLS--KNINTAVIGPNANATHTMLGNYEGIPCQYI 393

Query: 299 TPLQGISR-----YAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
           TPLQG+ +     Y K     GC   AC  +  I +A   A  ADA VLV+GL Q  E+E
Sbjct: 394 TPLQGLVKFGSGDYHKVWFSEGCVNTACQQDDQISSAVSTAAVADAVVLVVGLSQVQESE 453

Query: 354 FIDRAGLLLPGRQQELVSRVAKASRG-PVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
            +DR  LLLPG QQ L+  VA A+ G PVVLVLMC GPVD++FAKND RI +ILWVGYPG
Sbjct: 454 ALDRTSLLLPGYQQTLIDEVAGAAAGRPVVLVLMCAGPVDINFAKNDKRIQSILWVGYPG 513

Query: 413 QAGGAAIADVLFGRANPG 430
           Q+GG AIA+V+FG  NPG
Sbjct: 514 QSGGQAIAEVIFGAHNPG 531


>gi|449508468|ref|XP_004163321.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 7-like
           [Cucumis sativus]
          Length = 783

 Score =  496 bits (1277), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 231/438 (52%), Positives = 316/438 (72%), Gaps = 9/438 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT------ 53
           +YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP++TGKY+ +YVRG+QG+       
Sbjct: 152 VYNAGQAKGMTFWTPNINIFRDPRWGRGQETPGEDPLMTGKYSVAYVRGIQGDAIEGGKL 211

Query: 54  GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 113
           G++LK +ACCKH+TAYDLD WNG+ RY F+A+V+ QD+ DTY  PF++CV EGK + +MC
Sbjct: 212 GNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMADTYQPPFESCVEEGKASGIMC 271

Query: 114 SYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
           +YN+VNG P+CAD  +L  T   QW+ +GYI SDCD+V ++++ Q Y + PE+A AD ++
Sbjct: 272 AYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVSIIHDAQGYAKIPEDAVADVLR 331

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AG+D++CG +L  HT+ AV    +    ++ AL    +V+MRLG+FDG P+  PFG +G 
Sbjct: 332 AGMDVNCGTYLKEHTKSAVEMKKVPMLHIDRALRNLFSVRMRLGLFDGNPTKLPFGQIGR 391

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 293
             VC+  HQ LALQAA +GIVLLKNSA+ LPLS    H++AVIG N +   T+ GNYAG+
Sbjct: 392 DQVCSQQHQNLALQAAREGIVLLKNSAKLLPLSKSNTHSLAVIGHNGNDPKTLRGNYAGI 451

Query: 294 ACGYTTPLQGISRYAK-TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA 352
            C   TP QG++ Y K T++  GC    C    +  A ++ A+  D  VLVMGLDQ+ E 
Sbjct: 452 PCKSATPFQGLNNYVKNTVYHRGCNYANCTEATIYQAVKI-AKSVDYVVLVMGLDQTQER 510

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E  DR  L LPG+Q +L++ VAKA++ PV+LV++ GGPVD+S AK + +IG+ILW GYPG
Sbjct: 511 EDFDRTELGLPGKQDKLIAEVAKAAKXPVILVILSGGPVDISSAKYNEKIGSILWAGYPG 570

Query: 413 QAGGAAIADVLFGRANPG 430
           QAGG AIA+++FG  NPG
Sbjct: 571 QAGGTAIAEIIFGDHNPG 588


>gi|449433577|ref|XP_004134574.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
 gi|449530107|ref|XP_004172038.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
          Length = 812

 Score =  496 bits (1277), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 245/440 (55%), Positives = 306/440 (69%), Gaps = 13/440 (2%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR---- 56
           MYN G AGLTYWSP +N+ RDPRWGR  ETPGEDP + GKYA +YVRGLQ   GS     
Sbjct: 175 MYNLGRAGLTYWSPTINVIRDPRWGRTVETPGEDPFVVGKYAKNYVRGLQDVEGSENVTD 234

Query: 57  -----LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASV 111
                LKV++CCKHY AYD+DNW GV+RY F+ARV++QD+ +T+N PF+ CV EG V+SV
Sbjct: 235 LNSRPLKVSSCCKHYAAYDVDNWLGVERYSFDARVTEQDMLETFNKPFEMCVKEGDVSSV 294

Query: 112 MCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTR-TPEEAAAD 170
           MCSYN+VNG PTCADP +LK+TI G W L GYIVSDCDSV V+    HY + T E+A A 
Sbjct: 295 MCSYNRVNGIPTCADPVLLKDTIRGNWGLHGYIVSDCDSVKVMVEDAHYLQDTNEDAVAQ 354

Query: 171 AIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGN 230
            +KAGLDLDCG     +TE  VR G +   +++ AL     V MRLG FDG      F +
Sbjct: 355 TLKAGLDLDCGQIYPNYTESTVRQGKVGMRNIDNALNNLYVVLMRLGYFDGNTG---FES 411

Query: 231 LGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY 290
           LG  D+C+  H +LA +AA QG VLLKN   TLP     + T+AV+GP+++ T  M+GNY
Sbjct: 412 LGKPDICSDEHIELATEAARQGTVLLKNDNDTLPFDPSNYKTLAVVGPHANATSAMLGNY 471

Query: 291 AGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSI 350
           AGV C   +P+ G+S YAK  +Q GC  VAC  +  I  A  AAR +DATV+ +G+D SI
Sbjct: 472 AGVPCRMNSPMDGLSEYAKVKYQMGCDSVACKNDTFIFGAMEAARTSDATVIFVGIDLSI 531

Query: 351 EAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGY 410
           EAE +DR  LLLPG Q +LV +VA  S+GPVVLV++  G +DVSFAKN+  I AI+W GY
Sbjct: 532 EAESLDRVDLLLPGYQTQLVQQVATVSKGPVVLVILSAGGIDVSFAKNNSNIKAIIWAGY 591

Query: 411 PGQAGGAAIADVLFGRANPG 430
           PG+ GG AIADV+FG+ NPG
Sbjct: 592 PGEEGGRAIADVIFGKFNPG 611


>gi|449465962|ref|XP_004150696.1| PREDICTED: probable beta-D-xylosidase 7-like [Cucumis sativus]
          Length = 783

 Score =  496 bits (1277), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 231/438 (52%), Positives = 316/438 (72%), Gaps = 9/438 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT------ 53
           +YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP++TGKY+ +YVRG+QG+       
Sbjct: 152 VYNAGQAKGMTFWTPNINIFRDPRWGRGQETPGEDPLMTGKYSVAYVRGIQGDAIEGGKL 211

Query: 54  GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 113
           G++LK +ACCKH+TAYDLD WNG+ RY F+A+V+ QD+ DTY  PF++CV EGK + +MC
Sbjct: 212 GNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMADTYQPPFESCVEEGKASGIMC 271

Query: 114 SYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
           +YN+VNG P+CAD  +L  T   QW+ +GYI SDCD+V ++++ Q Y + PE+A AD ++
Sbjct: 272 AYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVSIIHDAQGYAKIPEDAVADVLR 331

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AG+D++CG +L  HT+ AV    +    ++ AL    +V+MRLG+FDG P+  PFG +G 
Sbjct: 332 AGMDVNCGTYLKEHTKSAVEMKKVPMLHIDRALRNLFSVRMRLGLFDGNPTKLPFGQIGR 391

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 293
             VC+  HQ LALQAA +GIVLLKNSA+ LPLS    H++AVIG N +   T+ GNYAG+
Sbjct: 392 DQVCSQQHQNLALQAAREGIVLLKNSAKLLPLSKSNTHSLAVIGHNGNDPKTLRGNYAGI 451

Query: 294 ACGYTTPLQGISRYAK-TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA 352
            C   TP QG++ Y K T++  GC    C    +  A ++ A+  D  VLVMGLDQ+ E 
Sbjct: 452 PCKSATPFQGLNNYVKNTVYHRGCNYANCTEATIYQAVKI-AKSVDYVVLVMGLDQTQER 510

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E  DR  L LPG+Q +L++ VAKA++ PV+LV++ GGPVD+S AK + +IG+ILW GYPG
Sbjct: 511 EDFDRTELGLPGKQDKLIAEVAKAAKRPVILVILSGGPVDISSAKYNEKIGSILWAGYPG 570

Query: 413 QAGGAAIADVLFGRANPG 430
           QAGG AIA+++FG  NPG
Sbjct: 571 QAGGTAIAEIIFGDHNPG 588


>gi|115485165|ref|NP_001067726.1| Os11g0297800 [Oryza sativa Japonica Group]
 gi|62734696|gb|AAX96805.1| beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|77549999|gb|ABA92796.1| Glycosyl hydrolase family 3 C terminal domain containing protein,
           expressed [Oryza sativa Japonica Group]
 gi|113644948|dbj|BAF28089.1| Os11g0297800 [Oryza sativa Japonica Group]
 gi|125534139|gb|EAY80687.1| hypothetical protein OsI_35869 [Oryza sativa Indica Group]
 gi|215766717|dbj|BAG98945.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 782

 Score =  493 bits (1269), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 243/433 (56%), Positives = 309/433 (71%), Gaps = 4/433 (0%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKV 59
           +YN G A GLT WSPNVNIFRDPRWGRGQETPGEDP +  KYA ++V+G+QGN+ + L+ 
Sbjct: 155 LYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPTMASKYAVAFVKGMQGNSSAILQT 214

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           +ACCKH TAYDL++WNGV RY+FNA+V+ QDLEDTYN PF++CVV+ K   +MC+Y  +N
Sbjct: 215 SACCKHVTAYDLEDWNGVQRYNFNAKVTAQDLEDTYNPPFRSCVVDAKATCIMCAYTGIN 274

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           G P CA+ D+L  T+ G W LDGYI SDCD+V ++ + Q YT+TPE+A A A+KAGLD++
Sbjct: 275 GVPACANADLLTKTVRGDWGLDGYIASDCDAVAIMRDAQRYTQTPEDAVAVALKAGLDMN 334

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEP-SAQPFGNLGPRDVCT 238
           CG ++  H   A++ G L EED++ AL     ++MRLG FDG+P S   +G LG  D+CT
Sbjct: 335 CGTYMQQHATAAIQQGKLTEEDIDKALKNLFAIRMRLGHFDGDPRSNSVYGGLGAADICT 394

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           P H+ LAL+AA  GIVLLKN A  LPL      + AVIGPN++  + +IGNY G  C  T
Sbjct: 395 PEHRSLALEAAMDGIVLLKNDAGILPLDRTAVASAAVIGPNANDGLALIGNYFGPPCEST 454

Query: 299 TPLQGISRYAKTIH-QAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TPL GI  Y K +   AGC   AC+      AA VA+  +D   L MGL Q  E+E  DR
Sbjct: 455 TPLNGILGYIKNVRFLAGCNSAACDVAATDQAAAVAS-SSDYVFLFMGLSQKQESEGRDR 513

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             LLLPG QQ L++ VA A++ PV+LVL+ GGPVDV+FA+ +P+IGAILW GYPGQAGG 
Sbjct: 514 TSLLLPGEQQSLITAVADAAKRPVILVLLTGGPVDVTFAQTNPKIGAILWAGYPGQAGGL 573

Query: 418 AIADVLFGRANPG 430
           AIA VLFG  NPG
Sbjct: 574 AIARVLFGDHNPG 586


>gi|218191593|gb|EEC74020.1| hypothetical protein OsI_08964 [Oryza sativa Indica Group]
          Length = 774

 Score =  492 bits (1266), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 237/438 (54%), Positives = 311/438 (71%), Gaps = 9/438 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN------T 53
           +YN G A GLT+W+PN+N+FRDPRWGRGQETPGEDP +TGKYAA +VRG+QG        
Sbjct: 140 VYNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTVTGKYAAVFVRGVQGYALAGAIN 199

Query: 54  GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 113
            + L+ +ACCKH+TAYDL+NW GV RY F+A+V+ QDL DTYN PF++CV +G  + +MC
Sbjct: 200 STDLEASACCKHFTAYDLENWKGVTRYAFDAKVTAQDLADTYNPPFRSCVEDGGASGIMC 259

Query: 114 SYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
           SYN+VNG PTCAD ++L  T  G WR  GYI SDCD+V ++++ Q Y +T E+A AD +K
Sbjct: 260 SYNRVNGVPTCADYNLLSKTARGDWRFYGYITSDCDAVSIIHDVQGYAKTAEDAVADVLK 319

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AG+D++CG ++  H   A++ G + E+D+N AL     V+MRLG+F+G P    +GN+GP
Sbjct: 320 AGMDVNCGSYVQEHGLSAIQQGKITEQDINRALHNLFAVRMRLGLFNGNPKYNRYGNIGP 379

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 293
             VCT  HQ LAL+AA  G+VLLKN A  LPLS  +  ++AVIG N++    ++GNY G 
Sbjct: 380 DQVCTQEHQNLALEAAQHGVVLLKNDANALPLSKSQVSSIAVIGHNANDATRLLGNYFGP 439

Query: 294 ACGYTTPLQGISRYAK-TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA 352
            C   TPLQ +  Y K T   AGC   ACN + +  AA++A+   D  VL MGLDQ  E 
Sbjct: 440 PCISVTPLQVLQGYVKDTRFLAGCNSAACNVSSIGEAAQLAS-SVDYVVLFMGLDQDQER 498

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E +DR  L LPG Q+ L++ VA A++ PV+LVL+CGGPVDV+FAK +P+IGAILW GYPG
Sbjct: 499 EEVDRLELSLPGMQENLINTVANAAKKPVILVLLCGGPVDVTFAKYNPKIGAILWAGYPG 558

Query: 413 QAGGAAIADVLFGRANPG 430
           +AGG AIA VLFG  NPG
Sbjct: 559 EAGGIAIAQVLFGEHNPG 576


>gi|115448721|ref|NP_001048140.1| Os02g0752200 [Oryza sativa Japonica Group]
 gi|46390122|dbj|BAD15557.1| putative beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|46390225|dbj|BAD15656.1| putative beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|113537671|dbj|BAF10054.1| Os02g0752200 [Oryza sativa Japonica Group]
 gi|125583710|gb|EAZ24641.1| hypothetical protein OsJ_08409 [Oryza sativa Japonica Group]
          Length = 780

 Score =  492 bits (1266), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 237/438 (54%), Positives = 311/438 (71%), Gaps = 9/438 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN------T 53
           +YN G A GLT+W+PN+N+FRDPRWGRGQETPGEDP +TGKYAA +VRG+QG        
Sbjct: 146 VYNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTVTGKYAAVFVRGVQGYALAGAIN 205

Query: 54  GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 113
            + L+ +ACCKH+TAYDL+NW GV RY F+A+V+ QDL DTYN PF++CV +G  + +MC
Sbjct: 206 STDLEASACCKHFTAYDLENWKGVTRYAFDAKVTAQDLADTYNPPFRSCVEDGGASGIMC 265

Query: 114 SYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
           SYN+VNG PTCAD ++L  T  G WR  GYI SDCD+V ++++ Q Y +T E+A AD +K
Sbjct: 266 SYNRVNGVPTCADYNLLSKTARGDWRFYGYITSDCDAVSIIHDVQGYAKTAEDAVADVLK 325

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AG+D++CG ++  H   A++ G + E+D+N AL     V+MRLG+F+G P    +GN+GP
Sbjct: 326 AGMDVNCGSYVQEHGLSAIQQGKITEQDINRALHNLFAVRMRLGLFNGNPKYNRYGNIGP 385

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 293
             VCT  HQ LAL+AA  G+VLLKN A  LPLS  +  ++AVIG N++    ++GNY G 
Sbjct: 386 DQVCTQEHQNLALEAAQHGVVLLKNDANALPLSKSQVSSIAVIGHNANDATRLLGNYFGP 445

Query: 294 ACGYTTPLQGISRYAK-TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA 352
            C   TPLQ +  Y K T   AGC   ACN + +  AA++A+   D  VL MGLDQ  E 
Sbjct: 446 PCISVTPLQVLQGYVKDTRFLAGCNSAACNVSSIGEAAQLAS-SVDYVVLFMGLDQDQER 504

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E +DR  L LPG Q+ L++ VA A++ PV+LVL+CGGPVDV+FAK +P+IGAILW GYPG
Sbjct: 505 EEVDRLELSLPGMQENLINTVANAAKKPVILVLLCGGPVDVTFAKYNPKIGAILWAGYPG 564

Query: 413 QAGGAAIADVLFGRANPG 430
           +AGG AIA VLFG  NPG
Sbjct: 565 EAGGIAIAQVLFGEHNPG 582


>gi|125534112|gb|EAY80660.1| hypothetical protein OsI_35838 [Oryza sativa Indica Group]
          Length = 771

 Score =  491 bits (1263), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 246/432 (56%), Positives = 303/432 (70%), Gaps = 4/432 (0%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKV 59
           +YN G A GLT WSPNVNI+RDPRWGRGQETPGEDP    KYA ++V+GLQG+T   L+ 
Sbjct: 140 LYNLGQAEGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQGSTPGTLQT 199

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           +ACCKH TAYDL+ WNGV RY+FNA+V+ QDL DT+N PFK+CVV+ K + VMC+Y  +N
Sbjct: 200 SACCKHATAYDLEEWNGVARYNFNAKVTAQDLADTFNPPFKSCVVDAKASCVMCAYTDIN 259

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           G P CA  D+L  T  GQW LDGY+ SDCD+V +L + Q Y  TPE+  A AIKAGLDL+
Sbjct: 260 GVPACASSDLLSKTFRGQWGLDGYVSSDCDAVALLRDAQRYAPTPEDTVAVAIKAGLDLN 319

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEP-SAQPFGNLGPRDVCT 238
           CG +  +H   A++ G +RE DV+ AL     V+MRLG FDG+P S   +G+LG  DVCT
Sbjct: 320 CGNYTQVHGMAALQQGKMRESDVDRALTNLFAVRMRLGHFDGDPRSNAAYGHLGAADVCT 379

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
            AH+ LAL+AA  GIVLLKN A  LPL      + AVIGPN++    + GNY G  C  T
Sbjct: 380 QAHRDLALEAAQNGIVLLKNDAGALPLDRATVRSAAVIGPNANDPAALNGNYFGPPCETT 439

Query: 299 TPLQGISRYAKTIH-QAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TPLQG+ RY  ++   AGC   AC G    G A   A  +D  ++ MGL Q  E E +DR
Sbjct: 440 TPLQGVQRYISSVRFLAGCDSPAC-GFAATGQAAALASSSDQVIMFMGLSQDQEKEGLDR 498

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             LLLPG+QQ L++ VA A+R PV+LVL+ GGPVDV+FAKN+P+IGAILW GYPGQAGG 
Sbjct: 499 TSLLLPGKQQSLITAVASAARRPVILVLLTGGPVDVTFAKNNPKIGAILWAGYPGQAGGL 558

Query: 418 AIADVLFGRANP 429
           AIA VLFG  NP
Sbjct: 559 AIAKVLFGDHNP 570


>gi|62701898|gb|AAX92971.1| beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|62733926|gb|AAX96035.1| beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|77550045|gb|ABA92842.1| Glycosyl hydrolase family 3 C terminal domain containing protein,
           expressed [Oryza sativa Japonica Group]
 gi|125576900|gb|EAZ18122.1| hypothetical protein OsJ_33667 [Oryza sativa Japonica Group]
          Length = 771

 Score =  491 bits (1263), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 246/432 (56%), Positives = 303/432 (70%), Gaps = 4/432 (0%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKV 59
           +YN G A GLT WSPNVNI+RDPRWGRGQETPGEDP    KYA ++V+GLQG+T   L+ 
Sbjct: 140 LYNLGQAEGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQGSTPGTLQT 199

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           +ACCKH TAYDL+ WNGV RY+FNA+V+ QDL DT+N PFK+CVV+ K + VMC+Y  +N
Sbjct: 200 SACCKHATAYDLEEWNGVARYNFNAKVTAQDLADTFNPPFKSCVVDAKASCVMCAYTDIN 259

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           G P CA  D+L  T  GQW LDGY+ SDCD+V +L + Q Y  TPE+  A AIKAGLDL+
Sbjct: 260 GVPACASSDLLSKTFRGQWGLDGYVSSDCDAVALLRDAQRYAPTPEDTVAVAIKAGLDLN 319

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEP-SAQPFGNLGPRDVCT 238
           CG +  +H   A++ G +RE DV+ AL     V+MRLG FDG+P S   +G+LG  DVCT
Sbjct: 320 CGNYTQVHGMAALQQGKMRESDVDRALTNLFAVRMRLGHFDGDPRSNAAYGHLGAADVCT 379

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
            AH+ LAL+AA  GIVLLKN A  LPL      + AVIGPN++    + GNY G  C  T
Sbjct: 380 QAHRDLALEAAQDGIVLLKNDAGALPLDRATVRSAAVIGPNANDPAALNGNYFGPPCETT 439

Query: 299 TPLQGISRYAKTIH-QAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TPLQG+ RY  ++   AGC   AC G    G A   A  +D  ++ MGL Q  E E +DR
Sbjct: 440 TPLQGVQRYISSVRFLAGCDSPAC-GFAATGQAAALASSSDQVIMFMGLSQDQEKEGLDR 498

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             LLLPG+QQ L++ VA A+R PV+LVL+ GGPVDV+FAKN+P+IGAILW GYPGQAGG 
Sbjct: 499 TSLLLPGKQQSLITAVASAARRPVILVLLTGGPVDVTFAKNNPKIGAILWAGYPGQAGGL 558

Query: 418 AIADVLFGRANP 429
           AIA VLFG  NP
Sbjct: 559 AIAKVLFGDHNP 570


>gi|125534137|gb|EAY80685.1| hypothetical protein OsI_35867 [Oryza sativa Indica Group]
          Length = 779

 Score =  491 bits (1263), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 232/432 (53%), Positives = 306/432 (70%), Gaps = 4/432 (0%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKV 59
            YN G A GL  WSPNVNIFRDPRWGRGQETPGEDP    KY A++V+GLQG++ + L+ 
Sbjct: 151 FYNLGQAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLTNLQT 210

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           +ACCKH TAYD++ W GV RY+FNA+V+ QDL DTYN PF++CVV+GK + +MC+Y  +N
Sbjct: 211 SACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLIN 270

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           G P CA  D+L  T+ G+W+LDGY  SDCD+V +L+ ++H+TRT EEA A A+KAGLD++
Sbjct: 271 GVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTRTAEEAVAVALKAGLDIN 330

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQP-FGNLGPRDVCT 238
           CG ++  +   A++ G + E+DV+ AL     ++MRLG FDG+P     +G LG  DVCT
Sbjct: 331 CGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLGAADVCT 390

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           P H+ LAL+AA +G+VLLKN AR LPL      + AVIG N++  + ++GNY G+ C  T
Sbjct: 391 PVHKALALEAARRGVVLLKNDARLLPLRAPTVSSAAVIGHNANDILALLGNYYGLPCETT 450

Query: 299 TPLQGISRYAKTIH-QAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TP  GI +Y K+     GC   AC+       A   A+ +D   LVMGL Q  E E +DR
Sbjct: 451 TPFGGIQKYVKSAKFLPGCSSAACD-VAATDQATALAKSSDYVFLVMGLSQKQEQEGLDR 509

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             LLLPG+QQ L++ VA AS+ PV+L+L+ GGPVD++FA+ +P+IGAILW GYPGQAGG 
Sbjct: 510 TSLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQ 569

Query: 418 AIADVLFGRANP 429
           AIADVLFG  NP
Sbjct: 570 AIADVLFGEFNP 581


>gi|357156390|ref|XP_003577440.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
           distachyon]
          Length = 755

 Score =  490 bits (1262), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 244/435 (56%), Positives = 313/435 (71%), Gaps = 8/435 (1%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKV 59
           M+N G A GLT WSPNVNI+RDPRWGRGQETPGEDP    KYA ++VRGLQG + + L+ 
Sbjct: 126 MFNLGQADGLTIWSPNVNIYRDPRWGRGQETPGEDPATASKYAVAFVRGLQGTSTTTLQT 185

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           +ACCKH TAYDLD+WN + RY+FNA+V+ QDLE+T+N PFK+CVVEGK   VMC+Y  VN
Sbjct: 186 SACCKHATAYDLDDWNRIGRYNFNAKVTAQDLEETFNPPFKSCVVEGKATCVMCAYTSVN 245

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           G P CAD  +L  TI G+W ++GYI SDCD+V +LY T+ Y+ TPE+A A AIKAGLD++
Sbjct: 246 GIPACADSGLLTKTIKGEWGMNGYISSDCDAVALLYGTR-YSGTPEDAVAAAIKAGLDMN 304

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQP-FGNLGPRDVCT 238
           CG F  +H   A++   + E+DV+ AL     ++MRLG FDG+P   P +G LG +DVC+
Sbjct: 305 CGNFSQVHGMAALQQRKMSEQDVDKALRNLFAIRMRLGHFDGDPLQSPLYGRLGAQDVCS 364

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLS--TLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
           PAH+ LAL+AA  GIVLLKN A TLPLS  T    + AVIGPN++    ++GNY G  C 
Sbjct: 365 PAHKDLALEAAQNGIVLLKNDAATLPLSRPTAASASFAVIGPNANEPGALLGNYFGPPCE 424

Query: 297 YTTPLQGISR-YAKTIHQA-GCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEF 354
            TTPLQ + + Y+K +    GC   ACN      A+ +AA  +D T+L MGL Q  E E 
Sbjct: 425 TTTPLQALQKFYSKNVRFVPGCDSAACNVADTYQASGLAA-TSDYTILFMGLSQKQEQEG 483

Query: 355 IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQA 414
           +DR  LLLPG+Q+ L++ VA A++ P++LVL+ GGPVD++FAK +P+IGAILW GYPGQA
Sbjct: 484 LDRTSLLLPGKQESLITAVAAAAKRPIILVLLTGGPVDITFAKFNPKIGAILWAGYPGQA 543

Query: 415 GGAAIADVLFGRANP 429
           GG AIA VLFG  NP
Sbjct: 544 GGLAIAKVLFGEHNP 558


>gi|297745533|emb|CBI40698.3| unnamed protein product [Vitis vinifera]
          Length = 461

 Score =  490 bits (1262), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 241/375 (64%), Positives = 291/375 (77%), Gaps = 4/375 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ-GNTGS--RL 57
           MYN G+AGLT+WSPNVNIFRDPRWGRGQETPGEDP+L+ KYA+ YVRGLQ  + GS  RL
Sbjct: 1   MYNVGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVRGLQQSDDGSPDRL 60

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           K+AACCKHYTAYDLDNW GVDR+HFNA V+KQD++DT+  PFK+CV++G VASVMCSYNQ
Sbjct: 61  KIAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQ 120

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           VNGKP CADPD+L   + G+W+L+GYIVSDCDSV V YN+QHYT+TPEEAAA AI AGLD
Sbjct: 121 VNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVDVFYNSQHYTKTPEEAAAKAILAGLD 180

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           L+CG FL  HTE AV+GGL+ E  V+ A++      MRLG FDG PS   +G LGP+DVC
Sbjct: 181 LNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIYGKLGPKDVC 240

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           T  HQ+LA +AA QGI+LLKNS  +LPLS     T+A+IGPN++VT TMIGNY G  C Y
Sbjct: 241 TLEHQELAREAARQGIMLLKNSKGSLPLSPTAIKTLAIIGPNANVTKTMIGNYEGTPCKY 300

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TTPLQG+     T + +GC  VAC+  Q I  A+  A  ADATVL++G+DQSIEAE  DR
Sbjct: 301 TTPLQGLMALVATTYLSGCSNVACSTAQ-IDEAKKIAAAADATVLIVGIDQSIEAEGRDR 359

Query: 358 AGLLLPGRQQELVSR 372
             + LPG+Q  L++ 
Sbjct: 360 VNIQLPGQQPLLITE 374


>gi|18025342|gb|AAK38482.1| beta-D-xylosidase [Hordeum vulgare]
          Length = 777

 Score =  489 bits (1260), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 238/438 (54%), Positives = 306/438 (69%), Gaps = 9/438 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN------T 53
           +YN G A GLT+W+PN+N+FRDPRWGRGQETPGEDP +TGKYAA +VRG+QG        
Sbjct: 144 VYNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQGYGMSGAIN 203

Query: 54  GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 113
            S L+ +ACCKH+TAYDL+NW GV R+ F+A+V++QDL DTYN PFK+CV +G  + +MC
Sbjct: 204 SSDLEASACCKHFTAYDLENWKGVTRFAFDAKVTEQDLADTYNPPFKSCVEDGGASGIMC 263

Query: 114 SYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
           SYN+VNG PTCAD ++L  T  G W  +GYI SDCD+V ++++ Q Y + PE+A AD +K
Sbjct: 264 SYNRVNGVPTCADHNLLSKTARGDWSFNGYITSDCDAVAIIHDVQGYAKAPEDAVADVLK 323

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AG+D++CG ++  H   A + G +  ED++ AL     ++MRLG+FDG P    +GN+G 
Sbjct: 324 AGMDVNCGGYIQTHGVSAYQQGKITGEDIDRALRNLFAIRMRLGLFDGNPKYNRYGNIGA 383

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 293
             VC+  HQ LALQAA  GIVLLKN    LPLS  +  ++AVIGPN +    ++GNY G 
Sbjct: 384 DQVCSKEHQDLALQAARDGIVLLKNDGAALPLSKSKVSSLAVIGPNGNNASLLLGNYFGP 443

Query: 294 ACGYTTPLQGISRYAKTIHQA-GCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA 352
            C   TPLQ +  Y K      GC    CN +  IG A  AA  AD  VL MGLDQ+ E 
Sbjct: 444 PCISVTPLQALQGYVKDARFVQGCNAAVCNVSN-IGEAVHAAGSADYVVLFMGLDQNQER 502

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E +DR  L LPG Q+ LV+ VA A++ PV+LVL+CGGPVDV+FAKN+P+IGAI+W GYPG
Sbjct: 503 EEVDRLELGLPGMQESLVNSVADAAKKPVILVLLCGGPVDVTFAKNNPKIGAIVWAGYPG 562

Query: 413 QAGGAAIADVLFGRANPG 430
           QAGG AIA VLFG  NPG
Sbjct: 563 QAGGIAIAQVLFGDHNPG 580


>gi|242062502|ref|XP_002452540.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
 gi|241932371|gb|EES05516.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
          Length = 784

 Score =  489 bits (1258), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 238/438 (54%), Positives = 311/438 (71%), Gaps = 9/438 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN------T 53
           +YN G A GLT+W+PN+N+FRDPRWGRGQETPGEDP +TGKYAA +VRG+QG        
Sbjct: 152 VYNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQGYGVAGPVN 211

Query: 54  GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 113
            + L+ +ACCKH+TAYDL+NW G+ RY ++A+V+ QDLEDTYN PFK+CV +G  + +MC
Sbjct: 212 STDLEASACCKHFTAYDLENWKGITRYVYDAKVTAQDLEDTYNPPFKSCVEDGHASGIMC 271

Query: 114 SYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
           SYN+VNG PTCAD ++L  T    W   GYI SDCD+V ++++ Q Y +T E+A AD +K
Sbjct: 272 SYNRVNGVPTCADYNLLSKTARQSWGFYGYITSDCDAVSIIHDAQGYAKTSEDAVADVLK 331

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AG+D++CG ++  +   A++ G + E+D+N AL    TV+MRLG+F+G+P    +GN+GP
Sbjct: 332 AGMDVNCGGYVQKYGASALQQGKITEQDINRALHNLFTVRMRLGLFNGDPRRNRYGNIGP 391

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 293
             VCT  HQ LAL+AA  GIVLLKN    LPLS     ++AVIG N++   +++GNY G 
Sbjct: 392 DQVCTQEHQDLALEAAQDGIVLLKNDGGALPLSKSGVASLAVIGFNANNATSLLGNYFGP 451

Query: 294 ACGYTTPLQGISRYAK-TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA 352
            C   TPLQ +  Y K T   AGC   ACN   +  A + AA  AD+ VL MGLDQ+ E 
Sbjct: 452 PCVTVTPLQVLQGYVKDTSFVAGCNSAACNVTTIPEAVQ-AASSADSVVLFMGLDQNQER 510

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E +DR  L LPG+QQ L+  VA A++ PV+LVL+CGGPVDVSFAK +P+IGAILW GYPG
Sbjct: 511 EEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAGYPG 570

Query: 413 QAGGAAIADVLFGRANPG 430
           +AGG AIA VLFG  NPG
Sbjct: 571 EAGGIAIAQVLFGEHNPG 588


>gi|62734691|gb|AAX96800.1| Glycosyl hydrolase family 3 C terminal domain, putative [Oryza
           sativa Japonica Group]
 gi|77549994|gb|ABA92791.1| beta-D-xylosidase, putative, expressed [Oryza sativa Japonica
           Group]
          Length = 853

 Score =  488 bits (1257), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 231/432 (53%), Positives = 305/432 (70%), Gaps = 4/432 (0%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKV 59
            YN G A GL  WSPNVNIFRDPRWGRGQETPGEDP    KY A++V+GLQG++ + L+ 
Sbjct: 225 FYNLGQAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLTNLQT 284

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           +ACCKH TAYD++ W GV RY+FNA+V+ QDL DTYN PF++CVV+GK + +MC+Y  +N
Sbjct: 285 SACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLIN 344

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           G P CA  D+L  T+ G+W+LDGY  SDCD+V +L+ ++H+TRT EEA A A+KAGLD++
Sbjct: 345 GVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTRTAEEAVAVALKAGLDIN 404

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQP-FGNLGPRDVCT 238
           CG ++  +   A++ G + E+DV+ AL     ++MRLG FDG+P     +G L   DVCT
Sbjct: 405 CGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAADVCT 464

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           P H+ LAL+AA +G+VLLKN AR LPL      + AVIG N++  + ++GNY G+ C  T
Sbjct: 465 PVHKALALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLPCETT 524

Query: 299 TPLQGISRYAKTIH-QAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TP  GI +Y K+     GC   AC+       A   A+ +D   LVMGL Q  E E +DR
Sbjct: 525 TPFGGIQKYVKSAKFLPGCSSAACD-VAATDQATALAKSSDYVFLVMGLSQKQEQEGLDR 583

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             LLLPG+QQ L++ VA AS+ PV+L+L+ GGPVD++FA+ +P+IGAILW GYPGQAGG 
Sbjct: 584 TSLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQ 643

Query: 418 AIADVLFGRANP 429
           AIADVLFG  NP
Sbjct: 644 AIADVLFGEFNP 655


>gi|115485163|ref|NP_001067725.1| Os11g0297300 [Oryza sativa Japonica Group]
 gi|113644947|dbj|BAF28088.1| Os11g0297300 [Oryza sativa Japonica Group]
          Length = 779

 Score =  488 bits (1255), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 231/432 (53%), Positives = 305/432 (70%), Gaps = 4/432 (0%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKV 59
            YN G A GL  WSPNVNIFRDPRWGRGQETPGEDP    KY A++V+GLQG++ + L+ 
Sbjct: 151 FYNLGQAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLTNLQT 210

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           +ACCKH TAYD++ W GV RY+FNA+V+ QDL DTYN PF++CVV+GK + +MC+Y  +N
Sbjct: 211 SACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLIN 270

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           G P CA  D+L  T+ G+W+LDGY  SDCD+V +L+ ++H+TRT EEA A A+KAGLD++
Sbjct: 271 GVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTRTAEEAVAVALKAGLDIN 330

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQP-FGNLGPRDVCT 238
           CG ++  +   A++ G + E+DV+ AL     ++MRLG FDG+P     +G L   DVCT
Sbjct: 331 CGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAADVCT 390

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           P H+ LAL+AA +G+VLLKN AR LPL      + AVIG N++  + ++GNY G+ C  T
Sbjct: 391 PVHKALALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLPCETT 450

Query: 299 TPLQGISRYAKTIH-QAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TP  GI +Y K+     GC   AC+       A   A+ +D   LVMGL Q  E E +DR
Sbjct: 451 TPFGGIQKYVKSAKFLPGCSSAACD-VAATDQATALAKSSDYVFLVMGLSQKQEQEGLDR 509

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             LLLPG+QQ L++ VA AS+ PV+L+L+ GGPVD++FA+ +P+IGAILW GYPGQAGG 
Sbjct: 510 TSLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQ 569

Query: 418 AIADVLFGRANP 429
           AIADVLFG  NP
Sbjct: 570 AIADVLFGEFNP 581


>gi|224093292|ref|XP_002309869.1| predicted protein [Populus trichocarpa]
 gi|222852772|gb|EEE90319.1| predicted protein [Populus trichocarpa]
          Length = 694

 Score =  486 bits (1251), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 238/439 (54%), Positives = 302/439 (68%), Gaps = 25/439 (5%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR---- 56
           MYN G AGLTYWSPN+N+ RDPRWGR  ETPGEDP L G+YA +YVRGLQ   GS     
Sbjct: 89  MYNLGRAGLTYWSPNINVVRDPRWGRAIETPGEDPYLVGRYAVNYVRGLQDVEGSENYTD 148

Query: 57  -----LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASV 111
                LKV++CCKHY AYD+DNW GV+RY F+ARVS+QD+ +T+  PF+ CV +G V+SV
Sbjct: 149 PNSRPLKVSSCCKHYAAYDVDNWKGVERYTFDARVSEQDMVETFLRPFEMCVKDGDVSSV 208

Query: 112 MCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADA 171
           MCSYN+VNG PTCADP +L  TI G W L GYIVSDCDS+ V+     +           
Sbjct: 209 MCSYNRVNGIPTCADPKLLNQTIRGDWDLHGYIVSDCDSLQVMVENHKWL---------- 258

Query: 172 IKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNL 231
              GLDLDCG +   + E AVR G +RE D++ +L +   V MRLG FDG P    FG  
Sbjct: 259 ---GLDLDCGAYYTENVEAAVRQGKVREADIDKSLNFLYVVLMRLGFFDGIPQYNSFGK- 314

Query: 232 GPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA 291
              DVC+  + +LA +AA +G VLLKN   +LPLS  +  T+AVIGP+S+ T  MIGNYA
Sbjct: 315 --NDVCSKENIELATEAAREGAVLLKNENDSLPLSIEKVKTLAVIGPHSNATSAMIGNYA 372

Query: 292 GVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIE 351
           G+ C   TP++G+S+YAK  +Q GC  +AC     I  A  +A++ADAT+++ G+D SIE
Sbjct: 373 GIPCQIITPIEGLSKYAKVDYQMGCSDIACKDESFIFPAMESAKKADATIILAGIDLSIE 432

Query: 352 AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYP 411
           AE +DR  LLLPG Q +L+++VA  S GPVVLVLM  G VD+SFAK++  I +ILWVGYP
Sbjct: 433 AESLDRDDLLLPGYQTQLINQVASVSNGPVVLVLMSAGGVDISFAKSNGDIKSILWVGYP 492

Query: 412 GQAGGAAIADVLFGRANPG 430
           G+ GG AIADV+FG+ NPG
Sbjct: 493 GEEGGNAIADVIFGKYNPG 511


>gi|357138088|ref|XP_003570630.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
           distachyon]
          Length = 1026

 Score =  485 bits (1248), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 238/438 (54%), Positives = 307/438 (70%), Gaps = 9/438 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN------T 53
           +YN G A GLT+W+PN+N+FRDPRWGRGQETPGEDP +TGKYAA +VRG+QG        
Sbjct: 142 IYNSGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQGYGASGAVN 201

Query: 54  GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 113
            S L+ +ACCKH+TAYDL+NWNGV R+ FNA+VS+QDL DTYN PF++CV +G  + +MC
Sbjct: 202 SSGLEASACCKHFTAYDLENWNGVTRFAFNAKVSEQDLADTYNPPFRSCVEDGGASGIMC 261

Query: 114 SYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
           SYN+VNG PTCAD ++L  T  G WR +GYI SDCD+V ++++ Q Y + PE+A AD +K
Sbjct: 262 SYNRVNGVPTCADHNLLSKTARGDWRFNGYITSDCDAVAIIHDVQGYAKEPEDAVADVLK 321

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AG+D++CG ++  H   A   G + E+D++ AL     ++MRLG+FDG P    +GN+G 
Sbjct: 322 AGMDVNCGDYVQKHGVSAFHQGKITEQDIDRALQNLFAIRMRLGLFDGNPKYNRYGNIGA 381

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 293
             VC   HQ LAL+AA  GIVLLKN A TLPL   +  ++AVIG N++    + GNY G 
Sbjct: 382 DQVCKKEHQDLALEAAQDGIVLLKNDAGTLPLPKQKISSLAVIGHNANDAQRLQGNYFGP 441

Query: 294 ACGYTTPLQGISRYAK-TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA 352
            C   +PLQ +  Y + T   AGC    CN +  I  A  AA +A+  VL MGLDQ  E 
Sbjct: 442 PCISVSPLQALQGYVRETKFVAGCNAAVCNVSD-IAGAAKAASEAEYVVLFMGLDQDQER 500

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E +DR  L LPG Q+ LV+ VA A++ PVVLVL+CGGPVDV+FAK +P+IGAI+W GYPG
Sbjct: 501 EDLDRIELGLPGMQESLVNAVADAAKKPVVLVLLCGGPVDVTFAKGNPKIGAIIWAGYPG 560

Query: 413 QAGGAAIADVLFGRANPG 430
           QAGG AIA VLFG  NPG
Sbjct: 561 QAGGIAIAQVLFGEHNPG 578


>gi|253761872|ref|XP_002489310.1| hypothetical protein SORBIDRAFT_0010s010920 [Sorghum bicolor]
 gi|241946958|gb|EES20103.1| hypothetical protein SORBIDRAFT_0010s010920 [Sorghum bicolor]
          Length = 772

 Score =  484 bits (1247), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 235/432 (54%), Positives = 300/432 (69%), Gaps = 4/432 (0%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-K 58
           ++N G A GLT WSPNVNIFRDPRWGRGQETPGEDP +  +YA ++VRG+QGN+ S L +
Sbjct: 145 LFNVGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQGNSSSSLLQ 204

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
            +ACCKH TAYDL++WNGV RY F ARV+ QDLEDT+N PF++CVVEGK + +MC+Y  +
Sbjct: 205 TSACCKHATAYDLEDWNGVARYSFVARVTAQDLEDTFNPPFRSCVVEGKASCIMCAYTAI 264

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           NG P CA+ D+L  T+ G W LDGY+ SDCD+V ++ + Q Y  TPE+A A ++KAGLD+
Sbjct: 265 NGVPACANTDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRYAPTPEDAVAVSLKAGLDI 324

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG ++  H   A++ G L E D++ AL     V+MRLG FDG+P    +G L   D+CT
Sbjct: 325 DCGSYIQQHATAAIQQGKLTELDIDKALVNLFAVRMRLGHFDGDPRKNMYGALSAADICT 384

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           P H+ LAL+AA  GIVLLKN    LPL      + AVIGPNS+  + +I NY G  C  T
Sbjct: 385 PEHRSLALEAAQDGIVLLKNDGGILPLDRSTVTSAAVIGPNSNDGMALIANYFGPPCEST 444

Query: 299 TPLQGISRYAKTIH-QAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TPLQG+  Y   +   AGC   AC+   +   A V +   D   L MGL Q  E+E  DR
Sbjct: 445 TPLQGLQSYVNNVRFLAGCSSAACD-VAVTDQAVVLSGSEDYVFLFMGLSQQQESEGKDR 503

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             LLLPG QQ L++ VA AS+ PV+LVL+ GGPVD++FA+++P+IGAILW GYPGQAGG 
Sbjct: 504 TSLLLPGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSNPKIGAILWAGYPGQAGGL 563

Query: 418 AIADVLFGRANP 429
           AIA VLFG  NP
Sbjct: 564 AIAKVLFGDHNP 575


>gi|125576920|gb|EAZ18142.1| hypothetical protein OsJ_33692 [Oryza sativa Japonica Group]
          Length = 618

 Score =  483 bits (1243), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 225/420 (53%), Positives = 299/420 (71%), Gaps = 3/420 (0%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL 71
           WSPNVNIFRDPRWGRGQETPGEDP    KY A++V+GLQG++ + L+ +ACCKH TAYD+
Sbjct: 2   WSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLTNLQTSACCKHITAYDI 61

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
           + W GV RY+FNA+V+ QDL DTYN PF++CVV+GK + +MC+Y  +NG P CA  D+L 
Sbjct: 62  EEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLINGVPACASSDLLT 121

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEGA 191
            T+ G+W+LDGY  SDCD+V +L+ ++H+TRT EEA A A+KAGLD++CG ++  +   A
Sbjct: 122 KTVRGEWKLDGYTASDCDAVAILHKSEHFTRTAEEAVAVALKAGLDINCGVYMQQNAASA 181

Query: 192 VRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQP-FGNLGPRDVCTPAHQQLALQAAH 250
           ++ G + E+DV+ AL     ++MRLG FDG+P     +G L   DVCTP H+ LAL+AA 
Sbjct: 182 LQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAADVCTPVHKALALEAAR 241

Query: 251 QGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRYAKT 310
           +G+VLLKN AR LPL      + AVIG N++  + ++GNY G+ C  TTP  GI +Y K+
Sbjct: 242 RGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLPCETTTPFGGIQKYVKS 301

Query: 311 IH-QAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQEL 369
                GC   AC+       A   A+ +D   LVMGL Q  E E +DR  LLLPG+QQ L
Sbjct: 302 AKFLPGCSSAACD-VAATDQATALAKSSDYVFLVMGLSQKQEQEGLDRTSLLLPGKQQAL 360

Query: 370 VSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++ VA AS+ PV+L+L+ GGPVD++FA+ +P+IGAILW GYPGQAGG AIADVLFG  NP
Sbjct: 361 ITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQAIADVLFGEFNP 420


>gi|255557375|ref|XP_002519718.1| Beta-glucosidase, putative [Ricinus communis]
 gi|223541135|gb|EEF42691.1| Beta-glucosidase, putative [Ricinus communis]
          Length = 802

 Score =  483 bits (1242), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 239/440 (54%), Positives = 309/440 (70%), Gaps = 15/440 (3%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR---- 56
           MYN G AGLTYWSPNVN+ RDPRWGR  ETPGEDP + G+YA +YVRGLQ   G+     
Sbjct: 167 MYNLGRAGLTYWSPNVNVVRDPRWGRTVETPGEDPYVVGRYAVNYVRGLQDVEGTENYTD 226

Query: 57  -----LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASV 111
                LKV++CCKHY AYD++ W GV+R  F+ARV++QD+ +T+  PF+ CV EG V+SV
Sbjct: 227 LNTRPLKVSSCCKHYAAYDVEKWQGVERLTFDARVTEQDMVETFLRPFEMCVKEGDVSSV 286

Query: 112 MCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYT-RTPEEAAAD 170
           MCS+N+VNG PTCADP +L  TI G W L GYIVSDCDS+ V+ +   +   T E+A A 
Sbjct: 287 MCSFNRVNGIPTCADPKLLNQTIRGDWDLHGYIVSDCDSIEVMVDNHKFLGDTNEDAVAQ 346

Query: 171 AIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGN 230
            +KAGLDLDCG +    TE +V+ G  REE ++ +L Y   V MRLG FDG P  Q    
Sbjct: 347 VLKAGLDLDCGGYYTNFTETSVKQGKAREEYIDRSLKYLYVVLMRLGFFDGTPQYQ---K 403

Query: 231 LGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY 290
           LG +D+CT  + +LA QAA +GIVLLKN+  TLPLS  +   +AV+GP+++ T  MIGNY
Sbjct: 404 LGKKDICTKENVELAKQAAREGIVLLKNN-DTLPLSMDKVKNLAVVGPHANATRVMIGNY 462

Query: 291 AGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSI 350
           AGV C Y +P+ G S Y+   ++ GC  V C    L+  A  AA+ ADAT++V GLD +I
Sbjct: 463 AGVPCRYVSPIDGFSIYSNVTYEIGC-DVPCKNESLVFPAVHAAKNADATIIVAGLDLTI 521

Query: 351 EAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGY 410
           EAE +DR  LLLPG Q +L+++VA A+ GPV+LV+M  G VD+SFA+++ +I AILWVGY
Sbjct: 522 EAEGLDRNDLLLPGYQTQLINQVAGAANGPVILVIMAAGGVDISFARDNEKIKAILWVGY 581

Query: 411 PGQAGGAAIADVLFGRANPG 430
           PGQ GG AIADV+FG+ NPG
Sbjct: 582 PGQEGGHAIADVVFGKYNPG 601


>gi|357485313|ref|XP_003612944.1| Beta-D-xylosidase [Medicago truncatula]
 gi|355514279|gb|AES95902.1| Beta-D-xylosidase [Medicago truncatula]
          Length = 783

 Score =  481 bits (1239), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 227/442 (51%), Positives = 308/442 (69%), Gaps = 15/442 (3%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR---- 56
           M+N G AGL++W+PNVN+FRDPRWGRGQETPGEDP++   YA  +VRG+QG  G +    
Sbjct: 142 MFNVGQAGLSFWAPNVNVFRDPRWGRGQETPGEDPMVGSAYAVEFVRGIQGVDGIKKVLN 201

Query: 57  --------LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKV 108
                   L V+ACCKH+TAYDL+ W    RY+FNA V++QDLEDTY  PF+ CV +GK 
Sbjct: 202 DHDSDDDGLMVSACCKHFTAYDLEKWGEFSRYNFNAVVTQQDLEDTYQPPFRGCVQQGKA 261

Query: 109 ASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAA 168
           + +MCSYN+VNG P CA  D+L   +  +W  +GYI SDCD+V  ++  Q Y ++ E+A 
Sbjct: 262 SCLMCSYNEVNGVPACASKDLL-GLVRNKWGFEGYIASDCDAVATVFEYQKYAKSAEDAV 320

Query: 169 ADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPF 228
           AD +KAG+D++CG F+  HTE A+  GL++EED++ AL    +VQMRLG+F+G+P    F
Sbjct: 321 ADVLKAGMDINCGTFMLRHTESAIEQGLVKEEDLDRALFNLFSVQMRLGLFNGDPEKGKF 380

Query: 229 GNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIG 288
           G LGP+DVCTP H++LAL+AA QGIVLLKN  + LPL      ++A+IGP +  T  + G
Sbjct: 381 GKLGPQDVCTPEHKKLALEAARQGIVLLKNDNKFLPLDKKDRVSLAIIGPMA-TTSELGG 439

Query: 289 NYAGVACGYTTPLQGISRYAKTIHQA-GCFGVACNGNQLIGAAEVAARQADATVLVMGLD 347
            Y+G+ C   +   G+  Y KTI  A GC  V C+ +     A   A+QAD  V+V GLD
Sbjct: 440 GYSGIPCSPRSLYDGLKEYVKTISYAFGCSDVKCDSDDGFAVAIDIAKQADFVVIVAGLD 499

Query: 348 QSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILW 407
            ++E E +DR  LLLPG+Q +LVSRVA AS+ PV+LVL  GGP+DVSFA+++  I +ILW
Sbjct: 500 TTLETEDLDRVSLLLPGKQMDLVSRVAAASKRPVILVLTGGGPLDVSFAESNQLITSILW 559

Query: 408 VGYPGQAGGAAIADVLFGRANP 429
           +GYPG+AGG A+A+++FG  NP
Sbjct: 560 IGYPGEAGGKALAEIIFGEFNP 581


>gi|212275712|ref|NP_001130324.1| uncharacterized protein LOC100191418 precursor [Zea mays]
 gi|194688848|gb|ACF78508.1| unknown [Zea mays]
 gi|413938927|gb|AFW73478.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 780

 Score =  481 bits (1239), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 239/440 (54%), Positives = 308/440 (70%), Gaps = 11/440 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN------T 53
           +YN G A GLT+W+PN+N+FRDPRWGRGQETPGEDP +TGKYAA +VRG+QG        
Sbjct: 147 VYNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQGYGLAGPVN 206

Query: 54  GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 113
            + L+ +ACCKH+TAYDL+NW GV RY F+A+V+ QDL DTYN PFK+CV +G  + +MC
Sbjct: 207 STGLEASACCKHFTAYDLENWKGVTRYVFDAKVTAQDLADTYNPPFKSCVEDGHASGIMC 266

Query: 114 SYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
           SYN+VNG PTCAD ++L  T    W   GYI SDCD+V ++++ Q Y +T E+A AD +K
Sbjct: 267 SYNRVNGVPTCADYNLLSTTARQDWGFYGYITSDCDAVAIIHDAQGYAKTAEDAVADVLK 326

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AG+D++CG ++  H   A++ G + E+D+N AL     V+MRLG+F+G+P    +G++GP
Sbjct: 327 AGMDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGDIGP 386

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNS--ARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA 291
             VCT  HQ LAL+AA  GIVLLKN   A  LPLS     ++AVIG N++  + + GNY 
Sbjct: 387 DQVCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRGNYF 446

Query: 292 GVACGYTTPLQGISRYAK-TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSI 350
           G  C   TPLQ +  Y K T   AGC   ACN   +  A + AA  AD+ VL MGLDQ  
Sbjct: 447 GPPCVTVTPLQVLQGYVKDTSFVAGCNSAACNVTTIPEAVQ-AASSADSVVLFMGLDQDQ 505

Query: 351 EAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGY 410
           E E +DR  L LPG+QQ L+  VA A++ PV+LVL+CGGPVDVSFAK +P+IGAILW GY
Sbjct: 506 EREEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAGY 565

Query: 411 PGQAGGAAIADVLFGRANPG 430
           PG+AGG AIA VLFG  NPG
Sbjct: 566 PGEAGGIAIAQVLFGEHNPG 585


>gi|371917284|dbj|BAL44718.1| SlArf/Xyl3 [Solanum lycopersicum]
          Length = 777

 Score =  480 bits (1235), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 224/439 (51%), Positives = 312/439 (71%), Gaps = 10/439 (2%)

Query: 1   MYNGG-MAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT--GSRL 57
           +YN G + G+T W+PN+NI RDPRWGRGQETPGEDP++ GKY  +YVRGLQG++  G +L
Sbjct: 146 VYNAGQLKGITLWAPNINILRDPRWGRGQETPGEDPMMVGKYGVAYVRGLQGDSFEGGKL 205

Query: 58  K-----VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVM 112
           K      +ACCKH+ A D+DNW+   RY F+A+V KQDL D+Y  PFK CV +GK +SVM
Sbjct: 206 KDGHLQTSACCKHFIAQDMDNWHNFSRYTFDAQVLKQDLADSYEPPFKDCVEQGKASSVM 265

Query: 113 CSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAI 172
           C+YN VNG P CA+ D+L  T  G+W L GYIVSDCD+V  +Y+ QHY + PE+A A  +
Sbjct: 266 CAYNLVNGIPNCANFDLLTTTARGKWGLQGYIVSDCDAVDKMYSEQHYAKEPEDAVAATL 325

Query: 173 KAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLG 232
           KAG+D++CG  L  +T+ A+    ++E D++ AL    +V+MRLG+F+G+PS   +G++ 
Sbjct: 326 KAGMDVNCGSHLKTYTKSALEKQKVKESDIDRALHNLFSVRMRLGLFNGDPSKLEYGDIS 385

Query: 233 PRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAG 292
             +VC+  H+ LA++AA  G VLLKNS R LPLS ++  ++AVIGP ++ +  ++GNY G
Sbjct: 386 AAEVCSEEHRALAVEAARSGSVLLKNSNRLLPLSKMKTASLAVIGPKANDSEVLLGNYEG 445

Query: 293 VACGYTTPLQGISRY-AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIE 351
            +C   T  QG+  Y A T++  GC  + C    +  A  + A++AD  VLVMGLDQ++E
Sbjct: 446 FSCKNVTLFQGLQGYVANTMYHPGCDFINCTSPAIDEAVNI-AKKADYVVLVMGLDQTLE 504

Query: 352 AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYP 411
            E  DR  L LPG Q++L++ +A+A+  PV+LVLMCGGPVDV+FAK++P+IG ILWVGYP
Sbjct: 505 REKFDRTELGLPGMQEKLITSIAEAASKPVILVLMCGGPVDVTFAKDNPKIGGILWVGYP 564

Query: 412 GQAGGAAIADVLFGRANPG 430
           G+ G AA+A +LFG  NPG
Sbjct: 565 GEGGAAALAQILFGEHNPG 583


>gi|296084630|emb|CBI25718.3| unnamed protein product [Vitis vinifera]
          Length = 768

 Score =  478 bits (1231), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 228/432 (52%), Positives = 299/432 (69%), Gaps = 4/432 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKV 59
           MYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP++   YA  +VRG QG++ G  L +
Sbjct: 142 MYNVGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVASAYAVEFVRGFQGDSDGDGLML 201

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           +ACCKH TAYDL+ W    RY F+A VS QDLEDTY  PF++CV +GK + +MCSYN+VN
Sbjct: 202 SACCKHLTAYDLEKWGNFSRYSFDAVVSNQDLEDTYQPPFRSCVQQGKASCLMCSYNRVN 261

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           G P CA  D+ +     +W   GYI SDCD+V  +Y  QHY  +PE+A AD +KAG D++
Sbjct: 262 GVPACARQDLFQKA-KTEWGFKGYITSDCDAVATVYEYQHYANSPEDAVADVLKAGTDIN 320

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG ++  HT+ A+  G ++EED++ AL    +VQMRLG+FDG+P+   +GNLGP+DVCT 
Sbjct: 321 CGSYMLRHTQSAIDQGKVKEEDIDRALFNLFSVQMRLGLFDGDPANGLYGNLGPKDVCTK 380

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
            H+ LAL+AA QGIVLLKN  + LPL   R  ++A+IGP +D    + G Y G+ C   +
Sbjct: 381 EHRTLALEAARQGIVLLKNDKKFLPLDKSRISSLAIIGPQADQPF-LGGGYTGIPCKPES 439

Query: 300 PLQGISRYA-KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
            ++G+  Y  KT   AGC  V C  +     A   AR+AD  V+V GLD S E E  DR 
Sbjct: 440 LVEGLKTYVEKTSFAAGCVDVPCLSDTGFDEAVSIARKADIVVVVAGLDLSQETEDHDRV 499

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            LLLPG+Q  L+S VA A + P+VLVL  GGP+DVSFA+ DPRI +ILW+GYPG+AG  A
Sbjct: 500 SLLLPGKQMALISSVASAIQKPLVLVLTGGGPLDVSFAEQDPRIASILWIGYPGEAGAKA 559

Query: 419 IADVLFGRANPG 430
           +A+++FG  NPG
Sbjct: 560 LAEIIFGDFNPG 571


>gi|357152329|ref|XP_003576084.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
           distachyon]
          Length = 779

 Score =  478 bits (1231), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 237/432 (54%), Positives = 300/432 (69%), Gaps = 4/432 (0%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKV 59
           +YN G A GLT WSPNVNI+RDPRWGRGQETPGEDP    KYA ++V+GLQG + + L+ 
Sbjct: 149 LYNLGQAQGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQGTSATTLQT 208

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           +ACCKH TAYDL++WNGV RY+FNA+V+ QDL DT+N PFK+CV EGK   VMC+Y  +N
Sbjct: 209 SACCKHATAYDLEDWNGVVRYNFNAKVTLQDLADTFNPPFKSCVEEGKATCVMCAYTNIN 268

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           G P CA  D++  T  G W L+GY+ SDCD+V +L + Q Y  TPE+  A A+KAGLDL+
Sbjct: 269 GVPACASSDLITKTFKGDWGLNGYVSSDCDAVALLRDAQRYRATPEDTVAVALKAGLDLN 328

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEP-SAQPFGNLGPRDVCT 238
           CG +  +H   A++ G + E+DV+ AL     V+MRLG FDG+P ++  +G+LG  DVC+
Sbjct: 329 CGNYTQVHGMSALQQGKMTEQDVDNALKNLFAVRMRLGHFDGDPRTSALYGSLGAADVCS 388

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PAH+ LAL+AA  GIVLLKN A  LPL      + A IG N++    + GNY G  C  T
Sbjct: 389 PAHKNLALEAAQSGIVLLKNDAGILPLDPSAVASAAAIGHNANDPAALNGNYFGPPCETT 448

Query: 299 TPLQGISRYAKTIH-QAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TPLQG+  Y K +   AGC   AC G    G A   A  +D  +L MGL Q  E E IDR
Sbjct: 449 TPLQGLQGYVKNVKFLAGCDSAAC-GFAATGQAVTLASSSDYVILFMGLSQKEEQEGIDR 507

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             LLLPG+QQ L++ VA AS+ PV+LVL+ GG VD++FAK++P+IGAILW GYPGQAGG 
Sbjct: 508 TSLLLPGKQQNLITAVASASKRPVILVLLTGGSVDITFAKSNPKIGAILWAGYPGQAGGL 567

Query: 418 AIADVLFGRANP 429
           AIA VLFG  NP
Sbjct: 568 AIARVLFGDHNP 579


>gi|358349509|ref|XP_003638778.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
 gi|355504713|gb|AES85916.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
          Length = 776

 Score =  478 bits (1230), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 230/439 (52%), Positives = 310/439 (70%), Gaps = 10/439 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS---- 55
           +YNGG A G+T+W+PN+NIFRDPRWGRGQET GEDP++T  YA SYVRGLQG++      
Sbjct: 144 IYNGGQAMGMTFWAPNINIFRDPRWGRGQETAGEDPMMTSNYAVSYVRGLQGDSFQGGKL 203

Query: 56  --RLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 113
              L+ +ACCKH+TAYDLDNW GV+R+HF+ARVS QDL DTY  PF++C+ +G+ + +MC
Sbjct: 204 RGHLQASACCKHFTAYDLDNWKGVNRFHFDARVSLQDLADTYQPPFRSCIEQGRASGIMC 263

Query: 114 SYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
           +YN+VNG P+CAD ++L NT+  QW   GYIVSDC +VG++++ Q Y ++ E+A AD + 
Sbjct: 264 AYNRVNGIPSCADFNLLTNTVRKQWEFHGYIVSDCGAVGIIHDEQGYAKSAEDAVADVLH 323

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AG+DL+CG +L  H + AV+   L    ++ AL    ++++RLG FDG P+  PFG +GP
Sbjct: 324 AGMDLECGSYLTDHAKSAVQQKKLPIVRIDRALHNLFSIRIRLGQFDGNPAKLPFGMIGP 383

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVT-VTMIGNYAG 292
             VC+  H  LAL+AA  GIVLLKN+A  LPL      ++AVIGPN++ + +T++GNYAG
Sbjct: 384 NHVCSENHLYLALEAARNGIVLLKNTASLLPLPKTS-ISLAVIGPNANASPLTLLGNYAG 442

Query: 293 VACGYTTPLQGISRYAKT-IHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIE 351
             C   T LQG   Y K  +   GC G     +  I  A   A+ AD  VLVMGLDQS+E
Sbjct: 443 PPCKSITILQGFQHYVKNAVFHPGCDGGPKCASAPIDKAVKVAKNADYVVLVMGLDQSVE 502

Query: 352 AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYP 411
            E  DR  L LPG+Q EL++ VAKAS+ PV+LVL+CGGP+D+S AKN+ +IG I+W GYP
Sbjct: 503 REERDRVHLDLPGKQLELINSVAKASKRPVILVLLCGGPIDISSAKNNDKIGGIIWAGYP 562

Query: 412 GQAGGAAIADVLFGRANPG 430
           G+ GG A+A ++FG  NPG
Sbjct: 563 GELGGIALAQIIFGDHNPG 581


>gi|242076578|ref|XP_002448225.1| hypothetical protein SORBIDRAFT_06g023450 [Sorghum bicolor]
 gi|241939408|gb|EES12553.1| hypothetical protein SORBIDRAFT_06g023450 [Sorghum bicolor]
          Length = 766

 Score =  478 bits (1229), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 227/435 (52%), Positives = 303/435 (69%), Gaps = 8/435 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS--RLK 58
           M+N G AGLTYW+PN+NIFRDPRWGRGQET GEDP +   Y+  YV+G QG  G   R++
Sbjct: 137 MHNAGQAGLTYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEQGEEGRIR 196

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           ++ACCKHYTAYD++ W G  RY FNA+V+ QDLEDTY  PFK C+ E + + +MC+YNQV
Sbjct: 197 LSACCKHYTAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMCAYNQV 256

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           NG P CA+ D+L+ T   +W   GYI SDCD+V +++  Q YT++ E++ A  +KAG+D+
Sbjct: 257 NGVPMCANKDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTYTKSDEDSIAIVLKAGMDI 315

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG FL  HT+ AV  G ++E+D++ AL    +VQ+RLG+FD   + Q    LGP +VCT
Sbjct: 316 NCGSFLVRHTKSAVEKGKVQEQDIDRALFNLFSVQLRLGIFDKPNNNQWSTQLGPNNVCT 375

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPL--STLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
             H++LA +A  QG VLLKN    LPL  S +RH  VA+IGP+++    M G+Y GVAC 
Sbjct: 376 KEHRELAAEAVRQGAVLLKNDHSFLPLKRSEVRH--VAIIGPSANDVYAMGGDYTGVACN 433

Query: 297 YTTPLQGISRYA-KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFI 355
            TT L+GI  YA +T   AGC  V+CN  +L G A  AA++AD  V+V GL+ + E E  
Sbjct: 434 PTTFLKGIQAYATQTTFAAGCKDVSCNSTELFGEAIAAAKRADIVVVVAGLNLTEEREDF 493

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  LLLPG+Q  L+  VA  ++ P+VLVL+ GGPVDVSFAK DPRI +ILW+GYPG+ G
Sbjct: 494 DRVSLLLPGKQMSLIHAVASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPGEVG 553

Query: 416 GAAIADVLFGRANPG 430
           G  + ++LFG  NPG
Sbjct: 554 GQVLPEILFGEYNPG 568


>gi|326488213|dbj|BAJ89945.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 525

 Score =  477 bits (1228), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 237/376 (63%), Positives = 286/376 (76%), Gaps = 7/376 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ-----GNTGS 55
           M+N G+AGLT+WSPN+NIFRDPRWGRGQETPGEDP+LT KYA  YV GLQ     G T  
Sbjct: 151 MHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQDAGAGGVTDG 210

Query: 56  RLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSY 115
            LKVAACCKHYTAYD+DNW GV+RY F+A+VS+QDL+DT+  PFK+CV++G VASVMCSY
Sbjct: 211 ALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSY 270

Query: 116 NQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAG 175
           N+VNGKPTCAD D+L+  I G W+L+GYIVSDCDSV VLY  QHYT+TPEEAAA  IK+G
Sbjct: 271 NKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIKSG 330

Query: 176 LDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD 235
           LDL+CG FLA HT  AV+ G L EEDV+ A+     + MRLG FDG+P    FG+LGP+D
Sbjct: 331 LDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKD 390

Query: 236 VCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVAC 295
           VCT ++++LA + A QGIVLLKNS   LPLS     ++AVIGPN++ + TMIGNY G  C
Sbjct: 391 VCTSSNRELARETARQGIVLLKNSG-ALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPC 449

Query: 296 GYTTPLQGISRYAKTIHQAGCFGVACNGNQL-IGAAEVAARQADATVLVMGLDQSIEAEF 354
            YTTPLQG+     T++Q GC  V C+GN L +  A  AA  AD TVLV+G DQSIE E 
Sbjct: 450 KYTTPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERES 509

Query: 355 IDRAGLLLPGRQQELV 370
           +DR  LLLPG+Q +LV
Sbjct: 510 LDRTSLLLPGQQTQLV 525


>gi|357156904|ref|XP_003577615.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
           distachyon]
          Length = 767

 Score =  477 bits (1228), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 235/431 (54%), Positives = 303/431 (70%), Gaps = 3/431 (0%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKV 59
           +YN G A GLT WSPNVNI+RDPRWGRGQETPGEDP    +YA ++VRG+QGN+ S L+ 
Sbjct: 138 LYNLGQAEGLTMWSPNVNIYRDPRWGRGQETPGEDPTTASRYAVAFVRGMQGNSTSLLQA 197

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           +ACCKH TAYDL++WNGV RY+F+A+V+ QDLEDT+N PF++CVV+GK + VMC+Y  +N
Sbjct: 198 SACCKHATAYDLEDWNGVARYNFDAKVTAQDLEDTFNPPFRSCVVDGKASCVMCAYTGIN 257

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           G P CA+ D+L  T+ G W LDGY  SDCD+V ++ + Q Y ++PE+A A A+KAGLD+D
Sbjct: 258 GVPACANADLLTKTVRGDWGLDGYTASDCDAVAIMRDAQRYAQSPEDAVALALKAGLDID 317

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG ++  H   A++ G + EED++ AL     ++MRLG FDG+P    +G LG  D+CT 
Sbjct: 318 CGTYMQQHAAAAIQQGKITEEDIDKALKNLFAIRMRLGHFDGDPRTNMYGGLGAADICTA 377

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
            H+ LAL AA  GIVLLKN A  LPL      + AVIGPN++    +I NY G  C  TT
Sbjct: 378 EHRSLALDAAQDGIVLLKNDAGILPLDRAAVASTAVIGPNANNPGALIANYFGPPCESTT 437

Query: 300 PLQGISRYAKTIH-QAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           PL+GI  Y K     AGC   AC+      AA +A+  +D   L MGL Q  E+E  DR 
Sbjct: 438 PLKGIQGYVKDARFLAGCSSTACDVATTDQAAALAS-TSDYVFLFMGLGQRQESEGRDRT 496

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            LLLPG+QQ L++ VA A++ PV+LVL+ GGPVDV+FA+ +P+IGAILW GYPGQAGG A
Sbjct: 497 SLLLPGKQQSLITAVADAAQRPVILVLLSGGPVDVTFAQTNPKIGAILWAGYPGQAGGLA 556

Query: 419 IADVLFGRANP 429
           IA VLFG  NP
Sbjct: 557 IARVLFGDHNP 567


>gi|371917286|dbj|BAL44719.1| SlArf/Xyl4 [Solanum lycopersicum]
          Length = 775

 Score =  476 bits (1226), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 219/439 (49%), Positives = 310/439 (70%), Gaps = 10/439 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS---- 55
           +YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP++TGKYA  YVRG+QG++ +    
Sbjct: 143 VYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPIMTGKYAIRYVRGVQGDSFNGGQL 202

Query: 56  ---RLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVM 112
               L+ +ACCKH+TAYDLD W  +DR+ FNA V+ QD+ DT+  PF+ C+ + + + +M
Sbjct: 203 KKGHLQASACCKHFTAYDLDQWKNLDRFSFNAIVTPQDMADTFQPPFQDCIQKAQASGIM 262

Query: 113 CSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAI 172
           CSYN VNG P+CA+ ++L  T   QW   GYI SDCD+V V+++   Y  TPE++ A A+
Sbjct: 263 CSYNSVNGIPSCANYNLLTKTARQQWGFHGYITSDCDAVQVMHDNHRYGNTPEDSTAFAL 322

Query: 173 KAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLG 232
           KAG+D+DCG +L  +T+ AV    + +  ++ AL    +++MRLG+F+G+P  Q +GN+ 
Sbjct: 323 KAGMDIDCGDYLKKYTKSAVMKKKVSQVHIDRALHNLFSIRMRLGLFNGDPRKQLYGNIS 382

Query: 233 PRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAG 292
           P  VC P HQQLAL+AA  GIVLLKN+ + LPLS  + +++AVIG N++    + GNY G
Sbjct: 383 PSQVCAPQHQQLALEAARNGIVLLKNTGKLLPLSKAKTNSLAVIGHNANNAYILRGNYDG 442

Query: 293 VACGYTTPLQGISRYAKTI-HQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIE 351
             C Y   L+ +  YAK++ +Q GC    C    +  A  + AR AD  VL+MGLDQ+ E
Sbjct: 443 PPCKYIEILKALVGYAKSVQYQQGCNAANCTSANIDQAVNI-ARNADYVVLIMGLDQTQE 501

Query: 352 AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYP 411
            E  DR  L+LPG+Q+ L++ VAKA++ PV+LV++ GGPVD+SFAK +P+IG+ILW GYP
Sbjct: 502 REQFDRDDLVLPGQQENLINSVAKAAKKPVILVILSGGPVDISFAKYNPKIGSILWAGYP 561

Query: 412 GQAGGAAIADVLFGRANPG 430
           G+AGG A+A+++FG  NPG
Sbjct: 562 GEAGGIALAEIIFGEHNPG 580


>gi|326517420|dbj|BAK00077.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 781

 Score =  476 bits (1224), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 236/434 (54%), Positives = 302/434 (69%), Gaps = 5/434 (1%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKV 59
           +YN G A GLT WSPNVNI+RDPRWGRGQETPGEDP    +Y  ++V+GLQGN+ S   +
Sbjct: 153 LYNVGQAEGLTMWSPNVNIYRDPRWGRGQETPGEDPTTASRYGVAFVKGLQGNSTSSSLL 212

Query: 60  A--ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
              ACCKH TAYDL++W GV RY+F+ARV+ QDLEDTYN PF++CVV+GK + VMC+Y  
Sbjct: 213 QTSACCKHATAYDLEDWGGVARYNFDARVTAQDLEDTYNPPFRSCVVDGKASCVMCAYTA 272

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           +NG P CA+  +L NT+   W LDGY+ SDCD+V ++ + Q Y  TPE+A A A+KAGLD
Sbjct: 273 INGVPACANSGLLTNTVRADWGLDGYVASDCDAVAIMRDAQRYAPTPEDAVALALKAGLD 332

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           +DCG ++  H   A++ G + E+DV+ AL     ++MRLG FDG+P A  +G L    +C
Sbjct: 333 IDCGTYMQQHAPAALQQGKITEDDVDKALKNLFAIRMRLGHFDGDPRANIYGGLNAAHIC 392

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           TP H+ LAL+AA  GIVLLKN A  LPL      + AVIGPN++    +IGNY G  C  
Sbjct: 393 TPEHRSLALEAAQDGIVLLKNDAGILPLDRAAIASAAVIGPNANNPGLLIGNYFGPPCES 452

Query: 298 TTPLQGISRYAKTIH-QAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFID 356
            TPL+G+  Y K +   AGC   AC+      AA +A   +D  +L MGL Q  E+E  D
Sbjct: 453 VTPLKGVQGYVKDVRFMAGCGSAACDVADTDQAATLAG-SSDYVLLFMGLSQQQESEGRD 511

Query: 357 RAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGG 416
           R  LLLPG+QQ L++ VA A++ PV+LVL+ GGPVDV+FAKN+P+IGAILW GYPGQAGG
Sbjct: 512 RTSLLLPGQQQSLITAVADAAKRPVILVLLTGGPVDVTFAKNNPKIGAILWAGYPGQAGG 571

Query: 417 AAIADVLFGRANPG 430
            AIA VLFG  NPG
Sbjct: 572 LAIARVLFGDHNPG 585


>gi|413925162|gb|AFW65094.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 774

 Score =  476 bits (1224), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 234/432 (54%), Positives = 302/432 (69%), Gaps = 4/432 (0%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-K 58
           ++N G A GLT WSPNVNIFRDPRWGRGQETPGEDP +  +YA ++VRG+QGN+ S L +
Sbjct: 146 LFNVGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQGNSSSSLLQ 205

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
            +ACCKH TAYDL++WNGV RY F ARV++QDLEDT+N PF++CVVE K + VMC+Y  +
Sbjct: 206 TSACCKHATAYDLEDWNGVARYSFVARVTEQDLEDTFNPPFRSCVVEAKASCVMCAYTAI 265

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           NG P CA+ D+L  T+ G W LDGY+ SDCD+V ++ + Q Y  TPE+A A ++KAGLD+
Sbjct: 266 NGVPACANSDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRYAPTPEDAVAVSLKAGLDI 325

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG ++  H   A++ G L E+D++ AL     V+MRLG FDG+P    +G LG  D+CT
Sbjct: 326 DCGSYVQQHAAAAIQQGKLTEQDIDKALTNLYAVRMRLGHFDGDPRKNMYGVLGAADICT 385

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           P H+ LAL+AA  GIVLLKN    LPL      + AVIGPN++  + +I NY G  C  T
Sbjct: 386 PEHRNLALEAAQDGIVLLKNDGGILPLDRSTVTSAAVIGPNANDGMALIANYFGPPCEST 445

Query: 299 TPLQGISRYAKTIH-QAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TPL+G+  Y   +   AGC   AC+      A  +A  + D   L MGL Q  E+E  DR
Sbjct: 446 TPLKGLQSYVNDVRFLAGCNSAACDVAATDQAVALAGSE-DYVFLFMGLSQKQESEGKDR 504

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             LLLPG QQ L++ VA AS+ PV+LVL+ GGPVD++FA+++P+IGAILW GYPGQAGG 
Sbjct: 505 TSLLLPGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSNPKIGAILWAGYPGQAGGL 564

Query: 418 AIADVLFGRANP 429
           AIA VLFG  NP
Sbjct: 565 AIAKVLFGDHNP 576


>gi|413925164|gb|AFW65096.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 829

 Score =  475 bits (1223), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 236/437 (54%), Positives = 303/437 (69%), Gaps = 9/437 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR--- 56
            YN G A GLT WSPNVNIFRDPRWGRGQETPGEDP +  +YAA++VRGLQG++ +    
Sbjct: 199 FYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRGLQGSSSNTKSV 258

Query: 57  ---LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 113
              L  +ACCKH TAYDL++W GV RY F A V+ QDL DT+N PF++CVV+GK + VMC
Sbjct: 259 PPVLLTSACCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKASCVMC 318

Query: 114 SYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
           +Y  VNG P+CA+ D+L  T  G W LDGY+ +DCD+V ++ N+Q Y  T E+  A  +K
Sbjct: 319 AYTSVNGVPSCANADLLTKTFRGSWGLDGYVAADCDAVSIMRNSQFYRPTAEDTVATTLK 378

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AGLD+DCGP++  H   A++ G L ++DV+ A+    T +MRLG FDG+P A  +GNLG 
Sbjct: 379 AGLDIDCGPYVQQHAMAAIQKGKLTQQDVDKAVKNLFTTRMRLGHFDGDPKAHVYGNLGA 438

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 293
             +CT  H+ LAL+AA  GIVLLKNSA  LPL      + AVIG N++  + ++GNY G 
Sbjct: 439 AHICTQEHKNLALEAALDGIVLLKNSAGVLPLKRGSVASAAVIGHNANDVLALLGNYWGP 498

Query: 294 ACGYTTPLQGISRYAKTIH-QAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA 352
            C  TTPLQGI  Y K +   AGC   ACN      AA +A+  +D+ +L MGL Q  E+
Sbjct: 499 PCAPTTPLQGIQGYVKNVRFLAGCHKAACNVAATPQAAALAS-TSDSVILFMGLSQEQES 557

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E  DR  LLLPG QQ L++ VA A++ PV+LVL+ GGPVD++FA+ +P+IGAILW GYPG
Sbjct: 558 EGKDRTTLLLPGNQQSLITAVANAAKRPVILVLLTGGPVDITFAQANPKIGAILWAGYPG 617

Query: 413 QAGGAAIADVLFGRANP 429
           QAGG AIA VLFG  NP
Sbjct: 618 QAGGLAIAKVLFGEKNP 634


>gi|356531391|ref|XP_003534261.1| PREDICTED: probable beta-D-xylosidase 6-like [Glycine max]
          Length = 780

 Score =  475 bits (1223), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 227/440 (51%), Positives = 305/440 (69%), Gaps = 13/440 (2%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR---- 56
           M+N G AGLT+W+PN+N+FRDPRWGRGQETPGEDP+L   YA  YVRGLQG +G +    
Sbjct: 141 MFNVGQAGLTFWAPNINLFRDPRWGRGQETPGEDPMLASAYAVEYVRGLQGLSGIQDAVV 200

Query: 57  ------LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVAS 110
                 L V+ACCKH+TAYDLD W    RY+FNA VS+QDLEDTY  PF++C+ +GK + 
Sbjct: 201 VDDDDTLMVSACCKHFTAYDLDMWGQFSRYNFNAVVSQQDLEDTYQPPFRSCIQQGKASC 260

Query: 111 VMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAAD 170
           +MCSYN+VNG P CA  ++L      +W   GYI SDCD+V  +Y  Q Y ++ E+A AD
Sbjct: 261 LMCSYNEVNGVPACASEELL-GLARDKWGFKGYITSDCDAVATVYEYQKYAKSQEDAVAD 319

Query: 171 AIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGN 230
            +KAG+D++CG F+  HTE A+  G ++EED++ AL    +VQ+RLG+FDG+P    FG 
Sbjct: 320 VLKAGMDINCGTFMLRHTESAIEQGKVKEEDLDRALLNLFSVQLRLGLFDGDPIRGRFGK 379

Query: 231 LGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY 290
           LGP+DVCT  H+ LAL AA QGIVLLKN  + LPL      ++AVIGP +  T  + G Y
Sbjct: 380 LGPKDVCTQEHKTLALDAARQGIVLLKNDKKFLPLDRDIGASLAVIGPLA-TTTKLGGGY 438

Query: 291 AGVACGYTTPLQGISRYAKTIHQA-GCFGVACNGNQLIGAAEVAARQADATVLVMGLDQS 349
           +G+ C  ++  +G+  +A+ I  A GC+ V C+ +     A   A+QAD  V+V GLD +
Sbjct: 439 SGIPCSSSSLYEGLGEFAERISYAFGCYDVPCDSDDGFAEAIDTAKQADFVVIVAGLDAT 498

Query: 350 IEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVG 409
            E E  DR  LLLPG+Q  LVS VA AS+ PV+LVL+ GGP+DVSFA+ +P+I +I+W+G
Sbjct: 499 QETEDHDRVSLLLPGKQMNLVSSVADASKNPVILVLIGGGPLDVSFAEKNPQIASIIWLG 558

Query: 410 YPGQAGGAAIADVLFGRANP 429
           YPG+AGG A+A+++FG  NP
Sbjct: 559 YPGEAGGKALAEIIFGEFNP 578


>gi|253761874|ref|XP_002489311.1| hypothetical protein SORBIDRAFT_0010s012040 [Sorghum bicolor]
 gi|241946959|gb|EES20104.1| hypothetical protein SORBIDRAFT_0010s012040 [Sorghum bicolor]
          Length = 791

 Score =  474 bits (1220), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 233/437 (53%), Positives = 303/437 (69%), Gaps = 9/437 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR--- 56
            YN G A GLT WSPNVNIFRDPRWGRGQETPGEDP +  +Y A++VRGLQG++ +    
Sbjct: 162 FYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYGAAFVRGLQGSSSNTKSV 221

Query: 57  ---LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 113
              L+ +ACCKH TAYDL++W GV RY F A V+ QDL DT+N PF++CVV+GK + VMC
Sbjct: 222 PPVLQTSACCKHATAYDLEDWKGVSRYSFKATVTIQDLADTFNPPFRSCVVDGKASCVMC 281

Query: 114 SYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
           +Y  VNG P+CA+ D+L  T  G W LDGY+ +DCD+V ++ N+Q Y  T E+  A  +K
Sbjct: 282 AYTIVNGVPSCANGDLLTKTFRGSWGLDGYVAADCDAVAIMRNSQFYRPTAEDTVAATLK 341

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AGLD+DCGP++  +   A++ G L ++DV+ A+   +T +MRLG FDG+P    +GNLG 
Sbjct: 342 AGLDIDCGPYIQQYAMAAIQKGKLTQQDVDKAVKNLLTTRMRLGHFDGDPKTNVYGNLGA 401

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 293
             +CT  H+ LAL+AA  GIVLLKNSA  LPL     ++ AVIG N++  + ++GNY G 
Sbjct: 402 GHICTAEHKNLALEAALDGIVLLKNSAGVLPLKRGTVNSAAVIGHNANDVLALLGNYWGP 461

Query: 294 ACGYTTPLQGISRYAKTIH-QAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA 352
            C  TTPLQGI  Y K +   AGC   ACN      A  +A+  +DA +L MGL Q  E+
Sbjct: 462 PCAPTTPLQGIQGYVKNVKFLAGCNKAACNVAATPQATALAS-SSDAVILFMGLSQEQES 520

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E  DR  LLLPG QQ L++ VA A++ PV+LVL+ GGPVD++FA+ +P+IGAILW GYPG
Sbjct: 521 EGKDRTTLLLPGNQQSLINAVANAAKRPVILVLLTGGPVDITFAQANPKIGAILWAGYPG 580

Query: 413 QAGGAAIADVLFGRANP 429
           QAGG AIA VLFG  NP
Sbjct: 581 QAGGLAIAKVLFGEKNP 597


>gi|225459350|ref|XP_002285805.1| PREDICTED: probable beta-D-xylosidase 7-like [Vitis vinifera]
          Length = 774

 Score =  473 bits (1216), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 236/438 (53%), Positives = 314/438 (71%), Gaps = 9/438 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLK- 58
           +YN G   G+T+W+PN+NIFRDPRWGRGQETPGEDP++TG YA SYVRG+QG+    LK 
Sbjct: 143 VYNAGQTKGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGSYAVSYVRGVQGDCLRGLKR 202

Query: 59  -----VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 113
                 +ACCKH+TAYDLD+W G+DR+ F+ARV+ QDL DTY  PF  C+ EG+ + +MC
Sbjct: 203 CGELQASACCKHFTAYDLDDWKGIDRFKFDARVTMQDLADTYQPPFHRCIEEGRASGIMC 262

Query: 114 SYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
           +YN+VNG P+CAD ++L NT   +W   GYI SDCD+V +++++  + +TPE+A  D +K
Sbjct: 263 AYNRVNGVPSCADFNLLTNTARKRWNFQGYITSDCDAVSLIHDSYGFAKTPEDAVVDVLK 322

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AG+D++CG +L  HT+ AV    L E +++ AL     V+MRLG+F+G P  QP+G++GP
Sbjct: 323 AGMDVNCGTYLLNHTKSAVMQKKLPESELDRALENLFAVRMRLGLFNGNPKGQPYGDIGP 382

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 293
             VC+  HQ LAL AA  GIVLLKNS R LPL   +  ++AVIGPN++   T+IGNYAG 
Sbjct: 383 NQVCSVEHQTLALDAARDGIVLLKNSQRLLPLPKGKTMSLAVIGPNANSPKTLIGNYAGP 442

Query: 294 ACGYTTPLQGISRYAK-TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA 352
            C + TPLQ +  Y K T++  GC  VAC+   +  A E+ A++AD  VLVMGLDQ+ E 
Sbjct: 443 PCKFITPLQALQSYVKSTMYHPGCDAVACSSPSIEKAVEI-AQKADYVVLVMGLDQTQER 501

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E  DR  L+LPG+QQ+L+  VA A++ PVVLVL+ GGPVD+SFAK    IG+ILW GYPG
Sbjct: 502 EAHDRLDLVLPGKQQQLIICVANAAKKPVVLVLLSGGPVDISFAKYSNNIGSILWAGYPG 561

Query: 413 QAGGAAIADVLFGRANPG 430
            AGGAAIA+ +FG  NPG
Sbjct: 562 GAGGAAIAETIFGDHNPG 579


>gi|224058158|ref|XP_002299457.1| predicted protein [Populus trichocarpa]
 gi|222846715|gb|EEE84262.1| predicted protein [Populus trichocarpa]
          Length = 780

 Score =  471 bits (1213), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 220/440 (50%), Positives = 303/440 (68%), Gaps = 11/440 (2%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN-------- 52
           MYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP++   YA  +V+G QG         
Sbjct: 144 MYNVGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVASAYAIEFVKGFQGGHWKNEDGE 203

Query: 53  -TGSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASV 111
               +L ++ACCKH TAYDL+ W    RY FNA V++QD+EDTY  PF++C+ +GK + +
Sbjct: 204 INDDKLMLSACCKHSTAYDLEKWGNFSRYSFNAVVTEQDMEDTYQPPFRSCIQKGKASCL 263

Query: 112 MCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADA 171
           MCSYN+VNG P CA  D+L+     +W   GYI SDCD+V  ++  Q+Y+++PE+A A A
Sbjct: 264 MCSYNEVNGVPACAREDLLQKP-RTEWGFKGYITSDCDAVATIFEYQNYSKSPEDAVAIA 322

Query: 172 IKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNL 231
           +KAG+D++CG ++  + + AV  G L+EED++ AL    +VQ+RLG+FDG+P    FG L
Sbjct: 323 LKAGMDINCGTYVLRNAQSAVEKGKLQEEDIDRALHNLFSVQLRLGLFDGDPRKGQFGKL 382

Query: 232 GPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA 291
           GP++VCT  H+ LAL+AA QGIVLLKN  + LPL+     ++A+IGP +++  ++ G+Y 
Sbjct: 383 GPKNVCTKEHKTLALEAARQGIVLLKNDKKLLPLNKKAVSSLAIIGPLANMANSLGGDYT 442

Query: 292 GVACGYTTPLQGISRYA-KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSI 350
           G  C   +  +G+  Y  KT +  GC  VAC  +     A + A++AD  ++V GLD S 
Sbjct: 443 GYPCDPQSLFEGLKAYVKKTSYAIGCLDVACVSDTQFHKAIIVAKRADFVIIVAGLDLSQ 502

Query: 351 EAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGY 410
           E E  DR  LLLPG+Q  LVS VA AS+ PV+LVL  GGP+DVSFAK DPRI +ILW+GY
Sbjct: 503 ETEEHDRVSLLLPGKQMSLVSSVAAASKKPVILVLTGGGPLDVSFAKGDPRIASILWIGY 562

Query: 411 PGQAGGAAIADVLFGRANPG 430
           PG+AG  A+A+++FG  NPG
Sbjct: 563 PGEAGAKALAEIIFGEYNPG 582


>gi|413925166|gb|AFW65098.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 830

 Score =  471 bits (1211), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 236/438 (53%), Positives = 303/438 (69%), Gaps = 10/438 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR--- 56
            YN G A GLT WSPNVNIFRDPRWGRGQETPGEDP +  +YAA++VRGLQG++ +    
Sbjct: 199 FYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRGLQGSSSNTKSV 258

Query: 57  ---LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 113
              L  +ACCKH TAYDL++W GV RY F A V+ QDL DT+N PF++CVV+GK + VMC
Sbjct: 259 PPVLLTSACCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKASCVMC 318

Query: 114 SYNQVNGKPTCADPDILKNTIHGQWRLDG-YIVSDCDSVGVLYNTQHYTRTPEEAAADAI 172
           +Y  VNG P+CA+ D+L  T  G W LDG Y+ +DCD+V ++ N+Q Y  T E+  A  +
Sbjct: 319 AYTSVNGVPSCANADLLTKTFRGSWGLDGRYVAADCDAVSIMRNSQFYRPTAEDTVATTL 378

Query: 173 KAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLG 232
           KAGLD+DCGP++  H   A++ G L ++DV+ A+    T +MRLG FDG+P A  +GNLG
Sbjct: 379 KAGLDIDCGPYVQQHAMAAIQKGKLTQQDVDKAVKNLFTTRMRLGHFDGDPKAHVYGNLG 438

Query: 233 PRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAG 292
              +CT  H+ LAL+AA  GIVLLKNSA  LPL      + AVIG N++  + ++GNY G
Sbjct: 439 AAHICTQEHKNLALEAALDGIVLLKNSAGVLPLKRGSVASAAVIGHNANDVLALLGNYWG 498

Query: 293 VACGYTTPLQGISRYAKTIH-QAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIE 351
             C  TTPLQGI  Y K +   AGC   ACN      AA +A+  +D+ +L MGL Q  E
Sbjct: 499 PPCAPTTPLQGIQGYVKNVRFLAGCHKAACNVAATPQAAALAS-TSDSVILFMGLSQEQE 557

Query: 352 AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYP 411
           +E  DR  LLLPG QQ L++ VA A++ PV+LVL+ GGPVD++FA+ +P+IGAILW GYP
Sbjct: 558 SEGKDRTTLLLPGNQQSLITAVANAAKRPVILVLLTGGPVDITFAQANPKIGAILWAGYP 617

Query: 412 GQAGGAAIADVLFGRANP 429
           GQAGG AIA VLFG  NP
Sbjct: 618 GQAGGLAIAKVLFGEKNP 635


>gi|357489431|ref|XP_003615003.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
 gi|355516338|gb|AES97961.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
          Length = 780

 Score =  470 bits (1209), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 224/441 (50%), Positives = 311/441 (70%), Gaps = 14/441 (3%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT------ 53
           +YN G A G+T+W+PN+NIFRDPRWGRGQET GEDP++  KY  SYVRGLQG++      
Sbjct: 148 VYNAGQAQGMTFWAPNINIFRDPRWGRGQETAGEDPLVNSKYGVSYVRGLQGDSFEGGKL 207

Query: 54  -GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVM 112
            G RLK +ACCKH+TAYDL+NW GV+RY F+A+V+ QDL DTY   F +CVV+G+ + +M
Sbjct: 208 IGGRLKASACCKHFTAYDLENWKGVNRYVFDAKVTLQDLADTYQPSFHSCVVQGRSSGIM 267

Query: 113 CSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAI 172
           C+YN+VNG P CAD ++L NT   +W  +GYI SDCD+V  +Y  Q Y +TPE+  AD +
Sbjct: 268 CAYNRVNGVPNCADYNLLTNTARKKWNFNGYIASDCDAVRFIYEKQGYAKTPEDVVADVL 327

Query: 173 KAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLG 232
           +AG+D++CG ++  H + AV    +    ++ AL    T+++RLG+FDG P+   +G +G
Sbjct: 328 RAGMDVECGNYMTKHAKSAVLQKKIPISQIDRALHNLFTIRIRLGLFDGNPTKLQYGRIG 387

Query: 233 PRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSD-VTVTMIGNYA 291
           P  VC+  +  LAL+AA  GIVLLKN+A  LPL   R +T+ VIGPN++  ++ ++GNY 
Sbjct: 388 PNQVCSKENLDLALEAARSGIVLLKNTASILPLP--RVNTLGVIGPNANKSSIVLLGNYF 445

Query: 292 GVACGYTTPLQGISRYAKTIH-QAGCF-GVACNGNQLIGAAEVAARQADATVLVMGLDQS 349
           G  C   + L+G   YA   H ++GC  GV C   ++  A EV A+ +D  +LVMGLDQS
Sbjct: 446 GQPCKQVSILKGFYTYASQTHYRSGCTDGVKCASAEIDRAVEV-AKISDYVILVMGLDQS 504

Query: 350 IEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVG 409
            E E +DR  L LPG+QQ+L++ VAKAS+ PV+LV++CGGPVD++FAKN+ +IG I+W G
Sbjct: 505 QETETLDRDHLELPGKQQKLINSVAKASKKPVILVILCGGPVDITFAKNNDKIGGIIWAG 564

Query: 410 YPGQAGGAAIADVLFGRANPG 430
           YPG+ GG A+A V+FG  NPG
Sbjct: 565 YPGELGGRALAQVVFGDYNPG 585


>gi|225432136|ref|XP_002274651.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
          Length = 809

 Score =  469 bits (1208), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 231/440 (52%), Positives = 303/440 (68%), Gaps = 15/440 (3%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR---- 56
           MYN G AGLT+WSPN+N+ RDPRWGR  ETPGEDP L G YA +YVRGLQ   G+     
Sbjct: 170 MYNSGNAGLTFWSPNINVVRDPRWGRILETPGEDPHLVGLYAVNYVRGLQDVVGAENTTD 229

Query: 57  -----LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASV 111
                LKV++CCKHY AYDLDNW G DR HF+ARVS QD+ +T+ +PF+ CV EG V+SV
Sbjct: 230 LNSRPLKVSSCCKHYAAYDLDNWKGADRVHFDARVSVQDMAETFVLPFEMCVKEGDVSSV 289

Query: 112 MCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP-EEAAAD 170
           MCSYN++NG P+CAD  +LK TI G+W L GYIVSDCDSV V+   Q +  +   ++AA 
Sbjct: 290 MCSYNKINGIPSCADSRLLKQTIRGEWDLHGYIVSDCDSVEVMAVDQKWLDSSFSDSAAQ 349

Query: 171 AIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGN 230
           A+ AG++LDCG F       AV  G   + D++ +L Y   + MR+G FDG P+   F +
Sbjct: 350 ALNAGMNLDCGTFNNRSLTEAVNQGKANQADLDHSLRYLYVLLMRVGFFDGIPA---FAS 406

Query: 231 LGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY 290
           LG  D+C+  H +LA +AA QGIVLLKN   TLPL ++++  +A++GP+++ T  MIGNY
Sbjct: 407 LGKDDICSAEHIELAREAARQGIVLLKNDNATLPLKSVKN--IALVGPHANATDAMIGNY 464

Query: 291 AGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSI 350
           AG+ C Y +PL   S   +  ++ GC  V C     I  A  AA++ADAT++  G D SI
Sbjct: 465 AGIPCYYVSPLDAFSSMGEVRYEKGCADVQCLNETYIFNAMEAAKRADATIIFAGTDLSI 524

Query: 351 EAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGY 410
           EAE +DR  LLLPG Q +L+++VA  S GPVVLV+M GG VD+SFA+++P+I AILW GY
Sbjct: 525 EAEALDRVDLLLPGYQTQLINQVADLSTGPVVLVIMSGGGVDISFARDNPKIAAILWAGY 584

Query: 411 PGQAGGAAIADVLFGRANPG 430
           PG+ GG AIADV+ G+ NPG
Sbjct: 585 PGEQGGNAIADVILGKYNPG 604


>gi|225469218|ref|XP_002264031.1| PREDICTED: probable beta-D-xylosidase 6-like [Vitis vinifera]
          Length = 789

 Score =  468 bits (1205), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 228/453 (50%), Positives = 298/453 (65%), Gaps = 25/453 (5%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG--------- 51
           MYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP++   YA  +VRG QG         
Sbjct: 142 MYNVGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVASAYAVEFVRGFQGGNWKGGDEI 201

Query: 52  -------------NTGSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVP 98
                        + G  L ++ACCKH TAYDL+ W    RY F+A VS QDLEDTY  P
Sbjct: 202 RGAVGKKRVLRGDSDGDGLMLSACCKHLTAYDLEKWGNFSRYSFDAVVSNQDLEDTYQPP 261

Query: 99  FKACVVEGKVASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQ 158
           F++CV +GK + +MCSYN+VNG P CA  D+ +     +W   GYI SDCD+V  +Y  Q
Sbjct: 262 FRSCVQQGKASCLMCSYNRVNGVPACARQDLFQKA-KTEWGFKGYITSDCDAVATVYEYQ 320

Query: 159 HYTRTPEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGM 218
           HY  +PE+A AD +KAG D++CG ++  HT+ A+  G ++EED++ AL    +VQMRLG+
Sbjct: 321 HYANSPEDAVADVLKAGTDINCGSYMLRHTQSAIDQGKVKEEDIDRALFNLFSVQMRLGL 380

Query: 219 FDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGP 278
           FDG+P+   +GNLGP+DVCT  H+ LAL+AA QGIVLLKN  + LPL   R  ++A+IGP
Sbjct: 381 FDGDPANGLYGNLGPKDVCTKEHRTLALEAARQGIVLLKNDKKFLPLDKSRISSLAIIGP 440

Query: 279 NSDVTVTMIGNYAGVACGYTTPLQGISRYA-KTIHQAGCFGVACNGNQLIGAAEVAARQA 337
            +D    + G Y G+ C   + ++G+  Y  KT   AGC  V C  +     A   AR+A
Sbjct: 441 QADQPF-LGGGYTGIPCKPESLVEGLKTYVEKTSFAAGCVDVPCLSDTGFDEAVSIARKA 499

Query: 338 DATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAK 397
           D  V+V GLD S E E  DR  LLLPG+Q  L+S VA A + P+VLVL  GGP+DVSFA+
Sbjct: 500 DIVVVVAGLDLSQETEDHDRVSLLLPGKQMALISSVASAIQKPLVLVLTGGGPLDVSFAE 559

Query: 398 NDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            DPRI +ILW+GYPG+AG  A+A+++FG  NPG
Sbjct: 560 QDPRIASILWIGYPGEAGAKALAEIIFGDFNPG 592


>gi|357489437|ref|XP_003615006.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
 gi|355516341|gb|AES97964.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
          Length = 685

 Score =  468 bits (1204), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 224/441 (50%), Positives = 308/441 (69%), Gaps = 14/441 (3%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT------ 53
           +YN G A G+ +W+PN+NIFRDPRWGRGQET GEDP++  KY  SYVRGLQG++      
Sbjct: 44  VYNAGQAQGMNFWAPNINIFRDPRWGRGQETAGEDPLVNSKYGVSYVRGLQGDSFEGGKL 103

Query: 54  -GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVM 112
            G RLK +ACCKH+TAYDL+NW GV+RY F+A+V+ QDL DTY   F +CVV+G+ + +M
Sbjct: 104 IGGRLKASACCKHFTAYDLENWKGVNRYVFDAKVTLQDLADTYQPSFHSCVVQGRSSGIM 163

Query: 113 CSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAI 172
           C+YN+VNG P CAD ++L NT   +W  +GYI SDCD+V  +Y  Q Y +TPE+  AD +
Sbjct: 164 CAYNRVNGVPNCADYNLLTNTARKKWNFNGYIASDCDAVRFIYEKQGYAKTPEDVVADVL 223

Query: 173 KAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLG 232
           +AG+DL+CG ++  H + AV    +    ++ AL    T+++RLG+FDG P+   +G +G
Sbjct: 224 RAGMDLECGNYMTKHAKSAVLQKKIPISQIDRALHNLFTIRIRLGLFDGNPTKLQYGRIG 283

Query: 233 PRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSD-VTVTMIGNYA 291
           P  VC+  +  LAL+AA  GIVLLKN+A  LPL   R +T+ VIGPN++  ++ ++GNY 
Sbjct: 284 PNQVCSKENLDLALEAARSGIVLLKNTASILPLP--RVNTLGVIGPNANKSSIVLLGNYI 341

Query: 292 GVACGYTTPLQGISRYAKTIH-QAGCF-GVACNGNQLIGAAEVAARQADATVLVMGLDQS 349
           G  C   + L+G   YA   H  +GC  G  C   ++  A EV A+ +D  +LVMGLDQS
Sbjct: 342 GPPCKNVSILKGFYTYASQTHYHSGCTDGTKCASAEIDRAVEV-AKISDYVILVMGLDQS 400

Query: 350 IEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVG 409
            E E +DR  L LPG+QQ+L++ VAKAS+ PV+LVL+CGGPVD++FAKN+ +IG I+W G
Sbjct: 401 QETETLDRDHLELPGKQQKLINSVAKASKKPVILVLLCGGPVDITFAKNNDKIGGIIWAG 460

Query: 410 YPGQAGGAAIADVLFGRANPG 430
           YPG+ GG A+A V+FG  NPG
Sbjct: 461 YPGELGGRALAQVVFGDYNPG 481


>gi|356515806|ref|XP_003526589.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
          Length = 772

 Score =  468 bits (1204), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 230/437 (52%), Positives = 308/437 (70%), Gaps = 8/437 (1%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT--GSRL 57
           ++N G A GLT+W+PN+NIFRDPRWGRGQET GEDP+LT +YA S+VRGLQG++  G+ L
Sbjct: 142 IFNAGQANGLTFWAPNINIFRDPRWGRGQETAGEDPLLTSRYAVSFVRGLQGDSFKGAHL 201

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
             +ACCKH+TAYDLDNW GVDR+ F+ARVS QDL DTY  PF++CV +G+ + +MC+YN+
Sbjct: 202 LASACCKHFTAYDLDNWKGVDRFVFDARVSLQDLADTYQPPFQSCVQQGRASGIMCAYNR 261

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
           VNG P CAD  +L  T   QW  +GYI SDC +VG +++ Q Y ++PE+  AD ++AG+D
Sbjct: 262 VNGVPNCADYGLLTQTARNQWDFNGYITSDCGAVGFIHDRQRYAKSPEDVVADVLRAGMD 321

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           L+CG +L  H + AV    L   +++ AL    +++MRLG+FDG P+   FG +G   VC
Sbjct: 322 LECGSYLTYHAKSAVLQKKLGMSEIDRALQNLFSIRMRLGLFDGNPTRLSFGLIGSNHVC 381

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPL-STLRHHTVAVIGPNSDVT-VTMIGNYAGVAC 295
           +  HQ LAL+AA  GIVLLKNS   LPL  T    ++AVIGPN++ + +T++GNYAG  C
Sbjct: 382 SKEHQYLALEAARNGIVLLKNSPTLLPLPKTSPSISLAVIGPNANSSPLTLLGNYAGPPC 441

Query: 296 GYTTPLQGISRYAKT-IHQAGCF-GVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
            Y T LQG   Y K   +  GC  G  C+  Q+  A EV A++ D  VLVMGLDQS E E
Sbjct: 442 KYVTILQGFRHYVKNAFYHPGCDGGPKCSSAQIDQAVEV-AKKVDYVVLVMGLDQSEERE 500

Query: 354 FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQ 413
             DR  L LPG+Q EL++ VA+AS+ PV+LVL+ GGP+D++ AK + +IG ILW GYPG+
Sbjct: 501 ERDRVHLDLPGKQLELINGVAEASKKPVILVLLSGGPLDITSAKYNHKIGGILWAGYPGE 560

Query: 414 AGGAAIADVLFGRANPG 430
            GG A+A ++FG  NPG
Sbjct: 561 LGGIALAQIIFGDHNPG 577


>gi|222629257|gb|EEE61389.1| hypothetical protein OsJ_15562 [Oryza sativa Japonica Group]
          Length = 771

 Score =  468 bits (1203), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 222/433 (51%), Positives = 293/433 (67%), Gaps = 4/433 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS--RLK 58
           M+N G AGLT+W+PN+N+FRDPRWGRGQETPGEDP +   Y+  YV+G Q + G   R+ 
Sbjct: 142 MHNAGQAGLTFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGEEGRMM 201

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           ++ACCKHY AYDL+ W G  RY FNA+V+ QD+EDTY  PFK+C+ EG+ + +MCSYNQV
Sbjct: 202 LSACCKHYIAYDLEKWRGFTRYTFNAKVNAQDMEDTYQPPFKSCIQEGRASCLMCSYNQV 261

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           NG P CA  DIL+     +W   GYI SDCD+V +++  Q YT + E++ A  +KAG+D+
Sbjct: 262 NGVPACARKDILQRA-RDEWGFQGYITSDCDAVAIIHENQTYTASDEDSIAVVLKAGMDI 320

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG FL  HT+ A+  G ++EED+N AL    +VQ+RLG FD     Q F  LGP +VCT
Sbjct: 321 NCGSFLIRHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCT 380

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
             H++LA +A  QG VLLKN    LPL       +A+IGP ++    + G+Y GV C  T
Sbjct: 381 TEHRELAAEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHST 440

Query: 299 TPLQGISRYA-KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           T ++G+  Y  KT   AGC  V CN     G A  AA++AD  VL+ GL+ + E E  DR
Sbjct: 441 TFVKGMQAYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDR 500

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             LLLPGRQ +L+  VA  ++ PVVLVLM GGPVDVSFAK+DPRI +ILW+GYPG+ GG 
Sbjct: 501 VSLLLPGRQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGN 560

Query: 418 AIADVLFGRANPG 430
            + ++LFG+ NPG
Sbjct: 561 VLPEILFGKYNPG 573


>gi|115459584|ref|NP_001053392.1| Os04g0530700 [Oryza sativa Japonica Group]
 gi|38346629|emb|CAD41212.2| OSJNBa0074L08.23 [Oryza sativa Japonica Group]
 gi|38346760|emb|CAE03865.2| OSJNBa0081C01.11 [Oryza sativa Japonica Group]
 gi|113564963|dbj|BAF15306.1| Os04g0530700 [Oryza sativa Japonica Group]
 gi|218195263|gb|EEC77690.1| hypothetical protein OsI_16749 [Oryza sativa Indica Group]
          Length = 770

 Score =  467 bits (1202), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 222/433 (51%), Positives = 293/433 (67%), Gaps = 4/433 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS--RLK 58
           M+N G AGLT+W+PN+N+FRDPRWGRGQETPGEDP +   Y+  YV+G Q + G   R+ 
Sbjct: 141 MHNAGQAGLTFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGEEGRMM 200

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           ++ACCKHY AYDL+ W G  RY FNA+V+ QD+EDTY  PFK+C+ EG+ + +MCSYNQV
Sbjct: 201 LSACCKHYIAYDLEKWRGFTRYTFNAKVNAQDMEDTYQPPFKSCIQEGRASCLMCSYNQV 260

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           NG P CA  DIL+     +W   GYI SDCD+V +++  Q YT + E++ A  +KAG+D+
Sbjct: 261 NGVPACARKDILQRA-RDEWGFQGYITSDCDAVAIIHENQTYTASDEDSIAVVLKAGMDI 319

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG FL  HT+ A+  G ++EED+N AL    +VQ+RLG FD     Q F  LGP +VCT
Sbjct: 320 NCGSFLIRHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCT 379

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
             H++LA +A  QG VLLKN    LPL       +A+IGP ++    + G+Y GV C  T
Sbjct: 380 TEHRELAAEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHST 439

Query: 299 TPLQGISRYA-KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           T ++G+  Y  KT   AGC  V CN     G A  AA++AD  VL+ GL+ + E E  DR
Sbjct: 440 TFVKGMQAYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDR 499

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             LLLPGRQ +L+  VA  ++ PVVLVLM GGPVDVSFAK+DPRI +ILW+GYPG+ GG 
Sbjct: 500 VSLLLPGRQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGN 559

Query: 418 AIADVLFGRANPG 430
            + ++LFG+ NPG
Sbjct: 560 VLPEILFGKYNPG 572


>gi|293336530|ref|NP_001167905.1| uncharacterized protein LOC100381616 [Zea mays]
 gi|223944757|gb|ACN26462.1| unknown [Zea mays]
          Length = 630

 Score =  466 bits (1200), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 223/435 (51%), Positives = 298/435 (68%), Gaps = 8/435 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS--RLK 58
           M+N G AGLTYW+PN+NIFRDPRWGRGQET GEDP +   Y+  YV+G QG  G   R++
Sbjct: 1   MHNAGQAGLTYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEEGEEGRIR 60

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           ++ACCKHYTAYD++ W G  RY FNA+V+ QDLEDTY  PFK C+ E + + +MC+YNQV
Sbjct: 61  LSACCKHYTAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMCAYNQV 120

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           NG P CA  D+L+ T   +W   GYI SDCD+V +++  Q YT++ E++ A  +KAG+D+
Sbjct: 121 NGVPMCAHKDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTYTKSGEDSIAIVLKAGMDI 179

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG FL  HT+ A+  G ++EED++ AL    +VQ+RLG+FD   + Q F  LGP  VCT
Sbjct: 180 NCGSFLVRHTKSAIEKGKIQEEDIDRALFNLFSVQLRLGIFDKPSNNQWFSQLGPNSVCT 239

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPL--STLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
             H++LA +A  QG VLLKN    LPL  S +RH  VA+IGP+++    M G+Y GV C 
Sbjct: 240 KEHRELAAEAVRQGAVLLKNDHNFLPLKRSEVRH--VAIIGPSANDAYAMGGDYTGVPCN 297

Query: 297 YTTPLQGISRYA-KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFI 355
            TT L+GI  YA +T    GC   +CN   L G A  AA++AD  V++ GL+ + E E  
Sbjct: 298 PTTFLKGIQAYATQTSFAPGCKDASCNSTDLFGEAVEAAKRADIVVVIAGLNLTEEREDF 357

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  LLLPG+Q  L+  +A  ++ P+VLVL+ GGPVDVSFAK DPRI +ILW+GYPG+ G
Sbjct: 358 DRVSLLLPGKQMGLIHAIASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPGEVG 417

Query: 416 GAAIADVLFGRANPG 430
           G  + ++LFG  NPG
Sbjct: 418 GQVLPEILFGEYNPG 432


>gi|414586138|tpg|DAA36709.1| TPA: putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 769

 Score =  466 bits (1198), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 223/435 (51%), Positives = 298/435 (68%), Gaps = 8/435 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS--RLK 58
           M+N G AGLTYW+PN+NIFRDPRWGRGQET GEDP +   Y+  YV+G QG  G   R++
Sbjct: 140 MHNAGQAGLTYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEEGEEGRIR 199

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           ++ACCKHYTAYD++ W G  RY FNA+V+ QDLEDTY  PFK C+ E + + +MC+YNQV
Sbjct: 200 LSACCKHYTAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMCAYNQV 259

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           NG P CA  D+L+ T   +W   GYI SDCD+V +++  Q YT++ E++ A  +KAG+D+
Sbjct: 260 NGVPMCAHKDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTYTKSGEDSIAIVLKAGMDI 318

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG FL  HT+ A+  G ++EED++ AL    +VQ+RLG+FD   + Q F  LGP  VCT
Sbjct: 319 NCGSFLVRHTKSAIEKGKIQEEDIDRALFNLFSVQLRLGIFDKPSNNQWFSQLGPNSVCT 378

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPL--STLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
             H++LA +A  QG VLLKN    LPL  S +RH  VA+IGP+++    M G+Y GV C 
Sbjct: 379 KEHRELAAEAVRQGAVLLKNDHNFLPLKRSEVRH--VAIIGPSANDAYAMGGDYTGVPCN 436

Query: 297 YTTPLQGISRYA-KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFI 355
            TT L+GI  YA +T    GC   +CN   L G A  AA++AD  V++ GL+ + E E  
Sbjct: 437 PTTFLKGIQAYATQTSFAPGCKDASCNSTDLFGEAVEAAKRADIVVVIAGLNLTEEREDF 496

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  LLLPG+Q  L+  +A  ++ P+VLVL+ GGPVDVSFAK DPRI +ILW+GYPG+ G
Sbjct: 497 DRVSLLLPGKQMGLIHAIASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPGEVG 556

Query: 416 GAAIADVLFGRANPG 430
           G  + ++LFG  NPG
Sbjct: 557 GQVLPEILFGEYNPG 571


>gi|357164885|ref|XP_003580200.1| PREDICTED: probable beta-D-xylosidase 6-like [Brachypodium
           distachyon]
          Length = 771

 Score =  465 bits (1197), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 215/435 (49%), Positives = 295/435 (67%), Gaps = 6/435 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS----R 56
           M+N G AGLTYW+PN+N+FRDPRWGRGQETPGEDP +   Y+  YV+G QG  G     R
Sbjct: 140 MHNAGQAGLTYWAPNINVFRDPRWGRGQETPGEDPAVIAAYSVEYVKGFQGEYGDGKEGR 199

Query: 57  LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYN 116
           + ++ACCKHY AYDL+ W    RY FNA+V++QD EDTY  PFK+C+ EG+ + +MCSYN
Sbjct: 200 MMLSACCKHYVAYDLEKWGNFTRYTFNAKVNEQDFEDTYEPPFKSCIQEGRASCLMCSYN 259

Query: 117 QVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGL 176
           QVNG P CA  D+L+  +  +W   GY+VSDCD+VG++Y  Q+YT + E++ A  +KAG+
Sbjct: 260 QVNGVPACARKDLLQK-VRDEWGFQGYVVSDCDAVGIIYGYQNYTNSDEDSIAIVLKAGM 318

Query: 177 DLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDV 236
           D++CG FL  HT+ A++ G + EED+N AL    +VQ+RLG+FD     Q F  LGP ++
Sbjct: 319 DINCGSFLIRHTKSAIQKGKITEEDINHALFNLFSVQLRLGLFDKTSGNQWFTQLGPSNI 378

Query: 237 CTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
           CT  H++LA +AA QG VLLKN    LPL       +A+IGP ++    M G+Y GV C 
Sbjct: 379 CTKEHRELAAEAARQGTVLLKNDNSFLPLKRSEVSHIAIIGPVANDAYIMGGDYTGVPCN 438

Query: 297 YTTPLQGISRYA-KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFI 355
            TT L+G+     +T   AGC  ++CN     G A   A++AD  VL+ GL+ + E E +
Sbjct: 439 PTTFLKGMQAVVPQTTIAAGCKDISCNSTDGFGEAIEVAKRADIVVLIAGLNLTQETEDL 498

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  LLLPG+Q +L++ +A  ++ P+VLV+  GGPVDVSFAK D RI ++LW+GYPG+ G
Sbjct: 499 DRVSLLLPGKQMDLINSIASVTKKPLVLVITGGGPVDVSFAKQDKRIASVLWIGYPGEVG 558

Query: 416 GAAIADVLFGRANPG 430
           G  + ++LFG  NPG
Sbjct: 559 GQVLPEILFGEYNPG 573


>gi|356548162|ref|XP_003542472.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
          Length = 778

 Score =  464 bits (1195), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 221/439 (50%), Positives = 304/439 (69%), Gaps = 11/439 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT------ 53
           +YN G A G+T+W+PN+N+FRDPRWGRGQET GEDP++  KY  +YVRGLQG++      
Sbjct: 144 VYNAGQATGMTFWAPNINVFRDPRWGRGQETAGEDPLMNAKYGVAYVRGLQGDSFEGGKL 203

Query: 54  GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 113
             RL+ +ACCKH+TAYDLD W G+DR+ F+ARV+ QDL DTY  PF++C+ +G+ + +MC
Sbjct: 204 AERLQASACCKHFTAYDLDQWKGLDRFVFDARVTSQDLADTYQPPFQSCIEQGRASGIMC 263

Query: 114 SYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
           +YN+VNG P CAD ++L  T   QW+ DGYI SDC +V +++  Q Y +T E+A AD  +
Sbjct: 264 AYNRVNGVPNCADFNLLTKTARQQWKFDGYITSDCGAVSIIHEKQGYAKTAEDAIADVFR 323

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AG+D++CG ++  H + AV    L    ++ AL    ++++RLG+FDG P+  PFG +GP
Sbjct: 324 AGMDVECGDYITKHAKSAVFQKKLPISQIDRALQNLFSIRIRLGLFDGNPTKLPFGTIGP 383

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTV-TMIGNYAG 292
            +VC+    QLAL+AA  GIVLLKN+   LPL    + T+A+IGPN++ +    +GNY G
Sbjct: 384 NEVCSKQSLQLALEAARDGIVLLKNTNSLLPLPK-TNPTIALIGPNANASSKVFLGNYYG 442

Query: 293 VACGYTTPLQGISRYAKTIHQAGC-FGVACNGNQLIGAAEVAARQADATVLVMGLDQSIE 351
             C   T LQG   YAKT++  GC  G  C   Q+  A EV A++ D  VLVMGLDQS E
Sbjct: 443 RPCNLVTLLQGFEGYAKTVYHPGCDDGPQCAYAQIEEAVEV-AKKVDYVVLVMGLDQSQE 501

Query: 352 AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYP 411
            E  DR  L LPG+Q+EL+  VA+A++ PVV+VL+CGGPVD++ AK D ++G ILW GYP
Sbjct: 502 RESHDREYLGLPGKQEELIKSVARAAKRPVVVVLLCGGPVDITSAKFDDKVGGILWAGYP 561

Query: 412 GQAGGAAIADVLFGRANPG 430
           G+ GG A+A V+FG  NPG
Sbjct: 562 GELGGVALAQVVFGDHNPG 580


>gi|297611657|ref|NP_001067709.2| Os11g0291000 [Oryza sativa Japonica Group]
 gi|255680005|dbj|BAF28072.2| Os11g0291000 [Oryza sativa Japonica Group]
          Length = 764

 Score =  463 bits (1192), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 235/433 (54%), Positives = 298/433 (68%), Gaps = 8/433 (1%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKV 59
           +YN G A GLT WSPNVNIFRDPRWGRGQETPGEDPV   +YA ++V GLQG  G   + 
Sbjct: 131 LYNLGQANGLTIWSPNVNIFRDPRWGRGQETPGEDPVTASRYAVAFVTGLQGIGG---EA 187

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           +ACCKH TAYDLD WN V RY+++++V+ QDLEDTYN PFK+CV EGK   +MC YN +N
Sbjct: 188 SACCKHATAYDLDYWNNVVRYNYDSKVTLQDLEDTYNPPFKSCVAEGKATCIMCGYNSIN 247

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           G P CA  D+L   +  +W ++GY+ SDCD+V  + +  HYT +PE+  A +IK G+D++
Sbjct: 248 GVPACASSDLLTKKVRQEWGMNGYVASDCDAVATIRDAHHYTLSPEDTVAVSIKVGMDVN 307

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQP-FGNLGPRDVCT 238
           CG +  +H   AV+ G L E+D++ AL     V+MRLG FDG+P +   +G+LG  DVC+
Sbjct: 308 CGNYTQVHAMAAVQKGNLTEKDIDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCS 367

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PAH+ LAL+AA  GIVLLKN A  LPL      ++AVIGPN+D    + GNY G  C  T
Sbjct: 368 PAHKSLALEAAQDGIVLLKNDAGALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETT 427

Query: 299 TPLQGISRYA--KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFID 356
           TPLQGI  Y   +    AGC   AC       AA +A+  +D  VL MGL Q  E + +D
Sbjct: 428 TPLQGIKGYLGDRARFLAGCDSPACAVAATNEAAALAS-SSDHVVLFMGLSQKQEQDGLD 486

Query: 357 RAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGG 416
           R  LLLPG QQ L++ VA A+R PV+LVL+ GGPVDV+FAK++P+IGAILW GYPGQAGG
Sbjct: 487 RTSLLLPGEQQGLITAVANAARRPVILVLLTGGPVDVTFAKDNPKIGAILWAGYPGQAGG 546

Query: 417 AAIADVLFGRANP 429
            AIA VLFG  NP
Sbjct: 547 LAIAKVLFGDHNP 559


>gi|449451581|ref|XP_004143540.1| PREDICTED: probable beta-D-xylosidase 6-like [Cucumis sativus]
          Length = 777

 Score =  462 bits (1188), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 223/448 (49%), Positives = 296/448 (66%), Gaps = 19/448 (4%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG--------- 51
           M+N G  GLT W+PN+NIFRDPRWGRGQETPGEDP++   Y+  +VRGLQ          
Sbjct: 132 MFNVGQCGLTIWAPNINIFRDPRWGRGQETPGEDPMVASAYSIQFVRGLQSGNWMKEHEI 191

Query: 52  --------NTGSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACV 103
                   N    L V+ACCKH+TAYDL+ WN   RY F++ V++QDL DTY  PF++C+
Sbjct: 192 RNEVLEEDNGMGSLMVSACCKHFTAYDLEKWNNFTRYTFDSVVTEQDLGDTYQPPFRSCI 251

Query: 104 VEGKVASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT 163
            +GK + +MCSYN VNG P CA+PD+LK      W L GYI SDCD+V  +Y  Q YT T
Sbjct: 252 QQGKASCLMCSYNAVNGVPACANPDLLKKA-RNDWGLKGYITSDCDAVATVYEYQKYTDT 310

Query: 164 PEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEP 223
           PE+A AD +KAG+D++CG F+   T+ A+  G +REE+++ AL    +VQ RLG FDG P
Sbjct: 311 PEDAIADVLKAGMDINCGTFMLRGTKSAIDQGKVREEELDSALINLFSVQARLGFFDGNP 370

Query: 224 SAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVT 283
               FG LG +DVCT  H+ LAL+AA QGIVLLKN  + LPL      ++ VIG  ++ +
Sbjct: 371 REGKFGELGAQDVCTAQHKTLALEAARQGIVLLKNENKFLPLDKNAISSLTVIGSLANDS 430

Query: 284 VTMIGNYAGVACGYTTPLQGISRYAKTIHQA-GCFGVACNGNQLIGAAEVAARQADATVL 342
             ++G YAGV C   + ++G   YA+TI  A GC  V C  +     A + A++AD  + 
Sbjct: 431 SKLLGGYAGVPCSPMSLVEGFQEYAETIFFASGCLDVPCASDNRFEDAILIAKKADFVIA 490

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           V GLD S E E +DR  LLLPG+Q +LVS VA  S+ P++LVL+ GGP+D+SFAK D R+
Sbjct: 491 VAGLDASQETEDLDRVSLLLPGKQMDLVSSVASVSKKPIILVLIGGGPLDISFAKKDSRV 550

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
            +ILW+G PG+AGG A+A+V+FG  NPG
Sbjct: 551 ASILWIGNPGEAGGKALAEVIFGDYNPG 578


>gi|449496501|ref|XP_004160150.1| PREDICTED: probable beta-D-xylosidase 6-like, partial [Cucumis
           sativus]
          Length = 767

 Score =  461 bits (1187), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 223/448 (49%), Positives = 296/448 (66%), Gaps = 19/448 (4%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG--------- 51
           M+N G  GLT W+PN+NIFRDPRWGRGQETPGEDP++   Y+  +VRGLQ          
Sbjct: 122 MFNVGQCGLTIWAPNINIFRDPRWGRGQETPGEDPMVASAYSIQFVRGLQSGNWMKEHEI 181

Query: 52  --------NTGSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACV 103
                   N    L V+ACCKH+TAYDL+ WN   RY F++ V++QDL DTY  PF++C+
Sbjct: 182 RNEVLEEDNGMGSLMVSACCKHFTAYDLEKWNNFTRYTFDSVVTEQDLGDTYQPPFRSCI 241

Query: 104 VEGKVASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT 163
            +GK + +MCSYN VNG P CA+PD+LK      W L GYI SDCD+V  +Y  Q YT T
Sbjct: 242 QQGKASCLMCSYNAVNGVPACANPDLLKKA-RNDWGLKGYITSDCDAVATVYEYQKYTDT 300

Query: 164 PEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEP 223
           PE+A AD +KAG+D++CG F+   T+ A+  G +REE+++ AL    +VQ RLG FDG P
Sbjct: 301 PEDAIADVLKAGMDINCGTFMLRGTKSAIDQGKVREEELDSALINLFSVQARLGFFDGNP 360

Query: 224 SAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVT 283
               FG LG +DVCT  H+ LAL+AA QGIVLLKN  + LPL      ++ VIG  ++ +
Sbjct: 361 REGKFGELGAQDVCTAQHKTLALEAARQGIVLLKNENKFLPLDKNAISSLTVIGSLANDS 420

Query: 284 VTMIGNYAGVACGYTTPLQGISRYAKTIHQA-GCFGVACNGNQLIGAAEVAARQADATVL 342
             ++G YAGV C   + ++G   YA+TI  A GC  V C  +     A + A++AD  + 
Sbjct: 421 SKLLGGYAGVPCSPMSLVEGFQEYAETIFFASGCLDVPCASDNRFEDAILIAKKADFVIA 480

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           V GLD S E E +DR  LLLPG+Q +LVS VA  S+ P++LVL+ GGP+D+SFAK D R+
Sbjct: 481 VAGLDASQETEDLDRVSLLLPGKQMDLVSSVASVSKKPIILVLIGGGPLDISFAKKDSRV 540

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
            +ILW+G PG+AGG A+A+V+FG  NPG
Sbjct: 541 ASILWIGNPGEAGGKALAEVIFGDYNPG 568


>gi|357489463|ref|XP_003615019.1| hypothetical protein MTR_5g062650 [Medicago truncatula]
 gi|355516354|gb|AES97977.1| hypothetical protein MTR_5g062650 [Medicago truncatula]
          Length = 785

 Score =  461 bits (1186), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 222/441 (50%), Positives = 305/441 (69%), Gaps = 13/441 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT------ 53
           +YN G A G+T+W+PN+NIFRDPRWGRGQET GEDP+++ KYA S+VRGLQG++      
Sbjct: 152 VYNAGQAEGMTFWAPNINIFRDPRWGRGQETAGEDPLVSAKYAVSFVRGLQGDSFEGGKL 211

Query: 54  -GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVM 112
              RLK +ACCKH+TAYDLDNW GVDR+ F+A V+ QDL DTY  PF +C+V+G+ + +M
Sbjct: 212 NEDRLKASACCKHFTAYDLDNWKGVDRFDFDANVTLQDLADTYQPPFHSCIVQGRSSGIM 271

Query: 113 CSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAI 172
           C+YN+VNG P CAD ++L NT   +W  +GYI SDC +V ++++ Q Y + PE+A AD +
Sbjct: 272 CAYNRVNGIPNCADYNLLTNTARKKWNFNGYITSDCSAVDIIHDRQGYAKAPEDAVADVL 331

Query: 173 KAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLG 232
           +AG+D++CG +   H++ AV    +    ++ AL    ++++RLG+FDG P+   +G +G
Sbjct: 332 QAGMDVECGDYFTSHSKSAVLQKKVPISQIDRALHNLFSIRIRLGLFDGHPTKLKYGKIG 391

Query: 233 PRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMI-GNYA 291
           P  VC+  +  +AL+AA  GIVLLKN+A  LPL      ++ VIGPN++ +  ++ GNY 
Sbjct: 392 PNRVCSKQNLNIALEAARSGIVLLKNAASILPLPK-STDSIVVIGPNANSSSQVVLGNYF 450

Query: 292 GVACGYTTPLQGISRYAKT-IHQAGCF-GVACNGNQLIGAAEVAARQADATVLVMGLDQS 349
           G  C   T LQG   Y+   ++  GC  G  C   ++  A EV A+  D  VLVMGLDQS
Sbjct: 451 GRPCNLVTILQGFENYSDNLLYHPGCSDGTKCVSAEIDRAVEV-AKVVDYVVLVMGLDQS 509

Query: 350 IEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVG 409
            E+E  DR  L LPG+QQEL++ VAKAS+ PV+LVL CGGPVD+SFAK D +IG ILW G
Sbjct: 510 QESEGHDRDDLELPGKQQELINSVAKASKRPVILVLFCGGPVDISFAKVDDKIGGILWAG 569

Query: 410 YPGQAGGAAIADVLFGRANPG 430
           YPG+ GG A+A V+FG  NPG
Sbjct: 570 YPGELGGMALAQVVFGDYNPG 590


>gi|356552866|ref|XP_003544783.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
          Length = 776

 Score =  460 bits (1184), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 222/440 (50%), Positives = 306/440 (69%), Gaps = 12/440 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT------ 53
           +YN G A G+T+W+PN+N+FRDPRWGRGQET GEDP++  KY  +YVRGLQG++      
Sbjct: 143 VYNAGQATGMTFWAPNINVFRDPRWGRGQETAGEDPLMNAKYGVAYVRGLQGDSFEGGKL 202

Query: 54  GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 113
           G RL+ +ACCKH+TAYDLD+W G+DR+ ++ARV+ QDL DTY  PF++C+ +G+ + +MC
Sbjct: 203 GERLQASACCKHFTAYDLDHWKGLDRFVYDARVTSQDLADTYQPPFQSCIEQGRASGIMC 262

Query: 114 SYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
           +YN+VNG P CA+ ++L  T   QW+ DGYI SDC +V ++++ Q Y +T E+A AD  +
Sbjct: 263 AYNRVNGVPNCANFNLLTKTARQQWKFDGYITSDCGAVSIIHDEQGYAKTAEDAIADVFR 322

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AG+D++CG ++  H + AV    L    ++ AL    ++++RLG+ DG P+  PFG +GP
Sbjct: 323 AGMDVECGDYITKHGKSAVSQKKLPISQIDRALQNLFSIRIRLGLLDGNPTKLPFGTIGP 382

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTV-TMIGNYAG 292
             VC+    QLAL+AA  GIVLLKN+   LPL    + T+A+IGPN++ +    +GNY G
Sbjct: 383 DQVCSKQSLQLALEAARDGIVLLKNTNSLLPLPK-TNPTIALIGPNANASSKVFLGNYYG 441

Query: 293 VACGYTTPLQGISRYAK-TIHQAGC-FGVACNGNQLIGAAEVAARQADATVLVMGLDQSI 350
             C   T LQG   YAK T++  GC  G  C   Q+ GA EV A++ D  VLVMGLDQS 
Sbjct: 442 RPCNLVTLLQGFEGYAKDTVYHPGCDDGPQCAYAQIEGAVEV-AKKVDYVVLVMGLDQSQ 500

Query: 351 EAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGY 410
           E E  DR  L LPG+Q+EL+  VA+AS+ PVVLVL+CGGPVD++ AK D ++G ILW GY
Sbjct: 501 ERESHDREYLGLPGKQEELIKSVARASKRPVVLVLLCGGPVDITSAKFDDKVGGILWAGY 560

Query: 411 PGQAGGAAIADVLFGRANPG 430
           PG+ GG A+A V+FG  NPG
Sbjct: 561 PGELGGVALAQVVFGDHNPG 580


>gi|384872601|gb|AFI25186.1| putative beta-D-xylosidase [Nicotiana tabacum]
          Length = 791

 Score =  460 bits (1184), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 221/455 (48%), Positives = 295/455 (64%), Gaps = 27/455 (5%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG--------- 51
           MYN G AGLT+W+PN+NI RDPRWGRGQETPGEDP++   YA  YV G QG         
Sbjct: 143 MYNLGQAGLTFWAPNINILRDPRWGRGQETPGEDPMVVSAYAIEYVTGFQGLNPKAKKGN 202

Query: 52  ---------------NTGSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYN 96
                          N G RL ++ACCKH+TAYDL+ W    RY FNA V+KQD+EDT+ 
Sbjct: 203 RNGYGKKRRVLKEDDNDGERLMLSACCKHFTAYDLEKWGDATRYDFNAVVTKQDMEDTFQ 262

Query: 97  VPFKACVVEGKVASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYN 156
            PF++C+ +GK + +MCSYN VNG P CAD ++L + +   W  DGYI SDCD+V  +Y 
Sbjct: 263 APFRSCIQQGKASCLMCSYNSVNGVPACADKELL-DKVRTDWGFDGYITSDCDAVATIYE 321

Query: 157 TQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRL 216
            Q YT+TPE+A A A+KAG +++CG ++  H + A + G + EED++ AL Y  +VQ RL
Sbjct: 322 NQKYTKTPEDAVAVALKAGTNINCGTYMLRHMKSAFQQGSVLEEDLDRALQYLFSVQFRL 381

Query: 217 GMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVI 276
           G+FDG P+   F N G +DVCT  H  LAL AA QGIVLLKN  + LPL      T+A++
Sbjct: 382 GLFDGNPADGQFANFGAQDVCTSNHLNLALDAARQGIVLLKNDQKFLPLDKTSVSTLAIV 441

Query: 277 GPNSDVTVTMIGNYAGVACGYTTPLQGISRYA-KTIHQAGCFGVACNGNQLIGAAEVAAR 335
           GP ++V+ +  G Y+GV C   +  +G  R+  +T++ AGC  V CN       A    +
Sbjct: 442 GPMANVS-SPGGTYSGVPCKLKSIREGFHRHINRTLYAAGCLDVGCNSTAGFQDAISIVK 500

Query: 336 QADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSF 395
           +AD  ++V G D S E E  DR  LLLPG+Q  LV+ +A AS+ P++LVL  GGPVDVSF
Sbjct: 501 EADYVIVVAGSDLSEETEDHDRYSLLLPGQQTNLVTTLAAASKKPIILVLTGGGPVDVSF 560

Query: 396 AKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           A+ DPRI +ILWV YPG+ GG A+++++FG  NPG
Sbjct: 561 AEKDPRIASILWVAYPGETGGKALSEIIFGYQNPG 595


>gi|357489441|ref|XP_003615008.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
 gi|355516343|gb|AES97966.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
          Length = 798

 Score =  460 bits (1184), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 226/457 (49%), Positives = 310/457 (67%), Gaps = 30/457 (6%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT------ 53
           +YN G A G+T+W+PN+NIFRDPRWGRGQET GEDP++  KY  SYVRGLQG++      
Sbjct: 150 VYNAGQAQGMTFWAPNINIFRDPRWGRGQETAGEDPLVNSKYGVSYVRGLQGDSFEGGKL 209

Query: 54  -GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVS----------------KQDLEDTYN 96
            G RLK +ACCKH+TAYDLDNW G+DR+ F+A+VS                 QDL DTY 
Sbjct: 210 IGDRLKASACCKHFTAYDLDNWKGLDRFDFDAKVSFLFSMAYSPWMINYVTLQDLADTYQ 269

Query: 97  VPFKACVVEGKVASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYN 156
            PF +C+V+G+ + +MC+YN+VNG P CAD ++L  T   +W  +GYI SDC++V ++Y+
Sbjct: 270 PPFHSCIVQGRSSGIMCAYNRVNGVPNCADYNLLTKTARQKWNFNGYITSDCEAVRIIYD 329

Query: 157 TQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRL 216
            Q Y +TPE+A AD ++AG+D++CG +L  H + AV    +    ++ AL    T+++RL
Sbjct: 330 NQGYAKTPEDAVADVLQAGMDVECGDYLTKHAKAAVLQKKVPISQIDRALHNLFTIRIRL 389

Query: 217 GMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVI 276
           G+FDG P+   +G +GP  VC+  +  LAL+AA  GIVLLKN+A  LPL   R +T+ VI
Sbjct: 390 GLFDGNPTKLQYGRIGPNQVCSKENLDLALEAARSGIVLLKNTASILPLP--RVNTLGVI 447

Query: 277 GPNSDVTVTMI-GNYAGVACGYTTPLQGISRYAKTIH-QAGCF-GVACNGNQLIGAAEVA 333
           GPN++ +  ++ GNY G  C     L+G   YA   H ++GC  G  C   ++  A EV 
Sbjct: 448 GPNANKSSKVVLGNYFGRPCRLVPILKGFYTYASQTHYRSGCLDGTKCASAEIDRAVEV- 506

Query: 334 ARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDV 393
           A+ +D  +LVMGLDQS E E  DR  L LPG+QQEL++ VAKAS+ PV+LVL+CGGPVD+
Sbjct: 507 AKISDYVILVMGLDQSQERESRDRDDLELPGKQQELINSVAKASKKPVILVLLCGGPVDI 566

Query: 394 SFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           +FAKN+ +IG I+W GYPG+ GG A+A V+FG  NPG
Sbjct: 567 TFAKNNDKIGGIIWAGYPGELGGRALAQVVFGDYNPG 603


>gi|414588273|tpg|DAA38844.1| TPA: putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 775

 Score =  460 bits (1183), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 237/437 (54%), Positives = 300/437 (68%), Gaps = 9/437 (2%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-----G 54
           +YN G A GLT WSPNVNIFRDPRWGRGQETPGEDP +  KYA ++VRG+QG+       
Sbjct: 143 LYNLGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASKYAVAFVRGIQGSNPAGAAA 202

Query: 55  SRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCS 114
           + L+ +ACCKH TAYDL++WNGV RY+F+ARV+ QDL DT+N PF++CVV+GK + VMC+
Sbjct: 203 APLQASACCKHATAYDLEDWNGVARYNFDARVTLQDLADTFNPPFQSCVVDGKASCVMCA 262

Query: 115 YNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKA 174
           Y  +NG P CA  D+L  T  G W LDGY+ SDCD+V ++ + Q Y  TPE+  A A+KA
Sbjct: 263 YTVINGVPACASSDLLTKTFRGAWGLDGYVSSDCDAVAIMRDAQRYEPTPEDTVAVALKA 322

Query: 175 GLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQP-FGNLGP 233
           GLDL+CG +   H   A++ G + E+DV+ AL     V+MRLG FDG+P     +G LG 
Sbjct: 323 GLDLNCGTYTQQHGMAAIQQGKMTEKDVDKALTNLFAVRMRLGHFDGDPRGNALYGRLGA 382

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 293
            DVCT  H+ LAL+AA  GIVLLKN A  LPL      + AVIG N++  + + GNY G 
Sbjct: 383 ADVCTADHKNLALEAAQDGIVLLKNDAGILPLDRSAVGSAAVIGHNANDPLVLSGNYFGP 442

Query: 294 ACGYTTPLQGISRYAKTIH-QAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA 352
           AC  TTPL+G+  Y + +   AGC   AC G    G A   A  A+   L MGL Q  E 
Sbjct: 443 ACETTTPLEGLQSYVRNVRFLAGCSSAAC-GYAATGQAAALASSAEYVFLFMGLSQDQEK 501

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E +DR  LLLPG+QQ LV+ VA A++ PVVLVL+ GGPVD++FA+++P+IGAILW GYPG
Sbjct: 502 EGLDRTSLLLPGKQQSLVTAVASAAKRPVVLVLLTGGPVDITFAQSNPKIGAILWAGYPG 561

Query: 413 QAGGAAIADVLFGRANP 429
           QAGG AIA VLFG  NP
Sbjct: 562 QAGGLAIARVLFGDHNP 578


>gi|15238197|ref|NP_196618.1| putative beta-D-xylosidase 6 [Arabidopsis thaliana]
 gi|75264319|sp|Q9LXA8.1|BXL6_ARATH RecName: Full=Probable beta-D-xylosidase 6; Short=AtBXL6; Flags:
           Precursor
 gi|7671447|emb|CAB89387.1| beta-xylosidase-like protein [Arabidopsis thaliana]
 gi|15982753|gb|AAL09717.1| AT5g10560/F12B17_90 [Arabidopsis thaliana]
 gi|332004180|gb|AED91563.1| putative beta-D-xylosidase 6 [Arabidopsis thaliana]
          Length = 792

 Score =  459 bits (1182), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 218/453 (48%), Positives = 296/453 (65%), Gaps = 24/453 (5%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS----- 55
           MYNGG AGLT+W+PN+N+FRDPRWGRGQETPGEDP +  +Y   +VRG Q          
Sbjct: 142 MYNGGQAGLTFWAPNINVFRDPRWGRGQETPGEDPKVVSEYGVEFVRGFQEKKKRKVLKR 201

Query: 56  -----------------RLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVP 98
                            +L ++ACCKH+TAYDL+ W    RY FNA V++QD+EDTY  P
Sbjct: 202 RFSDDVDDDRHDDDADGKLMLSACCKHFTAYDLEKWGNFTRYDFNAVVTEQDMEDTYQPP 261

Query: 99  FKACVVEGKVASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQ 158
           F+ C+ +GK + +MCSYN VNG P CA  D+L+     +W  +GYI SDCD+V  ++  Q
Sbjct: 262 FETCIRDGKASCLMCSYNAVNGVPACAQGDLLQKA-RVEWGFEGYITSDCDAVATIFAYQ 320

Query: 159 HYTRTPEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGM 218
            YT++PEEA ADAIKAG+D++CG ++  HT+ A+  G + EE V+ AL     VQ+RLG+
Sbjct: 321 GYTKSPEEAVADAIKAGVDINCGTYMLRHTQSAIEQGKVSEELVDRALLNLFAVQLRLGL 380

Query: 219 FDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGP 278
           FDG+P    +G LG  D+C+  H++LAL+A  QGIVLLKN  + LPL+     ++A++GP
Sbjct: 381 FDGDPRRGQYGKLGSNDICSSDHRKLALEATRQGIVLLKNDHKLLPLNKNHVSSLAIVGP 440

Query: 279 NSDVTVTMIGNYAGVACGYTTPLQGISRYA-KTIHQAGCFGVACNGNQLIGAAEVAARQA 337
            ++    M G Y G  C   T    +  Y  KT + +GC  V+C+ +   G A   A+ A
Sbjct: 441 MANNISNMGGTYTGKPCQRKTLFTELLEYVKKTSYASGCSDVSCDSDTGFGEAVAIAKGA 500

Query: 338 DATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAK 397
           D  ++V GLD S E E  DR  L LPG+Q++LVS VA  S+ PV+LVL  GGPVDV+FAK
Sbjct: 501 DFVIVVAGLDLSQETEDKDRVSLSLPGKQKDLVSHVAAVSKKPVILVLTGGGPVDVTFAK 560

Query: 398 NDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           NDPRIG+I+W+GYPG+ GG A+A+++FG  NPG
Sbjct: 561 NDPRIGSIIWIGYPGETGGQALAEIIFGDFNPG 593


>gi|26449574|dbj|BAC41913.1| putative beta-xylosidase [Arabidopsis thaliana]
          Length = 732

 Score =  458 bits (1179), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 218/453 (48%), Positives = 296/453 (65%), Gaps = 24/453 (5%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS----- 55
           MYNGG AGLT+W+PN+N+FRDPRWGRGQETPGEDP +  +Y   +VRG Q          
Sbjct: 82  MYNGGQAGLTFWAPNINVFRDPRWGRGQETPGEDPKVVSEYGVEFVRGFQEKKKRKVLKR 141

Query: 56  -----------------RLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVP 98
                            +L ++ACCKH+TAYDL+ W    RY FNA V++QD+EDTY  P
Sbjct: 142 RFSDDVDDDRHDDDADGKLMLSACCKHFTAYDLEKWGNFTRYDFNAVVTEQDMEDTYQPP 201

Query: 99  FKACVVEGKVASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQ 158
           F+ C+ +GK + +MCSYN VNG P CA  D+L+     +W  +GYI SDCD+V  ++  Q
Sbjct: 202 FETCIRDGKASCLMCSYNAVNGVPACAQGDLLQKA-RVEWGFEGYITSDCDAVATIFAYQ 260

Query: 159 HYTRTPEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGM 218
            YT++PEEA ADAIKAG+D++CG ++  HT+ A+  G + EE V+ AL     VQ+RLG+
Sbjct: 261 GYTKSPEEAVADAIKAGVDINCGTYMLRHTQSAIEQGKVSEELVDRALLNLFAVQLRLGL 320

Query: 219 FDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGP 278
           FDG+P    +G LG  D+C+  H++LAL+A  QGIVLLKN  + LPL+     ++A++GP
Sbjct: 321 FDGDPRRGQYGKLGSNDICSSDHRKLALEATRQGIVLLKNDHKLLPLNKNHVSSLAIVGP 380

Query: 279 NSDVTVTMIGNYAGVACGYTTPLQGISRYA-KTIHQAGCFGVACNGNQLIGAAEVAARQA 337
            ++    M G Y G  C   T    +  Y  KT + +GC  V+C+ +   G A   A+ A
Sbjct: 381 MANNISNMGGTYTGKPCQRKTLFTELLEYVKKTSYASGCSDVSCDSDTGFGEAVAIAKGA 440

Query: 338 DATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAK 397
           D  ++V GLD S E E  DR  L LPG+Q++LVS VA  S+ PV+LVL  GGPVDV+FAK
Sbjct: 441 DFVIVVAGLDLSQETEDKDRVSLSLPGKQKDLVSHVAAVSKKPVILVLTGGGPVDVTFAK 500

Query: 398 NDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           NDPRIG+I+W+GYPG+ GG A+A+++FG  NPG
Sbjct: 501 NDPRIGSIIWIGYPGETGGQALAEIIFGDFNPG 533


>gi|326491679|dbj|BAJ94317.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 772

 Score =  458 bits (1179), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 215/435 (49%), Positives = 294/435 (67%), Gaps = 6/435 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS----R 56
           M+N G AGLTYW+PN+N+FRDPRWGRGQETPGEDP +   Y+  YV+G QG  G     R
Sbjct: 141 MHNAGQAGLTYWAPNINVFRDPRWGRGQETPGEDPAMIAAYSVEYVKGFQGEYGDGREGR 200

Query: 57  LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYN 116
           + ++ACCKHY AYDL+ W    RY FNA V+ QD EDTY  PFK+C+ EG+ + +MCSYN
Sbjct: 201 MMLSACCKHYIAYDLEKWGKFARYTFNAEVNAQDFEDTYEPPFKSCIQEGRASCLMCSYN 260

Query: 117 QVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGL 176
           QVNG P CA  D+L+  I  +W   GYIVSDCD+V +++  Q YT + E++ A  +KAG+
Sbjct: 261 QVNGVPACARKDLLQK-IRDEWGFKGYIVSDCDAVAIIHENQTYTSSDEDSVAIVLKAGM 319

Query: 177 DLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDV 236
           D++CG FL  HT+ A+  G ++EED+N AL    +VQ+RLG+F+     Q F  LGP +V
Sbjct: 320 DVNCGSFLIRHTKSAIEKGKIQEEDINHALYNLFSVQLRLGLFEKANENQWFTRLGPSNV 379

Query: 237 CTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
           CT  H++LA +A  QG VLLKN    LPL   +   +A+IG  ++    M G+Y GV C 
Sbjct: 380 CTKEHRELAAEAVRQGTVLLKNDNSFLPLKRSKVSHIALIGAAANDAYIMGGDYTGVPCD 439

Query: 297 YTTPLQGISRYA-KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFI 355
             T L+G+  +  +T   AGC  V+C+     G A  AA++AD  V++ GL+ + E+E +
Sbjct: 440 PITFLKGMQAFVPQTTVAAGCKDVSCDSPDGFGEAIEAAKRADIVVVIAGLNLTQESEDL 499

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  LLLPGRQQ+LV+ +A  ++ P+VLV+  GGPVDV+FAK DPRI ++LW+GYPG+ G
Sbjct: 500 DRVTLLLPGRQQDLVNIIASVTKKPIVLVITGGGPVDVAFAKQDPRIASVLWIGYPGEVG 559

Query: 416 GAAIADVLFGRANPG 430
           G  + ++LFG  NPG
Sbjct: 560 GQVLPEILFGEYNPG 574


>gi|297811163|ref|XP_002873465.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319302|gb|EFH49724.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 796

 Score =  455 bits (1171), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 220/455 (48%), Positives = 297/455 (65%), Gaps = 26/455 (5%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT------- 53
           MYNGG AGLT+W+PN+N+FRDPRWGRGQETPGEDP +  +Y   +VRG Q          
Sbjct: 142 MYNGGQAGLTFWAPNINLFRDPRWGRGQETPGEDPKVVSEYGVEFVRGFQEKKKRKVLKT 201

Query: 54  --GS---------------RLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYN 96
             GS               +L ++ACCKH+TAYDL+ W    RY FNA V++QD+EDTY 
Sbjct: 202 RFGSDNVDDDARYDDDADGKLMLSACCKHFTAYDLEKWGNFTRYDFNAVVTEQDMEDTYQ 261

Query: 97  VPFKACVVEGKVASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYN 156
            PF+ C+ +GK + +MCSYN VNG P CA  D+L+     +W  DGYI SDCD+V  ++ 
Sbjct: 262 PPFETCIKDGKASCLMCSYNAVNGVPACAQGDLLQKA-RVEWGFDGYITSDCDAVATIFE 320

Query: 157 TQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRL 216
            Q YT++PEEA ADAIKAG+D++CG ++  +T+ A+  G + EE V+ AL     VQ+RL
Sbjct: 321 YQGYTKSPEEAVADAIKAGVDINCGTYMLRNTQSAIEQGKVSEELVDRALLNLFAVQLRL 380

Query: 217 GMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVI 276
           G+FDG+P    +G LG  D+C+  H++LAL+AA QGIVLLKN  + LPL+     ++A++
Sbjct: 381 GLFDGDPRGGHYGKLGSNDICSSDHRKLALEAARQGIVLLKNDYKLLPLNKNHVSSLAIV 440

Query: 277 GPNSDVTVTMIGNYAGVACGYTTPLQGISRYA-KTIHQAGCFGVACNGNQLIGAAEVAAR 335
           GP ++    M G Y G  C   T    +  Y  KT + +GC  V+C  +   G A   A+
Sbjct: 441 GPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKTSYASGCSDVSCVSDTGFGEAVAIAK 500

Query: 336 QADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSF 395
            AD  ++V GLD S E E  DR  L LPG+Q++LVS VA  S+ PV+LVL  GGPVDV+F
Sbjct: 501 GADFVIVVAGLDLSQETEDKDRFSLSLPGKQKDLVSSVAAVSKKPVILVLTGGGPVDVTF 560

Query: 396 AKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           AK DPRIG+I+W+GYPG+ GG A+A+++FG  NPG
Sbjct: 561 AKTDPRIGSIIWIGYPGETGGQALAEIIFGDFNPG 595


>gi|297736787|emb|CBI25988.3| unnamed protein product [Vitis vinifera]
          Length = 774

 Score =  455 bits (1170), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 226/440 (51%), Positives = 297/440 (67%), Gaps = 15/440 (3%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR---- 56
           MYN G AGLT+WSPN+N+ RDPRWGR  ETPGEDP+  G Y  +YVRGLQ   G+     
Sbjct: 167 MYNLGHAGLTFWSPNINVARDPRWGRILETPGEDPLTVGVYGVNYVRGLQDIEGTENTTD 226

Query: 57  -----LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASV 111
                LK+A+ CKH+ AYDLD W  VDR HF+A+VS+QD+ +T+  PF+ CV EG  +SV
Sbjct: 227 LNSRPLKIASSCKHFAAYDLDQWFNVDRRHFDAKVSEQDMTETFLRPFEMCVKEGDTSSV 286

Query: 112 MCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTR-TPEEAAAD 170
           MCS+N +NG P CADP  LK  I  QW L GYIVSDC ++  +   Q +   T EE  A 
Sbjct: 287 MCSFNNINGIPPCADPRFLKGVIREQWNLHGYIVSDCWAIDTIVQDQKFLDVTSEEGVAL 346

Query: 171 AIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGN 230
           ++KAGLDL+CG +       AVR G + E DV+ +L+Y   V MR+G FDG PS     +
Sbjct: 347 SMKAGLDLECGHYYNDSLATAVREGRVSEHDVDKSLSYLYVVLMRVGFFDGIPS---LAS 403

Query: 231 LGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY 290
           LG +D+C   H +LA +AA QGIVLLKN   TLPL  ++   +A++GP+++ TV MIGNY
Sbjct: 404 LGKKDICNDEHIELAREAARQGIVLLKNDNATLPLKPVKK--LALVGPHANATVAMIGNY 461

Query: 291 AGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSI 350
           AG+ C Y +PL   S      ++ GC  V C+ +  +  A  AA+ ADAT++++G D SI
Sbjct: 462 AGIPCHYVSPLDAFSELGDVTYEVGCADVKCHNDTHVYKAAEAAKNADATIILVGTDLSI 521

Query: 351 EAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGY 410
           EAE  DR  LLLPG Q E+V++V   S GPV+LV+MCGGP+D+SFAKN+P+I AILW G+
Sbjct: 522 EAEERDREDLLLPGYQTEMVNQVTDLSTGPVILVVMCGGPIDISFAKNNPKIAAILWAGF 581

Query: 411 PGQAGGAAIADVLFGRANPG 430
           PG+ GG AIAD++FG+ NPG
Sbjct: 582 PGEQGGNAIADIVFGKYNPG 601


>gi|225432134|ref|XP_002274619.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
          Length = 805

 Score =  454 bits (1169), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 226/440 (51%), Positives = 297/440 (67%), Gaps = 15/440 (3%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR---- 56
           MYN G AGLT+WSPN+N+ RDPRWGR  ETPGEDP+  G Y  +YVRGLQ   G+     
Sbjct: 167 MYNLGHAGLTFWSPNINVARDPRWGRILETPGEDPLTVGVYGVNYVRGLQDIEGTENTTD 226

Query: 57  -----LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASV 111
                LK+A+ CKH+ AYDLD W  VDR HF+A+VS+QD+ +T+  PF+ CV EG  +SV
Sbjct: 227 LNSRPLKIASSCKHFAAYDLDQWFNVDRRHFDAKVSEQDMTETFLRPFEMCVKEGDTSSV 286

Query: 112 MCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTR-TPEEAAAD 170
           MCS+N +NG P CADP  LK  I  QW L GYIVSDC ++  +   Q +   T EE  A 
Sbjct: 287 MCSFNNINGIPPCADPRFLKGVIREQWNLHGYIVSDCWAIDTIVQDQKFLDVTSEEGVAL 346

Query: 171 AIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGN 230
           ++KAGLDL+CG +       AVR G + E DV+ +L+Y   V MR+G FDG PS     +
Sbjct: 347 SMKAGLDLECGHYYNDSLATAVREGRVSEHDVDKSLSYLYVVLMRVGFFDGIPS---LAS 403

Query: 231 LGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY 290
           LG +D+C   H +LA +AA QGIVLLKN   TLPL  ++   +A++GP+++ TV MIGNY
Sbjct: 404 LGKKDICNDEHIELAREAARQGIVLLKNDNATLPLKPVKK--LALVGPHANATVAMIGNY 461

Query: 291 AGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSI 350
           AG+ C Y +PL   S      ++ GC  V C+ +  +  A  AA+ ADAT++++G D SI
Sbjct: 462 AGIPCHYVSPLDAFSELGDVTYEVGCADVKCHNDTHVYKAAEAAKNADATIILVGTDLSI 521

Query: 351 EAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGY 410
           EAE  DR  LLLPG Q E+V++V   S GPV+LV+MCGGP+D+SFAKN+P+I AILW G+
Sbjct: 522 EAEERDREDLLLPGYQTEMVNQVTDLSTGPVILVVMCGGPIDISFAKNNPKIAAILWAGF 581

Query: 411 PGQAGGAAIADVLFGRANPG 430
           PG+ GG AIAD++FG+ NPG
Sbjct: 582 PGEQGGNAIADIVFGKYNPG 601


>gi|168046596|ref|XP_001775759.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672911|gb|EDQ59442.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 784

 Score =  450 bits (1157), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 230/448 (51%), Positives = 299/448 (66%), Gaps = 18/448 (4%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ---------- 50
           MYN G +GLTYWSPN+NI RDPRWGR QETPGEDP L+  YA  +V+GLQ          
Sbjct: 133 MYNQGRSGLTYWSPNINIARDPRWGRTQETPGEDPKLSSGYAVHFVKGLQEGDYDQNQPQ 192

Query: 51  --GNTGSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKV 108
                  RLK++ACCKH+TA+DLD W   DR HF+++V++QDLEDTYN  FK+CV EG+ 
Sbjct: 193 AVSRGPRRLKISACCKHFTAHDLDRWKDYDRDHFDSKVTQQDLEDTYNPSFKSCVKEGQS 252

Query: 109 ASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAA 168
           +SVMCSYN++NG P C   ++L  T+  QW  DGYIVSDCD+V ++++  +Y  T E+A 
Sbjct: 253 SSVMCSYNRLNGIPMCTHYELLTLTVRNQWGFDGYIVSDCDAVALIHDYINYAPTSEDAV 312

Query: 169 ADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPF 228
           +  + AG+DL+CG    +H   A+   L+ E  +++ L     V+MRLGMFDG PS  P+
Sbjct: 313 SYVMLAGMDLNCGSTTLVHGLAALDKKLIWEGLIDMHLRNLFRVRMRLGMFDGNPSTLPY 372

Query: 229 GNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIG 288
           G+LGP D+CT  +Q LAL+AA Q +VLLKN    LP        +AVIG ++D T  M+G
Sbjct: 373 GSLGPEDMCTEDNQHLALEAARQSLVLLKNEKNALPWKKTHGLKLAVIGHHADATREMLG 432

Query: 289 NYAGVACGYTTPLQGISRY-----AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLV 343
           NY G  C + +PLQG ++       +  H+ GC   AC     I AA+ AA QADA VLV
Sbjct: 433 NYEGYPCKFVSPLQGFAKVLSDHSPRISHERGCSDAACEDQFYIYAAKEAAAQADAVVLV 492

Query: 344 MGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRG-PVVLVLMCGGPVDVSFAKNDPRI 402
           +G+ Q+ E E  DR  LLLPGRQ ELVS V +AS G PVVLVL+ G P+DVSFA +DPRI
Sbjct: 493 LGISQAQEKEGRDRDSLLLPGRQMELVSSVVEASAGRPVVLVLLSGSPLDVSFANDDPRI 552

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
            +I+W GYPGQ+GG AIA+ +FG  NPG
Sbjct: 553 QSIIWAGYPGQSGGEAIAEAIFGLVNPG 580


>gi|225432132|ref|XP_002274591.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
           [Vitis vinifera]
          Length = 805

 Score =  448 bits (1153), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 211/440 (47%), Positives = 298/440 (67%), Gaps = 15/440 (3%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR---- 56
           MYN G AGLT+WSPN+N+ RD RWGR QET GEDP + G++A +YVRGLQ   G+     
Sbjct: 166 MYNLGHAGLTFWSPNINVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVEGTENVTD 225

Query: 57  -----LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASV 111
                LKV++CCKHY AYD+D+W  VDR+ F+ARVS+QD+++T+  PF+ CV EG V+SV
Sbjct: 226 LNSRPLKVSSCCKHYAAYDIDSWLNVDRHTFDARVSEQDMKETFVSPFERCVREGDVSSV 285

Query: 112 MCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE-EAAAD 170
           MCS+N++NG P C+DP +LK  I  +W L GYIVSDC  + V+ + Q+Y    + +A A 
Sbjct: 286 MCSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAK 345

Query: 171 AIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGN 230
            ++AGLDL+CG +       +V  G + + +++ AL     + MR+G FDG P+   + +
Sbjct: 346 TLQAGLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPA---YES 402

Query: 231 LGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY 290
           LG +D+C   H +LA +AA QGIVLLKN    LPL   +   +A++GP+++ T  MIGNY
Sbjct: 403 LGLKDICAADHIELAREAARQGIVLLKNDYEVLPLKPGKK--IALVGPHANATEVMIGNY 460

Query: 291 AGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSI 350
           AG+ C Y +PL+  S      +  GC   +C+ +     A+ AA+ A+ T++ +G D SI
Sbjct: 461 AGLPCKYVSPLEAFSAIGNVTYATGCLDASCSNDTYFSEAKEAAKSAEVTIIFVGTDLSI 520

Query: 351 EAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGY 410
           EAEF+DR   LLPG Q EL+ +VA+ S GPV+LV++ G  +D++FAKN+PRI AILWVG+
Sbjct: 521 EAEFVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGF 580

Query: 411 PGQAGGAAIADVLFGRANPG 430
           PG+ GG AIADV+FG+ NPG
Sbjct: 581 PGEQGGHAIADVVFGKYNPG 600


>gi|62701894|gb|AAX92967.1| beta-xylosidase, putative [Oryza sativa Japonica Group]
 gi|77550041|gb|ABA92838.1| Glycosyl hydrolase family 3 C terminal domain containing protein
           [Oryza sativa Japonica Group]
          Length = 793

 Score =  448 bits (1153), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 235/461 (50%), Positives = 298/461 (64%), Gaps = 36/461 (7%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKV 59
           +YN G A GLT WSPNVNIFRDPRWGRGQETPGEDPV   +YA ++V GLQG  G   + 
Sbjct: 132 LYNLGQANGLTIWSPNVNIFRDPRWGRGQETPGEDPVTASRYAVAFVTGLQGIGG---EA 188

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNAR----------------------------VSKQDL 91
           +ACCKH TAYDLD WN V RY+++++                            V+ QDL
Sbjct: 189 SACCKHATAYDLDYWNNVVRYNYDSKDGASTGKSGETSSQVEKKHGPYEKGYFAVTLQDL 248

Query: 92  EDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSV 151
           EDTYN PFK+CV EGK   +MC YN +NG P CA  D+L   +  +W ++GY+ SDCD+V
Sbjct: 249 EDTYNPPFKSCVAEGKATCIMCGYNSINGVPACASSDLLTKKVRQEWGMNGYVASDCDAV 308

Query: 152 GVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTIT 211
             + +  HYT +PE+  A +IK G+D++CG +  +H   AV+ G L E+D++ AL     
Sbjct: 309 ATIRDAHHYTLSPEDTVAVSIKVGMDVNCGNYTQVHAMAAVQKGNLTEKDIDRALVNLFA 368

Query: 212 VQMRLGMFDGEPSAQP-FGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRH 270
           V+MRLG FDG+P +   +G+LG  DVC+PAH+ LAL+AA  GIVLLKN A  LPL     
Sbjct: 369 VRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVLLKNDAGALPLQPSAV 428

Query: 271 HTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRYA--KTIHQAGCFGVACNGNQLIG 328
            ++AVIGPN+D    + GNY G  C  TTPLQGI  Y   +    AGC   AC       
Sbjct: 429 TSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFLAGCDSPACAVAATNE 488

Query: 329 AAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCG 388
           AA +A+  +D  VL MGL Q  E + +DR  LLLPG QQ L++ VA A+R PV+LVL+ G
Sbjct: 489 AAALAS-SSDHVVLFMGLSQKQEQDGLDRTSLLLPGEQQGLITAVANAARRPVILVLLTG 547

Query: 389 GPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           GPVDV+FAK++P+IGAILW GYPGQAGG AIA VLFG  NP
Sbjct: 548 GPVDVTFAKDNPKIGAILWAGYPGQAGGLAIAKVLFGDHNP 588


>gi|297736788|emb|CBI25989.3| unnamed protein product [Vitis vinifera]
          Length = 746

 Score =  440 bits (1132), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 219/431 (50%), Positives = 288/431 (66%), Gaps = 25/431 (5%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYN G AGLT+WSPN+N+ RDPRWGR  ETPGEDP L G YA +Y               
Sbjct: 170 MYNSGNAGLTFWSPNINVVRDPRWGRILETPGEDPHLVGLYAVNY--------------- 214

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
               HY AYDLDNW G DR HF+ARVS QD+ +T+ +PF+ CV EG V+SVMCSYN++NG
Sbjct: 215 ----HYAAYDLDNWKGADRVHFDARVSVQDMAETFVLPFEMCVKEGDVSSVMCSYNKING 270

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP-EEAAADAIKAGLDLD 179
            P+CAD  +LK TI G+W L GYIVSDCDSV V+   Q +  +   ++AA A+ AG++LD
Sbjct: 271 IPSCADSRLLKQTIRGEWDLHGYIVSDCDSVEVMAVDQKWLDSSFSDSAAQALNAGMNLD 330

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG F       AV  G   + D++ +L Y   + MR+G FDG P+   F +LG  D+C+ 
Sbjct: 331 CGTFNNRSLTEAVNQGKANQADLDHSLRYLYVLLMRVGFFDGIPA---FASLGKDDICSA 387

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
            H +LA +AA QGIVLLKN   TLPL ++++  +A++GP+++ T  MIGNYAG+ C Y +
Sbjct: 388 EHIELAREAARQGIVLLKNDNATLPLKSVKN--IALVGPHANATDAMIGNYAGIPCYYVS 445

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           PL   S   +  ++ GC  V C     I  A  AA++ADAT++  G D SIEAE +DR  
Sbjct: 446 PLDAFSSMGEVRYEKGCADVQCLNETYIFNAMEAAKRADATIIFAGTDLSIEAEALDRVD 505

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           LLLPG Q +L+++VA  S GPVVLV+M GG VD+SFA+++P+I AILW GYPG+ GG AI
Sbjct: 506 LLLPGYQTQLINQVADLSTGPVVLVIMSGGGVDISFARDNPKIAAILWAGYPGEQGGNAI 565

Query: 420 ADVLFGRANPG 430
           ADV+ G+ NPG
Sbjct: 566 ADVILGKYNPG 576


>gi|253761860|ref|XP_002489304.1| hypothetical protein SORBIDRAFT_0010s007570 [Sorghum bicolor]
 gi|241946952|gb|EES20097.1| hypothetical protein SORBIDRAFT_0010s007570 [Sorghum bicolor]
          Length = 750

 Score =  439 bits (1128), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 227/436 (52%), Positives = 291/436 (66%), Gaps = 20/436 (4%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR--- 56
           +YN G A GLT WSPNVNIFRDPRWGRGQETPGEDP +  KYA ++VRG+QG++ +    
Sbjct: 132 LYNLGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASKYAVAFVRGIQGSSAAGAAA 191

Query: 57  -LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSY 115
            L+ +ACCKH TAYDL++WNGV RY+F+ARV+ QDL DT+N PF++CVV+GK   VMC+Y
Sbjct: 192 PLQASACCKHATAYDLEDWNGVARYNFDARVTAQDLADTFNPPFQSCVVDGKATCVMCAY 251

Query: 116 NQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAG 175
             +NG P CA  D+L  T  G W  DGY+ SDCD+V ++++ Q Y  TPE+  A A+K  
Sbjct: 252 TGINGVPACASSDLLTKTFRGAWGHDGYVSSDCDAVAIMHDAQRYVPTPEDTVAVALKE- 310

Query: 176 LDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQP-FGNLGPR 234
                      H   A++ G + E+DV+ AL     V+MRLG FDG+P     +G+LG  
Sbjct: 311 -----------HGMAAIQQGKMTEKDVDKALTNLFAVRMRLGHFDGDPRGNALYGHLGAA 359

Query: 235 DVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA 294
           DVCT  H+ LAL+AA  GIVLLKN A  LPL      + AVIG N++  + + GNY G A
Sbjct: 360 DVCTADHKNLALEAAQDGIVLLKNDAGILPLDRSAMGSAAVIGHNANDALVLRGNYFGPA 419

Query: 295 CGYTTPLQGISRYAKTIH-QAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
           C  TTPLQG+  Y   +   AGC   AC G    G A   A  ++   L MGL Q  E E
Sbjct: 420 CETTTPLQGVQSYVSNVRFLAGCSSAAC-GYAATGQAAALASSSEYVFLFMGLSQDQEKE 478

Query: 354 FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQ 413
            +DR  LLLPG+QQ L++ VA A++ PV+LVL+ GGPVD++FA+++P+IGAILW GYPGQ
Sbjct: 479 GLDRTSLLLPGKQQSLITAVASAAKRPVILVLLTGGPVDITFAQSNPKIGAILWAGYPGQ 538

Query: 414 AGGAAIADVLFGRANP 429
           AGG AIA VLFG  NP
Sbjct: 539 AGGLAIARVLFGDHNP 554


>gi|147857580|emb|CAN78858.1| hypothetical protein VITISV_030325 [Vitis vinifera]
          Length = 699

 Score =  437 bits (1123), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 234/432 (54%), Positives = 282/432 (65%), Gaps = 69/432 (15%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ-GNTGS--RL 57
           MYN G+AGLT+WSPNVNIF+DPRWGRGQETPGEDP+L+ KYA+ YVRGLQ  + GS  RL
Sbjct: 144 MYNVGLAGLTFWSPNVNIFQDPRWGRGQETPGEDPLLSSKYASGYVRGLQQSDDGSPDRL 203

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVAACCKHYTAYDLDNW GVD +HFNA V+ QD++DT+  PFK+CV++G VASV+     
Sbjct: 204 KVAACCKHYTAYDLDNWKGVDCFHFNAVVTNQDMDDTFQPPFKSCVIDGNVASVI----- 258

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
                                    YIVSDCDSV V YN+QHYT+TPEEAAA AI AGLD
Sbjct: 259 -------------------------YIVSDCDSVDVFYNSQHYTKTPEEAAAKAILAGLD 293

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           L+CG FL  HTE AV+GGL+ E  V+ A++      MRLG FDG PS   +G LGP+DVC
Sbjct: 294 LNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIYGKLGPKDVC 353

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           T  HQ+ A +A  QGIV                                   +AG  C Y
Sbjct: 354 TSEHQERAREAPRQGIV-----------------------------------FAGTPCKY 378

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TTPLQG++    T +  GC  VAC G   I  A+  A  ADATVL++G+DQSIEAE  DR
Sbjct: 379 TTPLQGLTALVATTYLPGCSNVAC-GTAQIDEAKKIAAAADATVLIVGIDQSIEAEGRDR 437

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             + LPG+Q  L++ VAK S+G V+LV+M GG  D+SFAKND +I +I WVGYPG+AGGA
Sbjct: 438 VNIQLPGQQPLLITEVAKXSKGNVILVVMSGGGFDISFAKNDDKITSIQWVGYPGEAGGA 497

Query: 418 AIADVLFGRANP 429
           AIADV+FG  NP
Sbjct: 498 AIADVIFGFYNP 509


>gi|359473427|ref|XP_002265788.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
           [Vitis vinifera]
          Length = 464

 Score =  436 bits (1120), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 207/440 (47%), Positives = 294/440 (66%), Gaps = 15/440 (3%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR---- 56
           MYN G AGLT+WSPN+N+ RD RWGR QET  EDP + G++A +YVRGLQ   G+     
Sbjct: 1   MYNLGHAGLTFWSPNINVVRDTRWGRTQETSREDPFMVGEFAVNYVRGLQDVEGTENVTD 60

Query: 57  -----LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASV 111
                LKV++CCKHY AYD+D+W  +DR+ F+ARVS+QD+++T+  PF+ CV EG V+SV
Sbjct: 61  LNSRPLKVSSCCKHYAAYDIDSWLNIDRHTFDARVSEQDMKETFVSPFERCVREGDVSSV 120

Query: 112 MCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE-EAAAD 170
           MCS+N++NG P C+DP +LK  I  +W L GYIVSDC  + V+ + Q+Y    + +A A 
Sbjct: 121 MCSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAK 180

Query: 171 AIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGN 230
            ++AGLDL+CG +        V  G + + +++ AL     + MR+G FDG P+   + +
Sbjct: 181 TLQAGLDLECGHYYTDALNELVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPA---YES 237

Query: 231 LGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY 290
           LG +D+C   H +LA +AA QGIVLLKN     PL   +   +A++GP+++ T  MIGNY
Sbjct: 238 LGLKDICAADHIELAREAARQGIVLLKNDYEVFPLKPGKK--LALVGPHANATEVMIGNY 295

Query: 291 AGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSI 350
           AG+   Y +PL+  S      +  GC   +C+ +     A+ AA+ A+ T++ +G D SI
Sbjct: 296 AGLPRKYVSPLEAFSAIGNVTYTTGCLDASCSNDTYFSEAKEAAKSAEVTIIFVGTDLSI 355

Query: 351 EAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGY 410
           EAEF+DR   LLPG Q EL+ +VA+ S GPV+LV++ G  +D++FAKN+PRI AILWVG+
Sbjct: 356 EAEFVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGF 415

Query: 411 PGQAGGAAIADVLFGRANPG 430
           PG+ GG AIADV+FG+ NPG
Sbjct: 416 PGEQGGHAIADVVFGKYNPG 435


>gi|30316196|sp|P83344.1|XYNB_PRUPE RecName: Full=Putative beta-D-xylosidase; AltName: Full=PpAz152
 gi|19879972|gb|AAM00218.1|AF362990_1 beta-D-xylosidase, partial [Prunus persica]
          Length = 461

 Score =  429 bits (1102), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 219/262 (83%), Positives = 236/262 (90%)

Query: 169 ADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPF 228
           ADAIKAGLDLDCGPFLAIHTE AVR GL+ + ++N ALA T+TVQMRLGMFDGEPSA  +
Sbjct: 1   ADAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQY 60

Query: 229 GNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIG 288
           GNLGPRDVCTPAHQQLAL+AA QGIVLL+N  R+LPLST RH TVAVIGPNSDVTVTMIG
Sbjct: 61  GNLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSTRRHRTVAVIGPNSDVTVTMIG 120

Query: 289 NYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQ 348
           NYAGVACGYTTPLQGI RY +TIHQAGC  V CNGNQL GAAE AARQADATVLVMGLDQ
Sbjct: 121 NYAGVACGYTTPLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQ 180

Query: 349 SIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWV 408
           SIEAEF+DRAGLLLPG QQELVSRVA+ASRGP +LVLM GGP+DV+FAKNDPRI AI+WV
Sbjct: 181 SIEAEFVDRAGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWV 240

Query: 409 GYPGQAGGAAIADVLFGRANPG 430
           GYPGQAGG AIA+VLFG ANPG
Sbjct: 241 GYPGQAGGTAIANVLFGTANPG 262


>gi|125576923|gb|EAZ18145.1| hypothetical protein OsJ_33695 [Oryza sativa Japonica Group]
          Length = 591

 Score =  427 bits (1099), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 212/396 (53%), Positives = 277/396 (69%), Gaps = 3/396 (0%)

Query: 37  LTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYN 96
           +  KYA ++V+G+QGN+ + L+ +ACCKH TAYDL++WNGV RY+FNA+V+ QDLEDTYN
Sbjct: 1   MASKYAVAFVKGMQGNSSAILQTSACCKHVTAYDLEDWNGVQRYNFNAKVTAQDLEDTYN 60

Query: 97  VPFKACVVEGKVASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYN 156
            PF++CVV+ K   +MC+Y  +NG P CA+ D+L  T+ G W LDGYI SDCD+V ++ +
Sbjct: 61  PPFRSCVVDAKATCIMCAYTGINGVPACANADLLTKTVRGDWGLDGYIASDCDAVAIMRD 120

Query: 157 TQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRL 216
            Q YT+TPE+A A A+KAGLD++CG ++  H   A++ G L EED++ AL     ++MRL
Sbjct: 121 AQRYTQTPEDAVAVALKAGLDMNCGTYMQQHATAAIQQGKLTEEDIDKALKNLFAIRMRL 180

Query: 217 GMFDGEP-SAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAV 275
           G FDG+P S   +G LG  D+CTP H+ LAL+AA  GIVLLKN A  LPL      + AV
Sbjct: 181 GHFDGDPRSNSVYGGLGAADICTPEHRSLALEAAMDGIVLLKNDAGILPLDRTAVASAAV 240

Query: 276 IGPNSDVTVTMIGNYAGVACGYTTPLQGISRYAKTIH-QAGCFGVACNGNQLIGAAEVAA 334
           IGPN++  + +IGNY G  C  TTPL GI  Y K +   AGC   AC+      AA VA+
Sbjct: 241 IGPNANDGLALIGNYFGPPCESTTPLNGILGYIKNVRFLAGCNSAACDVAATDQAAAVAS 300

Query: 335 RQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVS 394
             +D   L MGL Q  E+E  DR  LLLPG QQ L++ VA A++ PV+LVL+ GGPVDV+
Sbjct: 301 -SSDYVFLFMGLSQKQESEGRDRTSLLLPGEQQSLITAVADAAKRPVILVLLTGGPVDVT 359

Query: 395 FAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           FA+ +P+IGAILW GYPGQAGG AIA VLFG  NPG
Sbjct: 360 FAQTNPKIGAILWAGYPGQAGGLAIARVLFGDHNPG 395


>gi|78482949|emb|CAJ41429.1| beta (1,4)-xylosidase [Populus tremula x Populus alba]
          Length = 732

 Score =  427 bits (1099), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 220/388 (56%), Positives = 274/388 (70%), Gaps = 15/388 (3%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           M+NGG+AGLTYWSPNV     PRWGRGQETPGEDPV+ GKYAASYVRGLQG+ G RLKVA
Sbjct: 141 MFNGGVAGLTYWSPNVTYSVYPRWGRGQETPGEDPVVVGKYAASYVRGLQGSDGIRLKVA 200

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKH+TAYDLDNWNGVDR+HFNA+VSKQD+ DT++VPF+ CV EGKVASVMCSYNQVNG
Sbjct: 201 ACCKHFTAYDLDNWNGVDRFHFNAKVSKQDMVDTFDVPFRMCVKEGKVASVMCSYNQVNG 260

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
            PTCADP++LK T+ GQWRL+GYIVSDCDS GV Y  QH+T +P  ++    KAGLDLDC
Sbjct: 261 IPTCADPNLLKKTVRGQWRLNGYIVSDCDSFGVYYGQQHFT-SPRRSSLGCYKAGLDLDC 319

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           GPFL  H +   +     E ++N A   T+T Q+ LG+FDG P  Q  G++ P  +  P 
Sbjct: 320 GPFLVTHRDAVKKAA--EEAEINNAWLKTLTFQISLGIFDGSP-LQAVGDVVP-TMGPPT 375

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVT--MIGNYAGVACGYT 298
           +Q LA+ A  + + + KN  R   L + RH    + GP +       M+GNY G+ C Y 
Sbjct: 376 NQDLAVNAPKR-LFIFKN--RAFLLYSPRH----IFGPVALFKSLPFMLGNYEGLPCKYL 428

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
            PLQG++ +   ++  GC  V C    +  A ++AA  ADA VLV+G DQSIE E  DR 
Sbjct: 429 FPLQGLAGFVSLLYLPGCSNVICAVADVGSAVDLAA-SADAVVLVVGADQSIEREGHDRV 487

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLM 386
              LPG+QQELV+RVA A++GPV+LV+M
Sbjct: 488 DFYLPGKQQELVTRVAMAAKGPVLLVIM 515


>gi|413919687|gb|AFW59619.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 451

 Score =  424 bits (1090), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 199/297 (67%), Positives = 237/297 (79%), Gaps = 3/297 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG--NTGSRLK 58
           M+N G+AGLT+WSPN+NIFRDPRWGRGQETPGEDP+LT KYA  YV GLQG  +    LK
Sbjct: 154 MHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQGAVSGAGALK 213

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           VAACCKHYTAYD+DNW GV+RY F+A VS+QDL+DT+  PFK+CVV+G VASVMCSYNQV
Sbjct: 214 VAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVVDGNVASVMCSYNQV 273

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           NGKPTCAD D+L   I G W+L+GYI SDCDSV VLYN QHYT+TPE+AAA +IKAGLDL
Sbjct: 274 NGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAISIKAGLDL 333

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG FLA HT  AV+ G L E DV+ A+   +   MRLG FDG+P   PFGNLGP DVCT
Sbjct: 334 NCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNLGPSDVCT 393

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVAC 295
           P++Q+LA +AA QGIVLLKN+ + LPLS     ++AVIGPN++ + TMIGNY G +C
Sbjct: 394 PSNQELAREAARQGIVLLKNTGK-LPLSATSIKSMAVIGPNANASFTMIGNYEGTSC 449


>gi|297736786|emb|CBI25987.3| unnamed protein product [Vitis vinifera]
          Length = 745

 Score =  419 bits (1077), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 203/431 (47%), Positives = 284/431 (65%), Gaps = 30/431 (6%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           MYN G AGLT+WSPN+N+ RD RWGR QET GEDP + G++A +YVRGLQ   G+   V+
Sbjct: 166 MYNLGHAGLTFWSPNINVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVEGTE-NVS 224

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           +CCKHY AYD+D+W  VDR+ F+ARVS+QD+++T+  PF+ CV EG V+SVMCS+N++NG
Sbjct: 225 SCCKHYAAYDIDSWLNVDRHTFDARVSEQDMKETFVSPFERCVREGDVSSVMCSFNKING 284

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE-EAAADAIKAGLDLD 179
            P C+DP +LK  I  +W L GYIVSDC  + V+ + Q+Y    + +A A  ++AGLDL+
Sbjct: 285 IPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAKTLQAGLDLE 344

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG +       +V  G + + +++ AL     + MR+G FDG P+   + +LG +D+C  
Sbjct: 345 CGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPA---YESLGLKDICAA 401

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
            H +LA +AA QGIVLLKN    LPL   +   +A++GP+++ T  MIGNYAG+ C Y +
Sbjct: 402 DHIELAREAARQGIVLLKNDYEVLPLKPGKK--IALVGPHANATEVMIGNYAGLPCKYVS 459

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           PL+  S      +  G                        T++ +G D SIEAEF+DR  
Sbjct: 460 PLEAFSAIGNVTYATGF-----------------------TIIFVGTDLSIEAEFVDRVD 496

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
            LLPG Q EL+ +VA+ S GPV+LV++ G  +D++FAKN+PRI AILWVG+PG+ GG AI
Sbjct: 497 FLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPGEQGGHAI 556

Query: 420 ADVLFGRANPG 430
           ADV+FG+ NPG
Sbjct: 557 ADVVFGKYNPG 567


>gi|359477633|ref|XP_003632006.1| PREDICTED: LOW QUALITY PROTEIN: beta-D-xylosidase 3-like [Vitis
           vinifera]
          Length = 781

 Score =  415 bits (1067), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 217/443 (48%), Positives = 287/443 (64%), Gaps = 19/443 (4%)

Query: 2   YNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR----- 56
           YN G AGLT+W PNVN+ RDPRWGR QET GEDP     YA SYVRGLQ   G+      
Sbjct: 138 YNLGHAGLTFWCPNVNVARDPRWGRTQETXGEDPFTVSVYAVSYVRGLQDVEGTENTTDL 197

Query: 57  ----LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVM 112
               LKV++  KH+ AYDLDNW  VDR HFNARVS+QD+ +T+  PF+ACV EG V+ VM
Sbjct: 198 NSRPLKVSSSGKHFAAYDLDNWLNVDRNHFNARVSEQDMAETFLRPFEACVREGDVSGVM 257

Query: 113 CSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTR-TPEEAAADA 171
           CS+N +NG P CADP + K TI  +W L GYIVSDC S+  +   Q +   T EEA A  
Sbjct: 258 CSFNNINGIPPCADPRLFKGTIRDEWNLHGYIVSDCWSIETIVEDQKFLDVTGEEAVALN 317

Query: 172 IKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNL 231
           +KAGLDL+CG +       AV  G + + D++ +L+    V MRLG FDG P+     +L
Sbjct: 318 LKAGLDLECGHYYNDSPASAVMAGRVGQHDLDQSLSNLYVVLMRLGFFDGIPA---LASL 374

Query: 232 GPRDVCTPA-HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY 290
           G  D+C  A H +LA +AA QGIVLLKN   TLPL ++++  +A++GPN+D    M+GNY
Sbjct: 375 GKDDICLSAEHIELAREAARQGIVLLKNDNATLPLKSVKN--LALVGPNADAYGAMMGNY 432

Query: 291 AGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGL-DQS 349
           AG  C   +P    S      ++ GC  V C+ +  +  A  AA+ AD T++V+G+ D S
Sbjct: 433 AGPPCRSVSPRDAFSAIGNVTYEMGCGDVLCHNDTYVYKAVEAAKHADTTIIVVGITDVS 492

Query: 350 IEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL--MCGGPVDVSFAKNDPRIGAILW 407
           I  E  DR  LLLPG Q  LV+++AKA+  P++LV+   CGGP+D+SFA+++P I  ILW
Sbjct: 493 IGTEDKDRVDLLLPGYQTHLVNQIAKATTAPIILVVCGHCGGPIDISFARDNPGIEPILW 552

Query: 408 VGYPGQAGGAAIADVLFGRANPG 430
            G+PG+ GG AIADV++G+ NPG
Sbjct: 553 AGFPGEEGGNAIADVVYGKYNPG 575


>gi|222615852|gb|EEE51984.1| hypothetical protein OsJ_33664 [Oryza sativa Japonica Group]
          Length = 753

 Score =  414 bits (1065), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 220/434 (50%), Positives = 282/434 (64%), Gaps = 21/434 (4%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRG-LQGNTGSRLK 58
           +YN G A GLT WSPNVNIFRDP   R    PG+             RG   G  G   +
Sbjct: 131 LYNLGQANGLTIWSPNVNIFRDPSGTR----PGD-----------ARRGPRHGEQGIGGE 175

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
            +ACCKH TAYDLD WN V RY+++++V+ QDLEDTYN PFK+CV EGK   +MC YN +
Sbjct: 176 ASACCKHATAYDLDYWNNVVRYNYDSKVTLQDLEDTYNPPFKSCVAEGKATCIMCGYNSI 235

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           NG P CA  D+L   +  +W ++GY+ SDCD+V  + +  HYT +PE+  A +IK G+D+
Sbjct: 236 NGVPACASSDLLTKKVRQEWGMNGYVASDCDAVATIRDAHHYTLSPEDTVAVSIKVGMDV 295

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQP-FGNLGPRDVC 237
           +CG +  +H   AV+ G L E+D++ AL     V+MRLG FDG+P +   +G+LG  DVC
Sbjct: 296 NCGNYTQVHAMAAVQKGNLTEKDIDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVC 355

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           +PAH+ LAL+AA  GIVLLKN A  LPL      ++AVIGPN+D    + GNY G  C  
Sbjct: 356 SPAHKSLALEAAQDGIVLLKNDAGALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCET 415

Query: 298 TTPLQGISRYA--KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFI 355
           TTPLQGI  Y   +    AGC   AC  +    AA +A+  +D  VL MGL Q  E + +
Sbjct: 416 TTPLQGIKGYLGDRARFLAGCDSPACAVDATNEAAALAS-SSDHVVLFMGLSQKQEQDGL 474

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  LLLPG QQ L++ VA A+R PV+LVL+ GGPVDV+FAK++P+IGAILW GYPGQAG
Sbjct: 475 DRTSLLLPGEQQGLITAVANAARRPVILVLLTGGPVDVTFAKDNPKIGAILWAGYPGQAG 534

Query: 416 GAAIADVLFGRANP 429
           G AIA VLFG  NP
Sbjct: 535 GLAIAKVLFGDHNP 548


>gi|242052713|ref|XP_002455502.1| hypothetical protein SORBIDRAFT_03g012290 [Sorghum bicolor]
 gi|241927477|gb|EES00622.1| hypothetical protein SORBIDRAFT_03g012290 [Sorghum bicolor]
          Length = 825

 Score =  414 bits (1063), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 215/452 (47%), Positives = 286/452 (63%), Gaps = 25/452 (5%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG--------- 51
           MYN G A LTYWSPN+N+ RDPRWGR  ETPGEDP + G+YA ++VRG+Q          
Sbjct: 169 MYNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDVVIAAGAAA 228

Query: 52  --NTGSR-LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKV 108
             +  SR +KV++CCKH+ AYD+D W   DR  F+A+V ++D+ +T+  PF+ C+ +G  
Sbjct: 229 TADPFSRPIKVSSCCKHFAAYDVDAWFKADRLTFDAQVEERDMVETFERPFEMCIRDGDA 288

Query: 109 ASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTR-TPEEA 167
           + VMCSYN++NG P CAD  +L  T+  QW+L GYIVSDCDSV V+     +   T  EA
Sbjct: 289 SCVMCSYNRINGIPACADARLLSETVRSQWQLHGYIVSDCDSVRVMVRDAKWLNYTGVEA 348

Query: 168 AADAIKAGLDLDCG-------PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFD 220
            A A+KAGLDLDCG        F   +   AVR G ++E DV+ AL    T  MRLG FD
Sbjct: 349 TAAAMKAGLDLDCGMFWEGARDFFTTYGVDAVRQGKIKEADVDNALGNVYTTLMRLGFFD 408

Query: 221 GEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNS 280
           G P    F +LG  DVCT  H++LA  AA QG+VLLKN AR LPL   + ++V+++G   
Sbjct: 409 GMPE---FESLGADDVCTRDHKELAADAARQGMVLLKNDARRLPLDPSKINSVSLVGLLE 465

Query: 281 DVTVT--MIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQAD 338
            +  T  M+G+Y G  C   TP   I +     +   C   AC+  + +G A   A+ AD
Sbjct: 466 HINATDVMLGDYRGKPCRIVTPYDAIRQVVNATYVHACDSGACSTAEGMGRASRTAKIAD 525

Query: 339 ATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKN 398
           AT+++ GL+ S+E E  DR  LLLP  Q   ++ VA+AS  P+VLV+M  G VDVSFA+N
Sbjct: 526 ATIVIAGLNMSVERESNDREDLLLPWNQSSWINAVAEASTTPIVLVIMSAGGVDVSFAQN 585

Query: 399 DPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + +IGAI+W GYPG+ GG AIADVLFG+ NPG
Sbjct: 586 NTKIGAIVWAGYPGEEGGTAIADVLFGKYNPG 617


>gi|226506870|ref|NP_001146482.1| uncharacterized protein LOC100280070 precursor [Zea mays]
 gi|219887469|gb|ACL54109.1| unknown [Zea mays]
 gi|413947917|gb|AFW80566.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 835

 Score =  412 bits (1060), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 215/451 (47%), Positives = 284/451 (62%), Gaps = 24/451 (5%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ---------- 50
           MYN G A LTYWSPN+N+ RDPRWGR  ETPGEDP + G+YA ++VRG+Q          
Sbjct: 181 MYNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDVDDRPYAAA 240

Query: 51  GNTGSR-LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVA 109
            +  SR +KV++CCKH+ AYD+D W   DR  F+A+V ++D+ +T+  PF+ C+ +G  +
Sbjct: 241 ADPFSRPIKVSSCCKHFAAYDVDAWFKADRLTFDAQVEERDMVETFERPFEMCIRDGDAS 300

Query: 110 SVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTR-TPEEAA 168
            VMCSYN++NG P CAD  +L  T+  QW+L GYIVSDCDSV V+     +   T  EA 
Sbjct: 301 CVMCSYNRINGIPACADARLLSETVRSQWQLHGYIVSDCDSVRVMVRDAKWLNYTGVEAT 360

Query: 169 ADAIKAGLDLDCG-------PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDG 221
           A A+KAGLDLDCG        F   +   AVR G ++E DV+ AL+   T  MRLG FDG
Sbjct: 361 AAAMKAGLDLDCGMFWEGARDFFTTYGVDAVRQGKIKEGDVDNALSNVYTTLMRLGFFDG 420

Query: 222 EPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSD 281
            P    F +LG  +VCT  H++LA  AA QG+VLLKN AR LPL   + ++V+++G    
Sbjct: 421 MPE---FESLGASNVCTDGHKELAADAARQGMVLLKNDARRLPLDPNKINSVSLVGLLEH 477

Query: 282 VTVT--MIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADA 339
           +  T  M+G+Y G  C   TP   I       +   C   ACN  + +G A   A+ ADA
Sbjct: 478 INATDVMLGDYRGKPCRIVTPYNAIRNMVNATYVHACDSGACNTAEGMGRASSTAKIADA 537

Query: 340 TVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKND 399
           T+++ GL+ S+E E  DR  LLLP  Q   ++ VA AS  P+VLV+M  G VDVSFA N+
Sbjct: 538 TIVIAGLNMSVERESNDREDLLLPWNQSSWINAVAMASPTPIVLVIMSAGGVDVSFAHNN 597

Query: 400 PRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            +IGAI+W GYPG+ GG AIADVLFG+ NPG
Sbjct: 598 TKIGAIVWAGYPGEEGGTAIADVLFGKYNPG 628


>gi|40363751|dbj|BAD06320.1| putative beta-xylosidase [Triticum aestivum]
          Length = 573

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 199/377 (52%), Positives = 259/377 (68%), Gaps = 2/377 (0%)

Query: 55  SRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCS 114
           S L+ +ACCKH+TAYDL+NW GV R+ F+A+V++QDL DTYN PFK+CV +G  + +MCS
Sbjct: 3   SDLEASACCKHFTAYDLENWKGVTRFAFDAKVTEQDLADTYNPPFKSCVEDGGASGIMCS 62

Query: 115 YNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKA 174
           YN+VNG PTCAD ++L  T  G W  +GYI SDCD+V ++++ Q Y + PE+A AD +KA
Sbjct: 63  YNRVNGVPTCADHNLLSKTARGDWSFNGYITSDCDAVAIIHDVQGYAKAPEDAVADVLKA 122

Query: 175 GLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPR 234
           G+D++CG ++  H   A + G +  ED++ AL     ++MRLG+F+G P    +GN+G  
Sbjct: 123 GMDVNCGGYIQTHGVSAYQQGKITGEDIDRALRNLFAIRMRLGLFNGNPKYNRYGNIGAD 182

Query: 235 DVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA 294
            VC   HQ LALQAA  GIVLLKN A  LPLS  +  +VAVIGPN +    ++GNY G  
Sbjct: 183 QVCKKEHQDLALQAAQDGIVLLKNDAGALPLSKSKVSSVAVIGPNGNNASLLLGNYFGPP 242

Query: 295 CGYTTPLQGISRYAK-TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
           C   TP Q +  Y K      GC    CN +  IG A  AA  AD  VL MGLDQ+ E E
Sbjct: 243 CISVTPFQALQGYVKDATFVQGCNAAVCNVSN-IGEAVHAASSADYVVLFMGLDQNQERE 301

Query: 354 FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQ 413
            +DR  L LPG Q+ LV++VA A++ PV+LVL+CGGPVDV+FAKN+P+IGAI+W GYPGQ
Sbjct: 302 EVDRLELGLPGMQESLVNKVADAAKKPVILVLLCGGPVDVTFAKNNPKIGAIVWAGYPGQ 361

Query: 414 AGGAAIADVLFGRANPG 430
           AGG AIA VLFG  NPG
Sbjct: 362 AGGIAIAQVLFGEHNPG 378


>gi|348667575|gb|EGZ07400.1| xylosidase [Phytophthora sojae]
          Length = 751

 Score =  407 bits (1047), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 211/444 (47%), Positives = 285/444 (64%), Gaps = 22/444 (4%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG--------- 51
            YN   AGLT+W+PNVNIFRDPRWGRGQETPGEDP LTG+YA ++VRGLQG         
Sbjct: 133 FYNEKNAGLTFWTPNVNIFRDPRWGRGQETPGEDPYLTGEYAVAFVRGLQGEAMEGHENK 192

Query: 52  NTGSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASV 111
           +    LK+++CCKH++AY  +    V R+  +A V+KQD  DTY   F+ CV  G V+S+
Sbjct: 193 DDNKFLKISSCCKHFSAYSQE----VPRHRNDAIVTKQDQADTYFPAFEDCVKRGHVSSI 248

Query: 112 MCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADA 171
           MCSYN VNG P+CAD  +L + +  QW+ DGYI SDC++V  +    H+T++PE+  A  
Sbjct: 249 MCSYNAVNGIPSCADKGLLTDLVRNQWKFDGYITSDCEAVADVIYRHHFTQSPEQTCATT 308

Query: 172 IKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNL 231
           + AG+DL+CG FL  H   A+  G++  E V+ AL     V MRLGMF  E   QPF N+
Sbjct: 309 LDAGMDLNCGEFLRQHLSSAIEQGIVSTEMVHNALKNQFRVMMRLGMF--EKGTQPFSNI 366

Query: 232 GPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLST---LRHHTVAVIGPNSDVTVTMIG 288
               V T AH+QLAL+AA Q +VLLKN   TLPL+T    +  ++A+IGP+ + +  ++G
Sbjct: 367 TKDAVDTAAHRQLALEAARQSVVLLKNEDNTLPLATDVFSKDGSLALIGPHFNASTALLG 426

Query: 289 NYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQL--IGAAEVAARQADATVLVMGL 346
           NY G+     TPL+G+S Y   +  A   G   +G  L     A    ++AD  V+ MGL
Sbjct: 427 NYFGIPSHIVTPLKGVSSYVPNV--AYSLGCKVSGEVLPDFDEAIEVVKKADRVVVFMGL 484

Query: 347 DQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
           DQS E E IDR  L LPG Q  L++R+  A+  P+VLVL+ GG VD+S  KN P++GAI+
Sbjct: 485 DQSQEREEIDRYHLKLPGFQIALLNRILAAASHPIVLVLISGGSVDLSLYKNHPKVGAIV 544

Query: 407 WVGYPGQAGGAAIADVLFGRANPG 430
           + GY GQAGG A+AD+LFG+ +P 
Sbjct: 545 FGGYLGQAGGQALADMLFGKYSPA 568


>gi|357128056|ref|XP_003565692.1| PREDICTED: beta-D-xylosidase 3-like [Brachypodium distachyon]
          Length = 821

 Score =  406 bits (1043), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 211/453 (46%), Positives = 282/453 (62%), Gaps = 28/453 (6%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT------- 53
           MYN G A LTYWSPN+N+ RDPRWGR  ETPGEDP L G++A S+VR +Q          
Sbjct: 165 MYNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFLVGRFAVSFVRAMQDIDDGANAGA 224

Query: 54  ------GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGK 107
                   RLKV++CCKHY AYD+D W G DR  F+A V ++D+ +T+  PF+ CV +G 
Sbjct: 225 GAADPFARRLKVSSCCKHYAAYDVDKWFGADRLSFDANVQERDMVETFERPFEMCVRDGD 284

Query: 108 VASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYT-RTPEE 166
            + VMCSYN++NG P CA+  +L  T+   W+L GYIVSDCDSV V+     +      +
Sbjct: 285 ASCVMCSYNRINGVPACANGRLLTGTVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYDGVQ 344

Query: 167 AAADAIKAGLDLDCG-------PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMF 219
           A A A+KAGLDLDCG        F   +   AVR G L+E +V+ AL +     MRLG F
Sbjct: 345 ATAAAMKAGLDLDCGMFWEGAKDFFTAYGLQAVRQGKLKEAEVDEALGHLYLTLMRLGFF 404

Query: 220 DGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPN 279
           DG P    F +LG  DVCT  H+++A +AA QG+VLLKN    LPL   + +++A++G  
Sbjct: 405 DGSPE---FQSLGASDVCTEEHKEMAAEAARQGMVLLKNDHDRLPLDANKVNSLALVGLL 461

Query: 280 SDVTVT--MIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQA 337
             +  T  M+G+Y G  C   TP + I +         C   AC G   +GAA +AA+  
Sbjct: 462 QHINATDVMLGDYRGKPCRVVTPYEAIRKVVSGTSMQACDKGAC-GTTALGAA-IAAKTV 519

Query: 338 DATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAK 397
           DAT+++ GL+ S+E E  DR  LLLP  Q + ++ VA+ASR P+ LV++  G VD+SFA+
Sbjct: 520 DATIVITGLNMSVEREGNDREDLLLPWDQTQWINAVAEASRDPITLVIISAGGVDISFAQ 579

Query: 398 NDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           N+P+IGAILW GYPG+ GG  IADVLFG+ NPG
Sbjct: 580 NNPKIGAILWAGYPGEEGGTGIADVLFGKYNPG 612


>gi|301110280|ref|XP_002904220.1| beta-D-xylosidase, putative [Phytophthora infestans T30-4]
 gi|262096346|gb|EEY54398.1| beta-D-xylosidase, putative [Phytophthora infestans T30-4]
          Length = 709

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 209/443 (47%), Positives = 285/443 (64%), Gaps = 24/443 (5%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN-------- 52
            +N   AGLT+W+PNVNIFRDPRWGRGQETPGEDP LTG+YA ++VRGLQG         
Sbjct: 96  FHNAKDAGLTFWTPNVNIFRDPRWGRGQETPGEDPYLTGEYAVAFVRGLQGEGMEGREVE 155

Query: 53  TGSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVM 112
               LK+++CCKH++AY  +    V R+  NA V+KQD  DTY   F+ CV  G V+S+M
Sbjct: 156 NSKFLKISSCCKHFSAYSQE----VPRHRNNAMVTKQDQADTYFPAFEDCVKRGHVSSIM 211

Query: 113 CSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAI 172
           CSYN VNG P+CAD  +L + + GQW+ DGYI SDC++V  + +  HYT++PE+  A  +
Sbjct: 212 CSYNAVNGIPSCADKGLLTDLVRGQWKFDGYIASDCEAVADVIDHHHYTQSPEQTCATTL 271

Query: 173 KAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLG 232
            AG+DL+CG FL  H   A+  G++  E ++ AL     V MRLGMF+     +PF N+ 
Sbjct: 272 DAGMDLNCGEFLRQHLPKALEQGIVTTEMIHNALKNQFRVLMRLGMFE---KVEPFANIT 328

Query: 233 PRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLST---LRHHTVAVIGPNSDVTVTMIGN 289
              V T  H+QLAL+AA Q IVLLKN   TLPL+T    R  ++A+IGP+ + +  ++GN
Sbjct: 329 KDSVDTTMHRQLALEAARQSIVLLKNDGNTLPLATKDFTRDRSLALIGPHFNASAALLGN 388

Query: 290 YAGVACGYTTPLQGISRYAKTI-HQAGCFGVACNGNQL--IGAAEVAARQADATVLVMGL 346
           Y G+     TPL+GIS++   + H  GC     +G  L     A   A++AD  ++ +GL
Sbjct: 389 YFGIPSHIVTPLEGISQFVPNVAHSLGC---KVSGEVLPDFDDAIAVAKKADRLIVFVGL 445

Query: 347 DQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
           DQS E E IDR  + LP  Q  L+ RV + +  P+V V++ GG VD+S  KN P++GAI+
Sbjct: 446 DQSQEREEIDRYHIGLPAFQSTLLKRVLEVASHPIVFVVISGGCVDLSAYKNHPKVGAIV 505

Query: 407 WVGYPGQAGGAAIADVLFGRANP 429
           + GY GQAGG A+ADVLFG+ NP
Sbjct: 506 FGGYLGQAGGQALADVLFGKYNP 528


>gi|62321271|dbj|BAD94481.1| beta-xylosidase [Arabidopsis thaliana]
          Length = 523

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 204/327 (62%), Positives = 245/327 (74%), Gaps = 2/327 (0%)

Query: 103 VVEGKVASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTR 162
           VV+G VASVMCSYNQVNGKPTCADPD+L   I G+W+L+GYIVSDCDSV VLY  QHYT+
Sbjct: 3   VVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQHYTK 62

Query: 163 TPEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGE 222
           TP EAAA +I AGLDL+CG FL  HTE AV+ GL+ E  ++ A++      MRLG FDG 
Sbjct: 63  TPAEAAAISILAGLDLNCGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLTLMRLGFFDGN 122

Query: 223 PSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDV 282
           P  Q +G LGP DVCT A+Q+LA  AA QGIVLLKN+   LPLS     T+AVIGPN++V
Sbjct: 123 PKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKNTG-CLPLSPKSIKTLAVIGPNANV 181

Query: 283 TVTMIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVL 342
           T TMIGNY G  C YTTPLQG++    T +  GC  VAC    + GA ++AA  AD +VL
Sbjct: 182 TKTMIGNYEGTPCKYTTPLQGLAGTVSTTYLPGCSNVACAVADVAGATKLAA-TADVSVL 240

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           V+G DQSIEAE  DR  L LPG+QQELV +VAKA++GPV+LV+M GG  D++FAKNDP+I
Sbjct: 241 VIGADQSIEAESRDRVDLRLPGQQQELVIQVAKAAKGPVLLVIMSGGGFDITFAKNDPKI 300

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANP 429
             ILWVGYPG+AGG AIAD++FGR NP
Sbjct: 301 AGILWVGYPGEAGGIAIADIIFGRYNP 327


>gi|90399376|emb|CAJ86207.1| B1011H02.4 [Oryza sativa Indica Group]
          Length = 738

 Score =  404 bits (1039), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 203/433 (46%), Positives = 267/433 (61%), Gaps = 36/433 (8%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS--RLK 58
           M+N G AGLT+W+PN+N+FRDPRWGRGQETPGEDP +   Y+  YV+G Q + G   R+ 
Sbjct: 141 MHNAGQAGLTFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGEEGRMM 200

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           ++ACCKHY AYDL+ W G  RY FNA+V                                
Sbjct: 201 LSACCKHYIAYDLEKWRGFTRYTFNAKV-------------------------------- 228

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           NG P CA  DIL+     +W   GYI SDCD+V +++  Q YT + E++ A  +KAG+D+
Sbjct: 229 NGVPACARKDILQRA-RDEWGFQGYITSDCDAVAIIHENQTYTASDEDSIAVVLKAGMDI 287

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG FL  HT+ A+  G ++EED+N AL    +VQ+RLG FD     Q F  LGP +VCT
Sbjct: 288 NCGSFLIRHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCT 347

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
             H++LA +A  QG VLLKN    LPL       +A+IGP ++    + G+Y GV C  T
Sbjct: 348 TEHRELAAEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHST 407

Query: 299 TPLQGISRYA-KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           T ++G+  Y  KT   AGC  V CN     G A  AA++AD  VL+ GL+ + E E  DR
Sbjct: 408 TFVKGMQAYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDR 467

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             LLLPGRQ +L+  VA  ++ PVVLVLM GGPVDVSFAK+DPRI +ILW+GYPG+ GG 
Sbjct: 468 VSLLLPGRQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGN 527

Query: 418 AIADVLFGRANPG 430
            + ++LFG+ NPG
Sbjct: 528 VLPEILFGKYNPG 540


>gi|14164501|dbj|BAB55751.1| putative alpha-L-arabinofuranosidase/beta-D- xylosidase isoenzyme
           ARA-I [Oryza sativa Japonica Group]
          Length = 818

 Score =  403 bits (1036), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 216/455 (47%), Positives = 283/455 (62%), Gaps = 34/455 (7%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG------ 54
           MYN G A LTYWSPN+N+ RDPRWGR  ETPGEDP + G+YA ++VRG+Q   G      
Sbjct: 165 MYNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDIDGATTAAS 224

Query: 55  --------SR-LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVE 105
                   SR +KV++CCKHY AYD+D WNG DR  F+ARV ++D+ +T+  PF+ C+ +
Sbjct: 225 AAAATDAFSRPIKVSSCCKHYAAYDVDAWNGTDRLTFDARVQERDMVETFERPFEMCIRD 284

Query: 106 GKVASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYT-RTP 164
           G  + VMCSYN++NG P CAD  +L  T+   W+L GYIVSDCDSV V+     +   T 
Sbjct: 285 GDASCVMCSYNRINGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYTG 344

Query: 165 EEAAADAIKAGLDLDCG-------PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLG 217
            EA A A+KAGLDLDCG        F   +   AVR G L+E  V+ AL       MRLG
Sbjct: 345 VEATAAAMKAGLDLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTNLYLTLMRLG 404

Query: 218 MFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIG 277
            FDG P  +   +LG  DVCT  H++LA  AA QG+VLLKN A  LPLS  + ++VA+ G
Sbjct: 405 FFDGIPELE---SLGAADVCTEEHKELAADAARQGMVLLKNDAALLPLSPEKVNSVALFG 461

Query: 278 PNSDVTVT--MIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAAR 335
               +  T  M+G+Y G  C   TP  G+ +   +     C   +C+       A  AA+
Sbjct: 462 QLQHINATDVMLGDYRGKPCRVVTPYDGVRKVVSSTSVHACDKGSCD------TAAAAAK 515

Query: 336 QADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSF 395
             DAT++V GL+ S+E E  DR  LLLP  Q   ++ VA+AS  P+VLV+M  G VDVSF
Sbjct: 516 TVDATIVVAGLNMSVERESNDREDLLLPWSQASWINAVAEASPSPIVLVIMSAGGVDVSF 575

Query: 396 AKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           A+++P+IGA++W GYPG+ GG AIADVLFG+ NPG
Sbjct: 576 AQDNPKIGAVVWAGYPGEEGGTAIADVLFGKYNPG 610


>gi|357153280|ref|XP_003576399.1| PREDICTED: probable beta-D-xylosidase 2-like [Brachypodium
           distachyon]
          Length = 807

 Score =  400 bits (1027), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 214/455 (47%), Positives = 280/455 (61%), Gaps = 50/455 (10%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ---------- 50
           MYN G  GLTYWSPN+N+ RDPRWGR  ETPGEDP + G+YA ++VRG+Q          
Sbjct: 173 MYNLGKGGLTYWSPNINVVRDPRWGRALETPGEDPFVVGRYAVNFVRGMQDVDDAAAGFN 232

Query: 51  GNTGSR-LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVA 109
           G+  SR LK +ACCKHY AYD+D+W G  R+ F+ARV+++D+ +T+  PF+ CV +G  +
Sbjct: 233 GDPLSRPLKTSACCKHYAAYDVDDWYGHTRFKFDARVTERDMVETFQRPFEMCVRDGDAS 292

Query: 110 SVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYT-RTPEEAA 168
           +VMCSYN+VNG P CAD  +L  T+   W L GYIVSDCD+V V+ +   +   TP EA+
Sbjct: 293 AVMCSYNRVNGIPACADARLLAGTLRRDWGLHGYIVSDCDAVRVMTDNATWLGYTPAEAS 352

Query: 169 ADAIKAGLDLDCGP------------FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRL 216
           A ++KAGLDLDCG             FL+ +   AVR G +RE D++ AL    T  MRL
Sbjct: 353 AASLKAGLDLDCGESWIVQKGKPVMDFLSTYGMAAVRQGKMRESDIDNALVNLYTTLMRL 412

Query: 217 GMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVI 276
           G FDG P    + +L  +D+C+ AH+ LAL  A Q +VLLKN    LPL   +  +VAV 
Sbjct: 413 GYFDGMPR---YESLDEKDICSEAHRSLALDGARQSMVLLKNLDGLLPLDASKLASVAVR 469

Query: 277 GPNSDV-TVTMIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAAR 335
           GP+++     M G+Y G  C Y TP +GIS+      Q G                    
Sbjct: 470 GPHAEAPEKVMDGDYTGPPCRYITPREGISKDVNISQQGG-------------------- 509

Query: 336 QADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSF 395
             D T+ + G++  IE E  DR  LLLP  Q E + RVA AS  P+VLV++ GG +DVSF
Sbjct: 510 --DVTIYMGGINMHIEREGNDREDLLLPKNQTEEILRVAAASPSPIVLVILSGGGIDVSF 567

Query: 396 AKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           A++ P+IGAILW GYPG  GG AIADV+FGR NPG
Sbjct: 568 AQSHPKIGAILWAGYPGGEGGHAIADVIFGRYNPG 602


>gi|226491558|ref|NP_001146416.1| uncharacterized protein LOC100279996 [Zea mays]
 gi|223975771|gb|ACN32073.1| unknown [Zea mays]
          Length = 507

 Score =  399 bits (1026), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 202/319 (63%), Positives = 240/319 (75%), Gaps = 2/319 (0%)

Query: 112 MCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADA 171
           MCSYNQVNGKPTCAD D+L   I G W+L+GYI SDCDSV VLYN QHYT+TPE+AAA +
Sbjct: 1   MCSYNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAIS 60

Query: 172 IKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNL 231
           IKAGLDL+CG FLA HT  AV+ G L E DV+ A+   +   MRLG FDG+P   PFGNL
Sbjct: 61  IKAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNL 120

Query: 232 GPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA 291
           GP DVCTP++Q+LA +AA QGIVLLKN+ + LPLS     ++AVIGPN++ + TMIGNY 
Sbjct: 121 GPSDVCTPSNQELAREAARQGIVLLKNTGK-LPLSATSIKSMAVIGPNANASFTMIGNYE 179

Query: 292 GVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQL-IGAAEVAARQADATVLVMGLDQSI 350
           G  C YTTPLQG+     T++Q GC  V C+GN L + AA  AA  AD TVLV+G DQSI
Sbjct: 180 GTPCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSI 239

Query: 351 EAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGY 410
           E E +DR  LLLPG+Q +LVS VA AS GP +LV+M GGP D+SFAK+  +I AILWVGY
Sbjct: 240 ERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWVGY 299

Query: 411 PGQAGGAAIADVLFGRANP 429
           PG+AGGAAIADVLFG  NP
Sbjct: 300 PGEAGGAAIADVLFGYHNP 318


>gi|224068498|ref|XP_002302758.1| predicted protein [Populus trichocarpa]
 gi|222844484|gb|EEE82031.1| predicted protein [Populus trichocarpa]
          Length = 462

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 194/258 (75%), Positives = 217/258 (84%)

Query: 173 KAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLG 232
           +A LDLDCGPFL  HTE AVR GLL E ++N AL  T+TVQMRLGMFDGEPS++P+GNLG
Sbjct: 5   QASLDLDCGPFLGQHTEDAVRKGLLTEAEINNALLNTLTVQMRLGMFDGEPSSKPYGNLG 64

Query: 233 PRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAG 292
           P DVCTPAHQ+LAL+AA QGIVLLKN    LPLST  H +VA+IGPNS+VTVTMIGNYAG
Sbjct: 65  PTDVCTPAHQELALEAARQGIVLLKNHGPPLPLSTRHHQSVAIIGPNSNVTVTMIGNYAG 124

Query: 293 VACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA 352
           VACGYTTPLQGI RYAKTI+Q GC  VAC  +Q   AA  AARQADATVLVMGLDQSIEA
Sbjct: 125 VACGYTTPLQGIGRYAKTIYQQGCADVACVSDQQFVAAMDAARQADATVLVMGLDQSIEA 184

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E  DR  LLLPGRQQEL+S+VA AS+GP +LVLM GGP+DVSFA+NDP+IG I+W GYPG
Sbjct: 185 ESRDRTELLLPGRQQELISKVAAASKGPTILVLMSGGPIDVSFAENDPKIGGIVWAGYPG 244

Query: 413 QAGGAAIADVLFGRANPG 430
           QAGGAAI+DVLFG  NPG
Sbjct: 245 QAGGAAISDVLFGTTNPG 262


>gi|326431595|gb|EGD77165.1| beta-glucosidase [Salpingoeca sp. ATCC 50818]
          Length = 900

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 210/431 (48%), Positives = 282/431 (65%), Gaps = 4/431 (0%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ-GNTGSRLKV 59
           M N   AG T+W+PN+NI RDPRWGRGQETPGEDP  TG+YAA++V G Q G   + +K 
Sbjct: 283 MNNVQRAGNTFWAPNINIIRDPRWGRGQETPGEDPFATGEYAANFVSGFQDGEDMNYIKA 342

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           ++CCKH+  Y+L+NW+GVDR+H+NA  + QD+ DTY   F+ACV  G+ + +MCSYN VN
Sbjct: 343 SSCCKHFFDYNLENWHGVDRHHYNAIATDQDIADTYLPSFEACVRYGRASGLMCSYNAVN 402

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           G P+CA+ DI+       W  DGYI SDC +V  + N+  +TR   E     ++AG+D D
Sbjct: 403 GVPSCANGDIMTVMARESWGFDGYITSDCGAVADVLNSHKFTRNTSETIRAVLEAGMDTD 462

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG F+  +   A++ G++  E VN AL     VQ RLG+FD   S QP+ N     V TP
Sbjct: 463 CGSFVQQYLAKAMQEGVVPRELVNTALHRLFMVQFRLGLFD-PVSKQPYTNYSVARVNTP 521

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
           A+QQLAL+AA QGIVLLKN+   LPL T  H  VA+IGPN+D T  M GNY G A    +
Sbjct: 522 ANQQLALEAAQQGIVLLKNTNARLPLKTGLH--VALIGPNADATTVMQGNYQGTAPFLIS 579

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           P++G   Y+  +  A    VAC       AA  AA++ADA V+V+GLDQ  E+E  DR  
Sbjct: 580 PVRGFKNYSAAVTYAKGCDVACKDTSGFDAAVAAAKEADAVVVVVGLDQGQESEGHDRTS 639

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           + LPG Q++LV++VA A++ P+V+ +M GG VD+S  K +  +  ILW GYPGQ+GG A+
Sbjct: 640 ITLPGHQEDLVAQVAAAAKSPIVVFVMTGGAVDLSTIKANKNVAGILWCGYPGQSGGQAM 699

Query: 420 ADVLFGRANPG 430
           ADV+FG  +PG
Sbjct: 700 ADVVFGAVSPG 710


>gi|340370204|ref|XP_003383636.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 755

 Score =  392 bits (1007), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 207/433 (47%), Positives = 277/433 (63%), Gaps = 11/433 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR--LKVA 60
           N G +GLT+++PN+NI+RDPRWGRGQETPGEDP LT +YAA++V+G+Q  +  R  LK  
Sbjct: 159 NNGQSGLTFFAPNINIYRDPRWGRGQETPGEDPYLTSQYAANFVKGIQEGSEDRRYLKAI 218

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A CKHY AY+L+ +  V R +FNA VS QDLE+TY   FKACV EG+V S+MCSYN +NG
Sbjct: 219 ATCKHYAAYNLERYLDVRRVNFNAIVSDQDLEETYLPAFKACVQEGQVGSIMCSYNAING 278

Query: 121 KPTCADPDILKNTI-HGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
            P CA+ D + N I    W  +GYIVSDC ++  +    +YT       ADA+K G DL+
Sbjct: 279 VPNCAN-DFINNKIARDTWGFEGYIVSDCGAILDIQYKHNYTSDTNITVADALKGGCDLN 337

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG F   + E A     + EED++ +L    T +MRLGMFD  P  QPF     +DV TP
Sbjct: 338 CGHFYEKYMEDAFDNSTITEEDIDKSLTRLFTSRMRLGMFD-PPEIQPFRQYSVKDVNTP 396

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
             Q LAL AA +GIVLL+N    LPL  ++H  +A IGPN+D T  M GNY G+A    +
Sbjct: 397 EAQDLALNAAREGIVLLQNKGSVLPLDIVKHSNIAAIGPNADATHIMQGNYHGIAPYLIS 456

Query: 300 PLQGISRYA-KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           PLQG S       +Q GC  VACN  +    A  A +  DA + V+GL+ + E E  DR 
Sbjct: 457 PLQGFSNLGINATYQIGC-PVACNDTEGFPDAVKAVQGVDAVIAVIGLNNTQEGESHDRT 515

Query: 359 GLLLPGRQQELVSRVAK-ASRG-PVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGG 416
            + LPG Q++L+  + K A++G P+++V+M GG VD++  K+     AILW GYPGQ+GG
Sbjct: 516 SIALPGHQEDLLLELKKNAAKGTPLIVVVMSGGSVDLTGVKD--IADAILWAGYPGQSGG 573

Query: 417 AAIADVLFGRANP 429
            AIA+V++G+ NP
Sbjct: 574 QAIAEVIYGKVNP 586


>gi|326523729|dbj|BAJ93035.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 810

 Score =  391 bits (1004), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 205/451 (45%), Positives = 275/451 (60%), Gaps = 31/451 (6%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS----- 55
           MYN G A LTYWSPN+N+ RDPRWGR  ETPGEDP + G+YA S+VR +Q   G+     
Sbjct: 164 MYNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVSFVRAMQDIDGAGPGAG 223

Query: 56  ------RLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVA 109
                  +KV++CCKHY AYD+D W   DR  F+A+V ++D+ +T+  PF+ CV +G  +
Sbjct: 224 ADPFARPIKVSSCCKHYAAYDVDAWLTADRLTFDAQVEERDMIETFERPFEMCVRDGDAS 283

Query: 110 SVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYT-RTPEEAA 168
            VMCSYN++NG P CA+  +L  T+ G+W+L GYIVSDCDSV V+     +      EA 
Sbjct: 284 CVMCSYNRINGVPACANARLLSETVRGEWQLHGYIVSDCDSVRVMVRDAKWLGYNGVEAT 343

Query: 169 ADAIKAGLDLDCG-------PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDG 221
           A A+KAGLDLDCG        F       AVR G LRE +V+ AL       MRLG FDG
Sbjct: 344 AAAMKAGLDLDCGMFWEGAQDFFTAFGLDAVRQGKLRESEVDNALRNLYLTLMRLGFFDG 403

Query: 222 EPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSD 281
            P  +   +LG  DVCT  H++LA  AA QG+VL+KN    LPL T + ++++++G    
Sbjct: 404 IPELE---SLGANDVCTEEHKELAADAARQGMVLIKNDHGRLPLDTSKVNSLSLVGLLQH 460

Query: 282 VTVT--MIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADA 339
           +  T  M+G+Y G  C   TP   I +         C   AC       +     +  DA
Sbjct: 461 INATDVMLGDYRGKPCRVVTPYDAIRKVVSATSMQVCDHGAC-------STAANGKTVDA 513

Query: 340 TVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKND 399
           T+++ GL+ S+E E  DR  LLLP  Q   ++ VA+AS  P++LV++  G VDVSFA+N+
Sbjct: 514 TIVIAGLNMSVEKEGNDREDLLLPWNQTNWINAVAEASPYPIILVIISAGGVDVSFAQNN 573

Query: 400 PRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           P+IGAI+W GYPG+ GG AIADVLFG+ NPG
Sbjct: 574 PKIGAIVWAGYPGEEGGTAIADVLFGKYNPG 604


>gi|163889365|gb|ABY48135.1| beta-D-xylosidase [Medicago truncatula]
          Length = 776

 Score =  388 bits (996), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 200/437 (45%), Positives = 276/437 (63%), Gaps = 34/437 (7%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR---- 56
           M+N G AGL++W+PNVN+FRDPRWGRGQETPGEDP++   YA  +VRG+QG  G +    
Sbjct: 141 MFNVGQAGLSFWAPNVNVFRDPRWGRGQETPGEDPMVGSAYAVEFVRGIQGVDGIKKVLN 200

Query: 57  --------LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKV 108
                   L V+ACCKH+TAYDL+ W    RY+FNA V      +TY  PF+ CV +GK 
Sbjct: 201 DHDSDDDGLMVSACCKHFTAYDLEKWGEFSRYNFNAVV------NTYQPPFRGCVQQGKA 254

Query: 109 ASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGY-IVSDCDSVGVLY-NTQHYTRTPEE 166
           + +MCSYN+VNG P CA  D+L   +  +W  +G  I+     + +L+ + +     P+ 
Sbjct: 255 SCLMCSYNEVNGVPACASKDLL-GLVRNKWGFEGVGILPQTVMLWLLFLSIKSMQNLPKM 313

Query: 167 AAADAIKA-----------GLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMR 215
                +K             +D++CG F+  HTE A+  GL++EED++ AL    +VQMR
Sbjct: 314 LLLMFLKQVFFYVFENLWFCMDINCGTFMLRHTESAIEQGLVKEEDLDRALFNLFSVQMR 373

Query: 216 LGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAV 275
           LG+F+G+P    FG LGP+DVCTP H++LAL+AA QGIVLLKN  + LPL      ++A+
Sbjct: 374 LGLFNGDPEKGKFGKLGPQDVCTPEHKKLALEAARQGIVLLKNDNKFLPLDKKDRVSLAI 433

Query: 276 IGPNSDVTVTMIGNYAGVACGYTTPLQGISRYAKTIHQA-GCFGVACNGNQLIGAAEVAA 334
           IGP +  T  + G Y+G+ C   +   G+  Y KTI  A GC  V C+ +     A   A
Sbjct: 434 IGPMA-TTSELGGGYSGIPCSPRSLYDGLKEYVKTISYAFGCSDVKCDSDDGFAVAIDIA 492

Query: 335 RQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVS 394
           +QAD  V+V GLD ++E E +DR  LLLPG+Q +LVSRVA AS+ PV+LVL  GGP+DVS
Sbjct: 493 KQADFVVIVAGLDTTLETEDLDRVSLLLPGKQMDLVSRVAAASKRPVILVLTGGGPLDVS 552

Query: 395 FAKNDPRIGAILWVGYP 411
           FA+++  I +ILW+GYP
Sbjct: 553 FAESNQLITSILWIGYP 569


>gi|413954831|gb|AFW87480.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 814

 Score =  387 bits (995), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 208/452 (46%), Positives = 274/452 (60%), Gaps = 48/452 (10%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ-------GNT 53
           MYN G  GLTYWSPN+N+ RDPRWGR  ETPGEDP + G+YA ++VRG+Q       G+ 
Sbjct: 182 MYNLGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVAGRYAVNFVRGMQDIPGHYSGDP 241

Query: 54  GSR-LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVM 112
            +R +K +ACCKH+ AYD+DNW+   R+ ++ARVS++D+ +T+  PF+ CV EG V+SVM
Sbjct: 242 SARPIKTSACCKHHAAYDVDNWHNQTRFTYDARVSERDMAETFLRPFEMCVREGDVSSVM 301

Query: 113 CSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLY-NTQHYTRTPEEAAADA 171
           CSYN+VNG P CAD  +L  T+ G+W L+GYIVSDCD+V V+  N      T  E++A +
Sbjct: 302 CSYNRVNGVPACADARLLSGTVRGEWHLNGYIVSDCDAVRVMTDNATWLNFTAAESSAVS 361

Query: 172 IKAGLDLDCGP------------FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMF 219
           ++AG+DLDC              +L+ +   AV  G +RE D++ AL       MRLG F
Sbjct: 362 LRAGMDLDCAESWIEEEGRPLRDYLSEYGMAAVAQGKMRESDIDNALTNLYMTLMRLGYF 421

Query: 220 DGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPN 279
           D  P    + +L   DVCT  H+ LAL  A QGIVLLKN    LPL   +   VAV GP+
Sbjct: 422 DNIPR---YASLNETDVCTDEHKSLALDGARQGIVLLKNDHGLLPLDPKKTLAVAVHGPH 478

Query: 280 SDV-TVTMIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQAD 338
           +      M G+Y G  C Y TP QGISR  K  H                       +A 
Sbjct: 479 ARAPEKIMDGDYTGPPCRYVTPRQGISRDVKISH-----------------------KAK 515

Query: 339 ATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKN 398
            T+ + G++  IE E  DR  LLLP  Q E +   A+AS  P++LV++ GG +D+SFA+ 
Sbjct: 516 MTIYLGGINLYIEREGNDREDLLLPKNQTEEILHFAQASPTPIILVILSGGGIDISFAQK 575

Query: 399 DPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            P+IGAILW GYPG  GG AIADV+FGR NPG
Sbjct: 576 HPKIGAILWAGYPGGEGGNAIADVIFGRYNPG 607


>gi|320170454|gb|EFW47353.1| beta-xylosidase [Capsaspora owczarzaki ATCC 30864]
          Length = 779

 Score =  387 bits (993), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 210/432 (48%), Positives = 277/432 (64%), Gaps = 8/432 (1%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR-LKVAA 61
           N G AGLTY++PN+NIFRDPRWGRGQETPGEDP LT +Y  + V+ LQ    +R LKV A
Sbjct: 146 NAGHAGLTYFTPNINIFRDPRWGRGQETPGEDPYLTSRYVETLVQNLQNGEDARYLKVVA 205

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKHYTAYD+++W G+DR+HFNA VS QDL +T+  PF+ACV  GK AS+MCSYN VNG 
Sbjct: 206 TCKHYTAYDMEDWGGIDRFHFNAVVSDQDLVETFMPPFEACVRVGKGASLMCSYNAVNGI 265

Query: 122 PTCADPDILKNTI-HGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
           P+CAD D + N I   QW  DGYIVSDC ++  +  T +YT T +   A  I+ G DLDC
Sbjct: 266 PSCAD-DFINNEIAREQWGFDGYIVSDCGAIDCIQYTHNYTNTTQATCAAGIQGGCDLDC 324

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           G F   H   A+    L E D++ +L      ++RLG FD   S QP+  +    + +  
Sbjct: 325 GDFYQSHLMDAIGNATLHEADLDFSLRRLFGHRIRLGEFDAA-SIQPYRQIPVSAINSQE 383

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           HQ+LALQ A + IVLL N   TLP S      +A+IGPN+D   T++GNY G A    TP
Sbjct: 384 HQELALQIARESIVLLGNDNNTLPFSLATVRKLAIIGPNADDAETLLGNYYGDAPYLITP 443

Query: 301 LQGISRYAKTIHQAGCFGVACNGNQLIG--AAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           L+G  +   T+      G   N     G  AA  AA+ ADAT++V+GL+Q++E+E +DR 
Sbjct: 444 LKGFQQLDPTLSITFVKGCDVNSTDTSGFVAAAAAAKAADATIVVVGLNQTVESENLDRT 503

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            L+LPG Q EL+  +  A+RGPV+LV+M G P+D+S   +  R  A LW+GYPGQAGG A
Sbjct: 504 TLVLPGVQAELILALTAAARGPVILVVMSGSPIDLSNVIHPVR--AALWIGYPGQAGGRA 561

Query: 419 IADVLFGRANPG 430
           +A+ +FG  +P 
Sbjct: 562 LAEAVFGVFSPA 573


>gi|340370206|ref|XP_003383637.1| PREDICTED: probable beta-D-xylosidase 5-like [Amphimedon
           queenslandica]
          Length = 728

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 200/436 (45%), Positives = 278/436 (63%), Gaps = 14/436 (3%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ-GNTGSR-LKVA 60
           NGG AGLTY++PN+NIFRDPRWGRGQETPGEDP L+ +YAA++V+G+Q G   +R LK  
Sbjct: 132 NGGQAGLTYFAPNINIFRDPRWGRGQETPGEDPYLSSQYAANFVKGMQEGADDTRYLKTI 191

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A CKHY AYDL+N+  + R+ FNA VS QD E+TY   F++CV EGKV S+MCSYN VNG
Sbjct: 192 ATCKHYAAYDLENYLNLSRHTFNAIVSDQDFEETYFPAFRSCVEEGKVGSIMCSYNAVNG 251

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
            P+CA+  I      G+W  +GY+VSDC ++  + N+  YT   ++  A  ++ G DL+C
Sbjct: 252 VPSCANDFINNEVARGKWGFEGYVVSDCGAISDIINSHKYTSNTDDTVAAGLRGGCDLNC 311

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           G F + H + A   G + ++D++ A+    T +MRLGMFD  PS QPF +     V T  
Sbjct: 312 GHFYSDHAQAAYDNGAITDDDIDRAMTRLFTYRMRLGMFD-PPSMQPFRDYTNDKVDTKQ 370

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H+ LAL A+ + IVLL+N+   LPLS   H  +A++GP+      M GNY G A    +P
Sbjct: 371 HEALALDASRESIVLLQNNKDILPLSLTTHRKIALVGPHGQAQGAMQGNYKGTAPYLISP 430

Query: 301 LQGISRYAKTI-HQAGCFGVACNGNQLIGAAEVAA----RQADATVLVMGLDQSIEAEFI 355
           +QG+     ++   AGC  VAC    + G +EV         +A + V+GLD+S E+E  
Sbjct: 431 MQGLQDLGLSVTFAAGCTQVAC--PTIAGFSEVTKLVEEHSIEAIIAVIGLDESQESEGH 488

Query: 356 DRAGLLLPGRQQELVSRVAKASRG--PVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQ 413
           DR  L LPG+Q +L+  + K +    P ++V+M GGPVD+S  K+     AILW GYPGQ
Sbjct: 489 DRTSLTLPGQQVQLLEDIKKKAVPGIPFIVVVMSGGPVDLSGVKD--IADAILWAGYPGQ 546

Query: 414 AGGAAIADVLFGRANP 429
           +GG AIA+V++G+ NP
Sbjct: 547 SGGQAIAEVIYGKVNP 562


>gi|115486735|ref|NP_001068511.1| Os11g0696400 [Oryza sativa Japonica Group]
 gi|77552754|gb|ABA95551.1| Glycosyl hydrolase family 3 C terminal domain containing protein
           [Oryza sativa Japonica Group]
 gi|113645733|dbj|BAF28874.1| Os11g0696400 [Oryza sativa Japonica Group]
          Length = 816

 Score =  384 bits (985), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 213/456 (46%), Positives = 276/456 (60%), Gaps = 53/456 (11%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ---------- 50
           MYN G  GLTYWSPN+N+ RDPRWGR  ETPGEDP + G+YA ++VRG+Q          
Sbjct: 181 MYNMGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQDIPGHEAVAA 240

Query: 51  -GNTGSR-LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKV 108
            G+  +R LK +ACCKHY AYDLD+W+   R+ F+ARV ++D+ +T+  PF+ CV +G V
Sbjct: 241 GGDPNTRPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPFEMCVRDGDV 300

Query: 109 ASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYT-RTPEEA 167
           +SVMCSYN+VNG P CAD  +L  TI   W L GYIVSDCD+V V+ +   +   T  EA
Sbjct: 301 SSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNATWLGYTGAEA 360

Query: 168 AADAIKAGLDLDCGP-------------FLAIHTEGAVRGGLLREEDVNLALAYTITVQM 214
           +A A+KAGLDLDCG              FL  +   AV  G +RE D++ AL       M
Sbjct: 361 SAAALKAGLDLDCGESWKNDTDGHPLMDFLTTYGMEAVNKGKMRESDIDNALTNQYMTLM 420

Query: 215 RLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVA 274
           RLG FD       + +LG +D+CT  H+ LAL  A QGIVLLKN  + LPL   +   V 
Sbjct: 421 RLGYFD---DIAQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDANKVGFVN 477

Query: 275 VIGPNSDV-TVTMIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVA 333
           V GP+       M G+Y G  C Y TP QG+S+Y +  H                     
Sbjct: 478 VRGPHVQAPEKIMDGDYTGPPCRYVTPRQGVSKYVRFSH--------------------- 516

Query: 334 ARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDV 393
             +A+ T+   GL+ +IE E  DR  +LLP  Q E + RVAKAS  P++LV++ GG +DV
Sbjct: 517 --RANTTIYFGGLNLNIEREGNDREDILLPKNQTEEIIRVAKASPNPIILVILSGGGIDV 574

Query: 394 SFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           SFA+N+P+IGAILW GYPG  GG AIADV+FG+ NP
Sbjct: 575 SFAQNNPKIGAILWAGYPGGEGGNAIADVIFGKHNP 610


>gi|125535311|gb|EAY81859.1| hypothetical protein OsI_37025 [Oryza sativa Indica Group]
          Length = 816

 Score =  383 bits (984), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 214/456 (46%), Positives = 276/456 (60%), Gaps = 53/456 (11%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ---------- 50
           MYN G  GLTYWSPN+N+ RDPRWGR  ETPGEDP + G+YA ++VRG+Q          
Sbjct: 180 MYNMGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQDIPGHEAVAA 239

Query: 51  -GNTGSR-LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKV 108
            G+  +R LK +ACCKHY AYDLD+W+   R+ F+ARV ++D+ +T+  PF+ CV +G V
Sbjct: 240 GGDPNTRPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPFEMCVRDGDV 299

Query: 109 ASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYT-RTPEEA 167
           +SVMCSYN+VNG P CAD  +L  TI   W L GYIVSDCD+V V+ +   +   T  EA
Sbjct: 300 SSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNATWLGYTGAEA 359

Query: 168 AADAIKAGLDLDCGPFLAIHTEG-------------AVRGGLLREEDVNLALAYTITVQM 214
           +A A+KAGLDLDCG      TEG             AV  G +RE D++ AL       M
Sbjct: 360 SAAALKAGLDLDCGESWKNDTEGHPLMDFLTTYGMEAVNKGKMRESDIDNALTNQYMTLM 419

Query: 215 RLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVA 274
           RLG FD       + +LG +D+CT  H+ LAL  A QGIVLLKN  + LPL   +   V 
Sbjct: 420 RLGYFD---DITQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDANKVGFVN 476

Query: 275 VIGPNSDV-TVTMIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVA 333
           V GP+       M G+Y G  C Y TP QG+S+Y +  H                     
Sbjct: 477 VRGPHVQAPEKIMDGDYTGPPCRYVTPRQGVSKYVRFSH--------------------- 515

Query: 334 ARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDV 393
             +A+ T+   GL+ +IE E  DR  +LLP  Q E + RVAKAS  P++LV++ GG +DV
Sbjct: 516 --RANTTIYFGGLNLNIEREGNDREDILLPKNQTEEIIRVAKASPNPIILVILSGGGIDV 573

Query: 394 SFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           SFA+N+P+IGAILW GYPG  GG AIADV+FG+ NP
Sbjct: 574 SFAQNNPKIGAILWAGYPGGEGGNAIADVIFGKHNP 609


>gi|242093144|ref|XP_002437062.1| hypothetical protein SORBIDRAFT_10g020500 [Sorghum bicolor]
 gi|241915285|gb|EER88429.1| hypothetical protein SORBIDRAFT_10g020500 [Sorghum bicolor]
          Length = 809

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 205/454 (45%), Positives = 269/454 (59%), Gaps = 50/454 (11%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR---- 56
           MYN G  GLTYWSPN+N+ RDPRWGR  ETPGEDP + G+YA ++VRG+Q   G      
Sbjct: 175 MYNLGKGGLTYWSPNINVVRDPRWGRALETPGEDPFVAGRYAVNFVRGMQDIPGHDGGGD 234

Query: 57  ------LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVAS 110
                 +K +ACCKHY AYD+D+W+   R+ F+ARVS++D+ +T+  PF+ CV +G  + 
Sbjct: 235 DPSTRPIKTSACCKHYAAYDVDDWHNHTRFTFDARVSERDMAETFLRPFEMCVRDGDASG 294

Query: 111 VMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTR-TPEEAAA 169
           VMCSYN+VNG P CAD  +L  TI G W+L GYIVSDCD+V V+ +   +   T  E++A
Sbjct: 295 VMCSYNRVNGIPACADARLLSGTIRGDWQLHGYIVSDCDAVRVMTDNATWLHFTGAESSA 354

Query: 170 DAIKAGLDLDCGP------------FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLG 217
            +I+AGLDLDC              FL+ + + AV  G +RE D++ AL       MRLG
Sbjct: 355 ASIRAGLDLDCAESWIEEKGRPLRDFLSEYGKAAVAQGKMRESDIDSALRNQYMTLMRLG 414

Query: 218 MFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIG 277
            FD  P    + +L   D+CT  H+ LA   A QG+VLLKN    LPL   +   VAV G
Sbjct: 415 YFDNIPR---YASLNETDICTDEHKSLAHDGARQGMVLLKNDDGLLPLDPEKILAVAVHG 471

Query: 278 PNSDV-TVTMIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQ 336
           P++      M G+Y G  C Y TP QGIS+  K  H                       +
Sbjct: 472 PHARAPEKIMDGDYTGPPCRYVTPRQGISKDVKISH-----------------------R 508

Query: 337 ADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFA 396
           A+ T+ + G++  IE E  DR  LLLP  Q E +   AKAS  P++LV++ GG +D+SFA
Sbjct: 509 ANTTIYLGGINLHIEREGNDREDLLLPKNQTEEILHFAKASPNPIILVILSGGGIDISFA 568

Query: 397 KNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
              P+IGAILW GYPG  GG AIADV+FGR NPG
Sbjct: 569 HKHPKIGAILWAGYPGGEGGNAIADVIFGRYNPG 602


>gi|51507369|emb|CAH18932.1| beta-xylosidase [Pyrus communis]
          Length = 238

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 187/238 (78%), Positives = 205/238 (86%)

Query: 192 VRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQ 251
           +R G + E D+N ALA TITVQMRLGMFDGEPS Q +GNLG  DVC P+  +LAL+AA Q
Sbjct: 1   MRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQRYGNLGLADVCKPSSNELALEAARQ 60

Query: 252 GIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRYAKTI 311
           GIVLL+N   +LPLST+RH TVAVIGPNSDVT TMIGNYAGVACGYTTPLQGI+RY +TI
Sbjct: 61  GIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMIGNYAGVACGYTTPLQGIARYTRTI 120

Query: 312 HQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVS 371
           HQAGC  V CNGNQLIGAAEVAARQADATVLV+GLDQSIEAEF DR  LLLPG QQELVS
Sbjct: 121 HQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEAEFRDRTNLLLPGHQQELVS 180

Query: 372 RVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           RVA+ASRGP +LV+M GGP+DV FAKNDPRIGAI+WVGYPGQAGG AIADVLFG  NP
Sbjct: 181 RVARASRGPTILVIMSGGPIDVMFAKNDPRIGAIIWVGYPGQAGGTAIADVLFGTTNP 238


>gi|94442924|emb|CAJ91136.1| beta-xylosidase [Platanus x acerifolia]
          Length = 231

 Score =  380 bits (976), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 181/231 (78%), Positives = 205/231 (88%), Gaps = 3/231 (1%)

Query: 42  AASYVRGLQGNTG--SRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPF 99
           AASYVRGLQ   G  SRLKVAACCKHYTAYDLDNW G+DR+HFNARVSKQDLEDT+NVPF
Sbjct: 1   AASYVRGLQQPYGRDSRLKVAACCKHYTAYDLDNWKGIDRFHFNARVSKQDLEDTFNVPF 60

Query: 100 KACVVEGKVASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQH 159
           + CVVEGKVASVMCSYNQVNG PTCADP++L+NTI G+WRL+GYIV+DCDS+GVLY+TQH
Sbjct: 61  RECVVEGKVASVMCSYNQVNGIPTCADPNLLRNTIRGEWRLNGYIVTDCDSIGVLYDTQH 120

Query: 160 YTRTPEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMF 219
           YT TPEE+AADAIKAG+DLDCGPFLA+HT+ A+   +L E  V+ ALA T+ VQMRLGMF
Sbjct: 121 YTSTPEESAADAIKAGVDLDCGPFLAVHTQEAITRRMLSEVYVDGALANTLAVQMRLGMF 180

Query: 220 DGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRH 270
           DGEPSAQPFG+LGPRDVCT AHQQLAL+AA QGIVL+KN   +LPLST RH
Sbjct: 181 DGEPSAQPFGHLGPRDVCTSAHQQLALEAARQGIVLMKNQG-SLPLSTARH 230


>gi|167525174|ref|XP_001746922.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774702|gb|EDQ88329.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1620

 Score =  370 bits (950), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 196/431 (45%), Positives = 261/431 (60%), Gaps = 24/431 (5%)

Query: 1    MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR-LKV 59
            M N   AGLT+W+PN+NI RDPRWGRGQETPGEDP  TG YAA++V G+Q    +R +K 
Sbjct: 1025 MNNVNQAGLTFWAPNINIIRDPRWGRGQETPGEDPYATGLYAANFVPGMQEGEDTRYIKA 1084

Query: 60   AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
            ++CCKH+  Y+L++W+ VDR+HFNA  + QD+ DTY   F++CV  G+ +S+MCSYN VN
Sbjct: 1085 SSCCKHFFDYNLEDWHNVDRHHFNAIATDQDIADTYLPAFESCVRFGRASSLMCSYNAVN 1144

Query: 120  GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
            G P+CA+ DI+       W  DGYI SDC +V  +Y+   Y  T        + AG+D+D
Sbjct: 1145 GVPSCANADIMTTLAREAWGFDGYITSDCGAVEDVYSNHKYYNTTGATVNGVLSAGMDVD 1204

Query: 180  CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
            CG FL+ H   A+  G +    V+ AL     VQ RLGMFD     QP+ NL    V TP
Sbjct: 1205 CGSFLSQHLADAIDSGDVTNATVDQALYNLFRVQFRLGMFD-PAEDQPYLNLTTDAVNTP 1263

Query: 240  AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
             HQQLAL+AA QG+ LL+N    LPL       +A+IGPN++ T  M GNY G A    +
Sbjct: 1264 EHQQLALEAARQGMTLLENRDSRLPLDASSIKQLALIGPNANATGVMQGNYNGKAPFLIS 1323

Query: 300  PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
            P QG+ +Y   +              L   A  AA+ AD  V+V+GLDQ+ E+E  DR  
Sbjct: 1324 PQQGVQQYVSNV-------------ALELGAVTAAKAADTVVMVIGLDQTQESEGHDREI 1370

Query: 360  LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
            + LPG Q ELV++VA AS  P+V+V+M GG VD++  K+   +         GQAGG A+
Sbjct: 1371 IALPGMQAELVAQVANASSSPIVVVVMTGGAVDLTPVKDLDNV---------GQAGGQAL 1421

Query: 420  ADVLFGRANPG 430
            A+ LFG  NPG
Sbjct: 1422 AETLFGDNNPG 1432


>gi|340377241|ref|XP_003387138.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 733

 Score =  365 bits (937), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 197/434 (45%), Positives = 258/434 (59%), Gaps = 12/434 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           + G  GLTY++PN+NI RDPRWGRGQET GEDP LT +YA + VRG QGN     K+ A 
Sbjct: 138 DNGQGGLTYFTPNINIVRDPRWGRGQETAGEDPYLTSQYAVNLVRGAQGNDSEYKKIIAT 197

Query: 63  CKHYTAYDLDNW-NGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
           CKH+ AYDL+++ NG  R  FNA V+KQDLE+TY   F++CV  G V S+MCSYN VNG 
Sbjct: 198 CKHFAAYDLESYINGDVRDSFNAEVTKQDLEETYFPAFRSCVTAGGVGSIMCSYNSVNGV 257

Query: 122 PTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG 181
           P+C D          +W+ DGY+VSDC ++  + N  HYT TP +  A  +K G DL+CG
Sbjct: 258 PSCVDGVFNNKIARNKWKFDGYLVSDCGAIDDVMNKHHYTSTPTDTVAAGLKGGTDLNCG 317

Query: 182 PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAH 241
            F   H   A   G + E D++ A+    T +MRLG+FD  P  QP+       V T  H
Sbjct: 318 SFYQTHAMDAFLNGSITEVDIDRAVGRLFTARMRLGLFD-LPKYQPYSYFNTDVVNTKQH 376

Query: 242 QQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 301
           Q LALQAA + IVLL+N+ + LPLS   HH +AV+GPN    VTM G    +A    +P+
Sbjct: 377 QDLALQAARESIVLLQNNGK-LPLSYEDHHKIAVVGPNILANVTMQGISQVIAPYLISPV 435

Query: 302 QGI-SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
            G  S+     +  GC  V C        A    + A A V VMGLDQ IE E +DR  +
Sbjct: 436 DGFKSKGLHVTYSLGC-DVKCIVTDGFHDAFKLVKDAKAVVAVMGLDQGIERETVDREDI 494

Query: 361 LLPGRQQELV-----SRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
            LPG Q + +     +     S  P+++V+M G  VD+S +K+     AILWVGYPGQ+G
Sbjct: 495 FLPGLQDKFLLGLRDTLTNLQSPVPLIVVIMSGSSVDLSESKS--LADAILWVGYPGQSG 552

Query: 416 GAAIADVLFGRANP 429
           G AIA+V++G  NP
Sbjct: 553 GQAIAEVIYGEVNP 566


>gi|334187562|ref|NP_196532.2| Glycosyl hydrolase family protein [Arabidopsis thaliana]
 gi|332004052|gb|AED91435.1| Glycosyl hydrolase family protein [Arabidopsis thaliana]
          Length = 526

 Score =  363 bits (931), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 185/309 (59%), Positives = 226/309 (73%), Gaps = 10/309 (3%)

Query: 130 LKNTIHGQWRLDG--------YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG 181
           L +T+H Q  L          YIVSDCDS+G+LY +QHYT+TPEEAAA +I AGLDL+CG
Sbjct: 24  LVHTVHSQLGLGSPDLRVCFRYIVSDCDSLGILYGSQHYTKTPEEAAAKSILAGLDLNCG 83

Query: 182 PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAH 241
            FL  HTE AV+ GL+ E  +N A++      MRLG FDG P  QP+G LGP+DVCT  +
Sbjct: 84  SFLGNHTENAVKKGLIDEAAINKAISNNFATLMRLGFFDGNPKNQPYGGLGPKDVCTVEN 143

Query: 242 QQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 301
           ++LA++ A QGIVLLKNSA +LPLS     T+AVIGPN++VT TMIGNY GVAC YTTPL
Sbjct: 144 RELAVETARQGIVLLKNSAGSLPLSPSAIKTLAVIGPNANVTKTMIGNYEGVACKYTTPL 203

Query: 302 QGISRYAKTI-HQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           QG+ R   T  +  GCF V C    L  A  +AA  ADATVLVMG DQ+IE E +DR  L
Sbjct: 204 QGLERTVLTTKYHRGCFNVTCTEADLDSAKTLAA-SADATVLVMGADQTIEKETLDRIDL 262

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
            LPG+QQELV++VAKA+RGPVVLV+M GG  D++FAKND +I +I+WVGYPG+AGG AIA
Sbjct: 263 NLPGKQQELVTQVAKAARGPVVLVIMSGGGFDITFAKNDEKITSIMWVGYPGEAGGIAIA 322

Query: 421 DVLFGRANP 429
           DV+FGR NP
Sbjct: 323 DVIFGRHNP 331


>gi|300121549|emb|CBK22068.2| unnamed protein product [Blastocystis hominis]
          Length = 690

 Score =  362 bits (930), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 193/433 (44%), Positives = 260/433 (60%), Gaps = 19/433 (4%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR-LKV 59
           M N   A LTYWSPNVN++RDPRWGRGQETPGEDP L   YA  +VRGLQ     R LKV
Sbjct: 93  MNNAERANLTYWSPNVNVYRDPRWGRGQETPGEDPFLVATYAVEFVRGLQEGEDPRYLKV 152

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           +ACCKHY+AYDL+NW+GV+R+ F+A VS +D+ DT+ VPF+ CV +G V+S+MCSYN +N
Sbjct: 153 SACCKHYSAYDLENWHGVERFEFDAIVSDRDMTDTFQVPFEQCVKKGHVSSLMCSYNAIN 212

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           G P CAD ++L  T  G W  +GYI SDC ++  +    HYT   +  A   ++A  DLD
Sbjct: 213 GIPACADRELLYGTARGGWGFEGYITSDCGAIDTIIYNHHYTNDTDTTAMLGVRATCDLD 272

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG F   H   +V  G L+E +V+ ALA    VQMRLG+FD     Q + + G   + T 
Sbjct: 273 CGGFYQQHILHSVESGRLKEAEVDDALANLFKVQMRLGLFD-PVEQQVYTHYGLDKLNTK 331

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
            HQ +AL+AA +GI LLKN    LPLS L+   V V+GP ++    M+GNY G+     T
Sbjct: 332 EHQAMALRAAREGIALLKNQNDFLPLS-LKDKHVVVMGPYAEDAGVMLGNYNGIPEFIVT 390

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQ--ADATVLVMGLDQSIEAEFIDR 357
             QG+                C+   ++ + E  ++    D  V+ +GL+Q IE E +DR
Sbjct: 391 VAQGLRN-------------VCDHVDVVKSLEALSKLEGVDLIVVTVGLNQEIEREGLDR 437

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMC-GGPVDVSFAKNDPRIGAILWVGYPGQAGG 416
             LLLP  Q+ L+  +   +  PVVL L+  GG VD+S  + +  +  +L VGY G  GG
Sbjct: 438 EDLLLPASQRALLDGLLAQTDVPVVLTLLSGGGSVDISAYEQNEHVVGVLAVGYGGMFGG 497

Query: 417 AAIADVLFGRANP 429
            AIA+V+ G  NP
Sbjct: 498 QAIAEVIVGDVNP 510


>gi|340370208|ref|XP_003383638.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 732

 Score =  347 bits (889), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 181/419 (43%), Positives = 249/419 (59%), Gaps = 13/419 (3%)

Query: 20  RDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDLDNW-NGVD 78
           RDPRWGR QETPGEDP L  +YA  +V G QG++   LKV   CKH+  YDL+++ +G  
Sbjct: 157 RDPRWGRAQETPGEDPYLNSQYAIQFVTGAQGDS-KYLKVVTTCKHFAGYDLEDYVDGET 215

Query: 79  RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILKNTIHGQW 138
           R+ FNA+++ QD E+TY   FKACV E  VAS+MCSYN+VNG P+CAD  I        W
Sbjct: 216 RHSFNAKITPQDFEETYYPAFKACVEEANVASIMCSYNEVNGVPSCADGQINNKLARDTW 275

Query: 139 RLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLR 198
             DG+I SDC ++  + N  HYT   ++  A A+K G DL+CG +   H + A   G + 
Sbjct: 276 GFDGFIASDCGAIDDIQNKHHYTNNTDDTVAAALKGGCDLNCGSYYQSHAQSAFLNGTIT 335

Query: 199 EEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKN 258
             ++NLAL    T +M+LGMFD  P  QP+  + P  V +  HQ LAL AA + IVLL+N
Sbjct: 336 IGEINLALTRLFTARMKLGMFD-PPELQPYNAISPDVVNSLEHQALALNAARESIVLLQN 394

Query: 259 SARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRYA--KTIHQAGC 316
           +   LPL+  +H T+AV+GP++  T  M GNY GVA    +P++G         +  +GC
Sbjct: 395 NNDVLPLNFEKHSTIAVVGPHAMATDVMQGNYNGVAPYLISPVEGFENLGIDSVLTASGC 454

Query: 317 FGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRV--- 373
             V C        A   A +ADA + V+GLDQS E+E  DR  L LP  Q + V  +   
Sbjct: 455 -DVNCEVTDGFQDAFDIAVKADAVIAVLGLDQSHESEGHDREDLFLPNLQDKFVQDLKNT 513

Query: 374 --AKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
             A  +  P+++V+M G  VD++  K      AILW GYPGQ+GG AIA++++G+ NP 
Sbjct: 514 LKAAGTNAPLIVVVMSGSSVDLTVTKK--HADAILWAGYPGQSGGQAIAEIIYGKVNPS 570


>gi|409079878|gb|EKM80239.1| hypothetical protein AGABI1DRAFT_120267 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 786

 Score =  341 bits (874), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 186/433 (42%), Positives = 261/433 (60%), Gaps = 12/433 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG--NTGSRLKVA 60
           N G AGL Y++PN+N F+DPRWGRGQETPGEDP    +Y    V GLQG  +    +KVA
Sbjct: 140 NFGRAGLNYFTPNINPFKDPRWGRGQETPGEDPFHLSQYVYHLVDGLQGGIDPWPYIKVA 199

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A CKH+ AYDL+NW G+DR+HF+A+VS+QDL + Y  PF++CV + K ASVMCSYN VNG
Sbjct: 200 ADCKHFAAYDLENWEGIDRFHFDAQVSQQDLSEYYLPPFQSCVRDAKAASVMCSYNSVNG 259

Query: 121 KPTCADPDILKNTIHGQWRLDG--YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P CA   +L++ +   W  D   ++ SDC ++  ++++ ++TR+  EAAA ++KAG D+
Sbjct: 260 VPACASTYLLQDILRDAWGFDDDRWVTSDCWALDKIFDSHNFTRSFAEAAAISLKAGTDI 319

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG   A H   A+   L+  +D+  A     T  +RLG FD   S Q +      DV T
Sbjct: 320 DCGSTFADHLPAALNQSLISRDDLTRAFIRQYTSLIRLGYFDPSDS-QTYRQFDWSDVNT 378

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           P  Q L+ +AA +G+VLLKN    LPL+     T+A+IGP ++ T +M GNY G A   T
Sbjct: 379 PEAQALSRRAAVEGLVLLKNDG-LLPLAP-DGKTIAIIGPYTNATSSMQGNYFGNAPIIT 436

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEV--AARQADATVLVMGLDQSIEAEFID 356
           +P QG       +  A   G   NG    G AE    A+ AD  V V G+D ++E E +D
Sbjct: 437 SPFQGAQDVGFKVVSAA--GTTVNGTSSAGFAEAINTAKAADVVVFVGGIDNTLEREGLD 494

Query: 357 RAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGG 416
           R+ +  PG Q +LV  +A   + P+++V   GG VD +    + ++ AI+W GYPGQ+GG
Sbjct: 495 RSSISWPGNQLDLVKDLASLGK-PLIVVQFGGGQVDDTEILANKKVQAIIWAGYPGQSGG 553

Query: 417 AAIADVLFGRANP 429
            AI D++ G   P
Sbjct: 554 TAIFDIIVGSTAP 566


>gi|426198356|gb|EKV48282.1| hypothetical protein AGABI2DRAFT_67675 [Agaricus bisporus var.
           bisporus H97]
          Length = 763

 Score =  340 bits (873), Expect = 6e-91,   Method: Compositional matrix adjust.
 Identities = 187/433 (43%), Positives = 261/433 (60%), Gaps = 12/433 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG--NTGSRLKVA 60
           N G AGL Y++PN+N F+DPRWGRGQETPGEDP    +Y    V GLQG  +    +KVA
Sbjct: 140 NFGRAGLNYFTPNINPFKDPRWGRGQETPGEDPFHLSQYVYHLVDGLQGGIDPWPYIKVA 199

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A CKH+ AYDL+NW G+DR+HF+A+VS+QDL + Y  PF++CV + K ASVMCSYN VNG
Sbjct: 200 ADCKHFAAYDLENWEGIDRFHFDAQVSQQDLSEYYLPPFQSCVRDAKAASVMCSYNSVNG 259

Query: 121 KPTCADPDILKNTIHGQWRLDG--YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P CA   +L++ +   W  D   ++ SDC ++  ++++ ++TR+  EAAA ++KAG D+
Sbjct: 260 VPACASTYLLQDILRDAWGFDDDRWVTSDCWALDKIFDSHNFTRSFAEAAAISLKAGTDI 319

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG   A H   A+   L+  +D+  A     T  +RLG FD   S Q +      DV T
Sbjct: 320 DCGSTFADHLPAALNQSLISRDDLTRAFIRQYTSLIRLGYFDPSHS-QTYRQFDWSDVNT 378

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           P  Q L+ +AA +G+VLLKN    LPL+     T+A+IGP ++ T +M GNY G A   T
Sbjct: 379 PEAQALSRRAAVEGLVLLKNDG-LLPLAP-DGKTIAIIGPYTNATSSMQGNYFGNAPFIT 436

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEV--AARQADATVLVMGLDQSIEAEFID 356
           +P QG       +  A   G   NG    G AE    AR AD  V V G+D ++E E +D
Sbjct: 437 SPFQGAQDVGFKVVSAA--GTIVNGTSSAGFAEAINTARAADVVVFVGGIDNTLEREGLD 494

Query: 357 RAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGG 416
           R+ +  PG Q +LV  +A   + P+++V   GG VD +    + ++ AI+W GYPGQ+GG
Sbjct: 495 RSSISWPGNQLDLVKDLASLGK-PLIVVQFGGGQVDDTEILANEKVQAIIWAGYPGQSGG 553

Query: 417 AAIADVLFGRANP 429
            AI D++ G   P
Sbjct: 554 TAIFDIIVGATAP 566


>gi|440799679|gb|ELR20723.1| betaxylosidase [Acanthamoeba castellanii str. Neff]
          Length = 748

 Score =  339 bits (869), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 191/463 (41%), Positives = 264/463 (57%), Gaps = 39/463 (8%)

Query: 1   MYNGGMAGLTYWSPNV-----------------------NIFRDPRWGRGQETPGEDPVL 37
           + N G+ GL +W+PN+                       +I RDPRWGR  E PGEDP +
Sbjct: 108 LNNEGIGGLDFWAPNIKYSTQPTNKTRQESQLRNAMVCISINRDPRWGRNMEVPGEDPFM 167

Query: 38  TGKYAASYVRGLQGNTGSRL-KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYN 96
           T +Y A ++RGLQ    SR  +V   CKH+ AY L+ W   DR+ F+A VS  D  +TY 
Sbjct: 168 TAQYVAHFMRGLQEGEDSRYPQVVGTCKHFAAYSLEAWKDYDRFMFDAIVSDYDFVETYL 227

Query: 97  VPFKACVVEGKVASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYN 156
             FK C+VEG+  S+MCSYN VNG P+CA+  +L+  +   W  DGY+VSDCD+V  +YN
Sbjct: 228 PAFKGCIVEGRARSIMCSYNSVNGVPSCANDFLLRTILRDSWSFDGYVVSDCDAVDTIYN 287

Query: 157 TQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRL 216
             H+T+TPE A A A+ AG DL+CG F   H   A   G + E++V LA+      +M L
Sbjct: 288 NHHFTKTPEGACAVALHAGTDLNCGDFYQKHLGKAHSEGRVTEDEVRLAVKRLFRQRMEL 347

Query: 217 GMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPL-STLRHHTVAV 275
           GM+D  P+ QP+    P  V +  H  LALQAA + +VLL+N    LPL  ++R   VAV
Sbjct: 348 GMWD-PPAEQPYKQYPPSVVGSREHSDLALQAARESMVLLQNRRGVLPLRKSVRR--VAV 404

Query: 276 IGPNSDVTVTMIGNYAGVACGYTTPLQGISRY----AKTIHQAGCFGVACNGNQL----I 327
           IGPN++ T TM+GNY G  C   T    +S Y    AK       + + C+ +      I
Sbjct: 405 IGPNANATETMLGNYYGSRCHDGTYDCIVSPYLAIKAKLPQALVTYNLGCDVDSTNTTGI 464

Query: 328 GAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMC 387
             A  AA+ AD  ++V+GL+ S+E+E  DR  + LPG Q  L+  +  A+  P V+V+M 
Sbjct: 465 PEAVKAAQAADVAIVVLGLNTSVESEGKDRVAITLPGMQDHLIKSIV-ATNTPTVVVMMH 523

Query: 388 GGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           GG V + + K+  ++  I+   YPG+ GG AIADVLFG  NPG
Sbjct: 524 GGAVAIEWIKD--QVDGIVDAFYPGENGGQAIADVLFGDYNPG 564


>gi|407922988|gb|EKG16078.1| Glycoside hydrolase family 3 [Macrophomina phaseolina MS6]
          Length = 800

 Score =  337 bits (865), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 175/431 (40%), Positives = 260/431 (60%), Gaps = 7/431 (1%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLK-VAA 61
           NGG +GL YW+PN+N ++DPRWGRGQETPGEDP     Y  + +RGL+GN     K + A
Sbjct: 144 NGGRSGLDYWTPNINPYKDPRWGRGQETPGEDPFHLASYVQNLIRGLEGNQNDPYKKIVA 203

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+T YD++NWNG  RY F+A+++ +D+ + Y  PF+AC  E KV + MCSYN VNG 
Sbjct: 204 TCKHFTGYDMENWNGNFRYQFDAQINMRDMVEYYMPPFQACAREAKVGAFMCSYNAVNGV 263

Query: 122 PTCADPDILKNTIHGQW---RLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           PTCADP +L+  +   W   + D ++VSDCD++  +Y    +  + E+A AD + AG DL
Sbjct: 264 PTCADPWLLQTVLREHWGWNQEDQWVVSDCDAIQNVYLPHEWAESREQAVADTLNAGTDL 323

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG +   +  GA   GL+ +  ++ AL  T +  ++LG FD   S QP+  +G +DV +
Sbjct: 324 NCGTYYQRYLPGAYEQGLINDTTLDRALTRTYSSLIKLGYFDNADS-QPYRQIGWQDVNS 382

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
              Q+LAL+AA +GIVLLKN    LPLS     ++A+IG  ++ T  M GNYAGVA    
Sbjct: 383 QHAQELALKAAQEGIVLLKNDG-LLPLSLDGVSSIALIGSWANATEQMQGNYAGVAPYLH 441

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           +PL    +    ++ A     +       GA   AA  +D  ++V G+D  IE+E +DR 
Sbjct: 442 SPLYAAEQLGVKVNYAEGASQSNPTTDQWGAEYTAAENSDVIIVVGGIDNDIESEELDRV 501

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            +   G Q ++++++A   + PV++V M  G +D +   ++  I A+LW GYPGQ GG A
Sbjct: 502 AIAWSGPQLDMITKLATYGK-PVIVVQMGAGQLDSTPLVSNANISALLWGGYPGQDGGTA 560

Query: 419 IADVLFGRANP 429
           + D++ G   P
Sbjct: 561 LFDIITGAVAP 571


>gi|222618262|gb|EEE54394.1| hypothetical protein OsJ_01415 [Oryza sativa Japonica Group]
          Length = 776

 Score =  337 bits (863), Expect = 9e-90,   Method: Compositional matrix adjust.
 Identities = 198/455 (43%), Positives = 254/455 (55%), Gaps = 75/455 (16%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG------ 54
           MYN G A LTYWSPN+N+ RDPRWGR  ETPGEDP + G+YA ++VRG+Q   G      
Sbjct: 164 MYNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDIDGATTAAS 223

Query: 55  --------SR-LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVE 105
                   SR +KV++CCKHY A                                     
Sbjct: 224 AAAATDAFSRPIKVSSCCKHYAA------------------------------------- 246

Query: 106 GKVASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYT-RTP 164
                VMCSYN++NG P CAD  +L  T+   W+L GYIVSDCDSV V+     +   T 
Sbjct: 247 ----CVMCSYNRINGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYTG 302

Query: 165 EEAAADAIKAGLDLDCG-------PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLG 217
            EA A A+KAGLDLDCG        F   +   AVR G L+E  V+ AL       MRLG
Sbjct: 303 VEATAAAMKAGLDLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTNLYLTLMRLG 362

Query: 218 MFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIG 277
            FDG P  +   +LG  DVCT  H++LA  AA QG+VLLKN A  LPLS  + ++VA+ G
Sbjct: 363 FFDGIPELE---SLGAADVCTEEHKELAADAARQGMVLLKNDAALLPLSPEKVNSVALFG 419

Query: 278 PNSDVTVT--MIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAAR 335
               +  T  M+G+Y G  C   TP  G+ +   +     C   +C+       A  AA+
Sbjct: 420 QLQHINATDVMLGDYRGKPCRVVTPYDGVRKVVSSTSVHACDKGSCD------TAAAAAK 473

Query: 336 QADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSF 395
             DAT++V GL+ S+E E  DR  LLLP  Q   ++ VA+AS  P+VLV+M  G VDVSF
Sbjct: 474 TVDATIVVAGLNMSVERESNDREDLLLPWSQASWINAVAEASPSPIVLVIMSAGGVDVSF 533

Query: 396 AKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           A+++P+IGA++W GYPG+ GG AIADVLFG+ NPG
Sbjct: 534 AQDNPKIGAVVWAGYPGEEGGTAIADVLFGKYNPG 568


>gi|409079872|gb|EKM80233.1| hypothetical protein AGABI1DRAFT_57801 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 767

 Score =  335 bits (859), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 191/433 (44%), Positives = 257/433 (59%), Gaps = 11/433 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR--LKVA 60
           N   AGL Y++PN+N F+DPRWGRGQETPGEDP    +Y  S + GLQG    R   KVA
Sbjct: 140 NFHRAGLDYFTPNINPFKDPRWGRGQETPGEDPFHVSQYVYSLIDGLQGGIDPRPYFKVA 199

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A CKHY AYDLD+W G+DR+HF+A+VS QDL + Y   F++CV + KVASVMCSYN VNG
Sbjct: 200 ADCKHYAAYDLDSWEGIDRFHFDAKVSLQDLSEYYLPSFQSCVRDAKVASVMCSYNSVNG 259

Query: 121 KPTCADPDILKNTIHGQWRLDG--YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P CA+P +L++ +   W  D   ++ SDCD++G ++ T ++T T  EA ADA+KAG D+
Sbjct: 260 IPACANPYLLQDILRDFWGFDDDRWVTSDCDAIGNIFTTHNFTDTFAEAVADALKAGTDV 319

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG   + H   A+   L+  +D+  AL    T  MRLG FD  P +QP   L   DV  
Sbjct: 320 DCGTSYSTHLPDALNQSLITRDDLERALTRQYTSLMRLGYFD-PPESQPLRQLAWSDVNK 378

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           P  Q LA  AA +G+VLLKN    LP+S     T+A+IGP ++ T  M GNY G A    
Sbjct: 379 PDAQALAHTAAVEGLVLLKNDG-FLPVSA-SGKTIAIIGPYANATKDMQGNYFGTAPFIV 436

Query: 299 TPLQGI--SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFID 356
           TP QG   + + + +  AG   +         AA   A  +D  +   G++ SIE+E  D
Sbjct: 437 TPFQGAVDAGFNEVVSAAGT-SINGTSEADFAAAIAVANSSDIIIFAGGINNSIESEAKD 495

Query: 357 RAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGG 416
           R  +   G Q  LV ++A   + PVV+V   GG +D S   ++  + A++W GYPGQ+GG
Sbjct: 496 RLTIAWTGNQLSLVKQLASLGK-PVVVVQFGGGQLDDSDLLDNDAVRAVIWAGYPGQSGG 554

Query: 417 AAIADVLFGRANP 429
            AI DV+ G   P
Sbjct: 555 TAIFDVITGAVAP 567


>gi|426198365|gb|EKV48291.1| hypothetical protein AGABI2DRAFT_219902 [Agaricus bisporus var.
           bisporus H97]
          Length = 767

 Score =  335 bits (859), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 191/433 (44%), Positives = 257/433 (59%), Gaps = 11/433 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR--LKVA 60
           N   AGL Y++PN+N F+DPRWGRGQETPGEDP    +Y  S + GLQG    R   KVA
Sbjct: 140 NFHRAGLDYFTPNINPFKDPRWGRGQETPGEDPFHVSQYVYSLIDGLQGGIDPRPYFKVA 199

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A CKHY AYDLD+W G+DR+HF+A+VS QDL + Y   F++CV + KVASVMCSYN VNG
Sbjct: 200 ADCKHYAAYDLDSWEGIDRFHFDAKVSLQDLSEYYLPSFQSCVRDAKVASVMCSYNSVNG 259

Query: 121 KPTCADPDILKNTIHGQWRLDG--YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P CA+P +L++ +   W  D   ++ SDCD++G ++ T ++T T  EA ADA+KAG D+
Sbjct: 260 IPACANPYLLQDILRDFWGFDDDRWVTSDCDAIGNIFTTHNFTDTFAEAVADALKAGTDV 319

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG   + H   A+   L+  +D+  AL    T  MRLG FD  P +QP   L   DV  
Sbjct: 320 DCGTSYSTHLPDALNQSLITRDDLERALTRQYTSLMRLGYFD-PPESQPLRQLAWSDVNK 378

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           P  Q LA  AA +G+VLLKN    LP+S     T+A+IGP ++ T  M GNY G A    
Sbjct: 379 PDAQALAHTAAVEGLVLLKNDG-FLPVSA-SGKTIAIIGPYANATKDMQGNYFGTAPFIV 436

Query: 299 TPLQGI--SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFID 356
           TP QG   + + + +  AG   +         AA   A  +D  +   G++ SIE+E  D
Sbjct: 437 TPFQGAVDAGFNEVVSAAGT-SINGTSEADFAAAIAVANSSDIIIFAGGINNSIESEAKD 495

Query: 357 RAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGG 416
           R  +   G Q  LV ++A   + PVV+V   GG +D S   ++  + A++W GYPGQ+GG
Sbjct: 496 RLTIAWTGNQLSLVKQLASLGK-PVVVVQFGGGQLDDSDLLDNDAVRAVIWAGYPGQSGG 554

Query: 417 AAIADVLFGRANP 429
            AI DV+ G   P
Sbjct: 555 TAIFDVITGAVAP 567


>gi|389748500|gb|EIM89677.1| glycoside hydrolase family 3 protein [Stereum hirsutum FP-91666
           SS1]
          Length = 770

 Score =  334 bits (856), Expect = 5e-89,   Method: Compositional matrix adjust.
 Identities = 190/433 (43%), Positives = 255/433 (58%), Gaps = 12/433 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR--LKVA 60
           N   +GL +++PN+N F+DPRWGRGQETPGEDP+ T +Y    + GLQG  G     K+ 
Sbjct: 138 NTNHSGLDFFTPNINPFKDPRWGRGQETPGEDPLHTSRYVYQLITGLQGGVGPSPYYKII 197

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A CKH+ AYDL+NW G +R  FNA VS QDL + Y   F++CV + KV SVMCSYN VNG
Sbjct: 198 ADCKHFAAYDLENWEGNNRMAFNAIVSTQDLAEFYTPSFQSCVRDAKVGSVMCSYNAVNG 257

Query: 121 KPTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P C  P +L++ +   + L  D +I SDCD+VG +++  +YT T   A+A A+ AG D+
Sbjct: 258 VPACGSPYLLQDLVRDYFELGNDTWITSDCDAVGNIFDPHNYTTTLTNASAVALLAGTDV 317

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG   +     AV  GL+ + DV  AL       +RLG FD E S  P+  LG  DV T
Sbjct: 318 DCGTSYSETLGEAVSEGLVSKSDVERALVRLYGSLVRLGYFDPEDSV-PYRALGASDVNT 376

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PA Q LA  AA +GIVLLKN    LPLS+   H +A+IGP ++ T  M GNY G+A    
Sbjct: 377 PAAQTLAYTAAVEGIVLLKNDGL-LPLSSNVSH-IALIGPWANATTQMQGNYEGIAPLLI 434

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEV--AARQADATVLVMGLDQSIEAEFID 356
           +PL G +     +      G   +GN   G A+    A  AD  V + G+D ++EAE  D
Sbjct: 435 SPLDGFTSAGFNVSFTN--GTTISGNSTSGFADALSMASAADVIVYIGGIDDTVEAEGQD 492

Query: 357 RAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGG 416
           R  +  PG Q EL+  +    + P V++ M GG VD +  K +  + A+LW GYPGQAGG
Sbjct: 493 RTSITWPGNQLELIGELGAFGK-PFVVIQMGGGQVDDTELKANSSVNALLWGGYPGQAGG 551

Query: 417 AAIADVLFGRANP 429
            A+AD++ G   P
Sbjct: 552 KALADIITGVQAP 564


>gi|452989371|gb|EME89126.1| glycoside hydrolase family 3 protein [Pseudocercospora fijiensis
           CIRAD86]
          Length = 790

 Score =  332 bits (850), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 177/431 (41%), Positives = 249/431 (57%), Gaps = 7/431 (1%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N   AGL +W+PN+N F+DPRWGRGQETPGEDP     Y  S +RGLQG+  S  KV A 
Sbjct: 141 NDDRAGLDFWTPNINPFKDPRWGRGQETPGEDPYHLSSYVHSLIRGLQGDNPSYKKVVAT 200

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           CKH+ AYD++NWNG  RY  +A ++ QDL + Y  PF++C  +  V + MCSYN +NG P
Sbjct: 201 CKHFVAYDVENWNGNFRYQLDAHINSQDLVEYYMPPFRSCARDSNVGAFMCSYNSLNGVP 260

Query: 123 TCADPDILKNTIHGQWRL---DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           TCADP +L+  +   W     + ++ SDCDSV  ++   +Y  + EEAAA ++KAG D++
Sbjct: 261 TCADPYLLQTVLREHWNWTAEEQWVTSDCDSVQNVFLYHNYASSREEAAAISLKAGTDIN 320

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG +   H   A   GL+ E DV+ +L       +RLG FDG+    P+ NL   DV TP
Sbjct: 321 CGTYYQEHLPRAYEQGLINETDVDTSLIRQYGSLIRLGYFDGD--RVPYRNLTWNDVSTP 378

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
             Q LAL+AA  GI LLKN    LPL       +A+IG  ++ T  M+GNY G+   + +
Sbjct: 379 YAQDLALKAATSGITLLKNDG-ILPLQITNGTKIALIGDWANATDQMLGNYHGIPPYFHS 437

Query: 300 PLQGISRY-AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           PL    +  A+  +  G  G +            AA ++D  + + G+D+ +EAE  DR 
Sbjct: 438 PLWAAQQTGAEVTYVQGPGGQSDPTTYTWRPIWSAANKSDVIIYIGGMDERVEAEEKDRV 497

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            +   G Q +++ ++A     P ++V M GG +D S    +P I A+LW GYPGQ GG A
Sbjct: 498 SIAWSGPQLDVIGQLADYYDKPTIVVQMGGGSLDSSPLVKNPNIRALLWGGYPGQDGGKA 557

Query: 419 IADVLFGRANP 429
           I D+L G + P
Sbjct: 558 IFDILQGISAP 568


>gi|451992719|gb|EMD85198.1| glycoside hydrolase family 3 protein [Cochliobolus heterostrophus
           C5]
          Length = 781

 Score =  330 bits (847), Expect = 7e-88,   Method: Compositional matrix adjust.
 Identities = 183/434 (42%), Positives = 254/434 (58%), Gaps = 13/434 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN-TGSRLKVAA 61
           N    GL +W+PN+N FRDPRWGRGQETPGED      Y  + + GLQGN T    +V A
Sbjct: 138 NANRTGLDFWTPNINPFRDPRWGRGQETPGEDSYHLSSYVKALIHGLQGNATDPYRRVVA 197

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKHY  YD++NWNG  RY  + ++S+QDL + Y  PF+AC V+  V + MCSYN VNG 
Sbjct: 198 TCKHYAGYDIENWNGNLRYQNDVQISQQDLVEYYLAPFEAC-VQANVGAFMCSYNAVNGA 256

Query: 122 PTCADPDILKNTIHGQWRL---DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           P CADP +L+  +   W     D ++ SDCD++  +Y    ++ T E AAAD++ AG DL
Sbjct: 257 PPCADPYLLQTVLREHWGWSSDDHWVTSDCDAIQNVYLPHQWSSTREGAAADSLNAGTDL 316

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG +L  H  GAV+ GL  E  ++ AL    +  ++LG FD  P  QP+  LG   V T
Sbjct: 317 DCGTYLQTHLPGAVKQGLTDETTLDKALIRQYSSLIKLGYFDA-PENQPYRQLGFDAVAT 375

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
            A Q LAL+AA +GIVLLKN    LP++ L    V + G  ++ T  + GNY GVA   T
Sbjct: 376 SASQALALKAAEEGIVLLKNDG-VLPIN-LGSKQVGIYGDWANATSQLQGNYFGVAKFLT 433

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAA---RQADATVLVMGLDQSIEAEFI 355
           +PL  +      +  AG       G+   GA    +     +D  + V G+D  +E+E  
Sbjct: 434 SPLMALQNLGVDVKYAGNL-PGGQGDPTTGAWSSLSGVITTSDVHIWVGGIDNGVESEDR 492

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR+ L L G Q +++ ++A   + PV++V+M GG +D S    +P+I A+LW GYPGQ G
Sbjct: 493 DRSWLTLTGGQLDVIGQLADTGK-PVIVVIMGGGQIDTSPLIRNPKISAVLWAGYPGQDG 551

Query: 416 GAAIADVLFGRANP 429
           G AI ++L G+A P
Sbjct: 552 GTAIVNILTGKAAP 565


>gi|409041356|gb|EKM50841.1| glycoside hydrolase family 3 protein [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 764

 Score =  328 bits (842), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 187/430 (43%), Positives = 260/430 (60%), Gaps = 7/430 (1%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG--NTGSRLKVA 60
           N G AGL +++PN+N F+DPRWGRGQETPGEDP    +Y    V GLQG  +     KV 
Sbjct: 141 NAGRAGLDFFTPNINPFKDPRWGRGQETPGEDPFHIAQYVYQLVTGLQGGLSPDPYYKVI 200

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A CKH+  YDL+NW G  R  FNA +S QDL + Y   F++CV +  V SVMCSYN VNG
Sbjct: 201 ADCKHFAGYDLENWEGNSRMAFNAIISTQDLAEYYTPSFQSCVRDAHVGSVMCSYNAVNG 260

Query: 121 KPTCADPDILKNTIHGQWRL-DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
            P+CA+  +L++ I G + L DG+I SDCD+V  +++   YT T   A+A A+KAG D+D
Sbjct: 261 IPSCANSYLLQDIIRGHFGLGDGWITSDCDAVANIFSPHQYTTTLVNASAVALKAGTDVD 320

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG   +     AV   L+ E+D+  ++       +RLG FD  P+ QPF  LG  DV TP
Sbjct: 321 CGTTYSQTLVDAVDQNLVTEDDIKNSMIRLYRSLVRLGYFD-SPAEQPFRQLGWSDVNTP 379

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
           + Q LAL AA +G+ LLKN   TLPLS+     +A++GP ++ T  M GNY G+A    +
Sbjct: 380 SSQALALTAAEEGVTLLKNDG-TLPLSSAIKR-IALVGPWANATTQMQGNYQGIAPFLVS 437

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           PLQ +      +  A    +    +    AA  A + ADA +   G+D++IE+E  DR  
Sbjct: 438 PLQALQDAGFQVTFANGTAINSTDDSGFAAAVSAVQVADAVIYAGGIDETIESEGNDREI 497

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           +  PG Q +LVS++A   + P V++ M GG VD S  K++  + A++W GYPGQ+GGAAI
Sbjct: 498 ITWPGNQLDLVSQLAAVGK-PFVVLQMGGGQVDSSSLKSNKAVNALIWGGYPGQSGGAAI 556

Query: 420 ADVLFGRANP 429
            ++L G+  P
Sbjct: 557 VNILTGKIAP 566


>gi|296439595|sp|A1CCL9.2|BXLB_ASPCL RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
          Length = 771

 Score =  328 bits (841), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 187/431 (43%), Positives = 263/431 (61%), Gaps = 9/431 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG-SRLKVAA 61
           N G AGL YW+PN+N FRDPRWGRGQETPGEDP+   +Y    V GLQG  G +R ++AA
Sbjct: 145 NAGRAGLDYWTPNINPFRDPRWGRGQETPGEDPLHVSRYVYHLVDGLQGGIGPARPQIAA 204

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+ AYD+++WNGV R+ F+ARVS QDL + Y   FK+CV + +V +VMCSYN +NG 
Sbjct: 205 TCKHFAAYDMEDWNGVSRHEFDARVSTQDLAEFYLPSFKSCVRDAQVDAVMCSYNALNGV 264

Query: 122 PTCADPDILKNTIHGQWRLDG---YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           PTCADP +L+  +   W  D    ++VSDC ++  +Y   +YT+T  EAAA A+ AG DL
Sbjct: 265 PTCADPYLLQTLLREHWDWDQPGHWVVSDCGAIDDIYIGHNYTKTGAEAAAVALNAGTDL 324

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     H   A   GL   + ++ AL    +  ++LG FD     QP+G++G +DV T
Sbjct: 325 DCGTVFPKHLGEAAEQGLYTNQTLDRALVRLYSSLVKLGYFD-PAEKQPYGSIGWKDVDT 383

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PA +QLA +AA +GIVLLKN  +TLPL      T+A+IGP ++ T  M GNY G      
Sbjct: 384 PAAEQLAHKAAVEGIVLLKND-QTLPLKA--KGTLALIGPYANATKQMQGNYQGPPKYIR 440

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           T     +++   +  +    +  +      AA  AA+ AD  +   G+D +IE+E +DR 
Sbjct: 441 TLEWAATQHGYQVQYSPGTAINNSSTAGFAAALAAAKDADVVLYAGGIDNTIESETLDRT 500

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            +  PG Q  L+S ++   + P++++   GG VD +    +P + A+LW GYP Q GGAA
Sbjct: 501 TITWPGNQLSLISELSNLHK-PLIVIQFGGGQVDDTPLLTNPHVNALLWAGYPSQEGGAA 559

Query: 419 IADVLFGRANP 429
           I D+L G+A P
Sbjct: 560 IFDILTGKAAP 570


>gi|121712174|ref|XP_001273702.1| beta-xylosidase [Aspergillus clavatus NRRL 1]
 gi|119401854|gb|EAW12276.1| beta-xylosidase [Aspergillus clavatus NRRL 1]
          Length = 803

 Score =  328 bits (841), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 187/431 (43%), Positives = 263/431 (61%), Gaps = 9/431 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG-SRLKVAA 61
           N G AGL YW+PN+N FRDPRWGRGQETPGEDP+   +Y    V GLQG  G +R ++AA
Sbjct: 177 NAGRAGLDYWTPNINPFRDPRWGRGQETPGEDPLHVSRYVYHLVDGLQGGIGPARPQIAA 236

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+ AYD+++WNGV R+ F+ARVS QDL + Y   FK+CV + +V +VMCSYN +NG 
Sbjct: 237 TCKHFAAYDMEDWNGVSRHEFDARVSTQDLAEFYLPSFKSCVRDAQVDAVMCSYNALNGV 296

Query: 122 PTCADPDILKNTIHGQWRLDG---YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           PTCADP +L+  +   W  D    ++VSDC ++  +Y   +YT+T  EAAA A+ AG DL
Sbjct: 297 PTCADPYLLQTLLREHWDWDQPGHWVVSDCGAIDDIYIGHNYTKTGAEAAAVALNAGTDL 356

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     H   A   GL   + ++ AL    +  ++LG FD     QP+G++G +DV T
Sbjct: 357 DCGTVFPKHLGEAAEQGLYTNQTLDRALVRLYSSLVKLGYFD-PAEKQPYGSIGWKDVDT 415

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PA +QLA +AA +GIVLLKN  +TLPL      T+A+IGP ++ T  M GNY G      
Sbjct: 416 PAAEQLAHKAAVEGIVLLKND-QTLPLKA--KGTLALIGPYANATKQMQGNYQGPPKYIR 472

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           T     +++   +  +    +  +      AA  AA+ AD  +   G+D +IE+E +DR 
Sbjct: 473 TLEWAATQHGYQVQYSPGTAINNSSTAGFAAALAAAKDADVVLYAGGIDNTIESETLDRT 532

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            +  PG Q  L+S ++   + P++++   GG VD +    +P + A+LW GYP Q GGAA
Sbjct: 533 TITWPGNQLSLISELSNLHK-PLIVIQFGGGQVDDTPLLTNPHVNALLWAGYPSQEGGAA 591

Query: 419 IADVLFGRANP 429
           I D+L G+A P
Sbjct: 592 IFDILTGKAAP 602


>gi|451849522|gb|EMD62825.1| glycoside hydrolase family 3 protein [Cochliobolus sativus ND90Pr]
          Length = 849

 Score =  326 bits (836), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 180/434 (41%), Positives = 253/434 (58%), Gaps = 13/434 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN-TGSRLKVAA 61
           N    GL +W+PN+N FRDPRWGRGQETPGED      Y  + + GLQGN T    +V A
Sbjct: 206 NVNRTGLDFWTPNINPFRDPRWGRGQETPGEDSYHLSSYVKALIHGLQGNETDPYRRVVA 265

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKHY  YD++NWNG  RY  + ++S+QDL + Y  PF+AC V+  V + MCSYN VNG 
Sbjct: 266 TCKHYAGYDIENWNGNLRYQNDVQISQQDLVEYYLAPFEAC-VQANVGAFMCSYNAVNGA 324

Query: 122 PTCADPDILKNTIHGQWRL---DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           P CADP +L+  +   W     + ++ SDCDS+  +Y    ++ T E AAAD++ AG DL
Sbjct: 325 PPCADPYMLQTVLREHWGWSSDEHWVTSDCDSIQNVYLPHQWSSTREGAAADSLNAGTDL 384

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG +L  H  GAV+ GL  E  ++ AL    +  ++LG FD  P  QP+  LG   V T
Sbjct: 385 DCGTYLQSHLPGAVKQGLTNETTLDNALIRQYSSLIKLGYFD-IPENQPYRQLGFDAVAT 443

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
            A Q LAL+AA +GIVLLKN    LP++      V + G  ++ T  + GNY GVA   T
Sbjct: 444 SASQALALKAAEEGIVLLKNDG-VLPIN-FGSKNVGIYGDWANATSQLQGNYFGVAKFLT 501

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAA---RQADATVLVMGLDQSIEAEFI 355
           +P   + +    +  AG       G+   G+    +     +D  + V G+D  IE+E  
Sbjct: 502 SPYMALEKLGVNVRYAGNL-PGGQGDPTTGSWPRLSGVITTSDVHIWVGGMDNGIESEDR 560

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR+ L L G Q +++ ++A   + PV++++M GG +D S    +P+I A+LW GYPGQ G
Sbjct: 561 DRSWLTLTGSQLDVIGQLADTGK-PVIVIIMGGGQIDTSPLIKNPKISAVLWAGYPGQDG 619

Query: 416 GAAIADVLFGRANP 429
           G AI ++L G+A P
Sbjct: 620 GTAIVNILTGKAAP 633


>gi|452846807|gb|EME48739.1| glycoside hydrolase family 3 protein [Dothistroma septosporum
           NZE10]
          Length = 802

 Score =  325 bits (834), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 175/434 (40%), Positives = 256/434 (58%), Gaps = 13/434 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLK-VAA 61
           N   AGL +W+PN+N F+D RWGRGQETPGEDP     Y A+ + GLQG+   + K V A
Sbjct: 141 NDKRAGLDFWTPNINPFKDSRWGRGQETPGEDPYHLSSYVAALIEGLQGSPDDKYKRVVA 200

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+ AYD+++WNG  RY F+A+VS QDL + Y  PF+ C  +  V + MCSYN +NG 
Sbjct: 201 TCKHFVAYDMESWNGNFRYQFDAQVSSQDLVEYYMPPFQQCARDSNVGAFMCSYNALNGV 260

Query: 122 PTCADPDILKNTIHGQWRLDG---YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           PTCADP +L+  +  +W       ++ SDCD+V  ++    Y  T EEAAA ++KAG D+
Sbjct: 261 PTCADPWLLQTVLREKWNWTSEQQWVTSDCDAVQNVFLPHDYASTREEAAALSLKAGTDI 320

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG +   H   A   GL+   D++++L    +  +RLG FDG   A P+ NL   DV T
Sbjct: 321 NCGTYYQDHLPAAYDQGLINTTDLDISLIRQYSSLVRLGYFDGL--AVPYRNLTWNDVST 378

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           P  QQLA +AA +GI LLKN    LPL+     ++A+IG  ++ T  M+GNY G+   + 
Sbjct: 379 PHAQQLAYKAAAEGITLLKNDG-VLPLTISNGTSIALIGDWANATDQMLGNYDGIPPFFH 437

Query: 299 TPLQGISRYAKTIHQA---GCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFI 355
           +PL    +   T++ A   G  G     + L      AA ++D  +   G+D S+E+E +
Sbjct: 438 SPLYAAQQTGATVNFATGPGGQGDPTTDHWL--PVWAAANKSDVIIYAGGIDNSVESEGM 495

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  L   G Q +++ ++A   + PV+++ M GG +D S   N+P + A++W GYPGQ G
Sbjct: 496 DRVSLTWTGAQLDMIGQLAMYGK-PVIVLQMGGGQIDSSPLVNNPNVSALIWGGYPGQDG 554

Query: 416 GAAIADVLFGRANP 429
           G A+ D++ G   P
Sbjct: 555 GVALFDIIRGITAP 568


>gi|347531439|ref|YP_004838202.1| beta-glucosidase [Roseburia hominis A2-183]
 gi|345501587|gb|AEN96270.1| beta-glucosidase [Roseburia hominis A2-183]
          Length = 716

 Score =  325 bits (832), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 183/450 (40%), Positives = 265/450 (58%), Gaps = 34/450 (7%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
            G   GLT+W+PN+NIFRDPRWGRG ET GEDP LT K   +Y+RG+QG     LK AAC
Sbjct: 98  RGIYKGLTFWAPNINIFRDPRWGRGHETYGEDPCLTAKLGCAYIRGIQGKDPDHLKAAAC 157

Query: 63  CKHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
            KH+  +     +G +  R+ F+A+VS  DL DTY   FK CV +  V +VM +YN+VNG
Sbjct: 158 AKHFAVH-----SGPEALRHEFDAKVSLHDLYDTYLYAFKRCVKDAGVEAVMGAYNRVNG 212

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
           +P C    +L++ +  Q+  +G++VSDC ++   +   H T+T EE+AA A+  G DL+C
Sbjct: 213 EPACGSKTLLQDILREQFGFEGHVVSDCWAILDFHEHHHVTKTVEESAAMAVNHGCDLNC 272

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT-P 239
           G    ++   A   GL+ E+ +  A+   + V++RLGM +  PS  P+ N+ P DV   P
Sbjct: 273 GKAF-LYLSRACEQGLVEEKTITEAVERLMDVRIRLGMMEDYPS--PYANI-PYDVVECP 328

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
            H  L+L+A+ + +VLLKN    LPL   + HT+AVIGPN++    ++GNY G +  Y T
Sbjct: 329 EHIALSLEASKRSMVLLKNDNHFLPLKQEQVHTIAVIGPNANSRAALVGNYEGTSSRYIT 388

Query: 300 PLQGISRY----AKTIHQAGCFGVACNGNQLIG-------AAEVAARQADATVLVMGLDQ 348
           PL+GI  Y     + ++  GC  +  +  + +G        A +AA +AD  V+ +GLD 
Sbjct: 389 PLEGIQEYTGEKTRVLYAQGCH-LYKDQVEFLGEPKDRFKEALIAAERADVIVMCLGLDA 447

Query: 349 SIEAE---------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKND 399
            IE E           D+ GL LPG QQEL+  VA   + P+VL ++ G  +D+S+A+  
Sbjct: 448 GIEGEEGDAGNEYASGDKLGLKLPGLQQELLEAVAAVGK-PIVLTVLAGSALDLSWAQEH 506

Query: 400 PRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            +I AIL   YPG  GG AIA+ LFG  +P
Sbjct: 507 AQIRAILDCWYPGARGGKAIAEALFGEFSP 536


>gi|392570764|gb|EIW63936.1| glycoside hydrolase family 3 protein [Trametes versicolor FP-101664
           SS1]
          Length = 781

 Score =  324 bits (830), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 192/432 (44%), Positives = 264/432 (61%), Gaps = 9/432 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR--LKVA 60
           N G AGL YW+PN+N F+DPRWGRGQETPGEDP    +Y  + + GLQG    +   KV 
Sbjct: 143 NAGRAGLDYWTPNINPFKDPRWGRGQETPGEDPFHLSQYVYNLILGLQGGLDPKPYFKVV 202

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A CKH+ AYD+DNW GV RY FNA VS+QDL + Y  PF+ CV + KVASVMCSYN VNG
Sbjct: 203 ADCKHFAAYDMDNWEGVVRYGFNAVVSQQDLSEFYLPPFQTCVRDAKVASVMCSYNAVNG 262

Query: 121 KPTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+CA+  +L++ +   W    D ++ SDCD+V  ++   +YT  P +AAADA+ AG D+
Sbjct: 263 IPSCANSFLLQDVLRDHWGFTDDRWVTSDCDAVQNIFTPHNYTTDPAQAAADALLAGTDI 322

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG F + +   A++ GL+   D+  A        +RLG FD +P+AQP+  LG  DV T
Sbjct: 323 DCGTFSSTYLPEALQRGLVNSTDLRRAAIRQYASLVRLGYFD-DPAAQPYRQLGWSDVNT 381

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
              QQLA  AA +G+VLLKN    LPLS  R   +A+IGP ++ T  + GNY G+A    
Sbjct: 382 LQAQQLAHTAAVEGMVLLKNDG-LLPLSK-RVRKLALIGPWANATRLLQGNYFGIAPYLV 439

Query: 299 TPLQGISRYAKTIHQAGCFGVAC-NGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           +P+QG  +    +       V   N      AA  AA++ADA V   GLD+++E E IDR
Sbjct: 440 SPVQGAQQAGFEVEYVFGTNVTTRNDTSGFAAAVAAAKRADAVVFAGGLDETVEREEIDR 499

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             +  PG Q +LV+ + +  + P+++    GG +D +  K    + AI+W GYPGQ+GG 
Sbjct: 500 LNVTWPGNQLDLVAELERVGK-PLIVAQFGGGQLDNTALKRSKAVNAIIWGGYPGQSGGT 558

Query: 418 AIADVLFGRANP 429
           A+ D+L G+A P
Sbjct: 559 ALFDILTGKAAP 570


>gi|223945397|gb|ACN26782.1| unknown [Zea mays]
          Length = 516

 Score =  324 bits (830), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 167/322 (51%), Positives = 216/322 (67%), Gaps = 4/322 (1%)

Query: 112 MCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADA 171
           MCSYN+VNG PTCAD ++L  T    W   GYI SDCD+V ++++ Q Y +T E+A AD 
Sbjct: 1   MCSYNRVNGVPTCADYNLLSTTARQDWGFYGYITSDCDAVAIIHDAQGYAKTAEDAVADV 60

Query: 172 IKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNL 231
           +KAG+D++CG ++  H   A++ G + E+D+N AL     V+MRLG+F+G+P    +G++
Sbjct: 61  LKAGMDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGDI 120

Query: 232 GPRDVCTPAHQQLALQAAHQGIVLLKNS--ARTLPLSTLRHHTVAVIGPNSDVTVTMIGN 289
           GP  VCT  HQ LAL+AA  GIVLLKN   A  LPLS     ++AVIG N++  + + GN
Sbjct: 121 GPDQVCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRGN 180

Query: 290 YAGVACGYTTPLQGISRYAK-TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQ 348
           Y G  C   TPLQ +  Y K T   AGC   ACN   +  A + AA  AD+ VL MGLDQ
Sbjct: 181 YFGPPCVTVTPLQVLQGYVKDTSFVAGCNSAACNVTTIPEAVQ-AASSADSVVLFMGLDQ 239

Query: 349 SIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWV 408
             E E +DR  L LPG+QQ L+  VA A++ PV+LVL+CGGPVDVSFAK +P+IGAILW 
Sbjct: 240 DQEREEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWA 299

Query: 409 GYPGQAGGAAIADVLFGRANPG 430
           GYPG+AGG AIA VLFG  NPG
Sbjct: 300 GYPGEAGGIAIAQVLFGEHNPG 321


>gi|393247584|gb|EJD55091.1| beta-xylosidase [Auricularia delicata TFB-10046 SS5]
          Length = 763

 Score =  323 bits (829), Expect = 9e-86,   Method: Compositional matrix adjust.
 Identities = 182/434 (41%), Positives = 253/434 (58%), Gaps = 13/434 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR--LKVA 60
           N G +G+ +++PN+N FRDPRWGRGQETPGEDP+   +Y    V  LQG  G     K+ 
Sbjct: 141 NFGFSGIDFFTPNINPFRDPRWGRGQETPGEDPLHISRYVFQLVTALQGGLGPSPYYKIV 200

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A CKH+  YDL++W G+DR+HF+A ++ QDL + Y   F++CV + KV SVMCSYN VNG
Sbjct: 201 ADCKHFAGYDLESWEGIDRFHFDAVITTQDLAEFYTPSFQSCVRDAKVGSVMCSYNSVNG 260

Query: 121 KPTCADPDILKNTIHGQWRL-DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
            P CA   +L++ +   + L DG+I SDCD+V  ++ T ++T T   A+A ++KAG D+D
Sbjct: 261 VPACASSYLLQDIVRDFYGLGDGWITSDCDAVQNVFTTHNFTTTQANASAISLKAGTDVD 320

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG   A     A+  GL+ E+D+  AL       +R G FD  P  QPF  LG  DV TP
Sbjct: 321 CGNVYAQSLGDALDQGLVEEDDLKQALVRLYGSLVRTGYFD-SPEEQPFRQLGWADVDTP 379

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
           A ++LAL AA +GIVLLKN    LPLS+     V ++GP  + T  M GNY G A    +
Sbjct: 380 ASRRLALLAAEEGIVLLKNDG-LLPLSSRDVPNVIMVGPWGNATTMMQGNYFGNAPYLVS 438

Query: 300 PLQGI--SRYAKTIHQAGCFGVACNGNQLIGAAE--VAARQADATVLVMGLDQSIEAEFI 355
           P QG   + +  T        V  NG    G  E   AA   D  V V G D  +E E  
Sbjct: 439 PRQGFVDAGFNVTFFNG---TVGTNGTDTSGFDEAVAAAGDTDLIVFVGGPDNVVERESR 495

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  +  PG Q +L+  +A   + P++++ M  G VD ++ K    I A++W GYPGQ+G
Sbjct: 496 DRINITWPGVQLDLIKELAGVGK-PMIVLQMGAGQVDDTWLKESDAINALIWGGYPGQSG 554

Query: 416 GAAIADVLFGRANP 429
           G A+A+++ G+  P
Sbjct: 555 GTALANIVTGKTAP 568


>gi|395334835|gb|EJF67211.1| beta-xylosidase [Dichomitus squalens LYAD-421 SS1]
          Length = 774

 Score =  322 bits (826), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 193/433 (44%), Positives = 262/433 (60%), Gaps = 11/433 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG--NTGSRLKVA 60
           N G AGL YW+PN+N F+DPRWGRGQETPGEDP     Y  + + GLQG  +     KV 
Sbjct: 143 NVGRAGLDYWTPNINPFKDPRWGRGQETPGEDPFHLQGYVYNLILGLQGGLDPTPYFKVV 202

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A CKH+ AYD+DNW G  RY FNA V++QDL + Y   F+ CV + KVASVMCSYN VNG
Sbjct: 203 ADCKHFAAYDMDNWEGNVRYGFNAVVTQQDLSEYYLPSFQTCVRDAKVASVMCSYNAVNG 262

Query: 121 KPTCADPDILKNTIHGQWRLDG--YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+CA+  +L++ +   W  D   ++ SDCD+V  +Y   +YT  P +AAADA+ AG D+
Sbjct: 263 IPSCANSFLLQDILRDYWGFDDTRWVTSDCDAVQNIYTPHNYTDNPAQAAADALLAGTDI 322

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG F + +   A+  GL+   D+  A        +RLG FD  P +QP+  LG  DV T
Sbjct: 323 DCGTFSSTYLPDALSQGLVNATDLKRAAIRQYASLVRLGYFD-PPESQPYRQLGWSDVNT 381

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRH-HTVAVIGPNSDVTVTMIGNYAGVACGY 297
           P  QQLA  AA +G+VLLKN   TLPLS  +H   +A+IGP ++ T  M GNYAG+A   
Sbjct: 382 PEAQQLAHTAAVEGMVLLKNDG-TLPLS--KHVRKLALIGPWANATTLMQGNYAGIAPYL 438

Query: 298 TTPLQGISRYAKTIHQA-GCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFID 356
            +PL G  +    +    G      N      AA  AA++ADA +   GLD+++E E +D
Sbjct: 439 ISPLLGAQQAGFDVEYVFGTNVTTTNDTSGFAAAVAAAKRADAVIFAGGLDETVEREEVD 498

Query: 357 RAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGG 416
           R  +  PG Q +LV+ +A   + P+++    GG +D S  K+   + AI+W GYPGQ+GG
Sbjct: 499 RLNVTWPGNQLDLVAELASVGK-PLIVAQFGGGQLDDSALKSKRSVNAIIWGGYPGQSGG 557

Query: 417 AAIADVLFGRANP 429
            A+ D+L G+A P
Sbjct: 558 TALFDILTGKAAP 570


>gi|396473219|ref|XP_003839293.1| similar to beta-1,4-xylosidase [Leptosphaeria maculans JN3]
 gi|312215862|emb|CBX95814.1| similar to beta-1,4-xylosidase [Leptosphaeria maculans JN3]
          Length = 789

 Score =  322 bits (825), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 180/433 (41%), Positives = 251/433 (57%), Gaps = 10/433 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLK-VAA 61
           N    GL +W+PN+N FRDPRWGRGQETPGED      Y  + + GLQG T    K V A
Sbjct: 137 NANRTGLDFWTPNINPFRDPRWGRGQETPGEDAFHLSSYVKALIAGLQGETTDPYKRVVA 196

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+  YD+++WNG  RY F+A++S+QDL + Y  PF+AC V+  V + MCSYN VNG 
Sbjct: 197 TCKHFAGYDIEDWNGNLRYQFDAQISQQDLVEYYLQPFQAC-VQANVGAFMCSYNAVNGV 255

Query: 122 PTCADPDILKNTIHGQW---RLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           PTCADP +L+  +   W     + ++ SDCD+V  +Y    ++ T E+A ADA+ AG DL
Sbjct: 256 PTCADPYLLQTILREHWGWTNEEQWVTSDCDAVQNIYLPHQWSATREQAVADALIAGTDL 315

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG ++  H  GA   GL+ E  ++ AL    +  +RLG FD + + QP+   G   V T
Sbjct: 316 DCGTYMQEHLPGAFAQGLVNENVLDQALVRQYSSLVRLGWFD-DAADQPYRQFGWDSVAT 374

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
            A Q LA +AA +GIVLLKN    LPLS     ++ V G  ++ T  ++GNYAGV     
Sbjct: 375 DASQALARRAAVEGIVLLKNDG-VLPLSIDSSVSLGVFGDWANATSQLLGNYAGVPTYLH 433

Query: 299 TPLQGISRYAKTIHQAGCF--GVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFID 356
           +PL  + +   TI+ AG    G          +   A   +D  + + G+D SIE E  D
Sbjct: 434 SPLWALQQENLTINYAGGNPGGQGDPTTNRWSSLSGAIATSDILIYIGGIDNSIEEEGHD 493

Query: 357 RAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGG 416
           R  L   G Q +++ ++A   + P ++V+M GG +D +   N+  I AILW GYPGQ GG
Sbjct: 494 RTSLAWTGAQLDVIFQLAATGK-PTIVVVMGGGQIDSAPLANNANISAILWAGYPGQDGG 552

Query: 417 AAIADVLFGRANP 429
            AI D+L G++ P
Sbjct: 553 PAIVDILTGKSPP 565


>gi|291525508|emb|CBK91095.1| Beta-glucosidase-related glycosidases [Eubacterium rectale DSM
           17629]
          Length = 714

 Score =  321 bits (823), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 179/449 (39%), Positives = 265/449 (59%), Gaps = 34/449 (7%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           +G   GLT+W+PNVNIFRDPRWGRG ET GEDP LTGK   +Y+RGLQG+    LK AAC
Sbjct: 98  HGIYKGLTFWAPNVNIFRDPRWGRGHETYGEDPYLTGKLGCAYIRGLQGDDPDHLKSAAC 157

Query: 63  CKHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
            KH+  +     +G +  R+ F+A+ SK D+ DTY   FK CV + KV +VM +YN+VNG
Sbjct: 158 AKHFAVH-----SGPEAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNG 212

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
           +P C    +LK+ +  ++  +G++VSDC ++   +   H T T EE+AA A+  G DL+C
Sbjct: 213 EPACGSRTLLKDILRDEFGFEGHVVSDCWAILDFHEHHHVTDTVEESAAMAVNNGCDLNC 272

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           G    +H + A   GL+ +E +  A+   + V++RLGM    PS  P+ ++    V    
Sbjct: 273 GSAF-LHLKDAYDKGLVSDEAITAAVERLMEVRIRLGMMKDYPS--PYEDISYEVVECKE 329

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H +L+++AA + +VLLKN    LPL      T+AVIGPN++    +IGNY G +  Y TP
Sbjct: 330 HVELSVEAARRSLVLLKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITP 389

Query: 301 LQGISRY----AKTIHQAGCF-------GVACNGNQLIGAAEVAARQADATVLVMGLDQS 349
           L+G+ +Y     + ++  GC        G+A   ++    A + A Q+D  V+ +GLD +
Sbjct: 390 LEGLQQYLGEDTRVLYAEGCHLYKDKVQGLAEEKDRF-KEALIMAEQSDVVVMCLGLDAT 448

Query: 350 IEAE---------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDP 400
           IE E           D+ GL+LPG Q+EL+  VA   + PV+LVL  G  +D+S+A+   
Sbjct: 449 IEGEEGDAGNEYASGDKLGLMLPGLQEELLEAVAAVGK-PVILVLSAGSAIDLSWAEE-- 505

Query: 401 RIGAILWVGYPGQAGGAAIADVLFGRANP 429
            + AI+   YPG  GG A+A+ +FG  +P
Sbjct: 506 HVDAIIDSWYPGARGGKAVAEAIFGEYSP 534


>gi|291528382|emb|CBK93968.1| Beta-glucosidase-related glycosidases [Eubacterium rectale M104/1]
          Length = 714

 Score =  321 bits (823), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 179/449 (39%), Positives = 265/449 (59%), Gaps = 34/449 (7%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           +G   GLT+W+PNVNIFRDPRWGRG ET GEDP LTGK   +Y+RGLQG+    LK AAC
Sbjct: 98  HGIYKGLTFWAPNVNIFRDPRWGRGHETYGEDPYLTGKLGCAYIRGLQGDDPDHLKSAAC 157

Query: 63  CKHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
            KH+  +     +G +  R+ F+A+ SK D+ DTY   FK CV + KV +VM +YN+VNG
Sbjct: 158 AKHFAVH-----SGPEAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNG 212

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
           +P C    +LK+ +  ++  +G++VSDC ++   +   H T T EE+AA A+  G DL+C
Sbjct: 213 EPACGSRTLLKDILRDEFGFEGHVVSDCWAILDFHEHHHVTDTVEESAAMAVNNGCDLNC 272

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           G    +H + A   GL+ +E +  A+   + V++RLGM    PS  P+ ++    V    
Sbjct: 273 GSAF-LHLKDAYDKGLVSDEAITAAVERLMEVRIRLGMMKDYPS--PYEDISYEVVECKE 329

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H +L+++AA + +VLLKN    LPL      T+AVIGPN++    +IGNY G +  Y TP
Sbjct: 330 HVELSVEAARRSLVLLKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITP 389

Query: 301 LQGISRY----AKTIHQAGCF-------GVACNGNQLIGAAEVAARQADATVLVMGLDQS 349
           L+G+ +Y     + ++  GC        G+A   ++    A + A Q+D  V+ +GLD +
Sbjct: 390 LEGLQQYLGDDTRVLYAEGCHLYKDKVQGLAEEKDRF-KEALIMAEQSDVVVMCLGLDAT 448

Query: 350 IEAE---------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDP 400
           IE E           D+ GL+LPG Q+EL+  VA   + PV+LVL  G  +D+S+A+   
Sbjct: 449 IEGEEGDAGNEYASGDKLGLMLPGLQEELLEAVAAVGK-PVILVLSAGSAIDLSWAEE-- 505

Query: 401 RIGAILWVGYPGQAGGAAIADVLFGRANP 429
            + AI+   YPG  GG A+A+ +FG  +P
Sbjct: 506 HVDAIIDSWYPGARGGKAVAEAIFGEYSP 534


>gi|442803736|ref|YP_007371885.1| beta-xylosidase BxlB [Clostridium stercorarium subsp. stercorarium
           DSM 8532]
 gi|442739586|gb|AGC67275.1| beta-xylosidase BxlB [Clostridium stercorarium subsp. stercorarium
           DSM 8532]
          Length = 715

 Score =  321 bits (822), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 189/447 (42%), Positives = 256/447 (57%), Gaps = 32/447 (7%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G   GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+GLQGN    LK AAC 
Sbjct: 108 GIYKGLTFWSPNINIFRDPRWGRGQETYGEDPYLTARLGVAFVKGLQGNHPKYLKAAACA 167

Query: 64  KHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
           KH+  +     +G +  R+ FNA VSK+DL +TY   FKA V E KV SVM +YN+ NG+
Sbjct: 168 KHFAVH-----SGPESLRHEFNAVVSKKDLYETYLPAFKALVQEAKVESVMGAYNRTNGE 222

Query: 122 PTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG 181
           P C    +L + + G+W   G++VSDC ++   +   H T T  E+AA A++ G DL+CG
Sbjct: 223 PCCGSKTLLSDILRGEWGFKGHVVSDCWAIRDFHMHHHVTATAPESAALAVRNGCDLNCG 282

Query: 182 PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAH 241
                +   A++ GL+ EE+++ A+   +  +M+LGMFD E    P+ ++    V    H
Sbjct: 283 NMFG-NLLIALKEGLITEEEIDRAVTRLMITRMKLGMFDPEDQV-PYASISYDFVDCKEH 340

Query: 242 QQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 301
           ++LAL  A + IVLLKN    LPL   +  ++AVIGPN+D    +IGNY G A  Y T L
Sbjct: 341 RELALDVAKKSIVLLKNDG-LLPLDRKKIRSIAVIGPNADSRQALIGNYEGTASEYVTVL 399

Query: 302 QGISRYA----KTIHQAGCFGVACNGNQL------IGAAEVAARQADATVLVMGLDQSIE 351
            GI   A    +  +  GC         L      I  A   A  AD  ++ +GLD +IE
Sbjct: 400 DGIREMAGDDVRIYYSVGCHLYKDRVENLGEPGDRIAEAVTCAEHADVVIMCLGLDSTIE 459

Query: 352 AEFI---------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
            E +         D+  L LPG+QQEL+  V  A+  P+VLVL+ G  + V++A  D  I
Sbjct: 460 GEEMHESNIYGSGDKPDLNLPGQQQELLEAVY-ATGKPIVLVLLTGSALAVTWA--DEHI 516

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANP 429
            AIL   YPG  GG AIA VLFG  NP
Sbjct: 517 PAILNAWYPGALGGRAIASVLFGETNP 543


>gi|392590128|gb|EIW79457.1| glycoside hydrolase family 3 protein [Coniophora puteana RWD-64-598
           SS2]
          Length = 770

 Score =  320 bits (821), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 183/433 (42%), Positives = 257/433 (59%), Gaps = 12/433 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG--NTGSRLKVA 60
           N G AGL +W+PN+N F+DPRWGRGQETPGEDP    +Y  + ++GLQG  N     +V 
Sbjct: 139 NYGHAGLDFWTPNINPFKDPRWGRGQETPGEDPYHIAQYVYNLIQGLQGGVNPEPYFQVV 198

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A CKH+  YDL++W    RY F+A ++ QDL + Y   F++C  + +  + MCSYN VNG
Sbjct: 199 ATCKHFAGYDLEDWENNFRYGFDALITTQDLSEFYLPSFQSCYRDAQAGASMCSYNAVNG 258

Query: 121 KPTCADPDILKNTIHGQWRLDG--YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            PTCAD  +L++ +   W  D   ++ SDCD+V  +YN  +YT  P++AAADA++AG DL
Sbjct: 259 IPTCADTYLLQDILRDYWNFDETRWVTSDCDAVENIYNPHNYTALPQQAAADALRAGTDL 318

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSA-QPFGNLGPRDVC 237
           DCG F   +   A    L+ E ++  AL       +RLG FD  P+A QP+   G  +V 
Sbjct: 319 DCGTFYTEYLPLAYNQSLITETELRAALTRQYASLVRLGYFD--PAAQQPYRQYGWSNVD 376

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPL-STLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
           TP  QQLA  AA +GI LLKN   TLPL STL++  +A+IGP ++ T  M GNY GVA  
Sbjct: 377 TPYAQQLAYTAATEGITLLKNDG-TLPLPSTLKN--IALIGPWANATNQMQGNYFGVAPY 433

Query: 297 YTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFID 356
             +PLQG       +       +  N      AA  AAR+ADA V   G+D ++EAE +D
Sbjct: 434 LVSPLQGALAAGYNVTYVFGTNITSNSTAGFAAAIAAAREADAVVYAGGIDVTVEAEAMD 493

Query: 357 RAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGG 416
           R  +  PG Q +L+  +A   + P V+    GG VD +  K +  + +++W GYPGQ+GG
Sbjct: 494 RYNVTWPGNQLQLIGELAALGK-PFVVAQFGGGQVDDTEIKANASVNSLIWAGYPGQSGG 552

Query: 417 AAIADVLFGRANP 429
            A+ D++ G+  P
Sbjct: 553 QALFDIISGKVAP 565


>gi|238923424|ref|YP_002936940.1| beta-glucosidase [Eubacterium rectale ATCC 33656]
 gi|238875099|gb|ACR74806.1| beta-glucosidase [Eubacterium rectale ATCC 33656]
          Length = 714

 Score =  320 bits (821), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 178/449 (39%), Positives = 265/449 (59%), Gaps = 34/449 (7%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           +G   GLT+W+PNVNIFRDPRWGRG ET GEDP LTGK   +Y+RGLQG+    LK AAC
Sbjct: 98  HGIYKGLTFWAPNVNIFRDPRWGRGHETYGEDPYLTGKLGCAYIRGLQGDDPDHLKSAAC 157

Query: 63  CKHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
            KH+  +     +G +  R+ F+A+ SK D+ DTY   FK CV + KV +VM +YN+VNG
Sbjct: 158 AKHFAVH-----SGPEAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNG 212

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
           +P C    +LK+ +  ++  +G++VSDC ++   +   H T T EE+AA A+  G DL+C
Sbjct: 213 EPACGSRTLLKDILRDEFGFEGHVVSDCWAILDFHEHHHVTDTVEESAAMAVNNGCDLNC 272

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           G    +H + A   G++ +E +  A+   + V++RLGM    PS  P+ ++    V    
Sbjct: 273 GSAF-LHLKDAYDKGMVSDEAITAAVERLMEVRIRLGMMKDYPS--PYEDISYEVVECKE 329

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H +L+++AA + +VLLKN    LPL      T+AVIGPN++    +IGNY G +  Y TP
Sbjct: 330 HVELSVEAARRSLVLLKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITP 389

Query: 301 LQGISRY----AKTIHQAGCF-------GVACNGNQLIGAAEVAARQADATVLVMGLDQS 349
           L+G+ +Y     + ++  GC        G+A   ++    A + A Q+D  V+ +GLD +
Sbjct: 390 LEGLQQYLGEDTRVLYAEGCHLYKDKVQGLAEEKDRF-KEALIMAEQSDVVVMCLGLDAT 448

Query: 350 IEAE---------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDP 400
           IE E           D+ GL+LPG Q+EL+  VA   + PV+LVL  G  +D+S+A+   
Sbjct: 449 IEGEEGDAGNEYASGDKLGLMLPGLQEELLEAVAAVGK-PVILVLSAGSAIDLSWAEE-- 505

Query: 401 RIGAILWVGYPGQAGGAAIADVLFGRANP 429
            + AI+   YPG  GG A+A+ +FG  +P
Sbjct: 506 HVDAIIDSWYPGARGGKAVAEAIFGEYSP 534


>gi|297039776|gb|ADH95739.1| beta-xylosidase [Aspergillus fumigatus]
          Length = 771

 Score =  320 bits (820), Expect = 8e-85,   Method: Compositional matrix adjust.
 Identities = 188/431 (43%), Positives = 256/431 (59%), Gaps = 9/431 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG-SRLKVAA 61
           N G +GL +W+PN+N FRD RWGRGQETPGEDP+   +Y    V GLQ   G +  KV A
Sbjct: 145 NAGRSGLDFWTPNINPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQNGIGPANPKVVA 204

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+ AY L++WNGV R+ FNA VS QDL + Y  PFK+C  + +V +VMCSYN +NG 
Sbjct: 205 TCKHFAAYGLEDWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDARVDAVMCSYNALNGV 264

Query: 122 PTCADPDILKNTIHGQWRLDG---YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           P CAD  +L+  +   W+ D    +I SDC ++  +YN  ++T TP EAAA A+ AG DL
Sbjct: 265 PACADSYLLQTILREHWKWDEPGRWITSDCGAIDDIYNGHNFTTTPAEAAATALNAGTDL 324

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     +   A   GL   + ++ AL    +  ++LG FD     QP+ ++G  DV T
Sbjct: 325 DCGTVFPKYLGQAADEGLYSNQTLDRALVRLYSSFVKLGYFD-PAEDQPYRSIGWTDVDT 383

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PA + LA +AA +GIVLLKN  +TLPL      T+A+IGP ++ T  M GNY G A    
Sbjct: 384 PAVEALAHKAAGEGIVLLKND-KTLPLKA--KGTLALIGPYANATKQMQGNYEGPAKYIR 440

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           T L   ++    +  A    +  N      AA  AA+QAD  V   G+D +IEAE  DR 
Sbjct: 441 TLLWAATQAGYDVKYAAGTAINTNSTAGFDAALSAAKQADVVVYAGGIDNTIEAEGRDRT 500

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            +  PG Q  L+ +++K  + P+V+V   GG VD S   ++PR+ A+LW GYP Q GG+A
Sbjct: 501 TIAWPGNQVNLIDQLSKIGK-PLVVVQFGGGQVDDSSLLSNPRVNALLWAGYPSQEGGSA 559

Query: 419 IADVLFGRANP 429
           I D+L G+  P
Sbjct: 560 IFDILTGKTAP 570


>gi|115436902|ref|XP_001217674.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|121734342|sp|Q0CB82.1|BXLB_ASPTN RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|114188489|gb|EAU30189.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 765

 Score =  320 bits (820), Expect = 9e-85,   Method: Compositional matrix adjust.
 Identities = 177/432 (40%), Positives = 250/432 (57%), Gaps = 11/432 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG-SRLKVAA 61
           N G AGL YW+PN+N +RDPRWGRGQETPGEDP  T +Y    + GLQ   G  + K+ A
Sbjct: 145 NAGHAGLDYWTPNINPYRDPRWGRGQETPGEDPFHTSRYVYHLIDGLQDGIGPEKPKIVA 204

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+  YD+++W G +RY F+A +S QD+ + Y  PFK C  + KV +VMCSYN VNG 
Sbjct: 205 TCKHFAGYDIEDWEGNERYAFDAVISDQDMAEYYFPPFKTCTRDAKVDAVMCSYNSVNGI 264

Query: 122 PTCADPDILKNTIHGQWRLDG---YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           PTCADP +L+  +   W  +G   ++ SDC ++  +Y    Y      AAA A+ AG DL
Sbjct: 265 PTCADPWLLQTVLREHWEWEGVGHWVTSDCGAIDNIYKDHKYVADGAHAAAVAVNAGTDL 324

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSA-QPFGNLGPRDVC 237
           DCG         A+  GLL    ++ AL    +  ++LG FD  P+A QP+ ++G  DV 
Sbjct: 325 DCGSVYPQFLGSAISQGLLGNRTLDRALTRLYSSLVKLGYFD--PAADQPYRSIGWSDVA 382

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           TP  +QLA  AA +G VLLKN   TLPL   ++ TVA++GP ++ T  + GNY G A   
Sbjct: 383 TPDAEQLAHTAAVEGTVLLKNDG-TLPLK--KNGTVAIVGPYANATTQLQGNYEGTAKYI 439

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
            T L   ++    +  A   G+  N       A  AA+ +D  +   G+D  +EAE +DR
Sbjct: 440 HTMLSAAAQQGYKVKYAPGTGINSNSTSGFEQALNAAKGSDLVIYFGGIDHEVEAEALDR 499

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             +  PG Q +L+ +++   + P+V+V   GG VD S   ++  +  +LW GYP QAGGA
Sbjct: 500 TSIAWPGNQLDLIQQLSDLKK-PLVVVQFGGGQVDDSSLLSNAGVNGLLWAGYPSQAGGA 558

Query: 418 AIADVLFGRANP 429
           A+ D+L G+  P
Sbjct: 559 AVFDILTGKTAP 570


>gi|398403795|ref|XP_003853364.1| putative xylan 1,4-beta-Xylosidase [Zymoseptoria tritici IPO323]
 gi|339473246|gb|EGP88340.1| putative xylan 1,4-beta-Xylosidase [Zymoseptoria tritici IPO323]
          Length = 785

 Score =  320 bits (819), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 178/434 (41%), Positives = 252/434 (58%), Gaps = 15/434 (3%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N   +GL YW+PN+N F+D RWGRGQETPGEDP     Y  S + GLQG+ G   KV A 
Sbjct: 141 NDARSGLDYWTPNINPFKDSRWGRGQETPGEDPYHLSSYVKSLIAGLQGD-GKYKKVVAT 199

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           CKH+ AYDL+ WNG  RY F+  V  Q+L + Y  PF+AC  +  V + MCSYN +NG P
Sbjct: 200 CKHFVAYDLETWNGNFRYQFDPHVGSQELVEYYMPPFQACARDANVGAFMCSYNSLNGIP 259

Query: 123 TCADPDILKNTIHGQWRL---DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           TCADP +L+  +   W     + ++ SDCDS+  +Y    YT T EEA A ++KAG D++
Sbjct: 260 TCADPYLLQTILREHWNWTSEEQWVTSDCDSIQNVYLPHEYTSTREEAVAVSLKAGTDVN 319

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG +      GA+  GL+ E+D+++AL    +  +RLG FDG  +A  + +L  +DV TP
Sbjct: 320 CGTYYQEFLPGALSLGLVTEKDIDMALIRQYSSLVRLGYFDG--TAVEYRSLSWKDVSTP 377

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
             QQLAL+AA +GI LLKN    LPL+  +   +AVIG  ++ T  M+GNY G+     +
Sbjct: 378 YAQQLALKAAVEGITLLKNDG-ILPLAITKDTKIAVIGDWANATEQMLGNYDGIPPYLHS 436

Query: 300 PLQGISRYAKTIHQAGC---FGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFID 356
           PL    +    +  +G     G     N L      A  +AD  +   G+D  +EAE +D
Sbjct: 437 PLWAAQQTGANVTYSGNPGGQGDPTTNNWL--HIWTAVDEADVILFAGGIDNGVEAEGMD 494

Query: 357 RAGLLLPGRQQELVSRVAKASRG-PVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           R  +   G Q +++ ++  ASRG PV++  M    VD +   N+  I A+LW GYPGQ G
Sbjct: 495 RVSIAWTGAQLDVIGQL--ASRGKPVIVAQMGTNGVDSTPLLNNQNISALLWGGYPGQDG 552

Query: 416 GAAIADVLFGRANP 429
           G A+ D++ G++ P
Sbjct: 553 GVALLDIIQGKSAP 566


>gi|119473971|ref|XP_001258861.1| beta-xylosidase [Neosartorya fischeri NRRL 181]
 gi|292495290|sp|A1DJS5.1|XYND_NEOFI RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|119407014|gb|EAW16964.1| beta-xylosidase [Neosartorya fischeri NRRL 181]
          Length = 771

 Score =  320 bits (819), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 191/433 (44%), Positives = 260/433 (60%), Gaps = 13/433 (3%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG-SRLKVAA 61
           N G AGL +W+PN+N FRD RWGRGQETPGEDP+   +Y    V GLQ   G +  KV A
Sbjct: 145 NAGRAGLDFWTPNINPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQNGIGPANPKVVA 204

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+ AYDL++WNGV R+ FNA VS QDL + Y  PFK+C  + KV +VMCSYN +NG 
Sbjct: 205 TCKHFAAYDLEDWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDAKVDAVMCSYNALNGV 264

Query: 122 PTCADPDILKNTIHGQWRLDG---YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           P CAD  +L+  +   W+ D    +I  DC ++  +YN  +YT+TP EAAA A+ AG DL
Sbjct: 265 PACADSYLLQTILREHWKWDEPGHWITGDCGAIDDIYNGHNYTKTPAEAAATALNAGTDL 324

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     +   A   GL   + ++ AL    +  ++LG FD     QP+ ++G +DV +
Sbjct: 325 DCGTVFPKYLGQAADEGLYTNKTLDKALVRLYSSLVKLGYFD-PAEDQPYRSIGWKDVDS 383

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PA + LA +AA +GIVLLKN  +TLPL      T+A+IGP ++ T  M GNY G      
Sbjct: 384 PAAEALAHKAAVEGIVLLKND-KTLPLKA--KGTLALIGPYANATKQMQGNYEGPPKYIR 440

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIG--AAEVAARQADATVLVMGLDQSIEAEFID 356
           T L   ++    +      G A N N   G  AA  AA+QAD  V   G+D +IEAE  D
Sbjct: 441 TLLWAATQAGYDVKYVA--GTAINANSTAGFDAALSAAKQADVVVYAGGIDNTIEAEGHD 498

Query: 357 RAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGG 416
           R  ++ PG Q +L+ +++K  + P+V+V   GG VD S   ++P + A+LW GYP Q GG
Sbjct: 499 RTTIVWPGNQLDLIDQLSKIGK-PLVVVQFGGGQVDDSSLLSNPHVNALLWTGYPSQEGG 557

Query: 417 AAIADVLFGRANP 429
           +AI D+L G+  P
Sbjct: 558 SAIFDILTGKTAP 570


>gi|238508313|ref|XP_002385353.1| beta-xylosidase, putative [Aspergillus flavus NRRL3357]
 gi|296439537|sp|B8NYD8.1|BXLB_ASPFN RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|220688872|gb|EED45224.1| beta-xylosidase, putative [Aspergillus flavus NRRL3357]
          Length = 776

 Score =  320 bits (819), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 186/442 (42%), Positives = 258/442 (58%), Gaps = 31/442 (7%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG-SRLKVAA 61
           NGG AGL YW+PN+N FRDPRWGRGQETPGEDP+   +Y    V GLQ   G  R KV A
Sbjct: 143 NGGHAGLDYWTPNINPFRDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPERPKVVA 202

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+ AYDL+NW G++RY F+A VS QDL + Y   FK C  + KV +VMCSYN +NG 
Sbjct: 203 TCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGI 262

Query: 122 PTCADPDILKNTI--HGQWRLDGYIVS-DCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           PTCAD  +L+  +  H  W   G+ V+ DC ++  +Y   HY      AAA A+ AG DL
Sbjct: 263 PTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVADGAHAAAAALNAGTDL 322

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     +   A++ GL   + +N AL    +  ++LG FD     QP+ ++G  +V T
Sbjct: 323 DCGSVFPEYLRSALQQGLYNNQTLNNALIRLYSSLVKLGYFD-PADDQPYRSIGWNEVFT 381

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PA ++LA +A  +GIV+LKN   TLPL +  + TVA+IGP ++ T  + GNY        
Sbjct: 382 PAAEELAHKATVEGIVMLKNDG-TLPLKS--NGTVAIIGPFANATTQLQGNY-------- 430

Query: 299 TPLQGISRYAKTIHQAGCF---------GVACNGNQLIGAAEV--AARQADATVLVMGLD 347
              +G  +Y +T+  A            G   N N   G AE   AA++AD  +   G+D
Sbjct: 431 ---EGPPKYIRTLIWAAVHNGYKVKFSQGTDINSNSSAGFAEAISAAKEADTVIYAGGID 487

Query: 348 QSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILW 407
            +IE E  DR  ++ PG Q +L+ +++   + P+++V   GG VD S    +  +GA+LW
Sbjct: 488 NTIEKESQDRTTIVWPGNQLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLW 546

Query: 408 VGYPGQAGGAAIADVLFGRANP 429
            GYP QAGGAA+ D+L G++ P
Sbjct: 547 AGYPSQAGGAAVFDILTGKSAP 568


>gi|389748262|gb|EIM89440.1| hypothetical protein STEHIDRAFT_182874, partial [Stereum hirsutum
           FP-91666 SS1]
          Length = 772

 Score =  319 bits (817), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 185/432 (42%), Positives = 250/432 (57%), Gaps = 9/432 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG--NTGSRLKVA 60
           N   AGLT+++PN+N FRDPRWGRGQETPGEDP    +Y   YV GLQG  +     KV 
Sbjct: 143 NFNYAGLTFFTPNINPFRDPRWGRGQETPGEDPYHLSRYVYQYVVGLQGGLSPDPYYKVL 202

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A CKH  AYD++NW G DR  FNA V+ QDL + Y   F+ C+ + + AS MCSYN VNG
Sbjct: 203 ANCKHVLAYDVENWEGNDRTGFNAVVTTQDLSEFYTPSFQGCLRDAQGASAMCSYNAVNG 262

Query: 121 KPTCADPDILKNTIHGQWRL---DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
            P+CA   ILK+ +   W L   +G+I  DC +V  +Y    YT T   A A A+ AG D
Sbjct: 263 VPSCASSYILKDLVRDFWGLGEREGWITGDCGAVQNIYQPHGYTDTLVNATAVAMDAGTD 322

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           LDCG   + +   AV  GL+    +  AL       +RLG FD     QP+ +    +V 
Sbjct: 323 LDCGDVYSPNLWTAVVEGLITAGQIQTALIRLYGSLIRLGYFD-PAEQQPYRSFDWSNVN 381

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           TP+ Q LA  AA QGIVLL+N    LPLST     +A+IGP ++ T+++ GNYAG+A   
Sbjct: 382 TPSSQDLAYNAAVQGIVLLENDG-LLPLST-NVKNIALIGPMANATLSLQGNYAGIAPFV 439

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
            +P Q        +  A   G++ + N     A  AA+ AD  V V G+D SIEAE  DR
Sbjct: 440 ISPQQAFETAGYNVTFAFGTGISNSDNSGYSEALEAAQGADVVVFVGGIDNSIEAEGQDR 499

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             +  PG Q +L+ ++ +  + P+V+V M GG  D S  K +  + A+LW GYPGQ+GG 
Sbjct: 500 TSIEWPGSQLDLIGQLGELGK-PLVVVRMGGGQCDDSTLKANATVNALLWAGYPGQSGGT 558

Query: 418 AIADVLFGRANP 429
           A+ D++ G+ +P
Sbjct: 559 ALVDIISGKQSP 570


>gi|317158006|ref|XP_001826724.2| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
          Length = 776

 Score =  318 bits (816), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 186/442 (42%), Positives = 258/442 (58%), Gaps = 31/442 (7%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG-SRLKVAA 61
           NGG AGL YW+PN+N FRDPRWGRGQETPGEDP+   +Y    V GLQ   G  R KV A
Sbjct: 143 NGGHAGLDYWTPNINPFRDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPERPKVVA 202

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+ AYDL+NW G++RY F+A VS QDL + Y   FK C  + KV +VMCSYN +NG 
Sbjct: 203 TCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGI 262

Query: 122 PTCADPDILKNTI--HGQWRLDGYIVS-DCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           PTCAD  +L+  +  H  W   G+ V+ DC ++  +Y   HY      AAA A+ AG DL
Sbjct: 263 PTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVADGAHAAAAALNAGTDL 322

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     +   A++ GL   + +N AL    +  ++LG FD     QP+ ++G  +V T
Sbjct: 323 DCGSVFPEYLGSALQQGLYNNQTLNNALIRLYSSLVKLGYFD-PADDQPYRSIGWNEVFT 381

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PA ++LA +A  +GIV+LKN   TLPL +  + TVA+IGP ++ T  + GNY        
Sbjct: 382 PAAEELAHKATVEGIVMLKNDG-TLPLKS--NGTVAIIGPFANATTQLQGNY-------- 430

Query: 299 TPLQGISRYAKTIHQAGCF---------GVACNGNQLIGAAEV--AARQADATVLVMGLD 347
              +G  +Y +T+  A            G   N N   G AE   AA++AD  +   G+D
Sbjct: 431 ---EGPPKYIRTLIWAAVHNGYKVKFSQGTDINSNSSAGFAEAISAAKEADTVIYAGGID 487

Query: 348 QSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILW 407
            +IE E  DR  ++ PG Q +L+ +++   + P+++V   GG VD S    +  +GA+LW
Sbjct: 488 NTIEKESQDRTTIVWPGNQLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLW 546

Query: 408 VGYPGQAGGAAIADVLFGRANP 429
            GYP QAGGAA+ D+L G++ P
Sbjct: 547 AGYPSQAGGAAVFDILTGKSAP 568


>gi|121797681|sp|Q2TYT2.1|BXLB_ASPOR RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|83775471|dbj|BAE65591.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 797

 Score =  318 bits (815), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 186/442 (42%), Positives = 258/442 (58%), Gaps = 31/442 (7%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG-SRLKVAA 61
           NGG AGL YW+PN+N FRDPRWGRGQETPGEDP+   +Y    V GLQ   G  R KV A
Sbjct: 164 NGGHAGLDYWTPNINPFRDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPERPKVVA 223

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+ AYDL+NW G++RY F+A VS QDL + Y   FK C  + KV +VMCSYN +NG 
Sbjct: 224 TCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGI 283

Query: 122 PTCADPDILKNTI--HGQWRLDGYIVS-DCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           PTCAD  +L+  +  H  W   G+ V+ DC ++  +Y   HY      AAA A+ AG DL
Sbjct: 284 PTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVADGAHAAAAALNAGTDL 343

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     +   A++ GL   + +N AL    +  ++LG FD     QP+ ++G  +V T
Sbjct: 344 DCGSVFPEYLGSALQQGLYNNQTLNNALIRLYSSLVKLGYFD-PADDQPYRSIGWNEVFT 402

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PA ++LA +A  +GIV+LKN   TLPL +  + TVA+IGP ++ T  + GNY        
Sbjct: 403 PAAEELAHKATVEGIVMLKNDG-TLPLKS--NGTVAIIGPFANATTQLQGNY-------- 451

Query: 299 TPLQGISRYAKTIHQAGCF---------GVACNGNQLIGAAEV--AARQADATVLVMGLD 347
              +G  +Y +T+  A            G   N N   G AE   AA++AD  +   G+D
Sbjct: 452 ---EGPPKYIRTLIWAAVHNGYKVKFSQGTDINSNSSAGFAEAISAAKEADTVIYAGGID 508

Query: 348 QSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILW 407
            +IE E  DR  ++ PG Q +L+ +++   + P+++V   GG VD S    +  +GA+LW
Sbjct: 509 NTIEKESQDRTTIVWPGNQLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLW 567

Query: 408 VGYPGQAGGAAIADVLFGRANP 429
            GYP QAGGAA+ D+L G++ P
Sbjct: 568 AGYPSQAGGAAVFDILTGKSAP 589


>gi|332982588|ref|YP_004464029.1| glycoside hydrolase [Mahella australiensis 50-1 BON]
 gi|332700266|gb|AEE97207.1| glycoside hydrolase family 3 domain protein [Mahella australiensis
           50-1 BON]
          Length = 714

 Score =  317 bits (812), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 184/443 (41%), Positives = 257/443 (58%), Gaps = 29/443 (6%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G   GLT+WSPN+NIFRDPRWGRG ET GEDP LT +   ++V+GLQG+    LKV A  
Sbjct: 114 GIYKGLTFWSPNINIFRDPRWGRGHETYGEDPYLTSRMGVAFVKGLQGDDPKYLKVVATP 173

Query: 64  KHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPT 123
           KHY  +   +     R+ F+ARVS++DL +TY   F+ CV EGK  S+M +YN+ NG+P 
Sbjct: 174 KHYAVH---SGPESQRHSFDARVSQKDLRETYLPAFEECVKEGKAVSIMGAYNRTNGEPC 230

Query: 124 CADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPF 183
           CA   +LK+ +  +W  DGY+VSDC ++  ++     T+T  E+AA A+  G +L+CG  
Sbjct: 231 CASKTLLKDILRDEWGFDGYVVSDCGAIDDIHMHHKVTKTAAESAALAVNNGCELNCGKT 290

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC-TPAHQ 242
                + AV  GL+ EE ++ A+    T +MRLGMFD  P    + ++ P DV  +P H+
Sbjct: 291 YEYLCQ-AVEQGLISEETIDQAVIKLFTARMRLGMFD-PPEMVRYAHI-PYDVNDSPEHR 347

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
           +LAL+ A Q IVLLKN    LPLS  +  T+AVIGPN+D    ++ NY G    Y TPL+
Sbjct: 348 ELALETARQSIVLLKNDENILPLSK-KLKTIAVIGPNADDLDVLLANYFGTPSKYVTPLE 406

Query: 303 GI----SRYAKTIHQAGCFGVACNGNQLIGAAEVA--ARQADATVLVMGLDQSIEAEF-- 354
           GI    S   K ++  GC      GN + G  E    A  AD  ++ +GL   IE E   
Sbjct: 407 GIKNKVSPDTKVLYAKGC---EVTGNSVDGFDEAVNIAEMADIVIMCLGLSPRIEGEEGD 463

Query: 355 -------IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILW 407
                   DR  + LPG Q++L+  +    + P+VLVL+ G  + +++A     + AI+ 
Sbjct: 464 VADSDGGGDRLHIDLPGMQEQLLETIYGTGK-PIVLVLLNGSAIAINWAHE--HVPAIIE 520

Query: 408 VGYPGQAGGAAIADVLFGRANPG 430
             YPG+ GG AIADVLFG  NP 
Sbjct: 521 AWYPGEEGGTAIADVLFGDYNPA 543


>gi|392560759|gb|EIW53941.1| glycoside hydrolase family 3 protein [Trametes versicolor FP-101664
           SS1]
          Length = 783

 Score =  315 bits (808), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 190/433 (43%), Positives = 260/433 (60%), Gaps = 11/433 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR--LKVA 60
           N G AGL YW+PN+N F+DPRWGRGQETPGEDP    +Y  + + GLQG    +   KV 
Sbjct: 143 NAGRAGLDYWTPNINPFKDPRWGRGQETPGEDPFHLSQYVYNLILGLQGGLDPKPYFKVV 202

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A CKH+ AYDL+NW G+ R  F+A VS+QDL + Y  PF+ CV + KVASVMCSYN VNG
Sbjct: 203 ADCKHFAAYDLENWEGIVRNGFDAIVSQQDLSEFYLPPFQTCVRDAKVASVMCSYNAVNG 262

Query: 121 KPTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+CA+  +L++ +   W    D ++ SDCD+V  +     YT  P +AAADA+ AG D+
Sbjct: 263 IPSCANSFLLQDVLRDHWGFTDDRWVTSDCDAVENILTPHKYTTDPAQAAADALLAGTDI 322

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG F + +   A++ GL+   D+  A        +RLG FD +P+AQP+  LG  DV T
Sbjct: 323 DCGTFSSTYLPEALQRGLVNSTDLRRAAIRQYASLVRLGYFD-DPAAQPYRQLGWSDVNT 381

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRH-HTVAVIGPNSDVTVTMIGNYAGVACGY 297
           P  QQLA  AA +GIVLLKN    LP S  +H   +A+IGP ++ T  + G+Y GVA   
Sbjct: 382 PQAQQLAHTAAVEGIVLLKNDG-VLPFS--KHVRKLALIGPWANATSLLQGSYIGVAPYL 438

Query: 298 TTPLQGISRYAKTIHQA-GCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFID 356
            +PLQG       +    G      N      AA  A R+ADA V   GLD+++E E  D
Sbjct: 439 VSPLQGAQEAGFEVEYVLGTNVTTQNDMSGFAAAVAAVRRADAVVFAGGLDETVECEGTD 498

Query: 357 RAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGG 416
           R  +  PG Q +LV+ + +  + P+++    GG +D +  K+   + AI+W GYPGQ+GG
Sbjct: 499 RLNVTWPGNQLDLVAELERVGK-PLIVAQFGGGQLDDTALKHSKAVNAIIWGGYPGQSGG 557

Query: 417 AAIADVLFGRANP 429
            A+ D+L G+A P
Sbjct: 558 TALFDILTGKAAP 570


>gi|391864313|gb|EIT73609.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
          Length = 797

 Score =  315 bits (808), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 185/442 (41%), Positives = 257/442 (58%), Gaps = 31/442 (7%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG-SRLKVAA 61
           NGG AGL YW+PN+N FRDPRWGRGQETPGEDP+   +Y    V GLQ   G  R KV A
Sbjct: 164 NGGHAGLDYWTPNINPFRDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPERPKVVA 223

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+ AYDL+NW G++RY F+A VS QDL + Y   FK C  + KV +VMCSYN +NG 
Sbjct: 224 TCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGI 283

Query: 122 PTCADPDILKNTI--HGQWRLDGYIVS-DCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           PTCAD  +L+  +  H  W   G+ V+ DC ++  +Y   HY      AAA A+ AG DL
Sbjct: 284 PTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVADGAHAAAAALNAGTDL 343

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     +   A++ GL   + +  AL    +  ++LG FD     QP+ ++G  +V T
Sbjct: 344 DCGSVFPEYLGSALQQGLYNNQTLYNALIRLYSSLVKLGYFD-PADDQPYRSIGWNEVFT 402

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PA ++LA +A  +GIV+LKN   TLPL +  + TVA+IGP ++ T  + GNY        
Sbjct: 403 PAAEELAHKATVEGIVMLKNDG-TLPLKS--NGTVAIIGPFANATTQLQGNY-------- 451

Query: 299 TPLQGISRYAKTIHQAGCF---------GVACNGNQLIGAAEV--AARQADATVLVMGLD 347
              +G  +Y +T+  A            G   N N   G AE   AA++AD  +   G+D
Sbjct: 452 ---EGPPKYIRTLIWAAVHNGYKVKFSQGTDINSNSSAGFAEAISAAKEADTVIYAGGID 508

Query: 348 QSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILW 407
            +IE E  DR  ++ PG Q +L+ +++   + P+++V   GG VD S    +  +GA+LW
Sbjct: 509 NTIEKESQDRTTIVWPGNQLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLW 567

Query: 408 VGYPGQAGGAAIADVLFGRANP 429
            GYP QAGGAA+ D+L G++ P
Sbjct: 568 AGYPSQAGGAAVFDILTGKSAP 589


>gi|336377735|gb|EGO18896.1| glycoside hydrolase family 3 protein [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 766

 Score =  315 bits (807), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 183/431 (42%), Positives = 258/431 (59%), Gaps = 8/431 (1%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR--LKVA 60
           N G AGL +W+PN+N F+DPRWGRGQETPGEDP    +Y  + V+GLQG    +   +V 
Sbjct: 139 NYGRAGLDFWTPNINPFKDPRWGRGQETPGEDPYHLAQYVYNLVQGLQGGLDPKPYYQVI 198

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           + CKH+ AYDL++W+G  RY F+A V+ QDL + Y   F++C  + KV + MCSYN VNG
Sbjct: 199 STCKHFAAYDLEDWDGNYRYGFDAIVTTQDLSEYYLPSFQSCYRDAKVGAAMCSYNAVNG 258

Query: 121 KPTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+CA+  +L++ +   W    D ++ SDCD+V  +Y+  +YT+TPEEA ADA+KAG D+
Sbjct: 259 IPSCANTYLLQSILRDFWGFAEDRWVTSDCDAVDNIYDPHNYTKTPEEAVADALKAGTDI 318

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG F + +  GA    L+ E ++  AL       +RLG FD     QP+      +V T
Sbjct: 319 DCGTFYSEYLPGAYNQSLITETELRQALIRQYASLVRLGYFD-PTDIQPYRQYNWNNVDT 377

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           P  QQLA QAA +GIVLLKN   TLPLS+     +A+IGP  + T  M GNY GVA    
Sbjct: 378 PQAQQLAYQAAAEGIVLLKNDG-TLPLSS-DIKNIALIGPWGNATGEMQGNYYGVAPYLI 435

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           +PL G       +       +  N      AA  AA+ AD  +   G+D+++E+E  DR 
Sbjct: 436 SPLMGAVATGYNVTYVFGTNITSNDTSGFAAAIAAAQGADVVIYAGGIDETVESEGNDRN 495

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            +  PG Q +LV  +A   + P+V+V   GG VD +  K +  + A+LW GYPGQ+GG+A
Sbjct: 496 YITWPGNQLDLVGELAAVGK-PLVVVQFGGGQVDDTSLKANSTVNALLWAGYPGQSGGSA 554

Query: 419 IADVLFGRANP 429
           + D++ G+  P
Sbjct: 555 LFDIISGKVAP 565


>gi|336365124|gb|EGN93476.1| glycoside hydrolase family 3 protein [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 732

 Score =  315 bits (807), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 183/431 (42%), Positives = 258/431 (59%), Gaps = 8/431 (1%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR--LKVA 60
           N G AGL +W+PN+N F+DPRWGRGQETPGEDP    +Y  + V+GLQG    +   +V 
Sbjct: 124 NYGRAGLDFWTPNINPFKDPRWGRGQETPGEDPYHLAQYVYNLVQGLQGGLDPKPYYQVI 183

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           + CKH+ AYDL++W+G  RY F+A V+ QDL + Y   F++C  + KV + MCSYN VNG
Sbjct: 184 STCKHFAAYDLEDWDGNYRYGFDAIVTTQDLSEYYLPSFQSCYRDAKVGAAMCSYNAVNG 243

Query: 121 KPTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+CA+  +L++ +   W    D ++ SDCD+V  +Y+  +YT+TPEEA ADA+KAG D+
Sbjct: 244 IPSCANTYLLQSILRDFWGFAEDRWVTSDCDAVDNIYDPHNYTKTPEEAVADALKAGTDI 303

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG F + +  GA    L+ E ++  AL       +RLG FD     QP+      +V T
Sbjct: 304 DCGTFYSEYLPGAYNQSLITETELRQALIRQYASLVRLGYFD-PTDIQPYRQYNWNNVDT 362

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           P  QQLA QAA +GIVLLKN   TLPLS+     +A+IGP  + T  M GNY GVA    
Sbjct: 363 PQAQQLAYQAAAEGIVLLKNDG-TLPLSS-DIKNIALIGPWGNATGEMQGNYYGVAPYLI 420

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           +PL G       +       +  N      AA  AA+ AD  +   G+D+++E+E  DR 
Sbjct: 421 SPLMGAVATGYNVTYVFGTNITSNDTSGFAAAIAAAQGADVVIYAGGIDETVESEGNDRN 480

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            +  PG Q +LV  +A   + P+V+V   GG VD +  K +  + A+LW GYPGQ+GG+A
Sbjct: 481 YITWPGNQLDLVGELAAVGK-PLVVVQFGGGQVDDTSLKANSTVNALLWAGYPGQSGGSA 539

Query: 419 IADVLFGRANP 429
           + D++ G+  P
Sbjct: 540 LFDIISGKVAP 550


>gi|302683060|ref|XP_003031211.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
 gi|300104903|gb|EFI96308.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
          Length = 761

 Score =  315 bits (806), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 190/431 (44%), Positives = 252/431 (58%), Gaps = 8/431 (1%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS-RLKVAA 61
           N G+AGL YW+PN+N F+DPRWGRG ETPGEDP+   +Y  + V GLQG      LKVAA
Sbjct: 138 NAGLAGLDYWTPNINPFKDPRWGRGAETPGEDPLHAARYVRTLVEGLQGGIDPPSLKVAA 197

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+ AYDL++W GV RY F+A V+ QDL + Y+ PFK+CV + + ASVMCSYN VNG 
Sbjct: 198 DCKHWAAYDLEDWGGVARYAFDAVVTPQDLAEYYSPPFKSCVRDARAASVMCSYNAVNGV 257

Query: 122 PTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           P CA P +LK  +   W L  D ++ SDCD+VG +Y+   YT      +A ++KAG DLD
Sbjct: 258 PACASPYLLKTVLRDAWGLAEDRWVTSDCDAVGNVYDPHGYTEDFVNGSAVSLKAGSDLD 317

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG   + +   A   GL+ E+D+  AL       + LG FD  P  QP+  +   DV TP
Sbjct: 318 CGTTYSQYLPEAYDRGLIDEDDLKAALTRLYASLVWLGYFDA-PEDQPYRQISWADVNTP 376

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVT-VTMIGNYAGVACGYT 298
           A Q LA  AA +  VLLKN   TLPL T    ++A+IGP ++ + V + GNY G+     
Sbjct: 377 AAQALAYTAAIESFVLLKNDG-TLPL-TDSSLSIALIGPMANASAVQLQGNYNGIPPFAI 434

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
            PLQG       +       V  N    I  A  AA  AD  + V G+D ++E E  DR 
Sbjct: 435 APLQGFLDAGFNVTYVLGTNVTGNDADDIDGAVAAAEAADVVIYVGGIDSTVEEEAKDRT 494

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            +  P  Q  L+S + +A + P+V+V M GG +D +  K    + AILW GYPGQ+GG A
Sbjct: 495 EISWPDNQLALLSALEEAGK-PLVVVQMGGGQLDDTPLKESDAVNAILWAGYPGQSGGTA 553

Query: 419 IADVLFGRANP 429
           IAD + G+  P
Sbjct: 554 IADTVMGKVAP 564


>gi|330934749|ref|XP_003304687.1| hypothetical protein PTT_17336 [Pyrenophora teres f. teres 0-1]
 gi|311318569|gb|EFQ87188.1| hypothetical protein PTT_17336 [Pyrenophora teres f. teres 0-1]
          Length = 798

 Score =  314 bits (805), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 176/437 (40%), Positives = 249/437 (56%), Gaps = 18/437 (4%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN-TGSRLKVAA 61
           N    GL +W+PN+N FRDPRWGRGQETPGED      Y  + + GLQG  T    +V A
Sbjct: 139 NANRTGLDFWTPNINPFRDPRWGRGQETPGEDAYHLSSYVQALIHGLQGEATDPYKRVVA 198

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+  YD+++WNG  RY  + ++++QDL + Y  PF+AC V+  V + MCSYN VNG 
Sbjct: 199 TCKHFAGYDVEDWNGNLRYQNDVQITQQDLVEYYLAPFQAC-VQANVGAFMCSYNAVNGA 257

Query: 122 PTCADPDILKNTIHGQW---RLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           P CADP +L+  +   W   + + ++  DCD+V  +Y    ++ T   AAAD++ AG D+
Sbjct: 258 PPCADPYLLQTILREHWGWNKEEQWVTGDCDAVQNVYFPHQWSSTRAGAAADSLVAGTDI 317

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
            CG ++  H   A R  LL E  ++LAL    +  +RLG FD  P  QP+  LG   V T
Sbjct: 318 TCGTYMQEHLPAAFRQKLLNESSLDLALIRQYSSLVRLGYFDA-PENQPYRQLGFDAVAT 376

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
            A Q LA +AA +GIVLLKN   TLPLS     TV + G  ++ T  ++GNYAGVA    
Sbjct: 377 NASQALARRAAAEGIVLLKNDG-TLPLSLDSSMTVGLFGDWANATTQLLGNYAGVATYLH 435

Query: 299 TPLQGISRYAKTIHQAGCF------GVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA 352
           +PL  + +    I+ AG              + L GA       +D  + V G+D  +E 
Sbjct: 436 SPLYALKQTGVKINYAGGKPGGQGDPTTNRWSNLYGAYST----SDVLIYVGGIDNGVEE 491

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E  DR  L   G Q +++ ++A+  + PV++V+  GG +D S   N+P I AI+W GYPG
Sbjct: 492 EGHDRGYLTWTGPQLDVIGQLAETGK-PVIVVVTGGGQIDSSPLVNNPNISAIMWAGYPG 550

Query: 413 QAGGAAIADVLFGRANP 429
           Q GG+AI D++ G+  P
Sbjct: 551 QDGGSAIIDIISGKTAP 567


>gi|392596548|gb|EIW85871.1| hypothetical protein CONPUDRAFT_80240 [Coniophora puteana
           RWD-64-598 SS2]
          Length = 770

 Score =  314 bits (805), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 179/431 (41%), Positives = 247/431 (57%), Gaps = 8/431 (1%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG--NTGSRLKVA 60
           N G AGL +W+PN+N F+DPRWGRGQETPGEDP    +Y  + ++GLQG  +     +V 
Sbjct: 139 NYGHAGLDFWTPNINPFKDPRWGRGQETPGEDPYHIAQYVYNLIQGLQGGLDPEPYFQVV 198

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A CKH+  YDL++W+   RY +NA +S QDL + Y   F++C  +    + MCSYN +NG
Sbjct: 199 ATCKHFAGYDLEDWDFNYRYGYNAIISTQDLSEYYLPSFQSCYRDAFAGASMCSYNAING 258

Query: 121 KPTCADPDILKNTIHGQWRLDG--YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            PTCAD  +L++ + G W  D   ++  DCDSV  +Y+  HYT  P++AAADA+KAG D+
Sbjct: 259 IPTCADTYLLQDILRGFWGFDQTRWVTGDCDSVEDIYDFHHYTALPQQAAADALKAGSDI 318

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG F       A    L+ E+D+  AL       +RLG FD   S QP+      +V T
Sbjct: 319 DCGIFYTTWLPLAYTESLITEQDLRAALTRQYASLVRLGYFD-PASEQPYRQYNWSNVDT 377

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
              Q+LA  AA +GI LLKN   TLP S+   + +A+IGP +  T  M GNY G A    
Sbjct: 378 SYAQELAYTAAVEGITLLKNDG-TLPFSSAIKN-IALIGPWTFATTQMQGNYYGNAPYLI 435

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           +P QG       I       V  N      AA  AA+ ADA V V G+D ++EAE +DR 
Sbjct: 436 SPYQGAQLAGYNISYVLETNVTSNTTDGYAAAFTAAQGADAIVFVGGIDNTVEAEAMDRN 495

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            +  P  Q  L+  + K  + P+V+V   GG VD +    +P + A+LW GYPGQ+GG A
Sbjct: 496 DITWPAFQLWLIGELGKLGK-PLVVVQFGGGQVDDTEINANPDVNALLWGGYPGQSGGQA 554

Query: 419 IADVLFGRANP 429
           + D++ G+  P
Sbjct: 555 LFDIISGKVAP 565


>gi|70986056|ref|XP_748529.1| beta-xylosidase [Aspergillus fumigatus Af293]
 gi|74668295|sp|Q4WFI6.1|BXLB_ASPFU RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|296439536|sp|B0Y0I4.1|BXLB_ASPFC RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|66846158|gb|EAL86491.1| beta-xylosidase, putative [Aspergillus fumigatus Af293]
 gi|159128339|gb|EDP53454.1| beta-xylosidase [Aspergillus fumigatus A1163]
          Length = 771

 Score =  314 bits (804), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 189/431 (43%), Positives = 257/431 (59%), Gaps = 9/431 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG-SRLKVAA 61
           N G +GL +W+PN+N FRD RWGRGQETPGEDP+   +Y    V GLQ   G +  KV A
Sbjct: 145 NAGRSGLDFWTPNINPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQNGIGPANPKVVA 204

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+ AYDL++WNGV R+ FNA VS QDL + Y  PFK+C  + +V +VMCSYN +NG 
Sbjct: 205 TCKHFAAYDLEDWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDARVDAVMCSYNALNGV 264

Query: 122 PTCADPDILKNTIHGQWRLDG---YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           P CAD  +L+  +   W+ D    +I SDC ++  +YN  ++T TP EAAA A+ AG DL
Sbjct: 265 PACADSYLLQTILREHWKWDEPGRWITSDCGAIDDIYNGHNFTTTPAEAAATALNAGTDL 324

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     +   A   GL   + ++ AL    +  ++LG FD     QP+ ++G  DV T
Sbjct: 325 DCGTVFPKYLGQAADEGLYSNQTLDRALVRLYSSLVKLGYFD-PAEDQPYRSIGWTDVDT 383

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PA + LA +AA +GIVLLKN  +TLPL      T+A+IGP ++ T  M GNY G A    
Sbjct: 384 PAAEALAHKAAGEGIVLLKND-KTLPLKA--KGTLALIGPYANATKQMQGNYEGPAKYIR 440

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           T L   ++    +  A    +  N      AA  AA+QAD  V   G+D +IEAE  DR 
Sbjct: 441 TLLWAATQAGYDVKYAAGTAINTNSTAGFDAALSAAKQADVVVYAGGIDNTIEAEGRDRT 500

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            +  PG Q  L+ +++K  + P+V+V   GG VD S   ++PR+ A+LW GYP Q GG+A
Sbjct: 501 TIAWPGNQVNLIDQLSKIGK-PLVVVQFGGGQVDDSSLLSNPRVNALLWAGYPSQEGGSA 559

Query: 419 IADVLFGRANP 429
           I D+L G+  P
Sbjct: 560 IFDILTGKTAP 570


>gi|302683012|ref|XP_003031187.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
 gi|300104879|gb|EFI96284.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
          Length = 752

 Score =  311 bits (798), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 186/431 (43%), Positives = 245/431 (56%), Gaps = 8/431 (1%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS-RLKVAA 61
           N G AGL YW+PN+N F+DPRWGRG ETPGEDP+   +Y  S V GLQG      LKVAA
Sbjct: 137 NAGRAGLDYWTPNINPFKDPRWGRGAETPGEDPLHAARYVRSLVEGLQGGIDPPSLKVAA 196

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+ AYDL+NW GV RY F+A V+ QDL + Y  PF++CV + + AS MCSYN VNG 
Sbjct: 197 ACKHWAAYDLENWGGVTRYAFDAVVTPQDLAEYYAPPFRSCVRDARAASAMCSYNAVNGV 256

Query: 122 PTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           P CA P +LK  +   W L  D ++ SDC +VG +Y+   YT     A+  ++KAG DL+
Sbjct: 257 PACASPYLLKTVLRDAWGLAEDRWVTSDCGAVGNVYDPHGYTEDLVNASTVSLKAGTDLN 316

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG     +   A   GL+ E+D+  AL       + LG FD  P  QP+  +   DV TP
Sbjct: 317 CGTNYTQYLPEAYDRGLIDEDDLKAALTRLYASLVWLGYFDA-PEDQPYRQITWADVNTP 375

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVT-VTMIGNYAGVACGYT 298
             Q LA  AA +  VLLKN   TLPL T    ++A+IGP ++ + + M+GNY G+     
Sbjct: 376 EAQALAYTAAIKSFVLLKNDG-TLPL-TDSTLSLALIGPMANASALQMLGNYFGIPPFVI 433

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
            PLQG       +       V  N      AA  AA  AD  + V G+D ++E E  DR 
Sbjct: 434 APLQGFLDAGFNVTYVLGTNVTGNDAGSFDAAVAAAEAADVVIYVGGIDNTLEMEEKDRT 493

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            +  P  Q  L+S +    + P+V+V M GG +D +  K    + AILW GYPGQ+GG A
Sbjct: 494 EISWPDNQLALLSALEGVGK-PLVVVQMGGGQLDDTPLKESDAVNAILWAGYPGQSGGTA 552

Query: 419 IADVLFGRANP 429
           IAD + G+  P
Sbjct: 553 IADTVTGKVAP 563


>gi|23304843|emb|CAD48309.1| beta-xylosidase B [Clostridium stercorarium]
          Length = 715

 Score =  310 bits (795), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 185/445 (41%), Positives = 252/445 (56%), Gaps = 28/445 (6%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G   GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+GLQGN    LK    C
Sbjct: 108 GIYKGLTFWSPNINIFRDPRWGRGQETYGEDPYLTARLGVAFVKGLQGNHPKYLKAGGMC 167

Query: 64  KHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPT 123
           K+   + +   +   R+ FNA VSK+DL +TY   FKA V E KV SVM +YN+ NG+P 
Sbjct: 168 KNILPFTVVPESL--RHEFNAVVSKKDLYETYLPAFKALVQEAKVESVMGAYNRTNGEPC 225

Query: 124 CADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPF 183
           C    +L + + G+W   G++VSDC ++   +   H T T  E+AA A++ G DL+CG  
Sbjct: 226 CGSKTLLSDILRGEWGFKGHVVSDCWAIRDFHMHHHVTATAPESAALAVRNGCDLNCGNM 285

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
              +   A++ GL+ EE+++ A+   +  +M+LGMFD E    P+ ++     C   H++
Sbjct: 286 FG-NLLIALKEGLITEEEIDRAVTRLMITRMKLGMFDPEDQV-PYASISSFVDCK-EHRE 342

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL  A + IVLLKN    LPL   +  ++AVIGPN+D    +IGNY G A  Y T L G
Sbjct: 343 LALDVAKKSIVLLKNDG-LLPLDRKKIRSIAVIGPNADSRQALIGNYEGTASEYVTVLDG 401

Query: 304 ISRYA----KTIHQAGCFGVACNGNQL------IGAAEVAARQADATVLVMGLDQSIEAE 353
           I   A    +  +  GC         L      I  A   A  AD  ++ +GLD +IE E
Sbjct: 402 IREMAGDDVRIYYSVGCHLYKDRVENLGEPGDRIAEAVTCAEHADVVIMCLGLDSTIEGE 461

Query: 354 FI---------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGA 404
            +         D+  L LPG+QQEL+  V  A+  P+VLVL+ G  + V++A  D  I A
Sbjct: 462 EMHESNIYGSGDKPDLNLPGQQQELLEAVY-ATGKPIVLVLLTGSALAVTWA--DEHIPA 518

Query: 405 ILWVGYPGQAGGAAIADVLFGRANP 429
           IL   YPG  GG AIA VLFG  NP
Sbjct: 519 ILNAWYPGALGGRAIASVLFGETNP 543


>gi|242216161|ref|XP_002473890.1| beta-xylosidase [Postia placenta Mad-698-R]
 gi|220726990|gb|EED80923.1| beta-xylosidase [Postia placenta Mad-698-R]
          Length = 741

 Score =  309 bits (792), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 177/431 (41%), Positives = 245/431 (56%), Gaps = 8/431 (1%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLK-VAA 61
           N   +G+ +W+PN+N F+DPRWGRGQETPGEDP     Y  + + GLQG      K + A
Sbjct: 136 NANRSGIDFWTPNINPFKDPRWGRGQETPGEDPFHLQSYVYNLITGLQGGLDPEYKRIVA 195

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+ AYDL+NW G  RY F+A VS QDL + Y   F+ C  +  V S MCSYN VNG 
Sbjct: 196 TCKHFAAYDLENWEGNVRYGFDALVSLQDLSEFYTRSFRTCARDANVGSFMCSYNAVNGV 255

Query: 122 PTCADPDILKNTIHGQW---RLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           P+CA+  +L++ +   W     D YI SDCD++  +Y   +YT T  E  ADA+ AG DL
Sbjct: 256 PSCANSYLLQDILRDHWGWTNEDQYITSDCDAIQNIYEPHYYTATRAETVADALNAGTDL 315

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG +   +   A   GL  E  +N AL       ++LG FD     QP+  +G  +V T
Sbjct: 316 DCGEYYPENLGAAYDQGLFTESTLNRALIRQYAALVKLGYFD-PADIQPYRQIGWANVST 374

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           P  ++LA  AA +GI LLKN   TLPLS     T+A+IGP ++ T  M GNY GVA    
Sbjct: 375 PEAEELAYTAAVEGITLLKNDG-TLPLSP-SIKTIALIGPWANATTQMQGNYYGVAPYLI 432

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           +PL        T++ +   GV         AA  AA  ADA +   G+D ++EAE +DR 
Sbjct: 433 SPLMAAEELGFTVYYSAGPGVDDPTTSSFPAAFAAAEAADAIIYAGGIDITVEAEAMDRY 492

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            L  PG Q + + +++   + P++++   GG +D S    +P + A++W GYPGQ+GG A
Sbjct: 493 TLDWPGVQPDFIDQLSLLGK-PLIVLQFGGGQIDDSALLPNPGVNALVWGGYPGQSGGKA 551

Query: 419 IADVLFGRANP 429
           I D++ G A P
Sbjct: 552 IMDIIVGNAAP 562


>gi|344303941|gb|EGW34190.1| hypothetical protein SPAPADRAFT_65353 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 788

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 181/435 (41%), Positives = 246/435 (56%), Gaps = 13/435 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR-----L 57
           N G AGL ++SPN+N FRD RWGRGQE   E PVL G YA +YV+GLQG   S      L
Sbjct: 160 NVGRAGLDFYSPNINPFRDARWGRGQEVASESPVLVGNYALNYVQGLQGGLDSNQNDDTL 219

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           +VAA CKH+  YD+++WN   R  +NA +S QDL D Y   F++CV + K A  MCSYN 
Sbjct: 220 QVAATCKHFVGYDMESWNQHSRLGYNAIISDQDLADFYLPTFQSCVRDAKAAGAMCSYNA 279

Query: 118 VNGKPTCADPDILKNTIHGQWRL-DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGL 176
           VNG P CA    L   +   +   +G I SDCD++  ++N   Y +    AAADAIKAG+
Sbjct: 280 VNGVPACASEFFLNTVLRDGFDFQNGVIHSDCDAIYNVWNPHLYAQDLGGAAADAIKAGV 339

Query: 177 DLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDV 236
           D++CG     +   A+    + E  +  ++    +  +RLG FD  P    +      DV
Sbjct: 340 DVNCGDTYQNNLGYALGNKTINENQIRTSVTRQYSNLIRLGYFD-SPQTNKYRKYDWNDV 398

Query: 237 CTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
            TP   QLA QAA +GI LLKN   TLP +  +   VAVIGP ++ T  M+G+YAG    
Sbjct: 399 STPQANQLAYQAAVEGIALLKNDG-TLPFNKQKVRKVAVIGPWANATTQMLGDYAGTPPY 457

Query: 297 YTTPLQGISRYAKTIHQAGCFGVACNGNQLIG--AAEVAARQADATVLVMGLDQSIEAEF 354
             +PLQG       +  A   G   N     G  AA  AA+ ADA V   G+D S+E E 
Sbjct: 458 MISPLQGAQSEGFQVEYA--LGTQINTTDTSGYTAALNAAKGADAIVYFGGIDNSVENEA 515

Query: 355 IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQA 414
           +DR  L  PG Q +LVS+++   + P+V++   GG +D +  KN+  + AI++ GYPGQ+
Sbjct: 516 LDRESLAWPGNQLDLVSKLSGLKK-PLVVLQFGGGQIDDTEIKNNKNVNAIVYAGYPGQS 574

Query: 415 GGAAIADVLFGRANP 429
           GG AI D+L G+  P
Sbjct: 575 GGTAIWDILSGKYAP 589


>gi|296081550|emb|CBI20073.3| unnamed protein product [Vitis vinifera]
          Length = 309

 Score =  308 bits (789), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 151/239 (63%), Positives = 177/239 (74%), Gaps = 18/239 (7%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ-GNTGS--RL 57
           MYN G+AGLT+WSPNVNIF+DPRWGRGQETPGEDP+L+ KYA+ YVRGLQ  + GS  RL
Sbjct: 86  MYNVGLAGLTFWSPNVNIFQDPRWGRGQETPGEDPLLSSKYASGYVRGLQQSDDGSPDRL 145

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           KVAACCKHYTAYDLDNW GVD +HFNA V+ QD++DT+  PFK+CV++G VASVMCSYNQ
Sbjct: 146 KVAACCKHYTAYDLDNWKGVDCFHFNAVVTNQDMDDTFQPPFKSCVIDGNVASVMCSYNQ 205

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
                       L    H Q     Y+VSDCDSV V YN+QHYT+TPEEA A AI AGLD
Sbjct: 206 F----------YLIYKYHLQ-----YLVSDCDSVDVFYNSQHYTKTPEEAVAKAILAGLD 250

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDV 236
           L+CG FL  HTE AV+GGL+ E  V+ A++      MRLG FDG PS   +G LGP+DV
Sbjct: 251 LNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIYGKLGPKDV 309


>gi|330836687|ref|YP_004411328.1| Beta-glucosidase [Sphaerochaeta coccoides DSM 17374]
 gi|329748590|gb|AEC01946.1| Beta-glucosidase [Sphaerochaeta coccoides DSM 17374]
          Length = 709

 Score =  307 bits (787), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 177/443 (39%), Positives = 257/443 (58%), Gaps = 33/443 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPNVNIFRDPRWGRGQET GEDP LT + A +++RG+QG+ G  LK AAC KH+ 
Sbjct: 108 GLTFWSPNVNIFRDPRWGRGQETYGEDPYLTARLAVAFIRGIQGD-GKYLKAAACAKHFA 166

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+ARVS++DL +TY   FKA V E +V  VM +YN+VNG P CA
Sbjct: 167 VH-----SGPEALRHEFDARVSQKDLHETYLSAFKAAVKEAQVEIVMGAYNRVNGVPACA 221

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
             ++L + +  +W  +G++VSD +++  ++   HY        A A+KAG +L C   +A
Sbjct: 222 SHELLSDILRSEWGFEGHVVSDYEALEDIFKHHHYVADEAHTMAVALKAGCNL-CAGKIA 280

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
            H   +V  GL+ E+++  A+    T ++ +GM   +    P+ ++G  +  TP H QLA
Sbjct: 281 RHLRSSVDEGLISEDEITEAVERLFTTRIMMGMMADDC---PYDSIGYEENDTPEHHQLA 337

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
           ++AA +  VLLKN    LPL   +  ++AVIGPN++    + GNY G A  Y T L+GI 
Sbjct: 338 VEAASRSFVLLKNDG-LLPLEMEKISSIAVIGPNANSRKMLEGNYNGTASRYVTVLEGIQ 396

Query: 306 R---------YAKTIHQAGCFGVACNG-NQLIGAAEVAARQADATVLVMGLDQSIEAE-- 353
                     Y++  H    F  + +G N  +  A  AA+ AD  VL +GLD ++E E  
Sbjct: 397 DLVGDSVRVWYSEGCHLYKNFHSSLSGRNDRLAEAVSAAQHADVVVLCLGLDATLEGEEG 456

Query: 354 -------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
                    D+  L LPGRQQ L+  +    + PV+L+L  G  + +   +ND  + AIL
Sbjct: 457 DVEVGFGSGDKPNLSLPGRQQLLLDTMLTVGK-PVILLLASGSALTLGGRENDENLKAIL 515

Query: 407 WVGYPGQAGGAAIADVLFGRANP 429
            + YPG  GG A+ADVLFGR  P
Sbjct: 516 QIWYPGAMGGKAVADVLFGRRAP 538


>gi|242813865|ref|XP_002486253.1| beta-xylosidase, putative [Talaromyces stipitatus ATCC 10500]
 gi|218714592|gb|EED14015.1| beta-xylosidase, putative [Talaromyces stipitatus ATCC 10500]
          Length = 893

 Score =  305 bits (781), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 178/432 (41%), Positives = 251/432 (58%), Gaps = 9/432 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ-GNTGSRLKVAA 61
           N G AGL +W+PN+N FRDPRWGRGQETPGEDP    +Y  + V GLQ G   +  +V A
Sbjct: 267 NYGNAGLDFWTPNINPFRDPRWGRGQETPGEDPYHIARYVYNLVDGLQNGIAPANPRVVA 326

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+  YD+++W G  RY FNA +S QDL + Y  PFK+C  + +V ++MCSYN VNG 
Sbjct: 327 TCKHFAGYDIEDWEGNSRYGFNAIISTQDLSEYYLPPFKSCARDAQVDAIMCSYNAVNGI 386

Query: 122 PTCADPDILKNTIHGQWRLDG---YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           PTCAD  +L   +   W  +    ++ SDCD+V  +Y+   YT +   AAADA+ AG +L
Sbjct: 387 PTCADSYLLDTILRDHWNWNQTGHWVTSDCDAVDNIYSDHRYTSSLAAAAADALNAGTNL 446

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG  ++ +   A    L +   +N AL Y  +  +RLG FD E S   + +LG  DV T
Sbjct: 447 DCGTTMSNNLAAAAAQDLFKNATLNSALVYLYSSLVRLGWFDSEDSQ--YSSLGWSDVGT 504

Query: 239 PAHQQLALQAAHQGIVLLKNS-ARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
            A QQLA +AA +GIVLLKN   + LPLS     T+A+IGP ++ T  + GNY G     
Sbjct: 505 TASQQLANRAAVEGIVLLKNDHKKVLPLSQ-HGQTIALIGPYANATTQLQGNYYGTPAYI 563

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
            T + G  +   T+      G+         AA  AA+ AD  +   G+D SIEAE +DR
Sbjct: 564 RTLVWGAEQMGYTVQYEAGTGINSTDTSGFAAAVAAAKTADIVIYAGGIDNSIEAEAMDR 623

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             +   G Q +L+ ++++  + P+V++   GG +D S    +  + A+LW GYP Q GG 
Sbjct: 624 NTIAWTGNQLQLIDQLSQVGK-PLVVLQFGGGQLDDSALLQNENVNALLWCGYPSQTGGQ 682

Query: 418 AIADVLFGRANP 429
           A+ D+L G++ P
Sbjct: 683 AVFDILTGQSAP 694


>gi|322512556|gb|ADX05682.1| putative carbohydrate-active enzyme [uncultured organism]
          Length = 717

 Score =  304 bits (779), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 177/443 (39%), Positives = 253/443 (57%), Gaps = 31/443 (6%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPNVNIFRDPRWGRG ET GEDP L+G+   S++RG+QG+    +KVAAC KH+ 
Sbjct: 111 GLTFWSPNVNIFRDPRWGRGHETFGEDPFLSGRLGVSFIRGMQGDDERYMKVAACAKHFA 170

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ FNA VS+QDL +TY   F ACV E  V +VM +YN+ NG+  C 
Sbjct: 171 VH-----SGPEDQRHSFNAVVSEQDLRETYLPAFHACVTEAGVEAVMGAYNRTNGEACCG 225

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
              +L + + G+W   G++ SDC ++   +     T+  EE  A A+ +G DL+CG  L 
Sbjct: 226 SKKLLVDILRGEWGFRGHVTSDCWALKDFHEFHMVTKNQEETVALAMNSGCDLNCGN-LY 284

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
           +H   AVR GL+ E  ++ A+    T +M+LG+FD      P+  +G   V T A+++L 
Sbjct: 285 VHLLQAVRDGLVEESVIDRAVTRLFTTRMKLGLFDRSEEV-PYNGIGYDRVDTEANRKLN 343

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
            +A+ + + LLKN+   LPL   +  T+ V+GPN+D    ++GNY G A  Y T L GI 
Sbjct: 344 REASRRTVCLLKNADGLLPLDISKLRTIGVVGPNADNRKALVGNYEGTASEYVTVLDGIR 403

Query: 306 RYA----KTIHQAGC--FGVACNG----NQLIGAAEVAARQADATVLVMGLDQSIEA--- 352
             A    + ++  GC  F     G    N  I  A   A  +D  + VMGLD  +E    
Sbjct: 404 ELAGDDVRVVYSEGCHLFRDRVQGLGQPNDRIAEARAVAELSDVVIAVMGLDPGLEGEEG 463

Query: 353 ----EFI--DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
               EF   D+  L LPG Q E++  + ++ + PVVLVL+ G  + + +A+    + AIL
Sbjct: 464 DQGNEFASGDKPNLELPGLQGEVLKALVESGK-PVVLVLLGGSALAIPWAEE--HVPAIL 520

Query: 407 WVGYPGQAGGAAIADVLFGRANP 429
              YPG  GG A+ADVLFGRA P
Sbjct: 521 DAWYPGAQGGRAVADVLFGRACP 543


>gi|443893988|dbj|GAC71176.1| hypothetical protein PANT_1d00031 [Pseudozyma antarctica T-34]
          Length = 759

 Score =  304 bits (779), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 177/434 (40%), Positives = 245/434 (56%), Gaps = 12/434 (2%)

Query: 3   NGGMAGLTYWSP-NVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAA 61
           N G AGL  +SP N+N FRDPRWGRGQET GEDP+   +YA   V+GLQG     L++AA
Sbjct: 140 NAGKAGLNMYSPLNINCFRDPRWGRGQETTGEDPLHMSRYAVKMVQGLQGPNQDELRLAA 199

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKHY AYDL+ W+GV+RY F+A+VS+Q+L + Y   F+ACV +GK  ++M SYN VN  
Sbjct: 200 TCKHYLAYDLEKWDGVERYQFDAQVSRQELAEFYLPQFRACVRDGKAVTLMTSYNAVNNV 259

Query: 122 PTCADPDILKNTIHGQWRLD---GYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           P  A    L+     +W LD    Y+ SDCD+V  +++  HY  +  +AAAD+I AG DL
Sbjct: 260 PPSASRYYLETLARKEWGLDKKHNYVTSDCDAVANVFDGHHYADSYVQAAADSINAGTDL 319

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG   + +   A+   L   E +  A+A     Q+RLG+FD +   QP   LG   V T
Sbjct: 320 NCGATYSDNLGQALEQNLTDVETIRTAVARMYASQVRLGLFDPK-QGQPLRELGWEHVNT 378

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
            A Q LA  +A   + LLKN+  TLP+       VAVIGP S+ T  + GNYAG      
Sbjct: 379 KAAQDLAYSSAAASVTLLKNNG-TLPVDGATK--VAVIGPYSNATFALRGNYAGPGPFAI 435

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVA---ARQADATVLVMGLDQSIEAEFI 355
           T  +   R       +   G   +G      AE A   A++AD  +   G+D +IE+E +
Sbjct: 436 TMTEAAQRVFSQATISSANGTTISGTYNHTDAEAAMQLAKEADLVIFAGGIDPTIESEEL 495

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DRA +  P  Q +L+  +   ++  + +V   GG +D +  K D  IGA+LW GYPGQ+G
Sbjct: 496 DRATIAWPPNQLQLIHALGGMAKK-MAVVQFGGGQIDGASIKADGNIGALLWAGYPGQSG 554

Query: 416 GAAIADVLFGRANP 429
             A+ DV+ G   P
Sbjct: 555 ALAVMDVIAGNTAP 568


>gi|189203341|ref|XP_001938006.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187985105|gb|EDU50593.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 761

 Score =  304 bits (778), Expect = 7e-80,   Method: Compositional matrix adjust.
 Identities = 173/437 (39%), Positives = 248/437 (56%), Gaps = 18/437 (4%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAA 61
           N    GL +W+PN+N FRDPRWGRGQETPGED      Y  + + GLQG +T    +V A
Sbjct: 102 NANRTGLDFWTPNINPFRDPRWGRGQETPGEDAYHLSSYVQALIHGLQGESTDPYKRVVA 161

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+  YD+++WNG  RY  + ++++Q+L + Y  PF+AC V+  V + MCSYN VNG 
Sbjct: 162 TCKHFAGYDVEDWNGNLRYQNDVQITQQELVEYYLAPFQAC-VQANVGAFMCSYNAVNGA 220

Query: 122 PTCADPDILKNTIHGQW---RLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           P CADP +L+  +   W     + ++  DCD+V  +Y    ++ T   AAAD++ AG D+
Sbjct: 221 PPCADPYLLQTILREHWGWTNEEQWVTGDCDAVQNVYLPHQWSPTRAGAAADSLVAGTDV 280

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
            CG ++  H   A +  LL E  ++ AL    +  +RLG FD   + QP+  LG   V T
Sbjct: 281 TCGTYMQEHLPAAFQQKLLNESSLDQALIRQYSSLVRLGYFDASEN-QPYRQLGFDAVAT 339

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
            A Q LA +AA +GIVLLKN   TLPLS     TV + G  ++ T  ++GNYAGVA    
Sbjct: 340 NASQALARRAAAEGIVLLKNDG-TLPLSLDSSVTVGLFGDWANATSQLLGNYAGVATYLH 398

Query: 299 TPLQGISRYAKTIHQAGCF------GVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA 352
           +PL  + +    I+ AG              + L GA       +D  + V G+D S+E 
Sbjct: 399 SPLYALEQTGVKINYAGGNPGGQGDPTTNRWSNLYGAYST----SDVLIYVGGIDNSVEE 454

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E  DR  L   G Q +++ ++A   + PV++V+  GG +D S   N+P I AI+W GYPG
Sbjct: 455 EGRDRGYLTWTGAQLDVIGQLADTGK-PVIVVVTGGGQIDSSPLVNNPNISAIMWAGYPG 513

Query: 413 QAGGAAIADVLFGRANP 429
           Q GG+AI D++ G+  P
Sbjct: 514 QDGGSAIIDIIGGKTAP 530


>gi|344302281|gb|EGW32586.1| hypothetical protein SPAPADRAFT_51129 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 788

 Score =  303 bits (777), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 179/433 (41%), Positives = 242/433 (55%), Gaps = 9/433 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR-----L 57
           N G AGL Y+SPN+N F+DPRWGRGQE   E PVL G YA +YV+GLQG   S      L
Sbjct: 160 NVGRAGLDYYSPNINPFKDPRWGRGQEVASESPVLVGNYALNYVQGLQGGIDSNPNDDTL 219

Query: 58  KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQ 117
           +VAA CKH+  YD+++W    R  +NA +S QDL D Y   F++CV + K A  MCSYN 
Sbjct: 220 QVAATCKHFAGYDMESWKQHSRLGYNAIISDQDLADYYFPTFQSCVRDAKAAGAMCSYNA 279

Query: 118 VNGKPTCADPDILKNTIHGQWRL-DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGL 176
           +NG P CA    L   I   +   +G I SDCDS+  ++N   Y +    AAAD IKAG+
Sbjct: 280 INGIPVCASEFFLGTVIREGFDFQNGVIHSDCDSLYSIWNPHLYVQDLGAAAADGIKAGV 339

Query: 177 DLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDV 236
           D++CG     +   A+    + E+ +  ++    +  +RLG FD  P    +      DV
Sbjct: 340 DVNCGDTYQNNLGYALGNKTINEDQIRASVTRQYSNLIRLGYFD-SPQTNKYRTYNWSDV 398

Query: 237 CTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
            T    QLA QAA +GI LLKN   TLP +  +   VAVIGP ++ T  M+G+YAG    
Sbjct: 399 STSQANQLAYQAAVEGITLLKNDG-TLPFNKDKVKNVAVIGPWANATTDMLGDYAGTPPY 457

Query: 297 YTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFID 356
             +PLQG       +  A    +         AA  AA+ ADA V   G+D SIE E +D
Sbjct: 458 LISPLQGAQDSGFKVQYAYGTQINTTLTTNYTAALNAAKGADAIVYFGGIDNSIENEALD 517

Query: 357 RAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGG 416
           R  L  PG Q +LVS+++  ++ P+V+V    G VD +  KN+  + +I++ GYPGQ+GG
Sbjct: 518 RESLAWPGNQLDLVSKLSGLNK-PLVVVQFGAGQVDDTEIKNNNNVNSIVYAGYPGQSGG 576

Query: 417 AAIADVLFGRANP 429
            AI DVL G   P
Sbjct: 577 TAIWDVLNGIYAP 589


>gi|67523807|ref|XP_659963.1| hypothetical protein AN2359.2 [Aspergillus nidulans FGSC A4]
 gi|74597492|sp|Q5BAS1.1|XYND_EMENI RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|40745314|gb|EAA64470.1| hypothetical protein AN2359.2 [Aspergillus nidulans FGSC A4]
 gi|259487761|tpe|CBF86686.1| TPA: Beta-xylosidase (EC 3.2.1.37)
           [Source:UniProtKB/TrEMBL;Acc:O42810] [Aspergillus
           nidulans FGSC A4]
          Length = 803

 Score =  303 bits (777), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 167/430 (38%), Positives = 245/430 (56%), Gaps = 8/430 (1%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR-LKVAA 61
           N G+ G+  +SPN+N FR P WGRGQETPGED  LT  Y   Y+  LQG      LK+ A
Sbjct: 160 NAGLGGVDVYSPNINTFRHPVWGRGQETPGEDAFLTSVYGYEYITALQGGVDPETLKIIA 219

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
             KHY  YD+++WN   R   + ++++Q+L + Y  PF     + KV SVMCSYN VNG 
Sbjct: 220 TAKHYAGYDIESWNNHSRLGNDMQITQQELSEYYTPPFIVASRDAKVRSVMCSYNAVNGV 279

Query: 122 PTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           P+CA+   L+  +   +    DGY+  DC +V  ++N   Y      A+AD+I AG D+D
Sbjct: 280 PSCANKFFLQTLLRDTFEFSEDGYVSGDCGAVYNVWNPHGYASNEAAASADSILAGTDID 339

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG     H+E A    L+   D+   +    +  ++ G FDGE +  P+ ++   DV + 
Sbjct: 340 CGTSYQWHSEDAFEDSLVSRSDIERGVIRLYSNLVQAGYFDGEDA--PYRDITWDDVLST 397

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
               +A +AA +GIVLLKN   TLPLS     +VAVIGP ++VT  + GNY G A    +
Sbjct: 398 DAWNIAYEAAVEGIVLLKND-ETLPLSK-DIKSVAVIGPWANVTEELQGNYFGPAPYLIS 455

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           PL G       +H A    +  +       A  AA+QADA +   G+D +IEAE +DR  
Sbjct: 456 PLTGFRDSGLDVHYALGTNLTSHSTSGFEEALTAAKQADAIIFAGGIDNTIEAEAMDREN 515

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           +  PG Q +L+S++++  + P+V++ M GG VD S  K++  + A++W GYPGQ+GG A+
Sbjct: 516 ITWPGNQLDLISKLSELGK-PLVVLQMGGGQVDSSSLKDNDNVNALIWGGYPGQSGGHAL 574

Query: 420 ADVLFGRANP 429
           AD++ G+  P
Sbjct: 575 ADIITGKRAP 584


>gi|156062754|ref|XP_001597299.1| hypothetical protein SS1G_01493 [Sclerotinia sclerotiorum 1980]
 gi|154696829|gb|EDN96567.1| hypothetical protein SS1G_01493 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 758

 Score =  303 bits (777), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 179/432 (41%), Positives = 241/432 (55%), Gaps = 10/432 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS--RLKVA 60
           N    GL +W+PN+N ++DPRWGRGQETPGEDP     Y  + V GLQG        K  
Sbjct: 135 NANRYGLNFWTPNINPYKDPRWGRGQETPGEDPFHVSSYVNALVTGLQGGLDDLPYKKGV 194

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A CKHY  YDL+N  G+ RY F+A ++ QDL D Y   F+ C  +  V S+MCSYN VNG
Sbjct: 195 ATCKHYAGYDLENGGGIQRYAFDAIINSQDLRDYYLPSFQQCARDSNVQSIMCSYNAVNG 254

Query: 121 KPTCADPDILKNTIHGQW---RLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
            PTCAD  +L++ +   W     D ++ SDCD+V  ++++ +YT TPE+AAADA+ AG D
Sbjct: 255 VPTCADDWLLQSLLREHWGWVEEDQWVTSDCDAVQNIWDSHNYTSTPEQAAADALNAGTD 314

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           LDCG F   +   A    L     ++ +L       +RLG FD   S QP+  LG  DV 
Sbjct: 315 LDCGGFWPTYLGSAYNQSLYNISTLDRSLTRRYASLVRLGYFD-PASIQPYRQLGWSDVS 373

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           TP+ +QLALQAA  GIVLLKN    LPL +     VA+IGP ++ T  M GNY G A   
Sbjct: 374 TPSAEQLALQAAEDGIVLLKNDG-ILPLPS-NITNVALIGPWANATTQMQGNYYGQAPYL 431

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
            +PL         +       +         AA  AA++AD  + + G+D SIEAE  DR
Sbjct: 432 HSPLIAAQNAGFHVTYVQGADIDSTNTTEFTAAIAAAKKADVIIYIGGIDNSIEAEAKDR 491

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             +  P  Q  LV+++A  S  P+++  M G  +D S    +  +  I+W GYPGQ GG 
Sbjct: 492 KTIAWPSSQISLVNQLANLSI-PLIISQM-GTMIDSSSLLTNRGVNGIIWAGYPGQDGGT 549

Query: 418 AIADVLFGRANP 429
           AI ++L G+  P
Sbjct: 550 AIFNILTGKTAP 561


>gi|402225863|gb|EJU05924.1| hypothetical protein DACRYDRAFT_113532 [Dacryopinax sp. DJM-731
           SS1]
          Length = 778

 Score =  303 bits (777), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 172/439 (39%), Positives = 245/439 (55%), Gaps = 15/439 (3%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ--GNTGSRLKVA 60
           N   +GL +W+PN+N  RDPRWGR  ETPGEDP     Y A  V GLQ  G+     K+ 
Sbjct: 145 NFNHSGLDFWTPNINPVRDPRWGRSLETPGEDPFHLASYVAKLVTGLQFGGDDPKYQKLV 204

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A CKHY  YDL+NW G  RY F+A +S QDL + +  PF+ C  +  V SVMCSYN VNG
Sbjct: 205 ATCKHYAGYDLENWGGYARYGFDAVISNQDLVEYFLPPFQTCARDVNVTSVMCSYNAVNG 264

Query: 121 KPTCADPDILKNTIHGQWRLD--------GYIVSDCDSVGVLYNTQHYTRTPEEAAADAI 172
            P+CA+  +L++ +   W  +         Y+ SDCD+V  +Y   +YT TPE+A A ++
Sbjct: 265 IPSCANDYLLQSLLRTYWGWEPDSESLNAHYVTSDCDAVSNIYYPHNYTITPEQAVAVSL 324

Query: 173 KAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLG 232
           KAG DLDCG F A     +   GL  + D++ AL  +      LG FD     Q +    
Sbjct: 325 KAGTDLDCGTFYAEWLPSSYEQGLFHQTDIDRALIRSYAALFLLGYFD-PAEGQIYRQYN 383

Query: 233 PRDVCTPAHQQLALQAAHQGIVLLKNSARTLPL-STLRHHTVAVIGPNSDVTVTMIGNYA 291
             ++ T   QQLA  AA +GI LLKN    LPL ST+ +  +A+IGP ++ T  M GNY 
Sbjct: 384 WANINTDYAQQLAYTAAWEGITLLKNIDDMLPLPSTMTN--IALIGPWANATTQMQGNYQ 441

Query: 292 GVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIE 351
           G+A    +PL  + +    +       +  N      AA  AA+ AD T+ + G+D ++E
Sbjct: 442 GIAPFLHSPLYALQQRGINVTYVLGTNITSNSTAGFAAALAAAQTADLTLYIGGIDITVE 501

Query: 352 AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYP 411
           AE +DR  +  PG Q +L++++A  S   +++  M GG +D +    +P++  +LW GYP
Sbjct: 502 AEAMDRVNITWPGNQLDLIAQLANVSTH-LIVYQMGGGQIDDTVLLENPKVHGLLWGGYP 560

Query: 412 GQAGGAAIADVLFGRANPG 430
           GQ GG A+ D+L+G   P 
Sbjct: 561 GQDGGTAMIDILYGSRAPA 579


>gi|378730020|gb|EHY56479.1| beta-glucosidase, variant [Exophiala dermatitidis NIH/UT8656]
 gi|378730021|gb|EHY56480.1| beta-glucosidase [Exophiala dermatitidis NIH/UT8656]
          Length = 783

 Score =  303 bits (775), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 183/437 (41%), Positives = 256/437 (58%), Gaps = 19/437 (4%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLK-VAA 61
           N   +GL +W+PN+N ++DPRWGRGQETPGED      Y A+ + GLQG     +K V A
Sbjct: 137 NVNRSGLDFWTPNINPYKDPRWGRGQETPGEDTFHLKSYVAALIDGLQGGLNPPIKKVIA 196

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+ AYDL++W   DRY+F+A VS QDL + Y  PF+ C  + +V S+MCSYN +NG 
Sbjct: 197 TCKHFVAYDLEDWITTDRYNFDAIVSTQDLAEYYMQPFQTCARDARVGSIMCSYNAMNGV 256

Query: 122 PTCADPDILKNTI--HGQWRLDG-YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           PTCADP IL+  +  H  W  DG Y+ SDCD++  +Y   +Y  T E+A ADA+ AG DL
Sbjct: 257 PTCADPYILQTVLREHWNWTDDGQYVTSDCDAIQNIYAPHYYEPTREQAVADALTAGTDL 316

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG +   H   A   GL  +  ++  +    +  ++LG FD  PSA P+ +L   DV T
Sbjct: 317 NCGTYYQTHLPAAFSEGLFNQTVIDQTITRLYSALIKLGYFD-PPSATPYRSLNWSDVST 375

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLS--TLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
           PA + LAL+AA +GIVLLKN    LPLS  T ++ TVA+IG  ++ T TM GNY G+A  
Sbjct: 376 PAAEALALKAAEEGIVLLKNDG-LLPLSFPTDKNTTVAIIGGWANATTTMQGNYFGIAPY 434

Query: 297 YTTPLQGISRYAK-TIHQAGCFGVACNG--NQLIGAAEVAARQADATVLVMGLDQSIEAE 353
             +PL  + +         G FGV      ++L+G    AA +AD  ++  GL  S E+E
Sbjct: 435 LHSPLYALQQLPNINAVYGGGFGVPTTDGWDELLG----AAGEADLIIIADGLTTSDESE 490

Query: 354 FIDRAGLLLPGRQQELVSRVAKAS-RGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
             DR  +   G Q   +  + + S  G   + L  G  +D +   N+P I A++W GYPG
Sbjct: 491 SNDRYTI---GWQPAAIDIINQLSGMGKPTVFLQMGDQLDNTPLLNNPNISALIWGGYPG 547

Query: 413 QAGGAAIADVLFGRANP 429
            AGG A+ ++L G+A P
Sbjct: 548 MAGGDALINILTGKAAP 564


>gi|410628680|ref|ZP_11339398.1| beta-glucosidase [Glaciecola mesophila KMM 241]
 gi|410151684|dbj|GAC26167.1| beta-glucosidase [Glaciecola mesophila KMM 241]
          Length = 732

 Score =  302 bits (773), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 173/442 (39%), Positives = 256/442 (57%), Gaps = 27/442 (6%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G  AGLT+W+PNVNIFRDPRWGRGQET GEDP+LT +   ++V+GLQG+    LK A   
Sbjct: 131 GQYAGLTFWTPNVNIFRDPRWGRGQETYGEDPLLTSQMGTAFVKGLQGDDPKYLKSAGVA 190

Query: 64  KHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
           KH+  +     +G +  R+ F+   SK+DL +TY   F+A V + KVA VMC+YN V G+
Sbjct: 191 KHFAVH-----SGPESLRHQFDVEPSKKDLYETYLPAFEALVTQAKVAGVMCAYNGVYGQ 245

Query: 122 PTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG 181
           P+CA   +L   +  +W+ +GY+VSDC ++   ++    T    E+AA A++AG+DL+CG
Sbjct: 246 PSCASEFLLGEMLKKKWQFNGYVVSDCGALHDFHSGHKVTHNRVESAALALRAGVDLNCG 305

Query: 182 PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSA-QPFGNLGPRDVCTPA 240
                  + A   GL+ +  ++  L   + ++ RLG+FD  PS   P   +G   + +  
Sbjct: 306 FTYEKSLKAAFEEGLITQSLIDQRLKNLLMIRFRLGLFD--PSELNPHNAIGQEVIHSLE 363

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H +LA + A + IVLLKN  + LPLS        V GP +  +  ++GNY G++    T 
Sbjct: 364 HIELARKVAAKSIVLLKNEKQVLPLSK-DIKVPYVTGPFAASSDMLMGNYYGISDSLVTV 422

Query: 301 LQGI----SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFI- 355
           L+GI    S  +   ++AG      N N L  A EV A+ ADA + V+G+   +E E + 
Sbjct: 423 LEGIAGKVSLGSSLNYRAGALPFHSNINPLNWAPEV-AKTADAVIAVVGISADMEGEEVD 481

Query: 356 --------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILW 407
                   DR  + LP  Q + V ++A+  +GP++LV+  G PVD+S  + DP   AILW
Sbjct: 482 AIASADRGDRVAITLPQNQVDYVKQLAENKKGPLILVVAAGSPVDIS--ELDPLADAILW 539

Query: 408 VGYPGQAGGAAIADVLFGRANP 429
           + YPG+ GG A+ADV+FG  NP
Sbjct: 540 IWYPGEQGGNAVADVIFGDTNP 561


>gi|373955483|ref|ZP_09615443.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373892083|gb|EHQ27980.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 738

 Score =  302 bits (773), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 181/444 (40%), Positives = 245/444 (55%), Gaps = 35/444 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYW+PN+NIFRDPRWGRGQET GEDP LTG+  +++V+GLQG+    LK A C KHY 
Sbjct: 139 GLTYWTPNINIFRDPRWGRGQETYGEDPFLTGEMGSAFVKGLQGDDPRYLKAAGCAKHYA 198

Query: 68  AY----DLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPT 123
            +    DL       R+ FN  +S  DL DTY   F+  VV+ KV  VMC+YN   G+P 
Sbjct: 199 VHSGPEDL-------RHKFNTDISDYDLWDTYLPAFRKLVVDAKVTGVMCAYNAFKGQPC 251

Query: 124 CADPDILKNTIHGQWRLDGYIVSDCDSVGVLY--NTQHYTRTPEEAAADAIKAGLDLDCG 181
           C    ++ + +H +W+  GY+ SDC  +   Y  NT       E AAADA+  G D++CG
Sbjct: 252 CGSDLLMNSILHDKWKFTGYVTSDCGGIDDFYRENTHQTQPDAESAAADAVLHGTDVECG 311

Query: 182 PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAH 241
                    AV+ G L E+ ++ +L    +V+ +LGMFD    A  +  +G   +  PAH
Sbjct: 312 NVTYKSLVKAVKDGKLSEKQIDQSLKRLFSVRFKLGMFD-PADAVKYNQIGKDALEAPAH 370

Query: 242 QQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 301
              AL+ AHQ IVLLKN    LPLS      +AV+GPN+D  V+++GNY G      T L
Sbjct: 371 GAQALKMAHQSIVLLKNEGNLLPLSK-NLKKIAVLGPNADNAVSVLGNYNGTPSRIVTAL 429

Query: 302 QGISRY----AKTIHQAGCFGVACNGNQLIGAAEVA-ARQADATVLVMGLDQSIEAEFI- 355
           QGI        + I+      VA +  +   AA  A  + ADA + + G+   +E E + 
Sbjct: 430 QGIKNKLPAGTEVIYDKAVDYVADSAARYNYAAMAAKVKDADAIIYIGGISPELEGEEMP 489

Query: 356 ---------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSF-AKNDPRIGAI 405
                    DR+ +LLPG Q EL+ +  KA+  PVV V+M G  +   + A+N P   AI
Sbjct: 490 VSKPGFHGGDRSTILLPGVQTELL-KALKATGKPVVFVMMTGSAIATPWEAENLP---AI 545

Query: 406 LWVGYPGQAGGAAIADVLFGRANP 429
           +   Y GQA G AIADVLFG  NP
Sbjct: 546 VNAWYGGQAAGTAIADVLFGDYNP 569


>gi|291518645|emb|CBK73866.1| Beta-glucosidase-related glycosidases [Butyrivibrio fibrisolvens
           16/4]
          Length = 713

 Score =  301 bits (770), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 173/441 (39%), Positives = 246/441 (55%), Gaps = 29/441 (6%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYW+PNVNIFRDPRWGRG ET GEDP LTG+   +YVRGLQG+     K AAC KH+ 
Sbjct: 103 GLTYWAPNVNIFRDPRWGRGHETYGEDPYLTGQLGMAYVRGLQGDDLDNPKSAACAKHFA 162

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +   +    +R+HF+A+V+ QDL DTY   FK  V + KV +VM +YN+VNG+P C   
Sbjct: 163 VH---SGPEAERHHFDAKVNDQDLYDTYLYAFKRLVKDAKVEAVMGAYNRVNGEPACGSK 219

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +LK+ + G W  +G++VSDC ++   +     T    E+AA A+  G DL+CG      
Sbjct: 220 RLLKDILRGDWGFEGHVVSDCWAIRDFHENHKVTGCEVESAALAVNNGCDLNCGCVYE-K 278

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              A +  L+ EE +  ++   I +++RLG      S   + ++    V    H++LA++
Sbjct: 279 LLYAYKANLVTEETITESVERLIELRLRLGTLPERRS--KYDDIPYEVVECKEHKELAIE 336

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRY 307
           AA + +VLLKN    LPL      T+ VIGPNS+  + ++GNY G++  Y T L+GI +Y
Sbjct: 337 AAKRSMVLLKNDG-LLPLKKDEIKTIGVIGPNSNSRMALVGNYEGISSEYITVLEGIQQY 395

Query: 308 ----AKTIHQAGC------FGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE---- 353
                +  H  G         V          A   A  +D  VL MGLD +IE E    
Sbjct: 396 VGDDVRVFHSDGTPLWKDRMHVLSEARDTFAEAMAVAEHSDVVVLAMGLDSTIEGEEGDA 455

Query: 354 -----FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWV 408
                  D+ GL LPG QQEL+ ++    + PVVL+++ G  +D+S+A  +  + AI+  
Sbjct: 456 GNEFGSGDKKGLKLPGLQQELLEKITAIGK-PVVLLVLAGSAMDLSWANEN--VNAIMHC 512

Query: 409 GYPGQAGGAAIADVLFGRANP 429
            YPG  GG AIA VLFG  +P
Sbjct: 513 WYPGARGGKAIAQVLFGEDSP 533


>gi|67902828|ref|XP_681670.1| hypothetical protein AN8401.2 [Aspergillus nidulans FGSC A4]
 gi|74592887|sp|Q5ATH9.1|BXLB_EMENI RecName: Full=Exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|40747867|gb|EAA67023.1| hypothetical protein AN8401.2 [Aspergillus nidulans FGSC A4]
 gi|259484335|tpe|CBF80465.1| TPA: beta-1,4-xylosidase (Eurofung) [Aspergillus nidulans FGSC A4]
          Length = 763

 Score =  301 bits (770), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 181/430 (42%), Positives = 243/430 (56%), Gaps = 7/430 (1%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N   AG+ YW+PNVN F+DPRWGRGQETPGEDP+   +Y   +V GLQG+   + KV A 
Sbjct: 141 NSDHAGIDYWTPNVNPFKDPRWGRGQETPGEDPLHCSRYVKEFVGGLQGDDPEKPKVVAT 200

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           CKH  AYDL+ W GV R+ F+A+VS  DL + Y  PFK C V+  V + MCSYN +NG P
Sbjct: 201 CKHLAAYDLEEWGGVSRFEFDAKVSAVDLLEYYLPPFKTCAVDASVGAFMCSYNALNGVP 260

Query: 123 TCADPDILKNTIHGQWRLDG---YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
            CAD  +L+  +   W  +G   ++  DC +V  +    HY  +  EAAA A+ AG+DLD
Sbjct: 261 ACADRYLLQTVLREHWGWEGPGHWVTGDCGAVERIQTYHHYVESGPEAAAAALNAGVDLD 320

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG +L  +   A R GL+  E ++ AL    T  ++LG FD     QP  +LG  DV T 
Sbjct: 321 CGTWLPSYLGEAERQGLISNETLDAALTRLYTSLVQLGYFD-PAEGQPLRSLGWDDVATS 379

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
             ++LA   A QG VLLKN   TLPL    + T+A+IGP  + T  +  NYAG A    T
Sbjct: 380 EAEELAKTVAIQGTVLLKNIDWTLPLKA--NGTLALIGPFINFTTELQSNYAGPAKHIPT 437

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
            ++   R    +  A    V          A   A +ADA +   G+D ++E E +DR  
Sbjct: 438 MIEAAERLGYNVLTAPGTEVNSTSTDGFDDALAIAAEADALIFFGGIDNTVEEESLDRTR 497

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           +  PG Q+EL+  +A+  R P+ +V   GG VD S       +GAI+W GYP QAGGA +
Sbjct: 498 IDWPGNQEELILELAELGR-PLTVVQFGGGQVDDSALLASAGVGAIVWAGYPSQAGGAGV 556

Query: 420 ADVLFGRANP 429
            DVL G+A P
Sbjct: 557 FDVLTGKAAP 566


>gi|83774566|dbj|BAE64689.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 822

 Score =  301 bits (770), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 172/435 (39%), Positives = 249/435 (57%), Gaps = 21/435 (4%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N G +G  +W+PN+N FRDPRWGRGQETPGEDP++   Y  ++V GLQG+     +V A 
Sbjct: 183 NNGFSGFDFWAPNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDPKNKQVIAT 242

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           CKHY  YDL+      RY  N   ++QDL D +  PFK CV +  V S+MCSYN V+G P
Sbjct: 243 CKHYAVYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIP 298

Query: 123 TCADPDILKNTIHGQWRLDG---YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
            CA+  +L   +   W  +    Y+VSDC +V  ++   ++T T E AA+ A+ AG+DL+
Sbjct: 299 ACANEYLLSEVLRKHWNFNSDYHYVVSDCGAVTDIWQYHNFTDTEEAAASVALNAGVDLE 358

Query: 180 CG-PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           CG  +L ++   A     ++  D +LA  Y  +    +G FDG      +  L   DV T
Sbjct: 359 CGSSYLKLNESLAANQTSVKVMDQSLARLY--SALFTVGFFDG----GKYDKLDFSDVST 412

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPL-STLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           P  Q LA +AA +G+ LLKN    LPL S  ++ +VAVIGP ++ T  M G+Y+G A   
Sbjct: 413 PDAQALAYEAAVEGMTLLKND-DLLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAPYL 471

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAE--VAARQADATVLVMGLDQSIEAEFI 355
            +PL+        ++ A   G A N     G  E   AA ++D  + + G+D S+E+E +
Sbjct: 472 ISPLEAFGDSRWKVNYA--LGTAMNNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETL 529

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  L  PG Q +L++ ++K S+ P+V+V   GG VD S    +  I A++W GYP Q+G
Sbjct: 530 DRTSLTWPGNQLDLITSLSKLSK-PLVVVQFGGGQVDDSDILKNKDIQALVWAGYPSQSG 588

Query: 416 GAAIADVLFGRANPG 430
           G A+ DVL G+ +P 
Sbjct: 589 GTALLDVLVGKRSPA 603


>gi|2920706|emb|CAA73902.1| beta-xylosidase [Emericella nidulans]
          Length = 802

 Score =  301 bits (770), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 166/430 (38%), Positives = 244/430 (56%), Gaps = 8/430 (1%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAA 61
           N G+ G+  +SPN+N FR P WGRGQETPGED  LT  Y   Y+  LQG       K+ A
Sbjct: 159 NAGLGGVDVYSPNINTFRHPVWGRGQETPGEDAFLTSVYGYEYITALQGAVDPETSKIIA 218

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
             KHY  YD+++WN   R   + ++++Q+L + Y  PF     + KV SVMCSYN VNG 
Sbjct: 219 TAKHYAGYDIESWNNHSRLGNDMQITQQELSEYYTPPFIVASRDAKVRSVMCSYNAVNGV 278

Query: 122 PTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           P+CA+   L+  +   +    DGY+  DC +V  ++N   Y      A+AD+I AG D+D
Sbjct: 279 PSCANKFFLQTLLRDTFEFSEDGYVSGDCGAVYNVWNPHGYASNEAAASADSILAGTDID 338

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG     H+E A    L+   D+   +    +  ++ G FDGE +  P+ ++   DV + 
Sbjct: 339 CGTSYQWHSEDAFEDSLVSRSDIERGVIRLYSNLVQAGYFDGEDA--PYRDITWDDVLST 396

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
               +A +AA +GIVLLKN   TLPLS     +VAVIGP ++VT  + GNY G A    +
Sbjct: 397 DAWNIAYEAAVEGIVLLKND-ETLPLSK-DIKSVAVIGPWANVTEELQGNYFGPAPYLIS 454

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           PL G       +H A    +  +       A  AA+QADA +   G+D +IEAE +DR  
Sbjct: 455 PLTGFRDSGLDVHYALGTNLTSHSTSGFEEALTAAKQADAIIFAGGIDNTIEAEAMDREN 514

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           +  PG Q +L+S++++  + P+V++ M GG VD S  K++  + A++W GYPGQ+GG A+
Sbjct: 515 ITWPGNQLDLISKLSELGK-PLVVLQMGGGQVDSSSLKDNDNVNALIWGGYPGQSGGHAL 573

Query: 420 ADVLFGRANP 429
           AD++ G+  P
Sbjct: 574 ADIITGKRAP 583


>gi|403412992|emb|CCL99692.1| predicted protein [Fibroporia radiculosa]
          Length = 760

 Score =  300 bits (769), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 171/431 (39%), Positives = 247/431 (57%), Gaps = 8/431 (1%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLK-VAA 61
           N   +GL +W+PN+N F+DPRWGRGQETPGEDP     Y  + + GLQG      K + A
Sbjct: 136 NANRSGLDFWTPNINPFKDPRWGRGQETPGEDPFHLQSYVYNLITGLQGGLDPEYKRIVA 195

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKHY  YDL+NW G  RY F+A +S QDL + Y   F+ C  +  V + MCSYN VNG 
Sbjct: 196 TCKHYAGYDLENWEGNVRYGFDALISIQDLSEFYTRSFETCARDANVGAFMCSYNAVNGV 255

Query: 122 PTCADPDILKNTIHGQWRL---DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           P+CA+  +L++ + G W     D +I SDCD++  +Y   +Y  T E   ADA+ AG DL
Sbjct: 256 PSCANSYLLQDILRGHWNWTSDDQWITSDCDAIQNIYEPHYYAPTRELTVADALNAGADL 315

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG +   +   A   GL  E  ++ AL       ++LG FD   + QP+  +G  +V T
Sbjct: 316 DCGTYYPENLGAAYDEGLFAESTLDRALIRQYASLVKLGYFDPAEN-QPYRQIGWANVST 374

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           P  ++LA +AA +GI L+KN   TLPLS     ++A+IGP ++ T  M GNY G      
Sbjct: 375 PEAEELAYRAAVEGITLIKNDG-TLPLSP-SIKSLALIGPWANATTQMQGNYYGQPPYLI 432

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           +PL        T++ +   GV         AA  AA+ ADA + + G+D ++EAE +DR 
Sbjct: 433 SPLMAAEALNYTVYYSPGPGVDDPTTSSFPAAFAAAQAADAIIYIGGIDTTVEAEAMDRY 492

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            L  PG Q + + ++++  + P+V++ M GG VD S    +  + A++W GYPGQ+GG A
Sbjct: 493 TLDWPGVQPDFIDQLSQFGK-PLVVLQMGGGQVDDSCLLPNTNVNALIWGGYPGQSGGTA 551

Query: 419 IADVLFGRANP 429
           + D++ G A P
Sbjct: 552 LMDIIVGNAAP 562


>gi|317156541|ref|XP_001825822.2| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
          Length = 882

 Score =  300 bits (769), Expect = 8e-79,   Method: Compositional matrix adjust.
 Identities = 172/435 (39%), Positives = 249/435 (57%), Gaps = 21/435 (4%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N G +G  +W+PN+N FRDPRWGRGQETPGEDP++   Y  ++V GLQG+     +V A 
Sbjct: 243 NNGFSGFDFWAPNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDPKNKQVIAT 302

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           CKHY  YDL+      RY  N   ++QDL D +  PFK CV +  V S+MCSYN V+G P
Sbjct: 303 CKHYAVYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIP 358

Query: 123 TCADPDILKNTIHGQWRLDG---YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
            CA+  +L   +   W  +    Y+VSDC +V  ++   ++T T E AA+ A+ AG+DL+
Sbjct: 359 ACANEYLLSEVLRKHWNFNSDYHYVVSDCGAVTDIWQYHNFTDTEEAAASVALNAGVDLE 418

Query: 180 CG-PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           CG  +L ++   A     ++  D +LA  Y  +    +G FDG      +  L   DV T
Sbjct: 419 CGSSYLKLNESLAANQTSVKVMDQSLARLY--SALFTVGFFDG----GKYDKLDFSDVST 472

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPL-STLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           P  Q LA +AA +G+ LLKN    LPL S  ++ +VAVIGP ++ T  M G+Y+G A   
Sbjct: 473 PDAQALAYEAAVEGMTLLKND-DLLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAPYL 531

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAE--VAARQADATVLVMGLDQSIEAEFI 355
            +PL+        ++ A   G A N     G  E   AA ++D  + + G+D S+E+E +
Sbjct: 532 ISPLEAFGDSRWKVNYA--LGTAMNNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETL 589

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  L  PG Q +L++ ++K S+ P+V+V   GG VD S    +  I A++W GYP Q+G
Sbjct: 590 DRTSLTWPGNQLDLITSLSKLSK-PLVVVQFGGGQVDDSDILKNKDIQALVWAGYPSQSG 648

Query: 416 GAAIADVLFGRANPG 430
           G A+ DVL G+ +P 
Sbjct: 649 GTALLDVLVGKRSPA 663


>gi|391865040|gb|EIT74331.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
          Length = 822

 Score =  299 bits (766), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 171/435 (39%), Positives = 248/435 (57%), Gaps = 21/435 (4%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N G +G  +W+PN+N FRDPRWGRGQETPGEDP++   Y  ++V GLQG+     +V A 
Sbjct: 183 NNGFSGFDFWAPNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDPKNKQVIAT 242

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           CKHY  YDL+      RY  N   ++QDL D +  PFK CV +  V S+MCSYN V+G P
Sbjct: 243 CKHYAVYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIP 298

Query: 123 TCADPDILKNTIHGQWRLDG---YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
            CA+  +L   +   W  +    Y+VSDC +V  ++   ++T T E AA+ A+ AG+DL+
Sbjct: 299 ACANEYLLDEVLRKHWNFNSDYYYVVSDCGAVTDIWQYHNFTDTEEAAASVALNAGVDLE 358

Query: 180 CG-PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           CG  +L ++   A     ++  D +LA  Y  +    +G FDG      +  L   DV T
Sbjct: 359 CGSSYLKLNESLAANQTSVKVMDRSLARLY--SALFTVGFFDG----GKYDKLDFSDVST 412

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           P  Q LA +AA +G+ LLKN    LPL    ++ +VAVIGP ++ T  M G+Y+G A   
Sbjct: 413 PDAQALAYEAAVEGMTLLKND-DLLPLDFPHKYKSVAVIGPFANATTQMQGDYSGDAPYL 471

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAE--VAARQADATVLVMGLDQSIEAEFI 355
            +PL+        ++ A   G A N     G  E   AA ++D  + + G+D S+E+E +
Sbjct: 472 ISPLEAFGDSRWKVNYA--LGTAINNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETL 529

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  L  PG Q +L++ ++K S+ P+V+V   GG VD S    +  I A++W GYP Q+G
Sbjct: 530 DRTSLAWPGNQLDLITSLSKLSK-PLVVVQFGGGQVDDSAILKNKDIQALVWAGYPSQSG 588

Query: 416 GAAIADVLFGRANPG 430
           G A+ DVL G+ +P 
Sbjct: 589 GTALLDVLVGKRSPA 603


>gi|392962219|ref|ZP_10327666.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           DSM 17108]
 gi|392452977|gb|EIW29882.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           DSM 17108]
          Length = 724

 Score =  299 bits (766), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 171/446 (38%), Positives = 247/446 (55%), Gaps = 27/446 (6%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           +G   GLT+WSPNVNIFRDPRWGRGQET GEDP LTG+   S+++GLQG     L+ AAC
Sbjct: 106 HGIYKGLTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQGQDKKYLRAAAC 165

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
            KH+  +        +R+ F+A VS +DL +TY   FK CV E  V +VM +YN+VNG+P
Sbjct: 166 AKHFAVHSGPE---SERHRFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAYNRVNGEP 222

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP 182
            C    +LK T+  +W   G++VSDC ++   +     T +  E+ A A+  G DL+CG 
Sbjct: 223 CCGSNILLKETLRQEWGFTGHVVSDCWAIKDFHENHRVTSSAPESVALALNNGCDLNCGN 282

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
            + ++   A + GL+ EE +N A+   +  +M+LG+FD   +  P+ N+G        H+
Sbjct: 283 -MYLNLLIAYQEGLVTEEAINTAVTRLMLTRMKLGLFDAAENV-PYTNIGFHQNDCQEHR 340

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
           + AL+ + + +VLLKN    LPL      ++AVIGPN++    + GNY G A  Y T L+
Sbjct: 341 EFALEVSKKTLVLLKNENHLLPLDRNTISSIAVIGPNANSREALTGNYFGTASNYITVLE 400

Query: 303 GIS----RYAKTIHQAGCFGVACNGNQL------IGAAEVAARQADATVLVMGLDQSIEA 352
           GI     +     +  GC         L         A   A +AD  V+ MGLD SIE 
Sbjct: 401 GIREAVGKDTMVSYAQGCHLYRDKAENLGEERDRFAEAVSTAERADLVVMCMGLDASIEG 460

Query: 353 E---------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIG 403
           E           D+ GL LPG QQEL+  + K  + P++LVL+ G  + V++A    ++ 
Sbjct: 461 EEGDVSNEYASGDKLGLNLPGLQQELLEVIYKTGK-PIILVLLAGSALAVTWAAE--KVP 517

Query: 404 AILWVGYPGQAGGAAIADVLFGRANP 429
           AI+   YPG  GG A+A  +FG  +P
Sbjct: 518 AIIQAWYPGAEGGKALASAIFGEYSP 543


>gi|288870210|ref|ZP_06113312.2| beta-glucosidase [Clostridium hathewayi DSM 13479]
 gi|288868024|gb|EFD00323.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
          Length = 730

 Score =  299 bits (766), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 169/443 (38%), Positives = 250/443 (56%), Gaps = 30/443 (6%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT W+PN+NIFRDPRWGRG ET GEDP LT +    Y+RGLQG+    LK AAC KH+ 
Sbjct: 119 GLTLWAPNINIFRDPRWGRGHETYGEDPWLTSRLGIRYIRGLQGSHEKYLKTAACVKHFA 178

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+A VS++DL +TY   F+ACV +G V +VM +YN+VNG P C 
Sbjct: 179 VH-----SGPEELRHSFDAEVSEKDLRETYLPAFEACVKDGDVEAVMGAYNRVNGVPCCG 233

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
           +  +L+  +  +W   G++VSDC ++   +     T +P E+ + A+  G DL+CG    
Sbjct: 234 NEYLLETILRKEWGFHGHVVSDCWAIKDFHEGHGVTDSPVESVSMAMNHGCDLNCGNLFT 293

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
              + AV+ G ++EE ++ A+    T +++LG         P+  +   +V +PA ++L 
Sbjct: 294 YLIQ-AVKEGKVKEERLDEAVIRLFTTRLKLGALGKMEEDDPYAGISYLEVDSPAMKKLN 352

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
             AA + +VLLKN+   LP+ T R+ T+ VIGPN+D    ++GNY G A  Y T L+GI 
Sbjct: 353 RSAAGKSVVLLKNTEGLLPIDTKRYKTIGVIGPNADSRRALVGNYEGTASEYVTVLEGIR 412

Query: 306 R----YAKTIHQAGCFGVACN------GNQLIGAAEVAARQADATVLVMGLDQSIEAEF- 354
                 A+ ++  GC     N       N  +   +   R++D  +  MGLD ++E E  
Sbjct: 413 EAAEPEARVLYSEGCHLYKSNVSGLGARNDRLSEVKGICRESDIVIACMGLDSTLEGEQG 472

Query: 355 --------IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
                    D+  L+LPG QQ+++   A  S  PVVLVL+ G  + V++A  D  + AIL
Sbjct: 473 DTGNIYAGGDKPDLMLPGLQQKILE-TAYDSGKPVVLVLLAGSAMAVTWA--DEHLPAIL 529

Query: 407 WVGYPGQAGGAAIADVLFGRANP 429
              YPG  GG  +ADVLFG  NP
Sbjct: 530 TAWYPGAEGGRGVADVLFGTVNP 552


>gi|238492365|ref|XP_002377419.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
 gi|220695913|gb|EED52255.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
          Length = 775

 Score =  299 bits (765), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 171/435 (39%), Positives = 249/435 (57%), Gaps = 21/435 (4%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N G +G  +W+PN+N FRDPRWGRGQETPGEDP++   Y  ++V GLQG+     +V A 
Sbjct: 136 NNGFSGFDFWAPNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDPKNKQVIAT 195

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           CKHY  YDL+      RY  N   ++QDL + +  PFK CV +  V S+MCSYN V+G P
Sbjct: 196 CKHYAVYDLE----TGRYGNNYNPTQQDLSEYFLAPFKTCVRDTDVGSIMCSYNSVSGIP 251

Query: 123 TCADPDILKNTIHGQWRLDG---YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
            CA+  +L   +   W  +    Y+VSDC +V  ++   ++T T E AA+ A+ AG+DL+
Sbjct: 252 ACANEYLLDEVLRKHWNFNSDYHYVVSDCGAVTDIWQYHNFTDTEEAAASVALNAGVDLE 311

Query: 180 CG-PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           CG  +L ++   A     ++  D +LA  Y  +    +G FDG      +  L   DV T
Sbjct: 312 CGSSYLKLNESLAANQTSVKVMDQSLARLY--SALFTVGFFDG----GKYDKLDFSDVST 365

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPL-STLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           P  Q LA +AA +G+ LLKN    LPL S  ++ +VAVIGP ++ T  M G+Y+G A   
Sbjct: 366 PDAQALAYEAAVEGMTLLKND-DLLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAPYL 424

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAE--VAARQADATVLVMGLDQSIEAEFI 355
            +PL+        ++ A   G A N     G  E   AA ++D  + + G+D S+E+E +
Sbjct: 425 ISPLEAFGDSRWKVNYA--LGTAINNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETL 482

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  L  PG Q +L++ ++K S+ P+V+V   GG VD S    +  I A++W GYP Q+G
Sbjct: 483 DRTSLAWPGNQLDLITSLSKLSK-PLVVVQFGGGQVDDSAILKNKDIQALVWAGYPSQSG 541

Query: 416 GAAIADVLFGRANPG 430
           G A+ DVL G+ +P 
Sbjct: 542 GTALLDVLVGKRSPA 556


>gi|150019484|ref|YP_001311738.1| glycoside hydrolase family protein [Clostridium beijerinckii NCIMB
           8052]
 gi|149905949|gb|ABR36782.1| glycoside hydrolase, family 3 domain protein [Clostridium
           beijerinckii NCIMB 8052]
          Length = 709

 Score =  299 bits (765), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 177/445 (39%), Positives = 258/445 (57%), Gaps = 39/445 (8%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           G+T+WSPNVNIFRDPRWGRG ET GEDP LT +   ++V+GLQG  G  LK AAC KH+ 
Sbjct: 110 GITFWSPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQGE-GKYLKAAACAKHFA 168

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+A VSK+DL +TY   F+ACV EG V +VM +YN+ NG+P C 
Sbjct: 169 VH-----SGPEGLRHEFDAVVSKKDLYETYLPAFEACVKEGDVEAVMGAYNRTNGEPCCG 223

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
              +L++ + G+W   G++VSDC ++   +     T T  E+AA A+K G DL+CG  + 
Sbjct: 224 SKTLLRDILRGKWNFKGHVVSDCWAIADFHLHHRVTSTATESAALAMKNGCDLNCGN-VY 282

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT-PAHQQL 244
           +    A + GL+ EED+  A    +  ++RLGMFD E       N  P ++     H +L
Sbjct: 283 LQLLLAYKEGLVTEEDITTAAERLMATRIRLGMFDEECEY----NKIPYELNDCKEHNEL 338

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           +L+AA   +VLLKN+   LPL+     ++AVIGPN+D  + + GNY+G A  Y T L+GI
Sbjct: 339 SLKAARNSMVLLKNNG-ILPLNKNNLKSIAVIGPNADSQIMLKGNYSGTASRYITVLEGI 397

Query: 305 SRY----AKTIHQAGCF-------GVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
                   +  +  GC         +A   ++L  A  +A R +D  +L +GLD +IE E
Sbjct: 398 HEAVGEDVRVYYSEGCHLFRDRVEELAEPNDRLKEAISIAER-SDVAILCLGLDSTIEGE 456

Query: 354 ---------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGA 404
                      D+A L LPGRQQEL+ ++ +    PV+LV+  G    ++F   + +  A
Sbjct: 457 QGDAGNSEGAGDKASLNLPGRQQELLEKIIETG-TPVILVIGAGSA--LTFNNAEDKCSA 513

Query: 405 ILWVGYPGQAGGAAIADVLFGRANP 429
           IL   YPG  GG A+AD++FG+ +P
Sbjct: 514 ILDAWYPGSRGGRAVADLIFGKCSP 538


>gi|242786966|ref|XP_002480909.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
 gi|218721056|gb|EED20475.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
          Length = 757

 Score =  298 bits (763), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 162/429 (37%), Positives = 247/429 (57%), Gaps = 16/429 (3%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N G +G  +W+PN+N FRDPRWGRGQETPGED  +   Y  +++ GLQG+     +V A 
Sbjct: 119 NNGFSGFDFWAPNINPFRDPRWGRGQETPGEDSFVVQSYIRNFIPGLQGDDPEDKQVIAT 178

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           CKHY AYDL+      RY  +   ++QDL D +  PFK CV +  V S+MC+YN V+G P
Sbjct: 179 CKHYAAYDLE----TGRYGNDYNPTQQDLADYFLAPFKTCVRDTGVGSIMCAYNAVDGIP 234

Query: 123 TCADPDILKNTIHGQWRLDG---YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           TCA   +L   +   W       Y+VSDC +V  ++   ++T T E AA+ ++ AG+DL+
Sbjct: 235 TCASEYLLDQVLRKHWNFTADYNYVVSDCGAVTDIWQYHNFTDTEEAAASVSLNAGVDLE 294

Query: 180 CG-PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           CG  +L ++   A     ++  D  L   Y  +    +G FDG      +  LG  DV T
Sbjct: 295 CGSSYLKLNESLAANQTTVQALDQALTRLY--SALFTVGFFDG----GKYTALGFADVST 348

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPL-STLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           P  Q LA +AA +G+ LLKN  R LP+ S+ ++ +VA+IGP ++ T  M G+Y+G+    
Sbjct: 349 PEAQSLAYEAAVEGMTLLKNDKRLLPIRSSHKYKSVALIGPFANATTQMQGDYSGIPPFL 408

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
            +PL+    +   ++ A   G+         +A  AA ++D  + + G+D SIEAE +DR
Sbjct: 409 ISPLEAFKGHDWEVNYAMGTGINNQTTTGFASALAAAEKSDLVIYLGGIDNSIEAETLDR 468

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             L  PG Q +LV++++K  + P+++V   GG +D S    +  + A++W GYP Q+GG+
Sbjct: 469 TSLTWPGNQLDLVTQLSKLHK-PLIVVQFGGGQLDDSALLQNEGVQALVWAGYPSQSGGS 527

Query: 418 AIADVLFGR 426
           A+ DVL G+
Sbjct: 528 ALLDVLLGK 536


>gi|421077748|ref|ZP_15538711.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           JBW45]
 gi|392524151|gb|EIW47314.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           JBW45]
          Length = 750

 Score =  298 bits (762), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 173/449 (38%), Positives = 250/449 (55%), Gaps = 33/449 (7%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           +G   GLT+WSPNVNIFRDPRWGRGQET GEDP LTG+   S+++GLQG     L+ AAC
Sbjct: 132 HGIYKGLTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQGQDKKYLRAAAC 191

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
            KH+  +        +R+ F+A VS +DL +TY   FK CV E  V +VM +YN+VNG+P
Sbjct: 192 AKHFAVHSGPE---SERHSFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAYNRVNGEP 248

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP 182
            C    +LK T+  +W   G++VSDC ++   +     T +  E+ A A+  G DL+CG 
Sbjct: 249 CCGSNMLLKETLRQEWGFTGHVVSDCWAIKDFHENHRVTSSAPESVALALNNGCDLNCGN 308

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
            + ++   A + GL+ EE +N A+   +  +M+LG+FD   +  P+ N+G        H+
Sbjct: 309 -MYLNLLIAYQEGLVTEEAINTAVTRLMLTRMKLGLFDTAENV-PYTNIGFHQNDCQEHR 366

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
           + AL+ + + +VLLKN    LPL      ++AVIGPN++    + GNY G A  Y T L+
Sbjct: 367 EFALEVSKKTLVLLKNENNLLPLDRNTISSIAVIGPNANSREALTGNYCGTASNYITVLE 426

Query: 303 GISR---------YAKTIH----QAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQS 349
           GI           YA+  H    +A   G A         A   A +AD  V+ MGLD S
Sbjct: 427 GIREAVGKDTIVSYAQGCHLYRDKAENLGEA---RDRFAEAVSTAERADIVVMCMGLDAS 483

Query: 350 IEAE---------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDP 400
           IE E           D+ GL LPG QQEL+  + +  + P++LVL+ G  + V++A    
Sbjct: 484 IEGEEGDVSNEYASGDKLGLNLPGLQQELLEVIYQTGK-PIILVLLAGSALAVTWAAE-- 540

Query: 401 RIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++ AI+   YPG  GG A+A  +FG  +P
Sbjct: 541 KVPAIIQAWYPGAEGGKALASAIFGEYSP 569


>gi|150019782|ref|YP_001312036.1| glycoside hydrolase family protein [Clostridium beijerinckii NCIMB
           8052]
 gi|149906247|gb|ABR37080.1| glycoside hydrolase, family 3 domain protein [Clostridium
           beijerinckii NCIMB 8052]
          Length = 709

 Score =  298 bits (762), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 180/449 (40%), Positives = 260/449 (57%), Gaps = 39/449 (8%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G   GLTYWSPN+NIFRDPRWGRG ET GEDP LT +   ++++GLQG  G  LK+AAC 
Sbjct: 106 GIYKGLTYWSPNINIFRDPRWGRGHETYGEDPYLTSRLGVAFIKGLQGE-GKYLKLAACA 164

Query: 64  KHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
           KH+  +     +G +  R+ FNA V+K+DL +TY   F+ACV E  V SVM +YN+ NG+
Sbjct: 165 KHFAVH-----SGPEGLRHEFNAVVNKKDLYETYLPAFEACVKEANVESVMGAYNRTNGE 219

Query: 122 PTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG 181
           P C    +LK+ + G+W   G++VSDC ++   +     T T  E+ A AI+ G DL+CG
Sbjct: 220 PCCGSKTLLKDILRGKWGFKGHVVSDCWALADFHLHHMVTSTATESVALAIENGCDLNCG 279

Query: 182 PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC-TPA 240
             + ++   A + GL+ EE +  A    +T + +LGMFD E       N  P +V  +  
Sbjct: 280 N-MYLNLLLAYKEGLVTEEQITTAAERLMTTRFKLGMFDEECEY----NKIPYEVNDSRE 334

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H ++AL A+ + +VLLKN+  TLPL      ++AVIGPN++  + + GNY+G A  YTT 
Sbjct: 335 HNEVALIASRKSMVLLKNNG-TLPLDKSNLKSIAVIGPNANSEIMLKGNYSGTASKYTTI 393

Query: 301 LQGIS----RYAKTIHQAGCF-------GVACNGNQLIGAAEVAARQADATVLVMGLDQS 349
           L+GI        +  +  GC         +A   ++L  A  VA R +D  VL +GLD +
Sbjct: 394 LEGIHDAVGNDVRVYYSEGCHLFKDKVEDLARPDDRLSEAISVAER-SDVVVLCLGLDST 452

Query: 350 IEAE---------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDP 400
           IE E           D+  L LPGRQQ L+ +V +  + PV++VL  G  + ++ A  + 
Sbjct: 453 IEGEQGDAGNSYGAGDKENLNLPGRQQNLLEKVLEVGK-PVIVVLGAGSALTLNGA--EE 509

Query: 401 RIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +  AIL   YPG  GG A+AD+LFG+ +P
Sbjct: 510 KCAAILNAWYPGSHGGTAVADILFGKCSP 538


>gi|413919686|gb|AFW59618.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 475

 Score =  297 bits (761), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 156/257 (60%), Positives = 188/257 (73%), Gaps = 2/257 (0%)

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AGLDL+CG FLA HT  AV+ G L E DV+ A+   +   MRLG FDG+P   PFGNLGP
Sbjct: 31  AGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNLGP 90

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 293
            DVCTP++Q+LA +AA QGIVLLKN+ + LPLS     ++AVIGPN++ + TMIGNY G 
Sbjct: 91  SDVCTPSNQELAREAARQGIVLLKNTGK-LPLSATSIKSMAVIGPNANASFTMIGNYEGT 149

Query: 294 ACGYTTPLQGISRYAKTIHQAGCFGVACNGNQL-IGAAEVAARQADATVLVMGLDQSIEA 352
            C YTTPLQG+     T++Q GC  V C+GN L + AA  AA  AD TVLV+G DQSIE 
Sbjct: 150 PCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSIER 209

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E +DR  LLLPG+Q +LVS VA AS GP +LV+M GGP D+SFAK+  +I AILWVGYPG
Sbjct: 210 ESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWVGYPG 269

Query: 413 QAGGAAIADVLFGRANP 429
           +AGGAAIADVLFG  NP
Sbjct: 270 EAGGAAIADVLFGYHNP 286


>gi|240146254|ref|ZP_04744855.1| beta-glucosidase [Roseburia intestinalis L1-82]
 gi|257201613|gb|EEU99897.1| beta-glucosidase [Roseburia intestinalis L1-82]
 gi|291539969|emb|CBL13080.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis
           XB6B4]
          Length = 710

 Score =  296 bits (759), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 170/443 (38%), Positives = 245/443 (55%), Gaps = 31/443 (6%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+W+PNVNIFRDPRWGRG ET GEDP LT +    Y+ GLQG+  + LK AAC KH+ 
Sbjct: 111 GLTFWAPNVNIFRDPRWGRGHETFGEDPYLTSRLGVRYIEGLQGHDENYLKAAACAKHFA 170

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+A V++QDL +TY   F+ACV EGKV +VM +YN+ NG P C 
Sbjct: 171 VH-----SGPEAVRHEFDAEVTEQDLRETYLPAFEACVKEGKVEAVMGAYNRTNGVPCCG 225

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
           +  +L + +  +W   G++ SDC ++   +   H T T  E+ A A+  G DL+CG    
Sbjct: 226 NKRLLIDILRKEWGFSGHVTSDCWAIRDFHEGHHVTGTAIESVAMAMNNGCDLNCGTLFG 285

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
              + AVR GL++EE ++ A+      +M+LG+FD +    P+  +      +   ++L 
Sbjct: 286 FLVQ-AVRQGLVKEERLDEAVTNLFMARMKLGVFD-KKEENPYDKIPYLAADSREMKKLN 343

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
              A + +VLLKN    LPL   +  TV VIGPN+D    ++GNY G A  Y T L+GI 
Sbjct: 344 EAVARRTVVLLKNKEHILPLDKNKIKTVGVIGPNADSRRALVGNYEGTASRYITVLEGIE 403

Query: 306 RY----AKTIHQAGCF------GVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE-- 353
            Y     + ++  GC             N  +       +++D  V V+GLD  IE E  
Sbjct: 404 DYVGDDVRVLYSEGCHLYKDRTSNLAQENDRMSEVLGVCKESDVVVAVLGLDAGIEGEEG 463

Query: 354 -------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
                    D+  L LPG Q+E++       + PV+LVL+ G  + V++A  D  + AI+
Sbjct: 464 DAGNEYGSGDKPDLNLPGLQEEILEAAVSCGK-PVILVLLSGSALAVNWA--DEHVDAIV 520

Query: 407 WVGYPGQAGGAAIADVLFGRANP 429
              YPG  GGAAIAD+LFG ANP
Sbjct: 521 QGWYPGARGGAAIADILFGEANP 543


>gi|291537442|emb|CBL10554.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis
           M50/1]
          Length = 710

 Score =  296 bits (758), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 169/443 (38%), Positives = 245/443 (55%), Gaps = 31/443 (6%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+W+PNVNIFRDPRWGRG ET GEDP LT +    Y+ GLQG+  + LK AAC KH+ 
Sbjct: 111 GLTFWAPNVNIFRDPRWGRGHETFGEDPYLTSRLGVRYIEGLQGHDENYLKAAACAKHFA 170

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+A V++QDL +TY   F+ACV EGKV +VM +YN+ NG P C 
Sbjct: 171 VH-----SGPEAVRHEFDAEVTEQDLRETYLPAFEACVKEGKVEAVMGAYNRTNGVPCCG 225

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
           +  +L + +  +W   G++ SDC ++   +   H T T  E+ A A+  G DL+CG    
Sbjct: 226 NKRLLIDILRKEWGFSGHVTSDCWAIRDFHEGHHVTGTAIESVAMAMNNGCDLNCGTLFG 285

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
              + AVR GL++EE ++ A+      +M+LG+FD +    P+  +      +   ++L 
Sbjct: 286 FLVQ-AVRQGLVKEERLDEAVTNLFMARMKLGVFD-KKEENPYDKIPYLAADSREMKKLN 343

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
              A + +VLLKN    LPL   +  T+ VIGPN+D    ++GNY G A  Y T L+GI 
Sbjct: 344 EAVARRTVVLLKNKEHILPLDKNKIKTIGVIGPNADSRRALVGNYEGTASRYITVLEGIE 403

Query: 306 RY----AKTIHQAGCF------GVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE-- 353
            Y     + ++  GC             N  +       +++D  V V+GLD  IE E  
Sbjct: 404 DYVGDDVRVLYSEGCHLYKDRTSNLAQENDRMSEVLGVCKESDVVVAVLGLDAGIEGEEG 463

Query: 354 -------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
                    D+  L LPG Q+E++       + PV+LVL+ G  + V++A  D  + AI+
Sbjct: 464 DAGNEYGSGDKPDLNLPGLQEEILEAAVSCGK-PVILVLLSGSALAVNWA--DEHVDAIV 520

Query: 407 WVGYPGQAGGAAIADVLFGRANP 429
              YPG  GGAAIAD+LFG ANP
Sbjct: 521 QGWYPGARGGAAIADILFGEANP 543


>gi|421060771|ref|ZP_15523202.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           B3]
 gi|421065248|ref|ZP_15527033.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           A12]
 gi|421073214|ref|ZP_15534285.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           A11]
 gi|392444242|gb|EIW21677.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           A11]
 gi|392454445|gb|EIW31278.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           B3]
 gi|392459366|gb|EIW35779.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           A12]
          Length = 724

 Score =  296 bits (758), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 173/449 (38%), Positives = 249/449 (55%), Gaps = 33/449 (7%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           +G   GLT+WSPNVNIFRDPRWGRGQET GEDP LTG+   S+++GLQG     L+ AAC
Sbjct: 106 HGIYKGLTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQGQDKKYLRAAAC 165

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
            KH+  +        +R+ F+A VS +DL +TY   FK CV E  V +VM +YN+VNG+P
Sbjct: 166 AKHFAVHSGPE---SERHSFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAYNRVNGEP 222

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP 182
            C    +LK T+  +W   G++VSDC ++   +     T +  E+ A A+  G DL+CG 
Sbjct: 223 CCGSNMLLKETLRREWGFTGHVVSDCWAIKDFHENHRVTSSAPESVAMALNNGCDLNCGN 282

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
            + ++   A + GL+ EE +N A+   +  +M+LG+FD   +  P+  +G        H+
Sbjct: 283 -MYLNLLIAYQEGLVTEEAINTAVTRLMLTRMKLGLFDTAENV-PYTKIGFHQNDCQEHR 340

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
           + AL+ + + +VLLKN    LPL      ++AVIGPN++    + GNY G A  Y T L+
Sbjct: 341 EFALEVSKKTLVLLKNENNLLPLDRNTISSIAVIGPNANSREALTGNYCGTASNYITVLE 400

Query: 303 GISR---------YAKTIH----QAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQS 349
           GI           YA+  H    +A   G A         A   A +AD  V+ MGLD S
Sbjct: 401 GIREAVGKDTMVSYAQGCHLYRDKAENLGEA---RDRFAEAVSTAERADIVVMCMGLDAS 457

Query: 350 IEAE---------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDP 400
           IE E           D+ GL LPG QQEL+  + +  + P++LVL+ G  + V++A    
Sbjct: 458 IEGEEGDVSNEYASGDKLGLNLPGLQQELLEVIYQTGK-PIILVLLAGSALAVTWAAE-- 514

Query: 401 RIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +I AI+   YPG  GG A+A  +FG  +P
Sbjct: 515 KIPAIIQAWYPGAEGGKALASAIFGEYSP 543


>gi|410617070|ref|ZP_11328046.1| beta-glucosidase [Glaciecola polaris LMG 21857]
 gi|410163339|dbj|GAC32184.1| beta-glucosidase [Glaciecola polaris LMG 21857]
          Length = 731

 Score =  296 bits (758), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 169/441 (38%), Positives = 248/441 (56%), Gaps = 25/441 (5%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G  AGLT+W+PNVNIFRDPRWGRGQET GEDP LT +   ++V+GLQG+    LK A   
Sbjct: 130 GQYAGLTFWTPNVNIFRDPRWGRGQETYGEDPFLTAQMGTAFVKGLQGDDPKYLKSAGVA 189

Query: 64  KHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
           KH+  +     +G +  R+HF+   S++DL +TY   F+A V + KVA VMC+YN VNG+
Sbjct: 190 KHFAVH-----SGPESLRHHFDVEPSQKDLYETYLPAFEALVTQAKVAGVMCAYNAVNGE 244

Query: 122 PTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG 181
           P CA   +L   +  QW   GYIVSDC ++         T++  E+AA A+++G++L+CG
Sbjct: 245 PACASAQLLDGILKKQWGFHGYIVSDCGALNDFQAGHKVTKSGPESAALALQSGVNLNCG 304

Query: 182 PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAH 241
                  + A+   L+  E ++  L   + ++ +LG FD      P+  + P  + +P H
Sbjct: 305 STYEHFLKAALEQNLVPLELIDQRLTQLLMIRFQLGFFD-PAGLNPYNEVTPDVIHSPEH 363

Query: 242 QQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 301
             L+   A + IVLLKN    LPLS        V GP +  +  +IGNY G++    + L
Sbjct: 364 INLSRDVARKSIVLLKNDNHVLPLSK-DIKVPYVTGPFAASSDMLIGNYYGISDSLVSVL 422

Query: 302 QGI----SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFI-- 355
           +GI    S  +   +++G      N N L  A +V A+ ADA + V+G+   +E E +  
Sbjct: 423 EGIAGKVSLGSSLNYRSGSLPFHNNINPLNWAPQV-AKTADAVIAVVGVSADMEGEEVDA 481

Query: 356 -------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWV 408
                  DR  + LP  Q + V ++A   +GP++LV+  G PVD+S    +P   AILW+
Sbjct: 482 IASADRGDRVAITLPQNQVDYVKQLAAHKKGPLILVVAAGSPVDIS--DLEPLADAILWI 539

Query: 409 GYPGQAGGAAIADVLFGRANP 429
            YPG+ GG A+ADVLFG  NP
Sbjct: 540 WYPGEQGGNAVADVLFGDTNP 560


>gi|359409694|ref|ZP_09202159.1| Beta-glucosidase [Clostridium sp. DL-VIII]
 gi|357168578|gb|EHI96752.1| Beta-glucosidase [Clostridium sp. DL-VIII]
          Length = 723

 Score =  296 bits (758), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 181/445 (40%), Positives = 257/445 (57%), Gaps = 39/445 (8%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYWSPNVNIFRDPRWGRG ET GEDP LT +   ++V+GLQG+ G  LK++AC KH+ 
Sbjct: 123 GLTYWSPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQGD-GKYLKLSACAKHFA 181

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ FNA VS++DL +TY   F+ACV E  V SVM +YN+ NG+P C 
Sbjct: 182 VH-----SGPESLRHEFNAVVSQKDLHETYLPAFEACVKEANVESVMGAYNRTNGEPCCG 236

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
              +LK+ + G+W   G++VSDC ++   +     T T  E+ A AI+ G DL+CG  + 
Sbjct: 237 SKALLKDILRGKWGFKGHVVSDCWALADFHMHHKVTSTATESVALAIENGCDLNCGN-MY 295

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT-PAHQQL 244
           ++   A + GL+ EE +  A    +T + +LGMFD +       N  P +V     H Q+
Sbjct: 296 LNLLLAYKEGLVTEEQITTAAERLMTTRFKLGMFDEDCEY----NQIPYEVNDCKEHNQV 351

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           +L+A+ + +VLLKN+   LPL   +   VAVIGPN++  + + GNY+G A  YTT L GI
Sbjct: 352 SLEASRKSMVLLKNNG-ILPLDKSKLKAVAVIGPNANSEIMLKGNYSGTASKYTTILDGI 410

Query: 305 SRY----AKTIHQAGCF-------GVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
                   +  +  GC         +A   ++L  A  VA R AD  +L +GLD +IE E
Sbjct: 411 HDVLDDDVRVYYSEGCHLYKEKVEDLARRDDRLAEAVSVAER-ADVVILCLGLDSTIEGE 469

Query: 354 ---------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGA 404
                      D+  L LPG QQEL+ +V +  + PVV+VL  G  + ++ A  + R  A
Sbjct: 470 QGDAGNGYGAGDKLDLNLPGIQQELLEKVLETGK-PVVVVLGTGSGLTLNGA--EERCAA 526

Query: 405 ILWVGYPGQAGGAAIADVLFGRANP 429
           IL   YPG  GG A AD+LFG+ +P
Sbjct: 527 ILNAWYPGSHGGTAAADILFGKCSP 551


>gi|410723195|ref|ZP_11362440.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp.
           Maddingley MBC34-26]
 gi|410603399|gb|EKQ57833.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp.
           Maddingley MBC34-26]
          Length = 709

 Score =  296 bits (757), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 179/449 (39%), Positives = 256/449 (57%), Gaps = 39/449 (8%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G   GLTYWSPNVNIFRDPRWGRG ET GEDP LT +   ++++GLQG  G  LK+AAC 
Sbjct: 106 GIYKGLTYWSPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFIKGLQGE-GKYLKLAACA 164

Query: 64  KHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
           KH+  +     +G +  R+ FNA V K+DL +TY   F+ACV E  V SVM +YN+ NG+
Sbjct: 165 KHFAVH-----SGPEGLRHEFNAVVEKKDLYETYLPAFEACVKEANVESVMGAYNRTNGE 219

Query: 122 PTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG 181
           P C    +LK+ + G+W   G++VSDC ++   +     T T  E+ A AI+ G DL+CG
Sbjct: 220 PCCGSKTLLKDILRGKWGFKGHVVSDCWALADFHLHHMITSTATESVALAIENGCDLNCG 279

Query: 182 PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT-PA 240
             + ++   A + GL+ EE +  A    +T + +LGMFD +       N  P +V     
Sbjct: 280 N-MYLNLLLAYKEGLVTEEQITTAAERLMTTRFKLGMFDEDCEY----NRIPYEVNDCKE 334

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H ++AL A+ + +VLLKN   TLPL      ++AVIGPN++  + + GNY+G A  YTT 
Sbjct: 335 HNEIALIASRKSMVLLKNDG-TLPLDKSSLKSIAVIGPNANSEIMLKGNYSGTASKYTTI 393

Query: 301 LQGISRYA----KTIHQAGCF-------GVACNGNQLIGAAEVAARQADATVLVMGLDQS 349
           L+GI        +  +  GC         +A   ++L  A  VA R +D  +L +GLD +
Sbjct: 394 LEGIHNAVGDNIRVYYSEGCHLFKDKVEDLAGPDDRLSEAISVAER-SDVVILCLGLDST 452

Query: 350 IEAE---------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDP 400
           IE E           D+  L LPGRQQ L+ +V +  + PV++VL  G    ++F   + 
Sbjct: 453 IEGEQGDAGNSYGAGDKESLNLPGRQQNLLEKVLEVGK-PVIVVLGAGSA--LTFNGAEE 509

Query: 401 RIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +  AIL   YPG  GG A+AD+LFG+ +P
Sbjct: 510 KCAAILNAWYPGSHGGTAVADILFGKCSP 538


>gi|292495634|sp|A1CND4.2|XYND_ASPCL RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
          Length = 792

 Score =  295 bits (755), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 166/433 (38%), Positives = 246/433 (56%), Gaps = 13/433 (3%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPV-LTGKYAASYVRGLQGNTGSR-LKVA 60
           N G  GL  +SPN+N FR P WGRGQETPGED   L+  YA  Y+ G+QG    + LK+ 
Sbjct: 162 NAGRYGLDVYSPNINSFRHPVWGRGQETPGEDAYCLSSAYAYEYITGIQGGVDPKSLKLV 221

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A  KHY  YD++NW+G  R   +  +++QDL + Y   F     + KV SVMCSYN VNG
Sbjct: 222 ATAKHYAGYDIENWDGHSRLGNDMNITQQDLSEYYTPQFLVAARDAKVRSVMCSYNAVNG 281

Query: 121 KPTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+CA+   L+  +   +    DGYI SDCDS   ++N   Y      AAAD+I+AG D+
Sbjct: 282 VPSCANSFFLQTLLRDTFGFVEDGYISSDCDSAYNVFNPHEYAANVSSAAADSIRAGTDI 341

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     + + AV   LL   D+   +    +  MRLG FDG  SA  + NL   DV T
Sbjct: 342 DCGTTYQYYFDEAVDQNLLSRADIERGVIRLYSNLMRLGYFDGNSSA--YRNLTWNDVVT 399

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
                ++ +   +G VLLKN   TLPLS ++R  ++A++GP  +V+  + GNY G A   
Sbjct: 400 TNSWNISYEV--EGTVLLKNDG-TLPLSESIR--SIALVGPWMNVSTQLQGNYFGPAPYL 454

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
            +PL         ++ A    ++ N       A  AA+++DA +   G+D S+EAE +DR
Sbjct: 455 ISPLDAFRDSHLDVNYAFGTNISSNSTDGFSKALSAAKKSDAIIFAGGIDNSLEAETLDR 514

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             +  PG+Q EL+ ++++  + P++++ M GG VD S  K++  + +++W GYPGQ+GG 
Sbjct: 515 MNITWPGKQLELIDQLSQLGK-PLIVLQMGGGQVDSSLLKSNKNVNSLIWGGYPGQSGGQ 573

Query: 418 AIADVLFGRANPG 430
           A+ D++ G+  P 
Sbjct: 574 ALLDIITGKRAPA 586


>gi|212531051|ref|XP_002145682.1| beta-xylosidase XylA [Talaromyces marneffei ATCC 18224]
 gi|210071046|gb|EEA25135.1| beta-xylosidase XylA [Talaromyces marneffei ATCC 18224]
          Length = 799

 Score =  295 bits (755), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 170/433 (39%), Positives = 248/433 (57%), Gaps = 13/433 (3%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDP-VLTGKYAASYVRGLQGNTG-SRLKVA 60
           N G  GL  ++PN+N FR P WGRGQETPGED   LT  YA  Y+ GLQG      LK+A
Sbjct: 164 NVGRYGLDAYAPNINGFRSPLWGRGQETPGEDANFLTSSYAYEYITGLQGGIDPDNLKIA 223

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A  KH+  YDL+NW G  R  F+AR+++QDL + Y   F A     K  S MCSYN VN 
Sbjct: 224 ATAKHFAGYDLENWGGNSRLGFDARITQQDLAEYYTPQFLAASRYAKARSFMCSYNSVNA 283

Query: 121 KPTCADPDILKNTIHGQWRLD--GYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+C+   +L+  +  QW     GY+ SDCD+V  ++N   Y      AAA++++AG D+
Sbjct: 284 IPSCSSSFLLQTLLREQWDFPEYGYVSSDCDAVYNVFNPHGYASNQSSAAAESLRAGTDI 343

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG   + H   +   G +   ++  ++    +  ++LG FDG+ +   +  LG  DV T
Sbjct: 344 DCGQTYSWHLNQSFIEGSVTRGEIERSILRLYSNLVKLGYFDGDKNE--YRQLGWNDVVT 401

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
                ++ +AA +GIVLLKN    LPLS     +VA++GP ++ T  + GNY G A    
Sbjct: 402 TDAWNISYEAAVEGIVLLKNDG-VLPLSK-NVKSVALVGPWANATKQLQGNYFGTAPYLI 459

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEV--AARQADATVLVMGLDQSIEAEFID 356
           TPLQG S     ++ A   G   +GN   G A    AA+++D  V + G+D +IEAE  D
Sbjct: 460 TPLQGASDAGYKVNYA--LGTNISGNTTDGFANALSAAKKSDVIVYLGGIDNTIEAEGTD 517

Query: 357 RAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGG 416
           R  +  P  Q +L+ ++++  + P+V++ M GG VD S  K++ ++ A++W GYPGQ+GG
Sbjct: 518 RMNVTWPRNQLDLIQQLSQTGK-PLVVLQMGGGQVDSSSIKSNSKVNALIWGGYPGQSGG 576

Query: 417 AAIADVLFGRANP 429
            AI D+L G+  P
Sbjct: 577 KAIFDILKGKRAP 589


>gi|6573772|gb|AAF17692.1|AC009243_19 F28K19.27 [Arabidopsis thaliana]
          Length = 696

 Score =  295 bits (754), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 144/285 (50%), Positives = 203/285 (71%), Gaps = 2/285 (0%)

Query: 147 DCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLAL 206
           DCD+V ++Y+ Q Y ++PE+A AD +KAG+D++CG +L  HT+ A++   + E D++ AL
Sbjct: 221 DCDAVSIIYDAQGYAKSPEDAVADVLKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRAL 280

Query: 207 AYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLS 266
               +V++RLG+F+G+P+  P+GN+ P +VC+PAHQ LAL AA  GIVLLKN+ + LP S
Sbjct: 281 LNLFSVRIRLGLFNGDPTKLPYGNISPNEVCSPAHQALALDAARNGIVLLKNNLKLLPFS 340

Query: 267 TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRYAKT-IHQAGCFGVACNGNQ 325
                ++AVIGPN+ V  T++GNYAG  C   TPL  +  Y K  ++  GC  VAC+ N 
Sbjct: 341 KRSVSSLAVIGPNAHVVKTLLGNYAGPPCKTVTPLDALRSYVKNAVYHQGCDSVACS-NA 399

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
            I  A   A+ AD  VL+MGLDQ+ E E  DR  L LPG+QQEL++ VA A++ PVVLVL
Sbjct: 400 AIDQAVAIAKNADHVVLIMGLDQTQEKEDFDRVDLSLPGKQQELITSVANAAKKPVVLVL 459

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           +CGGPVD+SFA N+ +IG+I+W GYPG+AGG AI++++FG  NPG
Sbjct: 460 ICGGPVDISFAANNNKIGSIIWAGYPGEAGGIAISEIIFGDHNPG 504



 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 53/86 (61%), Positives = 67/86 (77%), Gaps = 8/86 (9%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT------ 53
           +YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP++TG YA +YVRGLQG++      
Sbjct: 138 VYNAGQANGMTFWAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDSFDGRKT 197

Query: 54  -GSRLKVAACCKHYTAYDLDNWNGVD 78
             + L+ +ACCKH+TAYDLD W   D
Sbjct: 198 LSNHLQASACCKHFTAYDLDRWKDCD 223


>gi|60729621|pir||JC7966 xylan 1,4-beta-xylosidase (EC 3.2.1.37) - Talaromyces emersonii
 gi|21326570|gb|AAL32053.2|AF439746_1 beta-xylosidase [Rasamsonia emersonii]
          Length = 796

 Score =  294 bits (753), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 167/431 (38%), Positives = 241/431 (55%), Gaps = 9/431 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPV-LTGKYAASYVRGLQGNTG-SRLKVA 60
           N G  GL  ++PN+N FR P WGRGQETPGED   L+  YA  Y+ GLQG      +K+ 
Sbjct: 163 NAGRYGLDSYAPNINGFRSPLWGRGQETPGEDAFFLSSAYAYEYITGLQGGVDPEHVKIV 222

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A  KH+  YDL+NW  V R   NA +++QDL + Y   F A     K  S+MCSYN VNG
Sbjct: 223 ATAKHFAGYDLENWGNVSRLGSNAIITQQDLSEYYTPQFLASARYAKTRSLMCSYNAVNG 282

Query: 121 KPTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+C++   L+  +   +    DGY+ SDCD+V  ++N   Y      AAAD++ AG D+
Sbjct: 283 VPSCSNSFFLQTLLRESFNFVDDGYVSSDCDAVYNVFNPHGYALNQSGAAADSLLAGTDI 342

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG  +  H   +     +   D+  +L       +RLG FDG  S   + NL   DV T
Sbjct: 343 DCGQTMPWHLNESFYERYVSRGDIEKSLTRLYANLVRLGYFDGNNSV--YRNLNWNDVVT 400

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
                ++ +AA +GI LLKN   TLPLS  +  ++A+IGP ++ TV M GNY G      
Sbjct: 401 TDAWNISYEAAVEGITLLKNDG-TLPLSK-KVRSIALIGPWANATVQMQGNYYGTPPYLI 458

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           +PL+       T++ A    ++ +  Q    A  AA+++D  +   G+D +IEAE  DR 
Sbjct: 459 SPLEAAKASGFTVNYAFGTNISTDSTQWFAEAISAAKKSDVIIYAGGIDNTIEAEGQDRT 518

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            L  PG Q +L+ +++K  + P+V++ M GG VD S  K +  + A++W GYPGQ+GGAA
Sbjct: 519 DLKWPGNQLDLIEQLSKVGK-PLVVLQMGGGQVDSSSLKANKNVNALVWGGYPGQSGGAA 577

Query: 419 IADVLFGRANP 429
           + D+L G+  P
Sbjct: 578 LFDILTGKRAP 588


>gi|375150455|ref|YP_005012896.1| Beta-glucosidase [Niastella koreensis GR20-10]
 gi|361064501|gb|AEW03493.1| Beta-glucosidase [Niastella koreensis GR20-10]
          Length = 711

 Score =  294 bits (752), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 172/447 (38%), Positives = 245/447 (54%), Gaps = 46/447 (10%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT     ++V+GLQGN    LK +AC KH+ 
Sbjct: 122 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTAHMGTAFVKGLQGNDPRYLKASACAKHFA 181

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +     NG  R+ FNA V ++DL +TY   F A V  G V SVMC+YN+VN +P C+  
Sbjct: 182 VHSGPE-NG--RHTFNAIVDEKDLRETYLYAFHALVDAG-VESVMCAYNRVNDQPCCSGN 237

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +L + +  +W+  G++V+DC ++  ++       +  E AA AIKAG++LDC   L   
Sbjct: 238 FLLNSILRNEWKFKGHVVTDCGALDDIFMRHKVMPSGVEVAAAAIKAGVNLDCSNVLQKD 297

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
            E AV   LL E+D++ +LA+ +  Q++LG +D +P+A PF   G   V   AH  LA  
Sbjct: 298 VEKAVEQKLLNEKDIDSSLAHLLRTQIKLGFYD-DPTANPFYKYGADSVANTAHATLARA 356

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI--- 304
            A Q +VLLKNS + LPL   ++  + V+G NS     ++GNY GV+    + ++GI   
Sbjct: 357 MAQQSMVLLKNSNQLLPLDKKKYPAIMVVGTNSASMDALLGNYHGVSNRAVSFVEGITNA 416

Query: 305 ------------SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA 352
                       S Y  T H  G +               AA  AD TV V+GL    E 
Sbjct: 417 VDAGTRVEYDQGSDYNDTTHFGGIW---------------AAGNADITVAVIGLTPVYEG 461

Query: 353 E----FI-----DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIG 403
           E    F+     D+  + LP      +  + KA++ P++ V+  G  VD+S    +P   
Sbjct: 462 EEGDAFLAAKGGDKPDMSLPAAHIAFMKALRKANKKPIIAVITAGSAVDISAI--EPYAD 519

Query: 404 AILWVGYPGQAGGAAIADVLFGRANPG 430
           AIL   YPG+ GG A+AD+LFG+ +P 
Sbjct: 520 AILLAWYPGEQGGNALADILFGKVSPA 546


>gi|347832625|emb|CCD48322.1| glycoside hydrolase family 3 protein [Botryotinia fuckeliana]
          Length = 772

 Score =  294 bits (752), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 182/427 (42%), Positives = 244/427 (57%), Gaps = 10/427 (2%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS--RLKVAACCKH 65
           GL +W+PN+N ++DPRWGRGQETPGEDP  T  Y  + + GLQG        K  A CKH
Sbjct: 142 GLNFWTPNINPYKDPRWGRGQETPGEDPFHTSSYVNALITGLQGGLDDLPYKKGVATCKH 201

Query: 66  YTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
           +  YDL++ +G  RY F+A +  QDL D Y  PF+ C  +  V SVMCSYN +NG PTCA
Sbjct: 202 FAGYDLESSDGAIRYGFDAIIKSQDLRDYYLPPFQQCARDSNVQSVMCSYNAMNGVPTCA 261

Query: 126 DPDILKNTIHGQW---RLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP 182
           D  +L+  +   W     D ++ SDCD+V  +++  +YT TPE++AADA+ AG DLDCG 
Sbjct: 262 DDWLLQTLLREHWGWTEEDQWVTSDCDAVKNIWDYHNYTLTPEQSAADALNAGTDLDCGT 321

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
           F   +   A   GL     ++ +LA      +RLG FD  PS QP+  L   +V TPA Q
Sbjct: 322 FWPTYLGSAYDQGLYDISTLDRSLARRYASLVRLGYFD-PPSVQPYRQLNWDNVSTPAAQ 380

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
           QLALQAA  GIVLLKN    LPLS+     VA+IGP ++ T  M GNY G A    +PL 
Sbjct: 381 QLALQAAEDGIVLLKNDG-ILPLSS-NITNVALIGPLANATKQMQGNYYGTAPYLRSPLI 438

Query: 303 GISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLL 362
                   +       +         AA  AA+ AD  + V G+D SIEAE IDR  +  
Sbjct: 439 AAQNAGFKVTYVQGADIDSQNTTDFSAAISAAQSADLVIYVGGIDNSIEAEEIDRTSISW 498

Query: 363 PGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADV 422
           P  Q  L++++A  S  P+++  M G  +D S   ++  + A+LW GYPGQ GG AI ++
Sbjct: 499 PSSQLSLINQLANLST-PLIISQM-GCMIDSSSLLSNTGVNALLWAGYPGQDGGTAIFNI 556

Query: 423 LFGRANP 429
           L G+  P
Sbjct: 557 LTGKTAP 563


>gi|348604625|dbj|BAK96214.1| beta-xylosidase [Acremonium cellulolyticus]
          Length = 797

 Score =  293 bits (750), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 168/432 (38%), Positives = 245/432 (56%), Gaps = 11/432 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDP-VLTGKYAASYVRGLQGNTG-SRLKVA 60
           N G  GL  ++PN+N FR P WGRGQETPGED   L+  YA  Y+ GLQG      LKV 
Sbjct: 164 NAGRYGLDAYAPNINGFRSPLWGRGQETPGEDANFLSSSYAYEYITGLQGGVDPDHLKVV 223

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A  KH+  YDL+NW G  R  F+A +++QDL + Y   F A     K  S MCSYN VNG
Sbjct: 224 ATAKHFAGYDLENWGGNSRLGFDASITQQDLAEYYTPQFLAASRYAKARSFMCSYNSVNG 283

Query: 121 KPTCADPDILKNTIHGQWRLD--GYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+C+   +L+  +   W     GY+ SDCD+V  ++N   Y      AAAD+++AG D+
Sbjct: 284 VPSCSSSFLLQTLLRDNWDFPEYGYVSSDCDAVYNVFNPHGYASNQSAAAADSLRAGTDI 343

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     +   +   G +   ++  ++    +  ++LG FDG+ S   +  LG  DV T
Sbjct: 344 DCGQTYPWNLNQSFIEGSVTRGEIERSIVRLYSNLVKLGYFDGDKSE--YRQLGWNDVVT 401

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRH-HTVAVIGPNSDVTVTMIGNYAGVACGY 297
                ++ +AA +GIVLLKN    LPLS  +H  ++A+IGP ++ T  + GNY G A   
Sbjct: 402 TDAWNISYEAAVEGIVLLKNDG-ILPLS--KHVKSIALIGPWANATEQLQGNYYGTAPYL 458

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
            TPLQG S     ++ A    +  N  +    A  AA+++D  V + G+D +IEAE  DR
Sbjct: 459 ITPLQGASDAGYKVNYALGTNILGNTTEGFADALSAAKKSDVIVYLGGIDNTIEAEGTDR 518

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             +  PG Q +L+ ++++  + P+V++ M GG VD S  K + ++ A++W GYPGQ+GG 
Sbjct: 519 MNVTWPGNQLDLIQQLSQTGK-PLVVLQMGGGQVDSSSIKANSKVNALVWGGYPGQSGGT 577

Query: 418 AIADVLFGRANP 429
           AI D+L G+  P
Sbjct: 578 AIFDILSGKRVP 589


>gi|160881137|ref|YP_001560105.1| glycoside hydrolase family 3 [Clostridium phytofermentans ISDg]
 gi|160429803|gb|ABX43366.1| glycoside hydrolase family 3 domain protein [Clostridium
           phytofermentans ISDg]
          Length = 717

 Score =  293 bits (750), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 169/443 (38%), Positives = 246/443 (55%), Gaps = 31/443 (6%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPNVNIFRDPRWGRG ET GEDP L+G     +V G+QG+  + LK AAC KH+ 
Sbjct: 108 GLTFWSPNVNIFRDPRWGRGHETFGEDPFLSGTLGGRFVDGIQGHDETYLKAAACAKHFA 167

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ FNA VS+QDL +TY   FK  V E KV +VM +YN+ NG+P C 
Sbjct: 168 VH-----SGPEDIRHSFNAEVSEQDLRETYLPAFKKLVKEHKVEAVMGAYNRTNGEPCCG 222

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
              +L++ + G+W   G++ SDC ++   +     T    E+ A A+  G DL+CG  L 
Sbjct: 223 SKTLLEDILRGEWEFVGHVTSDCWAIKDFHEHHMVTSNAVESVALAMNRGCDLNCGN-LY 281

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
           ++   AVR GL+ EE ++ AL    T +M+LG+FD E S  PF  +    V T + ++L 
Sbjct: 282 VNLLQAVRDGLVEEETIDTALIRLFTTRMKLGLFDKEESI-PFNTITYDQVDTKSSKELN 340

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
           ++A+ + +VLLKN    LPL+  +  +V VIGPN++    ++GNY G A  Y T L+GI 
Sbjct: 341 IKASKKCVVLLKNEDNILPLNPKKITSVGVIGPNANNRNALVGNYEGTASEYITVLEGIK 400

Query: 306 RY----AKTIHQAGC------FGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE-- 353
           +      +     GC             N  I         +D  +  +GLD  +E E  
Sbjct: 401 QVVPEDVRVYFSEGCHLFKNKLSNLSQENDRIAEVRAVCEHSDVVIACLGLDPGLEGEEG 460

Query: 354 -----FI--DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
                F   D+  L LPG Q++++  + +  + PV+L+L+ G  + V +A  D  I AIL
Sbjct: 461 DQGNQFASGDKKTLALPGIQEDVLKTIYECGK-PVILILLSGSALAVPWA--DEHIPAIL 517

Query: 407 WVGYPGQAGGAAIADVLFGRANP 429
              YPG  GG AIA+++FG  NP
Sbjct: 518 QGWYPGAQGGRAIAELIFGDGNP 540


>gi|336435507|ref|ZP_08615222.1| hypothetical protein HMPREF0988_00807 [Lachnospiraceae bacterium
           1_4_56FAA]
 gi|336000960|gb|EGN31106.1| hypothetical protein HMPREF0988_00807 [Lachnospiraceae bacterium
           1_4_56FAA]
          Length = 717

 Score =  293 bits (749), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 176/445 (39%), Positives = 256/445 (57%), Gaps = 38/445 (8%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+W+PNVNIFRDPRWGRG ET GEDP L+ +   SY+RGLQG+ G  +K AAC KH+ 
Sbjct: 108 GLTFWAPNVNIFRDPRWGRGHETYGEDPYLSSRLGVSYIRGLQGD-GETMKAAACAKHFA 166

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+A VS++DL +TY   F+ACV EG V +VM +YN VNG+P C 
Sbjct: 167 VH-----SGPEALRHEFDAEVSEKDLRETYLPAFQACVQEGHVEAVMGAYNCVNGEPCCG 221

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
              +LK  +  +W  DG++VSDC ++   +     T TP ++AA A++AG DL+CG    
Sbjct: 222 SETLLKKILREEWGFDGHVVSDCWAIKDFHENHLVTGTPVQSAALAMEAGCDLNCG-VTY 280

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDG-EPSAQPFGNLGPRDVCTPAHQQL 244
           +H   A + GL+ E  +  A     T +  LGMFDG E  + P+  +  ++     H+ L
Sbjct: 281 LHLVHACQEGLVTEAQITEAAIRLFTTRFLLGMFDGSEYDSVPYTVVECKE-----HRDL 335

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           + +AA + IVLLKN+   LPL   +  T+ +IGPN+D    +IGNY G +  Y T L+G+
Sbjct: 336 SERAARESIVLLKNNG-ILPLDREKLKTIGIIGPNADSRKALIGNYHGTSSEYITVLEGV 394

Query: 305 SRY----AKTIHQAGCFGVACNGNQL------IGAAEVAARQADATVLVMGLDQSIEAE- 353
            R      + ++  GC         L      +  A + AR++D  +L +GLD+++E E 
Sbjct: 395 RRLVGDEVRILYSDGCHLYENKTENLAREQDRLSEARIVARESDVVILCLGLDETLEGEE 454

Query: 354 --------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAI 405
                     D+  L LP  Q+ L+  VA   + P VL LM G  +D+SFA+      AI
Sbjct: 455 GDTGNSYASGDKVDLRLPKSQRMLMEAVA-MEKKPTVLCLMAGSDIDLSFAEK--HFDAI 511

Query: 406 LWVGYPGQAGGAAIADVLFGRANPG 430
           + + YPG  GGAA AD+LFG+ +P 
Sbjct: 512 VDLWYPGAYGGAAAADILFGKCSPS 536


>gi|366163035|ref|ZP_09462790.1| glycoside hydrolase family 3 [Acetivibrio cellulolyticus CD2]
          Length = 705

 Score =  292 bits (747), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 169/445 (37%), Positives = 248/445 (55%), Gaps = 40/445 (8%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT WSPN+NIFRDPRWGRG ET GEDP L+GK   ++++GLQG+    +  AAC KH+ 
Sbjct: 105 GLTIWSPNINIFRDPRWGRGHETYGEDPFLSGKLGVAFIKGLQGDKDVMM-TAACVKHFA 163

Query: 68  AY----DLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPT 123
           AY    DL       R+ FNA V+K+DL +TY   F+ CV + KV +VM  YN+ NG+P 
Sbjct: 164 AYSGPEDL-------RHGFNAEVTKKDLWETYLPAFETCVKDAKVEAVMGGYNRTNGEPC 216

Query: 124 CADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPF 183
           C    +L++ +  +W  +G++VSDC ++   +     T+TPEE+ A AI AG DL+CG  
Sbjct: 217 CGSYTLLRDILREKWGFEGHVVSDCWAIKDFHTDHMVTKTPEESVALAIDAGCDLNCGN- 275

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
           + +    A++ GL+ EE +  A     T + +LG+F+G      F N+    V    H++
Sbjct: 276 MYLMLLIALQEGLITEEHITRAAVRIFTTRFKLGLFEG----SEFDNIPYEVVECSEHKE 331

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           +A++AA +  VLLKN    LP++     T+ VIGPN++  + + GNY G +  Y T L+G
Sbjct: 332 MAIEAARKSAVLLKNDG-ILPINKGAIKTIGVIGPNANSRIALKGNYHGTSSRYITLLEG 390

Query: 304 IS----RYAKTIHQAGC------FGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
           I        + ++  GC        V    N  +  A   A  +D  VL +GLD++IE E
Sbjct: 391 IQDEVGDEVRVLYSNGCELVKDRTEVLAYANDRLAEAVTVAEHSDLVVLCLGLDETIEGE 450

Query: 354 FI---------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGA 404
                      D+  L LP  Q+ L+ ++  A+  P VL LM G  +++S+A        
Sbjct: 451 QSDEGNNGGSGDKKDLDLPEVQKSLLEKIV-ATGKPTVLCLMAGSAINLSYAHE--HCNG 507

Query: 405 ILWVGYPGQAGGAAIADVLFGRANP 429
           IL   YPG  GG A+AD+LFG A+P
Sbjct: 508 ILLTWYPGARGGKAVADILFGNASP 532


>gi|171678585|ref|XP_001904242.1| hypothetical protein [Podospora anserina S mat+]
 gi|170937362|emb|CAP62020.1| unnamed protein product [Podospora anserina S mat+]
          Length = 800

 Score =  292 bits (747), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 171/438 (39%), Positives = 242/438 (55%), Gaps = 16/438 (3%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR-LKVAA 61
           N G +GL YW+PNVN F+DPRWGRG ETPGED +L  +YAA+ ++GL+G    +  +V A
Sbjct: 143 NAGFSGLDYWTPNVNPFKDPRWGRGSETPGEDVLLVKRYAAAMIKGLEGPVPEKERRVVA 202

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKHY A D ++WNG  R++FNA++S QD+ + Y +PF+ CV + +V S+MC+YN VNG 
Sbjct: 203 TCKHYAANDFEDWNGATRHNFNAKISLQDMAEYYFMPFQQCVRDSRVGSIMCAYNAVNGV 262

Query: 122 PTCADPDILKNTIHGQWRL---DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           P+CA P +L+  +   W     + YI SDC++V  +     Y  T  E  A + +AG+D 
Sbjct: 263 PSCASPYLLQTILREHWNWTEHNNYITSDCEAVLDVSLNHKYAATNAEGTAISFEAGMDT 322

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
            C    +    GA   GLL+E  V+ AL       +R G FDG+ S   + +LG  DV  
Sbjct: 323 SCEYEGSSDIPGAWSQGLLKESTVDRALLRLYEGIVRAGYFDGKQSL--YSSLGWADVNK 380

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTL----RHHTVAVIGPNSDVTVTMIGNYAGVA 294
           P+ Q+L+LQAA  G VLLKN   TLPLS L    R   VA+IG  SD    + G Y+G A
Sbjct: 381 PSAQKLSLQAAVDGTVLLKNDG-TLPLSDLLDKSRPKKVAMIGFWSDAKDKLRGGYSGTA 439

Query: 295 CGYTTPLQGISRYAKTIHQAG---CFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIE 351
               TP    S+       A          +       A  AA+ AD  +   G+D S  
Sbjct: 440 AYLHTPAYAASQLGIPFSTASGPILHSDLASNQSWTDNAMAAAKDADYILYFGGIDTSAA 499

Query: 352 AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYP 411
            E  DR  L  PG Q  L++ +   S+   ++VL  G  +D +   ++P+I AILW  +P
Sbjct: 500 GETKDRYDLDWPGAQLSLINLLTTLSK--PLIVLQMGDQLDNTPLLSNPKINAILWANWP 557

Query: 412 GQAGGAAIADVLFGRANP 429
           GQ GG A+ +++ G  +P
Sbjct: 558 GQDGGTAVMELVTGLKSP 575


>gi|291548352|emb|CBL21460.1| Beta-glucosidase-related glycosidases [Ruminococcus sp. SR1/5]
          Length = 697

 Score =  291 bits (745), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 172/443 (38%), Positives = 243/443 (54%), Gaps = 37/443 (8%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPNVNIFRDPRWGRG ET GEDP LT +   S+V+ LQGN G  +K AAC KH+ 
Sbjct: 104 GLTFWSPNVNIFRDPRWGRGHETYGEDPYLTKELGVSFVKALQGN-GDTMKAAACAKHFA 162

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+A  S +D+E+TY   F+  V E KV +VM +YN+ NG+P C 
Sbjct: 163 VH-----SGPEALRHEFDAEASAKDMEETYLPAFEGLVKEAKVEAVMGAYNRTNGEPCCG 217

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
            P  L+  + G+W+  G+ VSDC ++   +     T T  E+AA AI  G DL+CG    
Sbjct: 218 SP-TLQKKLRGEWKFQGHFVSDCWAIRDFHEHHMVTDTAVESAALAINNGCDLNCGN-TY 275

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
           +H   A   GL+ EE +  A     T +  LG+FDG      + NL   +V +P H   A
Sbjct: 276 LHIMKAYEKGLVTEETITRAAVRLFTTRYLLGLFDG----SEYDNLSYMEVESPRHLDAA 331

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
            +AA +  VLLKN+   LPL   +  T+ +IGPN+D    +IGNY G A  Y T  +GI 
Sbjct: 332 EKAAEKSFVLLKNNG-ILPLDKEKLKTIGIIGPNADSRQALIGNYHGTASRYITIQEGIQ 390

Query: 306 RY----AKTIHQAGCFGVACNGNQL------IGAAEVAARQADATVLVMGLDQSIEAE-- 353
            Y     + +   GC         L      I  A+V A  +D  +L MGLD+++E E  
Sbjct: 391 DYVGDDVRILTSRGCDLFRDRTEHLAFTRDRIAEAKVVAENSDVVILCMGLDETLEGEEG 450

Query: 354 -----FI--DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
                ++  D+  + LPG Q+EL+  +A   + PVV  L+ G  +D+ +A    +  A++
Sbjct: 451 DTGNSYVSGDKEDIELPGVQRELMEAIADTGK-PVVFCLLAGSDLDLKYAAE--KFDAVM 507

Query: 407 WVGYPGQAGGAAIADVLFGRANP 429
            + YPG  GG A A VLFG  +P
Sbjct: 508 MLWYPGCQGGKAAAKVLFGEISP 530


>gi|358061481|ref|ZP_09148135.1| hypothetical protein HMPREF9473_00197 [Clostridium hathewayi
           WAL-18680]
 gi|356700240|gb|EHI61746.1| hypothetical protein HMPREF9473_00197 [Clostridium hathewayi
           WAL-18680]
          Length = 695

 Score =  291 bits (744), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 177/444 (39%), Positives = 257/444 (57%), Gaps = 38/444 (8%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPNVNIFRDPRWGRG ET GEDP LT +   ++VRGLQG+ G  LK+AAC KH+ 
Sbjct: 104 GLTFWSPNVNIFRDPRWGRGHETYGEDPYLTSRLGTAFVRGLQGD-GEHLKIAACAKHFA 162

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F A  SK+DL +TY   F+ACV E  V SVM +YN  +G+P CA
Sbjct: 163 VH-----SGPEALRHEFWADTSKKDLWETYLPAFEACVKEAHVESVMGAYNSYHGEPCCA 217

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
           +  +++  + GQW  +G+ VSDC ++   +     T T  E+AA A+K G DL+CG    
Sbjct: 218 NTLLMEEILRGQWGFEGHFVSDCWAIRDFHMNYMVTDTAMESAALAVKKGCDLNCGNTY- 276

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
           +    A   GLL +  V  A+    T +  LGM  GE +   + ++    V    H++LA
Sbjct: 277 LQVLKACEEGLLDDACVTEAVVRLFTTRYLLGM--GEETE--YDDIPYEVVECKEHRELA 332

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
           ++AA + +VLLKN    LPL   + +T+AVIGPN+D    +IGNY G +  YTT L+GI 
Sbjct: 333 VEAARRSMVLLKNDG-LLPLHAEKLNTIAVIGPNADNRTALIGNYHGTSSCYTTILEGIQ 391

Query: 306 ----RYAKTIHQAGCF-------GVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE- 353
                  + ++  GC         +A  G++L   A + A+ +D  VL +GLD+++E E 
Sbjct: 392 DAVGEDVRVLYAEGCHLFKDRVEHLAVAGDRL-SEARIVAKHSDVVVLCVGLDETLEGEE 450

Query: 354 --------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAI 405
                     D+  LLLP  Q+ L+  +    + PVV+  M G  +D+S A+   + GA+
Sbjct: 451 GDTGNSHASGDKKDLLLPESQRRLMEEILNLGK-PVVVCNMSGSAIDLSLAQE--KAGAV 507

Query: 406 LWVGYPGQAGGAAIADVLFGRANP 429
           + V YPG  GG A+AD+LFG+A+P
Sbjct: 508 IQVWYPGAEGGRALADLLFGKASP 531


>gi|343428088|emb|CBQ71612.1| related to Beta-xylosidase [Sporisorium reilianum SRZ2]
          Length = 698

 Score =  291 bits (744), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 180/445 (40%), Positives = 235/445 (52%), Gaps = 24/445 (5%)

Query: 3   NGGMAGLTYWSP-NVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-----NTGSR 56
           N G AGL  +SP N+N FRDPRWGRGQET GEDP+   +YA   VRGLQG         R
Sbjct: 139 NAGRAGLNMYSPLNINAFRDPRWGRGQETVGEDPLHLSRYAVRVVRGLQGPAAQDEANPR 198

Query: 57  LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYN 116
           L +AA CKHY AYDL+   GV+RY F+A VS QDL D +   F+ACV +G   ++M SYN
Sbjct: 199 LTLAATCKHYLAYDLEASAGVERYQFDALVSNQDLADLHLPQFRACVRDGGATTLMTSYN 258

Query: 117 QVNGKPTCADPDILKNTIHGQWRLD---GYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
            VNG P  A    L+      W LD    Y+ SDCD+V  +Y+  HY      AAA ++ 
Sbjct: 259 AVNGVPPSASKYYLETLARDTWGLDKHHNYVTSDCDAVANVYDAHHYAADYVHAAAASLN 318

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AG DLDCG         A+   L     +  A+       +RLG FD    AQP   LG 
Sbjct: 319 AGTDLDCGATYRDSLAAALAQNLTDVATIRRAVTRMYGSLVRLGYFDAA-EAQPLRQLGW 377

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 293
           +DV  PA Q+LA +AA   I LLKN   TLPL      T+A+IGP ++ T  + GNYAG 
Sbjct: 378 KDVNAPAAQKLAYEAAAASITLLKNRQSTLPLRETAGKTIALIGPYTNATFALRGNYAGP 437

Query: 294 ACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAE--------VAARQADATVLVMG 345
           +    TP     R     H      V+ NG  + G  +          A+ AD  V   G
Sbjct: 438 SPLVITPFDAARRTFSDAHI-----VSANGTSIAGPYDTATASAALATAKSADIIVYAGG 492

Query: 346 LDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAI 405
           +D ++E E +DR  +  P  Q  L+  +A   +  +V+V   GG VD +  K D  +GA+
Sbjct: 493 IDPTVEGESLDRRDIAWPANQLRLIQELAALGKV-LVVVQFGGGQVDGALLKGDDGVGAL 551

Query: 406 LWVGYPGQAGGAAIADVLFGRANPG 430
           +W GYPGQ+G  A+ D+L G+  P 
Sbjct: 552 VWAGYPGQSGALALMDILAGKRAPA 576


>gi|336425135|ref|ZP_08605165.1| hypothetical protein HMPREF0994_01171 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336013044|gb|EGN42933.1| hypothetical protein HMPREF0994_01171 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 705

 Score =  290 bits (743), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 171/445 (38%), Positives = 248/445 (55%), Gaps = 38/445 (8%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR-LKVAACCKHY 66
           GLT+WSPNVNIFRDPRWGRG ET GEDP LT +   +YV GLQG+     +K AAC KH+
Sbjct: 105 GLTFWSPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAYVEGLQGSQDDDFMKTAACAKHF 164

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+A+ SK+D+ +TY   F+ACV E  V +VM +YN+ NG+P C
Sbjct: 165 AVH-----SGPESVRHEFDAQASKKDMYETYLPAFEACVKEAGVEAVMGAYNRTNGEPCC 219

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFL 184
             P +++N +  +W   G+ VSDC ++   +     T+TPEE+AA A+K+G D++CG   
Sbjct: 220 GSPTLIQNILREEWDFQGHYVSDCWAIADFHMHHMVTKTPEESAALALKSGCDVNCG-VT 278

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
            +H   A + GL+ EE++  A     T +  LG FD       + ++    V    H +L
Sbjct: 279 YLHLLKAYQQGLVTEEEITQAAERLFTTRFLLGCFD----KNEYDDIPYEVVECKEHLEL 334

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           A + A + +VLLKN    LPL+     T+ VIGPN+D    ++GNY G +  Y T L+GI
Sbjct: 335 AQKMAKESMVLLKNDG-ILPLNKDGLKTIGVIGPNADSRTPLVGNYHGTSSRYITLLEGI 393

Query: 305 SRY----AKTIHQAGCF-------GVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
             +     +  +  GC        G+    ++ I  A   A  +D  VL +GLD+++E E
Sbjct: 394 QDFVGEDVRVYYSEGCHIYKDRVEGLGWKQDR-ISEALTVAEHSDVVVLCLGLDENLEGE 452

Query: 354 ---------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGA 404
                      D+  L LP  Q+EL+  VA   + PVVL +M G  +D+ FA     + A
Sbjct: 453 EGDTGNSYASGDKKDLELPESQRELLEAVAGCGK-PVVLCMMSGSAIDMQFAAE--HVNA 509

Query: 405 ILWVGYPGQAGGAAIADVLFGRANP 429
           IL V YPG  GG A A++LFG  +P
Sbjct: 510 ILQVWYPGARGGKAAAEILFGACSP 534


>gi|451821678|ref|YP_007457879.1| periplasmic beta-glucosidase BglX [Clostridium
           saccharoperbutylacetonicum N1-4(HMT)]
 gi|451787657|gb|AGF58625.1| periplasmic beta-glucosidase BglX [Clostridium
           saccharoperbutylacetonicum N1-4(HMT)]
          Length = 710

 Score =  290 bits (741), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 172/444 (38%), Positives = 248/444 (55%), Gaps = 36/444 (8%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPNVNIFRDPRWGRG ET GEDP LT +   ++V+GLQG+    LK+AAC KH+ 
Sbjct: 110 GLTFWSPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQGDE-KYLKIAACAKHFA 168

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ FNA VSK+DL +TY   F+ACV E  V +VM +YN+ N +P C 
Sbjct: 169 VH-----SGPEGLRHEFNAVVSKKDLYETYLPAFEACVKEADVEAVMGAYNRTNDEPCCG 223

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
              +LK+ + G+W+  G++VSDC ++   +     T T  E+AA AIK G DL+CG  + 
Sbjct: 224 SSLLLKDILRGKWQFKGHVVSDCWAIADFHLYHGVTSTATESAALAIKNGCDLNCGN-VY 282

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
           +    A + GL+ EED+  A    +  ++RLGMFD E     F  +         H +++
Sbjct: 283 LQMLLAYKEGLVTEEDITRAAERLMATRIRLGMFDEECE---FNKIPYTMNDCKEHHEVS 339

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI- 304
           L A+ + IV+L+N+   LPL   +  ++ +IGPN+D  + + GNY G A  Y T L+GI 
Sbjct: 340 LMASRKSIVMLRNNG-LLPLDKSKLKSIGIIGPNADSELMLKGNYFGTASKYITVLEGIH 398

Query: 305 ----SRYAKTIHQAGCFGVACNGNQL------IGAAEVAARQADATVLVMGLDQSIEAE- 353
               S   +  +  GC         L      +  A   A  +D  +L +GLD SIE E 
Sbjct: 399 EAVDSENIRIFYSEGCHLYKDRVQDLAEPDDRMAEAVTVAEHSDVVILCLGLDSSIEGEQ 458

Query: 354 --------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAI 405
                     D+  L LPG+QQEL+ +V  A+  PV++VL  G    ++    +    AI
Sbjct: 459 GDAGNSDGAGDKLNLNLPGKQQELLEKVI-ATGKPVIVVLGAGSA--LTLQGQEENCAAI 515

Query: 406 LWVGYPGQAGGAAIADVLFGRANP 429
           L   YPG  GG AIAD++FG+ +P
Sbjct: 516 LNAWYPGSFGGRAIADLIFGKCSP 539


>gi|255284060|ref|ZP_05348615.1| beta-glucosidase [Bryantella formatexigens DSM 14469]
 gi|255265405|gb|EET58610.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Marvinbryantia formatexigens DSM 14469]
          Length = 700

 Score =  289 bits (739), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 176/444 (39%), Positives = 249/444 (56%), Gaps = 39/444 (8%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPNVNIFRDPRWGRG ET GEDP LT +   ++V+GLQG+ G  +K AAC KH+ 
Sbjct: 104 GLTFWSPNVNIFRDPRWGRGHETYGEDPCLTSRLGVAFVKGLQGD-GETMKAAACAKHFA 162

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ FNA  S +D+E+TY   F+A V E  V +VM +YN+ NG+  CA
Sbjct: 163 VH-----SGPEAVRHEFNAEASAKDMEETYLPAFEALVKEADVEAVMGAYNRTNGEACCA 217

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
            P +L+  +   W  +G+ VSDC ++   +     T T +E+AA AI +G DL+CG    
Sbjct: 218 SP-VLQKILREDWGFEGHFVSDCWAIRDFHEHHMLTATAKESAAMAINSGCDLNCGNTY- 275

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
           +H   A R GL+ EE +  A     T +  LG+FDG      + ++    V +  H  LA
Sbjct: 276 LHILHAYRDGLVSEETITEAAVRLFTTRFLLGLFDG----SEYDDIPYTVVESKEHLALA 331

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
            +AA +  VLLKN+   LPL   R  TV VIGPN+D    + GNY G A  Y T  QG+ 
Sbjct: 332 EKAALESAVLLKNNG-ILPLKKERLRTVGVIGPNADSRAALAGNYHGTASRYETIQQGLQ 390

Query: 306 RY----AKTIHQAGCF-------GVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE- 353
            Y     + +   GC         +A  G++L   A++ A  +D  +L +GLD+++E E 
Sbjct: 391 DYLGEDVRVLTSVGCALSEDRTEKLALAGDRL-AEAQIVAENSDVVILCLGLDETLEGEE 449

Query: 354 --------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAI 405
                     D+  LLLP  Q++L+  VA A+  PVVL +M G  +D+S+A       AI
Sbjct: 450 GDTGNSYASGDKETLLLPEAQRDLMEAVA-ATGKPVVLCMMSGSDLDMSYAAE--HFDAI 506

Query: 406 LWVGYPGQAGGAAIADVLFGRANP 429
           L + YPG  GG+A A +LFG  +P
Sbjct: 507 LQLWYPGSQGGSAAAKLLFGEVSP 530


>gi|346225847|ref|ZP_08846989.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
 gi|346227016|ref|ZP_08848158.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
          Length = 718

 Score =  289 bits (739), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 178/447 (39%), Positives = 248/447 (55%), Gaps = 35/447 (7%)

Query: 2   YNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAA 61
           Y G   GLT+W+PNVNIFRDPRWGRGQET GEDP LT +   S+V+GLQGN    LKVAA
Sbjct: 116 YRGQYKGLTFWTPNVNIFRDPRWGRGQETYGEDPFLTSRIGVSFVKGLQGNHPKYLKVAA 175

Query: 62  CCKHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
             KHY  +     NG +  R+ F+A+VS +DL +TY   F+A V E  V  VM +YN+ N
Sbjct: 176 LAKHYAVH-----NGPEALRHEFDAKVSMKDLWETYLPAFEALVKEAGVEGVMGAYNRTN 230

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           G P CA P +++  +  +W  DGY VSDC ++   Y       TPEEAAA A+ AG +L+
Sbjct: 231 GDPCCAHPYLMQEVLREKWGFDGYYVSDCGAIMDFYTGHKIVDTPEEAAAMALNAGCNLN 290

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG   A   + ++  GL  EE+++ ++      ++RLG+F  E  A P+  +    + + 
Sbjct: 291 CGDTYASLLK-SLEKGLTTEEEIDRSVKQLFKTRLRLGLFAPE-GAVPYDTISTDVIRSK 348

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
            HQ+LAL+AA + +VLLKN A TLP++      V V GP +     ++ NY GV+   TT
Sbjct: 349 EHQKLALEAARKSVVLLKNEANTLPVAR-DVKKVYVTGPTATHVQALLANYYGVSEDMTT 407

Query: 300 PLQGI----SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE-- 353
            L+GI    S      ++ G      N N +   +  AA  AD TV  +G+ Q IE E  
Sbjct: 408 ILEGIVGKVSPQTSVQYRQGALLYEANRNTMDWFSGAAA-SADVTVACLGISQLIEGEEG 466

Query: 354 -------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI---- 402
                    DR    LP  Q + + R+ +AS   +V+V+  G  + +      P I    
Sbjct: 467 EAIASEHRGDRERTRLPQNQIDFLKRI-RASAKKLVVVITSGSAISL------PEIYDMA 519

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANP 429
            A+L+V YPG+ GG A+ADVLFG A P
Sbjct: 520 DALLYVWYPGEQGGKAVADVLFGDAVP 546


>gi|333379783|ref|ZP_08471502.1| hypothetical protein HMPREF9456_03097 [Dysgonomonas mossii DSM
           22836]
 gi|332884929|gb|EGK05184.1| hypothetical protein HMPREF9456_03097 [Dysgonomonas mossii DSM
           22836]
          Length = 737

 Score =  289 bits (739), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 167/442 (37%), Positives = 242/442 (54%), Gaps = 31/442 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYW+PN+NIFRDPRWGRGQET GEDP LTG    S+V GLQG+    LK AAC KHY 
Sbjct: 133 GLTYWTPNINIFRDPRWGRGQETYGEDPFLTGVLGKSFVAGLQGDDTKYLKAAACAKHYA 192

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ FN  V+  DL DTY   F+  VVE KVA VMC+YN  NG+P C 
Sbjct: 193 VH-----SGPENTRHTFNTFVTDYDLWDTYLPAFRNLVVEAKVAGVMCAYNAYNGEPCCG 247

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEE--AAADAIKAGLDLDCGPF 183
           +  +++  +  +W   GY+ SDC ++   Y  QH+   P+   AAADA+  G D+DCG  
Sbjct: 248 NNFLMQEILREKWNFTGYVTSDCGAIDDFY--QHHKTHPDAKYAAADAVYNGTDIDCGNE 305

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                  AV+ G++ E+ ++++L    T++ RLGMFD   + + +  +    + +  H+ 
Sbjct: 306 AYKALVDAVKTGIITEKQIDISLKRLFTIRFRLGMFDPAENVK-YSQISTSVLESQKHKD 364

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+   + IVLLKN   TLPLS  +   VAV+GPN++  V+++GNY G      TP + 
Sbjct: 365 LALKITRESIVLLKNENNTLPLSK-KLKKVAVVGPNANNEVSVLGNYNGFPTEIVTPYEA 423

Query: 304 ISRY---AKTIHQAGCFGV--ACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFI--- 355
           + +    A+ I++ G   V  + N  + + A     +  D  + V G+   +E E +   
Sbjct: 424 VKQKLKGAEVIYEKGIDFVTPSTNSKEEVSALVKRLKDVDVVIFVGGISPELEGEEMPVK 483

Query: 356 -------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWV 408
                  DR  + LP  Q + +  +  A + P V V+M G  +   +   +  I AI+  
Sbjct: 484 IEGFTGGDRTSIKLPKIQTDFMKALV-AEKIPTVFVMMTGSAIATEWESQN--IPAIVNA 540

Query: 409 GYPGQAGGAAIADVLFGRANPG 430
            Y GQ  G AIADVLFG  NP 
Sbjct: 541 WYGGQDAGTAIADVLFGDYNPS 562


>gi|388857998|emb|CCF48443.1| related to Beta-xylosidase [Ustilago hordei]
          Length = 782

 Score =  288 bits (737), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 172/445 (38%), Positives = 245/445 (55%), Gaps = 19/445 (4%)

Query: 3   NGGMAGLTYWSP-NVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG------NTGS 55
           N G AGL  +SP N+N FRDPRWGRGQET GEDP+   ++A S V GLQG        G+
Sbjct: 142 NAGKAGLNLYSPLNINCFRDPRWGRGQETVGEDPLHMSRFAVSIVHGLQGPHAQNEAEGN 201

Query: 56  RLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSY 115
           +L VAA CKH+ AYDL+ ++  +RY F+A VSKQDL D +   F+ACV +G   ++M SY
Sbjct: 202 KLTVAATCKHFLAYDLEQYDRGERYQFDAIVSKQDLSDFHLPQFRACVRDGGATTLMTSY 261

Query: 116 NQVNGKPTCADPDILKNTIHGQWRLD---GYIVSDCDSVGVLYNTQHYTRTPEEAAADAI 172
           N VN  P  A    L+      W LD    Y+ SDCD+V  +Y+   Y +   EAAA +I
Sbjct: 262 NAVNNVPPSASKYYLQTLARQAWGLDKTHNYVTSDCDAVANVYDGHRYAQNYVEAAAKSI 321

Query: 173 KAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLG 232
            AG DLDCG   + +   A++  L     +  A+       +RLG FD +P++QP   L 
Sbjct: 322 NAGTDLDCGATYSENLGAALKQKLTDIATIRRAVIRMYASLVRLGYFD-DPASQPLRQLT 380

Query: 233 PRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAG 292
            +DV +P+ Q+LA  +A   I LLKN   TLP+   +   +A+IGP ++V+ +  GNYAG
Sbjct: 381 WKDVNSPSSQRLAYTSALSSITLLKNLDSTLPIKQ-KPTKIAIIGPYTNVSTSFSGNYAG 439

Query: 293 VACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAAR---QADATVLVMGLDQS 349
            A    T +   S+           G   +G  +   A+ A +    AD+ V   G+D S
Sbjct: 440 PAAFNMTMVHAASQVFPDAKIVWVNGTDISGPYIPSDAQDAVKLTSDADSVVFAGGIDAS 499

Query: 350 IEAEFIDRAGLLLPGRQQELVSRVA----KASRGPVVLVLMCGGPVDVSFAKNDPRIGAI 405
           IE E  DR  +  P  Q  L+  ++    K  +  +V+V   GG +D +  K+D  +GA+
Sbjct: 500 IERESHDRKDIAWPPNQLRLIHELSQSRKKDKKSKLVVVQFGGGQLDGASLKSDDAVGAL 559

Query: 406 LWVGYPGQAGGAAIADVLFGRANPG 430
           +W GYPGQ+   A+ D+L G+A P 
Sbjct: 560 VWAGYPGQSASLAVWDILAGKAVPA 584


>gi|326791674|ref|YP_004309495.1| beta-glucosidase [Clostridium lentocellum DSM 5427]
 gi|326542438|gb|ADZ84297.1| Beta-glucosidase [Clostridium lentocellum DSM 5427]
          Length = 696

 Score =  288 bits (736), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 171/442 (38%), Positives = 246/442 (55%), Gaps = 35/442 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPNVNIFRDPRWGRG ET GEDP LT +   ++V+GLQG  G  LK AAC KH+ 
Sbjct: 104 GLTFWSPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQGEEG--LKTAACAKHFA 161

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +        DR+HF+ARVS++DL +TY   F+A V E +V SVM +YN+ NG+P C  P
Sbjct: 162 VHSGPE---ADRHHFDARVSQKDLWETYLPAFEALVKEAEVESVMGAYNRTNGEPCCGSP 218

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            ++K+ +  +W   G+ VSDC ++   +     T T +E+AA A+K+G DL+CG    +H
Sbjct: 219 TLMKDILREKWGFQGHYVSDCWAIKDFHEHHMVTSTAQESAALALKSGCDLNCGN-TYLH 277

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEP-SAQPFGNLGPRDVCTPAHQQLAL 246
              A + GL+ EE++  A     T +  LG+FDG    A P+       V +  H  +A 
Sbjct: 278 ILMAYQNGLVTEEEITTAAERLFTTRYLLGLFDGSTYDAIPY-----EVVESKPHLSVAD 332

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISR 306
           +A  + IVLLKN+   LPL+     T+ VIGPN++    +IGNY G +  Y T L+G+ +
Sbjct: 333 EATAKSIVLLKNNG-LLPLNKESIKTIGVIGPNANSRKALIGNYHGTSSQYITILEGLQK 391

Query: 307 ----YAKTIHQAGCFGVACNGNQL------IGAAEVAARQADATVLVMGLDQSIEAE--- 353
                 + ++  G    A     L      +  A++ A+ +D  ++ +GLD+++E E   
Sbjct: 392 EVGDEVRILYSEGSHLYADRVEPLAYQRDRLSEAKIVAKHSDVVIVCVGLDETLEGEEGD 451

Query: 354 ------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILW 407
                   D+  L LP  QQELV  +AK  + PV+L L  G  +D+ +A  D    A+L 
Sbjct: 452 TGNAYASGDKRDLALPEPQQELVEAMAKMGK-PVILCLSAGSAIDLQYA--DAHYDAVLQ 508

Query: 408 VGYPGQAGGAAIADVLFGRANP 429
             YPG  GG  IA  L G   P
Sbjct: 509 AWYPGARGGQVIAKALLGEIVP 530


>gi|333381510|ref|ZP_08473192.1| hypothetical protein HMPREF9455_01358 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332830480|gb|EGK03108.1| hypothetical protein HMPREF9455_01358 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 738

 Score =  287 bits (735), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 174/442 (39%), Positives = 239/442 (54%), Gaps = 31/442 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYW+PNVNIFRDPRWGRGQET GEDP LTG    S+V GLQG+    LK AAC KHY 
Sbjct: 132 GLTYWTPNVNIFRDPRWGRGQETYGEDPYLTGALGKSFVAGLQGDDSQYLKAAACAKHYA 191

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ FN  V+  DL DTY   F+  VV+ KVA VMC+YN  +G+P C 
Sbjct: 192 VH-----SGPENTRHTFNTFVTTFDLWDTYLPAFRDLVVDAKVAGVMCAYNAFSGEPCCG 246

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEE--AAADAIKAGLDLDCGPF 183
           +  +++  +  +W   GY+ SDC ++   Y  +H+   P+   AAADA+ +G D+DCG  
Sbjct: 247 NNLLMQEILRDKWGFTGYVTSDCGAIDDFY--RHHKTHPDAKYAAADAVYSGTDIDCGNE 304

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                  AV+ GL+ EE ++++L     ++ RLGMFD     + F  +    + +  H+ 
Sbjct: 305 AYKALVDAVKTGLITEEQIDISLKRLFEIRFRLGMFDPAEDVK-FSKIPLSVLESQPHKD 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+   + IVLLKN    LPLS  +   VAVIGPN+D  V+++GNY G      TP + 
Sbjct: 364 LALKITRESIVLLKNENNFLPLSK-KLKKVAVIGPNADNEVSVLGNYNGFPTQIITPYKA 422

Query: 304 ISRYAK---TIHQAGCFGVACNGNQLIGAAEVAARQA--DATVLVMGLDQSIEAEFI--- 355
           I    K    I++ G   V  + N     A +A R    D  +   G+   +E E +   
Sbjct: 423 IKNKLKNTEVIYEKGIDFVKPSENSKEEIAALAKRLKGMDVVIFAGGISPELEGEEMPVK 482

Query: 356 -------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWV 408
                  DR  + LP  Q EL+ +  KA R P V V+M G  +   +   +  + AIL  
Sbjct: 483 IEGFTGGDRTSIKLPKIQTELM-QALKAERIPTVFVMMTGSAIAAEWESQN--VPAILNA 539

Query: 409 GYPGQAGGAAIADVLFGRANPG 430
            Y GQ  G AIADVLFG  NP 
Sbjct: 540 WYGGQDAGTAIADVLFGDYNPS 561


>gi|386347261|ref|YP_006045510.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
           6578]
 gi|339412228|gb|AEJ61793.1| glycoside hydrolase family 3 domain protein [Spirochaeta
           thermophila DSM 6578]
          Length = 693

 Score =  287 bits (734), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 175/439 (39%), Positives = 255/439 (58%), Gaps = 34/439 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPN+NI+RDPRWGRGQET GEDP LT K   S+V+GLQG+    ++VAAC KHY 
Sbjct: 101 GLTFWSPNINIYRDPRWGRGQETYGEDPFLTSKIGVSFVKGLQGDHPYYMRVAACAKHYA 160

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+ARVS++DL +TY   F+A V  G V +VM +YN+VNG+P C 
Sbjct: 161 VH-----SGPEGLRHVFDARVSEKDLWETYLPAFEALVKAG-VEAVMGAYNRVNGEPACG 214

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
              +L   +  +W   G++VSDC ++   +     T+ P E+ A A++AG DL+CG    
Sbjct: 215 SKRLLDEILRKRWGFKGHVVSDCWAIADFHLHHKVTKDPIESIAMALEAGCDLNCGNTYE 274

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
            H   AV+ G++ EE V+ ++A  ++   RLG+F  +    P+  L   D+   AH+ LA
Sbjct: 275 -HLLDAVKAGVVSEELVDRSVARLLSTLDRLGLFTDD---HPYARLSLSDIDWEAHRALA 330

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
            +AA + +VLLKN+   LP    +   + V GPN+   V ++GNYAGV+    T L+GI+
Sbjct: 331 REAAEKSVVLLKNNG-ILPFDRQKLRYIYVTGPNAANPVALLGNYAGVSSRLVTVLEGIT 389

Query: 306 RYA----KTIHQAGCFGVACNGNQL--IGAAEVAARQADATVLVMGLDQSIEAE------ 353
            YA       ++ GC      GN++  I  A   AR AD TV VMG D ++E E      
Sbjct: 390 GYAGPGITVTYKIGC---PLQGNKINPIDWASGVARYADVTVAVMGRDSTVEGEEGDAIF 446

Query: 354 ---FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGY 410
              + D + L LP  Q E + R+ +  + P+V+VL+ G P  V   + +    AI++  Y
Sbjct: 447 SDNYGDLSDLDLPREQIEYLRRIKEIGK-PLVVVLLSGAP--VCSPELEELADAIVYAWY 503

Query: 411 PGQAGGAAIADVLFGRANP 429
           PG+ GG AIA VLFG  +P
Sbjct: 504 PGEEGGNAIARVLFGEISP 522


>gi|449303062|gb|EMC99070.1| glycoside hydrolase family 3 protein [Baudoinia compniacensis UAMH
           10762]
          Length = 786

 Score =  287 bits (734), Expect = 9e-75,   Method: Compositional matrix adjust.
 Identities = 159/430 (36%), Positives = 237/430 (55%), Gaps = 8/430 (1%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPV-LTGKYAASYVRGLQGNTGSRL-KVA 60
           N G  GL  ++PN+N FR P WGRGQETPGED   L+  YA  Y+ G+QG     + K+ 
Sbjct: 159 NAGRYGLDSYAPNINGFRSPVWGRGQETPGEDAFFLSSLYAYEYITGMQGGKAPAVPKLV 218

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A  KH+  YD++NWN   R   +  +++QDL   Y   F++ +   K   +MCSYN VNG
Sbjct: 219 AVPKHFAGYDIENWNNNSRLGLDVNITQQDLAGYYTPQFRSAIQNAKALGLMCSYNAVNG 278

Query: 121 KPTCADPDILKNTIHGQWRL-DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
            P+C++   L+      W   +G++ SDCD+V  +YN   Y      A AD+++AG D+D
Sbjct: 279 VPSCSNSFFLQTLARDTWGFGNGFVSSDCDAVYNVYNPHGYAANTTGAVADSLRAGTDID 338

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG     +   A   GL+   D+ LAL    +  +  G FDG  S   + NLG  DV T 
Sbjct: 339 CGTSYPFYLVPAFNAGLVSRNDIELALTRYYSGLVMQGYFDGNSSL--YRNLGWNDVLTT 396

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
               ++ +AA +GI LLKN   TLPLS     +VA+IGP ++ T+ + GNY   A    +
Sbjct: 397 DAWNISYEAAVEGITLLKNDG-TLPLSK-STRSVALIGPWANATLQLQGNYYAAAPYLIS 454

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           PLQ       T++      ++         A   A+Q+D  +   G+D SIEAE +DR  
Sbjct: 455 PLQAFRASGMTVNFVNGTTISSTNTSGFAEAITLAQQSDVIIYAGGIDNSIEAEGLDRQN 514

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           +  PG Q +L+ ++++  + P+V++ M GG VD S  KN+ ++ A++W GYPGQ+GG A+
Sbjct: 515 ITWPGNQLDLIYQLSQVGK-PLVVLQMGGGQVDSSALKNNSKVNALVWGGYPGQSGGQAL 573

Query: 420 ADVLFGRANP 429
            D++ G   P
Sbjct: 574 FDIIMGNRAP 583


>gi|253579611|ref|ZP_04856880.1| glycoside hydrolase, family 3 domain-containing protein
           [Ruminococcus sp. 5_1_39B_FAA]
 gi|251849112|gb|EES77073.1| glycoside hydrolase, family 3 domain-containing protein
           [Ruminococcus sp. 5_1_39BFAA]
          Length = 706

 Score =  286 bits (733), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 165/444 (37%), Positives = 251/444 (56%), Gaps = 38/444 (8%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPNVNIFRDPRWGRG ET GEDP LT +    +V G+QG+ G  +K AAC KHY 
Sbjct: 109 GLTFWSPNVNIFRDPRWGRGHETYGEDPYLTSRLGVKFVEGIQGD-GPVMKAAACAKHYA 167

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+A+ S +D+ +TY   F+A V E  V +VM +YN+ NG+P CA
Sbjct: 168 VH-----SGPESLRHEFDAQASMKDMWETYLPAFEALVTEADVEAVMGAYNRTNGEPCCA 222

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
              ++++ + G+W+ +G+  SDC ++   +     T TP ++AA A+ AG DL+CG    
Sbjct: 223 HKYLMEDVLRGKWKFEGHYTSDCWAIRDFHEHHMVTSTPRQSAAMALNAGCDLNCGNTY- 281

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDG-EPSAQPFGNLGPRDVCTPAHQQL 244
           +H  GA + GL+ EE +  +    +T +  LG+FDG E    P+  +  ++     H   
Sbjct: 282 LHMMGAYQDGLVTEEKITESAVRLLTTRYLLGLFDGSEYDKIPYSVVECKE-----HIDE 336

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           AL+ A +  VLLKN    LP+   + +T+ VIGPN+D    +IGNY G +  Y T L+GI
Sbjct: 337 ALKMARKSCVLLKNDG-VLPIDKTKVNTIGVIGPNADSRAALIGNYHGTSSEYITVLEGI 395

Query: 305 SRYA----KTIHQAGCFGVACNGNQL------IGAAEVAARQADATVLVMGLDQSIEAE- 353
              A    + ++  GC         L      I  A + A  +D  +L +GL++++E E 
Sbjct: 396 REEAGDDVRILYSQGCDLYKDKVENLAWDQDRISEAVITAENSDVVILCVGLNETLEGEE 455

Query: 354 --------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAI 405
                     D+  L LP  Q+EL+ +V    + P ++VLM G  +D+++A+++     I
Sbjct: 456 GDTGNSDASGDKVDLHLPKVQEELIEKVTAVGK-PTIVVLMAGSAIDLNYAQDN--CNGI 512

Query: 406 LWVGYPGQAGGAAIADVLFGRANP 429
           L   YPG  GG AIAD+LFG+ +P
Sbjct: 513 LLAWYPGARGGRAIADLLFGKESP 536


>gi|261368518|ref|ZP_05981401.1| beta-glucosidase [Subdoligranulum variabile DSM 15176]
 gi|282569400|gb|EFB74935.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Subdoligranulum variabile DSM 15176]
          Length = 717

 Score =  286 bits (733), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 166/443 (37%), Positives = 241/443 (54%), Gaps = 31/443 (6%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT W+PNVNIFRDPRWGRG ET GEDP LT +    +V G+QG+    L+ AAC KH+ 
Sbjct: 109 GLTIWAPNVNIFRDPRWGRGHETYGEDPYLTSRLGVRFVEGMQGDDPDYLRAAACAKHFA 168

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R++F+A+VS+QDL +TY   F+A V E  V +VM +YN+ NG+P C 
Sbjct: 169 VH-----SGPEDQRHYFDAKVSQQDLWETYLPAFRALVKEAGVEAVMGAYNRTNGEPCCG 223

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
              +L + + G+W   G++ SDC ++   +     T  P ++ A A+  G DL+CG   A
Sbjct: 224 SKTLLVDILRGKWNFQGHVTSDCWAIKDFHEGHMVTSGPVDSVALAVNNGCDLNCGDLYA 283

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
            + E AV  G ++EE ++ +L    T +M+LGMFD E    P+  +G   V +   Q L 
Sbjct: 284 -YLEEAVAEGKVKEETIDRSLVRLFTTRMKLGMFDAEEKV-PYNKIGYDAVDSREMQALN 341

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
           L+ A + +VLLKN   TLPL   + H VAV+GPN+D    ++GNY G A  Y T L GI 
Sbjct: 342 LEVAEKILVLLKNENHTLPLDKSKLHRVAVVGPNADNRKALVGNYEGTASRYVTVLDGIQ 401

Query: 306 RY----AKTIHQAGCFGVA------CNGNQLIGAAEVAARQADATVLVMGLDQSIEAE-- 353
            Y     +  +  GC   A         N+LI        + D  +  +GLD  +E E  
Sbjct: 402 EYLGEDVQVRYSEGCHLYADKIQGLAKSNELISEVRGVCAECDVVICCLGLDAGLEGEEG 461

Query: 354 -----FI--DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
                F   D+  L LPG Q+ ++    ++ + PVV+V++ G  + +  A+      A+L
Sbjct: 462 DQGNQFASGDKQSLSLPGNQESVLKACIESGK-PVVVVVLSGSALALGTAQEGA--AAVL 518

Query: 407 WVGYPGQAGGAAIADVLFGRANP 429
              YPG  GG A+A  LFG  NP
Sbjct: 519 QAWYPGAQGGRAVARALFGECNP 541


>gi|310795958|gb|EFQ31419.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Glomerella graminicola M1.001]
          Length = 824

 Score =  286 bits (731), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 173/426 (40%), Positives = 236/426 (55%), Gaps = 16/426 (3%)

Query: 13  SPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDLD 72
           SPN+N F+DPRWGRGQETPGEDP     Y A+ + GL+G   S+ K+ A CKHY A D +
Sbjct: 198 SPNINAFKDPRWGRGQETPGEDPFHLQNYVAAMLTGLEGGDPSK-KLIATCKHYAANDFE 256

Query: 73  NWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILKN 132
           N+ GVDR  F+A ++ QDL + Y  PFK C V+ KV S MCSYN +NG+P CA+P +L++
Sbjct: 257 NYKGVDRAGFDANITTQDLSEYYLPPFKTCAVDKKVGSFMCSYNAINGEPLCANPYLLED 316

Query: 133 TIHGQWRLDG---YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHT- 188
            +   W  +G   Y+ +DCD V ++ +  HY      AAA A+KAG DL+C  F      
Sbjct: 317 ILRQHWGWNGDGQYVSTDCDCVALMVSHHHYAPDLGHAAAWAMKAGTDLECNAFPGSEAL 376

Query: 189 EGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQA 248
           + A    L+ E++V+ +L    T  + +G FD     QP  +L   DV T   Q+LA QA
Sbjct: 377 QLAWNQSLISEKEVDKSLTRMYTALVSVGQFDSA-RGQPLRSLSWDDVNTKEAQKLAYQA 435

Query: 249 AHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRY 307
             +G VLLKN    LPLS   R    A+IGP  + T  M GNY G A    +  Q     
Sbjct: 436 VIEGAVLLKNDG-ILPLSAAWREKKYALIGPWINATTQMQGNYFGPAPYLISLYQA---- 490

Query: 308 AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVM---GLDQSIEAEFIDRAGLLLPG 364
           AK       + +    N    + + A   A A  L++   G+D ++EAE  DR  L  P 
Sbjct: 491 AKEFGLDFTYSLGSRINSTDDSFKQALDSAHAAALIVFAGGVDNTLEAETRDRKTLAWPE 550

Query: 365 RQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLF 424
            Q +L+ R   A   PV+++   GG VD +    +  I A+LW GYPGQ+GG A+ D+LF
Sbjct: 551 SQLDLL-RAVSALGKPVIVLQFGGGQVDDTELLANHSINALLWGGYPGQSGGKAVIDLLF 609

Query: 425 GRANPG 430
           GRA P 
Sbjct: 610 GRAAPA 615


>gi|121809149|sp|Q4AEG8.1|XYND_ASPAW RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|73486695|dbj|BAE19756.1| beta-xylosidase [Aspergillus awamori]
          Length = 804

 Score =  285 bits (729), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 167/437 (38%), Positives = 243/437 (55%), Gaps = 10/437 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAA 61
           N G  GL  ++PN+N FR P WGRGQETPGED  L   YA  Y+ G+QG +  S LK+AA
Sbjct: 168 NAGRYGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPESNLKLAA 227

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
             KHY  YD++NW+   R   +  +++QDL + Y   F     + KV SVMC+YN VNG 
Sbjct: 228 TAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGV 287

Query: 122 PTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           P CAD   L+  +   +     GY+ SDCD+   +YN   Y  +   AAA+AI AG D+D
Sbjct: 288 PACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDID 347

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSA--QPFGNLGPRDVC 237
           CG     H   ++  G L  +D+   +    T  ++ G FD   +    P+ +L   DV 
Sbjct: 348 CGTTYQWHLNESITAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVL 407

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRH----HTVAVIGPNSDVTVTMIGNYAGV 293
                 ++ QAA QGIVLLKNS   LPL+   +     TVA+IGP ++ T  ++GNY G 
Sbjct: 408 ETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGN 467

Query: 294 ACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
           A    +P          ++ A   G++        AA  AA+ AD  +   G+D ++EAE
Sbjct: 468 APYMISPRAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAE 527

Query: 354 FIDRAGLLLPGRQQELVSRVAKAS-RGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
            +DR  +  PG Q +L+ ++A A+ + P++++ M GG VD S  KN+ ++ A+LW GYPG
Sbjct: 528 ALDRESIAWPGNQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTKVSALLWGGYPG 587

Query: 413 QAGGAAIADVLFGRANP 429
           Q+GG A+ D++ G+ NP
Sbjct: 588 QSGGFALRDIITGKKNP 604


>gi|297738404|emb|CBI27605.3| unnamed protein product [Vitis vinifera]
          Length = 581

 Score =  285 bits (728), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 147/375 (39%), Positives = 218/375 (58%), Gaps = 49/375 (13%)

Query: 57  LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYN 116
           LKV++CCKHY  YD+D+W           VS+QD+++T+  PF+                
Sbjct: 19  LKVSSCCKHYATYDIDSW---------LNVSEQDMKETFFSPFE---------------- 53

Query: 117 QVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE-EAAADAIKAG 175
                               +W L GYIVSDC  + V+ + Q+Y    + +A A  ++AG
Sbjct: 54  ------------------RDEWDLHGYIVSDCYGLEVIVDNQNYLNESKVDAVAKTLQAG 95

Query: 176 LDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD 235
           LDL+CG +       +V  G + + +++ AL     + MR+G FDG P+   + +LG +D
Sbjct: 96  LDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPA---YESLGLKD 152

Query: 236 VCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVAC 295
           +C   H +LA +AA QGIVLLKN    LPL   +   + ++GP+++ T  MIGNYAG+  
Sbjct: 153 ICAADHIELAREAARQGIVLLKNDYEVLPLKPGKK--LVLVGPHANATEVMIGNYAGLPY 210

Query: 296 GYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFI 355
            Y +PL+  S      +  GC   +C+ +     A+ AA+ A+ T++ +G D SIEAEF+
Sbjct: 211 KYVSPLEAFSAIGNVTYATGCLDASCSNDTYFSEAKEAAKFAEVTIIFVGTDLSIEAEFV 270

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR   LLPG Q EL+ +VA+ S GPV+LV++ G  +D++FAKN+PRI AILWVG+PG+ G
Sbjct: 271 DRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPGEQG 330

Query: 416 GAAIADVLFGRANPG 430
           G AIADV+FG+ NPG
Sbjct: 331 GHAIADVVFGKYNPG 345


>gi|222635353|gb|EEE65485.1| hypothetical protein OsJ_20901 [Oryza sativa Japonica Group]
          Length = 481

 Score =  284 bits (727), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 137/187 (73%), Positives = 158/187 (84%), Gaps = 2/187 (1%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS--RLK 58
           MYNGG AGLT+WSPNVNIFRDPRWGRGQETPGEDP +  +YAA+YVRGLQ    S  RLK
Sbjct: 1   MYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQQQQPSSGRLK 60

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           +AACCKH+TAYDLDNW+G DR+HFNA V++QDLEDT+NVPF++CVV+G+ ASVMCSYNQV
Sbjct: 61  LAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRSCVVDGRAASVMCSYNQV 120

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           NG PTCAD   L+ TI  +W L GYIVSDCDSV V Y+ QHYTRT E+A A  ++AGLDL
Sbjct: 121 NGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHYTRTREDAVAATLRAGLDL 180

Query: 179 DCGPFLA 185
           DCGPFLA
Sbjct: 181 DCGPFLA 187



 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 50/68 (73%), Positives = 58/68 (85%)

Query: 363 PGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADV 422
           PGRQ EL+S VAKAS+GPV+LVLM GGP+D+ FA+NDP+I  ILW GYPGQAGG AIADV
Sbjct: 220 PGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQNDPKIAGILWAGYPGQAGGQAIADV 279

Query: 423 LFGRANPG 430
           +FG  NPG
Sbjct: 280 IFGHHNPG 287


>gi|291167620|dbj|BAI82526.1| 1,4-beta-D-xylosidase [Aureobasidium pullulans var. melanogenum]
          Length = 805

 Score =  284 bits (726), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 167/432 (38%), Positives = 245/432 (56%), Gaps = 10/432 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAA 61
           N G AGL +W+PN+N +RDPRWGRGQETPGEDP     Y  S + GLQG     + K+ A
Sbjct: 146 NYGRAGLDFWTPNINPYRDPRWGRGQETPGEDPYHLSSYVHSLIMGLQGGEDPEIRKITA 205

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+  YD+++WNG  RY  + ++ ++DL + Y   F++C  +  V + MC+Y+ +NG 
Sbjct: 206 TCKHFAGYDIESWNGNLRYQNDVQIPQRDLVEYYLPSFRSCARDSNVGAFMCTYSALNGV 265

Query: 122 PTCADPDILKNTIHGQW---RLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           PTCADP +L + +   W     + ++ SDCDS+  ++   +++ T + AAA A+ AG DL
Sbjct: 266 PTCADPWLLNDVLREHWGWTNEEQWVTSDCDSIQNIFLPHNFSDTRQGAAAAALNAGTDL 325

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG +   H   A   GL+ +  V+ AL    T  +R G FDG P+A  + NL   DV T
Sbjct: 326 DCGTYYQHHLPLAYSQGLINQTTVDQALVRLYTSLVRTGYFDG-PNAM-YRNLTWSDVGT 383

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
              QQLALQAA +G+VLLKN    LPLS      +A+IG  ++ T  M GNY GV     
Sbjct: 384 THAQQLALQAAEEGMVLLKNDG-LLPLSISNGTKIALIGSWANATTQMQGNYYGVPTYLH 442

Query: 299 TPLQGISRY-AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           +PL    +  A+  +  G  G              AA +AD  + + G+D S+EAE +DR
Sbjct: 443 SPLYAAQQTGAQVFYAQGPGGQGDPTTDHWLPVWTAAEKADIIIYIGGVDISVEAEGMDR 502

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             +   G Q +++  +A   + P+VL  M G  +D +   N+  I A++W GYPGQ GG 
Sbjct: 503 EDINWTGAQLDIIGELAMYGK-PMVLAQM-GDQLDNTPIVNNANISALIWGGYPGQDGGV 560

Query: 418 AIADVLFGRANP 429
           A+ +++ G+  P
Sbjct: 561 ALFNIITGKTAP 572


>gi|280977785|gb|ACZ98610.1| glucosidase [Cellulosilyticum ruminicola]
          Length = 711

 Score =  284 bits (726), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 171/444 (38%), Positives = 247/444 (55%), Gaps = 35/444 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR-LKVAACCKHY 66
           G+T+W+PN+NIFRDPRWGRG ET GEDP LT +   ++++GLQG+     LK AAC KH+
Sbjct: 102 GMTFWAPNINIFRDPRWGRGHETYGEDPYLTARLGVAFIKGLQGDENEDYLKAAACAKHF 161

Query: 67  TAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
             +        DR+HF+A VSK+DL +TY   F+A V E  V  VM +YN+VNG+P C  
Sbjct: 162 AVHSGPE---EDRHHFDAIVSKKDLYETYLPAFEAAVKEANVIGVMGAYNRVNGEPACGS 218

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAI 186
             +L + +   W  DGYIVSDC ++   +     T T  E+AA AI  G +L+CG    +
Sbjct: 219 KTLLVDILKKDWGFDGYIVSDCWAIRDFHTEHMVTHTAAESAALAINNGCELNCGN-TYL 277

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT-PAHQQLA 245
           H   A + GL++EE +  A    + ++M+LG+FD         N  P  V     H+++A
Sbjct: 278 HMLEAHQEGLVKEEIITEAAEKLMRIRMQLGLFDKNCKY----NEIPYAVNDCKVHREVA 333

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
           L+A+ + +V+LKN    LPL+  +  ++ +IGP ++    + GNY G A  YTT ++GI 
Sbjct: 334 LEASRRSMVMLKNDG-ILPLNKDKLKSIGIIGPTANNRTVLEGNYNGTASRYTTFVEGIQ 392

Query: 306 RY----AKTIHQAGCFGVACNG-------NQLIGAAEVAARQADATVLVMGLDQSIEAE- 353
            Y     +  +  GC   A NG       N     A + A Q+D  VL +GLD +IE E 
Sbjct: 393 DYVGDDVRVYYSEGCHLFA-NGMSNLAWENDREAEALIVAEQSDVVVLCLGLDSTIEGEQ 451

Query: 354 ------FI--DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAI 405
                 F   D+  L L GRQQ+L+ +V    + PV+LVL  G  + +++A  D    AI
Sbjct: 452 GDTGNAFAGGDKLSLNLIGRQQQLLEKVVAVGK-PVILVLSTGSAMAINYA--DEHCNAI 508

Query: 406 LWVGYPGQAGGAAIADVLFGRANP 429
               YPG  GG A+A +LFG  +P
Sbjct: 509 FQTWYPGAQGGKALAQLLFGEYSP 532


>gi|145230215|ref|XP_001389416.1| exo-1,4-beta-xylosidase xlnD [Aspergillus niger CBS 513.88]
 gi|74626559|sp|O00089.2|XYND_ASPNG RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|292495287|sp|A2QA27.1|XYND_ASPNC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|2181180|emb|CAB06417.1| xylosidase [Aspergillus niger]
 gi|134055533|emb|CAK37179.1| xylosidase xlnD-Aspergillus niger
 gi|350638468|gb|EHA26824.1| hypothetical protein ASPNIDRAFT_205670 [Aspergillus niger ATCC
           1015]
          Length = 804

 Score =  284 bits (726), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 167/437 (38%), Positives = 242/437 (55%), Gaps = 10/437 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAA 61
           N G  GL  ++PN+N FR P WGRGQETPGED  L   YA  Y+ G+QG +  S LK+AA
Sbjct: 168 NAGRYGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPESNLKLAA 227

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
             KHY  YD++NW+   R   +  +++QDL + Y   F     + KV SVMC+YN VNG 
Sbjct: 228 TAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGV 287

Query: 122 PTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           P CAD   L+  +   +     GY+ SDCD+   +YN   Y  +   AAA+AI AG D+D
Sbjct: 288 PACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDID 347

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSA--QPFGNLGPRDVC 237
           CG     H   ++  G L  +D+   +    T  ++ G FD   +    P+ +L   DV 
Sbjct: 348 CGTTYQWHLNESIAAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVL 407

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRH----HTVAVIGPNSDVTVTMIGNYAGV 293
                 ++ QAA QGIVLLKNS   LPL+   +     TVA+IGP ++ T  ++GNY G 
Sbjct: 408 ETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGN 467

Query: 294 ACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
           A    +P          ++ A   G++        AA  AA+ AD  +   G+D ++EAE
Sbjct: 468 APYMISPRAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAE 527

Query: 354 FIDRAGLLLPGRQQELVSRVAKAS-RGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
            +DR  +  PG Q +L+ ++A A+ + P++++ M GG VD S  KN+  + A+LW GYPG
Sbjct: 528 ALDRESIAWPGNQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPG 587

Query: 413 QAGGAAIADVLFGRANP 429
           Q+GG A+ D++ G+ NP
Sbjct: 588 QSGGFALRDIITGKKNP 604


>gi|290889355|gb|ADD69953.1| xylosidase HistTag [synthetic construct]
          Length = 810

 Score =  284 bits (726), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 167/437 (38%), Positives = 242/437 (55%), Gaps = 10/437 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAA 61
           N G  GL  ++PN+N FR P WGRGQETPGED  L   YA  Y+ G+QG +  S LK+AA
Sbjct: 168 NAGRYGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPESNLKLAA 227

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
             KHY  YD++NW+   R   +  +++QDL + Y   F     + KV SVMC+YN VNG 
Sbjct: 228 TAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGV 287

Query: 122 PTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           P CAD   L+  +   +     GY+ SDCD+   +YN   Y  +   AAA+AI AG D+D
Sbjct: 288 PACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDID 347

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSA--QPFGNLGPRDVC 237
           CG     H   ++  G L  +D+   +    T  ++ G FD   +    P+ +L   DV 
Sbjct: 348 CGTTYQWHLNESIAAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVL 407

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRH----HTVAVIGPNSDVTVTMIGNYAGV 293
                 ++ QAA QGIVLLKNS   LPL+   +     TVA+IGP ++ T  ++GNY G 
Sbjct: 408 ETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGN 467

Query: 294 ACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
           A    +P          ++ A   G++        AA  AA+ AD  +   G+D ++EAE
Sbjct: 468 APYMISPRAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAE 527

Query: 354 FIDRAGLLLPGRQQELVSRVAKAS-RGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
            +DR  +  PG Q +L+ ++A A+ + P++++ M GG VD S  KN+  + A+LW GYPG
Sbjct: 528 ALDRESIAWPGNQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPG 587

Query: 413 QAGGAAIADVLFGRANP 429
           Q+GG A+ D++ G+ NP
Sbjct: 588 QSGGFALRDIITGKKNP 604


>gi|429850127|gb|ELA25427.1| glycoside hydrolase family 3 protein [Colletotrichum
           gloeosporioides Nara gc5]
          Length = 918

 Score =  284 bits (726), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 170/434 (39%), Positives = 244/434 (56%), Gaps = 15/434 (3%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N G +GL  +SPN+N F+DPRWGRGQETPGED     KY ++ + GL+G+   + K+ A 
Sbjct: 272 NAGRSGLDLYSPNINAFKDPRWGRGQETPGEDTFHLQKYVSAMLSGLEGDDPDK-KLIAT 330

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           CKHY A D +N+ GVDR  FNA +S QDL + Y  PFK C VE  V S MCSYN +NG P
Sbjct: 331 CKHYAANDFENYKGVDRSGFNAVISTQDLSEYYLPPFKTCAVEKNVGSFMCSYNGINGTP 390

Query: 123 TCADPDILKNTIHGQWRLDG---YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
            CA+  ++++ +   W  +G   Y+ +DCD V ++ +  HY      AAA +++AG DL+
Sbjct: 391 LCANSYLIEDILRKHWGWNGDGQYVSTDCDCVALMVSYHHYAPDLGHAAAWSMQAGTDLE 450

Query: 180 CGPFLAIHT-EGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           C  F      + A    L+ E+DV+ AL    T  + +G+FD +    P  +LG  +V T
Sbjct: 451 CNAFPGSEALQSAWNQSLISEKDVDKALTRMYTSLVSVGLFDLD-RKDPLRSLGWDEVNT 509

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
              Q LA +AA +G VL+KN    LPLS       A+IGP    T  M GNY G A    
Sbjct: 510 KEAQDLAYRAAVEGAVLMKNDG-ILPLSPDSSKKYALIGPWVSATTQMQGNYFGPAPYLI 568

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQ---LIGAAEVAARQADATVLVMGLDQSIEAEFI 355
           +P +     AK +     + +    N+       A  AA+ AD  + + G+D ++E E +
Sbjct: 569 SPRKA----AKDLGLDFTYFLGSRTNKSDSSFAQAIKAAQAADVVIFMGGVDNTLEQETL 624

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  L  P  Q +L+  +++  + P+V++   GG VD +    +  + AILW GYPGQ+G
Sbjct: 625 DRNTLAWPEPQLQLLRALSEVGK-PLVVLQFGGGQVDDTELLANDSVNAILWGGYPGQSG 683

Query: 416 GAAIADVLFGRANP 429
           G AI D++FGRA P
Sbjct: 684 GKAILDIVFGRAAP 697


>gi|326789672|ref|YP_004307493.1| beta-glucosidase [Clostridium lentocellum DSM 5427]
 gi|326540436|gb|ADZ82295.1| Beta-glucosidase [Clostridium lentocellum DSM 5427]
          Length = 704

 Score =  284 bits (726), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 171/444 (38%), Positives = 248/444 (55%), Gaps = 33/444 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           G+T W+PN+NIFRDPRWGRG ET GEDP LT +   +++ GLQG+      K AAC KH+
Sbjct: 103 GMTLWAPNINIFRDPRWGRGHETYGEDPYLTSRLGVAFIHGLQGDENHHYWKAAACAKHF 162

Query: 67  TAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
             +        +R+HF+A VSK+DL +TY   F+A V +GKVA +M +YN+VNG+P C  
Sbjct: 163 AVHSGPE---EERHHFDAVVSKKDLYETYLPAFEAAVTKGKVAGMMGAYNRVNGEPACGS 219

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAI 186
             +L++ +  +W  DGY+VSDC ++   +     T T  E+AA AI  G  L+CG    +
Sbjct: 220 KVLLQDILKEEWGFDGYVVSDCWAIRDFHTEHMVTHTATESAALAINNGCQLNCGN-TYL 278

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT-PAHQQLA 245
           H   A + GL+ EE +  +    + ++M+LG+FD         N  P +V     H+ +A
Sbjct: 279 HMLQAYKEGLVTEETITKSAQKLMAIRMKLGLFDKNCEY----NKIPYEVNDCKVHRDIA 334

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
           L  A + +VLLKN+   LPL+  +   + VIGP ++    + GNY G A  YTT L+GI 
Sbjct: 335 LDVARRSMVLLKNNG-ILPLNLKQTKAIGVIGPTANSRTVLQGNYFGTASRYTTFLEGIQ 393

Query: 306 RY----AKTIHQAGC--FGVACNG----NQLIGAAEVAARQADATVLVMGLDQSIEAE-- 353
            Y    A+  +  GC  F  + +G    N  +  A + A Q+D  +L +GLD SIE E  
Sbjct: 394 DYVGDAARVYYAEGCHLFKNSISGLSWENDRLSEALIVAEQSDVVILCLGLDASIEGEQG 453

Query: 354 -----FI--DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
                F   D++ L L GRQQ L+  V K  + P +L+L  G  + +  A+      AIL
Sbjct: 454 DTGNAFAAGDKSDLNLIGRQQLLLEEVLKIGK-PTILILSSGSAMAIHTAQE--YCEAIL 510

Query: 407 WVGYPGQAGGAAIADVLFGRANPG 430
              YPGQ+GG A+A +LFG  +P 
Sbjct: 511 ETWYPGQSGGKALAQLLFGEYSPS 534


>gi|367046937|ref|XP_003653848.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
           8126]
 gi|347001111|gb|AEO67512.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
           8126]
          Length = 923

 Score =  283 bits (723), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 177/432 (40%), Positives = 246/432 (56%), Gaps = 14/432 (3%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N G++G  +W+PN+N FRDPRWGRG ETPGED     +Y    + GLQG+     ++ A 
Sbjct: 269 NHGLSGFDFWTPNINPFRDPRWGRGPETPGEDAFRIQQYIRHLIPGLQGSDPLDKQIIAT 328

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           CKHY  YD++      RY ++      DL + Y  PFK CV +  + SVMCSYN V+G P
Sbjct: 329 CKHYAVYDVE----TGRYEYDYDPQPHDLAEYYLAPFKTCVRDVGIGSVMCSYNAVDGIP 384

Query: 123 TCADPDILKNTIHGQWRLD---GYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
            CA   +L++ +   W       Y+VSDCD+V  +Y+  ++T +P  AAA A+ AG DL+
Sbjct: 385 ACASEYLLQSVLRDHWGFTEPYQYVVSDCDAVRFIYSPHNFTDSPAAAAAVALNAGTDLE 444

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG    ++   ++   +  E  ++ AL    T    +G FDG  SA+ +G LG   V T 
Sbjct: 445 CGSTY-LNLNQSLASNMTTEAALDRALTRLYTALHTIGFFDG--SAR-YGGLGWDAVGTG 500

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
             Q LA QAA  G VLLKN    LPL + R   +AVIGP ++ T  M GNY G A    +
Sbjct: 501 DAQVLAYQAAVDGAVLLKNEKSLLPLDSKRLRKLAVIGPWANATTQMQGNYFGQAAYLVS 560

Query: 300 PLQGISRY--AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           PL        A  +  A   G+A N      AA  AA+ ADA V + G+D S+E+E +DR
Sbjct: 561 PLAAFQSAWGADNVLFANGTGIAGNSTAGFAAALAAAKAADAVVFLGGVDNSVESESLDR 620

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             +  PG Q +L++++A   + P+V+V   GG +D S    +PR+GA+LW GYPGQAGGA
Sbjct: 621 TAISWPGNQLDLIAQLAAVGK-PLVVVQCGGGQLDDSALLANPRVGALLWAGYPGQAGGA 679

Query: 418 AIADVLFGRANP 429
           AIAD+L G+  P
Sbjct: 680 AIADLLTGKQAP 691


>gi|292495285|sp|B6EY09.1|XYND_ASPJA RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|211970990|dbj|BAG82824.1| 1,4-beta-D-xylosidase [Aspergillus japonicus]
          Length = 804

 Score =  282 bits (722), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 168/437 (38%), Positives = 242/437 (55%), Gaps = 11/437 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDP-VLTGKYAASYVRGLQGNTG-SRLKVA 60
           N G  GL  +SPN+N FR P WGRGQETPGED   LT  YA  Y+ G+QG      LK+A
Sbjct: 159 NAGRFGLDVYSPNINTFRHPVWGRGQETPGEDAYTLTAAYAYEYITGIQGGVNPEHLKLA 218

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A  KH+  YD++NW+   R   +  +++QDL + Y   F     +  V S MCSYN VNG
Sbjct: 219 ATAKHFAGYDIENWDNHSRLGNDVNITQQDLAEYYTPQFLVAARDAHVHSFMCSYNAVNG 278

Query: 121 KPTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+C++   L+  +   +     GY+  DC +V  ++N   Y      AAADAI AG D+
Sbjct: 279 VPSCSNTFFLQTLLRDTFSFVDHGYVSGDCGAVYGVFNPHGYAANEPSAAADAILAGTDI 338

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDG-EPSAQPFGNLGPRDVC 237
           DCG     H   ++  G +  +D+           + LG FDG   S+ P+ +LG  DV 
Sbjct: 339 DCGTSYQYHFNESITTGAVARDDIERGFIRLYANLVELGYFDGNSSSSNPYRSLGWPDVQ 398

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTL---RHHTVAVIGPNSDVTVTMIGNYAGVA 294
                 ++ +AA +GIVLLKN   TLPL++    ++ ++A+IGP ++ T  + GNY G A
Sbjct: 399 KTDAWNISYEAAVEGIVLLKNDG-TLPLASPSEGKNKSIALIGPWANATTQLQGNYYGDA 457

Query: 295 CGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEF 354
               +P+   +    T+H A    ++ N      AA  AAR AD  V + G+D +IEAE 
Sbjct: 458 PYLISPVDAFTAAGYTVHYAPGTEISTNSTANFSAALSAARAADTIVFLGGIDNTIEAEA 517

Query: 355 IDRAGLLLPGRQQELVSRVA--KASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
            DR+ +  PG Q EL+S++A  K+   P+V+  M GG VD S  K++ ++ A+LW GYPG
Sbjct: 518 QDRSSIAWPGNQLELISQLAAQKSDDQPLVVYQMGGGQVDSSALKSNAKVNALLWGGYPG 577

Query: 413 QAGGAAIADVLFGRANP 429
           Q+GG A+ D+L G   P
Sbjct: 578 QSGGLALRDILTGARAP 594


>gi|410098444|ref|ZP_11293422.1| hypothetical protein HMPREF1076_02600 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409222318|gb|EKN15263.1| hypothetical protein HMPREF1076_02600 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 738

 Score =  282 bits (721), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 171/443 (38%), Positives = 236/443 (53%), Gaps = 32/443 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYW+PN+NIFRDPRWGRGQET GEDP LT K  ++ V GL+GN    LK  AC KHY 
Sbjct: 135 GLTYWTPNINIFRDPRWGRGQETYGEDPYLTAKMGSAIVHGLEGNNPEYLKSVACAKHYA 194

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +     N   R+ ++ARVS  DL DTY   F+  V + KV  VMC+YN+  G P C   
Sbjct: 195 VHSGPEHN---RHSYDARVSMYDLWDTYLPAFRELVTKAKVHGVMCAYNRFEGTPCCGHN 251

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVL--YNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
           ++L++ +  QW+ DGY+ SDC +V     Y+  H   T  EA ADA+  G DL+CG    
Sbjct: 252 ELLQDILRNQWKFDGYVTSDCWAVSDFAKYHKTHSNDT--EAVADAVLNGTDLECGNLYQ 309

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQ-PFGNLGPRDVCTPAHQQL 244
              +G V  GL+ E+D+N++LA    +Q +LGM+D  P+ + P+ ++G   +   AH++ 
Sbjct: 310 KLQQG-VEKGLISEKDINVSLARLFEIQFKLGMYD--PADRVPYASIGREVIECDAHKKH 366

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           A + A + +VLLKN+   LPL+  +   +A+IGPN D   T++ NY G      TP + +
Sbjct: 367 AYEMAQKSMVLLKNNKNILPLNASKIKRIALIGPNMDNGSTLLANYFGTPSEIITPYKSL 426

Query: 305 -SRYAKTIHQAGCFGV----ACNGNQLIGAAEVAARQADATVLVMGLDQSIE-------- 351
             R+  +I      GV       G          A++AD  + V G+    E        
Sbjct: 427 QKRFGNSIQIDTLTGVGIVQKLEGAPSFAQVAAQAKKADIIIFVGGISADYEGEAGDAGA 486

Query: 352 -----AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
                    DR  + LP  Q EL+  + K  R P++LV M G    +SF        AIL
Sbjct: 487 AGYGGFASGDRTTMKLPPVQTELMKELKKTGR-PLILVNMSGSV--MSFDWESRNADAIL 543

Query: 407 WVGYPGQAGGAAIADVLFGRANP 429
              Y GQA G AI DVLFG  NP
Sbjct: 544 QAWYGGQAAGDAITDVLFGDYNP 566


>gi|436410475|gb|AGB57183.1| beta-xylosidase [Aspergillus sp. BCC125]
          Length = 804

 Score =  282 bits (721), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 167/437 (38%), Positives = 240/437 (54%), Gaps = 10/437 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAA 61
           N G  GL  ++PN+N FR P WGRGQETPGED  L   YA  Y+ G+QG +  S LK+AA
Sbjct: 168 NAGRYGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAIYAYEYITGIQGPDPDSNLKLAA 227

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
             KHY  YD++NW+   R   +  +++QDL + Y   F     + KV SVMC+YN VNG 
Sbjct: 228 TAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGV 287

Query: 122 PTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           P CAD   L+  +   +     GY+ SDCD+   +YN   Y  +   AAA+AI AG D+D
Sbjct: 288 PACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDID 347

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSA--QPFGNLGPRDVC 237
           CG     H   ++  G L  +D+   +    T  ++ G FD   +    P+ +L   DV 
Sbjct: 348 CGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVL 407

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRH----HTVAVIGPNSDVTVTMIGNYAGV 293
                 ++ QAA QGIVLLKNS   LPL+   +     TVA+IGP ++ T  ++GNY G 
Sbjct: 408 ETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGN 467

Query: 294 ACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
           A    +P          ++ A   G++        AA  AA+ AD  +   G+D ++EAE
Sbjct: 468 APYMISPRAAFEEAGYNVNFAEGTGISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAE 527

Query: 354 FIDRAGLLLPGRQQELVSRVA-KASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
            +DR  +  PG Q +L+ ++A  A   P++++ M GG VD S  KN+  + A+LW GYPG
Sbjct: 528 ALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPG 587

Query: 413 QAGGAAIADVLFGRANP 429
           Q+GG A+ D++ G+ NP
Sbjct: 588 QSGGFALRDIITGKKNP 604


>gi|402308386|ref|ZP_10827395.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
 gi|400375830|gb|EJP28725.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
          Length = 721

 Score =  282 bits (721), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 172/435 (39%), Positives = 249/435 (57%), Gaps = 23/435 (5%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT+WSPNVNIFRDPRWGRG ET GEDP+L+G    +YVRGLQG+    LK  AC KHY
Sbjct: 127 AGLTFWSPNVNIFRDPRWGRGMETYGEDPLLSGMLGTAYVRGLQGDDAFYLKTGACAKHY 186

Query: 67  TAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
             +         R+  +   S++DL +TY   FK  V +G+V +VM +YN+V G+P    
Sbjct: 187 AVHSGPEGT---RHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGS 243

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PFLA 185
             +L + +   W  +G+IVSDCD++   Y    Y +TPEEA A AIKAGL+++CG  F A
Sbjct: 244 KYLLTDILRKSWGFNGHIVSDCDAINDFYGGHRYVKTPEEACAAAIKAGLNVECGHTFKA 303

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
           +  +GA+  GLL E D++ AL   +  +++LG+ + + SA P+ +    ++C+PAH  LA
Sbjct: 304 M--QGALDQGLLAEADLDRALFPLVMTRLKLGILEPD-SACPYNSYDESEICSPAHTALA 360

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI- 304
           L+AA + +VLLKN+   LPL      T+ V GP +     ++GNY G++  Y+T LQGI 
Sbjct: 361 LRAADEAMVLLKNNG-ILPLDK-NIRTLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIV 418

Query: 305 SRYAK--TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE--------- 353
           SR +   +++    F         +  A   A  A+  ++VMG + ++E E         
Sbjct: 419 SRVSSGTSVNFRPAFMQITEELNDMNWAVNEACAAEVAIVVMGNNGNMEGEEGEAIASAS 478

Query: 354 FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQ 413
             DR G+ LP  Q   + RV     G +V+VL  G P+D+   K      A++   YPGQ
Sbjct: 479 RGDRVGIGLPASQLNYLRRVKARKGGRIVVVLTGGSPIDLR--KISKLADAVVMAWYPGQ 536

Query: 414 AGGAAIADVLFGRAN 428
            GG A+ D+LFG  N
Sbjct: 537 EGGEALGDLLFGDKN 551


>gi|310792973|gb|EFQ28434.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Glomerella graminicola M1.001]
          Length = 728

 Score =  281 bits (720), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 156/430 (36%), Positives = 244/430 (56%), Gaps = 15/430 (3%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N G AG  +W+PN+N FRDPRWGRGQETPGED ++   Y  SYV GLQG+  +   + A 
Sbjct: 87  NYGYAGFNFWTPNMNAFRDPRWGRGQETPGEDVLVVSNYVQSYVTGLQGSDPTDKVIIAA 146

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           CKH+ AYD++     + Y+     ++QDL+D Y   F+ CV +  V +VMCSYN V+G P
Sbjct: 147 CKHFAAYDIETARRANNYN----PTQQDLQDYYLPAFRRCVRDSHVGTVMCSYNSVDGIP 202

Query: 123 TCADPDILKNTIHGQWRLDG---YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
            C+   +LK  +   W       ++VSDC +V  ++   ++T T ++AA+ ++ AG DL+
Sbjct: 203 ACSSEYLLKEVLRDTWGFTNDYQFVVSDCGAVTDVWLLHNFTNTEQDAASVSMAAGTDLE 262

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG    +H  G++    + +E V+ AL         +G FDG   +    +LG  DV T 
Sbjct: 263 CGSSY-LHLNGSLADKQVTQERVDEALTRLYKALFTVGYFDGSSHS----SLGWSDVSTI 317

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
             QQ+A +AA  G+ LLKN    LPL+  ++ +VA+IGP ++ T  M GNY G A    +
Sbjct: 318 DAQQIACEAARAGMTLLKNDG-VLPLADGKYKSVALIGPFANATTQMQGNYFGRAPFVRS 376

Query: 300 PLQGISRYAK-TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           PL   ++ +   ++ A    +    +     A  AA+ +D  +   G+D +IEAE +DR 
Sbjct: 377 PLWAFTQQSSLQVNYAAGTDINSTSDSGFADALAAAKNSDIVIFCGGIDTTIEAETLDRV 436

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            +  PG Q +L+S+++   + P+V+    GG VD +   ++  + A+ W G PGQAGG A
Sbjct: 437 SITWPGNQLDLISQLSMLGK-PLVVAQFGGGQVDDTALVDNANVNALFWAGLPGQAGGLA 495

Query: 419 IADVLFGRAN 428
           + D++ G+A+
Sbjct: 496 MYDLVVGKAS 505


>gi|307719075|ref|YP_003874607.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
           6192]
 gi|306532800|gb|ADN02334.1| glycoside hydrolase family 3 [Spirochaeta thermophila DSM 6192]
          Length = 693

 Score =  281 bits (720), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 174/439 (39%), Positives = 255/439 (58%), Gaps = 34/439 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPN+NI+RDPRWGRGQET GEDP LT K   ++V+GLQG+    L+VAAC KHY 
Sbjct: 101 GLTFWSPNINIYRDPRWGRGQETYGEDPFLTSKIGVAFVKGLQGDHPYYLRVAACAKHYA 160

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+ARVS++DL +TY   F+A V  G V +VM +YN+VNG+P C 
Sbjct: 161 VH-----SGPEGLRHVFDARVSEKDLWETYLPAFEALVKAG-VEAVMGAYNRVNGEPACG 214

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
              +L+  +  +W   G++VSDC ++   +     T+ P E+ A A++AG DL+CG    
Sbjct: 215 SKRLLEEILRKKWGFKGHVVSDCWAIADFHLHHKVTKDPIESIAMALEAGCDLNCGNTYE 274

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
            H   AV+ G + EE V+ ++A  ++   RLG+F  +    P+  L   D+   AH+ LA
Sbjct: 275 -HLLDAVKAGAVSEELVDRSVARLLSTLDRLGLFTDD---HPYVRLSLADIDWEAHRALA 330

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
            +AA + +VLLKN+   LPL   +   + V GPN+   V ++GNYAGV+    T L+GI+
Sbjct: 331 REAAEKSVVLLKNNG-ILPLDRRKLRYIYVTGPNAANPVALLGNYAGVSSRLVTVLEGIT 389

Query: 306 RYA----KTIHQAGCFGVACNGNQL--IGAAEVAARQADATVLVMGLDQSIEAE------ 353
            YA       ++ GC      GN++  I  A   AR AD TV VMG D ++E E      
Sbjct: 390 GYAGPGITVTYKIGC---PLQGNKINPIDWASGVARYADVTVAVMGRDSAVEGEEGDAIF 446

Query: 354 ---FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGY 410
              + D + L L   Q + + R+ +  + P+V+VL+ G P  V   + +    AI++  Y
Sbjct: 447 SDNYGDLSDLNLSREQIDYLRRIKEIGK-PLVVVLLSGAP--VCSPELEELADAIVYAWY 503

Query: 411 PGQAGGAAIADVLFGRANP 429
           PG+ GG AIA VLFG  +P
Sbjct: 504 PGEEGGNAIARVLFGEVSP 522


>gi|182415033|ref|YP_001820099.1| Beta-glucosidase [Opitutus terrae PB90-1]
 gi|177842247|gb|ACB76499.1| Beta-glucosidase [Opitutus terrae PB90-1]
          Length = 905

 Score =  281 bits (720), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 181/445 (40%), Positives = 248/445 (55%), Gaps = 34/445 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYW+PN+N+FRDPRWGRGQET GEDP LT +    +V+G+QG+    +   AC KHY 
Sbjct: 312 GLTYWAPNINLFRDPRWGRGQETYGEDPFLTAEIGIEFVKGVQGDDPRYMLAMACAKHYA 371

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +         R+ FNA + ++DL DTY   F+  V EGKVA VM +YN VNG P  A+ 
Sbjct: 372 VHSGPERT---RHSFNAEIPERDLFDTYLPHFERVVREGKVAGVMSAYNAVNGVPASANS 428

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQ--HYTRTPEEAAADAIKAGLDLDC-GPFL 184
            +L   +  +W  +GY+ SDCD++  +Y  +  HY +T EEAAA A+KAG +L C G + 
Sbjct: 429 FLLTELLRKRWGFEGYVPSDCDAIRDIYGEKQHHYVKTAEEAAALAVKAGCNLCCGGDYN 488

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQ-PFGNLGPRDVCTPAHQQ 243
           A+    AV+ GL+ E+D++ AL +T+  + RLG+FD  P+ Q PF     +D   PAH Q
Sbjct: 489 ALVR--AVQQGLVTEKDLDGALYHTLWTRFRLGLFD--PAEQVPFSGYTLKDNDLPAHSQ 544

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           +AL+ A Q IVLLKN   TLPL   +   +AVIGPN+     + GNY G A    + L  
Sbjct: 545 VALELARQAIVLLKNDG-TLPLDRTKLKQIAVIGPNAASKSMLEGNYHGSASRSISILDD 603

Query: 304 ISRYA----KTIHQAGC-----FGVACNGNQ----------LIGAAEVAARQADATVLVM 344
           I        K  H  G       G A    Q          L   A   A +ADA + V 
Sbjct: 604 IRNLVGSEIKITHAMGSPVTTKPGTAPWSGQDNTTDRPVAELKAEALKLAAEADAIIYVG 663

Query: 345 GLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGA 404
           G+  + E E  DR  + LP  Q++L+ R   A+  PVV+V   G  + +++   D  + A
Sbjct: 664 GITPAQEGESFDRESIELPSEQEDLI-RALHATGKPVVMVNCSGSAMALTW--QDENLPA 720

Query: 405 ILWVGYPGQAGGAAIADVLFGRANP 429
           I+   YPGQ GG A+A+VLFG  NP
Sbjct: 721 IVQAWYPGQEGGRAVAEVLFGETNP 745


>gi|358365439|dbj|GAA82061.1| beta-xylosidase [Aspergillus kawachii IFO 4308]
          Length = 788

 Score =  281 bits (720), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 168/437 (38%), Positives = 240/437 (54%), Gaps = 10/437 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAA 61
           N G  GL  ++PN+N FR P WGRGQETPGED  L   YA  Y+ G+QG +  S LK+AA
Sbjct: 152 NAGRYGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPDSNLKLAA 211

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
             KHY  YD++NW+   R   +  +++QDL + Y   F     + KV SVMC+YN VNG 
Sbjct: 212 TAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGV 271

Query: 122 PTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           P CAD   L+  +   +     GY+ SDCD+   +YN   Y  +   AAA+AI AG D+D
Sbjct: 272 PACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDID 331

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSA--QPFGNLGPRDVC 237
           CG     H   ++  G L  +D+   +    T  ++ G FD   +    P+ +L   DV 
Sbjct: 332 CGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVL 391

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRH----HTVAVIGPNSDVTVTMIGNYAGV 293
                 ++ QAA QGIVLLKNS   LPL+   +     TVA+IGP ++ T  ++GNY G 
Sbjct: 392 ETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGN 451

Query: 294 ACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
           A    +P          ++ A   G++        AA  AAR AD  +   G+D ++EAE
Sbjct: 452 APYMISPRAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAE 511

Query: 354 FIDRAGLLLPGRQQELVSRVA-KASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
            +DR  +  PG Q +L+ ++A  A   P++++ M GG VD S  KN+  + A+LW GYPG
Sbjct: 512 ALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPG 571

Query: 413 QAGGAAIADVLFGRANP 429
           Q+GG A+ D++ G+ NP
Sbjct: 572 QSGGFALRDIITGKKNP 588


>gi|194400335|gb|ACF61038.1| beta-xylosidase [Aspergillus awamori]
          Length = 804

 Score =  281 bits (719), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 168/437 (38%), Positives = 240/437 (54%), Gaps = 10/437 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAA 61
           N G  GL  ++PN+N FR P WGRGQETPGED  L   YA  Y+ G+QG +  S LK+AA
Sbjct: 168 NAGRYGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPDSNLKLAA 227

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
             KHY  YD++NW+   R   +  +++QDL + Y   F     + KV SVMC+YN VNG 
Sbjct: 228 TAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGV 287

Query: 122 PTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           P CAD   L+  +   +     GY+ SDCD+   +YN   Y  +   AAA+AI AG D+D
Sbjct: 288 PACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDID 347

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSA--QPFGNLGPRDVC 237
           CG     H   ++  G L  +D+   +    T  ++ G FD   +    P+ +L   DV 
Sbjct: 348 CGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVL 407

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRH----HTVAVIGPNSDVTVTMIGNYAGV 293
                 ++ QAA QGIVLLKNS   LPL+   +     TVA+IGP ++ T  ++GNY G 
Sbjct: 408 ETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGN 467

Query: 294 ACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
           A    +P          ++ A   G++        AA  AAR AD  +   G+D ++EAE
Sbjct: 468 APYMISPRAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAE 527

Query: 354 FIDRAGLLLPGRQQELVSRVA-KASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
            +DR  +  PG Q +L+ ++A  A   P++++ M GG VD S  KN+  + A+LW GYPG
Sbjct: 528 ALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPG 587

Query: 413 QAGGAAIADVLFGRANP 429
           Q+GG A+ D++ G+ NP
Sbjct: 588 QSGGFALRDIITGKKNP 604


>gi|288924872|ref|ZP_06418809.1| beta-glucosidase [Prevotella buccae D17]
 gi|288338659|gb|EFC77008.1| beta-glucosidase [Prevotella buccae D17]
          Length = 721

 Score =  281 bits (718), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 171/435 (39%), Positives = 247/435 (56%), Gaps = 23/435 (5%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT+WSPNVNIFRDPRWGRG ET GEDP+L+G    +YVRGLQG+    LK  AC KHY
Sbjct: 127 AGLTFWSPNVNIFRDPRWGRGMETYGEDPLLSGMLGTAYVRGLQGDDAFYLKTGACAKHY 186

Query: 67  TAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
             +         R+  +   S++DL +TY   FK  V +G+V +VM +YN+V G+P    
Sbjct: 187 AVHSGPEGT---RHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGS 243

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PFLA 185
             +L + +   W  +G+IVSDCD++   Y    Y +TPEEA A AIKAGL+++CG  F A
Sbjct: 244 KYLLTDILRKSWGFNGHIVSDCDAINDFYGGHRYVKTPEEACAAAIKAGLNVECGHTFKA 303

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
           +  +GA+  GLL E D++ AL   +  +++LG+ + + SA P+ +    ++C+PAH  LA
Sbjct: 304 M--QGALDQGLLAEADLDRALFPLVMTRLKLGILEPD-SACPYNSYDESEICSPAHTALA 360

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI- 304
           L+AA + +VLLKN+   LPL      T+ V GP +     ++GNY G++  Y+T LQGI 
Sbjct: 361 LRAADEAMVLLKNNG-ILPLDK-NIRTLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIV 418

Query: 305 SRYAK--TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE--------- 353
           SR +   +++    F         +  A   A  A+  ++VMG + ++E E         
Sbjct: 419 SRVSSGTSVNFRPAFMQITEELNDMNWAVNEACAAEVAIVVMGNNGNMEGEEGEAIASAS 478

Query: 354 FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQ 413
             DR G+ LP  Q   + RV     G +V+VL  G P+D+           + W  YPGQ
Sbjct: 479 RGDRVGIGLPASQMNYLRRVKARKGGRIVVVLTGGSPIDLREISKLADAVVMAW--YPGQ 536

Query: 414 AGGAAIADVLFGRAN 428
            GG A+ D+LFG  N
Sbjct: 537 EGGEALGDLLFGDKN 551


>gi|238483831|ref|XP_002373154.1| beta-xylosidase XylA [Aspergillus flavus NRRL3357]
 gi|292495283|sp|B8MYV0.1|XYND_ASPFN RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|220701204|gb|EED57542.1| beta-xylosidase XylA [Aspergillus flavus NRRL3357]
          Length = 797

 Score =  281 bits (718), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 159/433 (36%), Positives = 236/433 (54%), Gaps = 10/433 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPV-LTGKYAASYVRGLQGNTGSR-LKVA 60
           N G  GL  +SPN+N FR P WGRGQETPGED   L   YA  Y+ G+QG   +  LK+ 
Sbjct: 162 NAGRYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQGGVDANPLKLI 221

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A  KHY  YD++NW+   R   + ++++QDL + Y   F     + KV SVMCSYN VNG
Sbjct: 222 ATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNG 281

Query: 121 KPTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+C++   L+  +   +    DGY+  DC +V  ++N   Y      AAAD+I+AG D+
Sbjct: 282 VPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDI 341

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     H + +     +  +D+   +       +R G FDG+ S  P+ N+   DV +
Sbjct: 342 DCGVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRAGYFDGKTS--PYRNITWSDVVS 399

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRH-HTVAVIGPNSDVTVTMIGNYAGVACGY 297
              Q L+ +AA Q IVLLKN    LPL+T     T+A+IGP ++ T  M+GNY G A   
Sbjct: 400 TNAQNLSYEAAAQSIVLLKNDG-ILPLTTSSSTKTIALIGPWANATTQMLGNYYGPAPYL 458

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLI-GAAEVAARQADATVLVMGLDQSIEAEFID 356
            +PLQ        I          + +      A   A++AD  +   G+D ++E E  D
Sbjct: 459 ISPLQAFQDSEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQD 518

Query: 357 RAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGG 416
           R+ +  P  Q  L++++A   + P++++ M GG VD S  KN+  + A++W GYPGQ+GG
Sbjct: 519 RSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSGG 577

Query: 417 AAIADVLFGRANP 429
            A+AD++ G+  P
Sbjct: 578 QALADIITGKRAP 590


>gi|292495281|sp|C0STH4.1|XYND_ASPAC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|225878711|dbj|BAH30675.1| beta-xylosidase [Aspergillus aculeatus]
          Length = 805

 Score =  281 bits (718), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 168/437 (38%), Positives = 241/437 (55%), Gaps = 11/437 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDP-VLTGKYAASYVRGLQGNTG-SRLKVA 60
           N G  GL  +SPN+N FR P WGRGQETPGED   LT  YA  Y+ G+QG      LK+A
Sbjct: 160 NAGRFGLDVYSPNINTFRHPVWGRGQETPGEDAYTLTAAYAYEYITGIQGGVNPEHLKLA 219

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A  KH+  YD++NW+   R   +  +++QDL + Y   F     +  V S MCSYN VNG
Sbjct: 220 ATAKHFAGYDIENWDNHSRLGNDVNITQQDLAEYYTPQFLVAARDAHVHSFMCSYNAVNG 279

Query: 121 KPTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+C++   L+  +   +     GY+  DC +V  ++N   Y      AAADAI AG D+
Sbjct: 280 VPSCSNTFFLQTLLRDTFSFVDHGYVSGDCGAVYGVFNPHGYAANEPSAAADAILAGTDI 339

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDG-EPSAQPFGNLGPRDVC 237
           DCG     H   ++  G +  +D+           + LG FDG   S+ P+ +LG  DV 
Sbjct: 340 DCGTSYQYHFNESITTGAVARDDIERGFIRLYANLVELGYFDGNSSSSNPYRSLGWPDVQ 399

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTL---RHHTVAVIGPNSDVTVTMIGNYAGVA 294
                 ++ +AA +GIVLLKN   TLPL++    ++ ++A+IGP ++ T  + GNY G A
Sbjct: 400 KTDAWNISYEAAVEGIVLLKNDG-TLPLASPSEGKNKSIALIGPWANATTQLQGNYYGDA 458

Query: 295 CGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEF 354
               +P+   +    T+H A    ++ N      AA  AAR AD  V + G+D +IEAE 
Sbjct: 459 PYLISPVDAFTAAGYTVHYAPGTEISTNSTANFSAALSAARAADTIVFLGGIDNTIEAEA 518

Query: 355 IDRAGLLLPGRQQELVSRVA--KASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
            DR+ +  PG Q EL+S++A  K+   P+V+  M GG VD S  K + ++ A+LW GYPG
Sbjct: 519 QDRSSIAWPGNQLELISQLAAQKSDDQPLVVYQMGGGQVDSSSLKFNAKVNALLWGGYPG 578

Query: 413 QAGGAAIADVLFGRANP 429
           Q+GG A+ D+L G   P
Sbjct: 579 QSGGLALRDILTGARAP 595


>gi|373952439|ref|ZP_09612399.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373889039|gb|EHQ24936.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 721

 Score =  280 bits (717), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 162/446 (36%), Positives = 237/446 (53%), Gaps = 46/446 (10%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT K   +YV GLQG     LK +A  KH+ 
Sbjct: 135 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTSKMGNAYVHGLQGTDPLHLKTSATAKHFV 194

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
           A+        +R +F+A V ++DL DTY   FK+ +V+G V S+M +YN+VNG P   + 
Sbjct: 195 AHSGPEG---ERDYFDALVDEKDLRDTYLYAFKS-LVDGGVESIMTAYNRVNGVPNSINK 250

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            ++ + +  +W   G++V+DC ++  +Y T        E AA AIKAG+DLDC       
Sbjct: 251 TLVNDIVIKEWGFKGHVVTDCGALDDVYKTHKVLPNRMEVAAAAIKAGVDLDCSSIFQTD 310

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              A+   LL E+ V+ ALA  ++ Q +LG FD  PS+ PF + G   +   +H  LA Q
Sbjct: 311 IINAINNKLLTEKQVDAALAAVLSTQFKLGFFDA-PSSSPFYSFGADSIHNDSHVMLARQ 369

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS-- 305
            A + +VLLKN  + LPL    + ++ V+GPN+     ++ +Y GV+      ++GI+  
Sbjct: 370 MAQKSMVLLKNDKQILPLKMQNYSSIMVVGPNAASLDALVASYHGVSSKAVNFVEGITAA 429

Query: 306 -------------RYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA 352
                         Y  T H  G +G               A  AD TV V+GL   +E 
Sbjct: 430 VDKGTRVEYDLGADYRDTTHFGGIWG---------------AGNADVTVAVIGLTPVLEG 474

Query: 353 E----FI-----DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIG 403
           E    F+     D+  L LP      +  + K+ + P++ V+  G  VD+  A   P   
Sbjct: 475 EAGDAFLSQTGGDKKDLSLPAGDIAFMKALRKSVKKPIIAVVTSGSDVDI--AAIAPYAD 532

Query: 404 AILWVGYPGQAGGAAIADVLFGRANP 429
           A++   YPG+ GG A+AD+LFG+ +P
Sbjct: 533 AVILAWYPGEQGGNALADILFGKISP 558


>gi|255957137|ref|XP_002569321.1| Pc21g23540 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211591032|emb|CAP97251.1| Pc21g23540 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 791

 Score =  280 bits (717), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 166/430 (38%), Positives = 238/430 (55%), Gaps = 8/430 (1%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR-LKVAA 61
           NGG  GL  +SPN+N FR P WGRGQETPGED  L   Y   Y+ GLQG    + LK+AA
Sbjct: 160 NGGRYGLDVYSPNINSFRHPVWGRGQETPGEDIQLCSVYGLEYITGLQGGLDPKELKLAA 219

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
             KH+  YD++NW    R   +  +S  D    Y   F   V + +V SVM SYN VNG 
Sbjct: 220 TAKHFAGYDIENWGNHSRLGNDMSISAFDFASYYAPQFVTAVRDARVHSVMASYNAVNGV 279

Query: 122 PTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           P  A+  +L+  +   W    DGY+ SDCDSV  ++N   Y  +   AAA +I+AG D+D
Sbjct: 280 PASANSFLLQTLLRDTWNFVEDGYVSSDCDSVYNVFNPHGYASSASLAAAKSIQAGTDID 339

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG    ++   +   G +   ++  A     +  + LG FDG+ S   + +L   DV   
Sbjct: 340 CGATYQLYLNQSFTQGEISRSEIERAATRFYSNLVSLGYFDGDNSK--YRDLDWSDVVAT 397

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
               ++ +AA +GIVLLKN   TLPLS    H+VA+IGP ++VT TM GNY G A   T 
Sbjct: 398 DAWNISYEAAVEGIVLLKNDG-TLPLSK-DTHSVALIGPWANVTTTMQGNYYGAAPYLTG 455

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           PL  +      ++ A    ++        AA  AAR++D  +   G+D S+EAE +DR  
Sbjct: 456 PLAALQASDLDVNYAFGTNISSETTSGFEAALSAARKSDVVIFAGGIDNSVEAEGVDRET 515

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           +  PG Q +L+ ++++  + P+V++ M GG VD S  K +  + +++W GYPGQ+GG AI
Sbjct: 516 ITWPGNQLQLIEQLSELGK-PLVVLQMGGGQVDSSSLKANKNVNSLVWGGYPGQSGGPAI 574

Query: 420 ADVLFGRANP 429
            D+L G+  P
Sbjct: 575 LDILTGKRAP 584


>gi|365135698|ref|ZP_09343911.1| hypothetical protein HMPREF1032_03710 [Subdoligranulum sp.
           4_3_54A2FAA]
 gi|363612160|gb|EHL63713.1| hypothetical protein HMPREF1032_03710 [Subdoligranulum sp.
           4_3_54A2FAA]
          Length = 643

 Score =  280 bits (716), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 172/443 (38%), Positives = 247/443 (55%), Gaps = 31/443 (6%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ-GNTGSRLKVAAC 62
           G   GLT+WSPN+N+FRDPRWGRG ET GEDP+LTG+   +++RGLQ G      K+ A 
Sbjct: 98  GIYQGLTFWSPNINLFRDPRWGRGHETYGEDPLLTGRMGTAFIRGLQEGEDSQYRKLDAT 157

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
            KH+ A+         R+ FNA VS +D+ D+Y   F+ C+   K A+VM +YN++NG+P
Sbjct: 158 VKHFAAHSGPE---AGRHSFNAEVSAEDMADSYLWAFRYCIEHAKPAAVMGAYNRINGEP 214

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP 182
            CA    LK  ++ +W+ DGY+VSDC ++  +    H T+  +E+AA A+  G  L+CG 
Sbjct: 215 ACASSTYLKGVLYEEWKFDGYVVSDCGAIQDINENHHVTKNEKESAALAVNNGCQLNCGK 274

Query: 183 FLAIH-TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAH 241
             A H  + AV  GL+ E+ V  A+      + RLGMFD   S   + ++    +    H
Sbjct: 275 --AYHWVKAAVEDGLISEDTVTCAVERLFEARFRLGMFD---SDCVYDSIPMNVIECRKH 329

Query: 242 QQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 301
           ++L  + A + IVLLKN+   LPL+  +  T+AVIGPN+D    ++GNY G    +TT L
Sbjct: 330 RELNRKMAQESIVLLKNNG-ILPLNPEK--TIAVIGPNADDKTVLLGNYNGTPSHWTTLL 386

Query: 302 QGISRYAK--TIHQAGCFGVACN----GNQLIGAAEVAARQADATVLVMGLDQSIE---- 351
           +GI   A+    +  G   V         + +  A   A+ AD  VL +GL   +E    
Sbjct: 387 RGIQDQARGEVYYARGSVLVEKEALPWAEKPLHEAIYTAKAADVVVLCLGLSPLLEGEEG 446

Query: 352 -----AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
                A+  DR  + LP  QQ+L+  +    + PVVLV + GG VD+  A  D R  AIL
Sbjct: 447 DAYNGADSGDRKDISLPDIQQQLLCAILDTEK-PVVLVNVSGGCVDLRQA--DERCAAIL 503

Query: 407 WVGYPGQAGGAAIADVLFGRANP 429
              YPG  GG A+AD+LFGR +P
Sbjct: 504 QCFYPGAEGGNALADILFGRVSP 526


>gi|323447708|gb|EGB03620.1| hypothetical protein AURANDRAFT_72703 [Aureococcus anophagefferens]
          Length = 744

 Score =  279 bits (714), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 169/443 (38%), Positives = 234/443 (52%), Gaps = 65/443 (14%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG--NTGSRLK 58
           + N G A  TYW+P VN+ R+PRWGR  E PGEDP LTG+YA  +V G Q        L+
Sbjct: 129 LMNAGAAYSTYWAPVVNLAREPRWGRNIEVPGEDPYLTGEYATEFVGGFQAAPEDPYHLQ 188

Query: 59  VAACCKHYTAYDLDN--------WNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVAS 110
            +ACCKHY A +L+N        W   DR H ++ V+++DL D+Y VPF+ACV +GKV+S
Sbjct: 189 ASACCKHYVANELENTRQPDGEQW---DRQHVDSNVTQRDLVDSYMVPFQACVEKGKVSS 245

Query: 111 VMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAAD 170
           +MCSYN VNG P+CA+  +L+      W  DGYI SDCD+   +Y+  HY  TPEEA AD
Sbjct: 246 LMCSYNAVNGVPSCANDWLLRTVARDAWHFDGYITSDCDADSNVYDAHHYAATPEEAVAD 305

Query: 171 AIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFD-GEPSAQPFG 229
            +KAG D+DC  F+  H   A+  GL+ E D++  L     V++RLG FD    +A+P G
Sbjct: 306 VLKAGTDVDCQSFVGQHARSALDKGLITEADMDARLVNLFKVRLRLGHFDLSFDAAKPRG 365

Query: 230 NLGPRD----VCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVT 285
            L   D    VC+ AH   +++   Q   LLKN    LPL      T AV+GPN+ ++  
Sbjct: 366 PLDEIDADAVVCSDAHLDASMEGLAQSATLLKNDG-ALPLKP--SGTAAVVGPNALLS-- 420

Query: 286 MIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG 345
                     GY  P                                    ADA VL +G
Sbjct: 421 ------KADAGYYGPTDA---------------------------------ADAVVLAVG 441

Query: 346 LDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVS--FAKNDPRIG 403
            D +  AE  D   ++    Q EL+  VA AS  PVV+V+    P+D++   A++D ++G
Sbjct: 442 TDLTWAAEGKDATSIVFTAAQLELIDAVATASATPVVVVVFSATPLDLTPLLARSDGKVG 501

Query: 404 AILWVGYPGQAGGAAIADVLFGR 426
           A++ VG P       + D+L+GR
Sbjct: 502 AVVHVGQP-SVTVKGLGDLLYGR 523


>gi|319641744|ref|ZP_07996426.1| beta-glucosidase [Bacteroides sp. 3_1_40A]
 gi|317386631|gb|EFV67528.1| beta-glucosidase [Bacteroides sp. 3_1_40A]
          Length = 702

 Score =  279 bits (714), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 172/445 (38%), Positives = 237/445 (53%), Gaps = 34/445 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYW+PN+NIFRDPRWGRGQET GEDP LT K  A+ VRGL+G     LK  AC KHY 
Sbjct: 99  GLTYWTPNINIFRDPRWGRGQETYGEDPYLTAKMGAAIVRGLEGEDPHYLKSVACAKHYA 158

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +    +N   R+ F+AR S  DL DTY   F+  V + KV  VMC+YN++NG+P C + 
Sbjct: 159 VHSGPEYN---RHSFDARPSVFDLWDTYMPAFRELVTKAKVHGVMCAYNRLNGQPCCGND 215

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEE--AAADAIKAGLDLDCGPFL 184
            +L + +  QW  DGY+ SDC     L +   + +T PE   A +DA+ AG DL+CG   
Sbjct: 216 PLLVDILRNQWHFDGYVTSDC---WALKDFAEFHKTHPEHTIAMSDALLAGTDLECGNLY 272

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQ-PFGNLGPRDVCTPAHQQ 243
            +  EG V+ GL  E D+N++L+   T+  ++GMFD  P+ + P+ ++G   +   AH+Q
Sbjct: 273 HLLAEG-VKKGLHSERDINVSLSRLFTILFKIGMFD--PAERVPYSSIGREVLECEAHKQ 329

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
            A + A + IVLL+N    LPL   +  ++A+IGPN+D   T + NY G      TP   
Sbjct: 330 HAERMAKESIVLLENKNHILPLDASKIKSIALIGPNADNGQTQLANYFGTPSEIVTPYMS 389

Query: 304 ISRY---AKTIHQAGCFGVACNGNQLIGAAEVA--ARQADATVLVMGLDQSIE------- 351
           + R       I+     G+           +VA  A Q+D  V V G+    E       
Sbjct: 390 LKRRLGDKIKINYLPGVGIVDKLKDAPSFVQVAHKAAQSDVIVFVSGISADYEGEAGDAG 449

Query: 352 ------AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAI 405
                     DR  + LP  Q EL+ ++ K  R P+++V M G    +SF        A+
Sbjct: 450 AAGYGGFASGDRTTMQLPLVQIELLKKLKKTGR-PLIIVNMSGSV--MSFEWESQNADAL 506

Query: 406 LWVGYPGQAGGAAIADVLFGRANPG 430
           L   Y GQA G AI DVLFG  NP 
Sbjct: 507 LQAWYGGQAAGDAIVDVLFGHCNPA 531


>gi|345519864|ref|ZP_08799275.1| beta-glucosidase [Bacteroides sp. 4_3_47FAA]
 gi|254836262|gb|EET16571.1| beta-glucosidase [Bacteroides sp. 4_3_47FAA]
          Length = 736

 Score =  279 bits (714), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 172/445 (38%), Positives = 237/445 (53%), Gaps = 34/445 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYW+PN+NIFRDPRWGRGQET GEDP LT K  A+ VRGL+G     LK  AC KHY 
Sbjct: 133 GLTYWTPNINIFRDPRWGRGQETYGEDPYLTAKMGAAIVRGLEGEDPHYLKSVACAKHYA 192

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +    +N   R+ F+AR S  DL DTY   F+  V + KV  VMC+YN++NG+P C + 
Sbjct: 193 VHSGPEYN---RHSFDARPSVFDLWDTYMPAFRELVTKAKVHGVMCAYNRLNGQPCCGND 249

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEE--AAADAIKAGLDLDCGPFL 184
            +L + +  QW  DGY+ SDC     L +   + +T PE   A +DA+ AG DL+CG   
Sbjct: 250 PLLVDILRNQWHFDGYVTSDC---WALKDFAEFHKTHPEHTIAMSDALLAGTDLECGNLY 306

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQ-PFGNLGPRDVCTPAHQQ 243
            +  EG V+ GL  E D+N++L+   T+  ++GMFD  P+ + P+ ++G   +   AH+Q
Sbjct: 307 HLLAEG-VKKGLHSERDINVSLSRLFTILFKIGMFD--PAERVPYSSIGREVLECEAHKQ 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
            A + A + IVLL+N    LPL   +  ++A+IGPN+D   T + NY G      TP   
Sbjct: 364 HAERMAKESIVLLENKNHILPLDASKIKSIALIGPNADNGQTQLANYFGTPSEIVTPYMS 423

Query: 304 ISRY---AKTIHQAGCFGVACNGNQLIGAAEVA--ARQADATVLVMGLDQSIE------- 351
           + R       I+     G+           +VA  A Q+D  V V G+    E       
Sbjct: 424 LKRRLGDKIKINYLPGVGIVDKLKDAPSFVQVAHKAAQSDVIVFVSGISADYEGEAGDAG 483

Query: 352 ------AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAI 405
                     DR  + LP  Q EL+ ++ K  R P+++V M G    +SF        A+
Sbjct: 484 AAGYGGFASGDRTTMQLPLVQIELLKKLKKTGR-PLIIVNMSGSV--MSFEWESQNADAL 540

Query: 406 LWVGYPGQAGGAAIADVLFGRANPG 430
           L   Y GQA G AI DVLFG  NP 
Sbjct: 541 LQAWYGGQAAGDAIVDVLFGHCNPA 565


>gi|354508473|gb|AER26905.1| beta-xylosidase 3 [synthetic construct]
          Length = 778

 Score =  279 bits (714), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 167/437 (38%), Positives = 240/437 (54%), Gaps = 10/437 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAA 61
           N G  GL  ++PN+N FR P WGRGQETPGED  L   YA  Y+ G+QG +  S LK+AA
Sbjct: 142 NAGRYGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPDSNLKLAA 201

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
             KHY  YD++NW+   R   +  +++QDL + Y   F     + KV SVMC+YN V+G 
Sbjct: 202 TAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVDGV 261

Query: 122 PTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           P CAD   L+  +   +     GY+ SDCD+   +YN   Y  +   AAA+AI AG D+D
Sbjct: 262 PACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDID 321

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSA--QPFGNLGPRDVC 237
           CG     H   ++  G L  +D+   +    T  ++ G FD   +    P+ +L   DV 
Sbjct: 322 CGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVL 381

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRH----HTVAVIGPNSDVTVTMIGNYAGV 293
                 ++ QAA QGIVLLKNS   LPL+   +     TVA+IGP ++ T  ++GNY G 
Sbjct: 382 ETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGN 441

Query: 294 ACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
           A    +P          ++ A   G++        AA  AAR AD  +   G+D ++EAE
Sbjct: 442 APYMISPRAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAE 501

Query: 354 FIDRAGLLLPGRQQELVSRVA-KASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
            +DR  +  PG Q +L+ ++A  A   P++++ M GG VD S  KN+  + A+LW GYPG
Sbjct: 502 ALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPG 561

Query: 413 QAGGAAIADVLFGRANP 429
           Q+GG A+ D++ G+ NP
Sbjct: 562 QSGGFALRDIITGKKNP 578


>gi|4235093|gb|AAD13106.1| beta-xylosidase [Aspergillus niger]
          Length = 804

 Score =  279 bits (714), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 167/437 (38%), Positives = 240/437 (54%), Gaps = 10/437 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAA 61
           N G  GL  ++PN+N FR P WGRGQETPGED  L   YA  Y+ G+QG +  S LK+AA
Sbjct: 168 NAGRYGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPDSNLKLAA 227

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
             KHY  YD++NW+   R   +  +++QDL + Y   F     + KV SVMC+YN V+G 
Sbjct: 228 TAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVDGV 287

Query: 122 PTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           P CAD   L+  +   +     GY+ SDCD+   +YN   Y  +   AAA+AI AG D+D
Sbjct: 288 PACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDID 347

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSA--QPFGNLGPRDVC 237
           CG     H   ++  G L  +D+   +    T  ++ G FD   +    P+ +L   DV 
Sbjct: 348 CGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVL 407

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRH----HTVAVIGPNSDVTVTMIGNYAGV 293
                 ++ QAA QGIVLLKNS   LPL+   +     TVA+IGP ++ T  ++GNY G 
Sbjct: 408 ETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGN 467

Query: 294 ACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
           A    +P          ++ A   G++        AA  AAR AD  +   G+D ++EAE
Sbjct: 468 APYMISPRAAFEEAGYKVNFAEGTGISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAE 527

Query: 354 FIDRAGLLLPGRQQELVSRVA-KASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
            +DR  +  PG Q +L+ ++A  A   P++++ M GG VD S  KN+  + A+LW GYPG
Sbjct: 528 ALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPG 587

Query: 413 QAGGAAIADVLFGRANP 429
           Q+GG A+ D++ G+ NP
Sbjct: 588 QSGGFALRDIITGKKNP 604


>gi|266619450|ref|ZP_06112385.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
 gi|288869013|gb|EFD01312.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
          Length = 714

 Score =  279 bits (713), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 172/443 (38%), Positives = 241/443 (54%), Gaps = 35/443 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           G+T+WSPN+NIFRDPRWGRG ET GEDP LTG+   +Y++G+QGN G RLK AAC KH+ 
Sbjct: 110 GITFWSPNINIFRDPRWGRGHETYGEDPCLTGRMGTAYIKGMQGN-GKRLKAAACVKHFA 168

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
           A+         R+ FN+ VSK+DL +TY   F+ CV E  V  VM  YN++NG+  C   
Sbjct: 169 AHSGPEKG---RHSFNSVVSKKDLTETYFPAFERCVKEAGVEGVMGGYNRLNGEAACGSH 225

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            ++   +  +W  DGY VSDC ++   +     T TP+E+AA A+K+G DL+CG  + +H
Sbjct: 226 HLITEILREKWGFDGYYVSDCGAIKDFHMHHGLTDTPQESAALALKSGCDLNCGA-VYLH 284

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA-HQQLAL 246
              A   GL+  ED++ A+ + +  +MRLGMFD            P ++   A H  LAL
Sbjct: 285 VMSAYNQGLVSAEDIDRAVTHLMMTRMRLGMFDQHTEFDEI----PYEINDCAEHHGLAL 340

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS- 305
           +AA + +VLLKN    LPL      TVAVIGPN D    + GNY G A    T L+GI  
Sbjct: 341 KAAEESMVLLKNDG-ILPLDKTALKTVAVIGPNGDSEEILKGNYNGTATEKYTILEGIRA 399

Query: 306 ----------RYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEF- 354
                          +++     +A   ++L  A  +A R +D   L +GL+ ++E E  
Sbjct: 400 VLGKETRIFCSEGSHLYRDNVENLAEADDRLKEAVSMAVR-SDVVFLCLGLNGTLEGEEG 458

Query: 355 --------IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
                    D+A L LP  Q  L+  V   +  PV+L+L  G  + +++A       AIL
Sbjct: 459 DANNSYAGADKADLNLPESQMRLLKAVC-GTGTPVILLLAAGSAMAINYAAE--HCSAIL 515

Query: 407 WVGYPGQAGGAAIADVLFGRANP 429
            + YPGQ GG A A +L G A P
Sbjct: 516 HIWYPGQMGGLAAARLLTGEAVP 538


>gi|336471692|gb|EGO59853.1| hypothetical protein NEUTE1DRAFT_99999 [Neurospora tetrasperma FGSC
           2508]
 gi|350292807|gb|EGZ74002.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
          Length = 770

 Score =  279 bits (713), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 170/435 (39%), Positives = 240/435 (55%), Gaps = 13/435 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N G +G  YW+PNVN F+DPRWGRG ETPGED +   +YAAS +RGLQG    R +V A 
Sbjct: 145 NAGFSGFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAASMIRGLQGPARER-RVVAT 203

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           CKHY A D ++WNG  R+ FNA+V+ QDL + Y  PF+ C  + KV S+MCSYN VNG P
Sbjct: 204 CKHYAANDFEDWNGSTRHDFNAKVTLQDLAEYYLSPFQQCARDSKVGSIMCSYNAVNGVP 263

Query: 123 TCADPDILKNTI--HGQWRLDG-YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
            CA+  +++  +  H  W   G YI SDC++V  +    HY  T  E  A A +AG+D  
Sbjct: 264 ACANTYLMQTILREHWNWTAPGNYITSDCEAVLDISANHHYAETNAEGTALAFEAGIDSS 323

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           C    +    GA   GLL +  V+ AL       +R+G FDG  S   + +LG +DV +P
Sbjct: 324 CEYESSSDIPGAWTQGLLEQSTVDRALKRIYEGLVRVGYFDGNHSE--YASLGWKDVNSP 381

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLS--TLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
             Q++ALQAA +GIVLLKN  +TLPL   T     +A+IG  ++   T+ G Y+G     
Sbjct: 382 KSQEVALQAAVEGIVLLKND-KTLPLDLRTDPKSKLAMIGFWANDPKTLSGGYSGKPAFE 440

Query: 298 TTPLQGISRYAKTIHQAG--CFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFI 355
            +P+        ++  AG      + + +    AA  AA+ A+  +   G D S   E  
Sbjct: 441 HSPVYAAQAMGFSVTTAGGPVLQNSTSNDTWTQAALEAAKDANYILYFGGQDTSAAGETK 500

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  +  P  Q +L++ ++K  + P+V+V M G  +D +       + AILW  + GQ G
Sbjct: 501 DRTTINWPEAQLQLITTLSKLGK-PLVVVQM-GDQLDNTPLLAAKAVNAILWANWLGQDG 558

Query: 416 GAAIADVLFGRANPG 430
           G A+  +L G  NP 
Sbjct: 559 GTAVMQILTGLKNPA 573


>gi|164429277|ref|XP_958209.2| hypothetical protein NCU09923 [Neurospora crassa OR74A]
 gi|16945419|emb|CAB91343.2| related to xylan 1, 4-beta-xylosidase [Neurospora crassa]
 gi|157073010|gb|EAA28973.2| hypothetical protein NCU09923 [Neurospora crassa OR74A]
          Length = 774

 Score =  278 bits (711), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 168/434 (38%), Positives = 239/434 (55%), Gaps = 13/434 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N G +G  YW+PNVN F+DPRWGRG ETPGED +   +YAAS +RGLQG    R +V A 
Sbjct: 145 NAGFSGFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAASMIRGLQGPLPER-RVVAT 203

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           CKHY A D ++WNG  R+ F+A+V+ QDL + Y  PF+ C  + KV S+MCSYN VNG P
Sbjct: 204 CKHYAANDFEDWNGSTRHDFDAKVTLQDLAEYYLSPFQQCARDSKVGSIMCSYNAVNGVP 263

Query: 123 TCADPDILKNTI--HGQWRLDG-YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
            CA+  +++  +  H  W   G YI SDC++V  ++   HY +T  E  A A +AG D  
Sbjct: 264 ACANTYLMQTILREHWNWTAPGNYITSDCEAVLDIFANHHYAKTNAEGTALAFEAGTDSS 323

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           C    +    GA   GLL +  V+ AL       +R+G FDG  S   + +LG +DV +P
Sbjct: 324 CEYESSSDIPGAWTQGLLEQSTVDRALTRLYEGLVRVGYFDGNHSE--YASLGWKDVNSP 381

Query: 240 AHQQLALQAAHQGIVLLKNSARTLP--LSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
             Q++ALQ A +GIVLLKN  +TLP  L T     +A+IG  ++   T+ G Y+G     
Sbjct: 382 KSQEVALQTAVEGIVLLKND-QTLPLGLKTDPKSKLAMIGFWANDPKTLSGGYSGKPAFE 440

Query: 298 TTPLQGISRYAKTIHQAG--CFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFI 355
            +P+         +  AG      + + +    AA  AA+ A+  +   GLD S   E  
Sbjct: 441 HSPVYAAEAMGFNVTTAGGPVLQNSTSNDTWTQAALEAAQDANYILYFGGLDTSAAGETK 500

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  +  P  Q +L+  + K  + P+V+V M G  +D +       + +ILW  +PGQ G
Sbjct: 501 DRTTINWPEAQLQLIKTLTKLGK-PLVVVQM-GDQLDNTPLLATKTVNSILWANWPGQDG 558

Query: 416 GAAIADVLFGRANP 429
           G A+  +L G  +P
Sbjct: 559 GTAVMQILTGLKSP 572


>gi|340519849|gb|EGR50086.1| glycoside hydrolase family 3 [Trichoderma reesei QM6a]
          Length = 796

 Score =  278 bits (711), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 174/432 (40%), Positives = 245/432 (56%), Gaps = 18/432 (4%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAA 61
           N G AGL +W+PN+N FRDPRWGRG ETPGED      Y  + V GLQG       +  +
Sbjct: 179 NYGFAGLDFWTPNINPFRDPRWGRGMETPGEDAFRIQGYVLALVDGLQGGIDPDFYRTLS 238

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+ AYD++N     R   N   ++QD+ D Y   F+ CV + KVAS+MC+YN V+G 
Sbjct: 239 TCKHFAAYDIEN----GRTANNLSPTQQDMADYYLPMFETCVRDAKVASIMCAYNAVDGV 294

Query: 122 PTCADPDILKNTIHGQWRLD---GYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           P CAD  +L++ +   +       Y+VSDCD+V  +++  HY     +AAA +I AG DL
Sbjct: 295 PACADSYLLQDVLRDTYGFTEDFNYVVSDCDAVENVFDPHHYAANLTQAAAMSINAGTDL 354

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG    +    +V+ GL  E  ++ +L    +  +++G FD +P+   + +LG  +V T
Sbjct: 355 DCGSSYNV-LNASVQAGLTTEATLDKSLIRLYSALVKVGYFD-QPAE--YNSLGWGNVNT 410

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
              Q LA  AA +G+ LLKN   TLPLS TL +  VAVIGP ++VT  M GNYAG A   
Sbjct: 411 TQSQALAHDAATEGMTLLKNDG-TLPLSRTLSN--VAVIGPWANVTTQMQGNYAGTAPLL 467

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
             PL    +  + +  A    +         AA  AA  +D  V + G+D S+E E  DR
Sbjct: 468 VNPLSVFQQKWRNVKYAQGTAINSQDTSGFNAALSAASSSDVIVYLGGIDISVENEGFDR 527

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
           + +  PG Q  L+S++A   + P+V+V   GG +D S   ++ ++ +ILW GYPGQ GG 
Sbjct: 528 SSITWPGNQLNLISQLANLGK-PLVIVQFGGGQIDDSALLSNSKVNSILWAGYPGQDGGN 586

Query: 418 AIADVLFGRANP 429
           AI DVL G ANP
Sbjct: 587 AIFDVLTG-ANP 597


>gi|372208556|ref|ZP_09496358.1| beta-glucosidase [Flavobacteriaceae bacterium S85]
          Length = 729

 Score =  278 bits (710), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 169/442 (38%), Positives = 238/442 (53%), Gaps = 25/442 (5%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G  AGLT+W+PNVNIFRDPRWGRGQET GEDP LT +   ++V+GLQGN    LK AAC 
Sbjct: 127 GQYAGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGVAFVKGLQGNHPKYLKSAACA 186

Query: 64  KHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
           KH+  +     +G +  R+HFNA  SK+DL +TY   F+A V +  V  VM +YN V G 
Sbjct: 187 KHFAVH-----SGPEELRHHFNANPSKKDLYETYLPAFEALVKQANVEGVMSAYNAVYGV 241

Query: 122 PTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG 181
           P  +   +LK T+   W  DGYIVSDC ++G ++      +T  EAAA A+KAG++L+CG
Sbjct: 242 PAGSSEFLLKETLRKSWGFDGYIVSDCGALGDIFKGHKQVKTMPEAAAVALKAGVNLNCG 301

Query: 182 PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAH 241
                  E AV+ GL+ EE ++  L   +  + +LG FD +  A P+  +    + +  H
Sbjct: 302 YVYNGALEKAVQQGLVSEELIDTRLKQLLKTRFKLGFFDPK-EANPYNAIPTSVIHSDDH 360

Query: 242 QQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 301
             LA + A + IVLLKN   TLPL         V GP +  +  ++ NY G+     + L
Sbjct: 361 IALARKTAQKSIVLLKNKNHTLPLDK-NIKVPYVTGPFASSSDVLLANYYGMTTNLVSVL 419

Query: 302 QGI----SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFI-- 355
           +GI    S      ++ G      N N    A  V A+ ADA + V+GL    E E +  
Sbjct: 420 EGIADKVSLGTSLNYRMGALPFNKNLNPKNWAPNV-AKTADAVIAVVGLSADFEGEEVDA 478

Query: 356 -------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWV 408
                  D+  L LP  Q + V  +A   +GP++LV+  G  V +    +      ++W 
Sbjct: 479 IASPNKGDKKDLKLPQNQIDYVKEMAAKKKGPLILVVASGSAVALGELYDLADAIVLMW- 537

Query: 409 GYPGQAGGAAIADVLFGRANPG 430
            YPG+ GG A+ADVLFG  +P 
Sbjct: 538 -YPGEQGGNAVADVLFGDVSPS 558


>gi|333494646|gb|AEF56854.1| putative glycosyl hydrolase [synthetic construct]
          Length = 743

 Score =  277 bits (709), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 181/442 (40%), Positives = 250/442 (56%), Gaps = 31/442 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSP +NI RDPRWGRGQET GEDP LT + A S++RG+QG  G  LK AAC KH+ 
Sbjct: 135 GLTFWSPTINIDRDPRWGRGQETYGEDPYLTSRLAVSFIRGIQGR-GRYLKAAACAKHFA 193

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +        +R+ FNA VS++DL +TY   F+A V E KVA VM +YN+VNG+P C   
Sbjct: 194 VHSGPE---SERHQFNAEVSQKDLWETYLPAFEASVKEAKVAGVMGAYNRVNGEPCCGSG 250

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +L + + G+W   GY+ SDC ++  +      T+T EE++A A+K+G DL+CG   A  
Sbjct: 251 TLLGDVLRGEWEFGGYVTSDCWAIKDINEGHGVTKTIEESSALAVKSGCDLNCGCAYASL 310

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
            + A R GL+ E++++ A+   +  +MRLGMFD  P   P+ ++         H+  AL+
Sbjct: 311 VK-AYRAGLIGEKEIDTAVHRLMLTRMRLGMFDA-PEKVPYSSIPYEKNDCAEHRAFALE 368

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISR- 306
            A + +VLL+N +  LPL   R  +VAVIGPN+D  V + GNY G A  Y T L GI   
Sbjct: 369 VAEKSLVLLRNRSGFLPLDRSRIRSVAVIGPNADSRVALEGNYNGTASEYVTVLDGIREA 428

Query: 307 --------YAKTIHQ-AGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE---- 353
                   YA+  H      G     N  +  A  AA +AD  V+ +GL++ IE E    
Sbjct: 429 VGDRARVYYAEGSHLFRNSMGGLSQKNDRLAEAAAAAERADVAVVCLGLNRDIEGEEGDP 488

Query: 354 -----FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWV 408
                  D+  L LPG Q+EL+  V KA+  PVVLVL+ G  + V++A  D    A++  
Sbjct: 489 SNEYPAGDKRDLRLPGLQEELLETV-KATGTPVVLVLLSGSALAVNWA--DENADAVVQA 545

Query: 409 GYPG-QAGGAAIADVLFGRANP 429
            YPG QA G   A  LFG   P
Sbjct: 546 WYPGAQAEGRRGA--LFGIIRP 565


>gi|225878709|dbj|BAH30674.1| beta-xylosidase [Aspergillus aculeatus]
          Length = 785

 Score =  277 bits (709), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 170/431 (39%), Positives = 241/431 (55%), Gaps = 11/431 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDP-VLTGKYAASYVRGLQGNTGSRLKVAA 61
           N G  GL  +SPN+N FR P WGRGQETPGED   L   YA  Y+ GLQGN  +  K+ A
Sbjct: 161 NAGRYGLNAFSPNINAFRHPVWGRGQETPGEDANCLCSAYAYEYITGLQGN-ATNPKIIA 219

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
             KHY  YD++NW    R+  +  +++QDL + +   F   V + +V SVM SYN VNG 
Sbjct: 220 TAKHYAGYDIENWRQRSRFGNDLNITQQDLAEYFTPQFVVAVRDAQVRSVMPSYNAVNGV 279

Query: 122 PTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           P+ A+  +L+  +   W    DGY+ SDCD+V  ++N   Y      A+A +++AG D+D
Sbjct: 280 PSSANTFLLQTLVRDSWGFIQDGYMASDCDAVYNVFNPHGYAANLSSASAMSLRAGTDID 339

Query: 180 CG-PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           CG  +L    E   +G + R E +  A+    +  +  G FDG P A P+ +L   DV  
Sbjct: 340 CGISYLTTLNESLTQGQISRSE-IERAVTRFYSNLVSAGYFDG-PDA-PYRDLSWSDVVR 396

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
                +A +AA  G+VLLKN    LPLS      VA+IGP ++ T  M GNY GVA   T
Sbjct: 397 TNRWNVAYEAAVAGVVLLKNDG-VLPLSKSVQR-VALIGPWANATEQMQGNYHGVAPYLT 454

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           +PL  +      ++ A    +  N      AA  AA ++D  +   G+D ++EAE +DRA
Sbjct: 455 SPLAAVQASGLEVNYAFGTNITSNVTNCFAAALAAAEKSDIIIFAGGIDNTLEAEELDRA 514

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            +  PG Q EL+ R+ +  + P+V++ M GG VD S  K   ++GA+LW GYPGQAGG A
Sbjct: 515 NITWPGNQLELIHRLGELGK-PLVVLQMGGGQVDSSALKASEKVGALLWGGYPGQAGGQA 573

Query: 419 IADVLFGRANP 429
           + D+L G+  P
Sbjct: 574 LWDILTGQRAP 584


>gi|315607899|ref|ZP_07882892.1| beta-glucosidase [Prevotella buccae ATCC 33574]
 gi|315250368|gb|EFU30364.1| beta-glucosidase [Prevotella buccae ATCC 33574]
          Length = 721

 Score =  276 bits (707), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 170/435 (39%), Positives = 248/435 (57%), Gaps = 23/435 (5%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT+WSPNVNIFRD RWGRG ET GEDP+L+G    +YVRGLQG+    LK  AC KHY
Sbjct: 127 AGLTFWSPNVNIFRDLRWGRGMETYGEDPLLSGMLGTAYVRGLQGDDAFYLKTGACAKHY 186

Query: 67  TAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
             +         R+  +   S++DL +TY   FK  V +G+V +VM +YN+V G+P    
Sbjct: 187 AVHSGPEGT---RHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGS 243

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PFLA 185
             +L + +   W  +G+IVSDCD++   Y    Y +TPEEA A AIKAGL+++CG  F A
Sbjct: 244 KYLLTDILRKSWGFNGHIVSDCDAINDFYGGHRYVKTPEEACAAAIKAGLNVECGHTFKA 303

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
           +  +GA+  GLL E D++ AL   +  +++LG+ + + SA P+ +    ++C+PAH  LA
Sbjct: 304 M--QGALDQGLLAEADLDRALFPLVMTRLKLGILEPD-SACPYNSYDESEICSPAHTALA 360

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI- 304
           L+AA + +VLLKN+   LPL      T+ V GP +     ++GNY G++  Y+T LQGI 
Sbjct: 361 LRAADEAMVLLKNNG-ILPLDK-NIRTLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIV 418

Query: 305 SRYAK--TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE--------- 353
           SR +   +++    F         +  A   A  A+  ++VMG + ++E E         
Sbjct: 419 SRVSSGTSVNFRPAFMQITEELNDMNWAVNEACAAEVAIVVMGNNGNMEGEEGEAIASAS 478

Query: 354 FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQ 413
             DR G+ LP  Q   + RV     G +V+VL  G P+D+   +      A++   YPGQ
Sbjct: 479 RGDRVGIGLPASQLNYLRRVKARKGGRIVVVLTGGSPIDLR--EISKLADAVVMAWYPGQ 536

Query: 414 AGGAAIADVLFGRAN 428
            GG A+ D+LFG  N
Sbjct: 537 EGGEALGDLLFGDKN 551


>gi|374372635|ref|ZP_09630297.1| Beta-glucosidase [Niabella soli DSM 19437]
 gi|373235166|gb|EHP54957.1| Beta-glucosidase [Niabella soli DSM 19437]
          Length = 734

 Score =  276 bits (707), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 166/443 (37%), Positives = 231/443 (52%), Gaps = 33/443 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYW+PN+NIFRDPRWGRGQET GEDP LT     ++VRGLQG     LK AAC KHY 
Sbjct: 135 GLTYWTPNINIFRDPRWGRGQETYGEDPFLTAMLGRAFVRGLQGEDPKYLKAAACAKHYA 194

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+  VS  DL +TY   FK  V   KVA VMC+YN    KP C 
Sbjct: 195 IH-----SGPEAVRHSFDVDVSDYDLWNTYLPAFKELVTHAKVAGVMCAYNAFRKKPCCG 249

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
              ++ + +  QW   GY+ SDC ++   +N        E AA DA+  G D++CG    
Sbjct: 250 SDLLMTDILRRQWGFTGYVTSDCGAIDDFFNYHKTHPNAEAAAIDAVTNGTDVECGNRAY 309

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
           +    AV+ G + E++++ ++     ++MRLGMFD   S   +    P  + +  H+  A
Sbjct: 310 LTLTDAVKTGRIAEKEIDRSVKRLFMIRMRLGMFD-PVSMVSYAQTSPAVLESAPHKAQA 368

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
           L+ A + IVLLKN    LPLS      +AV+GPN+D ++ ++GNY G      T L GI 
Sbjct: 369 LKMAQESIVLLKNENHLLPLSK-SIKKIAVVGPNADNSIAVLGNYNGTPSKIVTALDGIK 427

Query: 306 R---------YAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFI- 355
                     Y K ++      +   G     A     + ADA + V G+   +E E + 
Sbjct: 428 AKLGTNGSVVYEKAVNFTNA--MLPEGKTDFAALTSRVKDADAIIFVGGISPQLEGEEMK 485

Query: 356 ---------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
                    DR  +LLP  Q E + +  KA+  PVV V+M G  + + + + +  I AI+
Sbjct: 486 VNEPGFNSGDRTTILLPTVQTEAM-KALKATGKPVVFVMMTGSALAIPWEQEN--IPAIV 542

Query: 407 WVGYPGQAGGAAIADVLFGRANP 429
              Y GQA G AIADVLFG  NP
Sbjct: 543 NAWYGGQAAGTAIADVLFGDYNP 565


>gi|242771939|ref|XP_002477942.1| beta-xylosidase XylA [Talaromyces stipitatus ATCC 10500]
 gi|218721561|gb|EED20979.1| beta-xylosidase XylA [Talaromyces stipitatus ATCC 10500]
          Length = 797

 Score =  276 bits (707), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 169/433 (39%), Positives = 242/433 (55%), Gaps = 13/433 (3%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDP-VLTGKYAASYVRGLQGNTG-SRLKVA 60
           N G  GL  ++PN+N FR P WGRGQETPGED   L+  YA  Y+ GLQG      LK+ 
Sbjct: 164 NVGRYGLDAYAPNINGFRSPLWGRGQETPGEDAGFLSSSYAYEYITGLQGGVDPEHLKIV 223

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A  KH+  YDL+NWN   R  F+A +++QDL + Y   F A     K  S MCSYN VNG
Sbjct: 224 ATAKHFAGYDLENWNNNSRLGFDASITQQDLAEYYTPQFLAASRYAKARSFMCSYNSVNG 283

Query: 121 KPTCADPDILKNTIHGQWRLD--GYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+C+   +L+  +   W     GY+ SDCD+   ++N   Y      AAAD+++AG D+
Sbjct: 284 VPSCSSSFLLQTLLRENWDFPDYGYVSSDCDAAYNVFNPHGYAINISAAAADSLRAGTDI 343

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     +   +   G +   ++  +L    +  ++LG FDG  S   +  LG  DV  
Sbjct: 344 DCGQTYPWYLNQSFIEGSVTRGEIERSLIRLYSNLVKLGYFDGNQSE--YRQLGWNDVVA 401

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
                ++ +AA +GIVLLKN    LPLS  +  +VAVIGP ++ T  + GNY G A    
Sbjct: 402 TDAWNISYEAAVEGIVLLKNDG-VLPLSE-KLKSVAVIGPWANATQQLQGNYFGPAPYLI 459

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIG--AAEVAARQADATVLVMGLDQSIEAEFID 356
           TPLQ        ++ A  FG    GN   G  AA  AA+++D  + + G+D +IEAE  D
Sbjct: 460 TPLQAARDAGYKVNYA--FGTNILGNTTDGFAAALSAAKKSDVIIYLGGIDNTIEAEGTD 517

Query: 357 RAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGG 416
           R  +  PG Q +L+ ++++  + P+V++ M GG VD S  K++  + A++W GYPGQ+GG
Sbjct: 518 RMNVTWPGNQLDLIQQLSQTGK-PLVVLQMGGGQVDSSSLKSNNNVNALVWGGYPGQSGG 576

Query: 417 AAIADVLFGRANP 429
            AI D+L G+  P
Sbjct: 577 KAIFDILSGKRAP 589


>gi|329745495|gb|AEB98984.1| xylosidase precursor [synthetic construct]
          Length = 804

 Score =  276 bits (707), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 166/437 (37%), Positives = 240/437 (54%), Gaps = 10/437 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAA 61
           N G  GL  ++PN+N FR P  GRGQETPGED  L   YA  Y+ G+QG +  S LK+AA
Sbjct: 168 NAGRYGLDVYAPNINTFRHPVRGRGQETPGEDVSLAAVYAYEYITGIQGPDPDSNLKLAA 227

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
             KHY  YD++NW+   R   +  +++QDL + Y   F     + KV SVMC+YN VNG 
Sbjct: 228 TAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGV 287

Query: 122 PTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           P CAD   L+  +   +     GY+ SDCD+   +YN   Y  +   AAA+AI AG D+D
Sbjct: 288 PACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDID 347

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSA--QPFGNLGPRDVC 237
           CG     H   ++  G L  +D+   +    T  ++ G FD   +    P+ +L   DV 
Sbjct: 348 CGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVL 407

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRH----HTVAVIGPNSDVTVTMIGNYAGV 293
                 ++ QAA QGIVLLKNS + LPL+   +     TVA+IGP ++ T  ++GNY G 
Sbjct: 408 ETDAWNISYQAATQGIVLLKNSNKVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGN 467

Query: 294 ACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
           A    +P          ++ A   G++        AA  AA+ AD  +   G+D ++EAE
Sbjct: 468 APYMISPRVAFEEAGYNVNFAERTGISSTNTSGFAAALSAAQSADVIIYAGGIDNTLEAE 527

Query: 354 FIDRAGLLLPGRQQELVSRVA-KASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
            +DR  +  PG Q +L+ ++A  A   P++++ M GG VD S  KN+  + A+LW GYPG
Sbjct: 528 ALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPG 587

Query: 413 QAGGAAIADVLFGRANP 429
           Q+GG A+ D++ G+ NP
Sbjct: 588 QSGGFALRDIITGKKNP 604


>gi|425780840|gb|EKV18836.1| Beta-xylosidase XylA [Penicillium digitatum PHI26]
 gi|425783077|gb|EKV20946.1| Beta-xylosidase XylA [Penicillium digitatum Pd1]
          Length = 792

 Score =  276 bits (706), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 162/430 (37%), Positives = 240/430 (55%), Gaps = 8/430 (1%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR-LKVAA 61
           NGG  GL  ++PN+N FR P WGRGQETPGED  L   Y   Y+ G+QG    R LK+AA
Sbjct: 161 NGGRYGLDVYAPNINSFRHPVWGRGQETPGEDVQLCSIYGVEYITGIQGGLNPRDLKLAA 220

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
             KH+  YDL+NW    R   N  +S  DL   Y   F   V + +V SVM SYN VNG 
Sbjct: 221 TAKHFAGYDLENWGNHSRLGNNVAISSFDLASYYTPQFITAVRDARVHSVMSSYNAVNGV 280

Query: 122 PTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           P+ A+  +L+  +   W    DGY+ SDCD+V  ++N   Y  +   AAA +I+AG D+D
Sbjct: 281 PSSANSFLLQTLLRETWNFVEDGYVSSDCDAVFNVFNPHGYASSASLAAAKSIQAGTDID 340

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG    ++   ++    +   ++  A+    +  + LG FDG+ S   + +L   DV   
Sbjct: 341 CGATYQLYLNESLSHDEISRSEIERAVTRFYSTLVSLGYFDGDNSK--YRHLHWPDVVAT 398

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
               ++ +AA +GIVLLKN   TLPLS     +VA+IGP ++VT T+ GNY G A   T 
Sbjct: 399 DAWNISYEAAVEGIVLLKNDG-TLPLSN-NTRSVALIGPWANVTTTLQGNYYGAAPYLTG 456

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           PL  +      ++ A    ++ +      AA  AA +++  +   G+D ++EAE +DR  
Sbjct: 457 PLAALQASNLDVNYAFGTNISSDSTSGFEAALSAAGKSEVIIFAGGIDNTVEAEGVDRES 516

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           +  PG Q +L+ +++K  + P+V++ M GG VD S  K +  + +++W GYPGQ+GG AI
Sbjct: 517 ITWPGNQLQLIEQLSKLGK-PLVVLQMGGGQVDSSSLKANKNVNSLVWGGYPGQSGGPAI 575

Query: 420 ADVLFGRANP 429
            D+L G+  P
Sbjct: 576 LDILTGKRAP 585


>gi|238578959|ref|XP_002388893.1| hypothetical protein MPER_12044 [Moniliophthora perniciosa FA553]
 gi|215450599|gb|EEB89823.1| hypothetical protein MPER_12044 [Moniliophthora perniciosa FA553]
          Length = 658

 Score =  276 bits (706), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 167/428 (39%), Positives = 240/428 (56%), Gaps = 25/428 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG-SRLKVAACCKHY 66
           GL +++PN+N F+DPRWGRGQETPGEDP    +Y    V GLQG  G + LK+AA CKH+
Sbjct: 52  GLDFFTPNINPFKDPRWGRGQETPGEDPFHISQYVYQLVTGLQGGVGPTNLKIAADCKHW 111

Query: 67  TAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
            AYDL+N  GV R+ F+A+V+ QDL + Y+  F++C+ + KVAS+MCSYN VNG P+CA+
Sbjct: 112 AAYDLENL-GVSRFEFDAKVTMQDLAEFYSPSFQSCIRDAKVASIMCSYNAVNGIPSCAN 170

Query: 127 PDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFL 184
             +L+      W L  + +I  DC +VG ++   HYT  P    A A+ AG D+DC    
Sbjct: 171 RYLLQTLARDFWGLGEEQWITGDCGAVGNIFARHHYTDDPANGTAVALNAGTDIDCDSGA 230

Query: 185 AIHTEG---AVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAH 241
           A +++    A+   L+ E+ +  A+       +RL                  DV T   
Sbjct: 231 AAYSQNLGQALNRSLVSEDQLRTAVTRQYNSLVRLSW---------------DDVNTEPA 275

Query: 242 QQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 301
           QQLA QAA +GIVLLKN    LPL++     VAV+GP ++ T  M  NY G+A    +P 
Sbjct: 276 QQLAYQAAVEGIVLLKNDG-ILPLAS-SVKKVAVVGPMANATTQMQSNYNGIAPFLVSPQ 333

Query: 302 QGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLL 361
           Q        +  A   G+  +      AA  AA  AD    V G+D +IE E  DR  + 
Sbjct: 334 QAFRNAGFNVTFANGTGLNSSDTSGFSAAIAAADDADVVFYVGGIDTTIEREDRDRPEIS 393

Query: 362 LPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIAD 421
             G Q  LV ++A   + P++++ M GG VD S  +++  + A++W GYPGQ+GG A+ D
Sbjct: 394 WTGNQLALVQQLASLGK-PLIVLQMGGGQVDSSSLRDNTSVNALIWGGYPGQSGGTALVD 452

Query: 422 VLFGRANP 429
           ++ G+  P
Sbjct: 453 LITGKQAP 460


>gi|358393086|gb|EHK42487.1| glycoside hydrolase family 3 protein [Trichoderma atroviride IMI
           206040]
          Length = 794

 Score =  276 bits (705), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 158/431 (36%), Positives = 243/431 (56%), Gaps = 10/431 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDP-VLTGKYAASYVRGLQGNTG-SRLKVA 60
           N G  GL  ++PN+N FR P WGRGQETPGED  VLT  Y   Y+ G+QG      LK+A
Sbjct: 164 NNGRYGLDVYAPNINGFRSPLWGRGQETPGEDANVLTSAYTYEYITGMQGGVDPENLKIA 223

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A  KH+  YDL+N+N   R  F+A +++QDL + Y   F A     K  S MC+YN VNG
Sbjct: 224 ATAKHFAGYDLENYNNQSRLGFDAIITQQDLSEYYTPQFLAASRYAKSHSFMCAYNSVNG 283

Query: 121 KPTCADPDILKNTIHGQWRLD--GYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+C++   L+  +   W     GY+ SDCD++  ++N  +Y  +   AAAD++KAG D+
Sbjct: 284 VPSCSNSFFLQTLLRESWGFPEYGYVSSDCDAIYNVWNPHNYANSQSSAAADSLKAGTDI 343

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     H   +   G +   ++  ++       +RLG FD +     + +LG +DV  
Sbjct: 344 DCGQTYPWHLNESFVAGTVSRGEIERSVTRLYANLVRLGYFDKK---NEYRSLGWKDVVK 400

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
                ++ +AA +GIVLLKN   TLPLS  +  ++A+IGP  + T  + GNY G A    
Sbjct: 401 TDAWNISYEAAVEGIVLLKNDG-TLPLSK-KVRSIALIGPWVNATEQLQGNYFGTAPYLI 458

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           +PLQ   +    ++     G+          A  AA+++DA + + G+D +IE E  DR 
Sbjct: 459 SPLQAAKKAGYEVNYELGTGINNQTTAGFAKAIAAAKKSDAIIFIGGIDNTIEQEGADRT 518

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            +  PG Q +L+ ++++  + P+V++ M GG VD S  K++ ++ +++W GYPGQ+GG A
Sbjct: 519 DIAWPGNQLDLIKQLSEVGK-PLVVLQMGGGQVDSSSIKSNKKVNSLVWGGYPGQSGGYA 577

Query: 419 IADVLFGRANP 429
           + D+L G+  P
Sbjct: 578 LFDILSGKRAP 588


>gi|169767016|ref|XP_001817979.1| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
 gi|121805502|sp|Q2UR38.1|XYND_ASPOR RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|83765834|dbj|BAE55977.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 798

 Score =  276 bits (705), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 158/434 (36%), Positives = 236/434 (54%), Gaps = 11/434 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPV-LTGKYAASYVRGLQGNTGSR-LKVA 60
           N G  GL  +SPN+N FR P WGRGQETPGED   L   YA  Y+ G+QG   +  LK+ 
Sbjct: 162 NAGRYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQGGVDANPLKLI 221

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A  KHY  YD++NW+   R   + ++++QDL + Y   F     + KV SVMCSYN VNG
Sbjct: 222 ATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNG 281

Query: 121 KPTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+C++   L+  +   +    DGY+  DC +V  ++N   Y      AAAD+I+AG D+
Sbjct: 282 VPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDI 341

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     H + +     +  +D+   +       +R G FDG+ S  P+ N+   DV +
Sbjct: 342 DCGVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRAGYFDGKTS--PYRNITWSDVVS 399

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPL--STLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
              Q L+ +AA Q IVLLKN    LPL  ++    T+A+IGP ++ T  M+GNY G A  
Sbjct: 400 TNAQNLSYEAAAQSIVLLKNDG-ILPLTSTSSSTKTIALIGPWANATTQMLGNYYGPAPY 458

Query: 297 YTTPLQGISRYAKTIHQAGCFGVACNGNQLI-GAAEVAARQADATVLVMGLDQSIEAEFI 355
             +PLQ        I          + +      A   A++AD  +   G+D ++E E  
Sbjct: 459 LISPLQAFQDSEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQ 518

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR+ +  P  Q  L++++A   + P++++ M GG VD S  KN+  + A++W GYPGQ+G
Sbjct: 519 DRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSG 577

Query: 416 GAAIADVLFGRANP 429
           G A+AD++ G+  P
Sbjct: 578 GQALADIITGKRAP 591


>gi|391872736|gb|EIT81831.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
          Length = 798

 Score =  276 bits (705), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 158/434 (36%), Positives = 236/434 (54%), Gaps = 11/434 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPV-LTGKYAASYVRGLQGNTGSR-LKVA 60
           N G  GL  +SPN+N FR P WGRGQETPGED   L   YA  Y+ G+QG   +  LK+ 
Sbjct: 162 NAGRYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQGGVDANPLKLI 221

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A  KHY  YD++NW+   R   + ++++QDL + Y   F     + KV SVMCSYN VNG
Sbjct: 222 ATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNG 281

Query: 121 KPTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+C++   L+  +   +    DGY+  DC +V  ++N   Y      AAAD+I+AG D+
Sbjct: 282 VPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDI 341

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     H + +     +  +D+   +       +R G FDG+ S  P+ N+   DV +
Sbjct: 342 DCGVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRAGYFDGKTS--PYRNITWSDVVS 399

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPL--STLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
              Q L+ +AA Q IVLLKN    LPL  ++    T+A+IGP ++ T  M+GNY G A  
Sbjct: 400 TNAQNLSYEAAAQSIVLLKNDG-ILPLTSTSSSTKTIALIGPWANATTQMLGNYYGPAPY 458

Query: 297 YTTPLQGISRYAKTIHQAGCFGVACNGNQLI-GAAEVAARQADATVLVMGLDQSIEAEFI 355
             +PLQ        I          + +      A   A++AD  +   G+D ++E E  
Sbjct: 459 LISPLQAFQDSEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQ 518

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR+ +  P  Q  L++++A   + P++++ M GG VD S  KN+  + A++W GYPGQ+G
Sbjct: 519 DRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSG 577

Query: 416 GAAIADVLFGRANP 429
           G A+AD++ G+  P
Sbjct: 578 GQALADIITGKRAP 591


>gi|326202986|ref|ZP_08192853.1| glycoside hydrolase family 3 domain protein [Clostridium
           papyrosolvens DSM 2782]
 gi|325987063|gb|EGD47892.1| glycoside hydrolase family 3 domain protein [Clostridium
           papyrosolvens DSM 2782]
          Length = 712

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 168/444 (37%), Positives = 251/444 (56%), Gaps = 35/444 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           G+T+WSPNVNIFRDPRWGRG ET GEDP LT +   ++V+GLQG+ G  LK AAC KHY 
Sbjct: 111 GITFWSPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQGD-GKYLKTAACAKHYA 169

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +   +    DR+ F+A VS++DL +TY   F+A V E KV S+M +YN+ NG+P     
Sbjct: 170 VH---SGPEDDRHFFDAIVSQKDLYETYLPAFEALVKEAKVESIMGAYNRTNGEPCNGSK 226

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP-FLAI 186
            +LK+ +   W  DG++VSDC ++   +     T+TP E+ A A+K+G DL+CG  +L I
Sbjct: 227 TLLKDILRDGWGFDGHVVSDCWAIKDFHEGHGVTKTPTESVALALKSGCDLNCGNMYLLI 286

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLAL 246
               A++ GL+ EED++ A    +T +M+LGMFD +     F N+      +  H +++L
Sbjct: 287 LL--ALKEGLITEEDIDRAAIRLMTTRMKLGMFDDDCE---FDNIPYELNDSAEHNKISL 341

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI-- 304
           +AA + +VLLKN    LPL + +   VAVIGPN+D ++ +  NY+G      T ++GI  
Sbjct: 342 EAAKKSMVLLKNDG-LLPLDSKKIKNVAVIGPNADSSLALRANYSGTPSQNVTIIEGIRK 400

Query: 305 --SRYAKTIHQAGCFGVACNGNQL------IGAAEVAARQADATVLVMGLDQSIEAEFI- 355
             S   +  +  G          L      +  A  AA ++D  VL +GLD S+E E   
Sbjct: 401 RVSENTRVWYAMGSHLFLNRDEDLAQPDDRLKEAVSAAERSDVVVLCLGLDASVEGEQND 460

Query: 356 ----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAI 405
                     D+A L LP  Q+ L++ V  A+  P ++ L+ G  + +  A +  +  AI
Sbjct: 461 QGTVILDAGGDKADLNLPESQRNLLNAVL-ATGKPTIVALLSGSALSIGDAAD--KAAAI 517

Query: 406 LWVGYPGQAGGAAIADVLFGRANP 429
           +   YPG  GG A A+++FG  +P
Sbjct: 518 VQCWYPGAIGGLAFAEMIFGDYSP 541


>gi|3135209|dbj|BAA28267.1| beta-xylosidase A [Aspergillus oryzae]
          Length = 798

 Score =  275 bits (704), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 158/434 (36%), Positives = 236/434 (54%), Gaps = 11/434 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPV-LTGKYAASYVRGLQGNTGSR-LKVA 60
           N G  GL  +SPN+N FR P WGRGQETPGED   L   YA  Y+ G+QG   +  LK+ 
Sbjct: 162 NAGRYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQGGVDANPLKLI 221

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A  KHY  YD++NW+   R   + ++++QDL + Y   F     + KV SVMCSYN VNG
Sbjct: 222 ATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNG 281

Query: 121 KPTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+C++   L+  +   +    DGY+  DC +V  ++N   Y      AAAD+I+AG D+
Sbjct: 282 VPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDI 341

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     H + +     +  +D+   +       +R G FDG+ S  P+ N+   DV +
Sbjct: 342 DCGVSYPRHFQESFHDQEVSRQDLERGVIRLYASLIRAGYFDGKTS--PYRNITWSDVVS 399

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPL--STLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
              Q L+ +AA Q IVLLKN    LPL  ++    T+A+IGP ++ T  M+GNY G A  
Sbjct: 400 TNAQNLSYEAAAQSIVLLKNDG-ILPLTSTSSSTKTIALIGPWANATTQMLGNYYGPAPY 458

Query: 297 YTTPLQGISRYAKTIHQAGCFGVACNGNQLI-GAAEVAARQADATVLVMGLDQSIEAEFI 355
             +PLQ        I          + +      A   A++AD  +   G+D ++E E  
Sbjct: 459 LISPLQAFQDSEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQ 518

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR+ +  P  Q  L++++A   + P++++ M GG VD S  KN+  + A++W GYPGQ+G
Sbjct: 519 DRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSG 577

Query: 416 GAAIADVLFGRANP 429
           G A+AD++ G+  P
Sbjct: 578 GQALADIITGKRAP 591


>gi|167519969|ref|XP_001744324.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777410|gb|EDQ91027.1| predicted protein [Monosiga brevicollis MX1]
          Length = 721

 Score =  275 bits (704), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 166/430 (38%), Positives = 234/430 (54%), Gaps = 9/430 (2%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGL-QGNTGSRLKVAACCKHY 66
            L+ + P +NI RD RWGR QE+  EDP L G YA ++V GL Q N+   L+ A  CKH 
Sbjct: 161 ALSIYGPTMNIIRDGRWGRSQESVSEDPWLNGLYAVNFVLGLEQRNSSKYLQAATSCKHL 220

Query: 67  TAYDLDNWNG-VDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            AY  + +N  + R+ FNA + + D+ DTY   F+ACV  G V  +MCSYN VNG P CA
Sbjct: 221 FAYSFEGYNNTLTRHSFNAVIDELDIHDTYLPAFRACVELGHVQQIMCSYNSVNGIPACA 280

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
             D+  + +   W  +G IVSDCD+V  +YNT +YTRTPE+A   A++ G DLDCG F +
Sbjct: 281 RGDVQNDRVRKAWGFEGLIVSDCDAVADIYNTHNYTRTPEDAVTVALQGGCDLDCGDFYS 340

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
            H   AV+  L     +  ++   + ++  LG FD + S  P+  LG   + TP  +  +
Sbjct: 341 QHLASAVQQNLTTLAALQQSMTRVLEMRFLLGEFDPDTSV-PYRQLGREAIDTPFARDSS 399

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY-TTPLQGI 304
           L+A+ + +VLL+N  + LP++      VA+IGP  ++T  M+G        + TT  QG 
Sbjct: 400 LRASRESVVLLENRIKLLPVTLSADIKVALIGPYVNLTTIMMGGKLDYTPSFITTYFQGF 459

Query: 305 SRYAKT--IHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLL 362
                T      GC   A     L  A ++A  QAD  VL +GL   IE E  DR  L L
Sbjct: 460 QAIGITHLTSSPGCNITAPLPGALDKAVQIAT-QADLVVLTLGLSSDIEHEGGDRETLGL 518

Query: 363 PGRQQELVSRVAKA-SRGPVVLVLMCGGPVDVSFAK-NDPRIGAILWVGYPGQAGGAAIA 420
           P  QQ+L   ++ A     +V+VL+ GGPV V   K    R   I+   Y GQ+ G A+A
Sbjct: 519 PTPQQDLYDAISAAIPSSKLVVVLVNGGPVSVDRIKYGIARTPTIIEAFYGGQSAGTALA 578

Query: 421 DVLFGRANPG 430
           + +FG+ NP 
Sbjct: 579 ETIFGQNNPS 588


>gi|2723496|dbj|BAA24107.1| beta-1,4-xylosidase [Aspergillus oryzae]
          Length = 798

 Score =  275 bits (704), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 158/434 (36%), Positives = 236/434 (54%), Gaps = 11/434 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPV-LTGKYAASYVRGLQGNTGSR-LKVA 60
           N G  GL  +SPN+N FR P WGRGQETPGED   L   YA  Y+ G+QG   +  LK+ 
Sbjct: 162 NAGRYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQGGVDANPLKLI 221

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A  KHY  YD++NW+   R   + ++++QDL + Y   F     + KV SVMCSYN VNG
Sbjct: 222 ATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNG 281

Query: 121 KPTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+C++   L+  +   +    DGY+  DC +V  ++N   Y      AAAD+I+AG D+
Sbjct: 282 VPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDI 341

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     H + +     +  +D+   +       +R G FDG+ S  P+ N+   DV +
Sbjct: 342 DCGVSYPRHFQESFHDQEVSRQDLERGVIRLYASLIRAGYFDGKTS--PYRNITWSDVVS 399

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPL--STLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
              Q L+ +AA Q IVLLKN    LPL  ++    T+A+IGP ++ T  M+GNY G A  
Sbjct: 400 TNAQNLSYEAAAQSIVLLKNDG-ILPLTSTSSSTKTIALIGPWANATTQMLGNYYGPAPY 458

Query: 297 YTTPLQGISRYAKTIHQAGCFGVACNGNQLI-GAAEVAARQADATVLVMGLDQSIEAEFI 355
             +PLQ        I          + +      A   A++AD  +   G+D ++E E  
Sbjct: 459 LISPLQAFQDSEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQ 518

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR+ +  P  Q  L++++A   + P++++ M GG VD S  KN+  + A++W GYPGQ+G
Sbjct: 519 DRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSG 577

Query: 416 GAAIADVLFGRANP 429
           G A+AD++ G+  P
Sbjct: 578 GQALADIITGKRAP 591


>gi|302669556|ref|YP_003829516.1| beta-xylosidase [Butyrivibrio proteoclasticus B316]
 gi|302394029|gb|ADL32934.1| beta-xylosidase Xyl3A [Butyrivibrio proteoclasticus B316]
          Length = 709

 Score =  275 bits (703), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 163/443 (36%), Positives = 251/443 (56%), Gaps = 34/443 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+W+PNVNIFRDPRWGRG ET GEDP LT + A  +V+ +QG+ G  +K AAC KH+ 
Sbjct: 104 GLTFWAPNVNIFRDPRWGRGHETYGEDPFLTSRLAVPFVKAMQGD-GEYMKAAACAKHFA 162

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +   +    +R+ F+A+ SK+DLE+TY   F+A V E +V +VM +YN+ NG+P CA+ 
Sbjct: 163 VH---SGPEGERHFFDAKASKKDLEETYLPAFEALVKEAEVEAVMGAYNRTNGEPCCANK 219

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            ++ +T+ G+W   G+ VSDC ++   +     T +PEE+A  A++ G DL+CG      
Sbjct: 220 PLMVDTLRGKWGFQGHFVSDCWAIKDFHENHKVTSSPEESAKLALEMGCDLNCGCTYQSI 279

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
             G VR GL+ E+ +  +     T +  LGMFD       F  +    V    H  +A +
Sbjct: 280 MNG-VRAGLIDEKLITESCERLFTTRFLLGMFD----KTEFDEIPYEKVECKEHLAVAKR 334

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRY 307
           AA + +VLLKN    LPL+     T+ V+GPN++  +++IGNY G +  Y T L+GI   
Sbjct: 335 AARESVVLLKNDG-LLPLNKDSIKTIGVVGPNANSRLSLIGNYHGTSSRYITVLEGIQDK 393

Query: 308 ----AKTIHQAGCFGVACNGNQL--------IGAAEVAARQADATVLVMGLDQSIEAE-- 353
                + ++  GC     N + L        +  A+  A  +D  V+V+GLD+++E E  
Sbjct: 394 VGDDVRVLYSEGCDIFQNNISNLADPNLPDRLSEAQAVADHSDVVVVVVGLDENLEGEEG 453

Query: 354 -----FI--DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
                F   D+  L LP  Q++L++ V    + P +++ M G  +D+S A+++    A+L
Sbjct: 454 DAGNQFASGDKINLNLPLSQRQLLNAVLDCGK-PTIVIDMAGSAIDLSKAQDEA--NAVL 510

Query: 407 WVGYPGQAGGAAIADVLFGRANP 429
              YPG  GGA +AD+LFG  +P
Sbjct: 511 QAFYPGARGGADVADILFGDVSP 533


>gi|330947691|ref|XP_003306937.1| hypothetical protein PTT_20252 [Pyrenophora teres f. teres 0-1]
 gi|311315273|gb|EFQ84970.1| hypothetical protein PTT_20252 [Pyrenophora teres f. teres 0-1]
          Length = 756

 Score =  275 bits (703), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 160/429 (37%), Positives = 236/429 (55%), Gaps = 11/429 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           NGG+A + +W+P++N FRDPRWGRG ETPGED +    Y  S + GL+G+   R K+ A 
Sbjct: 135 NGGIAPVDFWTPDINPFRDPRWGRGSETPGEDILRIKGYTKSLLSGLEGDKAQR-KIIAT 193

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           CKHY  YD++NWNG DR+HF+A+++ QDL + +  PF+ C  + KV S MCSYN VNG P
Sbjct: 194 CKHYVGYDVENWNGTDRHHFDAKITTQDLAEYFMPPFQQCARDSKVGSFMCSYNAVNGVP 253

Query: 123 TCADPDILKNTIHGQWRL---DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           TCAD  +L++ +   W     + YI SDC++V  +     Y  T +EA A A   G+DL 
Sbjct: 254 TCADTYVLEDILRKHWNWTDSNNYITSDCEAVKDISLRHKYVATLQEATAIAFNNGMDLS 313

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           C         GA   GLL    ++ AL       +  G FDG  +A  + +LG +D+ TP
Sbjct: 314 CEYSGTSDIPGAFSQGLLNVSVIDRALTRQYEGLVHAGYFDG--AAATYAHLGVQDINTP 371

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
             Q+L LQ A +G+ LLKN   TLPLS      VA++G  ++ T  + G Y+G A    T
Sbjct: 372 EAQKLVLQVAAEGLTLLKND-DTLPLSLKSGSKVAMVGFWANTTSKLSGIYSGPAPYLHT 430

Query: 300 PLQGISRYAKTIHQAGCFGVACNG--NQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           P+   ++    +  A    +  +G  +     A  AA+++D  +   GLD S  AE  DR
Sbjct: 431 PVYAGNKLGLDMAVATGPILQTSGAADNWTTTALNAAKKSDFILYFGGLDPSAAAEGSDR 490

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             +  P  Q +L++++A   +  VV+ L  G  VD +       + +++W  +PGQ GG 
Sbjct: 491 TDISWPSAQIDLITKLAALGKPLVVIAL--GDMVDHTPILKMKGVNSLIWANWPGQDGGT 548

Query: 418 AIADVLFGR 426
           A+  V+ G 
Sbjct: 549 AVMQVITGE 557


>gi|7671419|emb|CAB89360.1| beta-glucosidase-like protein [Arabidopsis thaliana]
 gi|9758998|dbj|BAB09525.1| unnamed protein product [Arabidopsis thaliana]
          Length = 411

 Score =  275 bits (703), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 138/217 (63%), Positives = 166/217 (76%), Gaps = 2/217 (0%)

Query: 214 MRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTV 273
           MRLG FDG P  QP+G LGP+DVCT  +++LA++ A QGIVLLKNSA +LPLS     T+
Sbjct: 1   MRLGFFDGNPKNQPYGGLGPKDVCTVENRELAVETARQGIVLLKNSAGSLPLSPSAIKTL 60

Query: 274 AVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRYAKTI-HQAGCFGVACNGNQLIGAAEV 332
           AVIGPN++VT TMIGNY GVAC YTTPLQG+ R   T  +  GCF V C    L  A  +
Sbjct: 61  AVIGPNANVTKTMIGNYEGVACKYTTPLQGLERTVLTTKYHRGCFNVTCTEADLDSAKTL 120

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA  ADATVLVMG DQ+IE E +DR  L LPG+QQELV++VAKA+RGPVVLV+M GG  D
Sbjct: 121 AA-SADATVLVMGADQTIEKETLDRIDLNLPGKQQELVTQVAKAARGPVVLVIMSGGGFD 179

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++FAKND +I +I+WVGYPG+AGG AIADV+FGR NP
Sbjct: 180 ITFAKNDEKITSIMWVGYPGEAGGIAIADVIFGRHNP 216


>gi|376259588|ref|YP_005146308.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp. BNL1100]
 gi|373943582|gb|AEY64503.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp. BNL1100]
          Length = 712

 Score =  275 bits (702), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 171/445 (38%), Positives = 252/445 (56%), Gaps = 37/445 (8%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           G+T+WSPNVNIFRDPRWGRG ET GEDP LT +   ++V+GLQG+ G  LK AAC KH+ 
Sbjct: 111 GITFWSPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQGD-GKYLKTAACAKHFA 169

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +   +    DR+HF+A VS++DL +TY   F+A V E KV SVM +YN+ NG+P     
Sbjct: 170 VH---SGPEDDRHHFDAVVSQKDLYETYLPAFEALVKEAKVESVMGAYNRTNGEPCNGSK 226

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP-FLAI 186
            +LK+ +   W  DG++VSDC ++   +     T+TP E+ A A+K+G DL+CG  +L I
Sbjct: 227 TLLKDILRDGWGFDGHVVSDCWAIKDFHEGHGVTKTPTESVALALKSGCDLNCGNMYLLI 286

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLAL 246
               A++ G + EED++ A    +T +MRLGMFD +     F  +      +  H +L+L
Sbjct: 287 LL--ALKEGRITEEDIDRAAIRLMTTRMRLGMFDDDCE---FDKIPYELNDSVEHNKLSL 341

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI-- 304
           +AA + +VLLKN    LPL + +   +AVIGPN+D ++ +  NY+G      T L GI  
Sbjct: 342 EAAKKSMVLLKNDG-LLPLDSKKIKNIAVIGPNADSSLALRANYSGTPSQNITILDGIRK 400

Query: 305 --SRYAKTIHQAGCF-------GVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFI 355
             S   +  +  G          +A   ++L  A  VA R +D  VL +GLD S+E E  
Sbjct: 401 RVSEDTRVWYSVGSHLFMNREEDLAQPDDRLKEAVSVAER-SDVVVLCLGLDASVEGEQN 459

Query: 356 -----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGA 404
                      D+A L LP  Q+ L++ V  A+  P ++ L+ G  + +  A +  +  A
Sbjct: 460 DQGTVILDAGGDKADLNLPESQRNLLNAVL-ATGKPTIVALLSGSALSIGDAAD--KAAA 516

Query: 405 ILWVGYPGQAGGAAIADVLFGRANP 429
           I+   YPG  GG A A+++FG  +P
Sbjct: 517 IVQCWYPGSRGGLAFAEMIFGDYSP 541


>gi|125534110|gb|EAY80658.1| hypothetical protein OsI_35835 [Oryza sativa Indica Group]
          Length = 511

 Score =  274 bits (701), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 150/290 (51%), Positives = 192/290 (66%), Gaps = 4/290 (1%)

Query: 143 YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDV 202
           Y+ SDCD+V  + +  HYT +PE+  A +IKAG+D++CG +  +H   AV+ G L E+D+
Sbjct: 16  YVASDCDAVATIRDAHHYTLSPEDTVAVSIKAGMDVNCGNYTQVHAMAAVQKGNLTEKDI 75

Query: 203 NLALAYTITVQMRLGMFDGEPSAQP-FGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSAR 261
           + AL     V+MRLG FDG+P +   +G+LG  DVC+PAH+ LAL+AA  GIVLLKN A 
Sbjct: 76  DRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVLLKNDAG 135

Query: 262 TLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRYA--KTIHQAGCFGV 319
            LPL      ++AVIGPN+D    + GNY G  C  TTPLQGI  Y   +    AGC   
Sbjct: 136 ALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFLAGCDSP 195

Query: 320 ACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRG 379
           AC       AA +A+  +D  VL MGL Q  E E +DR  LLLPG QQ L++ VA A+R 
Sbjct: 196 ACAVAATNEAAALAS-SSDHVVLFMGLSQKQEQEGLDRTSLLLPGEQQGLITAVANAARR 254

Query: 380 PVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           PV+LVL+ GGPVDV+FAK++P+IGAIL  GYPGQAGG AIA VLFG  NP
Sbjct: 255 PVILVLLTGGPVDVTFAKDNPKIGAILLAGYPGQAGGLAIAKVLFGDHNP 304


>gi|115387056|ref|XP_001210069.1| predicted protein [Aspergillus terreus NIH2624]
 gi|114191067|gb|EAU32767.1| predicted protein [Aspergillus terreus NIH2624]
          Length = 908

 Score =  274 bits (701), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 173/435 (39%), Positives = 249/435 (57%), Gaps = 22/435 (5%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N G +G  +W+PN+N FRDPRWGRGQETPGED  +   Y  +++ GLQG+     +V A 
Sbjct: 266 NNGFSGFDFWAPNINGFRDPRWGRGQETPGEDTFVAQNYIRNFIPGLQGDDPKNKQVIAT 325

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           CKHY  YDL+      RY  N   ++QDL D +  PFK CV +  V S+MCSYN V+G P
Sbjct: 326 CKHYAVYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIP 381

Query: 123 TCADPDILKNTIHGQWRLDG---YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
            CA+  +L   +   W  +    Y+VSDC++V  ++   ++T T E AAA A+ AG+DL+
Sbjct: 382 ACANEYLLDEVLRKHWGFNADYHYVVSDCNAVTDIWQYHNFTDTEEAAAAVALNAGVDLE 441

Query: 180 CG-PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           CG  +L ++   A     ++  D +LA  Y  +    +G FDG      + +L   DV  
Sbjct: 442 CGSSYLKLNESLAANQTSVKAMDQSLARLY--SALFTIGFFDG----GKYDHLDFSDVSI 495

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPL-STLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           PA Q LA +AA +G+ LLKN    LPL S  ++ +VAVIGP ++ T  M G Y+G A   
Sbjct: 496 PAAQALAYEAAVEGMTLLKNDG-LLPLHSQHKYKSVAVIGPFANATTQMQGGYSGNAPYL 554

Query: 298 TTPLQGI-SRYAKTIHQAGCFGVACNGNQLIG--AAEVAARQADATVLVMGLDQSIEAEF 354
            +PL    S +   ++ A   G A N     G  A+  AA+++D  V + G+D SIE+E 
Sbjct: 555 ISPLVAFESDHRWKVNYA--VGTAINDQNTTGFEASLAAAKKSDLIVYLGGIDNSIESET 612

Query: 355 IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQA 414
           IDR  L  PG Q +L+  ++  S+ P+V+V   GG VD S    +  I A++W GYP Q+
Sbjct: 613 IDRTSLAWPGNQLDLIKSLSNLSK-PMVVVQFGGGQVDDSALLENKDIQALIWAGYPSQS 671

Query: 415 GGAAIADVLFGRANP 429
           GG A+ D+L G+ +P
Sbjct: 672 GGTALLDILVGKRSP 686


>gi|154313073|ref|XP_001555863.1| hypothetical protein BC1G_05538 [Botryotinia fuckeliana B05.10]
          Length = 755

 Score =  274 bits (701), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 176/427 (41%), Positives = 237/427 (55%), Gaps = 27/427 (6%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS--RLKVAACCKH 65
           GL +W+PN+N ++DPRWGRGQETPGEDP  T  Y  + + GLQG        K  A CKH
Sbjct: 142 GLNFWTPNINPYKDPRWGRGQETPGEDPFHTSSYVNALITGLQGGLDDLPYKKGVATCKH 201

Query: 66  YTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
           +  YDL+N +G  RY F+A +  QDL D Y  PF+ C  +  V SVMCSYN +NG PTCA
Sbjct: 202 FAGYDLENSDGAIRYGFDAIIKSQDLRDYYLPPFQQCARDSNVQSVMCSYNAMNGVPTCA 261

Query: 126 DPDILKNTIHGQW---RLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP 182
           D  +L+  +   W     D ++ SDCD+V  +++  +YT TPE++AADA+ AG DLDCG 
Sbjct: 262 DDWLLQTLLREHWGWTEEDQWVTSDCDAVKNIWDYHNYTLTPEQSAADALNAGTDLDCGT 321

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
           F   +   A   GL     ++ +LA      +RLG FD  PS QP+  L   +V TPA Q
Sbjct: 322 FWPTYLGSAYDQGLYDISTLDRSLARRYASLVRLGYFD-PPSVQPYRQLNWDNVSTPAAQ 380

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
           QLALQAA  GIVLLKN    LPLS+     VA+IGP ++ T  M GNY G A    +PL 
Sbjct: 381 QLALQAAEDGIVLLKNDG-ILPLSS-NITNVALIGPLANATKQMQGNYYGTAPYLRSPLI 438

Query: 303 GISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLL 362
                   +       +         AA  AA+ AD  + V G+D SIEAE         
Sbjct: 439 AAQNAGFKVTYVQGADIDSQNTTDFSAAISAAQSADLVIYVGGIDNSIEAE--------- 489

Query: 363 PGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADV 422
                E+++ ++     P+++  M G  +D S   ++  + A+LW GYPGQ GG AI ++
Sbjct: 490 -----EILANLST----PLIISQM-GCMIDSSSLLSNTGVNALLWAGYPGQDGGTAIFNI 539

Query: 423 LFGRANP 429
           L G+  P
Sbjct: 540 LTGKTAP 546


>gi|409385818|ref|ZP_11238358.1| Beta-glucosidase [Lactococcus raffinolactis 4877]
 gi|399206850|emb|CCK19273.1| Beta-glucosidase [Lactococcus raffinolactis 4877]
          Length = 695

 Score =  274 bits (700), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 164/443 (37%), Positives = 246/443 (55%), Gaps = 32/443 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPN+N+FRDPRWGRGQET GEDP LT +   ++++GLQG  G  L++AAC KH+ 
Sbjct: 106 GLTFWSPNINLFRDPRWGRGQETYGEDPFLTAQIGVAFIKGLQGE-GKYLRLAACTKHFA 164

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +   +    DR++F+A V+ +DL + Y   FKA + E  V S M +YN +NG+P C + 
Sbjct: 165 VH---SGPEADRHYFDAVVNPKDLNEFYLPQFKAAIEEADVESFMGAYNAINGQPACVNE 221

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
           +++  T+ G+W  +G++VSD  ++  ++   HYT+T  E  A A+K G +L C   ++  
Sbjct: 222 ELIAKTLLGKWGFEGHVVSDYAALEDVHENHHYTQTAAETMALAMKIGTNL-CAGKISDA 280

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA-HQQLAL 246
              AV  GL+ E ++  ++    T  +RLGMF  +          P +V   A H+ L+L
Sbjct: 281 LFEAVGKGLVTETEITASVVKLYTTHVRLGMFAEDNDYDTI----PYEVNASAEHEMLSL 336

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI-- 304
           +AA + +VLLKN    LPLS     +VAVIGP +     + GNYAG A  Y T + GI  
Sbjct: 337 KAAEKSMVLLKND-NFLPLSQSEIKSVAVIGPTARNIGALEGNYAGTANHYETFVSGIQQ 395

Query: 305 --SRYAKTIHQAGCFGVA-------CNGNQLIGAAEVAARQADATVLVMGLDQSIEAE-- 353
             S  A+  +  GC   A          N+    A +AA  AD  VL +GLD +IE E  
Sbjct: 396 ALSNQARVTYALGCHLYADHAESSLSRANERESEAIIAAEHADIAVLCVGLDPTIEGEQG 455

Query: 354 -------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
                    D+  L LPG+Q+ L+ +V +  +  V+LVL  G  + +   +    + AI+
Sbjct: 456 DAGNVYGSGDKPSLSLPGQQKRLIEKVLETGK-TVILVLTSGSALSLEGLEKHTGVKAII 514

Query: 407 WVGYPGQAGGAAIADVLFGRANP 429
              YPG  GG A+A++L G+ +P
Sbjct: 515 QAWYPGAHGGTALANILLGKVSP 537


>gi|333995841|ref|YP_004528454.1| beta-glucosidase [Treponema azotonutricium ZAS-9]
 gi|333737309|gb|AEF83258.1| periplasmic beta-glucosidase (Gentiobiase)(Cellobiase)
           (Beta-D-glucoside glucohydrolase) [Treponema
           azotonutricium ZAS-9]
          Length = 706

 Score =  274 bits (700), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 170/451 (37%), Positives = 251/451 (55%), Gaps = 46/451 (10%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+W+PNVNIFRDPRWGRGQET GEDP LTG+   ++++GLQG+    LKVAAC KHY 
Sbjct: 101 GLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGRIGLAFMKGLQGDDTEHLKVAACAKHYA 160

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+A VSK+DL +TY   FK  +VE  V +VM +YN+  G+P   
Sbjct: 161 VH-----SGPEKLRHTFDAVVSKKDLFETYLPAFK-LLVENGVEAVMGAYNRTLGEPCGG 214

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG---P 182
              +LK  + G+W   G++ SDC ++   +     T++PEE+AA A+ AG DL+CG   P
Sbjct: 215 STYLLKEILRGRWGFKGHVTSDCWAIRDFHENHKVTKSPEESAAMALNAGCDLNCGCTYP 274

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
           +L +    + + GL+ +E ++ AL   +  + +LG+FD  P   P+ NLG   V    H+
Sbjct: 275 YLTV----SHKKGLVTDETIDTALTRLLRTRFKLGLFD-PPEQDPYRNLGNDIVGCEKHR 329

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
            LAL+AA + IVLLKN +  LPL       + ++GP +   +T++ NY G++    T L+
Sbjct: 330 NLALEAAQKSIVLLKNDSNILPLDD-SARKILLMGPGAANILTLLANYYGMSSRLVTILE 388

Query: 303 GISRYAKTI------HQAGCFGVACN--GNQLIGAAEVAAR-------QADATVLVMGLD 347
           G++   KT       ++ G      N   N   G+  V A        + D  + V GLD
Sbjct: 389 GLAEKIKTKTAISFEYRQGSLMYEPNHLSNVPFGSTGVDAEAPIYGLDEIDLVIAVYGLD 448

Query: 348 QSIEAEF---------IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKN 398
            S+E E           DR  + LP  Q   + R+ KA +  VVL+L  G P+    A  
Sbjct: 449 GSMEGEEGDSIASDANGDRDTIELPSWQLNFLRRIRKAGK-KVVLILTGGSPI----AFP 503

Query: 399 DPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +    A+L+  YPG+ GG A+AD+LFG  +P
Sbjct: 504 EDLADAVLFAWYPGEQGGNAVADILFGDVSP 534


>gi|320161274|ref|YP_004174498.1| beta-D-xylosidase [Anaerolinea thermophila UNI-1]
 gi|319995127|dbj|BAJ63898.1| beta-D-xylosidase [Anaerolinea thermophila UNI-1]
          Length = 712

 Score =  274 bits (700), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 165/434 (38%), Positives = 244/434 (56%), Gaps = 20/434 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT WSPN+NIFRDPRWGRGQET GEDP LTG+  A++VRGLQG     LK AAC KHYT
Sbjct: 114 GLTMWSPNINIFRDPRWGRGQETWGEDPYLTGEMGAAFVRGLQGKDPHYLKTAACAKHYT 173

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +   +    +R+ FNA V++++L DTY   FK  V E KV +VM +YN+  G+P C  P
Sbjct: 174 VH---SGPEKERHTFNAIVTRRELFDTYLPAFKKLVTEAKVEAVMGAYNRTLGEPCCGSP 230

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC-GPFLAI 186
            +LK  +  QW   G++VSDC ++   +     T+   E+AA  IK G D+ C   +   
Sbjct: 231 YLLKEILRNQWGFKGHVVSDCGAINDFHLHHQVTKDGAESAALGIKNGCDMACICTYSYE 290

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLAL 246
           +   A+  GL+ EED++ AL  T+  + +LG+FD +    P+ ++    V   AH++LA 
Sbjct: 291 NLTEALNRGLITEEDIDHALRNTLRTRFKLGLFDPQEKV-PYAHISMSVVGCEAHRKLAY 349

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG-IS 305
           + A +  VLLKN    LP+      ++ ++GPN+     ++GNY G++   TT ++G + 
Sbjct: 350 ETAVKSAVLLKNHNHILPVKP-DVKSILIVGPNAGNVHVLLGNYYGLSDSMTTFMEGLVG 408

Query: 306 RYAKTIHQAGCFGVACNGNQLIGA--AEVAARQADATVLVMGLDQSIEAEF--------I 355
           R  + +      G     ++ I    +  +A   D  +  MGL   +E E          
Sbjct: 409 RLPEGVRMEFMPGSLLTDSKKIKNDWSVASAASFDLVIAFMGLSPLLEGEEGEAILSDNG 468

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  + LP  QQE +  +A A+   +VLVL  G  + ++  ++   + AILWVGYPGQ G
Sbjct: 469 DREDIALPKAQQEYIRDLA-ATGAKIVLVLTGGSAIALNGIED--LVEAILWVGYPGQEG 525

Query: 416 GAAIADVLFGRANP 429
           G AIAD++FG  +P
Sbjct: 526 GRAIADLIFGDHSP 539


>gi|291544853|emb|CBL17962.1| Beta-glucosidase-related glycosidases [Ruminococcus champanellensis
           18P13]
          Length = 697

 Score =  273 bits (699), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 171/441 (38%), Positives = 249/441 (56%), Gaps = 38/441 (8%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT W+PN+N+FRDPRWGRG ET GEDP LT +   ++V+G+QG  G  LK AAC KH+ 
Sbjct: 114 GLTLWAPNINLFRDPRWGRGHETYGEDPFLTARLGVAFVKGMQGE-GKVLKAAACAKHFA 172

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+A+VS +DLE++Y   F A V E KV  VM +YN+VNG+P+CA
Sbjct: 173 VH-----SGPEALRHSFDAQVSPKDLEESYLPAFHALVAEAKVEGVMGAYNRVNGEPSCA 227

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
            P +L + +H QW   GY VSDC ++   +     T+   E+AA A++ G DL+CG    
Sbjct: 228 SP-MLMDKLH-QWGFAGYFVSDCWAIQDFHKHHGVTKNVTESAALALRTGCDLNCGNTY- 284

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
           ++   A+  GL+   D+  A    +  ++RLG+FD EP    F       + +PAH+ ++
Sbjct: 285 LYVLAALEEGLIDAADIRRACIRVLRTRIRLGLFDPEPH---FAACTYDTIASPAHKAVS 341

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
           L  A + +VLLKN    LPL   + H +AVIGPN+D    + GNY G A  Y T L+GI 
Sbjct: 342 LSCAEKSMVLLKNDG-ILPLDLSKLHAIAVIGPNADSRAALEGNYCGTADRYVTFLEGIQ 400

Query: 306 R-YAKTIHQA-GCFGVACNGNQLIGAAE------VAARQADATVLVMGLDQSIEA----- 352
             +   +H A GC       + L  A +       AA  +D  +L +GLD ++E      
Sbjct: 401 DAFPGRVHYAQGCHLYKDRTSNLAMADDRYAEALAAAEASDVVILCLGLDATLEGEEGDT 460

Query: 353 --EFI--DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWV 408
             EF   D+A L LP  Q +L+ ++    + PV+LVL  G  ++   + N     A+L  
Sbjct: 461 GNEFSSGDKADLRLPPPQCKLLEKLHAVGK-PVILVLAAGSALNPEISCN-----AVLQA 514

Query: 409 GYPGQAGGAAIADVLFGRANP 429
            YPGQ GG A+A +LFG+ +P
Sbjct: 515 WYPGQCGGQALAHILFGKVSP 535


>gi|389632743|ref|XP_003714024.1| beta-xylosidase [Magnaporthe oryzae 70-15]
 gi|351646357|gb|EHA54217.1| beta-xylosidase [Magnaporthe oryzae 70-15]
          Length = 847

 Score =  273 bits (697), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 169/447 (37%), Positives = 243/447 (54%), Gaps = 25/447 (5%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N G+AGL +W+PN+N ++DPRWGRG ETPGED +   KY  + +RGL+G+  +  K+ A 
Sbjct: 200 NAGLAGLDWWTPNINPYKDPRWGRGMETPGEDALRISKYVKALLRGLEGSDPTTRKMVAN 259

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV---- 118
           CKHY A DL+ WNGV RY+F+A V+ QDL + Y   FK C  +  V S MC+YN +    
Sbjct: 260 CKHYAANDLERWNGVTRYNFDAPVTLQDLSEYYLPAFKQCARDSNVGSFMCAYNAMSIKG 319

Query: 119 -----NGKPTCADPDILKNTIHGQW---RLDGYIVSDCDSVGVLYNTQHYTRTPEEAAAD 170
                NG P CA   ++ + +   W     + +I SDC++V  ++N  H++ T EEAA  
Sbjct: 320 KDLSWNGTPVCASKYLMNDILREHWGWKEHNNWITSDCNAVLHMWNQHHWSDTREEAAGS 379

Query: 171 AIKAGLDLDC--GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPF 228
           A  AG D  C    +     +GA   GLL E+ V+ AL       +R G FDG P A P+
Sbjct: 380 AYTAGTDTVCEVSNYDKTAVKGAFDRGLLDEDVVDRALKRLYEGLVRAGYFDG-PDA-PY 437

Query: 229 GNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLS----TLRHHTVAVIGPNSDVTV 284
            N+   DV TP  ++LA ++A +G+VL KN+   LP+       +  TVA+IG   D   
Sbjct: 438 RNITWADVNTPEARKLAHRSAVEGMVLTKNNG-VLPIKLEELQKKGKTVALIGNWVDNGE 496

Query: 285 TMIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQ--LIGAAEVAARQADATVL 342
            M+G Y+G+A    TPL         +  AG       G++      A  AA QAD  + 
Sbjct: 497 QMLGTYSGIAPFRNTPLAAAKALNLKMVTAGGPVNQSTGSRDSWTRPALNAAIQADVVLY 556

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
             G+D S+EAE  DR  L  P  Q +L+S +  ++ G   +V+  G  +D +   ++  I
Sbjct: 557 FGGIDLSVEAEDRDRYSLAWPSAQAKLLSDI--SALGKPTVVVQLGTMLDDTALLDNKNI 614

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANP 429
            AI+W GYPGQ GG A  D++ G+  P
Sbjct: 615 SAIIWAGYPGQDGGTAAFDIITGKTAP 641


>gi|440472411|gb|ELQ41274.1| beta-xylosidase [Magnaporthe oryzae Y34]
 gi|440484691|gb|ELQ64724.1| beta-xylosidase [Magnaporthe oryzae P131]
          Length = 792

 Score =  273 bits (697), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 170/447 (38%), Positives = 243/447 (54%), Gaps = 25/447 (5%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N G+AGL +W+PN+N ++DPRWGRG ETPGED +   KY  + +RGL+G+  +  K+ A 
Sbjct: 145 NAGLAGLDWWTPNINPYKDPRWGRGMETPGEDALRISKYVKALLRGLEGSDPTTRKMVAN 204

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV---- 118
           CKHY A DL+ WNGV RY+F+A V+ QDL + Y   FK C  +  V S MC+YN +    
Sbjct: 205 CKHYAANDLERWNGVTRYNFDAPVTLQDLSEYYLPAFKQCARDSNVGSFMCAYNAMSIKG 264

Query: 119 -----NGKPTCADPDILKNTIHGQW---RLDGYIVSDCDSVGVLYNTQHYTRTPEEAAAD 170
                NG P CA   ++ + +   W     + +I SDC++V  ++N  H++ T EEAA  
Sbjct: 265 KDLSWNGTPVCASKYLMNDILREHWGWKEHNNWITSDCNAVLHMWNQHHWSDTREEAAGS 324

Query: 171 AIKAGLDLDC--GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPF 228
           A  AG D  C    +     +GA   GLL E+ V+ AL       +R G FDG P A P+
Sbjct: 325 AYTAGTDTVCEVSNYDKTAVKGAFDRGLLDEDVVDRALKRLYEGLVRAGYFDG-PDA-PY 382

Query: 229 GNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLS----TLRHHTVAVIGPNSDVTV 284
            N+   DV TP  ++LA ++A +G+VL KN+   LP+       +  TVA+IG   D   
Sbjct: 383 RNITWADVNTPEARKLAHRSAVEGMVLTKNNG-VLPIKLEELQKKGKTVALIGNWVDNGE 441

Query: 285 TMIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQ--LIGAAEVAARQADATVL 342
            M+G Y+G+A    TPL         +  AG       G++      A  AA QAD  + 
Sbjct: 442 QMLGTYSGIAPFRNTPLAAAKALNLKMVTAGGPVNQSTGSRDSWTRPALNAAIQADVVLY 501

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
             G+D S+EAE  DR  L  P  Q +L+S ++   +  VV+ L  G  +D +   ++  I
Sbjct: 502 FGGIDLSVEAEDRDRYSLAWPSAQAKLLSDISALGKPTVVVQL--GTMLDDTALLDNKNI 559

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANP 429
            AI+W GYPGQ GG A  D++ G+  P
Sbjct: 560 SAIIWAGYPGQDGGTAAFDIITGKTAP 586


>gi|220927661|ref|YP_002504570.1| glycoside hydrolase [Clostridium cellulolyticum H10]
 gi|219997989|gb|ACL74590.1| glycoside hydrolase family 3 domain protein [Clostridium
           cellulolyticum H10]
          Length = 712

 Score =  272 bits (695), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 169/446 (37%), Positives = 253/446 (56%), Gaps = 39/446 (8%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           G+T+WSPNVNIFRDPRWGRG ET GEDP LT +   ++V+GLQG+ G  LK AAC KH+ 
Sbjct: 111 GITFWSPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQGD-GKYLKSAACAKHFA 169

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +   +    DR+HFNA  S++D+ +TY   F+A V E KV SVM +YN+ NG+P     
Sbjct: 170 VH---SGPEDDRHHFNAVASQKDMYETYLPAFEALVKEAKVESVMGAYNRTNGEPCNGSK 226

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP-FLAI 186
            +LK+ +   W  DG++VSDC ++   +     T+TP E+ A A+K G DL+CG  +L I
Sbjct: 227 TLLKDILRDDWGFDGHVVSDCWAIKDFHEGHGVTKTPTESVALALKNGCDLNCGNMYLLI 286

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC-TPAHQQLA 245
               A++ G + EED++ A    +T +M+LGMFD +          P +V  +  H +L+
Sbjct: 287 LL--ALKEGKITEEDIDRAAIRLMTTRMKLGMFDDDCEFDKI----PYEVNDSIEHNKLS 340

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI- 304
           L+AA + +VLLKN+   LPL + +   +AVIGPN+D ++ +  NY+G      T L G+ 
Sbjct: 341 LEAARKSMVLLKNNG-LLPLDSKKIKNIAVIGPNADSSLALRANYSGTPSHNITILDGVR 399

Query: 305 SRYAKTIHQAGCFG----------VACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEF 354
           SR ++        G          +A   ++L  A  +A R +D  VL +GLD S+E E 
Sbjct: 400 SRVSEDTRVWYSLGSHLFMNREEDLAQPDDRLKEAVSMAER-SDVVVLCLGLDASVEGEQ 458

Query: 355 I-----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIG 403
                       D+A L LP  Q+ L++ V  A+  P ++ L+ G  + +  A +  +  
Sbjct: 459 NDQGTVILDAGGDKADLNLPESQRNLLNAVL-ATGKPTIVALLSGSALSIGDAAD--KAA 515

Query: 404 AILWVGYPGQAGGAAIADVLFGRANP 429
           AI+   YPG  GG A A+++FG  +P
Sbjct: 516 AIVQCWYPGSKGGLAFAEMIFGDYSP 541


>gi|268610157|ref|ZP_06143884.1| glycoside hydrolase family 3 protein [Ruminococcus flavefaciens
           FD-1]
          Length = 690

 Score =  271 bits (692), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 170/442 (38%), Positives = 244/442 (55%), Gaps = 40/442 (9%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GL  WSPNVNIFRDPRWGRGQET GEDP LT +   ++ +GLQG  G  LK AAC KH  
Sbjct: 109 GLCLWSPNVNIFRDPRWGRGQETYGEDPYLTTRLGVAFAKGLQGE-GEVLKTAACAKHLA 167

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+A  S +D+E+TY   F+A V E KV  VM +YN+VNG+P CA
Sbjct: 168 VH-----SGPEAIRHEFDAVASPKDMEETYLPAFEALVKEAKVEGVMGAYNRVNGEPACA 222

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
              ++      +W  DGY VSDC ++   +     T+T  E+AA A+K G DL+CG    
Sbjct: 223 SKFLMGKL--DEWGFDGYFVSDCWAIRDFHTNHMVTKTAPESAAMALKLGCDLNCGN-TY 279

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
           +H   A   GL+ +ED+  A  + +  ++RLGMFD E     +  L    V    ++  A
Sbjct: 280 LHLLHAYNEGLINDEDIKKACTHLMRTRVRLGMFDDETE---YDKLDYSIVANEENKAYA 336

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
            + + + +V+LKN+   LPL   +  T+ VIGPN+D    + GNY G A  Y T L+GI 
Sbjct: 337 RKCSERSMVMLKNNG-ILPLDPSKIKTIGVIGPNADSRPALEGNYNGRADRYITFLEGIQ 395

Query: 306 R-------YAKTIH--QAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA---- 352
                   Y++  H  +  C G+A   ++L   AE+    +D  VL +GLD +IE     
Sbjct: 396 DAFGGRVLYSEGSHLYKDRCMGLAVADDRL-SEAEIVTEHSDVVVLCVGLDATIEGEEGD 454

Query: 353 ---EFI--DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILW 407
              EF   D+  L LP  Q++LV  V +  + PV++V   G  ++V     +    A++ 
Sbjct: 455 TGNEFSSGDKNDLRLPEAQRKLVETVMRKGK-PVIIVTAAGSAINV-----EADCDALIH 508

Query: 408 VGYPGQAGGAAIADVLFGRANP 429
             YPGQ GG A+AD+LFG+ +P
Sbjct: 509 AWYPGQFGGTALADILFGKISP 530


>gi|292495632|sp|Q0CMH8.2|XYND_ASPTN RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
          Length = 793

 Score =  270 bits (691), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 165/432 (38%), Positives = 241/432 (55%), Gaps = 11/432 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPV-LTGKYAASYVRGLQGNTGSR-LKVA 60
           N G  GL  ++PN+N FR P WGRGQETPGED   L   YA  Y+ G+QG      LK+ 
Sbjct: 162 NVGRYGLDTYAPNINSFRHPVWGRGQETPGEDAYYLASTYAYEYITGIQGGVDPETLKLV 221

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A  KHY  YD++NW+G  R   + ++++QDL + Y   F     + KV SVMCSYN VNG
Sbjct: 222 ATAKHYAGYDIENWDGHSRLGNDMQITQQDLSEYYTPQFLVSARDAKVHSVMCSYNAVNG 281

Query: 121 KPTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+C++   L+  +   +    DGY+  DC +V   +N   Y      A+AD+I+AG D+
Sbjct: 282 VPSCSNSFFLQTLLRETFGFVEDGYVSGDCGAVYNAFNPHEYAANESSASADSIRAGTDI 341

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     H   A   G +  +D+   +    T  +RLG FDG  S   + +L   DV T
Sbjct: 342 DCGTSYQYHFTNAFDEGEISRQDIERGVIRLYTNLVRLGYFDGNSSQ--YRDLTWSDVQT 399

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
                ++ +AA +G VLLKN   TLPL+ ++R  +VA+IGP ++ T  M GNY G A   
Sbjct: 400 TDAWNISHEAAVEGTVLLKNDG-TLPLADSIR--SVALIGPWANATTQMQGNYYGPAPYL 456

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           T+PL  +      +H A    ++         A  AAR+ADA +   G+D +IE E +DR
Sbjct: 457 TSPLAALEASDLDVHYAFGTNISSTTTAGFADALAAARKADAIIFAGGIDNTIEGEALDR 516

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             +  PG Q +L+++++   + P+V++ M GG VD S  K++  + A+LW GYPGQ+GG 
Sbjct: 517 MNITWPGNQLDLINQLSALGK-PLVVLQMGGGQVDSSALKHNTNVSALLWGGYPGQSGGT 575

Query: 418 AIADVLFGRANP 429
           A+ D++ G   P
Sbjct: 576 ALLDIIRGVRAP 587


>gi|70996610|ref|XP_753060.1| beta-xylosidase XylA [Aspergillus fumigatus Af293]
 gi|74672055|sp|Q4WRB0.1|XYND_ASPFU RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|66850695|gb|EAL91022.1| beta-xylosidase XylA [Aspergillus fumigatus Af293]
          Length = 792

 Score =  270 bits (691), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 155/431 (35%), Positives = 237/431 (54%), Gaps = 9/431 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPV-LTGKYAASYVRGLQGNTG-SRLKVA 60
           N G  GL  ++PN+N FR   WGRGQETPGED   L   YA  Y+ G+QG      LK+ 
Sbjct: 162 NVGRYGLDVYAPNINAFRSAMWGRGQETPGEDAYCLASAYAYEYITGIQGGVDPEHLKLV 221

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A  KHY  YDL+NW+G  R   +  +++Q+L + Y   F     + KV SVMCSYN VNG
Sbjct: 222 ATAKHYAGYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSVMCSYNAVNG 281

Query: 121 KPTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+CA+   L+  +   +    DGY+ SDCDS   ++N   +      AAAD+I+AG D+
Sbjct: 282 VPSCANSFFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANITGAAADSIRAGTDI 341

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     +   A     +   ++   +    +  +RLG FDG  S   + +L   DV T
Sbjct: 342 DCGTTYQYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGNGSV--YRDLTWNDVVT 399

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
                ++ +AA +GIVLLKN   TLPL+     +VA+IGP  +VT  + GNY G A    
Sbjct: 400 TDAWNISYEAAVEGIVLLKNDG-TLPLAK-SVRSVALIGPWMNVTTQLQGNYFGPAPYLI 457

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           +PL         ++ A    ++ +       A  AA+++D  +   G+D ++EAE +DR 
Sbjct: 458 SPLNAFQNSDFDVNYAFGTNISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLEAEAMDRM 517

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            +  PG Q +L+ ++++  + P++++ M GG VD S  K++  + +++W GYPGQ+GG A
Sbjct: 518 NITWPGNQLQLIDQLSQLGK-PLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYPGQSGGQA 576

Query: 419 IADVLFGRANP 429
           + D++ G+  P
Sbjct: 577 LLDIITGKRAP 587


>gi|218186207|gb|EEC68634.1| hypothetical protein OsI_37026 [Oryza sativa Indica Group]
          Length = 1241

 Score =  270 bits (691), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 151/319 (47%), Positives = 194/319 (60%), Gaps = 30/319 (9%)

Query: 1    MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ---------- 50
            MYN G  GLTYWSPN+N+ RDPRWGR  ETPGEDP + G+YA ++VRG+Q          
Sbjct: 925  MYNMGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQDIPGHEAVAA 984

Query: 51   -GNTGSR-LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKV 108
             G+  +R LK +ACCKHY AYDLD+W+   R+ F+ARV ++D+ +T+  PF+ CV +G V
Sbjct: 985  GGDPNTRPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPFEMCVRDGDV 1044

Query: 109  ASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYT-RTPEEA 167
            +SVMCSYN+VNG P CAD  +L  TI   W L GYIVSDCD+V V+ +   +   T  EA
Sbjct: 1045 SSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNATWLGYTGAEA 1104

Query: 168  AADAIKAGLDLDCGP-------------FLAIHTEGAVRGGLLREEDVNLALAYTITVQM 214
            +A A+KAGLDLDCG              FL  +   AV  G +RE D++ AL       M
Sbjct: 1105 SAAALKAGLDLDCGESWKNETDGHPLMDFLTTYGMEAVNKGKMRESDIDNALTNQYMTLM 1164

Query: 215  RLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVA 274
            RLG FD       + +LG +D+CT  H+ LAL  A QGIVLLKN  + LPL   +   V 
Sbjct: 1165 RLGYFD---DIAQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDANKVGFVN 1221

Query: 275  VIGPNSDV-TVTMIGNYAG 292
            V GP+       M G+Y G
Sbjct: 1222 VRGPHVQAPEKIMDGDYTG 1240


>gi|292495282|sp|B0XP71.1|XYND_ASPFC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|159131796|gb|EDP56909.1| beta-xylosidase XylA [Aspergillus fumigatus A1163]
          Length = 792

 Score =  270 bits (690), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 155/431 (35%), Positives = 237/431 (54%), Gaps = 9/431 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPV-LTGKYAASYVRGLQGNTG-SRLKVA 60
           N G  GL  ++PN+N FR   WGRGQETPGED   L   YA  Y+ G+QG      LK+ 
Sbjct: 162 NVGRYGLDVYAPNINAFRSAMWGRGQETPGEDAYCLASAYAYEYITGIQGGVDPEHLKLV 221

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A  KHY  YDL+NW+G  R   +  +++Q+L + Y   F     + KV SVMCSYN VNG
Sbjct: 222 ATAKHYAGYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSVMCSYNAVNG 281

Query: 121 KPTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+CA+   L+  +   +    DGY+ SDCDS   ++N   +      AAAD+I+AG D+
Sbjct: 282 VPSCANSFFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANITGAAADSIRAGTDI 341

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     +   A     +   ++   +    +  +RLG FDG  S   + +L   DV T
Sbjct: 342 DCGTTYQYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGNGSV--YRDLTWNDVVT 399

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
                ++ +AA +GIVLLKN   TLPL+     +VA+IGP  +VT  + GNY G A    
Sbjct: 400 TDAWNISYEAAVEGIVLLKNDG-TLPLAK-SVRSVALIGPWMNVTTQLQGNYFGPAPYLI 457

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           +PL         ++ A    ++ +       A  AA+++D  +   G+D ++EAE +DR 
Sbjct: 458 SPLNAFQNSDFDVNYAFGTNISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLEAEAMDRM 517

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            +  PG Q +L+ ++++  + P++++ M GG VD S  K++  + +++W GYPGQ+GG A
Sbjct: 518 NITWPGNQLQLIDQLSQLGK-PLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYPGQSGGQA 576

Query: 419 IADVLFGRANP 429
           + D++ G+  P
Sbjct: 577 LLDIITGKRAP 587


>gi|76160898|gb|ABA40420.1| Xld [Aspergillus fumigatus]
          Length = 792

 Score =  270 bits (690), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 155/431 (35%), Positives = 237/431 (54%), Gaps = 9/431 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPV-LTGKYAASYVRGLQGNTG-SRLKVA 60
           N G  GL  ++PN+N FR   WGRGQETPGED   L   YA  Y+ G+QG      LK+ 
Sbjct: 162 NVGRYGLDVYAPNINAFRSAMWGRGQETPGEDAYCLASAYAYEYITGIQGGVDPEHLKLV 221

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A  KHY  YDL+NW+G  R   +  +++Q+L + Y   F     + KV SVMCSYN VNG
Sbjct: 222 ATAKHYAGYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSVMCSYNAVNG 281

Query: 121 KPTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+CA+   L+  +   +    DGY+ SDCDS   ++N   +      AAAD+I+AG D+
Sbjct: 282 VPSCANSFFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANITGAAADSIRAGTDI 341

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     +   A     +   ++   +    +  +RLG FDG  S   + +L   DV T
Sbjct: 342 DCGTTYQYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGNGSV--YRDLTWNDVVT 399

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
                ++ +AA +GIVLLKN   TLPL+     +VA+IGP  +VT  + GNY G A    
Sbjct: 400 TDAWNISYEAAVEGIVLLKNDG-TLPLAK-SVRSVALIGPWMNVTTQLQGNYFGPAPYLI 457

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           +PL         ++ A    ++ +       A  AA+++D  +   G+D ++EAE +DR 
Sbjct: 458 SPLNAFQNSDFDVNYAFGTNISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLEAEAMDRM 517

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            +  PG Q +L+ ++++  + P++++ M GG VD S  K++  + +++W GYPGQ+GG A
Sbjct: 518 NITWPGNQLQLIDQLSQLGK-PLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYPGQSGGQA 576

Query: 419 IADVLFGRANP 429
           + D++ G+  P
Sbjct: 577 LLDIITGKRAP 587


>gi|332638085|ref|ZP_08416948.1| glycoside hydrolase family 3 protein [Weissella cibaria KACC 11862]
          Length = 713

 Score =  270 bits (690), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 168/445 (37%), Positives = 241/445 (54%), Gaps = 36/445 (8%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPNVNIFRDPRWGRG ET GEDP LT K+  ++++GLQG     LK+AA  KH+ 
Sbjct: 108 GLTFWSPNVNIFRDPRWGRGHETYGEDPFLTSKFGVAFIKGLQGQ-AKYLKLAATAKHFA 166

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +      G+ R+ F+A VS +DL +TY   FKA V E  V S+M +YN V+G P     
Sbjct: 167 VHS--GPEGL-RHGFDAVVSDKDLYETYLPAFKAAVEEADVESIMTAYNAVDGVPASVSE 223

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPF-LAI 186
            +L++ +H +W  +G++VSD  +   ++    YT+   E    AIKAGL+L  G    ++
Sbjct: 224 MLLRDILHDKWSFEGHVVSDYMAPEDVHENHKYTKDAAETMGLAIKAGLNLVAGHIEQSL 283

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMF--DGEPSAQPFGNLGPRDVCTPAHQQL 244
           H   A+  GL+ EE++  A+      ++RLGMF  D E  A P+         T AH  L
Sbjct: 284 HE--ALNRGLVTEEEITNAVISLYATRVRLGMFATDNEYDAIPY-----EANDTKAHNNL 336

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           +  AA +  VLLKN    LPL       +AV+GPN+   + ++GNY G      T L+GI
Sbjct: 337 SEIAAEKSFVLLKNDG-VLPLRKETMEAIAVVGPNAHSEIALLGNYFGTPSRSYTILEGI 395

Query: 305 SRYAKT---IHQAGCFGV--------ACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
                    +H +   GV            ++    A +AA  +D  V V+GLD +IE E
Sbjct: 396 QERLGDDVRVHYSIGSGVFQDHAAEPLAKADERESEAIIAAEHSDVIVAVLGLDSTIEGE 455

Query: 354 ---------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGA 404
                      D+  L LPGRQ++L+ R+    + PVV++L  G  + +   +N P + A
Sbjct: 456 EGDAGNSQGAGDKPNLSLPGRQRQLLERLLAVGK-PVVVLLASGSSLQLDGLENHPNLRA 514

Query: 405 ILWVGYPGQAGGAAIADVLFGRANP 429
           I+ + YPG  GG A+ADVLFG  +P
Sbjct: 515 IMQIWYPGARGGLAVADVLFGTVSP 539


>gi|348684865|gb|EGZ24680.1| family 3 glycoside hydrolase [Phytophthora sojae]
          Length = 769

 Score =  270 bits (690), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 174/445 (39%), Positives = 244/445 (54%), Gaps = 26/445 (5%)

Query: 2   YNGG-MAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR---- 56
           Y GG   GL  WSPN+NI RDPRWGR  ETP EDP++  KY  +Y RGLQ   G R    
Sbjct: 141 YKGGPHLGLDCWSPNININRDPRWGRNTETPSEDPLVNSKYGVAYTRGLQ--EGKRQDPR 198

Query: 57  -LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSY 115
            L+     KHY AY  +N+ GV+R  F+A VS  D  DTY   F++ VV+G    VMCSY
Sbjct: 199 FLQAVVTLKHYAAYSYENYGGVNRMEFDAIVSPYDFADTYFPAFRSSVVDGNAKGVMCSY 258

Query: 116 NQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAG 175
           N VNG P CA+ ++++  + G    DGY+ SD  +V  + +  HY  +  EAA  AI AG
Sbjct: 259 NSVNGIPMCANKELVETLLRGTLGFDGYVTSDSGAVEAISDMHHYADSQCEAARLAILAG 318

Query: 176 LDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD 235
            D++ G       +  V    L E+ ++ AL +T+ ++  LG+FD     QP+ N+ P +
Sbjct: 319 TDINSGKSYEACLKTLVDDNQLEEKALDDALRHTLKLRFELGLFD-PIDDQPYWNVTPSE 377

Query: 236 VCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVAC 295
           V T A + L+L A  + +V+L+N+A  LPL   +   +AV+GP++     ++GNY G  C
Sbjct: 378 VNTAAAKALSLNATRKSLVMLQNNASVLPLQ--KGVKLAVLGPHAKSKRGLLGNYLGQMC 435

Query: 296 G--------YTTPLQGI---SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVM 344
                      TPL  I   +  + T    GC G++ N       A  AA++ADA VL +
Sbjct: 436 HGDYDEVGCVQTPLDAIRAANGASNTTFAEGC-GISGNSTAGFEKAVAAAKEADAVVLFL 494

Query: 345 GLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGA 404
           G+D+SIE E  DR  + LP  Q +L+ RV    R P V+VL+ GG +     +   R  A
Sbjct: 495 GIDKSIEGEVGDRNNIDLPNIQMQLLQRVHAVGR-PTVVVLINGGVIGAE--EIIERTDA 551

Query: 405 ILWVGYPGQAGGAAIADVLFGRANP 429
           ++   YPG  G  A+ADVLFG  NP
Sbjct: 552 LVEAFYPGFFGARAMADVLFGDTNP 576


>gi|390630430|ref|ZP_10258413.1| Beta-xylosidase B [Weissella confusa LBAE C39-2]
 gi|390484359|emb|CCF30761.1| Beta-xylosidase B [Weissella confusa LBAE C39-2]
          Length = 674

 Score =  270 bits (690), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 168/445 (37%), Positives = 239/445 (53%), Gaps = 36/445 (8%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPNVNIFRDPRWGRG ET GEDP LT K+  ++++GLQG     LK+AA  KH+ 
Sbjct: 69  GLTFWSPNVNIFRDPRWGRGHETYGEDPFLTSKFGVAFIKGLQGQ-AKYLKLAATAKHFA 127

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +      G+ R+ F+A VS +DL +TY   FKA V E  V S+M +YN V+G P     
Sbjct: 128 VHS--GPEGL-RHGFDAVVSDKDLYETYLPAFKAAVEEADVESIMTAYNAVDGVPASVSE 184

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPF-LAI 186
            +LK+ +H +W  +G++VSD  +   ++    YT+   E    AIKAGL+L  G    ++
Sbjct: 185 MLLKDILHDKWSFEGHVVSDYMAPEDVHENHKYTKDAAETMGLAIKAGLNLVAGHIEQSL 244

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMF--DGEPSAQPFGNLGPRDVCTPAHQQL 244
           H   A+  GL+ EE++  A+      ++RLGMF  D E  A P+         T AH  L
Sbjct: 245 HE--ALDRGLVTEEEITNAVISLYATRVRLGMFATDNEYDAIPY-----EANDTKAHNNL 297

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           +  AA +  VLLKN    LPL       +AV+GPN+   + ++GNY G      T L+GI
Sbjct: 298 SEIAAEKSFVLLKNDG-VLPLRKETMEAIAVVGPNAHSEIALLGNYFGTPSRSYTILEGI 356

Query: 305 -SRYAKTIHQAGCFGVA----------CNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
             R    +      G               ++    A +AA  +D  V V+GLD +IE E
Sbjct: 357 QERLGDDVRVHYSIGSGLFQDHAAEPLAKADERESEAVIAAEHSDVVVAVLGLDSTIEGE 416

Query: 354 ---------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGA 404
                      D+  L LPGRQ++L+ R+    + PVV++L  G  + +   +N P + A
Sbjct: 417 EGDAGNSQGAGDKPNLSLPGRQRQLLERLLAVGK-PVVVLLASGSSLQLDGLENHPNLRA 475

Query: 405 ILWVGYPGQAGGAAIADVLFGRANP 429
           I+ + YPG  GG A+ADVLFG  +P
Sbjct: 476 IMQIWYPGARGGLAVADVLFGAVSP 500


>gi|374316077|ref|YP_005062505.1| beta-glucosidase-like glycosyl hydrolase [Sphaerochaeta pleomorpha
           str. Grapes]
 gi|359351721|gb|AEV29495.1| beta-glucosidase-like glycosyl hydrolase [Sphaerochaeta pleomorpha
           str. Grapes]
          Length = 701

 Score =  270 bits (690), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 165/448 (36%), Positives = 247/448 (55%), Gaps = 36/448 (8%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G   GLT WSPNVNIFRDPRWGRGQET GEDP L  +   S+++GLQG+ G  LK AAC 
Sbjct: 100 GIYKGLTLWSPNVNIFRDPRWGRGQETYGEDPYLASQLGVSFIQGLQGD-GPYLKTAACV 158

Query: 64  KHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
           KH+  +     +G +  R+ FNA VS++DL +TY   F+ACV EG+V +VM +Y+ VNG+
Sbjct: 159 KHFAVH-----SGPEPLRHDFNAIVSRKDLYETYLPAFEACVKEGEVNAVMGAYSAVNGE 213

Query: 122 PTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG 181
           P C  P ++ + +   W  +G  +SDC ++   +     T+   ++ A A+ AG DL+CG
Sbjct: 214 PCCGSPFLITDILRNDWGFEGMYISDCWAIRDFHLNHAVTKNQVDSVALALNAGCDLNCG 273

Query: 182 -PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
             +L++  E A + GL+  + +  A    +T +  LG+F  + +   + N+G     T  
Sbjct: 274 CEYLSL--EKAYQQGLIDRKTITQACIRVMTTRFALGLFSEDCT---YSNIGYEQNDTEE 328

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H+++A +A+   +VLLKN    LPL +   H +A+IGPN+D    + GNY G +  YTT 
Sbjct: 329 HRKVAFKASCNSLVLLKNDG-MLPLDSRSLHAIAIIGPNADSREALWGNYHGTSSTYTTV 387

Query: 301 LQGISR-YAKTIHQAGCFGVACNGNQL---------IGAAEVAARQADATVLVMGLDQSI 350
           L+G  +   +++      G A    +L         I  A   A  +D  +L +G D+++
Sbjct: 388 LEGFRKTLGESVKVKYSQGSAIQKEKLERLAEPNDRIAEAIAVATVSDTIILCLGYDETV 447

Query: 351 EAEFI---------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPR 401
           E E           D+  L LP  Q+ L+  VA   + P+VLVL+ GG +D    +  P 
Sbjct: 448 EGEMHDDGNGGWAGDKQDLRLPPCQRALLKAVASTGK-PIVLVLLSGGAIDPEIERF-PN 505

Query: 402 IGAILWVGYPGQAGGAAIADVLFGRANP 429
           + A+L   YPGQ GG AIA  + G  NP
Sbjct: 506 VKALLQGWYPGQEGGLAIAHTILGLNNP 533


>gi|348684866|gb|EGZ24681.1| hypothetical protein PHYSODRAFT_325770 [Phytophthora sojae]
          Length = 805

 Score =  270 bits (690), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 174/445 (39%), Positives = 244/445 (54%), Gaps = 26/445 (5%)

Query: 2   YNGG-MAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR---- 56
           Y GG   GL  WSPN+NI RDPRWGR  ETP EDP++  KY  +Y RGLQ   G R    
Sbjct: 142 YKGGPHLGLDCWSPNININRDPRWGRNTETPSEDPLVNSKYGVAYTRGLQ--EGKRQDPR 199

Query: 57  -LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSY 115
            L+     KHY AY  +N+ GV+R  F+A VS  D  DTY   F++ VV+G    VMCSY
Sbjct: 200 FLQAVVTLKHYAAYSYENYGGVNRMEFDAIVSPYDFADTYFPAFRSSVVDGNAKGVMCSY 259

Query: 116 NQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAG 175
           N VNG P CA+ ++++  + G    DGY+ SD  +V  + +  HY  +  EAA  AI AG
Sbjct: 260 NSVNGIPMCANKELVETLLRGTLGFDGYVTSDSGAVEAISDMHHYADSQCEAARLAILAG 319

Query: 176 LDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD 235
            D++ G       +  V    L E+ ++ AL +T+ ++  LG+FD     QP+ N+ P +
Sbjct: 320 TDINSGKSYEACLKTLVDDNQLEEKALDDALRHTLKLRFELGLFD-PIDDQPYWNVTPSE 378

Query: 236 VCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVAC 295
           V T A + L+L A  + +V+L+N+A  LPL   +   +AV+GP++     ++GNY G  C
Sbjct: 379 VNTAAAKALSLNATRKSLVMLQNNASVLPLQ--KGVKLAVLGPHAKSKRGLLGNYLGQMC 436

Query: 296 G--------YTTPLQGI---SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVM 344
                      TPL  I   +  + T    GC G++ N       A  AA++ADA VL +
Sbjct: 437 HGDYDEVGCVQTPLDAIRAANGASNTTFAEGC-GISGNSTAGFEKAVAAAKEADAVVLFL 495

Query: 345 GLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGA 404
           G+D+SIE E  DR  + LP  Q +L+ RV    R P V+VL+ GG +     +   R  A
Sbjct: 496 GIDKSIEGEVGDRNNIDLPNIQMQLLQRVHAVGR-PTVVVLINGGVIGAE--EIIERTDA 552

Query: 405 ILWVGYPGQAGGAAIADVLFGRANP 429
           ++   YPG  G  A+ADVLFG  NP
Sbjct: 553 LVEAFYPGFFGARAMADVLFGDTNP 577


>gi|115397385|ref|XP_001214284.1| hypothetical protein ATEG_05106 [Aspergillus terreus NIH2624]
 gi|114192475|gb|EAU34175.1| hypothetical protein ATEG_05106 [Aspergillus terreus NIH2624]
          Length = 776

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 165/432 (38%), Positives = 241/432 (55%), Gaps = 11/432 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPV-LTGKYAASYVRGLQGNTGSR-LKVA 60
           N G  GL  ++PN+N FR P WGRGQETPGED   L   YA  Y+ G+QG      LK+ 
Sbjct: 180 NVGRYGLDTYAPNINSFRHPVWGRGQETPGEDAYYLASTYAYEYITGIQGGVDPETLKLV 239

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A  KHY  YD++NW+G  R   + ++++QDL + Y   F     + KV SVMCSYN VNG
Sbjct: 240 ATAKHYAGYDIENWDGHSRLGNDMQITQQDLSEYYTPQFLVSARDAKVHSVMCSYNAVNG 299

Query: 121 KPTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+C++   L+  +   +    DGY+  DC +V   +N   Y      A+AD+I+AG D+
Sbjct: 300 VPSCSNSFFLQTLLRETFGFVEDGYVSGDCGAVYNAFNPHEYAANESSASADSIRAGTDI 359

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     H   A   G +  +D+   +    T  +RLG FDG  S   + +L   DV T
Sbjct: 360 DCGTSYQYHFTNAFDEGEISRQDIERGVIRLYTNLVRLGYFDGNSSQ--YRDLTWSDVQT 417

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
                ++ +AA +G VLLKN   TLPL+ ++R  +VA+IGP ++ T  M GNY G A   
Sbjct: 418 TDAWNISHEAAVEGTVLLKNDG-TLPLADSIR--SVALIGPWANATTQMQGNYYGPAPYL 474

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           T+PL  +      +H A    ++         A  AAR+ADA +   G+D +IE E +DR
Sbjct: 475 TSPLAALEASDLDVHYAFGTNISSTTTAGFADALAAARKADAIIFAGGIDNTIEGEALDR 534

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             +  PG Q +L+++++   + P+V++ M GG VD S  K++  + A+LW GYPGQ+GG 
Sbjct: 535 MNITWPGNQLDLINQLSALGK-PLVVLQMGGGQVDSSALKHNTNVSALLWGGYPGQSGGT 593

Query: 418 AIADVLFGRANP 429
           A+ D++ G   P
Sbjct: 594 ALLDIIRGVRAP 605


>gi|358382857|gb|EHK20527.1| hypothetical protein TRIVIDRAFT_192759 [Trichoderma virens Gv29-8]
          Length = 860

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 169/432 (39%), Positives = 241/432 (55%), Gaps = 18/432 (4%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAA 61
           N G AGL +W+PN+N FRDPRWGRG ETPGED      Y  S + GLQG       +  +
Sbjct: 243 NYGFAGLDFWTPNINPFRDPRWGRGMETPGEDAFRIQGYVLSLINGLQGGIDPDFFRTIS 302

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+ AYD++N     R   N   ++QD+ D Y   F+ CV + KV S+MC+YN VNG 
Sbjct: 303 TCKHFAAYDIEN----GRTANNLSPTQQDMADYYLPMFETCVRDAKVGSIMCAYNSVNGV 358

Query: 122 PTCADPDILKNTIHGQWRLD---GYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           P CAD  +L++ +   +       Y+VSDCD+V  +Y+  HY     +AAA ++ AG DL
Sbjct: 359 PACADSYLLQSVLRDGYGFTEDFNYVVSDCDAVENVYDPHHYAANLTQAAAMSLNAGTDL 418

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG    +    +V+ G+  E  ++ +L    +  +++G FD +P+   + +LG  +V T
Sbjct: 419 DCGSSYNV-LNASVQAGMTTEATLDKSLIRLYSALIKVGWFD-QPAK--YSSLGWGNVNT 474

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
              + LA  AA  G+ LLKN   TLPLS TL++  VAVIGP  + T  + GNYAG A   
Sbjct: 475 TQTRALAHDAATGGMTLLKNDG-TLPLSPTLQN--VAVIGPWVNATTQLQGNYAGTAPVL 531

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
             PL    +  + +  A    +         AA  AA  +D  V + G+D S+E E  DR
Sbjct: 532 VNPLTVFQQKWRNVKYAQGTAINSQDTSGFNAAISAASSSDVIVYLGGIDISVENEGFDR 591

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             +  PG Q  L+S++A   + P+V+V   GG +D S   ++ ++ +ILW GYPGQ GG 
Sbjct: 592 TAITWPGNQLSLISQLANLGK-PLVIVQFGGGQIDDSSLLSNSKVNSILWAGYPGQEGGN 650

Query: 418 AIADVLFGRANP 429
           A+ DVL G ANP
Sbjct: 651 ALFDVLTG-ANP 661


>gi|189201569|ref|XP_001937121.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187984220|gb|EDU49708.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 756

 Score =  269 bits (688), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 158/429 (36%), Positives = 235/429 (54%), Gaps = 11/429 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           NGG+A + +W+P++N FRDPRWGRG ETPGED +    Y  S + GL+G+   R K+ A 
Sbjct: 135 NGGIAPVDFWTPDINPFRDPRWGRGSETPGEDILRIKGYTKSLLSGLEGDKAQR-KIIAT 193

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           CKHY  YD+++WNG DR+ F+A+++ QDL + +  PF+ C  + KV S MCSYN VNG P
Sbjct: 194 CKHYVGYDMEDWNGTDRHSFDAKITTQDLAEYFMPPFQQCARDSKVGSFMCSYNAVNGVP 253

Query: 123 TCADPDILKNTIHGQWRL---DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           TCAD  +L++ +   W     + YI SDC++V  +     Y  T +EA A A   G+DL 
Sbjct: 254 TCADTYVLEDILRKHWNWTDSNNYITSDCEAVKDISLRHKYVATLQEATAIAFNNGMDLS 313

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           C    +    GA   GLL    ++ AL       +  G FDG  +A  + NLG +D+ TP
Sbjct: 314 CEYSGSSDIPGAFSQGLLNVSVIDRALTRQYEGLVHAGYFDG--AAATYANLGVQDINTP 371

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
             Q+L LQ A +G+ LLKN   TLPLS      VA++G  ++ +  + G Y+G A     
Sbjct: 372 EAQKLVLQVAAEGLTLLKND-DTLPLSLKSGSKVAMVGFWANDSSKLSGIYSGPAPYLHN 430

Query: 300 PLQGISRYAKTIHQAGCFGVACNG--NQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           P+   ++    +  A    +  +G  +     A  AA+++D  +   GLD S  AE  DR
Sbjct: 431 PVYAGNKLGLDMAVATGPILQKSGAADNWTTKALDAAKKSDTILYFGGLDPSAAAEGSDR 490

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             +  P  Q +L++++A   +  VV+ L  G  VD     N   + +++W  +PGQ GG 
Sbjct: 491 TDISWPSAQIDLITKLAALGKPLVVIAL--GDMVDHMPILNMKGVNSLIWANWPGQDGGT 548

Query: 418 AIADVLFGR 426
           A+  V+ G 
Sbjct: 549 AVMQVITGE 557


>gi|358397360|gb|EHK46735.1| glycoside hydrolase family 3 protein [Trichoderma atroviride IMI
           206040]
          Length = 865

 Score =  269 bits (688), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 174/434 (40%), Positives = 242/434 (55%), Gaps = 22/434 (5%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG-SRLKVAA 61
           N G AGL +W+PN+N FRDPRWGRG ETPGED      Y  + + GLQG       ++ A
Sbjct: 248 NYGFAGLDFWTPNINPFRDPRWGRGMETPGEDAFRIQGYVLALISGLQGGINPDFFRIIA 307

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+ AYD++N     R   N   ++QD+ D Y   F+ CV + KV SVMC+YN V+G 
Sbjct: 308 TCKHFAAYDIEN----GRTGNNLNPTQQDMADYYLPMFETCVRDAKVGSVMCAYNAVDGI 363

Query: 122 PTCADPDILKNTIHGQWRLD---GYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           P CA   +L++ +   +       Y+VSDCD+V  +++  HY     EAAA ++ AG DL
Sbjct: 364 PACASEYLLQDVLRDGFGFTEDFNYVVSDCDAVDNVFDPHHYASNLTEAAALSLNAGTDL 423

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG    +    +V   L  E  +N +L    +  +++G FD +PS   + +L   +V T
Sbjct: 424 DCGSSYNV-LNASVEAALTSEAALNQSLVRLYSALIKVGYFD-QPSE--YKSLSWANVNT 479

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
             +Q LA  AA  G+ LLKN   TLPLS TL +  VA+IGP  + T  M GNYAG A   
Sbjct: 480 TQNQALAHDAATGGMTLLKNDG-TLPLSRTLSN--VAIIGPWVNATTQMQGNYAGTAPFL 536

Query: 298 TTPLQGISRYAKTIHQAGCFGVACNGNQLIG--AAEVAARQADATVLVMGLDQSIEAEFI 355
             PL    +    +  A   G A N     G  AA  AA  +D  V + G+D ++E E  
Sbjct: 537 VNPLDVFQQKWGNVKYAQ--GTAINSQDTSGFSAALSAASSSDVIVYLGGIDITVENEGF 594

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  ++ PG Q +L+S++A   + P+V+V   GG +D S   ++P + +ILW GYPGQ G
Sbjct: 595 DRGSIVWPGNQLDLISQLANLGK-PLVIVQFGGGQIDDSSLLSNPNVRSILWAGYPGQDG 653

Query: 416 GAAIADVLFGRANP 429
           G A+ DVL G ANP
Sbjct: 654 GNAVFDVLTG-ANP 666


>gi|2791278|emb|CAA93248.1| beta-xylosidase [Trichoderma reesei]
 gi|340519464|gb|EGR49702.1| glycoside hydrolase family 3 [Trichoderma reesei QM6a]
          Length = 797

 Score =  268 bits (684), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 160/431 (37%), Positives = 242/431 (56%), Gaps = 10/431 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPV-LTGKYAASYVRGLQGNTG-SRLKVA 60
           N G  GL  ++PNVN FR P WGRGQETPGED   L+  Y   Y+ G+QG      LKVA
Sbjct: 163 NSGRYGLDVYAPNVNGFRSPLWGRGQETPGEDAFFLSSAYTYEYITGIQGGVDPEHLKVA 222

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A  KH+  YDL+NWN   R  F+A +++QDL + Y   F A     K  S+MC+YN VNG
Sbjct: 223 ATVKHFAGYDLENWNNQSRLGFDAIITQQDLSEYYTPQFLAAARYAKSRSLMCAYNSVNG 282

Query: 121 KPTCADPDILKNTIHGQWRLD--GYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+CA+   L+  +   W     GY+ SDCD+V  ++N   Y      AAA +++AG D+
Sbjct: 283 VPSCANSFFLQTLLRESWGFPEWGYVSSDCDAVYNVFNPHDYASNQSSAAASSLRAGTDI 342

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     H   +   G +   ++  ++       +RLG FD +     + +LG +DV  
Sbjct: 343 DCGQTYPWHLNESFVAGEVSRGEIERSVTRLYANLVRLGYFDKK---NQYRSLGWKDVVK 399

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
                ++ +AA +GIVLLKN   TLPLS  +  ++A+IGP ++ T  M GNY G A    
Sbjct: 400 TDAWNISYEAAVEGIVLLKNDG-TLPLSK-KVRSIALIGPWANATTQMQGNYYGPAPYLI 457

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           +PL+   +    ++      +A N       A  AA+++DA + + G+D +IE E  DR 
Sbjct: 458 SPLEAAKKAGYHVNFELGTEIAGNSTTGFAKAIAAAKKSDAIIYLGGIDNTIEQEGADRT 517

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            +  PG Q +L+ ++++  + P+V++ M GG VD S  K++ ++ +++W GYPGQ+GG A
Sbjct: 518 DIAWPGNQLDLIKQLSEVGK-PLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVA 576

Query: 419 IADVLFGRANP 429
           + D+L G+  P
Sbjct: 577 LFDILSGKRAP 587


>gi|402074909|gb|EJT70380.1| hypothetical protein GGTG_11406 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 793

 Score =  268 bits (684), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 173/450 (38%), Positives = 241/450 (53%), Gaps = 30/450 (6%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAA 61
           NG  +GL YW+PNVN F+DPRWGRG ETPGED +   +YAAS ++GL+G +     +V +
Sbjct: 147 NGRWSGLDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAASMIKGLEGPHPEKERRVVS 206

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKHY A D ++WNG  R+ F+AR+S QDL + Y +PF+ C  + +V S+MC+YN VNG 
Sbjct: 207 TCKHYAANDFEDWNGTSRHDFDARISAQDLAEYYLMPFQQCARDSRVGSIMCAYNAVNGV 266

Query: 122 PTCADPDILKNTIHGQWRLDG---YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           P+CA+  +L   +   W   G   Y+ SDC++V  +     Y RT  E  A   +AG D 
Sbjct: 267 PSCANSYLLDTVLRKHWGWTGHNNYVTSDCEAVLDVSAGHKYARTNAEGTAMCFEAGTDT 326

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
            C    +    GA   GLLREE ++ AL       +R+G FDG  SA  F ++   DV  
Sbjct: 327 SCEYTPSSDIRGAYAQGLLREETMDRALLRLYEGLVRVGYFDGNSSA--FSDISWADVNA 384

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPL-------------STLRHHTVAVIGPNSDVTVT 285
           PA Q L+LQ+A +GIV+LKN   TLPL             S+     +A+IG  +D    
Sbjct: 385 PAAQDLSLQSAVEGIVMLKNDG-TLPLPLGAKCSSKSKKRSSSGGPKLAMIGFWADAPEK 443

Query: 286 MIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNG------NQLIGAAEVAARQADA 339
           + G Y+G A    TP     +    +  AG  G    G      +     A  AA  AD 
Sbjct: 444 LRGGYSGTAAYLRTPAYAARQMGLDVVTAG--GPVLQGAAAAAADNWTAPALAAAEGADY 501

Query: 340 TVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKND 399
            V   GLD++   E  DR  +  PG Q  LV R+A   + P+V+V M G  +D +    +
Sbjct: 502 IVYFGGLDETAAGENKDRWDVEWPGAQLALVKRLAALGK-PLVVVQM-GDQLDGTPLLAN 559

Query: 400 PRIGAILWVGYPGQAGGAAIADVLFGRANP 429
             +GA+LW  +PGQ GG A+  +L G A+P
Sbjct: 560 AGVGAVLWASWPGQDGGPAVMRLLSGAASP 589


>gi|367053033|ref|XP_003656895.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
           8126]
 gi|347004160|gb|AEO70559.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
           8126]
          Length = 758

 Score =  268 bits (684), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 176/437 (40%), Positives = 243/437 (55%), Gaps = 15/437 (3%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N G AGL +W+PN+N +RDPRWGRG ETPGEDPV    Y  S +RGL+G    + KV A 
Sbjct: 120 NAGSAGLDFWTPNINPWRDPRWGRGSETPGEDPVRIKGYVRSLLRGLEGEESIK-KVIAT 178

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           CKHY AYDL+ W+ + RY F+A VS QDL + Y  PF+ C  + KV S+MCSYN +NG P
Sbjct: 179 CKHYAAYDLERWHNITRYEFDAIVSLQDLSEYYLPPFQQCARDSKVGSIMCSYNSLNGTP 238

Query: 123 TCADPDILKNTIHGQWRL---DGYIVSDCDSVGVLYNTQH-YTRTPEEAAADAIKAGLDL 178
            CA+  ++ + +   WR    + YI SDC+++      +H +T+T  EAAA A  AG D 
Sbjct: 239 ACANTYLMDDILRKHWRWTEDNNYITSDCNAIKDFLPDEHNFTQTAAEAAAAAYTAGTDT 298

Query: 179 DC---GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD 235
            C   G        GA    LL EE ++ AL       +R G FD   SA P+ ++G  D
Sbjct: 299 VCEVAGSPPYTDVVGAYDQKLLSEEVIDRALRRLYEGLVRAGYFDPA-SASPYRDIGWSD 357

Query: 236 VCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVAC 295
           V T   Q LALQ+A  G+VLLKN   TLP+  L   TVA+IG  +  T +M+G Y+G+  
Sbjct: 358 VNTAEAQALALQSASDGLVLLKNDG-TLPIK-LEGKTVALIGHWASGTRSMLGGYSGIPP 415

Query: 296 GYTTPLQGISRYAKTI-HQAGCFGVACNGNQLIGA-AEVAARQADATVLVMGLDQSIEAE 353
            Y +P+    +   T  + +G    A        A A  AA ++D  +   GLDQS+ +E
Sbjct: 416 YYHSPVYAAGQLNLTYKYASGPVAPASAARDTWTADALSAANKSDVILYFGGLDQSVASE 475

Query: 354 FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQ 413
             DR  +  P  Q  L+  +  A  G  ++V+  G  VD +    +P + AILW GYPGQ
Sbjct: 476 DKDRDSIAWPPAQLTLIQTL--AGLGKPLVVIQLGDQVDDTPLLTNPNVSAILWAGYPGQ 533

Query: 414 AGGAAIADVLFGRANPG 430
           +GG A+ + + G + P 
Sbjct: 534 SGGTAVLNAITGVSPPA 550


>gi|373460527|ref|ZP_09552278.1| hypothetical protein HMPREF9944_00542 [Prevotella maculosa OT 289]
 gi|371955145|gb|EHO72949.1| hypothetical protein HMPREF9944_00542 [Prevotella maculosa OT 289]
          Length = 699

 Score =  268 bits (684), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 165/435 (37%), Positives = 246/435 (56%), Gaps = 26/435 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPN+NIFRDPRWGRG ET GEDP LTG    +YV+G+QGN    LKVAAC KHY 
Sbjct: 107 GLTFWSPNINIFRDPRWGRGMETYGEDPFLTGTLGIAYVQGMQGNDPFYLKVAACGKHYA 166

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +         R+  N   +K+DL +TY   FK  V +G V ++M +YN+V G+      
Sbjct: 167 VHSGPE---ATRHEANVSPTKRDLFETYLPAFKMLVQQGHVEAIMGAYNRVYGEACSGSK 223

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PFLAI 186
            +L + +  QW   G+IVSDCD+V  ++      +T  EA A AIKAGL+++CG  F A+
Sbjct: 224 YLLTDVLRKQWGFRGHIVSDCDAVADIHAGHKIVKTEAEACAIAIKAGLNIECGHTFEAM 283

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLAL 246
             + AV   LL E++++ AL   +  +++LG+ + +    P+  +   ++C+P H  LA 
Sbjct: 284 --KQAVAQKLLTEQEIDRALLPLMMTRLKLGILEYDAEC-PYNEVKETEICSPEHIALAR 340

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISR 306
           +AA + +VLLKN+   LPL     HT+ + GP +  +  ++GNY G++  Y T LQGI+ 
Sbjct: 341 KAATESMVLLKNNG-ILPLDK-NLHTLFIAGPGASDSFWLMGNYFGISNRYCTYLQGIAD 398

Query: 307 YAKT---IHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE---------F 354
              +   ++    FG +      I  A   A  A+ T++VMG + ++E E          
Sbjct: 399 KVSSGTAVNFRPAFGESTPTKNTINWALDEAIAAEKTIVVMGNNGNLEGEEGESIASETR 458

Query: 355 IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIG-AILWVGYPGQ 413
            DR  + LP  Q + + R  KA +  +V+VL  G P+DV   +   R+  A++   YPGQ
Sbjct: 459 GDRVSMRLPASQMKFL-RDLKARKNGIVVVLTGGSPIDV---REISRLADAVVMAWYPGQ 514

Query: 414 AGGAAIADVLFGRAN 428
            GG A+AD+LFG  N
Sbjct: 515 EGGYALADLLFGDEN 529


>gi|348684872|gb|EGZ24687.1| family 3 glycoside hydrolase [Phytophthora sojae]
          Length = 805

 Score =  268 bits (684), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 170/438 (38%), Positives = 236/438 (53%), Gaps = 19/438 (4%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR-LKVAAC 62
           G   GL  WSPN+NI RDPRWGR  ETP EDP++  KY  +Y RGLQ     R L+    
Sbjct: 144 GPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTRGLQEGKDKRFLQAVVT 203

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
            KHY AY  ++++G+DR  FNA+VS+ D  DTY   F A VVEGK   VMCSYN VNG P
Sbjct: 204 LKHYAAYSYEHYDGIDRMAFNAQVSRYDFADTYLPAFHASVVEGKAKGVMCSYNSVNGMP 263

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP 182
            CA+  +    +      DGYI SD  ++  +Y  +HYT++  EA   AI +G D++ G 
Sbjct: 264 MCANEQLNTKLLREALGFDGYITSDSGAIEGIYRQRHYTKSLCEAGRLAIMSGTDVNSGS 323

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
                    V  G L E+ V+ A+  T+ ++  LG+FD     QP+ ++ P +V     +
Sbjct: 324 VYKKCLADLVTSGQLPEKAVDDAMRRTLKLRFELGLFD-PIDDQPYWHVAPSEVGKTESK 382

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG------ 296
           QL+L+   + IVLL+N    LPL   +   +AVIGP++     ++GNY G  C       
Sbjct: 383 QLSLELTRKSIVLLQNHGNVLPLR--KGKKLAVIGPHAKAKRALLGNYLGQMCHGDYLEV 440

Query: 297 --YTTPLQGISR---YAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIE 351
               TPL+ I+     + T++  G  G+         AAE AAR ADA VL +G+D SIE
Sbjct: 441 GCVQTPLEAITAANGASNTVYAKGS-GINDTSTADFDAAEAAARGADAVVLFLGIDTSIE 499

Query: 352 AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYP 411
            E  DR  + +P  Q +L+ RV +A + P V+VL  GG V     +       +    YP
Sbjct: 500 REAWDRENIDMPNIQMQLLKRVRRAGK-PTVVVLFNGGVVGAE--ELILHTDGVAEAFYP 556

Query: 412 GQAGGAAIADVLFGRANP 429
           G  G  A++D+LFG A P
Sbjct: 557 GFFGAQAVSDILFGDAIP 574


>gi|255690205|ref|ZP_05413880.1| xylosidase/arabinosidase [Bacteroides finegoldii DSM 17565]
 gi|260624224|gb|EEX47095.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
            finegoldii DSM 17565]
          Length = 1425

 Score =  267 bits (683), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 165/437 (37%), Positives = 238/437 (54%), Gaps = 26/437 (5%)

Query: 9    LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
            LTYW+PN+NIFRDPRWGRGQET GEDP LT K   ++V GLQG+    LK +AC KHY  
Sbjct: 821  LTYWTPNINIFRDPRWGRGQETYGEDPYLTSKIGKAFVLGLQGDDPRYLKASACAKHYAV 880

Query: 69   YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            +     N   R+ FN+ VS  DL DTY   F+  VV+  V+ VMC+YN   G+P C +  
Sbjct: 881  HSGPEKN---RHSFNSDVSTYDLWDTYLPAFRTLVVDANVSGVMCAYNAFKGQPCCGNDL 937

Query: 129  ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHT 188
            ++++ +  +W   GY+ SDC ++  ++N          AAADA+  G DLDCG    +  
Sbjct: 938  LMQSILRDKWNFKGYVTSDCGAIDDIFNHHKAHPDAATAAADAVFHGTDLDCGQSAYLAL 997

Query: 189  EGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQ-PFGNLGPRDVCTPAHQQLALQ 247
              AV+ G++ E+ +++++    T++ RLG+FD  P+ Q  + ++    +    HQ LA Q
Sbjct: 998  VKAVKNGIITEKQLDVSVKRLFTIRFRLGLFD--PAEQVDYAHIPISVLECKKHQDLAKQ 1055

Query: 248  AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISR- 306
             A + +VLLKN  R LPL   +   V V+GPN+D    ++GNY G      TPLQ I   
Sbjct: 1056 LARESMVLLKND-RLLPLQKNKLKKVVVMGPNADCKDALLGNYNGHPSRMLTPLQAIRER 1114

Query: 307  ---YAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFI-------- 355
                A+ ++ +G   +       +      A+ ADA + + G+   +E E +        
Sbjct: 1115 LKGVAEVVYVSGIDYINTVSEDELKRYVNQAKGADAVIFIGGISPRLEGEEMSVNKDGFD 1174

Query: 356  --DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSF-AKNDPRIGAILWVGYPG 412
              DR  + LP  Q +L+  +  A R P V V+M G  + + + AK+ P   AIL   Y G
Sbjct: 1175 GGDRTSIALPTVQTQLMKALV-AGRIPTVFVMMTGSALAIPWEAKHVP---AILNAWYGG 1230

Query: 413  QAGGAAIADVLFGRANP 429
            Q GG AIADVLFG  NP
Sbjct: 1231 QYGGEAIADVLFGDYNP 1247


>gi|169611757|ref|XP_001799296.1| hypothetical protein SNOG_08993 [Phaeosphaeria nodorum SN15]
 gi|160702362|gb|EAT83185.2| hypothetical protein SNOG_08993 [Phaeosphaeria nodorum SN15]
          Length = 755

 Score =  267 bits (682), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 161/430 (37%), Positives = 230/430 (53%), Gaps = 13/430 (3%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           NGG+A + +W+P++N FRDPRWGRG ETPGED V    Y    + GL+G+   R K+ A 
Sbjct: 131 NGGVAPVDFWTPDINPFRDPRWGRGSETPGEDIVRIKGYTKHLLAGLEGDKPQR-KIIAT 189

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           CKHY  YD++ W G+DR+ FNA+++ QDL + Y  PF+ C  + KV S MCSYN VNG P
Sbjct: 190 CKHYVGYDMEAWGGIDRHSFNAKINMQDLAEYYMPPFQQCARDSKVGSFMCSYNAVNGVP 249

Query: 123 TCADPDILKNTIHGQWRL---DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           TCAD  +L+  +   W     + YI SDC++V  +     Y +T  E    A  AG+D  
Sbjct: 250 TCADTYVLQTILRDHWNWTESNNYITSDCEAVKDISLKHKYAKTNAEGTGLAFTAGMDNS 309

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           C    +    GA     L    ++ AL       +R G FDG  +A  + NLG +D+ TP
Sbjct: 310 CEYTGSSDIPGAFNQSYLSIPTIDRALKRQYEGLVRAGYFDG--AAATYANLGVKDINTP 367

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
             QQL+LQ A +G+VLLKN   TLPLS      VA++G  ++ T  + G Y+G A    +
Sbjct: 368 EAQQLSLQVASEGLVLLKND-DTLPLSLTNGSKVAMLGFWANDTSKLSGIYSGPAPYLRS 426

Query: 300 PLQGISRYAKTIHQA-GCFGVACNG---NQLIGAAEVAARQADATVLVMGLDQSIEAEFI 355
           P+    +    +  A G      N    +     A  AA ++D  +   GLD S  AE  
Sbjct: 427 PVWAGQKLGLDMAIASGPILQQSNSSTRDNWTTNALAAAEKSDYILYFGGLDPSAAAEGF 486

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  +  P  Q +L+ ++A   +  VVLVL  G  +D S       + +++W  +PGQ G
Sbjct: 487 DRNSIAWPTAQVDLIKKLAAIGKPLVVLVL--GDLMDNSPLLELDGVNSVIWANWPGQDG 544

Query: 416 GAAIADVLFG 425
           G+A+  V+ G
Sbjct: 545 GSAVMQVVTG 554


>gi|310797011|gb|EFQ32472.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Glomerella graminicola M1.001]
          Length = 767

 Score =  266 bits (681), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 165/433 (38%), Positives = 231/433 (53%), Gaps = 11/433 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAA 61
           N G AGL YW+PNVN F+DPRWGRG ETPGED +   +YA    RGL G   G + +V +
Sbjct: 146 NAGWAGLDYWTPNVNPFKDPRWGRGSETPGEDVLRVKRYAEYITRGLDGPVPGEQRRVIS 205

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKHY   D ++WNG  R+ F+A+++ QDL + Y +PF+ C  + KV S+MC+YN VNG 
Sbjct: 206 TCKHYAGNDFEDWNGTSRHDFDAKITAQDLAEYYLMPFQQCARDSKVGSIMCAYNAVNGV 265

Query: 122 PTCADPDILKNTIHGQWRL---DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           P+CA+  +L+N +   W     + Y+ SDC++V  +     Y  T     A   +AG+D 
Sbjct: 266 PSCANEYLLQNILREHWNWTEHNNYVTSDCEAVLDVSANHKYAPTNAAGTAICFEAGMDT 325

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
            C    +    GA   GLL+EE V+ AL       +R G FDG  +   +  LG +DV +
Sbjct: 326 SCEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGYFDGHEAI--YAKLGWKDVNS 383

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
              Q LALQAA +GIVLLKN+  TLPL     H VA+IG  +D    + G Y+G A    
Sbjct: 384 AEAQSLALQAAVEGIVLLKNNG-TLPLDLKPSHKVAMIGFWADAPDKLQGGYSGRAAHLH 442

Query: 299 TPLQGISRYAKTIHQAGCFGVACN--GNQLIGAAEVAARQADATVLVMGLDQSIEAEFID 356
           TP     +    I  A    +  N   +    AA  AA  AD  +   GLD S   E +D
Sbjct: 443 TPAYAARQLGLDITLASGPVLQRNNASDNWTAAALEAAEGADYILYFGGLDTSAAGETLD 502

Query: 357 RAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGG 416
           R  L  P  Q  L+ +++   + P+V+ L+     D    + D  + +ILW  +PGQ GG
Sbjct: 503 RTDLEWPEAQLMLIKKLSALGK-PLVVNLLGDQLDDTPLLQLD-EVSSILWANWPGQDGG 560

Query: 417 AAIADVLFGRANP 429
            AI  ++ G  +P
Sbjct: 561 VAIMKLITGEKSP 573


>gi|301090543|ref|XP_002895482.1| beta-glucosidase, putative [Phytophthora infestans T30-4]
 gi|262098232|gb|EEY56284.1| beta-glucosidase, putative [Phytophthora infestans T30-4]
          Length = 809

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 169/441 (38%), Positives = 238/441 (53%), Gaps = 23/441 (5%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR-LKVAAC 62
           G   GL  WSPN+NI RDPRWGR  ETP EDP++  KY  +Y +GLQ     R L+    
Sbjct: 145 GPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTKGLQEGKDKRFLQAVVT 204

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
            KHY AY  ++++G+DR  FNA VS+ D  DTY   F+A VV GK   VMCSYN VNG P
Sbjct: 205 LKHYAAYSYEHYDGIDRMAFNAVVSRYDFADTYLPAFEASVVHGKAKGVMCSYNSVNGMP 264

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP 182
            CA+  +    +      DGYI SD  ++  +Y+ +HYT+T  EA   AI +G D++ G 
Sbjct: 265 MCANEQLNSKLLRDALGFDGYITSDSGAIAGIYHQRHYTKTLCEAGRLAILSGTDVNSGS 324

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
                    V  G L E+ V+ A+  T+ ++  LG+FD     QP+ ++ P +V T   +
Sbjct: 325 VYKQCLAELVTSGQLPEKAVDDAMRRTLKLRFELGLFD-PIDDQPYWHVAPNEVNTAESK 383

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG------ 296
           QL+L  + + IVLL+N    LPL+  +   +AVIGP++     ++GNY G  C       
Sbjct: 384 QLSLDLSRKSIVLLQNHGNILPLA--KGKKLAVIGPHAAAKRALLGNYLGQMCHGDYLEV 441

Query: 297 --YTTPLQGISR---YAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIE 351
               TPL+ I+     + T++  G  G+          AE AAR+A+  VL +G+D SIE
Sbjct: 442 GCVQTPLEAITIANGASNTLYAKGS-GINDTSTAGFDEAEAAARKAETVVLFLGIDTSIE 500

Query: 352 AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVS--FAKNDPRIGAILWVG 409
            E  DR  + +P  Q +L+ RV +A + P V+VL  GG V         D  + A     
Sbjct: 501 REAWDRENIDMPNIQMQLLKRVRRAGK-PTVVVLFNGGVVGAEELILHTDGVVEAF---- 555

Query: 410 YPGQAGGAAIADVLFGRANPG 430
           YPG  G  A++D+LFG A P 
Sbjct: 556 YPGFFGAQAVSDILFGDAIPS 576


>gi|317057539|ref|YP_004106006.1| glycoside hydrolase family protein [Ruminococcus albus 7]
 gi|315449808|gb|ADU23372.1| glycoside hydrolase family 3 domain protein [Ruminococcus albus 7]
          Length = 691

 Score =  266 bits (679), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 168/441 (38%), Positives = 252/441 (57%), Gaps = 38/441 (8%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT W+PN+NIFRDPRWGRG ET GEDP LT K   + VRGLQG+ G  +K AAC KH+ 
Sbjct: 109 GLTLWAPNINIFRDPRWGRGHETFGEDPYLTTKNGMAVVRGLQGD-GKVIKAAACAKHFA 167

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+A+ + +D+E+TY   F+A V E KV SVM +YN+VNG+P CA
Sbjct: 168 VH-----SGPEAIRHSFDAKANAKDMEETYLPAFEALVKEAKVESVMGAYNRVNGEPACA 222

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
             + L + +  +W  DGY VSDC ++   +     T    E+ A A+KAG D++CG    
Sbjct: 223 -SNFLMDKLK-EWEFDGYFVSDCWAIRDFHENHMVTANAIESTAMALKAGCDVNCGCTYQ 280

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
            +   A+  G + +ED+  A  + +  ++RLGMFD +     + ++    V    H+ ++
Sbjct: 281 -NLLVALEKGAVTKEDIRTACVHLMRTRIRLGMFDKKTE---YDDIPYDKVACKEHKAIS 336

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI- 304
           L+ A + +V+L+N+   LP+ T ++ T+AVIGPN+D    + GNY G++  YTT L GI 
Sbjct: 337 LECAEKSLVMLENNG-ILPVDTSKYKTIAVIGPNADSRTALEGNYNGLSDRYTTFLNGIQ 395

Query: 305 SRY-AKTIHQAGCFGVACNGNQLIGAAE------VAARQADATVLVMGLDQSIEA----- 352
            R+  + I   GC       + L  A +       AA+ AD T+L +GLD +IE      
Sbjct: 396 DRFDGRVIFAEGCHLYKDRVSNLAQAGDRYAEAVAAAKFADMTILCLGLDATIEGEEGDT 455

Query: 353 --EFI--DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWV 408
             EF   D+ GL LP  Q+ELV ++    + PVV V+  G  ++      + +  A++  
Sbjct: 456 GNEFSSGDKNGLTLPPPQRELVKKIMAVGK-PVVTVVCAGSAINT-----ESKPDALIHA 509

Query: 409 GYPGQAGGAAIADVLFGRANP 429
            YPG  GG A+A+VLFG  +P
Sbjct: 510 FYPGAEGGKALAEVLFGDVSP 530


>gi|301118693|ref|XP_002907074.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
 gi|262105586|gb|EEY63638.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
          Length = 809

 Score =  266 bits (679), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 169/440 (38%), Positives = 238/440 (54%), Gaps = 23/440 (5%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR-LKVAAC 62
           G   GL  WSPN+NI RDPRWGR  ETP EDP++  KY  +Y +GLQ     R L+    
Sbjct: 145 GPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTKGLQEGKDKRFLQAVVT 204

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
            KHY AY  ++++G+DR  FNA VS+ D  DTY   F+A VV GK   VMCSYN VNG P
Sbjct: 205 LKHYAAYSYEHYDGIDRMAFNAVVSRYDFADTYLPAFEASVVHGKAKGVMCSYNSVNGMP 264

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP 182
            CA+  +    +      DGYI SD  ++  +Y+ +HYT+T  EA   AI +G D++ G 
Sbjct: 265 MCANEQLNSKLLRDALGFDGYITSDSGAIAGIYHQRHYTKTLCEAGRLAILSGTDVNSGS 324

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
                    V  G L E+ V+ A+  T+ ++  LG+FD     QP+ ++ P +V T   +
Sbjct: 325 VYKQCLAELVTSGQLPEKAVDDAMRRTLKLRFELGLFD-PIDDQPYWHVAPNEVNTAESK 383

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG------ 296
           QL+L  + + IVLL+N    LPL+  +   +AVIGP++     ++GNY G  C       
Sbjct: 384 QLSLDLSRKSIVLLQNHGNILPLA--KGKKLAVIGPHAAAKRALLGNYLGQMCHGDYLEV 441

Query: 297 --YTTPLQGISR---YAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIE 351
               TPL+ I+     + T++  G  G+          AE AAR+A+  VL +G+D SIE
Sbjct: 442 GCVQTPLEAITIANGASNTLYAKGS-GINDTSTGGFDEAEAAARKAETVVLFLGIDTSIE 500

Query: 352 AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVS--FAKNDPRIGAILWVG 409
            E  DR  + +P  Q +L+ RV +A + P V+VL  GG V         D  + A     
Sbjct: 501 REAWDRENIDMPNIQMQLLKRVRRAGK-PTVVVLFNGGVVGAEELILHTDGVVEAF---- 555

Query: 410 YPGQAGGAAIADVLFGRANP 429
           YPG  G  A++D+LFG A P
Sbjct: 556 YPGFFGAQAVSDILFGDAIP 575


>gi|5690010|emb|CAB51937.1| Family 3 Glycoside Hydrolase [Ruminococcus flavefaciens]
          Length = 690

 Score =  266 bits (679), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 166/440 (37%), Positives = 243/440 (55%), Gaps = 36/440 (8%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT WSPNVNIFRDPRWGRGQET GEDP LT     +Y +GLQG+ G  L+ AAC KH+ 
Sbjct: 111 GLTLWSPNVNIFRDPRWGRGQETYGEDPYLTSCLGVAYAKGLQGD-GKVLRTAACAKHFA 169

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +   +     R+ F+A+ + +D+ +TY   F+A V + KV SVM +YN+VNG+P CA  
Sbjct: 170 VH---SGPEATRHEFDAKANMKDMTETYIAAFEALVKDAKVESVMGAYNRVNGEPACA-S 225

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
           D + N +  +W  DG+ VSDC ++   +     T+T  E+AA A+K G DL+CG    +H
Sbjct: 226 DFVMNKLE-EWGFDGHFVSDCWAIRDFHTNHGVTKTAPESAALALKKGCDLNCGN-TYLH 283

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              A   GL+ EED+  +    +  ++RLGMFD    +  +  L    V    H++ +L+
Sbjct: 284 LLAAFNEGLINEEDLRRSCIKLMRTRVRLGMFD---KSTEYDGLDYDIVACDEHKEFSLR 340

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISR- 306
            + + +VLLKN+   LPL   ++ T+ VIGPN+D    + GNY G A  Y T L GI   
Sbjct: 341 CSERSMVLLKNNG-ILPLDGSKYKTIGVIGPNADSVPALEGNYNGKADEYITFLSGIREA 399

Query: 307 ------YAKTIH--QAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA------ 352
                 Y +  H  +  C G+A   ++L   AE+  R    +  +  LD +IE       
Sbjct: 400 HDGRVLYTEGSHLYKDRCMGLALPDDRL-SEAEIITRTLRCSGSLCWLDATIEGEEGDTG 458

Query: 353 -EFI--DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVG 409
            EF   D+  L LP  Q++LV  V    + PV++V   G  ++V     +    A++   
Sbjct: 459 NEFSSGDKNDLRLPESQRKLVKTVMAKGK-PVIIVTAAGSAINV-----EADCDALIQAW 512

Query: 410 YPGQAGGAAIADVLFGRANP 429
           YPGQ GG A+A++LFG+ +P
Sbjct: 513 YPGQLGGRALANILFGKVSP 532


>gi|171695518|ref|XP_001912683.1| hypothetical protein [Podospora anserina S mat+]
 gi|170948001|emb|CAP60165.1| unnamed protein product [Podospora anserina S mat+]
          Length = 805

 Score =  266 bits (679), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 172/456 (37%), Positives = 243/456 (53%), Gaps = 34/456 (7%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRG------------------QETPGEDPVLTGKYAAS 44
           N  +AGL YW+PN+N ++DPRWGRG                  Q+TPGEDPV    Y  +
Sbjct: 159 NAELAGLDYWTPNINPYKDPRWGRGHEVCYLSLLFRAVQLLRTQKTPGEDPVHIKGYVQA 218

Query: 45  YVRGLQGNTGSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVV 104
            + GL+G    R KV A CKH+ AYDL+ W G  RY FNA V+ QDL + Y  PF+ C  
Sbjct: 219 LLEGLEGRDKIR-KVIATCKHFAAYDLERWQGALRYRFNAVVTSQDLSEYYLQPFQQCAR 277

Query: 105 EGKVASVMCSYNQVNGKPTCADPDILKNTIHGQWRL---DGYIVSDCDSV-GVLYNTQHY 160
           + KV S MCSYN +NG P CA   ++ + +   W     + YI SDC+++   L N  ++
Sbjct: 278 DSKVGSFMCSYNALNGTPACASTYLMDDILRKHWNWTEHNNYITSDCNAIQDFLPNFHNF 337

Query: 161 TRTPEEAAADAIKAGLDLDCG-PFLAIHTE--GAVRGGLLREEDVNLALAYTITVQMRLG 217
           ++TP +AAADA  AG D  C  P     T+  GA    LL EE ++ AL       +R G
Sbjct: 338 SQTPAQAAADAYNAGTDTVCEVPGYPPLTDVIGAYNQSLLSEEIIDRALRRLYEGLIRAG 397

Query: 218 MFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIG 277
             D   S  P+  +    V TP  Q LALQ+A  GIVLLKN+   LPL  L + T+A+IG
Sbjct: 398 YLDSA-SPHPYTKISWSQVNTPKAQALALQSATDGIVLLKNNG-LLPLD-LTNKTIALIG 454

Query: 278 PNSDVTVTMIGNYAGVACGYTTPLQGISRYAKTIHQAG---CFGVACNGNQLIGAAEVAA 334
             ++ T  M+G Y+G+   Y  P+   ++   T H A            +     A  AA
Sbjct: 455 HWANATRQMLGGYSGIPPYYANPIYAATQLNVTFHHAPGPVNQSSPSTNDTWTSPALSAA 514

Query: 335 RQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVS 394
            ++D  + + G D SI AE  DR  +  P  Q  L++ +A+  +  +V  L  G  VD +
Sbjct: 515 SKSDIILYLGGTDLSIAAEDRDRDSIAWPSAQLSLLTSLAQMGKPTIVARL--GDQVDDT 572

Query: 395 FAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
              ++P I +ILWVGYPGQ+GG A+ +++ G ++P 
Sbjct: 573 PLLSNPNISSILWVGYPGQSGGTALLNIITGVSSPA 608


>gi|380293100|gb|AFD50200.1| beta-xylosidase [Hypocrea orientalis]
          Length = 797

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 162/431 (37%), Positives = 243/431 (56%), Gaps = 10/431 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPV-LTGKYAASYVRGLQGNTG-SRLKVA 60
           N G  GL  ++PNVN FR P WGRGQETPGED   L+  Y   Y+ G+QG     +LKVA
Sbjct: 163 NSGRYGLDVYAPNVNGFRSPLWGRGQETPGEDAFFLSSAYTYEYITGIQGGVDPEQLKVA 222

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A  KH+  YDL+NWN   R  F+A +++QDL + Y   F A     K  S+MCSYN VNG
Sbjct: 223 ATVKHFAGYDLENWNNQSRLGFDAIITQQDLSEYYTPQFLAAARYAKSRSLMCSYNSVNG 282

Query: 121 KPTCADPDILKNTIHGQWRLD--GYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+CA+   L+  +   W     GY+ SDCD+V  ++N   Y      AAA +++AG D+
Sbjct: 283 VPSCANSFFLQTLLRESWGFPEWGYVSSDCDAVYNVFNPHDYASNQSSAAASSLRAGTDI 342

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     H   +   G +   ++  ++       +RLG FD +     + +LG +DV  
Sbjct: 343 DCGQTYPWHLNESFVAGEVTRGEIERSVTRLYANLVRLGYFDKK---NQYRSLGWKDVVK 399

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
                ++ +AA +GIVLLKN   TLPLS  +  ++A+IGP ++ T  M GNY G A    
Sbjct: 400 TDAWNISYEAAVEGIVLLKNDG-TLPLSK-KVRSIALIGPWANATTQMQGNYFGPAPYLI 457

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           +PL+   +    ++      +A N       A  AA+++DA V + G+D +IE E  DR 
Sbjct: 458 SPLEAAKKAGYHVNFELGTEIAGNSTAGFAKAIAAAKKSDAIVYLGGIDNTIEQEGADRT 517

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            +  PG Q +L+ ++++  + P+V++ M GG VD S  K++ ++ +++W GYPGQ+GG A
Sbjct: 518 DIAWPGNQLDLIKQLSEVGK-PLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVA 576

Query: 419 IADVLFGRANP 429
           + D+L G+  P
Sbjct: 577 LFDILSGKRAP 587


>gi|339499234|ref|YP_004697269.1| beta-glucosidase [Spirochaeta caldaria DSM 7334]
 gi|338833583|gb|AEJ18761.1| Beta-glucosidase [Spirochaeta caldaria DSM 7334]
          Length = 699

 Score =  264 bits (675), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 168/441 (38%), Positives = 243/441 (55%), Gaps = 38/441 (8%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+W+PN+NIFRDPRWGRGQET GEDPVLT +   +YVRGLQG+    L+ AAC KH+ 
Sbjct: 106 GLTFWAPNINIFRDPRWGRGQETYGEDPVLTSRLGTAYVRGLQGSDPYYLRAAACAKHFA 165

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ FNA VS++DLE+TY   FKA V  G V SVM +YN+VNG+P C 
Sbjct: 166 VH-----SGPEGLRHTFNAEVSQKDLEETYLPAFKALVKSG-VESVMGAYNRVNGEPACG 219

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
              +LK  +  +W+  G++VSDC ++   +     T    E+ A A+++G DL+CG    
Sbjct: 220 STYLLKQKLREEWQFQGHVVSDCWAICDFHKNHKVTNDILESIALALRSGCDLNCGDAYN 279

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
              E AV  G + E+D+N A+   +    +LG+   +    P+  +    +    H  LA
Sbjct: 280 YLAE-AVLKGYVTEDDINRAVVRLLITLDKLGLIHDD---GPYQGITIHQIDWKKHDSLA 335

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
           L+AA + IVLLKN+   LPL   +   + V GPN+  +  ++GNYAGV+    T L+ I 
Sbjct: 336 LEAAEKSIVLLKNNG-VLPLKKDKISYIYVTGPNATNSDALLGNYAGVSSRLLTVLEAIV 394

Query: 306 RYA----KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE-------- 353
             A       ++ GC       N    A+ V  + AD T+ VMG D S+E E        
Sbjct: 395 EEAGPEITVTYKKGCPLAERRVNPNDWASGV-TKYADVTIAVMGRDTSVEGEEGDAILSS 453

Query: 354 -FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI----GAILWV 408
            + D   L L   Q   + ++ K S  P+++VLM G P+        P +     AIL  
Sbjct: 454 TYGDFEDLNLNDEQLSYLHKL-KESGKPLIVVLMGGAPI------CSPELHEIADAILVA 506

Query: 409 GYPGQAGGAAIADVLFGRANP 429
            YPGQAGG A+++++FG+ NP
Sbjct: 507 WYPGQAGGTAVSNIVFGKTNP 527


>gi|255590044|ref|XP_002535159.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
 gi|223523880|gb|EEF27223.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
          Length = 449

 Score =  264 bits (674), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 126/256 (49%), Positives = 179/256 (69%), Gaps = 2/256 (0%)

Query: 176 LDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD 235
           +D++CG +L  +T+ AV    + E +++ AL    +++MRLG+F+G P+  P+G++    
Sbjct: 1   MDVNCGNYLKNYTKSAVEKKKVSESEIDRALHNLFSIRMRLGLFNGNPTKLPYGDISADQ 60

Query: 236 VCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVAC 295
           VC+  HQ +AL+AA  GIVLLKNS + LPLS  +  ++A+IGPN+D +  ++GNYAG  C
Sbjct: 61  VCSQEHQAVALEAARDGIVLLKNSNQLLPLSKSKTTSLAIIGPNADNSTILVGNYAGPPC 120

Query: 296 GYTTPLQGISRYAKTI-HQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEF 354
              TP QG+  Y KT  +  GC  VAC+   +  A ++A ++AD  VLVMGLDQ+ E E 
Sbjct: 121 KTVTPFQGLQNYIKTTKYHPGCSTVACSSAAIDQAIKIA-KEADQVVLVMGLDQTQEREE 179

Query: 355 IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQA 414
            DR  L+LPG+QQEL+  VA+A++ PVVLVL+CGGPVD+SFAK D  IG ILW GYPG+A
Sbjct: 180 HDRVDLVLPGKQQELIISVARAAKKPVVLVLLCGGPVDISFAKYDRNIGGILWAGYPGEA 239

Query: 415 GGAAIADVLFGRANPG 430
           GG A+A+++FG  NPG
Sbjct: 240 GGIALAEIIFGNHNPG 255


>gi|402493386|ref|ZP_10840139.1| beta-glucosidase [Aquimarina agarilytica ZC1]
          Length = 734

 Score =  263 bits (673), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 172/446 (38%), Positives = 249/446 (55%), Gaps = 35/446 (7%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G  AGLT+W+PN+NIFRDPRWGRGQET GEDP LT +   ++V+GLQGN    LK AAC 
Sbjct: 142 GKYAGLTFWTPNINIFRDPRWGRGQETYGEDPYLTSQIGVNFVKGLQGNHPKYLKSAACA 201

Query: 64  KHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
           KHY  +     +G +  R+ F+A  SK+D+ +TY   F+A V E KV  VM +YN+VNG+
Sbjct: 202 KHYAVH-----SGPEELRHEFDAIASKKDMAETYLPAFEALVKEAKVEGVMGAYNRVNGE 256

Query: 122 PTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG 181
             CA P +L+  +   W   GYIVSDC ++  L+     T+T EE+AA A+  GL+++CG
Sbjct: 257 GACASPYLLEKLLKDTWGFKGYIVSDCWALSDLHKFHKVTQTAEESAAAALNVGLNVNCG 316

Query: 182 ---PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPS-AQPFGNLGPRDVC 237
              P L    +GA++ GL  E+ ++  L + +  + +LG FD  PS   P+  +    V 
Sbjct: 317 NVYPAL----DGAIKQGLTSEKQLDNVLQHQLLTRFKLGFFD--PSNNNPYNKITTDVVD 370

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           + AH+ +AL+AA + IVLLKN+   L        +V V GPN+     ++GNY GV    
Sbjct: 371 SEAHRAIALEAAQKSIVLLKNNNNLL-PLKKDLKSVYVAGPNAAREDVLLGNYYGVTSKT 429

Query: 298 TTPLQGI----SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
            T L GI    S      ++ G      N N +  +    +R AD  ++VMGL  + E E
Sbjct: 430 QTILDGIVSKVSAGTSINYKQGLLPFQKNVNPIDWSTGEISR-ADVGIIVMGLSGNYEGE 488

Query: 354 ---------FIDRAGLLLPGRQQELVSRV-AKASRGPVVLVLMCGGPVDVSFAKNDPRIG 403
                      DR  + LP  Q + + ++ AK +  P+VLVL  G P+ +    +   + 
Sbjct: 489 EGEAIASESKGDRVDIRLPQNQIDYIKKIKAKNTGNPLVLVLTGGSPIAMPEVYD--LVD 546

Query: 404 AILWVGYPGQAGGAAIADVLFGRANP 429
           AI++  YPG+ GG A+AD+LFG   P
Sbjct: 547 AIVFAWYPGEEGGQAVADILFGDVVP 572


>gi|372209074|ref|ZP_09496876.1| glycoside hydrolase family protein [Flavobacteriaceae bacterium
           S85]
          Length = 727

 Score =  263 bits (673), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 171/441 (38%), Positives = 241/441 (54%), Gaps = 30/441 (6%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT+W+PNVNIFRDPRWGRGQET GEDP L  +   ++V+GLQG+  + LK AAC KHY
Sbjct: 125 AGLTFWTPNVNIFRDPRWGRGQETFGEDPYLMTQMGVAFVKGLQGDDPNYLKSAACAKHY 184

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R  FNA  ++QDL +TY   F+A V +  V  VM ++N V G P  
Sbjct: 185 AVH-----SGPESLRLEFNAVPTQQDLYETYLPAFEALVKDANVEGVMPAHNAVFGAPMA 239

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFL 184
           A+  +L + +  +W  DGY+V+DC ++  +     Y  +   AAA A+KAG +L+CG   
Sbjct: 240 ANKFLLTDVLRDRWGFDGYVVTDCGAIKQIKVGHKYVDSEVAAAAVALKAGTNLNCG--- 296

Query: 185 AIHTE--GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
           A + E   A+  GL+ EE V+         + RLGMFD + S  P+  +GP  + +  H 
Sbjct: 297 ATYKELKKAIDQGLVTEELVHERTKQLFKTRFRLGMFDKDLSKNPYSKIGPELIHSKEHI 356

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
           +LA +AA + IV+LKN    LPL T       V GP ++ +  ++G+Y GV+ G  T L 
Sbjct: 357 ELAREAAQKSIVMLKNKNNLLPLPT-DIKVPYVTGPFANSSDMLMGSYYGVSPGVVTILA 415

Query: 303 GI----SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFI--- 355
           GI    S      +++G      N N    A  VA   +D T+ V+GL    E E +   
Sbjct: 416 GITDAVSLGTSLNYRSGALPFQKNINPKNWAPNVAG-MSDVTICVVGLTADREGEGVDAI 474

Query: 356 ------DRAGLLLPGRQQELVSRV-AKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWV 408
                 DR  L LP  Q   V ++ AK    P+VLV+  G P  VS    +    AIL +
Sbjct: 475 ASNHKGDRLDLKLPENQINYVKQLAAKKKDKPLVLVIASGSP--VSLEGIEEHCDAILQI 532

Query: 409 GYPGQAGGAAIADVLFGRANP 429
            YPG+ GG A+ADVLFG+ +P
Sbjct: 533 WYPGEQGGNAVADVLFGKVSP 553


>gi|325970053|ref|YP_004246244.1| beta-glucosidase [Sphaerochaeta globus str. Buddy]
 gi|324025291|gb|ADY12050.1| Beta-glucosidase [Sphaerochaeta globus str. Buddy]
          Length = 698

 Score =  263 bits (673), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 163/444 (36%), Positives = 245/444 (55%), Gaps = 36/444 (8%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT WSPNVNIFRDPRWGRGQET GEDP LT + A +++RGLQG  G  LK A+C KH+ 
Sbjct: 101 GLTVWSPNVNIFRDPRWGRGQETFGEDPYLTARLAVAFIRGLQGE-GPVLKTASCVKHFA 159

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
           A+     +G +  R+ FNA V K+DLE+TY   F + V E K  +VM +Y+ +N +P CA
Sbjct: 160 AH-----SGPEPLRHGFNAVVGKKDLEETYLPAFASAVKEAKADAVMGAYSALNDEPCCA 214

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PFL 184
              +++ T+  +W  +G  +SDC ++   +     T+  EE+AA A+K G DL CG  + 
Sbjct: 215 SSFLMEETLRLRWGFEGMYISDCWAIRDFHLNHKVTKNEEESAALALKRGCDLACGCEYQ 274

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
           ++  E A + GL+  E +  A    +T + +LG FD       +  LG   + +  H  L
Sbjct: 275 SL--EKAFQKGLITREQIKKAAIRVMTTRFKLGQFD---QGTAYDTLGLESLDSDEHAAL 329

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           A +A+ + +VLLKN A  LPL       +AVIGPN+D    + GNY G +  Y T L+G+
Sbjct: 330 AFEASCRSLVLLKNDA-LLPLKKEAVSCLAVIGPNADSRQALWGNYHGTSSRYVTILEGL 388

Query: 305 SRY----AKTIHQAGCFGVACNGNQL------IGAAEVAARQADATVLVMGLDQSIEAEF 354
             Y     + ++  G         +L      +  A   A+ +D  VL +GL++++E E 
Sbjct: 389 RDYVGSSTRILYSEGSNLTKNKVERLAKDDDRLSEAVFMAKASDVVVLCLGLNETVEGEM 448

Query: 355 I---------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAI 405
                     D+  L LP  Q++L+  VA+  + P+++VL+ GG +D    +    + A+
Sbjct: 449 HDDGNGGWAGDKDDLRLPLCQRKLLKAVAETGK-PIIVVLLSGGSLDPEIEQY-ANVKAL 506

Query: 406 LWVGYPGQAGGAAIADVLFGRANP 429
           +   YPGQ GG AIA +L+G   P
Sbjct: 507 IQAWYPGQEGGKAIAHLLYGALCP 530


>gi|295134875|ref|YP_003585551.1| beta-glucosidase [Zunongwangia profunda SM-A87]
 gi|294982890|gb|ADF53355.1| beta-glucosidase [Zunongwangia profunda SM-A87]
          Length = 735

 Score =  263 bits (671), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 166/447 (37%), Positives = 247/447 (55%), Gaps = 33/447 (7%)

Query: 2   YNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAA 61
           Y+    GLT+W+PNVNIFRDPRWGRGQET GEDP LT K   ++V+GLQG+    LK AA
Sbjct: 133 YHRKYGGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTSKLGEAFVKGLQGDNDKYLKTAA 192

Query: 62  CCKHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
             KHY  +     +G +  R+ FNA VS++DL +TY   FK  +V+  V ++MC+YN  N
Sbjct: 193 AAKHYAVH-----SGPEKLRHEFNADVSEKDLWETYLPAFK-TLVDANVETIMCAYNSTN 246

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           G+P CA+  ++ + +  +W  +G++VSDC ++    +      +PE AAA A++ G++L+
Sbjct: 247 GEPCCANNRLINDILRDKWGFNGHVVSDCWALQDFVSGHDIVESPEAAAALAVEVGIELN 306

Query: 180 CGP---FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDV 236
           CG    FLA     AV  GL+ EE V+  L   +  + +LG+FD E S  P+  +G   +
Sbjct: 307 CGDTYNFLA----KAVEDGLVSEELVDKRLHKLLETRFKLGLFDPEES-NPYNKIGVEVM 361

Query: 237 CTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
            +  H+ LA + A + IVLLKN    LPL         + GPN+     ++GNY GV   
Sbjct: 362 NSDEHRALARETARKSIVLLKNDG-VLPLKN-NLSKYFITGPNATNIEVLLGNYHGVNPD 419

Query: 297 YTTPLQGISRYAKTIHQAGC-FGVACN--GNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
             T L+GI++  K   Q     G   N         A   A  +DAT +VMG+   +E E
Sbjct: 420 MVTVLEGIAKAIKPESQLQYRMGTRLNLPNENPQDWASPNAGNSDATFVVMGISGLLEGE 479

Query: 354 ---------FIDRAGLLLPGRQQELVSRVAKASRG-PVVLVLMCGGPVDVSFAKNDPRIG 403
                    F DR    LP  Q + + +V++A+   PVV ++  G P++++         
Sbjct: 480 EGESIASPTFGDRMDYNLPQNQIDYLQKVSEAAEDRPVVAIVTGGSPMNLTEVHK--LAD 537

Query: 404 AILWVGYPGQAGGAAIADVLFGRANPG 430
           A+L V YPG+ GG A+AD++FG+ +P 
Sbjct: 538 AVLLVWYPGEEGGNAVADIIFGKNSPS 564


>gi|358385386|gb|EHK22983.1| glycoside hydrolase family 3 protein [Trichoderma virens Gv29-8]
          Length = 795

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 158/431 (36%), Positives = 239/431 (55%), Gaps = 10/431 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDP-VLTGKYAASYVRGLQGNTG-SRLKVA 60
           N G  GL  ++PN+N FR P WGRGQETPGED  VLT  Y   Y+ G+QG      LK+A
Sbjct: 163 NSGRYGLDVYAPNINGFRSPLWGRGQETPGEDANVLTSAYTYEYITGMQGGVDPENLKIA 222

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A  KH+  YDL+NWN   R  F+A +++QDL + Y   F A     K  S MC+YN VNG
Sbjct: 223 ATAKHFAGYDLENWNNQSRLGFDAIITQQDLSEYYTPQFLAASRYAKSHSFMCAYNSVNG 282

Query: 121 KPTCADPDILKNTIHGQWRLD--GYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+CA+   L+  +   W     GY+ SDCD+V  ++N   Y      AAA +++AG D+
Sbjct: 283 VPSCANSFFLQTLLRESWGFPEWGYVSSDCDAVYNVWNPHDYASNQSSAAASSLRAGTDI 342

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     H   +   G +   ++  ++       +RLG FD +     + +LG +DV  
Sbjct: 343 DCGQTYPWHLNESFVAGEVSRGEIERSVTRLYANLVRLGYFDKK---NEYRSLGWKDVVK 399

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
                ++ +AA +GIVLLKN   TLPLS  +  ++A+IGP ++ T  M GNY G A    
Sbjct: 400 TDAWNISYEAAVEGIVLLKNDG-TLPLSK-KVRSIALIGPWANATTQMQGNYFGAAPYLI 457

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           +PL+   +    ++       A         A  AA+++DA +   G+D ++E E  DR 
Sbjct: 458 SPLEAAKKAGYQVNFELGTETASTSTAGFAKAIAAAKKSDAIIFAGGIDNTVEQEGADRT 517

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            +  PG Q +L+ ++++  + P+V++ M GG VD S  K++ ++ +++W GYPGQ+GG A
Sbjct: 518 DIAWPGNQLDLIKQLSELGK-PLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVA 576

Query: 419 IADVLFGRANP 429
           + D+L G+  P
Sbjct: 577 LFDILSGKRAP 587


>gi|291530120|emb|CBK95705.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum 70/3]
          Length = 689

 Score =  262 bits (669), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 164/440 (37%), Positives = 246/440 (55%), Gaps = 39/440 (8%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT W+PN+NIFRDPRWGRG ET GEDP LT +   ++V+G+QG     L+ AAC KH+ 
Sbjct: 109 GLTLWAPNINIFRDPRWGRGHETYGEDPYLTSRLGVNFVKGIQGEE-KYLRAAACAKHFA 167

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+ARVS++DLE+TY   FKA V EG+V  VM +YN+VNG+P+CA
Sbjct: 168 VH-----SGPESLRHEFDARVSEKDLEETYLPAFKALVKEGRVEGVMGAYNRVNGEPSCA 222

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
              ++      +W  DGY VSDC ++   +     T T  ++AA A+KAG D++CG    
Sbjct: 223 SEKLMGKL--REWGFDGYFVSDCGAIRDFHTNHKITDTAPQSAAMALKAGCDVNCGNTY- 279

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
           +H   A+  GL+ ++D+  A  + +  ++RLG  D       F +L    +    ++ L+
Sbjct: 280 LHILAALEEGLITKQDIRTACIHALRTRIRLGQLDD----NEFDDLPFDIIACDGNKALS 335

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
           L+AA + +VLL N    LPL   R  ++AVIGPN+D    ++GNY G      T L+GI 
Sbjct: 336 LEAAEKSMVLLHNDG-ILPLDKSRISSIAVIGPNADSRAALLGNYEGTPDRSVTFLEGIQ 394

Query: 306 RY--AKTIHQAGC-------FGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE--- 353
                +  +  GC        G+A  G++    A  A   AD TV+ +GLD ++E E   
Sbjct: 395 DAFDGRVYYAEGCQLFRDRTQGLALPGDRYA-EAVAACEAADVTVVCVGLDSTLEGEEGD 453

Query: 354 ----FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVG 409
                 D+  L LP  Q+ L+ ++ K +  P+++VL  G  V+     N     A++   
Sbjct: 454 TENKSGDKPDLRLPEVQRVLLQKL-KDTGKPLIIVLAAGSSVNTECEGN-----ALINAW 507

Query: 410 YPGQAGGAAIADVLFGRANP 429
           YPGQ GG A+A++LFG  +P
Sbjct: 508 YPGQYGGKALAEILFGEVSP 527


>gi|291556907|emb|CBL34024.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum V10Sc8a]
          Length = 691

 Score =  261 bits (668), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 166/442 (37%), Positives = 247/442 (55%), Gaps = 41/442 (9%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT W+PN+NIFRDPRWGRG ET GEDP LT +   S+V+G+QG     L+ AAC KH+ 
Sbjct: 109 GLTLWAPNINIFRDPRWGRGHETYGEDPYLTSRLGVSFVKGIQGEE-EYLRAAACAKHFA 167

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+ARVS++D+E+TY   FKA V EG+V  VM +YN+VNG+P+CA
Sbjct: 168 VH-----SGPESLRHEFDARVSEKDMEETYLPAFKALVKEGRVEGVMGAYNRVNGEPSCA 222

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
              ++      +W  DGY VSDC ++   + T   T T  ++AA A+KAG D++CG    
Sbjct: 223 SEKLMGKL--REWGFDGYFVSDCWAIRDFHTTHKITDTAPQSAAMALKAGCDVNCGN-TY 279

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
           +H   A+  GL+ ++D+  A  + +  ++RLG  D       F +L    +    ++ L+
Sbjct: 280 LHILAALEEGLITKQDIRTACIHALRTRIRLGQLDD----NEFDDLPFDIIACDGNKALS 335

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
           L+AA + +VLL N    LPL   R  ++AVIGPN+D    ++GNY G      T L+GI 
Sbjct: 336 LEAAEKSMVLLHNDG-ILPLDKSRISSIAVIGPNADSRAALLGNYNGTPDRSVTFLEGIQ 394

Query: 306 RY--AKTIHQAGC-------FGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA---- 352
                +  +  GC        G+A  G++    A  A   AD TV+ +GLD ++E     
Sbjct: 395 DAFDGRVYYAEGCQLFRDRTQGLALPGDRYA-EAVAACEAADVTVICVGLDATLEGEEGD 453

Query: 353 ---EFI--DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILW 407
              EF   D+  L LP  Q+ L+  + K +  P+++VL  G  V+     N     A++ 
Sbjct: 454 TGNEFASGDKPDLRLPEVQRVLLQNL-KDTGKPLIIVLAAGSSVNTECEGN-----ALIN 507

Query: 408 VGYPGQAGGAAIADVLFGRANP 429
             YPGQ GG A+A++LFG  +P
Sbjct: 508 AWYPGQYGGKALAEILFGEVSP 529


>gi|219887077|gb|ACL53913.1| unknown [Zea mays]
 gi|224035251|gb|ACN36701.1| unknown [Zea mays]
 gi|413919685|gb|AFW59617.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 405

 Score =  261 bits (668), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 136/217 (62%), Positives = 163/217 (75%), Gaps = 2/217 (0%)

Query: 214 MRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTV 273
           MRLG FDG+P   PFGNLGP DVCTP++Q+LA +AA QGIVLLKN+ + LPLS     ++
Sbjct: 1   MRLGFFDGDPRELPFGNLGPSDVCTPSNQELAREAARQGIVLLKNTGK-LPLSATSIKSM 59

Query: 274 AVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQL-IGAAEV 332
           AVIGPN++ + TMIGNY G  C YTTPLQG+     T++Q GC  V C+GN L + AA  
Sbjct: 60  AVIGPNANASFTMIGNYEGTPCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLDAATK 119

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA  AD TVLV+G DQSIE E +DR  LLLPG+Q +LVS VA AS GP +LV+M GGP D
Sbjct: 120 AAASADVTVLVVGADQSIERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFD 179

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +SFAK+  +I AILWVGYPG+AGGAAIADVLFG  NP
Sbjct: 180 ISFAKSSDKIAAILWVGYPGEAGGAAIADVLFGYHNP 216


>gi|332377068|gb|AEE64772.1| Xyl3A [Ruminococcus albus 8]
          Length = 691

 Score =  261 bits (667), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 168/442 (38%), Positives = 249/442 (56%), Gaps = 40/442 (9%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT W+PN+NIFRDPRWGR  ET GEDP LT +   + VRGLQG+ G  +K AAC KH+ 
Sbjct: 109 GLTLWAPNINIFRDPRWGRSHETFGEDPYLTAQNGKAVVRGLQGD-GKVMKAAACAKHFA 167

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+A+   +D+E+TY   F+A V E KV SVM +YN+VNG+P CA
Sbjct: 168 VH-----SGPEALRHSFDAKADAKDMEETYLPAFEALVKEAKVESVMGAYNRVNGEPACA 222

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
              +++     +W  DGY VSDC ++   +     T    E+AA A+KAG D++CG    
Sbjct: 223 SDYLMEKL--KEWEFDGYFVSDCWAIRDFHEHHMVTANAVESAAMALKAGCDVNCGCTYQ 280

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
            +   A+  GL+ +E +  A  + +  ++RLGMFD       F ++    V    H+ ++
Sbjct: 281 -NLLAALDKGLITKEQIRTACVHLMRTRIRLGMFDKHTD---FDDIPYSKVACAEHKAVS 336

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI- 304
           L+ A + +VLLKN+   LPL   ++ T+AVIGPN+D    + GNY G++  YTT L GI 
Sbjct: 337 LECAEKSLVLLKNNG-ILPLDDKKYKTIAVIGPNADSRTALEGNYNGLSDRYTTFLNGIQ 395

Query: 305 SRY-AKTIHQAGCF-------GVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA---- 352
            R+  + I   GC        G+A  G++    A  AA+ AD  ++ +GLD +IE     
Sbjct: 396 DRFEGRVIFAEGCHLYKKSISGLAQAGDRYA-EAVAAAKNADLVIMCVGLDATIEGEEGD 454

Query: 353 ---EFI--DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILW 407
              EF   D+ GL LP  Q+ LV ++    + PVV V+  G  ++      + +  A++ 
Sbjct: 455 TGNEFSSGDKNGLTLPPPQKILVEKIMSVGK-PVVTVVCAGSAINT-----ESQPDALIH 508

Query: 408 VGYPGQAGGAAIADVLFGRANP 429
             YPG  GG A+A+VLFG  +P
Sbjct: 509 AFYPGAEGGKALAEVLFGDVSP 530


>gi|325679939|ref|ZP_08159508.1| glycosyl hydrolase family 3 C-terminal domain protein [Ruminococcus
           albus 8]
 gi|324108377|gb|EGC02624.1| glycosyl hydrolase family 3 C-terminal domain protein [Ruminococcus
           albus 8]
          Length = 691

 Score =  261 bits (667), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 168/442 (38%), Positives = 249/442 (56%), Gaps = 40/442 (9%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT W+PN+NIFRDPRWGRG ET GEDP LT +   + VRGLQG+ G  +K AAC KH+ 
Sbjct: 109 GLTLWAPNINIFRDPRWGRGHETFGEDPYLTAQNGKAVVRGLQGD-GKVMKAAACAKHFA 167

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+A+   +D+E+TY   F+A V E KV SVM +YN+VNG+P CA
Sbjct: 168 VH-----SGPEALRHSFDAKADAKDMEETYLPAFEALVKEAKVESVMGAYNRVNGEPACA 222

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
              +++     +W  DGY VSDC ++   +     T    E+AA A+KAG D++CG    
Sbjct: 223 SDYLMEKL--KEWEFDGYFVSDCWAIRDFHEHHMVTANAVESAAMALKAGCDVNCGCTYQ 280

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
            +   A+  GL+ +E +  A  + +  ++RLGMFD       F ++    V    H+ ++
Sbjct: 281 -NLLAALDKGLITKEQIRTACVHLMRTRIRLGMFDKHTD---FDDIPYSKVACAEHKAVS 336

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI- 304
           L+ A + +VLLKN+   LPL   ++ T+AVIGPN+D    + GNY G++  YTT L GI 
Sbjct: 337 LECAEKSLVLLKNNG-ILPLDDKKYKTIAVIGPNADSRTALEGNYNGLSDRYTTFLNGIQ 395

Query: 305 SRY-AKTIHQAGCF-------GVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA---- 352
            R+  + I   GC        G+A  G++    A  AA+ AD  ++ +GLD +IE     
Sbjct: 396 DRFEGRVIFAEGCHLYKKSISGLAQAGDRYA-EAVAAAKNADLVIMCVGLDATIEGEEGD 454

Query: 353 ---EFI--DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILW 407
              EF   D+ GL LP  Q+ LV ++    + PVV V+  G  ++      + +  A++ 
Sbjct: 455 TGNEFSSGDKNGLTLPPPQKILVEKIMSVGK-PVVTVVCAGSAINT-----ESQPDALIH 508

Query: 408 VGYPGQAGGAAIADVLFGRANP 429
             YPG  G  A+A+VLFG  +P
Sbjct: 509 AFYPGAEGSKALAEVLFGDVSP 530


>gi|313202830|ref|YP_004041487.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312442146|gb|ADQ78502.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 742

 Score =  261 bits (666), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 162/440 (36%), Positives = 239/440 (54%), Gaps = 24/440 (5%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G  +G+T+++PNVNIFRDPRWGRGQET GEDP LT +   +YV+G+QGN    LK AAC 
Sbjct: 141 GQYSGITFYAPNVNIFRDPRWGRGQETYGEDPFLTSRMGVAYVKGMQGNDPKYLKTAACA 200

Query: 64  KHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
           KHY  +     +G +  R+ ++A    +D  +TY   F+  V EGKV SVMC+YN+  GK
Sbjct: 201 KHYVVH-----SGPEALRHSYDAEPPMKDFMETYVPAFETLVKEGKVESVMCAYNRTFGK 255

Query: 122 PTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG 181
           P C    +L + +  +W   GY+ +DC ++   Y      +   EA A AIK+G++L+CG
Sbjct: 256 PCCGSSFLLHDLLREKWGFTGYVTTDCWAIQNFYLHHGAAKDSLEACALAIKSGVNLNCG 315

Query: 182 PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAH 241
                +   AVR GL+ E++V+ AL+  +  + RLG+FD  P+  P+  +    + +  +
Sbjct: 316 NEFN-YLPAAVRKGLVTEKEVDEALSQLLRTRFRLGLFDS-PNENPYAKIKEEVIGSQQN 373

Query: 242 QQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 301
             LA +AA + +VLL+N   TLPL      ++ V+GP +     ++GNY GV    TT +
Sbjct: 374 IDLAYEAAAKSLVLLQNKNNTLPLKK-DMKSLYVVGPYAANQDILLGNYNGVNSRLTTIM 432

Query: 302 QGI----SRYAKTIHQAGCFGVACNGNQL-IGAAEVAARQADATVL-VMGLDQSIEAEFI 355
           Q I    S      ++ G    A N N +     E A   A   V  + G+ +  E E  
Sbjct: 433 QAIVGKVSAGTSVNYRIGVEPSAPNKNSMNYSIGEAADADAVVAVFGISGVFEGEEGEST 492

Query: 356 ------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVG 409
                 DR  L LP  Q + +  + K  + P++LVL  G P+      +   + AIL+V 
Sbjct: 493 ASTSRGDRLDLNLPQNQLDYLRELKKKCKKPIILVLTGGSPICTPELAD--MVDAILFVW 550

Query: 410 YPGQAGGAAIADVLFGRANP 429
           YPGQ GG A+ADV+FG  NP
Sbjct: 551 YPGQEGGHAVADVIFGDVNP 570


>gi|427385932|ref|ZP_18882239.1| hypothetical protein HMPREF9447_03272 [Bacteroides oleiciplenus YIT
           12058]
 gi|425726971|gb|EKU89834.1| hypothetical protein HMPREF9447_03272 [Bacteroides oleiciplenus YIT
           12058]
          Length = 732

 Score =  261 bits (666), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 173/446 (38%), Positives = 232/446 (52%), Gaps = 28/446 (6%)

Query: 2   YNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAA 61
           + G   GL Y SP +NI R+P WGR  E  GEDP  TG+   +YV+GLQG+    LK+A+
Sbjct: 134 FRGEHKGLIYRSPVINISRNPYWGRINEIFGEDPYHTGRMGVAYVKGLQGDDSHYLKLAS 193

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
             KHY   +++    VDR   +A+V ++ L + +   FK C+VEGK  SVM SYN +NG 
Sbjct: 194 TLKHYAVNNVE----VDRMKLSAQVPERMLYEYWLPHFKDCIVEGKAQSVMASYNAINGV 249

Query: 122 PTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP-EEAAADAIKAGLDLDC 180
           P   +  +L + +  QW  +G++VSD   V  +    H  +   EEA   +I AG D   
Sbjct: 250 PNNINKLLLTDILKNQWGHEGFVVSDLGGVKTMVEGHHQRQISCEEAVGRSIMAGCDFSD 309

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
             +   +   A+R G L EE +N AL   + V+ RLG FD   S  P+  + P  +    
Sbjct: 310 AEYEK-YIPDALRKGYLTEERLNDALRRVLLVRFRLGEFDDFKSV-PYSRISPDVIGCKE 367

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H+ L+L+AA + IVLLKN  + LP+       VAVIGP +D+     GNY GV     TP
Sbjct: 368 HRNLSLEAARKSIVLLKNEKKLLPIDRSIIKRVAVIGPYADLFNQ--GNYGGVPKDPVTP 425

Query: 301 LQGISR-YAKTIHQAGCFG-----VACNGNQLI-----GAAEVA-----ARQADATVLVM 344
           LQGI       +    C G     V     Q I       AE+      AR +D   L +
Sbjct: 426 LQGIKNAVGNNVEVLYCKGAQITPVKVRKGQPIPPRFDKEAEMKKAVEMARNSDVVFLFV 485

Query: 345 GLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGA 404
           G    IE E  DR  L+LPG Q ELV  V + ++  VV+VLM  GPV V   K +  I A
Sbjct: 486 GTTADIEVEGRDRKTLVLPGNQNELVKAVYEVNKK-VVVVLMSAGPVAVPEVKKN--IPA 542

Query: 405 ILWVGYPGQAGGAAIADVLFGRANPG 430
           +L   +PG  GG AIADVLFG  NPG
Sbjct: 543 VLQAWWPGDEGGNAIADVLFGDYNPG 568


>gi|390340546|ref|XP_001186857.2| PREDICTED: probable beta-D-xylosidase 2-like [Strongylocentrotus
           purpuratus]
          Length = 623

 Score =  260 bits (665), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 166/434 (38%), Positives = 245/434 (56%), Gaps = 16/434 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GL  +SP +NI R P WGR QET GEDP LTG+ A S+V GLQGN    L   A CKH+ 
Sbjct: 144 GLNCFSPVINIMRHPYWGRNQETYGEDPYLTGELAKSFVWGLQGNHPRYLLTNAGCKHFA 203

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
           AY         R+ F+A+VS +DL+ T+   FK C+  G   SVMCSYN VNG P CA+ 
Sbjct: 204 AYSGPENYPSSRFSFDAKVSDKDLQVTFFPAFKECIKAGTY-SVMCSYNSVNGIPACANS 262

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG---PFL 184
            +L + +  +W   GY+VSD  ++ +     +YT +  + A  ++KAG +LD G   P +
Sbjct: 263 YLLNDVLRTEWGFKGYVVSDQRALELEELAHNYTTSYLDTAIKSLKAGCNLDLGTTKPAV 322

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC-TPAHQQ 243
             +   AV  G+L  +D+  ++A     ++RLG FD  P   P+  L    V  +P HQ+
Sbjct: 323 YDYLAEAVELGMLTAQDLRDSIAPLFYTRLRLGEFD-PPDHNPYVKLNVDQVVESPEHQE 381

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG--YTTPL 301
           +AL+AA +  VL+KN   TLP+     HT+AV+GP ++ +  + G+YA        TT L
Sbjct: 382 IALKAALKSFVLVKNDGSTLPIEG-TIHTLAVVGPFANNSKLLFGDYAPNPDPRFVTTVL 440

Query: 302 QGISRYA-KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           +G+S  A KT H +GC    C      G    A   AD  V+ +G    +E+E  DR  +
Sbjct: 441 EGLSPMATKTRHASGCPSPKCVTYDQQGVLN-AVTGADVVVVCLGTGIELESEGNDRRDM 499

Query: 361 LLPGRQQELVSRVAKASRG-PVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           LLPG+Q++L+   A+ + G PV+L+L   GP+++++A + P + AI+   +P QA G A+
Sbjct: 500 LLPGKQEQLLQDAARYAAGKPVILLLFNAGPLNITWALSSPSVQAIVECFFPAQATGVAL 559

Query: 420 ADVLFGR---ANPG 430
             ++F     ANPG
Sbjct: 560 R-MMFQNAPGANPG 572


>gi|380696433|ref|ZP_09861292.1| glycoside hydrolase [Bacteroides faecis MAJ27]
          Length = 739

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 163/442 (36%), Positives = 236/442 (53%), Gaps = 32/442 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
            LTYW+PN+NIFRDPRWGRGQET GEDP LT +   ++V+GLQG+    LK +AC KHY 
Sbjct: 133 ALTYWTPNINIFRDPRWGRGQETYGEDPYLTARIGEAFVQGLQGDNPRYLKASACAKHYA 192

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +     N   R+ FN+ VS  DL DTY   F+  VV+ KV+ VMC+YN   G+P C + 
Sbjct: 193 VHSGPEKN---RHSFNSDVSTYDLWDTYLPAFRTLVVDAKVSGVMCAYNAFQGQPCCGND 249

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            ++++ +  +W   GY+ SDC ++  ++N          AAADA+  G DLDCG    + 
Sbjct: 250 LLMQSILRDKWNFTGYVTSDCGAIDDIFNHHKTHPDAATAAADAVFHGTDLDCGHSAYLA 309

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDG----EPSAQPFGNLGPRDVCTPAHQQ 243
              AV+ G++ E+ +++++    T++ RLG+FD     + +  P   L  R      HQ 
Sbjct: 310 LVKAVKDGIITEKQLDVSVKRLFTIRFRLGLFDPVELVDYARIPISILECR-----KHQD 364

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LA Q A + +VLLKN  + LPL   +   V V+GPN+D   +++GNY G      TPLQ 
Sbjct: 365 LAKQLARESMVLLKND-QLLPLQKNKLKKVVVMGPNADSRESLLGNYNGNPSRMLTPLQA 423

Query: 304 ISR----YAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFI---- 355
           I      + +  +  G   V       +      A+ ADA + + G+   +E E +    
Sbjct: 424 IRERLGGWTEVEYIEGVDHVNTISADDLKQYVNRAKGADAVIFIGGISPRLEGEEMPVSK 483

Query: 356 ------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSF-AKNDPRIGAILWV 408
                 DR  + LP  Q +++ +   A   P V V+M G  + + + A+N P   AIL  
Sbjct: 484 DGFDGGDRTTIALPAVQTQMM-KAWVAEHIPTVFVMMTGSALAIPWEAQNVP---AILNA 539

Query: 409 GYPGQAGGAAIADVLFGRANPG 430
            Y GQ GG AIADVLFG  NP 
Sbjct: 540 WYGGQYGGEAIADVLFGDYNPS 561


>gi|367032987|ref|XP_003665776.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
           ATCC 42464]
 gi|347013048|gb|AEO60531.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
           ATCC 42464]
          Length = 835

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 170/454 (37%), Positives = 231/454 (50%), Gaps = 31/454 (6%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL----- 57
           N G +GL YW+PNVN FRDPRWGRG ETPGED V   +YAAS +RGL+G + S       
Sbjct: 132 NAGWSGLDYWTPNVNPFRDPRWGRGSETPGEDVVRLKRYAASMIRGLEGRSSSSSSCSFG 191

Query: 58  ------KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASV 111
                 +V + CKHY   D ++WNG  R+ F+A +S QDL + Y  PF+ C  + +V SV
Sbjct: 192 SGGEPPRVISTCKHYAGNDFEDWNGTTRHDFDAVISAQDLAEYYLAPFQQCARDSRVGSV 251

Query: 112 MCSYNQVNGKPTCADPDILKNTIHGQWRL---DGYIVSDCDSVGVLYNTQHYTRTPEEAA 168
           MC+YN VNG P+CA+  ++   + G W     D Y+ SDC++V  +    HY  T  E  
Sbjct: 252 MCAYNAVNGVPSCANSYLMNTILRGHWNWTEHDNYVTSDCEAVLDVSAHHHYADTNAEGT 311

Query: 169 ADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPF 228
               +AG+D  C    +    GA  GG L    V+ AL       +R+G FDG  S  P 
Sbjct: 312 GLCFEAGMDTSCEYEGSSDIPGASAGGFLTWPAVDRALTRLYRSLVRVGYFDGPES--PH 369

Query: 229 GNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLR---------HHTVAVIGPN 279
            +LG  DV  P  Q+LAL+AA +GIVLLKN   TLPL                VA+IG  
Sbjct: 370 ASLGWADVNRPEAQELALRAAVEGIVLLKNDNDTLPLPLPDDVVVTADGGRRRVAMIGFW 429

Query: 280 SDVTVTMIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQ----LIGAAEVAAR 335
           +D    + G Y+G      +P     +    +  AG   +  + ++        A  AA 
Sbjct: 430 ADAPDKLFGGYSGAPPFARSPASAARQLGWNVTVAGGPVLEGDSDEEEDTWTAPAVEAAA 489

Query: 336 QADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSF 395
            AD  V   GLD S   E  DR  +  P  Q  L+S +A+  + PVV+V M     D   
Sbjct: 490 DADYIVYFGGLDTSAAGETKDRMTIGWPAAQLALISELARLGK-PVVVVQMGDQLDDTPL 548

Query: 396 AKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            + D  +GA+LW  +PGQ GG A+  +L G  +P
Sbjct: 549 FELD-GVGAVLWANWPGQDGGTAVVRLLSGAESP 581


>gi|167751044|ref|ZP_02423171.1| hypothetical protein EUBSIR_02029 [Eubacterium siraeum DSM 15702]
 gi|167655962|gb|EDS00092.1| glycosyl hydrolase family 3 C-terminal domain protein [Eubacterium
           siraeum DSM 15702]
          Length = 691

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 164/442 (37%), Positives = 248/442 (56%), Gaps = 41/442 (9%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT W+PN+NIFRDPRWGRG ET GEDP LT +   ++V+G+QG     L+ AAC KH+ 
Sbjct: 109 GLTLWAPNINIFRDPRWGRGHETYGEDPYLTSRLGVNFVKGIQGEE-EYLRAAACAKHFA 167

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+ARVS++D+E+TY   FKA V EG+V  VM +YN+VNG+P+CA
Sbjct: 168 VH-----SGPESLRHEFDARVSEKDMEETYLPAFKALVKEGRVEGVMGAYNRVNGEPSCA 222

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
              ++      +W  DGY VSDC ++   + T   T T  ++AA A+KAG D++CG    
Sbjct: 223 SEKLMGKL--REWGFDGYFVSDCWAIRDFHTTHKITDTAPQSAAMALKAGCDVNCGN-TY 279

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
           +H   A+  GL+ ++++  A  + +  ++RLG  D       F +L    +    ++ L+
Sbjct: 280 LHILAALEEGLITKQNIRTACIHALRTRIRLGQLDD----NEFDDLPFDIIACDGNKALS 335

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
           L+AA + +VLL N    LPL   R  ++AVIGPN+D    ++GNY G      T L+GI 
Sbjct: 336 LEAAEKSMVLLHNDG-ILPLDKSRISSIAVIGPNADSRAALLGNYNGTPDRSVTFLEGIQ 394

Query: 306 RY--AKTIHQAGC-------FGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA---- 352
                +  +  GC        G+A  G++    A  A   AD TV+ +GLD ++E     
Sbjct: 395 DAFDGRVYYAEGCQLFRDRTQGLALPGDRYA-EAVAACEAADVTVVCVGLDATLEGEEGD 453

Query: 353 ---EFI--DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILW 407
              EF   D+  L LP  Q+ L+ ++ K +  P+++VL  G  V+     N     A++ 
Sbjct: 454 TGNEFASGDKPDLRLPEVQRVLLQKL-KDTGKPLIIVLAAGSSVNTECEGN-----ALIN 507

Query: 408 VGYPGQAGGAAIADVLFGRANP 429
             YPGQ GG A+A++LFG  +P
Sbjct: 508 AWYPGQYGGKALAEILFGEVSP 529


>gi|373954937|ref|ZP_09614897.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373891537|gb|EHQ27434.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 723

 Score =  259 bits (661), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 156/443 (35%), Positives = 245/443 (55%), Gaps = 28/443 (6%)

Query: 2   YNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAA 61
           YN    GLT+W+PN+NIFRDPRWGRGQET GEDP LT +   ++++GLQG+    LKVAA
Sbjct: 123 YNEKYGGLTFWTPNINIFRDPRWGRGQETYGEDPFLTSQIGVAFIQGLQGDDPEHLKVAA 182

Query: 62  CCKHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           C KH+  +     +G +  R+ FNA  S +DL +TY   FKA +V  +V +VMC+YN+ N
Sbjct: 183 CAKHFAVH-----SGPERLRHSFNAIASPKDLRETYLPAFKA-LVNARVEAVMCAYNRTN 236

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
            +  C    +L   +  +W   G++VSDC ++   Y          EA A A+K G+DL+
Sbjct: 237 SEVCCGSNLLLDQILRDEWHFTGHVVSDCGAIVDFYMGHKVVPGQPEAVALAVKHGVDLN 296

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG       E AV+ GL+ E++++ ALA  +  + +LG+FD + ++ P+ N+    + + 
Sbjct: 297 CGDEYPALIE-AVKRGLITEKEIDKALATLLKTRFKLGLFDPKQNS-PYNNIPVSVINST 354

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
            H+ LA + A + IVLLKN  + LPL         + GPN+     ++GNY GV    +T
Sbjct: 355 DHRALAKEVALKSIVLLKNE-KCLPLKN-NLSKYYITGPNAASVDALMGNYYGVNPHMST 412

Query: 300 PLQGISRY----AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE-- 353
            L+GI+      ++  ++ G   +  + N  I      A+ +D T +VMG+   +E E  
Sbjct: 413 ILEGIAGAIQPGSQMQYKPGIL-LDRDNNNPIDWTTGDAKASDVTFVVMGITGLLEGEEG 471

Query: 354 -------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
                  + DR    LP  Q + + ++ K ++  VV ++  G P+++S         A+L
Sbjct: 472 EAIASPNYGDRLDYNLPKNQIDFLRKIRKGNKNKVVAIITGGSPMNLSEVHE--LADAVL 529

Query: 407 WVGYPGQAGGAAIADVLFGRANP 429
              YPG+ GG A+AD+LFG+ +P
Sbjct: 530 LAWYPGEEGGNAVADILFGKVSP 552


>gi|365120422|ref|ZP_09338009.1| hypothetical protein HMPREF1033_01355 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363647477|gb|EHL86692.1| hypothetical protein HMPREF1033_01355 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 735

 Score =  258 bits (660), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 161/438 (36%), Positives = 236/438 (53%), Gaps = 28/438 (6%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYW+PN+NIFRDPRWGRGQET GEDP LTG    ++V GLQG+    LK +AC KHY 
Sbjct: 135 GLTYWTPNINIFRDPRWGRGQETYGEDPYLTGTLGKAFVNGLQGDDPKYLKASACAKHYA 194

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +   +   + R+ FN  VS  DL DTY   F+  VV+ KV+SVMC+YN + G+P C + 
Sbjct: 195 VH---SGPEISRHFFNTEVSMYDLWDTYLPAFRDLVVDAKVSSVMCAYNALAGQPCCGND 251

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYT-RTPEEAAADAIKAGLDLDCGPFLAI 186
            ++++ +  QW+  GY+ SDC ++      +H T      A+ADA+  G DL+CG  + +
Sbjct: 252 LLMQDILRKQWKFTGYVTSDCGAIDDF--LKHKTHADAAHASADAVLHGTDLECGQNIYV 309

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT-PAHQQLA 245
               AV+ GL+ E  ++ ++      + RLG+FD  P+ +      P  V     H+ LA
Sbjct: 310 KLVDAVKQGLITEAQIDESVKRLFMTRFRLGLFD--PADRVKYADTPLSVLECDEHKALA 367

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
           L+ + + +VLLKN    LPL       +AVIGPN+D +  ++GNY G      TPL+ I 
Sbjct: 368 LKMSRESVVLLKND-NVLPLRK-NLKKIAVIGPNADDSTVVLGNYNGFPSKVITPLEAIR 425

Query: 306 ----RYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFI------ 355
               +  + I+      V  +  + + A     +  D  + V G+   +E E +      
Sbjct: 426 SKVGKRTQVIYDRAIDCVKPSDEKTLNALIERLKGVDQVIFVGGISPRLEGEELPISVDG 485

Query: 356 ----DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYP 411
               DR  + LP  Q EL+ ++ +A   PV+ V+M G  + + +   +  I AIL   Y 
Sbjct: 486 FRGGDRTTIALPEVQTELMKKMKEAGL-PVIFVMMTGSALGIEWESQN--IPAILNAWYG 542

Query: 412 GQAGGAAIADVLFGRANP 429
           GQ  G AIADVLFG  NP
Sbjct: 543 GQFAGQAIADVLFGDYNP 560


>gi|85813774|emb|CAJ65923.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
          Length = 704

 Score =  258 bits (660), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 131/285 (45%), Positives = 189/285 (66%), Gaps = 7/285 (2%)

Query: 147 DCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLAL 206
           DCD+V VL+  Q Y +TPE+A ADA+K+G+      +L  +T+ AV    +   +++ AL
Sbjct: 229 DCDAVNVLHVEQKYAKTPEDAVADALKSGIS-----YLRNYTKSAVEKKKVTVSEIDRAL 283

Query: 207 AYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLS 266
               + +MRLG+F+G+P+ Q + ++GP  VC+  HQ LAL+AA  GIVLLKN+ R LPLS
Sbjct: 284 HNLFSTRMRLGLFNGDPTKQLYSDIGPDQVCSQEHQALALEAALDGIVLLKNADRLLPLS 343

Query: 267 TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRYAKTI-HQAGCFGVACNGNQ 325
                ++AVIGPN+  +  ++GNY G AC   T L+G+  Y  +  ++ GC  V+C    
Sbjct: 344 KSGISSLAVIGPNAHNSTNLLGNYFGPACKNVTILEGLRNYVSSASYEKGCNNVSCTSAA 403

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
                E+A  + D  +LVMGLDQS E E +DR  L+LPG+Q  L++ VAKA++ P+VLVL
Sbjct: 404 KKKPVEMAQTE-DQVILVMGLDQSQEKERLDRMDLVLPGKQPTLITAVAKAAKRPIVLVL 462

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G P+DV+FAKN+ +IG+ILW GYPGQAG  A+A ++FG  NPG
Sbjct: 463 LGGSPMDVTFAKNNRKIGSILWAGYPGQAGATALAQIIFGEHNPG 507



 Score =  124 bits (312), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 56/98 (57%), Positives = 73/98 (74%), Gaps = 10/98 (10%)

Query: 1   MYNGG-MAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT------ 53
           +YN G + GL +W+PNVNIFRDPRWGRGQETPGEDP++ GKY AS+VRG+QG++      
Sbjct: 147 LYNAGQVTGLGFWAPNVNIFRDPRWGRGQETPGEDPLVVGKYGASFVRGVQGDSFEGEST 206

Query: 54  -GSRLKVAACCKHYTAYDLDNW--NGVDRYHFNARVSK 88
            G  L+ +ACCKHYTA+DLDNW  + V+  H   + +K
Sbjct: 207 LGDHLQASACCKHYTAHDLDNWDCDAVNVLHVEQKYAK 244


>gi|358380569|gb|EHK18247.1| glycoside hydrolase family 3 protein, partial [Trichoderma virens
           Gv29-8]
          Length = 722

 Score =  258 bits (660), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 164/431 (38%), Positives = 239/431 (55%), Gaps = 19/431 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG-SRLKVAACCKH 65
           AGL YW+PN+N +RDPRWGRGQETPGEDP    +YA +YV GL+G  G ++ KV + CKH
Sbjct: 107 AGLDYWAPNINPYRDPRWGRGQETPGEDPYHVAQYAYNYVVGLKGGVGPAKSKVVSTCKH 166

Query: 66  YTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
           +  YD+++ +GV R  +NA +S QDL + Y   F++C  + K  +VMCSYN VNG P+CA
Sbjct: 167 FAGYDIEDSDGVVRGSYNAIISTQDLAEYYLPSFRSCFRDAKTGAVMCSYNAVNGHPSCA 226

Query: 126 DPDILKNTIHGQWRLDG---YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP 182
           +  +L   +   W       ++  DC +V  ++N  H  ++  +  A AI  G DLDCG 
Sbjct: 227 NSYMLDTVLRDHWGWGSSAHWVTGDCGAVDGVFNQHHVGQSAAQGVAFAINNGTDLDCGT 286

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
             A +   AV+     E  ++ AL+   +  + LG FD  P  Q +  LG  DV TP+ Q
Sbjct: 287 AYASNIASAVQNNYTTEAQLDQALSRLYSSLIVLGYFD-PPEGQEYRTLGVSDVNTPSTQ 345

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGP-NSDVTVTMIGNYAGVACGYTTPL 301
           +LA  A  +GI +       LP+  +   TV  +GP  ++ +V+M GNY GVA   T P+
Sbjct: 346 KLAYTALVEGINI-------LPIRPMG-QTVLFVGPWANNASVSMFGNYNGVAPYKTIPV 397

Query: 302 QGISRYA---KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
              +  A      +  G   V  N      AA  AA++AD  V + G+D+ +EAE  DR 
Sbjct: 398 PTANSSAYNWNVTYSQGLQYVLSNDTSQFAAAVSAAQEADVVVYIGGIDEQVEAEAHDRT 457

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            +  PG Q  L+ ++A     PVV+V + GG VD S    +  +  +LW+GYPGQ  G+ 
Sbjct: 458 SIDWPGAQLNLIKQLAAVK--PVVVVQVGGGQVDDSSLLQNKNVKGLLWMGYPGQEFGSG 515

Query: 419 IADVLFGRANP 429
           + D+L G + P
Sbjct: 516 LIDILSGASAP 526


>gi|291240563|ref|XP_002740191.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 747

 Score =  258 bits (659), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 157/422 (37%), Positives = 226/422 (53%), Gaps = 10/422 (2%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G   GL+ +SP +NI R P WGR QET GEDP L+G  AASYV GLQGN    +   A C
Sbjct: 135 GDHKGLSCFSPVINIARHPLWGRIQETYGEDPYLSGTLAASYVNGLQGNHPRYVTANAGC 194

Query: 64  KHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPT 123
           KH+ AY         R  F+A+VS +DL  T+   F  C+  G   S+MCSYN +NG P 
Sbjct: 195 KHFDAYAGPEDIPSSRSTFDAKVSDRDLRMTFLPAFHECIQAG-THSLMCSYNSINGVPA 253

Query: 124 CADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPF 183
           CA+  +L + +  +W   GY++SD  +V  +Y+  HYT+   + A   + +GL+L+    
Sbjct: 254 CANKKLLTDILRTEWNFTGYVISDQSAVEKVYDAHHYTKDMLDTAIACVNSGLNLELSSN 313

Query: 184 LA----IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           L     + T  AV+ G +  + V   ++     +MRLG FD  P   P+  L    + + 
Sbjct: 314 LEDNVMMQTTKAVKQGNVTMKTVKARVSPLFYTRMRLGEFD-PPEMNPYSKLDLSIIQSQ 372

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT- 298
            HQ+L+L+AA +  VLLKN  R LPL   +   +AV+GP +D    + G+Y+     YT 
Sbjct: 373 EHQELSLKAAAKSFVLLKNENRFLPLKE-KIDKLAVVGPLADNVDALYGDYSATPNNYTV 431

Query: 299 TPLQGISRYA-KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           TP  G++R A  T + +GC    C      G  + A   AD  V+ +G    IE+E  DR
Sbjct: 432 TPRNGLARLAGNTSYASGCDNPKCRKYD-SGQVKSAVSGADMVVVCVGTGTDIESEGNDR 490

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             L LPG+Q  L+    K    PV+L+L   GP+DVS+A  +P +  I+   +P QA G 
Sbjct: 491 HELALPGKQLSLLQDAVKFGTKPVILLLFNAGPLDVSWAVENPAVQTIVACFFPAQATGD 550

Query: 418 AI 419
           A+
Sbjct: 551 AL 552


>gi|116197206|ref|XP_001224415.1| hypothetical protein CHGG_05201 [Chaetomium globosum CBS 148.51]
 gi|88181114|gb|EAQ88582.1| hypothetical protein CHGG_05201 [Chaetomium globosum CBS 148.51]
          Length = 735

 Score =  258 bits (659), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 162/436 (37%), Positives = 234/436 (53%), Gaps = 17/436 (3%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N G A L +W+PNVN FRDPRWGRG ETPGED     K+A ++V+G+QG T S  +V A 
Sbjct: 115 NNGRAHLDFWTPNVNPFRDPRWGRGHETPGEDAFRNKKWAEAFVQGMQG-TESTHRVIAT 173

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           CKHY AYDL+N     R++F+A+VS QDL + Y  PF+ C  + KV S+MCSYN VNG P
Sbjct: 174 CKHYAAYDLENSGSTTRFNFDAKVSTQDLAEYYLPPFQQCARDSKVGSIMCSYNAVNGVP 233

Query: 123 TCADPDILKNTIHGQWRL---DGYIVSDCDSVGVLYNT---QHYTRTPEEAAADAIKAGL 176
            CA P ++   +   W     + Y+VSDCD+V  L N      Y  +   A   +++AG 
Sbjct: 234 ACASPYLMDTILRKHWNWTDQNQYVVSDCDAVYYLGNANGGHRYKSSYAAAIGASLEAGC 293

Query: 177 DLDCGPFLAIHTE--GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPR 234
           D  C        +   A       +  ++ A+   +   ++ G FDG  S   + NL   
Sbjct: 294 DNMCWATGGTTPDPASAFNSRQFTQATLDKAMLRQMQGLVKAGYFDGPNSL--YRNLTAA 351

Query: 235 DVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHT-VAVIGPNSDVTVTMIGNYAGV 293
           DV T   +  AL+AA +GIVLLKN    LPL+    +T VA+IG  ++    M+G Y+G 
Sbjct: 352 DVNTQVARDTALKAAEEGIVLLKND-NILPLTLGGSNTQVAMIGFWANAADKMLGGYSGS 410

Query: 294 ACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
                 P+        T++     G     N    AA  AA+++   +   G+D ++E E
Sbjct: 411 PPFSHDPVTAARSMGITVNYVN--GPLTQTNADTSAAVNAAQKSSVVIFFGGIDNTVEKE 468

Query: 354 FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQ 413
             DR  +  P  Q  ++ R+A+  + PV++V M G  VD +   + P + AILW GYPGQ
Sbjct: 469 SQDRTSIAWPSGQLTMIQRLAQTGK-PVIVVRM-GTHVDDTPLLSIPNVKAILWAGYPGQ 526

Query: 414 AGGAAIADVLFGRANP 429
            GG A+ +++ G A+P
Sbjct: 527 DGGTAVMNLITGLASP 542


>gi|224068504|ref|XP_002302759.1| predicted protein [Populus trichocarpa]
 gi|222844485|gb|EEE82032.1| predicted protein [Populus trichocarpa]
          Length = 273

 Score =  258 bits (659), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 114/133 (85%), Positives = 125/133 (93%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           M+NGG+AGLTYWSPNVNIFRDPRWGRGQETPGEDPV+ GKYAASYVRGLQGN G RLKVA
Sbjct: 141 MFNGGVAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNDGDRLKVA 200

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           ACCKH+TAYDLDNWNGVDR+HFNA+VSKQD+EDT++VPF+ CV EGKVASVMCSYNQVNG
Sbjct: 201 ACCKHFTAYDLDNWNGVDRFHFNAQVSKQDMEDTFDVPFRMCVKEGKVASVMCSYNQVNG 260

Query: 121 KPTCADPDILKNT 133
            PTCADP +LK T
Sbjct: 261 IPTCADPKLLKKT 273


>gi|336463686|gb|EGO51926.1| hypothetical protein NEUTE1DRAFT_125528 [Neurospora tetrasperma
           FGSC 2508]
          Length = 788

 Score =  257 bits (656), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 177/464 (38%), Positives = 245/464 (52%), Gaps = 42/464 (9%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N G  GL YW+PNVN ++DPRWGRG ETPGEDP+    Y  + + GL+GN   R KV A 
Sbjct: 81  NFGFGGLDYWTPNVNPYKDPRWGRGAETPGEDPLHIKGYVKAMLAGLEGNETVR-KVIAT 139

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV---- 118
           CKHY AYDL+ W+G+ RY F A V+ QDL + Y  PF+ C  + KV S+MCSYN +    
Sbjct: 140 CKHYAAYDLERWHGLTRYEFEAIVTLQDLSEYYLPPFQQCARDSKVGSIMCSYNALTIRD 199

Query: 119 -------------NGKPTCADP---DILKNTIHGQW-RLDGYIVSDCDSV-GVLYNTQHY 160
                          +P CA+     IL++  H  W   + YI SDC+++   L +  ++
Sbjct: 200 MAGGNPDEIINLTTAQPACANTYLMTILRD--HWNWTEHNNYITSDCNAILDFLPDNHNF 257

Query: 161 TRTPEEAAADAIKAGLDLDCGPFLAIHTE--GAVRGGLLREEDVNLALAYTITVQMRLGM 218
           ++TP EAAA A KAG D  C    +  T+  GA    LL E  ++ AL       +R G 
Sbjct: 258 SQTPAEAAAAAYKAGTDTVCEVSGSPLTDVVGAYNQSLLPEAVIDTALRRLYEGLIRAGY 317

Query: 219 FD------------GEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLS 266
            D            G  S+  +  L   DV TP+ Q+LAL++A +GIVLLKNS   LPL 
Sbjct: 318 LDHGRSSAVAGGDGGSFSSPAYDALNWEDVNTPSTQELALRSATEGIVLLKNSGSLLPLD 377

Query: 267 TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRYAKTIHQA-GCFGVACNGNQ 325
                 VA+IG  ++ T TM G Y+G+   Y  PL    +   +   A G    A + + 
Sbjct: 378 FSSGKKVALIGHWANATGTMRGPYSGIPPFYHNPLYAAQQLNLSFSYANGPVVNASDPDT 437

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
               A  AA  AD  +   G D ++ +E +DR  +  P  Q +L+S +  A  G  ++V+
Sbjct: 438 WTAPALAAAEGADVVLYFGGTDTTVASEDLDRESIAWPKAQMKLLSEL--AGLGKPLVVI 495

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
             G  VD SF   +  + +ILWVGYPGQ+GG A+ DVL G+  P
Sbjct: 496 QLGDQVDDSFLLENGNVSSILWVGYPGQSGGTAVFDVLTGKKAP 539


>gi|367028614|ref|XP_003663591.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
           ATCC 42464]
 gi|347010860|gb|AEO58346.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
           ATCC 42464]
          Length = 760

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 169/436 (38%), Positives = 232/436 (53%), Gaps = 17/436 (3%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N G A L +W+PNVN FRDPRWGRG ETPGED     K+A ++V+G+QG  G   +V A 
Sbjct: 139 NNGRAHLDFWTPNVNPFRDPRWGRGHETPGEDAFKNKKWAEAFVKGMQG-PGPTHRVIAT 197

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           CKHY AYDL+N     R++F+A+VS QDL + Y  PF+ C  + KV S+MCSYN VN  P
Sbjct: 198 CKHYAAYDLENSGSTTRFNFDAKVSTQDLAEYYLPPFQQCARDSKVGSIMCSYNAVNEIP 257

Query: 123 TCADPDILKNTIHGQWRLDG---YIVSDCDSVGVLYNTQHYTR-TPEEAAA--DAIKAGL 176
            CA+P ++   +   W       YIVSDCD+V  L N     R  P  AAA   +++AG 
Sbjct: 258 ACANPYLMDTILRKHWNWTDEHQYIVSDCDAVYYLGNANGGHRYKPSYAAAIGASLEAGC 317

Query: 177 DLDCGPF--LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPR 234
           D  C      A     A   G   +  ++ A+   +   +  G FDG P    + NL   
Sbjct: 318 DNMCWATGGTAPDPASAFNSGQFSQTTLDTAILRQMQGLVLAGYFDG-PGGM-YRNLSVA 375

Query: 235 DVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTL-RHHTVAVIGPNSDVTVTMIGNYAGV 293
           DV T   Q  AL+AA  GIVLLKN    LPLS    +  VA+IG  ++    M+G Y+G 
Sbjct: 376 DVNTQTAQDTALKAAEGGIVLLKNDG-ILPLSVNGSNFQVAMIGFWANAADKMLGGYSGS 434

Query: 294 ACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
                 P+        T++         NG+    AA  AA++++A V   G+D ++E E
Sbjct: 435 PPFNHDPVTAARSMGITVNYVNGPLTQPNGD--TSAALNAAQKSNAVVFFGGIDNTVEKE 492

Query: 354 FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQ 413
             DR  +  P  Q  L+ R+A+   G  V+V+  G  VD +   + P + AILW GYPGQ
Sbjct: 493 SQDRTSIEWPSGQLALIRRLAET--GKPVIVVRLGTHVDDTPLLSIPNVRAILWAGYPGQ 550

Query: 414 AGGAAIADVLFGRANP 429
            GG A+  ++ G A+P
Sbjct: 551 DGGTAVVKIITGLASP 566


>gi|325192664|emb|CCA27085.1| unnamed protein product [Albugo laibachii Nc14]
          Length = 2278

 Score =  256 bits (654), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 171/447 (38%), Positives = 231/447 (51%), Gaps = 39/447 (8%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR-LKVAAC 62
           G   GL  WSPN+NI RDPRWGR  ETP EDP +  KY  +Y +GLQ    SR L+    
Sbjct: 144 GPHIGLDCWSPNININRDPRWGRAMETPSEDPYVNAKYGVAYTKGLQEGQDSRFLQAVVT 203

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
            KHY AY  +N+ G DR  F+A VS  D  DTY   F+A VV+GK   +MCSYN +NG P
Sbjct: 204 LKHYLAYSYENYGGTDRTQFDAIVSAYDFADTYFPAFEASVVDGKAKGIMCSYNSLNGIP 263

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL---- 178
           TCA+   L   +      DGYI SD  ++  +++   YT+T  EA   A+++G+D+    
Sbjct: 264 TCAN-KWLNQLLRDDLEFDGYITSDTGAIQGIFDGHKYTKTLCEATKIAMESGVDICSGN 322

Query: 179 ---DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD 235
              +C   LA  T  +          ++ A+  T+ ++ +LG+FD     QP  + GP D
Sbjct: 323 AYWNCLKQLANSTNFSA--------SIDEAIRRTLKLRFQLGLFDA-IGDQP--HFGPED 371

Query: 236 VCTPAHQQLALQAAHQGIVLLKNSARTLPLST-LRHHTVAVIGPNSDVTVTMIGNYAGVA 294
           V T    QL+L  A + IVLL+N   TLPL   LR   +AVIGP+S     ++GNY G  
Sbjct: 372 VRTAKSLQLSLDLARKSIVLLQNHGNTLPLRLGLR---IAVIGPHSMTRRGIMGNYYGQL 428

Query: 295 CG--------YTTPLQGISRY---AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLV 343
           C           +PL+ I        T H  GC G+          A  A R AD  VL 
Sbjct: 429 CHGDYDEVRCIQSPLEAIQSVNGRNNTHHVNGC-GINDTSTAEFDDALQAVRTADVAVLF 487

Query: 344 MGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIG 403
           +G+D SIE E  DR  + +P  Q EL+  +  A + P V+VL  GG + +   K      
Sbjct: 488 LGIDISIERESKDRDNIDVPHIQLELLKAIRVAGK-PTVVVLFNGGILGIE--KLILYAD 544

Query: 404 AILWVGYPGQAGGAAIADVLFGRANPG 430
           ++L   YPG  G  AIA++LFG  NP 
Sbjct: 545 SVLEAFYPGFFGAQAIAEILFGSINPS 571


>gi|449299051|gb|EMC95065.1| glycoside hydrolase family 3 protein [Baudoinia compniacensis UAMH
           10762]
          Length = 849

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 161/443 (36%), Positives = 234/443 (52%), Gaps = 21/443 (4%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL----- 57
           N    GL Y++PN+N F+DPRWGRG ETPGEDP+    Y  + + GL+G           
Sbjct: 145 NAARGGLDYFTPNINPFKDPRWGRGSETPGEDPLRIQGYVKNLLIGLEGTDDGYFNTSHS 204

Query: 58  ---KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCS 114
              K+ A CKH+  YDL++W+G  RY ++A ++ QDL + Y  PF+ C  +  VAS+MCS
Sbjct: 205 GYKKMIATCKHFAGYDLEDWDGYIRYGYDAEITTQDLAEYYLPPFQTCARDQNVASIMCS 264

Query: 115 YNQVNGKPTCADPDILKNTI---HGQWRLD-GYIVSDCDSVGVLYNTQHYTRTPEEAAAD 170
           YN VN  P CA+   L+ TI   H  W +D  YI SDC+++  +Y   +Y+     AA  
Sbjct: 265 YNSVNSVPACAN-SYLQETILREHWGWTIDNNYITSDCNAISDIYYNHNYSVNNAAAAGL 323

Query: 171 AIKAGLDLDC---GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQP 227
           ++  G+D  C      +     G+  GG + E  +  AL       +  G FD   S+ P
Sbjct: 324 SLSNGMDTACIVANTGVMTDVNGSYYGGYVTEATITTALIRQYEALVIAGYFD-PASSNP 382

Query: 228 FGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMI 287
           + ++G   V TPA Q LA QAA +G  LLKN+   LP        VA+IG  ++ T  M 
Sbjct: 383 YRSIGWSSVNTPAAQTLARQAATEGTTLLKNTG-LLPYKFTSQTKVAMIGMWANGTSQMQ 441

Query: 288 GNYAGVACGYTTPLQGISRYAKTIHQA-GCFGVACNGNQLIGAAEVAARQADATVLVMGL 346
           G Y+G A    +PL   S+   + + A G        +     A  AA+ AD  +   G+
Sbjct: 442 GGYSGPAPYLHSPLYAASQLGLSYNYANGPINQTTLTSNYSQNATAAAQNADVILFFGGI 501

Query: 347 DQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
           D S+EAE +DR  +  PG QQ L++++  A+ G  ++VL  G  +D +   ++  I A++
Sbjct: 502 DWSVEAEAMDRYQIAWPGAQQALIAQL--AALGKPMIVLQMGSMLDATPILSNNNISALV 559

Query: 407 WVGYPGQAGGAAIADVLFGRANP 429
           WVGYPGQ GG A  D+L G   P
Sbjct: 560 WVGYPGQDGGVAAFDILTGAVAP 582


>gi|373852136|ref|ZP_09594936.1| Beta-glucosidase [Opitutaceae bacterium TAV5]
 gi|372474365|gb|EHP34375.1| Beta-glucosidase [Opitutaceae bacterium TAV5]
          Length = 740

 Score =  255 bits (651), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 165/441 (37%), Positives = 243/441 (55%), Gaps = 28/441 (6%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHY 66
           GLT+W+PN+N+FRDPRWGRGQET GEDP LT + AA++VRGLQG+T  + LK+AAC KHY
Sbjct: 131 GLTFWTPNINLFRDPRWGRGQETWGEDPHLTARLAAAFVRGLQGDTPDTHLKLAACAKHY 190

Query: 67  TAYD-LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +   +N    +R+ FNARV+  DL D+Y   F+  V   +V SVM +YN+   +P CA
Sbjct: 191 AVHSGPEN----ERHTFNARVTPHDLWDSYLPAFEHLVRHARVESVMGAYNRTLDEPCCA 246

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
              +L + +  +W  +G++VSDC ++  ++ T   T  P E+AA A+  G DL CG    
Sbjct: 247 SQFLLLDILRERWGFEGHVVSDCWALRDIHETHRITTDPVESAALALTKGCDLACGTTFE 306

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFD-GEPSAQPFGN-LGPRDVCT-PAHQ 242
           +  E AV+ GL+ E D++ AL+  +  + +LGMFD  + +  P+ N   P  + T  AH 
Sbjct: 307 LLGE-AVQRGLITEADIDRALSRHLRARFKLGMFDPADDNRNPWSNPPAPEAIVTCAAHT 365

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
            LA +AA    VLL+N    LPL      ++ + GP +     ++GNY G+     T L 
Sbjct: 366 ALACEAAVASCVLLQNHNHILPLRP-DVRSIYITGPLAATQDALLGNYYGLPPRAITLLD 424

Query: 303 GIS----RYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE----- 353
           G++       +  ++ G        N L   AE      D T+  +GL   +E E     
Sbjct: 425 GLAAALPEGIRADYRPGALLSTPKQNAL-EWAEFDCASCDVTIACLGLTALLEGEEGEAI 483

Query: 354 ----FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVG 409
                 DR  + LP  Q+  +  + +  RG  V+V++ GG   +S      ++ AILW G
Sbjct: 484 ASSLHGDRDDISLPPPQRLFLESLIQ--RGARVIVILFGGSA-LSLGPLADKVEAILWAG 540

Query: 410 YPGQAGGAAIADVLFGRANPG 430
           YPGQ GG A+AD+L GRA+P 
Sbjct: 541 YPGQEGGRALADILLGRASPS 561


>gi|164428543|ref|XP_964543.2| hypothetical protein NCU00709 [Neurospora crassa OR74A]
 gi|157072187|gb|EAA35307.2| hypothetical protein NCU00709 [Neurospora crassa OR74A]
          Length = 786

 Score =  255 bits (651), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 178/464 (38%), Positives = 247/464 (53%), Gaps = 43/464 (9%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N G  GL YW+PNVN ++DPRWGRG ETPGEDP+    Y  + + GL+GN   R KV A 
Sbjct: 81  NFGFGGLDYWTPNVNPYKDPRWGRGAETPGEDPLHIKGYVKAILAGLEGNETVR-KVIAT 139

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYN------ 116
           CKHY AYDL+ W+G+ RY F A V+ QDL + Y  PF+ C  + KV S+MCSYN      
Sbjct: 140 CKHYAAYDLERWHGLTRYEFEAIVTLQDLSEYYLPPFQQCARDSKVGSIMCSYNALTIRD 199

Query: 117 --------QVN---GKPTCADP---DILKNTIHGQW-RLDGYIVSDCDSV-GVLYNTQHY 160
                   ++N    +P CA P    IL++  H  W   + YI SDC+++   L +  ++
Sbjct: 200 MASGKPDEEINLTTAQPACAKPYLMTILRD--HWNWTEHNNYITSDCNAILDFLPDNHNF 257

Query: 161 TRTPEEAAADAIKAGLDLDCGPFLAIHTE--GAVRGGLLREEDVNLALAYTITVQMRLGM 218
           ++TP EAAA A KAG D  C    +  T+  GA    LL E  ++ AL       +R G 
Sbjct: 258 SQTPAEAAAAAYKAGTDTVCEVSGSPLTDVVGAYNQSLLPEAVIDTALRRLYEGLIRAGY 317

Query: 219 FD------------GEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLS 266
            D            G  S+  +  L   DV TP+ Q+LAL++A +GIVLLKN+   LPL 
Sbjct: 318 LDHGRSSAVAGGDGGSFSSPAYDALNWEDVNTPSTQELALRSATEGIVLLKNAGSLLPLD 377

Query: 267 TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRYAKTIHQA-GCFGVACNGNQ 325
                 VA+IG  ++ T TM G Y+G+   Y  PL    +   +   A G    A + + 
Sbjct: 378 -FSGKKVALIGHWANATGTMRGPYSGIPPFYHNPLYAAQQLNLSFSYANGPVVNASDPDT 436

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
               A  AA  AD  +   G D ++ +E +DR  +  P  Q +L+S +  A  G  ++V+
Sbjct: 437 WTAPALAAAEGADVVLYFGGTDTTVASEDLDRESIAWPETQMQLLSEL--AGLGKPLVVI 494

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
             G  VD S   N+  + +ILWVGYPGQ+GG A+ DVL G+  P
Sbjct: 495 QLGDQVDDSSLLNNGNVSSILWVGYPGQSGGTAVFDVLTGKKAP 538


>gi|118489157|gb|ABK96385.1| unknown [Populus trichocarpa x Populus deltoides]
          Length = 343

 Score =  254 bits (649), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 124/145 (85%), Positives = 130/145 (89%)

Query: 286 MIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG 345
           MIGNYAGVACGYTTPLQGI RYAKT+H +GC  V CNGNQ   AAEVAAR ADAT+LVMG
Sbjct: 1   MIGNYAGVACGYTTPLQGIRRYAKTVHLSGCNDVFCNGNQQFNAAEVAARHADATILVMG 60

Query: 346 LDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAI 405
           LDQSIEAEF DR GLLLPG QQELVSRVA+ASRGP +LVLM GGP+DVSFAKNDPRIGAI
Sbjct: 61  LDQSIEAEFRDRKGLLLPGYQQELVSRVARASRGPTILVLMSGGPIDVSFAKNDPRIGAI 120

Query: 406 LWVGYPGQAGGAAIADVLFGRANPG 430
           LWVGYPGQAGGAAIADVLFG ANPG
Sbjct: 121 LWVGYPGQAGGAAIADVLFGTANPG 145


>gi|350295750|gb|EGZ76727.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
          Length = 839

 Score =  254 bits (649), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 178/464 (38%), Positives = 246/464 (53%), Gaps = 43/464 (9%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N G  GL YW+PNVN ++DPRWGRG ETPGEDP+    Y  + + GL+GN   R KV A 
Sbjct: 140 NFGFGGLDYWTPNVNPYKDPRWGRGAETPGEDPLHIKGYVKAMLAGLEGNETVR-KVIAT 198

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV---- 118
           CKHY AYDL+ W+G+ RY F A V+ QDL + Y  PF+ C  + KV S+MCSYN +    
Sbjct: 199 CKHYAAYDLERWHGLTRYEFEAIVTLQDLSEYYLPPFQQCARDSKVGSIMCSYNALTIRD 258

Query: 119 --------------NGKPTCADP---DILKNTIHGQW-RLDGYIVSDCDSV-GVLYNTQH 159
                           +P CA+     IL++  H  W   + YI SDC+++   L +  +
Sbjct: 259 MAGGSKPDEIINLTTAQPACANTYLMTILRD--HWNWTEHNNYITSDCNAILDFLPDNHN 316

Query: 160 YTRTPEEAAADAIKAGLDLDCGPFLAIHTE--GAVRGGLLREEDVNLALAYTITVQMRLG 217
           +++TP EAAA A KAG D  C    +  T+  GA    LL E  ++ AL       +R G
Sbjct: 317 FSQTPAEAAAAAYKAGTDTVCEVSGSPLTDVVGAYNQSLLPEAVIDTALRRLYEGLIRAG 376

Query: 218 MFD-----------GEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLS 266
             D           G  S+  +  L   DV TP+ Q+LAL++A +GIVLLKNS   LPL 
Sbjct: 377 YLDHGRSAVAGGDGGSFSSPAYDALNWNDVNTPSTQELALRSATEGIVLLKNSGSLLPLD 436

Query: 267 TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRYAKTIHQA-GCFGVACNGNQ 325
                 VA+IG  ++ T TM G Y+G+   Y  PL    +   ++  A G    A + + 
Sbjct: 437 -FSGKKVALIGHWANATGTMRGPYSGIPPFYHNPLYAAQQLNLSLSYANGPVVNASDPDT 495

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
               A  AA  AD  +   G D ++ +E +DR  +  P  Q +L+S +  A  G  V+V+
Sbjct: 496 WTAPALAAAEGADVVLYFGGTDTTVASEDLDRESIAWPEAQMKLLSEL--AGLGKPVVVI 553

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
             G  VD S   N+  + +ILWVGYPGQ+GG A+ DVL G+  P
Sbjct: 554 QLGDQVDDSSLLNNGNVSSILWVGYPGQSGGTAVFDVLTGKKAP 597


>gi|109897152|ref|YP_660407.1| beta-glucosidase [Pseudoalteromonas atlantica T6c]
 gi|109699433|gb|ABG39353.1| Beta-glucosidase [Pseudoalteromonas atlantica T6c]
          Length = 733

 Score =  254 bits (649), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 159/441 (36%), Positives = 236/441 (53%), Gaps = 31/441 (7%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           +GLT+W+PN+NIFRDPRWGRGQET GEDP LT +   + V GLQG+    LK AA  KH+
Sbjct: 130 SGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGDHPKYLKTAAAAKHF 189

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+A  S++D+ +TY   F+A V E  V +VM +YN+VNG P  
Sbjct: 190 AVH-----SGPEALRHEFDAIASEKDMYETYFPAFEALVTEADVETVMAAYNRVNGHPAG 244

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFL 184
               +L   +  +W   G+IVSDC  +   +     T    E+AA AI  G DL+CG   
Sbjct: 245 GSDFLLNTVLRDKWGFSGHIVSDCWGLADFHEYHKVTANAVESAALAINTGTDLNCG--- 301

Query: 185 AIHTE--GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
           +++T    AV  GL+ E+ ++  L   +  + +LG FD +    P+ ++    V + AH 
Sbjct: 302 SVYTALPDAVEAGLVDEKTIDTRLHKVLATKFKLGFFDPKDD-NPYNSISADVVNSDAHA 360

Query: 243 QLALQAAHQGIVLLKNSARTLPL-STLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 301
            +A + A + IVLL+N  + LPL   +R+  V V GP +  +  ++GNY G++   T  L
Sbjct: 361 DVAYEMAVKSIVLLQNENQVLPLDKNIRN--VYVTGPFASSSEVLLGNYYGLSGKTTNIL 418

Query: 302 QGIS---RYAKTI-HQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE---- 353
            GI+       TI ++ G      N N +      A +  D  + VMGL  + E E    
Sbjct: 419 DGITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEGEEGEA 478

Query: 354 -----FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWV 408
                  DR  L LP  Q E + ++ K +  PV++VL  G PV+V+         AI++ 
Sbjct: 479 IASPHKGDRLSLDLPEHQIEFLRKLRKDNDKPVIVVLTAGTPVNVTEIAQ--LADAIVFA 536

Query: 409 GYPGQAGGAAIADVLFGRANP 429
            YPGQ GG A+AD+LFG  +P
Sbjct: 537 WYPGQEGGKAVADILFGERSP 557


>gi|398406144|ref|XP_003854538.1| hypothetical protein MYCGRDRAFT_38178 [Zymoseptoria tritici IPO323]
 gi|339474421|gb|EGP89514.1| hypothetical protein MYCGRDRAFT_38178 [Zymoseptoria tritici IPO323]
          Length = 884

 Score =  254 bits (649), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 161/435 (37%), Positives = 233/435 (53%), Gaps = 22/435 (5%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG--NTGSRLKVA 60
           N   +G  +W+PN+N F DPRWGRG E P ED     +Y AS V GLQG        ++ 
Sbjct: 250 NYAHSGYDFWTPNMNTFLDPRWGRGLEVPTEDSFHAQRYVASLVPGLQGGKEKTDHKQII 309

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A CKH+  YD++     +R+  N   + QDL + Y   FK CV +  V S+MCSYN V G
Sbjct: 310 ATCKHFAVYDVE----TNRHAQNYEPTPQDLGEYYLPAFKTCVRDVNVGSIMCSYNAVYG 365

Query: 121 KPTCADPDILKNTIHGQWRLDG---YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD 177
            P CA    L++ +  QW  +    Y+ SDC++V  ++   ++T T   AAA A+ AG D
Sbjct: 366 VPACASEYFLQDVLRDQWNFNEPYHYVTSDCEAVKDIWTPHNFTDTEPAAAAVALNAGTD 425

Query: 178 LDCG-PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDV 236
            +CG  +L ++T  +V      E  ++++L         +G FDG+P    +  L   DV
Sbjct: 426 TNCGTSYLQLNT--SVANNWTTEAQMDISLTRLYNALFTVGYFDGQPE---YDGLSFADV 480

Query: 237 CTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
            TP  Q  A +AA +GI LLKN    LPL    +++VA+IGP ++ T  M G Y G+A  
Sbjct: 481 STPFAQATAYRAASEGITLLKNDG-LLPLKK-SYNSVALIGPWANATTQMQGIYQGIAPY 538

Query: 297 YTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEV--AARQADATVLVMGLDQSIEAEF 354
             +PL      A+  H +   G A N     G A    AAR AD  +   G+D SIE E 
Sbjct: 539 LVSPLAAAQ--AQWGHISFTNGTAINSTNTTGFASALSAARDADVIIYAGGIDSSIEKES 596

Query: 355 IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQA 414
            DR  +  PG Q +LV ++++  + P+V+V   GG VD S    +  + +++W GYPGQ 
Sbjct: 597 RDRTSISWPGNQLDLVQQLSELGK-PLVVVQFGGGQVDDSALLRNKNVNSLVWAGYPGQD 655

Query: 415 GGAAIADVLFGRANP 429
           GG+A+ DVL G+ +P
Sbjct: 656 GGSALIDVLVGKQSP 670


>gi|291240561|ref|XP_002740190.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 763

 Score =  254 bits (648), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 163/443 (36%), Positives = 236/443 (53%), Gaps = 24/443 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GL+ +SP +NI R P WGR QET GEDP L G+  ++YVRGLQG+    +   A CKH+ 
Sbjct: 136 GLSCFSPVINIARHPLWGRNQETYGEDPFLIGELGSAYVRGLQGDHPRYVLANAGCKHFD 195

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +       V R+ F+A+V ++D + T+   F  CV  G V SVMCSYN++N  P CA+ 
Sbjct: 196 VHGGPEDIPVSRFSFDAKVFERDWQMTFLPAFHECVKAG-VYSVMCSYNRINEVPACANT 254

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD----CGPF 183
            +L + +  +W  DGY+VSD  +V  +  + HYT +  +  A A+ AG +LD     G  
Sbjct: 255 RLLTDILRKEWGFDGYVVSDEGAVEFIMTSHHYTDSIVDTVASAVNAGCNLDLAFPVGDG 314

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
           + I    AV  G ++E+ V   +      +MRLG FD  P   P+ NL    V +  H++
Sbjct: 315 MYIKIGDAVTAGKIKEKTVVERVKPLFYTRMRLGEFD-PPELNPYANLNLSVVQSEEHRE 373

Query: 244 LALQAAHQG-----IVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG-- 296
           LA++AA Q       VLLK   R LPL TL +  +AVIGP +D    + G+Y+       
Sbjct: 374 LAVKAALQSFVLLNFVLLKREGRVLPLDTLVNK-LAVIGPFADNPSYLFGDYSPNPDKEF 432

Query: 297 YTTPLQGISRYAK-TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFI 355
             TP +G+S  A+ T    GC    C         + A   AD  V+ +G    IEAEF+
Sbjct: 433 VVTPCKGLSNAARDTRCTPGCLTAPCT-TYFSEMVKAAVTGADLIVVCLGTGVKIEAEFV 491

Query: 356 DRAGLLLPGRQQELVSRVAKASRG-PVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQA 414
           DR+ L LPG+Q +L+  V K + G P++L+L   GP+D+ +A  +P I  I+   +P QA
Sbjct: 492 DRSDLSLPGKQFQLLQDVVKYANGKPIILLLFNAGPLDIVWAVENPAIQVIVACFFPSQA 551

Query: 415 GGAAIADVLF-------GRANPG 430
            G A+  +         G  NPG
Sbjct: 552 TGDALYRMFMNTHGVDTGNGNPG 574


>gi|115436096|ref|NP_001042806.1| Os01g0296700 [Oryza sativa Japonica Group]
 gi|113532337|dbj|BAF04720.1| Os01g0296700, partial [Oryza sativa Japonica Group]
          Length = 522

 Score =  253 bits (646), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 147/323 (45%), Positives = 192/323 (59%), Gaps = 19/323 (5%)

Query: 118 VNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYT-RTPEEAAADAIKAGL 176
           +NG P CAD  +L  T+   W+L GYIVSDCDSV V+     +   T  EA A A+KAGL
Sbjct: 1   INGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYTGVEATAAAMKAGL 60

Query: 177 DLDCG-------PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFG 229
           DLDCG        F   +   AVR G L+E  V+ AL       MRLG FDG P  +   
Sbjct: 61  DLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTNLYLTLMRLGFFDGIPELE--- 117

Query: 230 NLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVT--MI 287
           +LG  DVCT  H++LA  AA QG+VLLKN A  LPLS  + ++VA+ G    +  T  M+
Sbjct: 118 SLGAADVCTEEHKELAADAARQGMVLLKNDAALLPLSPEKVNSVALFGQLQHINATDVML 177

Query: 288 GNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLD 347
           G+Y G  C   TP  G+ +   +     C   +C+       A  AA+  DAT++V GL+
Sbjct: 178 GDYRGKPCRVVTPYDGVRKVVSSTSVHACDKGSCD------TAAAAAKTVDATIVVAGLN 231

Query: 348 QSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILW 407
            S+E E  DR  LLLP  Q   ++ VA+AS  P+VLV+M  G VDVSFA+++P+IGA++W
Sbjct: 232 MSVERESNDREDLLLPWSQASWINAVAEASPSPIVLVIMSAGGVDVSFAQDNPKIGAVVW 291

Query: 408 VGYPGQAGGAAIADVLFGRANPG 430
            GYPG+ GG AIADVLFG+ NPG
Sbjct: 292 AGYPGEEGGTAIADVLFGKYNPG 314


>gi|326427096|gb|EGD72666.1| hypothetical protein PTSG_04397 [Salpingoeca sp. ATCC 50818]
          Length = 614

 Score =  253 bits (646), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 158/427 (37%), Positives = 229/427 (53%), Gaps = 19/427 (4%)

Query: 13  SPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHYTAYDL 71
           SPN+NI RDPRWGR QE P EDP+L G++   Y  GLQ    SR  KV    KH+ AY L
Sbjct: 11  SPNININRDPRWGRNQEVPSEDPLLNGEFGKLYTMGLQQGEDSRYTKVVVTLKHWDAYSL 70

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
           ++ +G  R++F+A+VS   L DTY   F+  V+EG    VMCSYN +NG+PTC  P +L 
Sbjct: 71  EDSDGFTRHNFDAKVSNFALMDTYWPAFRKAVMEGNAKGVMCSYNALNGRPTCTHP-LLT 129

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAG-LDLDCGPFLAIHTEG 190
             +   W+ DGY+ SD  ++  +Y   HYT     A A A++ G  D+D G         
Sbjct: 130 KVLRDIWKFDGYVTSDTGAIEDIYAKHHYTANASAAVAAALRDGRCDMDSGAVYHDALLD 189

Query: 191 AVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAH 250
           AV  G    +DV+ AL  T+ ++  LG+FD     QP+  +    + T   Q L ++   
Sbjct: 190 AVNSGECSMDDVDRALYNTLKLRFELGLFD-PIEDQPYWRINASSINTTYAQDLNMKITL 248

Query: 251 QGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG------YTTPL--- 301
           + ++LL+N    LP    R   VAVIGP+ +    ++GNY G  C        T+PL   
Sbjct: 249 ESMILLQNHNNALPFKKGR--KVAVIGPHINAQEALVGNYLGQLCPDDSFDCITSPLAAI 306

Query: 302 QGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLL 361
           + I+  + T+   G   +AC    +  A  V A+ AD  VL++G++ +IEAE  DR  + 
Sbjct: 307 EAINGMSNTVSAMGSGVLACTDASIQEAVNV-AKDADYVVLLIGINDTIEAESNDRTSID 365

Query: 362 LPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIAD 421
           LP  Q +L + +A  ++     VL+ GG + +   K   ++ AI+  GYPG  GGAAIA 
Sbjct: 366 LPQCQHKLTAAIAHLNKT-TAAVLINGGMLAIEQEKK--QLPAIIEAGYPGFYGGAAIAK 422

Query: 422 VLFGRAN 428
            +FG  N
Sbjct: 423 TIFGDNN 429


>gi|443692971|gb|ELT94448.1| hypothetical protein CAPTEDRAFT_221920 [Capitella teleta]
          Length = 757

 Score =  253 bits (645), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 154/419 (36%), Positives = 235/419 (56%), Gaps = 11/419 (2%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GL+ +SP +NI R P WGR QET GEDP+L+G  A S+VRGLQG+    L+  A CKH+ 
Sbjct: 131 GLSCFSPVINIMRHPLWGRNQETYGEDPLLSGTLAQSFVRGLQGDDPRYLRANAGCKHFD 190

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +       V R+ F+A+V+ +D   T+   FK CV  G   S+MCSYN++NG P CA+ 
Sbjct: 191 VHGGPEDIPVSRFSFDAKVNMRDWRMTFLPQFKMCVDAGSY-SLMCSYNRINGIPACANK 249

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +L +    +W   GYIVSD  ++  +    HYT +       AIKAG +L+ G    ++
Sbjct: 250 QLLTDITRDEWGFHGYIVSDSGAISNIKEQHHYTNSTVATVVAAIKAGTNLELGGGSNMY 309

Query: 188 TEG---AVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
                 A++ GLL E+++   +   +  ++RLG FD E     +  +G   + +P H++ 
Sbjct: 310 YPKQLDAMKQGLLTEKEIRDNVRPLLYTRLRLGEFDPEAMVD-YNKIGVDVIQSPEHREQ 368

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAG-VACGYT-TPLQ 302
           A++AA+ G VLLKN    LP+   ++  +A++GP ++ T  + G Y+  V   +T T  +
Sbjct: 369 AVKAAYMGFVLLKNHNNLLPIKK-QYSKLAIVGPFTNATSELFGTYSSEVNLKFTSTIFE 427

Query: 303 GISRYAKTIHQA-GCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLL 361
           G+S    +   A GC   AC+G  +    E A   AD  ++ +G  Q  E+E  DRA L 
Sbjct: 428 GLSPLGGSTRSANGCTNSACSG-YVRDDVETAVAGADLVIVALGSGQRFESEGNDRAYLD 486

Query: 362 LPGRQQELVSRVAKASRG-PVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           L G Q +++      S G PV+LVL+  GP+D+++AK DP + AIL  GYP Q+ G A+
Sbjct: 487 LHGHQLDILKDAVFFSNGAPVILVLINAGPLDITWAKLDPGVTAILSCGYPAQSTGEAL 545


>gi|443695317|gb|ELT96258.1| hypothetical protein CAPTEDRAFT_179825 [Capitella teleta]
          Length = 750

 Score =  252 bits (644), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 157/428 (36%), Positives = 241/428 (56%), Gaps = 14/428 (3%)

Query: 5   GMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCK 64
           G  G+T +SP +NI R P WGR QET GEDP L+G+ A+ YV GLQG+    L+V+A CK
Sbjct: 143 GAHGITCFSPVINILRHPLWGRNQETYGEDPYLSGELASQYVSGLQGDDPRYLRVSAGCK 202

Query: 65  HYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           H+ A+   +   V ++ F+A++ ++DL+ T+   FK C+   K  +VMCS+N +NG P+C
Sbjct: 203 HFDAHGGPDTIPVRKFGFDAKIEERDLQMTFLPAFKKCIA-AKPYNVMCSFNSINGVPSC 261

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD-CGPF 183
           A+  +L + +  QW  +G++VSD  +V  ++   HY  + E AA +AIK+G +++  G F
Sbjct: 262 ANKRLLTDVLRAQWGYEGFVVSDDAAVEYIFTEHHYNSSFETAAVEAIKSGCNMELVGKF 321

Query: 184 LAIHTE--GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAH 241
              + +   A+   L+ ++++   +      +  LG FD  P+  PF  +    V +  H
Sbjct: 322 DPSYWQLTKALNEHLITKDELMENVRPVFLTRFLLGEFD-PPALNPFNQITKDVVLSAEH 380

Query: 242 QQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA--CGYTT 299
           Q+LAL+AA +  VLLKN    LPL      TVAV+GP S+ T  +IG+Y+         T
Sbjct: 381 QRLALEAAVKSFVLLKNDRNFLPLLKNSLKTVAVVGPMSNYTDGLIGDYSTDTDPSLILT 440

Query: 300 PLQGISRYAKTIHQA-GCFGVACNGNQLIGAAEVAARQADATVLVMGLDQS--IEAEFID 356
           PL GI + A  +  A GC    C   +   A +VAA    A V+ + L     +EAE  D
Sbjct: 441 PLHGIKKLAPNVQFASGCSNSTCTDYR---ATDVAAAVDGAQVVFVALGTGFIVEAENND 497

Query: 357 RAGLLLPGRQQELVSRVAKASRG-PVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           R+ ++LPG Q +L+      + G PVVL+L  GGP+DV+FA+    I +I+   +P    
Sbjct: 498 RSDIVLPGAQLQLLKDAVYHANGRPVVLLLFNGGPLDVTFAQLTSGIVSIVECFFPAMMT 557

Query: 416 GAAIADVL 423
           G AI  +L
Sbjct: 558 GEAIYRML 565


>gi|361127339|gb|EHK99311.1| putative exo-1,4-beta-xylosidase bxlB [Glarea lozoyensis 74030]
          Length = 569

 Score =  252 bits (643), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 151/432 (34%), Positives = 217/432 (50%), Gaps = 45/432 (10%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           NG  +G+ +W+PN+N F+DPRWGRG ETPGED +    Y A+ +RGL+GN   R ++ A 
Sbjct: 167 NGNHSGIDFWTPNINPFKDPRWGRGSETPGEDTLRLKGYVAALLRGLEGNKAQR-RIIAT 225

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           CKHY A DL++WNGV R+ F+A++S QDL + Y  PF+ C  + KV S MCSYN VNG P
Sbjct: 226 CKHYAANDLESWNGVTRHDFDAKISMQDLAEYYLQPFQQCARDSKVGSFMCSYNSVNGVP 285

Query: 123 TCADPDILKNTIHGQWRL---DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
            CA+  +L+  +   W     + Y+ SDC++V  +    HY  T     A A  AG D  
Sbjct: 286 ACANKYLLQTILRDHWNWTSENQYVTSDCEAVQDISLNHHYASTNAAGTALAFNAGTDSS 345

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           C                                    G FDG  S   + +LG  DV TP
Sbjct: 346 C----------------------------------EAGYFDG--SKALYSSLGWSDVNTP 369

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
             QQLALQA   GIV+LKN   TLPL       VA+IG  +  +  + G Y+G A    T
Sbjct: 370 QAQQLALQATVDGIVMLKNDG-TLPLKLDSKSKVAMIGFWASDSSKLQGGYSGKAPYLRT 428

Query: 300 PLQGISR--YAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
           P+    +  +   +        A   +     A  AA ++D  +   GLD S  AE +DR
Sbjct: 429 PVYAAQQLGFTPNVATGPVQQSASATDNWTTNALAAASKSDYILYFGGLDTSAAAEGVDR 488

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             L  P  Q  L+ ++  ++ G  ++++  G  +D +    +  + +ILW  +PGQ GG 
Sbjct: 489 TSLEWPSAQLALIKKL--SALGKPLIIIQEGDQMDNTPLLTNKGVSSILWASWPGQDGGP 546

Query: 418 AIADVLFGRANP 429
           A+  ++ G  +P
Sbjct: 547 AVMQIISGAKSP 558


>gi|348688508|gb|EGZ28322.1| family 3 glycoside hydrolase [Phytophthora sojae]
          Length = 701

 Score =  252 bits (643), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 162/431 (37%), Positives = 228/431 (52%), Gaps = 31/431 (7%)

Query: 20  RDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR---LKVAACCKHYTAYDLDNWNG 76
           RDPRWGR  ETP EDP++  KY  +Y +GLQ         L+     KHY AY  +N+ G
Sbjct: 113 RDPRWGRNTETPSEDPLVNSKYGVAYTKGLQEGKHEDPRYLQAVVTLKHYVAYSYENYGG 172

Query: 77  VDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILKNTIHG 136
            +R  FNA VS  D  DTY   F++ +V+G    VMCSYN VNG P CA+ ++    + G
Sbjct: 173 GNRKTFNAIVSPYDFADTYFPAFRSSIVDGNAKGVMCSYNSVNGVPACANNELENKLLRG 232

Query: 137 QWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP---FLAIHTEGAVR 193
               DGYI SD  ++  + +  HY  T  EAA  AI AG D++ G    ++A   E  V 
Sbjct: 233 MLGFDGYITSDSGAIEAISDWLHYVPTRCEAARLAILAGTDVNSGRGFGYMACLKE-LVE 291

Query: 194 GGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGI 253
              L  + V+  L +T+ ++  LG+FD     QP+  + P DV T A ++L+L  A + I
Sbjct: 292 SNQLDVKVVDDVLRHTLKLRFELGLFD-PIEDQPYWKVTPNDVNTDAAKKLSLDLARKSI 350

Query: 254 VLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG--------YTTPLQGIS 305
           VLL+N+   LPL   R   +AV+GP++     ++GNY G  C           TP + +S
Sbjct: 351 VLLQNNQPVLPLR--RGVKLAVVGPHAQAKRALLGNYLGQMCHGDYNEVGCIKTPFEAVS 408

Query: 306 RYAKTIHQAGCFGVACN--GNQLIGAAEV--AARQADATVLVMGLDQSIEAEFIDRAGLL 361
             A     +  + + CN  GN   G  E   A + A+A VL +G+D+S+EAE  DR  + 
Sbjct: 409 --ASNGDSSTTYALGCNVTGNSTAGFVEAVKAVQGAEAVVLFLGIDKSVEAEVRDRNNID 466

Query: 362 LPGRQQELVSRVAKASRGPVVLVLMCGGPVDVS--FAKNDPRIGAILWVGYPGQAGGAAI 419
           LP  Q +L+ RV +A   P V+VLM GG +       + D  + A     YPG  G  A+
Sbjct: 467 LPAIQVQLLQRV-RAVGKPTVVVLMNGGVLTAEDIIGQTDALVEAF----YPGFFGAQAM 521

Query: 420 ADVLFGRANPG 430
            D+LFG ANPG
Sbjct: 522 TDILFGDANPG 532


>gi|317474362|ref|ZP_07933636.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316909043|gb|EFV30723.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 723

 Score =  251 bits (642), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 164/449 (36%), Positives = 232/449 (51%), Gaps = 39/449 (8%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKH 65
            GLT+W+PN+NIFRDPRWGRGQET GEDP LT +   S V GLQG   ++  K  AC KH
Sbjct: 129 TGLTFWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLSVVNGLQGPQNTKYNKTHACAKH 188

Query: 66  YTAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           Y  +    WN   R+ FNA  ++ +DL +TY   F+  V++G V  VMC+YN+  G P C
Sbjct: 189 YAVHSGPEWN---RHSFNAENINPRDLWETYLPAFQDLVIQGNVKEVMCAYNRFEGDPCC 245

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLY-NTQHYT-RTPEEAAADAIKAGLDLDCG- 181
               +L N +  +W   G +VSDC ++   Y   +H T +   +A+A A+ +G DL+CG 
Sbjct: 246 GSDRLLINILRNEWNYKGLVVSDCGAIDNFYFKGRHETHKNKADASAAAVLSGTDLECGR 305

Query: 182 PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAH 241
            +  + +  AV+ GL+ E  ++ +L   +  +  LG  D      P+  L    +   AH
Sbjct: 306 SYTGLIS--AVKEGLINESAIDQSLCRLMKARFELGEMD---DTTPWDQLPDSLLSCHAH 360

Query: 242 QQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 301
           QQLALQ A + + LL+N    LPL   +  TVA+IGPN++ +V    NY G      T L
Sbjct: 361 QQLALQMARESMTLLQNHKNILPLD--KEMTVALIGPNANDSVMQWANYNGFPVHTITLL 418

Query: 302 QGISRY----------AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIE 351
           +G+++Y           K I       V    N +      AA+ AD  +   G+  S+E
Sbjct: 419 EGLTQYLPQERLIYIPQKNIEVQKYPWVNYYPNDIQAVINQAAK-ADVIIYAGGISASLE 477

Query: 352 AEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPR 401
            E +          DR  + LP  Q++LV +  KA+  P+V V   G    +        
Sbjct: 478 GEEMDVDAEGFRGGDRTTIELPNVQRKLV-KALKATGKPIVFVNFSG--CAMGLQPESQI 534

Query: 402 IGAILWVGYPGQAGGAAIADVLFGRANPG 430
             AIL   YPGQAGG AIA+VLFG  NP 
Sbjct: 535 CDAILQAWYPGQAGGTAIAEVLFGDYNPA 563


>gi|410648100|ref|ZP_11358515.1| beta-glucosidase [Glaciecola agarilytica NO2]
 gi|410132388|dbj|GAC06914.1| beta-glucosidase [Glaciecola agarilytica NO2]
          Length = 733

 Score =  251 bits (642), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 157/441 (35%), Positives = 235/441 (53%), Gaps = 31/441 (7%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           +GLT+W+PN+NIFRDPRWGRGQET GEDP LT +   + V GLQG+    LK AA  KH+
Sbjct: 130 SGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGDHPKYLKTAAAAKHF 189

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+A  S +D+ +TY   F+A V E  V +VM +YN+VNG P  
Sbjct: 190 AVH-----SGPEALRHEFDAIASPKDMYETYFPAFEALVTEANVETVMAAYNRVNGHPAG 244

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFL 184
               +L   +  +W   G++VSDC  +   +     T    E+AA AI  G DL+CG   
Sbjct: 245 GSDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKVTANAVESAALAINTGTDLNCG--- 301

Query: 185 AIHTE--GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
           A++     AV  GL+ E+ ++  L+  +  + +LG FD +    P+ N+    V + AH 
Sbjct: 302 AVYNALPDAVEAGLVDEKTIDKRLSKVLATKFKLGFFDPKDD-NPYNNISADVVNSEAHA 360

Query: 243 QLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 301
           Q+A + A + IVLL+N    LPL   +R+  + V GP +  +  ++GNY G++   T  L
Sbjct: 361 QVAYEMAVKSIVLLQNKNNILPLDRNIRN--LYVTGPFASSSEVLLGNYYGLSGKTTNIL 418

Query: 302 QGIS---RYAKTI-HQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE---- 353
            GI+       TI ++ G      N N +      A +  D  + VMGL  + E E    
Sbjct: 419 DGITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEGEEGEA 478

Query: 354 -----FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWV 408
                  DR  L LP  Q   + ++ K +  PV++VL  G PV+++  +      AI++ 
Sbjct: 479 IASPHKGDRLSLDLPEHQIAFLRKLRKDNDKPVIVVLTAGTPVNLT--EIAELADAIVFA 536

Query: 409 GYPGQAGGAAIADVLFGRANP 429
            YPGQ GG A+AD+LFG  +P
Sbjct: 537 WYPGQEGGKAVADILFGERSP 557


>gi|167521708|ref|XP_001745192.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776150|gb|EDQ89770.1| predicted protein [Monosiga brevicollis MX1]
          Length = 614

 Score =  251 bits (641), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 160/429 (37%), Positives = 228/429 (53%), Gaps = 18/429 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GL  WSPN+NI R P WGR QE PGEDP + G++  +Y  GLQG+  + L+     KH+ 
Sbjct: 155 GLDTWSPNINIARSPLWGRNQEVPGEDPFMNGQFGKAYTLGLQGDDDTYLQAIVTLKHWD 214

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
           AY L++ +G  R++FNA VS   L DTY   F+  V EGK   VMCSYN VNG PTCA P
Sbjct: 215 AYSLEDSDGATRHNFNAIVSNFSLMDTYWPAFRVAVTEGKAKGVMCSYNAVNGIPTCAHP 274

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAG-LDLDCGPFLAI 186
            +L+  +   W+ DGY+ SD  +V  + +   YT +   AA  AI+ G  D+D G     
Sbjct: 275 -LLRTVLRDLWKFDGYVSSDTGAVEDISDNHKYTPSWATAACAAIRDGQTDIDSGAVYMK 333

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLAL 246
                V  G  R EDV+ AL  T+ ++  LG+FD   + Q + ++    V T A +   +
Sbjct: 334 SLLQGVSEGHCRMEDVDNALRNTLRLRFELGLFDPVEN-QSYWHVPLAAVNTNASRATNM 392

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG------YTTP 300
               + +VLL+N    LPL++  +  VA+IGP++     M+GNY G  C         +P
Sbjct: 393 LHTLESMVLLQNKNNVLPLAS--NTKVALIGPHAKAQEDMVGNYLGQLCPDNNFDCVVSP 450

Query: 301 LQGISRYAKT---IHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDR 357
              +     T    +  G     C+ + +  A  VA   AD  VL++G+D+SIEAE  DR
Sbjct: 451 HDALVSILGTDAVTYAPGTNVTTCSQSHIDEAVSVAT-AADVAVLMLGIDESIEAESNDR 509

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             + LP  Q +L S +    + P V+VL+ GG + +   K   +  AI+  GYPG  GG 
Sbjct: 510 KSIDLPECQHQLASAIFAVGK-PTVIVLLNGGMLAIENEKQ--QADAIIEAGYPGFYGGT 566

Query: 418 AIADVLFGR 426
           AIA  L G+
Sbjct: 567 AIAQTLTGQ 575


>gi|429738050|ref|ZP_19271875.1| glycosyl hydrolase family 3 protein [Prevotella saccharolytica
           F0055]
 gi|429161155|gb|EKY03583.1| glycosyl hydrolase family 3 protein [Prevotella saccharolytica
           F0055]
          Length = 722

 Score =  251 bits (641), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 157/437 (35%), Positives = 242/437 (55%), Gaps = 27/437 (6%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT+W+PNVNIFRDPRWGRG ET GEDP LTG    ++V+G+QG+    LK AAC KH+
Sbjct: 128 AGLTFWAPNVNIFRDPRWGRGMETYGEDPFLTGTLGTAFVKGMQGDDPFYLKAAACGKHF 187

Query: 67  TAYDLDNWNGVDRYHFNARV--SKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +R    A V  +K+DL +TY   FK  V +GKV S+M +Y ++ G+   
Sbjct: 188 AVH-----SGPERTRHTANVEPTKRDLYETYLPAFKMLVQKGKVESIMGAYQRLYGESCS 242

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFL 184
               +L + +   W   G++VSDC +V  +Y      ++  EA A AIKAGL+L+CG  +
Sbjct: 243 GSKYLLTDILRKDWGFKGHVVSDCGAVTDMYEGHKLVKSEAEAVAFAIKAGLNLECGNSM 302

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
               + A++  L+ E+D++ AL   +  +++LG+   + +A P+       + + A++++
Sbjct: 303 RTMKD-AIQQKLITEKDLDKALLPLMMTRLKLGILQPD-AACPYNEFPESVIGSEANRKI 360

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           A QAA + +VLLKN+   LP++     T+ V GP +     ++GNY G++  Y+T L+GI
Sbjct: 361 AEQAAEESMVLLKNNG-VLPIAK-DIRTLFVTGPGATDAYYLMGNYFGLSNRYSTYLEGI 418

Query: 305 ----SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIE--------- 351
               S      ++ G   V  N N  +  +   +R A+ ++L+MG   + E         
Sbjct: 419 VGKVSNGTSVNYKQGFMQVFKNLND-VNWSVSESRGAEVSILIMGNSGNTEGEEGDAIAS 477

Query: 352 AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYP 411
           AE  DR  L LP  Q E +  V+K     +V+VL  G P+DV           + W  YP
Sbjct: 478 AERGDRVNLRLPDSQMEYLREVSKDRTNKLVVVLTGGSPIDVKEITELADAVVMAW--YP 535

Query: 412 GQAGGAAIADVLFGRAN 428
           GQ GG A+A++LFG AN
Sbjct: 536 GQEGGVALANLLFGDAN 552


>gi|410639677|ref|ZP_11350222.1| beta-glucosidase [Glaciecola chathamensis S18K6]
 gi|410140558|dbj|GAC08409.1| beta-glucosidase [Glaciecola chathamensis S18K6]
          Length = 733

 Score =  251 bits (641), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 157/441 (35%), Positives = 234/441 (53%), Gaps = 31/441 (7%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           +GLT+W+PN+NIFRDPRWGRGQET GEDP LT +   + V GLQG+    LK AA  KH+
Sbjct: 130 SGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGDHPKYLKTAAAAKHF 189

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+A  S +D+ +TY   F+A V E  V +VM +YN+VNG P  
Sbjct: 190 AVH-----SGPEALRHEFDAIASPKDMYETYFPAFEALVTEANVETVMAAYNRVNGHPAG 244

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFL 184
               +L   +  +W   G++VSDC  +   +     T    E+AA AI  G DL+CG   
Sbjct: 245 GSDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKVTANAVESAALAINTGTDLNCG--- 301

Query: 185 AIHTE--GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
           A++     AV  GL+ E+ ++  L+  +  + +LG FD +    P+ N+    V + AH 
Sbjct: 302 AVYNALPDAVEAGLVDEKTIDKRLSKVLATKFKLGFFDPKDD-NPYNNISADVVNSEAHA 360

Query: 243 QLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 301
           Q+A + A + IVLL+N    LPL   +R+  + V GP +  +  ++GNY G++   T  L
Sbjct: 361 QVAYEMAVKSIVLLQNKNNILPLDRNIRN--LYVTGPFASSSEVLLGNYYGLSGKTTNIL 418

Query: 302 QGIS---RYAKTI-HQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE---- 353
            GI+       TI ++ G      N N +      A +  D  + VMGL  + E E    
Sbjct: 419 DGITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEGEEGEA 478

Query: 354 -----FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWV 408
                  DR  L LP  Q   + ++ K +  PV++VL  G PV+++         AI++ 
Sbjct: 479 IASPHKGDRLSLDLPEHQIAFLRKLRKDNDKPVIVVLTAGTPVNLTEIAE--LADAIVFA 536

Query: 409 GYPGQAGGAAIADVLFGRANP 429
            YPGQ GG A+AD+LFG  +P
Sbjct: 537 WYPGQEGGKAVADILFGERSP 557


>gi|332307852|ref|YP_004435703.1| glycoside hydrolase family protein [Glaciecola sp. 4H-3-7+YE-5]
 gi|332175181|gb|AEE24435.1| glycoside hydrolase family 3 domain protein [Glaciecola sp.
           4H-3-7+YE-5]
          Length = 733

 Score =  251 bits (640), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 156/441 (35%), Positives = 234/441 (53%), Gaps = 31/441 (7%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           +GLT+W+PN+NIFRDPRWGRGQET GEDP LT +   + V GLQG+    LK AA  KH+
Sbjct: 130 SGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGDHPKYLKTAAAAKHF 189

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+A  S +D+ +TY   F+A + E  V +VM +YN+VNG P  
Sbjct: 190 AVH-----SGPEALRHEFDAIASPKDMYETYFPAFEALITEANVETVMAAYNRVNGHPAG 244

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFL 184
               +L   +  +W   G++VSDC  +   +     T    E+AA AI  G DL+CG   
Sbjct: 245 GSDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKVTANAVESAALAINTGTDLNCG--- 301

Query: 185 AIHTE--GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
           A++     AV  GL+ E+ ++  L+  +  + +LG FD +    P+ N+    V + AH 
Sbjct: 302 AVYNALPDAVEAGLVDEKTIDKRLSKVLATKFKLGFFDPKDD-NPYNNISADVVNSEAHA 360

Query: 243 QLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 301
           Q+A + A + IVLL+N    LPL   +R+  + V GP +  +  ++GNY G++   T  L
Sbjct: 361 QVAYEMAVKSIVLLQNKNNILPLDRNIRN--LYVTGPFASSSEVLLGNYYGLSGKTTNIL 418

Query: 302 QGIS---RYAKTI-HQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE---- 353
            GI+       TI ++ G      N N +      A +  D  + VMGL  + E E    
Sbjct: 419 DGITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEGEEGEA 478

Query: 354 -----FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWV 408
                  DR  L LP  Q   + ++ K +  PV++VL  G PV+++         AI++ 
Sbjct: 479 IASPHKGDRLSLDLPEHQIAFLRKLRKDNDKPVIVVLTAGTPVNLTEIAE--LADAIVFA 536

Query: 409 GYPGQAGGAAIADVLFGRANP 429
            YPGQ GG A+AD+LFG  +P
Sbjct: 537 WYPGQEGGKAVADILFGERSP 557


>gi|359473580|ref|XP_003631325.1| PREDICTED: protein BRASSINOSTEROID INSENSITIVE 1-like [Vitis
           vinifera]
          Length = 785

 Score =  250 bits (638), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 125/290 (43%), Positives = 185/290 (63%), Gaps = 6/290 (2%)

Query: 143 YIVSDCDSVGVLYNTQHYTRTPE-EAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREED 201
           YIVSDC  + V+ + Q+Y    + +A A  ++AGLDL+CG +       +V  G + + +
Sbjct: 10  YIVSDCYGLEVIVDNQNYLNESKVDAVAKTLQAGLDLECGHYYTDALNESVLTGKVSQYE 69

Query: 202 VNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSAR 261
           ++ AL     + MR+G FDG P+   + +LG +D+C   H +LA +AA QGIVLLKN   
Sbjct: 70  LDRALKNIYVLLMRVGYFDGIPA---YESLGLKDICAADHIELAREAARQGIVLLKNDYE 126

Query: 262 TLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVAC 321
            LPL   +   + ++GP+++ T  MIGNYAG+   Y +PL+  S      +  GC   +C
Sbjct: 127 VLPLKPGKK--LVLVGPHANATEVMIGNYAGLPYKYVSPLEAFSAIGNVTYATGCLDASC 184

Query: 322 NGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPV 381
           + +     A+ AA+ A+ T++ +G D SIEAEF+DR   LLPG Q EL+ +VA+ S GPV
Sbjct: 185 SNDTYFSEAKEAAKFAEVTIIFVGTDLSIEAEFVDRVDFLLPGNQTELIKQVAEVSSGPV 244

Query: 382 VLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPGT 431
           +LV++ G  +D++FAKN+PRI AILWVG+PG+ GG AIADV+FG+ NP T
Sbjct: 245 ILVVLSGSNIDITFAKNNPRISAILWVGFPGEQGGHAIADVVFGKYNPDT 294


>gi|363742357|ref|XP_003642627.1| PREDICTED: probable beta-D-xylosidase 5-like [Gallus gallus]
          Length = 748

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 160/427 (37%), Positives = 231/427 (54%), Gaps = 11/427 (2%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GL+ +SP +NI R P WGR QET GEDP LT + A S+V+GLQG     +K +A CKH++
Sbjct: 142 GLSCFSPVLNIMRHPLWGRNQETYGEDPYLTAELATSFVQGLQGQHPRYIKASAGCKHFS 201

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +       V R  F+A+V ++D   T+   F+ACV  G   S MCSYN++NG P CA+ 
Sbjct: 202 VHGGPENIPVSRLSFDAKVLERDWHTTFLPQFQACVRAGSY-SFMCSYNRINGVPACANK 260

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA-- 185
            +L + + G+W  +GY+VSD  +V ++     YT T  E A  ++ AGL+L+    +   
Sbjct: 261 KLLTDILRGEWGFEGYVVSDEGAVELILLGHRYTHTFLETAIASVNAGLNLELSYGMRNN 320

Query: 186 --IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
             +H   A+  G +  E +   +      ++RLG FD  P+  P+  L    V +  H+ 
Sbjct: 321 VFMHIPKALAMGNITLEMLRDRVRPLFYTRLRLGEFD-PPAMNPYNALELSVVQSSEHRN 379

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG--YTTPL 301
           L+L+AA +  VLLKN   TLPL  L    +AV+GP +D    + G+YA V       TP 
Sbjct: 380 LSLEAAIKSFVLLKNQRDTLPLRELHGKRLAVVGPFADNPRVLFGDYAPVPEPQYIYTPR 439

Query: 302 QGISRY-AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
           +G+    A     AGC    C         E A R AD  ++ +G    +E E  DR  L
Sbjct: 440 RGLQTLPANVSFAAGCREPRCWVYSR-DEVENAVRGADVVLVCLGTGIDVEMEARDRKDL 498

Query: 361 LLPGRQQELVSRVAKASRG-PVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
            LPG Q +L+    +A+ G PV+L+L   GP+DVS+A+    +GAIL   +P QA G AI
Sbjct: 499 SLPGHQLQLLQDAVRAAAGHPVILLLFNAGPLDVSWAQLHDGVGAILACFFPAQATGLAI 558

Query: 420 ADVLFGR 426
           A VL G+
Sbjct: 559 ASVLLGK 565


>gi|291240559|ref|XP_002740189.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 745

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 154/437 (35%), Positives = 230/437 (52%), Gaps = 14/437 (3%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G   GL+ +SP +NI R P WGR QET GEDP L+G  AAS+V GLQGN    +   A C
Sbjct: 135 GDHKGLSCFSPVINIARHPLWGRIQETYGEDPYLSGTLAASFVTGLQGNHPRYVTANAGC 194

Query: 64  KHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPT 123
           KH+ AY         R  F+A+VS +DL  T+   F  C+  G   S+MCSYN +NG P 
Sbjct: 195 KHFDAYAGPENIPSSRSTFDAKVSDRDLRMTFLPAFHECIQAG-TYSLMCSYNSINGVPA 253

Query: 124 CADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPF 183
           CA+  +L + +  +W   GY++SD  +V  +Y+  HYT+   + A   + +GL+L+    
Sbjct: 254 CANKKLLTDILRTEWNFTGYVISDQSAVEKVYDAHHYTKDMLDTAIACVNSGLNLELSSN 313

Query: 184 LA----IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           L     + T  AV+ G +  + V   ++     +MRLG FD  P   P+  L    + + 
Sbjct: 314 LTDNVMMQTTKAVKQGNVTMKTVKARVSPLFYTRMRLGEFD-PPEMNPYSKLDLSIIQSQ 372

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT- 298
            HQ+L+L+AA +  VLLKN  R LPL   +   +AV+GP  D  + + G+ +      T 
Sbjct: 373 EHQELSLKAAAKSFVLLKNENRFLPLKE-KIDKLAVVGPFGDNPIEIYGSKSPDVSNLTV 431

Query: 299 TPLQGISRYAK--TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFID 356
           TP  G+S+ A+  T   +GC   AC         +   R  D  V+ +G    +E E  D
Sbjct: 432 TPRYGLSKIARLATTFASGCLSPACTEYDPKSTKQAIDR-VDMVVVCLGTGNEVENEAHD 490

Query: 357 RAGLLLPGRQQELVSR-VAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           R+ L LPG+Q  L+   V  A+  PV+L+L   GP+D+++A ++P I  I+   +P Q  
Sbjct: 491 RSELTLPGQQLRLLQDAVTFAADKPVILLLFNAGPLDITWAVSNPAIPVIVECFFPAQTT 550

Query: 416 GAAIADVLFGR--ANPG 430
           G A+  +      +NPG
Sbjct: 551 GTALYHLFVNSPGSNPG 567


>gi|121308314|dbj|BAF43576.1| arabinofuranosidase/xylosidase homolog [Prunus persica]
          Length = 349

 Score =  249 bits (635), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 129/150 (86%), Positives = 136/150 (90%)

Query: 281 DVTVTMIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADAT 340
           DVTVTMIGNYAGVACGYTTPLQGI RY +TIHQAGC  V CNGNQL GAAE AARQADAT
Sbjct: 1   DVTVTMIGNYAGVACGYTTPLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADAT 60

Query: 341 VLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDP 400
           VLVMGLDQSIEAEF+DRAGLLLPG QQELVSRVA+ASRGP +LVLM GGP+DV+FAKNDP
Sbjct: 61  VLVMGLDQSIEAEFVDRAGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDP 120

Query: 401 RIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           RI AI+WVGYPGQAGG AIADVLFG  NPG
Sbjct: 121 RISAIIWVGYPGQAGGTAIADVLFGTTNPG 150


>gi|336261464|ref|XP_003345521.1| hypothetical protein SMAC_07509 [Sordaria macrospora k-hell]
 gi|380088197|emb|CCC13872.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 762

 Score =  248 bits (634), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 157/422 (37%), Positives = 229/422 (54%), Gaps = 14/422 (3%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR-LKVAA 61
           N G +G  YW+PNVN F+DPRWGRG ETPGED +   +YAAS +RGL+G    R  ++ A
Sbjct: 152 NAGFSGFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAASMIRGLEGPVRERERRIVA 211

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKHY A D ++WNG  R+ FNA+V+ QDL + Y  PF+ C  + KV S+MCSYN VNG 
Sbjct: 212 TCKHYAANDFEDWNGSTRHDFNAKVTLQDLAEYYLSPFQQCARDSKVGSIMCSYNAVNGV 271

Query: 122 PTCADPDILKNTIHGQWRLDG---YIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           P CA+  +++  +   W       YI SDC++V  +    HY +T  E  A A +AG+D 
Sbjct: 272 PACANTYLMQTILRDHWNWTAPGNYITSDCEAVLDISANHHYAKTNAEGTALAFEAGIDS 331

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
            C    +    GA   GLL++  V+ AL       +++G FDG  S   + +LG   V  
Sbjct: 332 SCEYEGSSDILGAWTQGLLKQSTVDRALRRLYEGLVQVGYFDGNRSE--YASLGWNHVNR 389

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRH---HTVAVIGPNSDVTVTMIGNYAGVAC 295
           P  Q++ALQAA +GIVLLKN  +TLPL   ++     +A+IG  ++   T+ G Y+G   
Sbjct: 390 PKSQEVALQAAVEGIVLLKND-KTLPLGVKKNGPKLKLAMIGFWANDPKTLSGGYSGTPA 448

Query: 296 GYTTPLQGISRYAKTIHQAG--CFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
              +P+         +  AG      + + +    AA  AA+ A+  +   G D S   E
Sbjct: 449 FEHSPVYATQAMGFKVTTAGGPVLQNSTSKDTWTQAALAAAKDANYILYFGGQDTSAAGE 508

Query: 354 FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQ 413
             DR  +  P  Q +L++ ++K  + P+V+V M G  +D +       I +ILW  +P  
Sbjct: 509 TKDRTTINWPEAQLQLITDLSKLGK-PLVVVQM-GDQLDNTPLLASKAINSILWANWPVP 566

Query: 414 AG 415
           AG
Sbjct: 567 AG 568


>gi|194700280|gb|ACF84224.1| unknown [Zea mays]
          Length = 452

 Score =  248 bits (634), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 133/258 (51%), Positives = 171/258 (66%), Gaps = 4/258 (1%)

Query: 176 LDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD 235
           +D++CG ++  H   A++ G + E+D+N AL     V+MRLG+F+G+P    +G++GP  
Sbjct: 1   MDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGDIGPDQ 60

Query: 236 VCTPAHQQLALQAAHQGIVLLKNS--ARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 293
           VCT  HQ LAL+AA  GIVLLKN   A  LPLS     ++AVIG N++  + + GNY G 
Sbjct: 61  VCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRGNYFGP 120

Query: 294 ACGYTTPLQGISRYAK-TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA 352
            C   TPLQ +  Y K T   AGC   ACN   +  A + AA  AD+ VL MGLDQ  E 
Sbjct: 121 PCVTVTPLQVLQGYVKDTSFVAGCNSAACNVTTIPEAVQ-AASSADSVVLFMGLDQDQER 179

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E +DR  L LPG+QQ L+  VA A++ PV+LVL+CGGPVDVSFAK +P+IGAILW GYPG
Sbjct: 180 EEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAGYPG 239

Query: 413 QAGGAAIADVLFGRANPG 430
           +AGG AIA VLFG  NPG
Sbjct: 240 EAGGIAIAQVLFGEHNPG 257


>gi|397690575|ref|YP_006527829.1| glucan 1,4-beta-glucosidase [Melioribacter roseus P3M]
 gi|395812067|gb|AFN74816.1| glucan 1,4-beta-glucosidase [Melioribacter roseus P3M]
          Length = 860

 Score =  248 bits (634), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 129/317 (40%), Positives = 192/317 (60%), Gaps = 11/317 (3%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G   G++ W+PN+NIFRDPRWGRG ET GEDP LTG+ A S+++GLQG     LK  A  
Sbjct: 124 GIYQGISLWAPNINIFRDPRWGRGMETYGEDPYLTGELAVSFIKGLQGQDKKYLKTIATP 183

Query: 64  KHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPT 123
           KH   +        +R+HFNA VS  DL +TY   FK  +++GK  SVMC+YN++ GK  
Sbjct: 184 KHLAVHSGPE---PERHHFNALVSNYDLNETYLPHFKKSIMKGKAYSVMCAYNRLRGKAC 240

Query: 124 CADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-P 182
           C    +L + +  +W  +G +VSDC +V  ++N+     +PE+AAA A+ +G DL+CG  
Sbjct: 241 CGHDTLLTDILRNKWGFEGIVVSDCWAVYDIFNSHKIVDSPEKAAALAVSSGTDLECGNT 300

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
           FL++  + A R GL+ E++++ AL   +  + +LGMFD  P    +  +    +    ++
Sbjct: 301 FLSL--KNAYRDGLITEKEIDSALRRVLLARFKLGMFD-PPEIVSYSQIDESYLDNSYNR 357

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
           ++AL+AA + IVLLKN  + LPL +   + +AVIGPN+D   +++GNY G    Y TPLQ
Sbjct: 358 EIALEAARKSIVLLKNDNKLLPLDS-SINKIAVIGPNADNLESLLGNYHGFPSEYITPLQ 416

Query: 303 GISRYAK---TIHQAGC 316
            I R  K     ++ GC
Sbjct: 417 AIRRVLKNGEVFYEKGC 433



 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/107 (38%), Positives = 60/107 (56%), Gaps = 13/107 (12%)

Query: 333 AARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVV 382
            A ++DA ++ MGL   +E E +          DR  L LP  Q +L+ ++    + PV+
Sbjct: 594 TALKSDAVIMFMGLCPRMEGEALKIKLDGFKGGDRLKLSLPANQLKLIKKIHSTGK-PVI 652

Query: 383 LVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           LVL+ GGP+   +   +  I AIL   YPGQAGG AI DV++G+ NP
Sbjct: 653 LVLLNGGPISTVWESEN--IPAILEAWYPGQAGGRAITDVIWGKYNP 697


>gi|371776901|ref|ZP_09483223.1| beta-glucosidase [Anaerophaga sp. HS1]
          Length = 720

 Score =  248 bits (633), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 157/440 (35%), Positives = 243/440 (55%), Gaps = 32/440 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR-LKVAACCKHY 66
           G+T+W+PN+NIFRDPRWGRGQET GEDP LT     ++V+GLQG+   R LK AA  KH+
Sbjct: 134 GITFWTPNINIFRDPRWGRGQETYGEDPFLTASMGKAFVKGLQGSEPERRLKTAAGAKHF 193

Query: 67  TAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
             +        DR+HFNA V ++DL +TY   FKA +VE  V ++MC+YN+VNG+P C  
Sbjct: 194 AVHSGPE---ADRHHFNAVVDEKDLRETYLPAFKA-LVENGVTTIMCAYNRVNGEPCCTG 249

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAI 186
             +L++ +  +W   G +V+DC ++  ++       T  E AA A+KAG++LDC   L  
Sbjct: 250 KTLLQDILRDEWGFKGQVVTDCWALDDIWLRHKTIPTRVEVAAAAVKAGVNLDCANILQE 309

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLAL 246
             + A+   LL  E V+ AL  T+  Q++LG +D +PS  P+ + G   V    H  LA 
Sbjct: 310 DVQDAIEKRLLTLEQVDSALLPTLQTQLKLGFYD-DPSHSPYRHYGIDSVNNSYHISLAK 368

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISR 306
           +AA + +VLLKN    LPL      ++ V+G N+     + GNY G++    T ++G+ +
Sbjct: 369 EAAEKSMVLLKNDG-ILPLKKDTISSIMVVGENAASISALTGNYHGLSGNMVTFVEGLVK 427

Query: 307 YAKTIHQAGCFGVACNGNQLIGAAEV-------AARQADATVLVMGLDQSIEAE----FI 355
                  AG  G++   +     A+        AA   D T+ V+GL   +E E    F+
Sbjct: 428 -------AGGPGMSVQYDYGCSFADTSHFGGIWAAGFTDVTIAVIGLSPLLEGEHGDAFL 480

Query: 356 -----DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGY 410
                D+  L +P   +  + ++ ++   PV+ V+  G  +D+S    +P   AI++  Y
Sbjct: 481 SNWGGDKKDLRMPRSHEIYLKKLRESHNHPVIAVVTGGSALDISAI--EPYADAIIYAWY 538

Query: 411 PGQAGGAAIADVLFGRANPG 430
           PG+ GG A+AD++FG  +P 
Sbjct: 539 PGEQGGTALADLIFGEVSPS 558


>gi|393782428|ref|ZP_10370612.1| hypothetical protein HMPREF1071_01480 [Bacteroides salyersiae
           CL02T12C01]
 gi|392673256|gb|EIY66719.1| hypothetical protein HMPREF1071_01480 [Bacteroides salyersiae
           CL02T12C01]
          Length = 596

 Score =  247 bits (631), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 160/434 (36%), Positives = 236/434 (54%), Gaps = 30/434 (6%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           +TYWSPNVNIFRDPRWGRGQET GEDP LT +   +YVRGLQGN    LK AAC KHY  
Sbjct: 1   MTYWSPNVNIFRDPRWGRGQETYGEDPYLTAEIGKAYVRGLQGNDPFFLKAAACAKHYAV 60

Query: 69  YDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
           +     +G +  R+ FNA  SK+DL +TY   F+A V E KV +VM +YN+V G+     
Sbjct: 61  H-----SGPEALRHEFNASPSKRDLFETYLPAFEALVKEAKVEAVMGAYNRVYGESASGS 115

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC-GPFLA 185
             +L + +  +W   G++VSDC +V  +Y      +   EA+A A+K+GL+L+C G F A
Sbjct: 116 FFLLTDILRKKWGFKGHVVSDCGAVDDIYGGHKIAKDVAEASAIALKSGLNLNCGGSFHA 175

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
           +    A+   L+ E D++ AL   +  +++LG    +  + P+ N+    + +  H  +A
Sbjct: 176 LKE--ALERKLITEVDLDNALMPLMMTRLKLGNLTDDDES-PYKNISDSVIASYTHAMVA 232

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
            + A + +VLLKN+  TLPL      T+ V GP +  T  M+GNY GV+    T LQGI+
Sbjct: 233 REVAQKSMVLLKNNNHTLPLKK-DVKTIFVTGPYAADTYVMMGNYYGVSPRSNTFLQGIA 291

Query: 306 RY----AKTIHQAGCFGVACNGNQL-IGAAEVAARQADATVLVMGL---DQSIEAEFI-- 355
                     ++ G      N N       EV  R A+  ++V+GL   D+  E + I  
Sbjct: 292 AKVSGGTSINYKIGILPTTPNMNPADWTVGEV--RAAEVAIVVIGLSGIDEGEEGDAIAS 349

Query: 356 ----DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYP 411
               D+  L LP  Q + +  +++     +V V+  G P+D+   +      A++   YP
Sbjct: 350 SHRGDKQNLKLPEHQLKFLRDISRNRWNKLVTVITGGSPIDLE--EVSELSDAVIMAWYP 407

Query: 412 GQAGGAAIADVLFG 425
           GQ GG A+ D+LFG
Sbjct: 408 GQEGGMALGDLLFG 421


>gi|390956994|ref|YP_006420751.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
           18391]
 gi|390411912|gb|AFL87416.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
           18391]
          Length = 742

 Score =  247 bits (630), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 166/436 (38%), Positives = 236/436 (54%), Gaps = 23/436 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GL Y SP +NI RDPRWGR QE   EDP+LTG+   +YVRGLQG+    LK+AA  KH+ 
Sbjct: 141 GLVYRSPVINISRDPRWGRIQEVFSEDPLLTGRMGVAYVRGLQGDDLQHLKLAATVKHFA 200

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
             ++++     R H NA V +++L + +   ++A ++E    SVM SYN +NG P   + 
Sbjct: 201 VNNVES----GRQHLNADVDERNLFEFWLPHWRAAIMEAHAQSVMSSYNAINGMPDAVNH 256

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNT---------QHYTRTPEEAAADAIKAGLDL 178
            +L + +  +W  DG++  D  +V +L  T         QH++  P  AAA AI+AG D 
Sbjct: 257 WLLTDVLRKKWGFDGFVTDDLGAVALLSGTRATNTSEPGQHFSEDPVVAAAAAIRAGNDS 316

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           D   F   +   AV+ GLL E+DV+ AL   + V  RLG +D  P A  +  +G   V +
Sbjct: 317 DDVEF-ETNLPLAVQRGLLTEKDVDGALRNVLRVGFRLGAYD-PPQASKYSRIGMDVVRS 374

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
            AH+ L+ + A + + LL N  + LPL   +  +VAVIGP         GNY G     T
Sbjct: 375 QAHRDLSQRVAEESMTLLLNRRQFLPLQRDQVKSVAVIGPAGGEAYE-TGNYYGTPAVKT 433

Query: 299 TPLQGISRY----AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEF 354
           +  +G+        K  ++ G   V    ++ I  A   AR++D  VL +G +  +EAE 
Sbjct: 434 SVTEGLRALLGSGVKVEYEKGAGYVDLADDKEIERAANLARKSDVVVLCLGTNLQVEAEG 493

Query: 355 IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQA 414
            DR  L LPG QQ L+  V  A+   V LVLM  GP+ V++A +   + AIL   YPG+ 
Sbjct: 494 RDRRDLNLPGAQQRLLEAV-YAANPKVALVLMNAGPLGVTWAHD--HVPAILSAWYPGEL 550

Query: 415 GGAAIADVLFGRANPG 430
           GGAAIA  LFG  NPG
Sbjct: 551 GGAAIARTLFGLNNPG 566


>gi|405955586|gb|EKC22647.1| Putative beta-D-xylosidase 2 [Crassostrea gigas]
          Length = 745

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 150/434 (34%), Positives = 241/434 (55%), Gaps = 13/434 (2%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G   G++ +SP +NI R P WGR QET GEDP L+G+ AA +V+ LQG+  + ++  A C
Sbjct: 137 GDHKGISCFSPVINIMRHPLWGRNQETYGEDPFLSGELAAIFVKCLQGDDPTYIRANAGC 196

Query: 64  KHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPT 123
           KH+  +       V R+ F+A+VS++D   T+   FK CV  G   S+MCS+N++NG P 
Sbjct: 197 KHFDVHGGPENIPVSRFSFDAKVSERDWRLTFLPAFKRCVQAGSY-SLMCSFNRINGVPA 255

Query: 124 CADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPF 183
           C +  +L + +  +W   GY+VSD +++  +    HYT    + AA  +KAG +L+    
Sbjct: 256 CGNKRLLTDILRTEWGFTGYVVSDQEAIENIMTYHHYTNNSVDTAALCVKAGCNLELSTN 315

Query: 184 LA----IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
                  +   A++ G L +ED+  +++     +MRLG FD  P   P+  +    + + 
Sbjct: 316 EVKPTYFYIIDALKAGKLDKEDLVKSVSPLFYTRMRLGEFD-PPDHNPYNFIDLSVIQSE 374

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA-GVACGY- 297
            H+ ++L AA +  VLLKN    LP++ L   T++V+GP +D     IG+YA  V   Y 
Sbjct: 375 EHRAISLNAAMKSFVLLKNKGGFLPITKL-FDTISVLGPMADNKYQQIGSYAPDVMPSYT 433

Query: 298 TTPLQGISRYAKTI-HQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFID 356
           TTPLQG+S+ +K + + AGC   AC+        + A   +D   + +G    IE E  D
Sbjct: 434 TTPLQGLSKLSKRVQYAAGCNDNACSKYNRT-EIQRAVNSSDIFFVCLGTGPMIENEDHD 492

Query: 357 RAGLLLPGRQQELVSR-VAKASRG-PVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQA 414
           RA + LPG+Q +L+   +  +++G P+VL+L  GGPV++++A    R+ AI+   +P Q 
Sbjct: 493 RASMELPGQQAQLLKDAIMFSAKGVPIVLLLFNGGPVNITWADRSDRVVAIMECFFPAQE 552

Query: 415 GGAAIADVLFGRAN 428
            G A+  V+    N
Sbjct: 553 TGEAVLRVVTNTGN 566


>gi|282877070|ref|ZP_06285912.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
           buccalis ATCC 35310]
 gi|281300752|gb|EFA93079.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
           buccalis ATCC 35310]
          Length = 721

 Score =  245 bits (626), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 158/443 (35%), Positives = 228/443 (51%), Gaps = 31/443 (6%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR--LKVAACCKHY 66
           ++ W+PNVNIFRDPRWGRGQET GEDP LT +   + V GLQG  G     K  AC KH+
Sbjct: 132 VSLWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGCAVVEGLQGGKGPHKYYKAFACAKHF 191

Query: 67  TAYDLDNWNGVDRYHFNA-RVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+  +   VS +D  +TY   FK  V  G V  VMC+YN ++G+P C+
Sbjct: 192 AVHSGPEWN---RHSISIDDVSPRDFHETYLPAFKHLVQVGGVKEVMCAYNSIDGEPCCS 248

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE--EAAADAIKAGLDLDCGPF 183
           D  +L+  +  +W   G +VSDC ++  ++    +   P+   A+A A+K G D+ CG  
Sbjct: 249 DQRLLEQLLRDEWGFKGIVVSDCGAIDDIWRKGFHEVEPDAAHASARAVKGGTDMSCGQT 308

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                E AVR G + EE ++ +L   I  +M+LG FD + S   +  +  +DV TPA ++
Sbjct: 309 YGSLPE-AVRLGKVTEERIDKSLKRLIVGRMQLGEFDPD-SITRWNAISMKDVSTPASRE 366

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           +AL+ A + + LL N    LPLS  +   V V+GPN++ +V M GNY G      T L G
Sbjct: 367 VALKMARETMTLLHNPMHALPLSK-QLKQVVVMGPNANDSVMMWGNYNGTPHHTVTILDG 425

Query: 304 ISR-----YAKTIHQAGCFGVACNGNQLIGAAEVAARQAD--ATVLVMGLDQSIEAEFI- 355
           I R       K I   G       GNQ +   ++     D    + V G+   +E E + 
Sbjct: 426 IRRKIGAQRVKFIEGCGLVEPHRRGNQALTTQQLVEEVGDNKTVIFVGGISPQLEGEQLE 485

Query: 356 ---------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
                    DR  + LP  Q+E+++ +  A  G  V+++ C G   +          AIL
Sbjct: 486 VEAKGFKGGDRVTIELPQVQREMIAALHAA--GKQVIMVNCSGSA-IGLVPEVTHTDAIL 542

Query: 407 WVGYPGQAGGAAIADVLFGRANP 429
              YPG+ GG A+ADVLFG  NP
Sbjct: 543 QAWYPGERGGEAVADVLFGDYNP 565


>gi|371777036|ref|ZP_09483358.1| glycoside hydrolase [Anaerophaga sp. HS1]
          Length = 890

 Score =  244 bits (622), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 136/308 (44%), Positives = 182/308 (59%), Gaps = 15/308 (4%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G   GLT+W+PN+NIFRDPRWGRG ET GEDP LTG+ A  Y++GLQG+    LK+ A  
Sbjct: 141 GIYQGLTFWTPNINIFRDPRWGRGMETYGEDPFLTGELAVDYIKGLQGDDDRYLKLVATS 200

Query: 64  KHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPT 123
           KH+  +        DR+HF+AR S +D   TY   FK  + E  V SVMC+YN+ NG P 
Sbjct: 201 KHFLVHSGPE---PDRHHFDARTSARDSLMTYTPHFKKTIQEAGVYSVMCAYNRYNGLPC 257

Query: 124 CADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCG 181
           C    + +N +  +W   GYIVSDC +V   Y   H+   P  EEAAA A+KAG DL+CG
Sbjct: 258 CGSKPV-ENLLRNEWGFKGYIVSDCWAVADFYKKGHHEVVPTVEEAAAMAVKAGTDLNCG 316

Query: 182 ---PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
              P L      AV+ GL+ EE++++ +   +  ++RLGMFD  P   P+ N+    V +
Sbjct: 317 NSYPALV----DAVKQGLVSEEEIDVLVKRLMEARLRLGMFD-PPEMVPYTNIPYSVVDS 371

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
             H++LAL AA + +VLLKN   TLPL       VAVIGPN++    ++ NY G      
Sbjct: 372 KEHRELALIAARKSMVLLKNDNNTLPLDK-NVKNVAVIGPNANNLDVLLANYNGYPSNPV 430

Query: 299 TPLQGISR 306
           TPL GI +
Sbjct: 431 TPLDGIRQ 438



 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 47/119 (39%), Positives = 63/119 (52%), Gaps = 15/119 (12%)

Query: 323 GNQLIGAAEVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSR 372
           G  L   A   A  +D  ++ MGL  ++E E +          DR  + LP  Q +LV  
Sbjct: 603 GRNLKKEAIQIAAASDVVLMFMGLSPNLEGEEMPVNVPGFSGGDRVDIKLPQIQTDLVKA 662

Query: 373 VAKASRGPVVLVLMCGGPVDVSF-AKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           +    + PVVLVL+ G  + +++ A+N P   AIL   YPGQAGG AIADVLFG  NP 
Sbjct: 663 IMSLGK-PVVLVLLNGSALAINWEAENVP---AILEAWYPGQAGGTAIADVLFGDYNPA 717


>gi|189467437|ref|ZP_03016222.1| hypothetical protein BACINT_03826 [Bacteroides intestinalis DSM
           17393]
 gi|189435701|gb|EDV04686.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 863

 Score =  243 bits (621), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 129/303 (42%), Positives = 177/303 (58%), Gaps = 4/303 (1%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G   GLTYWSPN+NIFRDPRWGRGQET GEDP LT     S+V+GLQG+    LK +AC 
Sbjct: 128 GIFRGLTYWSPNINIFRDPRWGRGQETYGEDPFLTASIGVSFVKGLQGDDPVYLKSSACA 187

Query: 64  KHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPT 123
           KHY  +    WN   R+ ++A+V+  DL DTY   FK  VVEGKV  VMC+YN   G+P 
Sbjct: 188 KHYAVHSGPEWN---RHTYDAKVNNHDLWDTYLPAFKELVVEGKVTGVMCAYNSFFGQPC 244

Query: 124 CADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPF 183
           C +  ++ + +   W+  GY+ SDC +V   YNT    +    A+ADA+  G D +CG  
Sbjct: 245 CGNDLLMMDILRNHWKFGGYVTSDCGAVEDFYNTHKTHQDAAAASADAVLHGTDCECGNG 304

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                  AV  GL+ E+ ++ +L     ++ RLGMFD +    P+ N+    +   AH+ 
Sbjct: 305 AYRALADAVLRGLITEKQIDESLKKLFEIRFRLGMFDPDDRV-PYSNIPLSVLECDAHKA 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
            AL+ A Q IVLLKN  + LPL+  +   +AV+GPN+D    ++ NY G     TT L+G
Sbjct: 364 HALKIARQSIVLLKNQDQLLPLNKNKIKKIAVVGPNADDKSVLLANYYGYPSHITTALEG 423

Query: 304 ISR 306
           I +
Sbjct: 424 IQK 426



 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 40/107 (37%), Positives = 54/107 (50%), Gaps = 13/107 (12%)

Query: 333 AARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVV 382
           A + AD  + V GL   +E E +          DR  + +P  QQ L+  +  A+  PVV
Sbjct: 595 AVKDADVIIFVGGLSAKVEGEEMGVEIEGFKRGDRTSISIPSVQQNLLKELY-ATGKPVV 653

Query: 383 LVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            V+M G  + + +      + AIL   Y GQAGG AIADVLFG  NP
Sbjct: 654 FVMMTGSALGLEW--ESAHLPAILNAWYGGQAGGQAIADVLFGDYNP 698


>gi|380509734|ref|ZP_09853141.1| beta-glucosidase-related glycosidase [Xanthomonas sacchari NCPPB
           4393]
          Length = 883

 Score =  243 bits (620), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 140/318 (44%), Positives = 189/318 (59%), Gaps = 9/318 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           +G   GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+GLQG+     K+ A 
Sbjct: 124 HGRYQGLTFWSPNINIFRDPRWGRGQETYGEDPYLTARMGVAFVQGLQGDDPVYRKLDAT 183

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
            KH+  +        DR+HF+AR SK+DL DTY   F+A V EGKV +VM +YN+V G+ 
Sbjct: 184 AKHFAVHSGPE---ADRHHFDARPSKRDLYDTYLPAFEALVKEGKVDAVMGAYNRVYGES 240

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP 182
             A   +L++ +   W   GY+VSDC ++  ++   H   + E AAA A+K G +L+CG 
Sbjct: 241 ASASQFLLRDVLRRDWGFTGYVVSDCWAIVDIWKHHHLAPSREAAAALAVKNGTELECGQ 300

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
             A     AVR GL+ E +++ A+    T +MRLGMFD  P    +  +       PAH 
Sbjct: 301 EYAT-LPAAVRQGLIGEAEIDDAVTRLFTARMRLGMFD-PPERVRWARIPASVNQVPAHD 358

Query: 243 QLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 301
            LALQAA + +VLLKN    LPLS TL+   +AV+GP +D T+ ++GNY G      T L
Sbjct: 359 ALALQAAQESLVLLKNDG-VLPLSRTLKR--IAVVGPTADDTMALLGNYFGTPAAPVTIL 415

Query: 302 QGISRYAKTIHQAGCFGV 319
           QGI   AK I      GV
Sbjct: 416 QGIRDAAKGIEVRYARGV 433



 Score = 58.2 bits (139), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 42/107 (39%), Positives = 54/107 (50%), Gaps = 13/107 (12%)

Query: 333 AARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVV 382
           AAR AD  V V GL   +E E +          DR  L LP  Q+ L+  +  A+  PVV
Sbjct: 611 AARNADVVVFVGGLTGDVEGEEMKVDYPGFAGGDRTDLRLPAPQRALLEAL-HATGKPVV 669

Query: 383 LVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +VL  G  + V +A+    + AIL   YPGQ GG A+   LFG  NP
Sbjct: 670 MVLTGGSALAVDWAQA--HLPAILMSWYPGQRGGTAVGQALFGEVNP 714


>gi|451996250|gb|EMD88717.1| glycoside hydrolase family 3 protein [Cochliobolus heterostrophus
           C5]
          Length = 763

 Score =  243 bits (619), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 149/434 (34%), Positives = 225/434 (51%), Gaps = 15/434 (3%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           NGG+A + YW+P++N  RD RWGR  E+PGED      Y  + + GL+G+   R K+ A 
Sbjct: 137 NGGVAPMDYWTPDINPVRDIRWGRASESPGEDIRRIKGYTKALLAGLEGDQAQR-KIIAT 195

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           CKHY  YD++ W G DR++F+A+++ QDL + Y  PF+ C  + KV S MCSYN VNG P
Sbjct: 196 CKHYVGYDMEAWGGYDRHNFSAKITMQDLAEYYMPPFQQCARDSKVGSFMCSYNAVNGVP 255

Query: 123 TCADPDILKNTIHGQWRL---DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           TCAD  +L+  +   W     + YI SDC++V  +     Y  T  +  A A   G+DL 
Sbjct: 256 TCADTYVLQTILRDHWNWTDSNNYITSDCEAVADISENHKYVETLAQGTALAFAKGMDLS 315

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           C    +    GA   GLL    ++ AL       +  G FDG  +   + NL   D+ TP
Sbjct: 316 CEYSGSSDIPGAWSQGLLNLSVIDKALTRQYEGLVHAGYFDGAKAT--YANLSYNDINTP 373

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
             +QL+LQ   +G+V+LKN   TLPL   +   VA+IG  ++ +  + G Y+G      +
Sbjct: 374 EARQLSLQVTSEGLVMLKND-HTLPLPLTKGSKVAMIGFWANDSSKLQGIYSGPPPYRHS 432

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQL----IGAAEVAARQADATVLVMGLDQSIEAEFI 355
           P+    +    +  A  +G     + +       A  AA ++D  +   G D ++  E  
Sbjct: 433 PVFAGEQMG--LDMAIAWGPMIQNSSVPDNWTTNALDAAEKSDYILYFGGQDWTVAQEGY 490

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  +  P  Q +L++++AK  +  VV+ L  G   D S   +   I +I+W  +PGQ G
Sbjct: 491 DRTTISFPQVQIDLLAKLAKLGKPLVVITL--GDMTDHSPLLSMEGINSIIWANWPGQDG 548

Query: 416 GAAIADVLFGRANP 429
           G AI +V+ G   P
Sbjct: 549 GPAILNVISGVHAP 562


>gi|373951852|ref|ZP_09611812.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373888452|gb|EHQ24349.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 871

 Score =  242 bits (617), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 130/303 (42%), Positives = 177/303 (58%), Gaps = 13/303 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYW+PN+NIFRDPRWGRGQET GEDP LTG   +++V GLQGN    LK AAC KHY 
Sbjct: 135 GLTYWTPNINIFRDPRWGRGQETYGEDPFLTGAMGSAFVSGLQGNDPKYLKAAACAKHYA 194

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ FNA +S  DL DTY   FK  VV+ KVA VMC+YN    +P C 
Sbjct: 195 VH-----SGPEPLRHVFNADISTYDLWDTYLPAFKKLVVDDKVAGVMCAYNAFKTQPCCG 249

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
              ++ + +  QW+  GY+ SDC  +   +       T E+A+ DA+  G D++CG    
Sbjct: 250 SDLLMVDILRNQWKFSGYVTSDCGGIDDFFKNHKTHATAEDASTDAVLHGTDIECGTDAY 309

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC-TPAHQQL 244
                AV+ G + E  +++++     ++ RLGMFD  PS        P  V  +P HQ  
Sbjct: 310 KSLVAAVKEGKISETQIDISVKRLFMIRFRLGMFD--PSDVVKYAQTPVSVLESPEHQAH 367

Query: 245 ALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           AL+ A Q +VLLKN+  TLPLS T+R   + V+GPN+D  + ++GNY G     TT  QG
Sbjct: 368 ALKMARQSVVLLKNANHTLPLSKTIRK--IVVLGPNADNPIAILGNYNGTPSNLTTVYQG 425

Query: 304 ISR 306
           I +
Sbjct: 426 IRQ 428



 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 40/105 (38%), Positives = 54/105 (51%), Gaps = 15/105 (14%)

Query: 337 ADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLM 386
           ADA V V G+   +E E +          DR  + LP  Q  L+ +  +A+  PVV V+M
Sbjct: 603 ADAIVYVGGISPQLEGEEMQVNYPGFNGGDRTSIQLPAAQTNLM-KTLQATGKPVVFVMM 661

Query: 387 CGGPVDVSF-AKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            G  +   + A+N P   AI+   Y GQA G A+ADVLFG  NP 
Sbjct: 662 TGSALATPWEAENIP---AIVNAWYGGQAAGTAVADVLFGDYNPA 703


>gi|423279990|ref|ZP_17258903.1| hypothetical protein HMPREF1203_03120 [Bacteroides fragilis HMW
           610]
 gi|404584326|gb|EKA88991.1| hypothetical protein HMPREF1203_03120 [Bacteroides fragilis HMW
           610]
          Length = 722

 Score =  242 bits (617), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 143/423 (33%), Positives = 226/423 (53%), Gaps = 16/423 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYWSP +N+ RDPRWGR +ET GEDP LT +   ++V+GLQG+  + LK  A  KH+ 
Sbjct: 148 GLTYWSPTINMARDPRWGRNEETYGEDPYLTSRLGVAFVKGLQGDHPAYLKTVATIKHFV 207

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
           A + +N    +R+  ++++  + L + Y   ++ACV E  V SVM +YN  NG P     
Sbjct: 208 ANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEAGVQSVMTAYNAFNGVPPSGSR 263

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +L   +  +W  DG++VSDC ++GV+        + EEAAA  + +G DL+CG      
Sbjct: 264 WLLGEVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEK 323

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              AV+ GL+ E  ++ AL   +T + +LG FD      P+ +   + +      +LA +
Sbjct: 324 LVQAVKQGLISEATIDQALTRVLTARFKLGEFD-PMELVPYNHYDKKLLAGKKFAELAYE 382

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRY 307
           AA + +VLLKN    LPLS  +  +VAV+GP +D     +G Y+G      T L+G+   
Sbjct: 383 AAVKSVVLLKNE-NLLPLSKEKTKSVAVVGPFAD--HNYLGGYSGQPPYSVTLLKGVKDL 439

Query: 308 AKTIHQAGCF-GVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQ 366
                +     G+  + + ++     A +  D  ++ +G D+ +  E  D   + LP  Q
Sbjct: 440 MGKRGKVNYLNGIGASRDSIVA----AVKGVDVVLVALGSDEKMARENHDMTSIYLPEEQ 495

Query: 367 QELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGR 426
           ++L+  + + +   +VLV   G P+   +A  D  I AI+   YPGQ  G A+AD+LFG 
Sbjct: 496 EKLLKAIYQVN-PRIVLVFHSGNPLTSEWA--DVHIPAIMQAWYPGQEAGRALADLLFGN 552

Query: 427 ANP 429
            NP
Sbjct: 553 ENP 555


>gi|313145345|ref|ZP_07807538.1| beta-glucosidase [Bacteroides fragilis 3_1_12]
 gi|313134112|gb|EFR51472.1| beta-glucosidase [Bacteroides fragilis 3_1_12]
          Length = 722

 Score =  242 bits (617), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 143/423 (33%), Positives = 226/423 (53%), Gaps = 16/423 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYWSP +N+ RDPRWGR +ET GEDP LT +   ++V+GLQG+  + LK  A  KH+ 
Sbjct: 148 GLTYWSPTINMARDPRWGRNEETYGEDPYLTSRLGVAFVKGLQGDHPAYLKTVATIKHFV 207

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
           A + +N    +R+  ++++  + L + Y   ++ACV E  V SVM +YN  NG P     
Sbjct: 208 ANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEAGVQSVMTAYNAFNGVPPSGSR 263

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +L   +  +W  DG++VSDC ++GV+        + EEAAA  + +G DL+CG      
Sbjct: 264 WLLGEVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEK 323

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              AV+ GL+ E  ++ AL   +T + +LG FD      P+ +   + +      +LA +
Sbjct: 324 LVQAVKQGLISEATIDQALTRVLTARFKLGEFD-PMELVPYNHYDKKLLAGKKFAELAYE 382

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRY 307
           AA + +VLLKN    LPLS  +  +VAV+GP +D     +G Y+G      T L+G+   
Sbjct: 383 AAVKSVVLLKNE-NLLPLSKEKTKSVAVVGPFAD--HNYLGGYSGQPPYSVTLLKGVKDL 439

Query: 308 AKTIHQAGCF-GVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQ 366
                +     G+  + + ++     A +  D  ++ +G D+ +  E  D   + LP  Q
Sbjct: 440 MGKRGKVNYLNGIGASRDSIVA----AVKGVDVVLVALGSDEKMARENHDMTSIYLPEEQ 495

Query: 367 QELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGR 426
           ++L+  + + +   +VLV   G P+   +A  D  I AI+   YPGQ  G A+AD+LFG 
Sbjct: 496 EKLLKAIYQVN-PRIVLVFHSGNPLTSEWA--DVHIPAIMQAWYPGQEAGRALADLLFGN 552

Query: 427 ANP 429
            NP
Sbjct: 553 ENP 555


>gi|451851086|gb|EMD64387.1| glycoside hydrolase family 3 protein [Cochliobolus sativus ND90Pr]
          Length = 763

 Score =  242 bits (617), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 148/434 (34%), Positives = 226/434 (52%), Gaps = 15/434 (3%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           NGG+A + YW+P++N  RD RWGR  E+PGED      Y  + + GL+G+   R K+ A 
Sbjct: 137 NGGVAPVDYWTPDINPVRDIRWGRASESPGEDIRRIKGYTKALLAGLEGDQAQR-KIIAT 195

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           CKHY  YD++ W G DR++F+A+++ QDL + Y  PF+ C  + KV S MCSYN VNG P
Sbjct: 196 CKHYVGYDMEAWGGYDRHNFSAKITMQDLAEYYMPPFQQCARDSKVGSFMCSYNAVNGIP 255

Query: 123 TCADPDILKNTIHGQWRL---DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD 179
           TCAD  +L+  +   W     + YI SDC++V  +     Y  T  +  A A   G+DL 
Sbjct: 256 TCADTYVLQTILRDHWNWTDSNNYITSDCEAVADISENHKYVETLAQGTALAFAKGMDLS 315

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           C    +    GA   GLL    ++ AL       +  G FDG  +   + NL  +D+ TP
Sbjct: 316 CEYTGSSDIPGAWAQGLLNISVIDKALTRQYEGLVHAGYFDGAKAT--YANLSYKDINTP 373

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
             +QL+LQ   +G+V+LKN   TLPL   +   VA+IG  ++ +  + G Y+G      +
Sbjct: 374 EARQLSLQVTSEGLVMLKND-HTLPLPLTKGSKVAMIGFWANDSSKLQGIYSGPPPYRHS 432

Query: 300 PLQGISRYAKTIHQAGCFGVACNGNQL----IGAAEVAARQADATVLVMGLDQSIEAEFI 355
           P+    +    +  A  +G     + +       A  AA ++D  +   G D ++  E  
Sbjct: 433 PVFAGEQMG--LDMAIAWGPMIQNSSVPDNWTTNALDAAEKSDYILYFGGQDWTVAQEGY 490

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  +  P  Q +L++++AK  +  VV+ L  G   D S   +   + +I+W  +PGQ G
Sbjct: 491 DRTTISFPQVQIDLLTKLAKLGKPLVVITL--GDMTDHSPLLSMEGVNSIIWANWPGQDG 548

Query: 416 GAAIADVLFGRANP 429
           G AI +V+ G   P
Sbjct: 549 GPAILNVVSGAHAP 562


>gi|116621778|ref|YP_823934.1| glycoside hydrolase family 3 protein [Candidatus Solibacter
           usitatus Ellin6076]
 gi|116224940|gb|ABJ83649.1| glycoside hydrolase, family 3 domain protein [Candidatus Solibacter
           usitatus Ellin6076]
          Length = 850

 Score =  242 bits (617), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 142/353 (40%), Positives = 202/353 (57%), Gaps = 20/353 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT + A ++++G+QG      KV A  KHY 
Sbjct: 131 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTSRMAVAFIKGMQGEDPHYYKVIATAKHYA 190

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +         R+ F+ + S +DL DTY   F+A +VE +  S+MC+YN+V+G P CA  
Sbjct: 191 VHSGPE---SSRHQFDVKPSPRDLADTYLPAFRASIVEARADSLMCAYNRVDGIPACAST 247

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
           D+L+  + G+W   G++VSDC +V  ++   HY      A+A A+KAG DL CG      
Sbjct: 248 DLLEKRLRGEWGFQGFVVSDCGAVSDIFRGHHYQPDAASASAVAVKAGTDLTCGNEYRAL 307

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
            + AV+ GL+ E ++N +L      + +LGMFD  P   PF N+   +V +  H+++AL+
Sbjct: 308 VD-AVKTGLITEPEINRSLERLFVARFKLGMFD-PPERVPFSNIPYSEVDSAGHRKIALE 365

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRY 307
           AA + IVLLKN   TLPL +     +AVIGP +D    ++GNY G +    TPL GI   
Sbjct: 366 AARKSIVLLKNDG-TLPLKS-SIKKIAVIGPAADDAEALLGNYNGFSSLQVTPLAGIE-- 421

Query: 308 AKTIHQAGCFGVACNGNQLIGAAEVAARQA--DATVLV--MGLDQSIEAEFID 356
               HQ   +         +GA   A  QA   A+VL    G  + ++AE+ D
Sbjct: 422 ----HQ---WAGKAEVRYALGANYTAQSQAPLPASVLTPPTGTGRGLQAEYFD 467



 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 58/114 (50%), Gaps = 13/114 (11%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAK 375
           L+ AA  A   AD T+  +GL+ S+E E +          DR  L LP  Q++L+   A 
Sbjct: 593 LLAAAIEAVSNADVTLAFVGLNPSLEGEEMPVSVPGFQGGDRTNLELPEPQEKLIE-AAI 651

Query: 376 ASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           A+  PVV+VL  G  V ++FA       A+L   Y G+  G AIAD L G  NP
Sbjct: 652 ATGKPVVVVLASGSAVAMNFAAQ--HASALLETWYNGEETGTAIADTLAGINNP 703


>gi|424661938|ref|ZP_18098975.1| hypothetical protein HMPREF1205_02324 [Bacteroides fragilis HMW
           616]
 gi|404578249|gb|EKA82984.1| hypothetical protein HMPREF1205_02324 [Bacteroides fragilis HMW
           616]
          Length = 722

 Score =  241 bits (616), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 142/423 (33%), Positives = 226/423 (53%), Gaps = 16/423 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYWSP +N+ RDPRWGR +ET GEDP LT +   ++V+GLQG+  + LK  A  KH+ 
Sbjct: 148 GLTYWSPTINMARDPRWGRNEETYGEDPYLTSRLGVAFVKGLQGDHPAYLKTVATIKHFV 207

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
           A + +N    +R+  ++++  + L + Y   ++ACV E  V SVM +YN  NG P     
Sbjct: 208 ANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEADVQSVMTAYNAFNGVPPSGSR 263

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +L   +  +W  DG++VSDC ++GV+        + EEAAA  + +G DL+CG      
Sbjct: 264 WLLGEVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEK 323

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              AV+ GL+ E  ++ AL   +T + +LG FD      P+ +   + +      +LA +
Sbjct: 324 LVQAVKQGLISEATIDQALTRVLTARFKLGEFD-PMELVPYNHYDKKLLAGKKFAELAYE 382

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRY 307
           AA + +VLLKN    LPLS  +  +VAV+GP +D     +G Y+G      T L+G+   
Sbjct: 383 AAVKSVVLLKNE-NLLPLSKEKTKSVAVVGPFAD--HNYLGGYSGQPPYSITLLKGVKDL 439

Query: 308 AKTIHQAGCF-GVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQ 366
                +     G+  + + ++     A +  D  ++ +G D+ +  E  D   + LP  Q
Sbjct: 440 MGKRGKVNYLNGIGASRDSIVA----AVKGVDVVLVALGSDEKMARENHDMTSIYLPEEQ 495

Query: 367 QELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGR 426
           ++L+  + + +   +VLV   G P+   +A  D  I AI+   YPGQ  G A+A++LFG 
Sbjct: 496 EKLLKAIYQVN-PRIVLVFHSGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGN 552

Query: 427 ANP 429
            NP
Sbjct: 553 ENP 555


>gi|389736853|ref|ZP_10190363.1| glucan 1,4-beta-glucosidase [Rhodanobacter sp. 115]
 gi|388438821|gb|EIL95541.1| glucan 1,4-beta-glucosidase [Rhodanobacter sp. 115]
          Length = 868

 Score =  241 bits (615), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 132/303 (43%), Positives = 183/303 (60%), Gaps = 9/303 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           +G   GLT WSPN+NIFRDPRWGRGQET GEDP LTG+ A ++VRG+QG+     +  A 
Sbjct: 122 HGLFQGLTLWSPNINIFRDPRWGRGQETYGEDPYLTGQLAVAFVRGIQGDDPQHPRAIAT 181

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
            KH+ A+   +     R  F+  VS  DLEDTY   F+  VV+G   SVMC+YN ++G P
Sbjct: 182 PKHFVAH---SGPEAGRDSFDVDVSPHDLEDTYLPAFRTAVVDGHAGSVMCAYNALHGTP 238

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP 182
            CA+  +L   +   W   GY+VSDCD+VG + +  ++     +A+  A++AG DLDCG 
Sbjct: 239 ACANAGLLDTRLRKDWGFAGYVVSDCDAVGDIASYHYFKPDDVQASVAAVQAGTDLDCGH 298

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
             A   + AVR G + E  ++ +L    T + RLG   G     P+  +G   + +PAH+
Sbjct: 299 TYASLAQ-AVRQGDIAESALDASLVRLFTARYRLGEL-GSRGNDPYARIGADQIDSPAHR 356

Query: 243 QLALQAAHQGIVLLKNSARTLPLST-LRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 301
           +LALQAA + +VLLKN+  TLPL   +R   +AVIGP++D   T+  NY G A    TPL
Sbjct: 357 KLALQAALESLVLLKNAHSTLPLHAGMR---LAVIGPDADALETLEANYHGTARHPVTPL 413

Query: 302 QGI 304
           QG+
Sbjct: 414 QGL 416



 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 44/110 (40%), Positives = 58/110 (52%), Gaps = 13/110 (11%)

Query: 331 EVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGP 380
           E     ADA V  +GL   +E E +          DR  + LP  Q+ L+ R A+AS  P
Sbjct: 592 ERVLHDADAVVAFIGLSPDVEGEQLRIDVPGFDGGDRTDIGLPAPQRALLER-ARASGKP 650

Query: 381 VVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           +++VL+ G  V + +A+      AIL   YPGQAGG AIA VL G  NPG
Sbjct: 651 LIVVLLSGSAVALDWAQQ--HADAILAAWYPGQAGGTAIAQVLAGDYNPG 698


>gi|297740661|emb|CBI30843.3| unnamed protein product [Vitis vinifera]
          Length = 401

 Score =  241 bits (615), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 117/236 (49%), Positives = 161/236 (68%), Gaps = 3/236 (1%)

Query: 195 GLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIV 254
           G  REEDV+ +L     V  ++G FDG PS   + +L  +D+CT  H +LA  AA QGIV
Sbjct: 3   GKAREEDVDTSLRNLYIVLTQVGFFDGIPS---YESLDKKDLCTKEHIELAADAARQGIV 59

Query: 255 LLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRYAKTIHQA 314
           LLKN   TLPL   +   +A+IGP+++ T+ M+GNYAGV C Y++PL G S Y K  ++ 
Sbjct: 60  LLKNINETLPLDPAKLKNLALIGPHANATIEMLGNYAGVPCQYSSPLDGFSAYGKVTYEM 119

Query: 315 GCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVA 374
           GC  V C+    I  A  A++ ADAT+L++GLD+++E E +DR  LLLPG Q EL+ +V 
Sbjct: 120 GCNNVTCDNKTFIMPAVEASKNADATILLVGLDKTVEGEGLDRNDLLLPGYQTELILQVI 179

Query: 375 KASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            AS+GP++LV+M G  VD+SF+K D R+ AILW GYPG+ GG AIADV++G+ NPG
Sbjct: 180 VASKGPIILVIMSGSAVDISFSKTDDRVKAILWAGYPGEEGGRAIADVVYGKYNPG 235


>gi|121700633|ref|XP_001268581.1| beta-xylosidase XylA [Aspergillus clavatus NRRL 1]
 gi|119396724|gb|EAW07155.1| beta-xylosidase XylA [Aspergillus clavatus NRRL 1]
          Length = 743

 Score =  241 bits (614), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 144/432 (33%), Positives = 215/432 (49%), Gaps = 60/432 (13%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPV-LTGKYAASYVRGLQGNTGSR-LKVA 60
           N G  GL  +SPN+N FR P WGRGQETPGED   L+  YA  Y+ G+QG    + LK+ 
Sbjct: 162 NAGRYGLDVYSPNINSFRHPVWGRGQETPGEDAYCLSSAYAYEYITGIQGGVDPKSLKLV 221

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           A  KHY  YD++NW+G  R   +  +++QDL + Y   F     + KV SVMCSYN VNG
Sbjct: 222 ATAKHYAGYDIENWDGHSRLGNDMNITQQDLSEYYTPQFLVAARDAKVRSVMCSYNAVNG 281

Query: 121 KPTCADPDILKNTIHGQWRL--DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            P+CA+   L+  +   +    DGYI SDCDS   ++N   Y      AAAD+I+AG D+
Sbjct: 282 VPSCANSFFLQTLLRDTFGFVEDGYISSDCDSAYNVFNPHEYAANVSSAAADSIRAGTDI 341

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           DCG     + + AV        D NL                                  
Sbjct: 342 DCGTTYQYYFDEAV--------DQNL---------------------------------- 359

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
                L+     +G++ L         S L       +GP  +V+  + GNY G A    
Sbjct: 360 -----LSRADIERGVIRL--------YSNLMRLGYFDVGPWMNVSTQLQGNYFGPAPYLI 406

Query: 299 TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRA 358
           +PL         ++ A    ++ N       A  AA+++DA +   G+D S+EAE +DR 
Sbjct: 407 SPLDAFRDSHLDVNYAFGTNISSNSTDGFSKALSAAKKSDAIIFAGGIDNSLEAETLDRM 466

Query: 359 GLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAA 418
            +  PG+Q EL+ ++++  + P++++ M GG VD S  K++  + +++W GYPGQ+GG A
Sbjct: 467 NITWPGKQLELIDQLSQLGK-PLIVLQMGGGQVDSSLLKSNKNVNSLIWGGYPGQSGGQA 525

Query: 419 IADVLFGRANPG 430
           + D++ G+  P 
Sbjct: 526 LLDIITGKRAPA 537


>gi|323344407|ref|ZP_08084632.1| beta-glucosidase [Prevotella oralis ATCC 33269]
 gi|323094534|gb|EFZ37110.1| beta-glucosidase [Prevotella oralis ATCC 33269]
          Length = 722

 Score =  241 bits (614), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 153/437 (35%), Positives = 237/437 (54%), Gaps = 27/437 (6%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT+W+PNVNIFRDPRWGRG ET GEDP LTG    ++V+G+QGN    LK AAC KH+
Sbjct: 128 AGLTFWAPNVNIFRDPRWGRGMETYGEDPFLTGVLGTAFVKGMQGNDPFYLKAAACGKHF 187

Query: 67  TAYDLDNWNGVDRYHFNARV--SKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +R    A V  +K DL +TY   FK  V +GKV S+M +Y ++ G+   
Sbjct: 188 AVH-----SGPERTRHTANVEPTKHDLYETYLPAFKMLVQQGKVESIMGAYQRLYGESCS 242

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFL 184
               +L + +   W   G++VSDC +V  +Y      ++  EA A AIKAGL+L+CG  +
Sbjct: 243 GSKYLLTDILRKDWGFKGHVVSDCGAVTDMYEGHKLVKSEAEAVAFAIKAGLNLECGNSM 302

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
               + A++  L+ E+D++ AL   +  +++LG+   +  A P+       + +  ++ +
Sbjct: 303 RTMKD-ALKQKLITEKDLDKALLPLMMTRLKLGILQPD-VACPYNEFPESVIGSIDNRNI 360

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           A +AA + +VLLKN    LP++     T+ V GP +     ++GNY G++  Y+T L+GI
Sbjct: 361 AQRAAEESMVLLKNDG-VLPIAK-DIRTLFVTGPGATDAYYLMGNYFGLSDRYSTYLEGI 418

Query: 305 ----SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIE--------- 351
               S      ++ G   V  N N  +  +   +R A+ ++++MG   + E         
Sbjct: 419 VGKVSNGTSVNYKQGFMQVFKNLND-VNWSVSESRGAEVSIIIMGNSGNTEGEEGDAIAS 477

Query: 352 AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYP 411
           +E  DR  L LP  Q + +  V+K     +V+VL  G P+DV           + W  YP
Sbjct: 478 SERGDRVDLRLPEPQMQYLREVSKDRTNKLVVVLTGGSPIDVKEITELADAVVMAW--YP 535

Query: 412 GQAGGAAIADVLFGRAN 428
           GQ GG A+A++LFG AN
Sbjct: 536 GQEGGVALANLLFGDAN 552


>gi|399025517|ref|ZP_10727513.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
           CF314]
 gi|398077894|gb|EJL68841.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
           CF314]
          Length = 875

 Score =  241 bits (614), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 124/304 (40%), Positives = 178/304 (58%), Gaps = 10/304 (3%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G   GLT+W+PN+NIFRDPRWGRGQET GEDP LT     + V+GLQGN     K  AC 
Sbjct: 127 GRYEGLTFWTPNINIFRDPRWGRGQETYGEDPYLTSVLGVAAVKGLQGNDPKYFKTHACA 186

Query: 64  KHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPT 123
           KH+  +    WN   R+ +NA VSK+DL +TY   FK+ V+EG V  VMC+YN  +G+P 
Sbjct: 187 KHFAVHSGPEWN---RHSYNAEVSKRDLYETYLPAFKSLVLEGNVREVMCAYNAFDGQPC 243

Query: 124 CADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEE--AAADAIKAGLDLDCG 181
           CA   +L   + G+W+ DG +VSDC ++   Y  +++   P+E   AADA+K   DL+CG
Sbjct: 244 CASNTLLNEILRGKWKYDGMVVSDCWALADFYQEKYHGTHPDEKSTAADALKHSTDLECG 303

Query: 182 PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC-TPA 240
                +   ++ GGL+ E+D+++++   +     LGM D  P +    N  P  V  +  
Sbjct: 304 DTYN-NLNKSLAGGLITEKDIDISMRRILKGWFELGMLD--PKSSVLWNQIPYSVVDSDE 360

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H++ AL+ A + IVL+KN    LP +      +AV+GPN+D  +  +GNY G      T 
Sbjct: 361 HKKQALKMAQKSIVLMKNENNILPFNK-NIKKIAVVGPNADDEMMQLGNYNGTPSSIVTI 419

Query: 301 LQGI 304
           L+GI
Sbjct: 420 LEGI 423



 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 38/115 (33%), Positives = 57/115 (49%), Gaps = 15/115 (13%)

Query: 327 IGAAEVAARQADATVLVM--GLDQSIEAEFI----------DRAGLLLPGRQQELVSRVA 374
           +  A V  +  DA V+V   GL  S+E E +          D+  + LP  Q+EL++ + 
Sbjct: 590 VNFASVKEKVKDADVIVFAGGLSPSLEGEEMLVNAEGFKGGDKTSIELPKVQRELLAELR 649

Query: 375 KASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           K  + PVV VL  G    +   +++     +L   Y GQ+GG A+ADVL G  NP
Sbjct: 650 KTGK-PVVFVLCTGS--SLGLEQDEKNYDVLLNAWYGGQSGGTAVADVLAGDYNP 701


>gi|397642422|gb|EJK75223.1| hypothetical protein THAOC_03061, partial [Thalassiosira oceanica]
          Length = 534

 Score =  241 bits (614), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 157/409 (38%), Positives = 216/409 (52%), Gaps = 53/409 (12%)

Query: 51  GNTGSRLKVAACCKHYTAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACV------ 103
           G  G R ++AA CKH  AY L+     DR++F+A  + + D E TY   F ACV      
Sbjct: 2   GGAGLRPRIAATCKHLAAYSLE----TDRFNFSADGIDRTDWEGTYLPAFDACVHAERFL 57

Query: 104 VEGKVAS-------------VMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDS 150
           +E   AS             VMCSYN ++G P CADP +LK+ +   W   G +VSDC +
Sbjct: 58  LEHYNASGGGGGGQDRGALGVMCSYNAIDGVPACADPALLKDMLRRDWNFTGLVVSDCWA 117

Query: 151 VGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTI 210
           V  +++   +  + EEA   A+++G+DLDCG         A    LL E+D++ AL+   
Sbjct: 118 VDNIHSNHRFVASYEEAVGLALRSGVDLDCGNTFQDFGRLAYDESLLDEDDIDEALSRLF 177

Query: 211 TVQMRLGMFD--GEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSART-----L 263
            V M LG FD   EP A+   +          H QLAL+AA Q IVLLKN         L
Sbjct: 178 RVLMDLGYFDETDEPDAKSSDD-------EMEHDQLALEAALQSIVLLKNGINEDEPGPL 230

Query: 264 PLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRYAKTIHQAGCFGVACNG 323
           PLS  +H  +A+ GP +D    ++GNY G+     TPL G+++           GV    
Sbjct: 231 PLSLAKHKEIALFGPLADNQTVLLGNYHGLPSTIVTPLMGLAK----------MGVEVAF 280

Query: 324 NQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQEL---VSRVAKASRGP 380
            Q     +       AT+LV+GLDQS+EAE  DR  LLLP  Q++L   +SR +K    P
Sbjct: 281 RQRASVCDFHGES--ATILVVGLDQSLEAEDQDRTTLLLPVEQRDLIKTISRCSKVRDLP 338

Query: 381 VVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           VVLV++ GG VD+S  KN   I A++ + YPGQ GG+A+A VL+G  NP
Sbjct: 339 VVLVVVSGGMVDLSRYKNSSDIDAMIHMSYPGQNGGSALAQVLYGAYNP 387


>gi|413925161|gb|AFW65093.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 323

 Score =  241 bits (614), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 105/175 (60%), Positives = 136/175 (77%), Gaps = 2/175 (1%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-K 58
           ++N G A GLT WSPNVNIFRDPRWGRGQETPGEDP +  +YA ++VRG+QGN+ S L +
Sbjct: 146 LFNVGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQGNSSSSLLQ 205

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
            +ACCKH TAYDL++WNGV RY F ARV++QDLEDT+N PF++CVVE K + VMC+Y  +
Sbjct: 206 TSACCKHATAYDLEDWNGVARYSFVARVTEQDLEDTFNPPFRSCVVEAKASCVMCAYTAI 265

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
           NG P CA+ D+L  T+ G W LDGY+ SDCD+V ++ + Q Y  TPE+A A ++K
Sbjct: 266 NGVPACANSDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRYAPTPEDAVAVSLK 320


>gi|427386425|ref|ZP_18882622.1| hypothetical protein HMPREF9447_03655 [Bacteroides oleiciplenus YIT
           12058]
 gi|425726465|gb|EKU89330.1| hypothetical protein HMPREF9447_03655 [Bacteroides oleiciplenus YIT
           12058]
          Length = 864

 Score =  240 bits (612), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 128/302 (42%), Positives = 176/302 (58%), Gaps = 4/302 (1%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G   GLTYWSPN+NIFRDPRWGRGQET GEDP LT     ++V+GLQG+    LK +AC 
Sbjct: 128 GIFRGLTYWSPNINIFRDPRWGRGQETYGEDPYLTAAIGVAFVKGLQGDDPVYLKSSACA 187

Query: 64  KHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPT 123
           KHY  +    WN   R+ +NA VS  DL DTY   F+  VV+ KV  VMC+YN    +P 
Sbjct: 188 KHYAVHSGPEWN---RHTYNAEVSNHDLWDTYLPAFRELVVDAKVTGVMCAYNSFFEQPC 244

Query: 124 CADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPF 183
           C +  ++ + +  QW+ DGY+ SDC ++   YNT +      EA+ADA+  G D +CG  
Sbjct: 245 CGNDLLMMDILRNQWKFDGYVTSDCGAIEDFYNTHNTHEDAAEASADAVLHGTDCECGNG 304

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                  A+  GL+ EE V+++L     ++ RLGMFD +    P+ ++    +   AH+ 
Sbjct: 305 AYRALADAIVRGLITEEQVDVSLKKLFEIRFRLGMFDPDDRV-PYSDIPISVLECDAHKA 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
            AL+ A Q IVLLKN  + LPL   +   +AV+GPN+D    ++ NY G     TT L+G
Sbjct: 364 HALKMARQSIVLLKNEKQLLPLDMNKIKKIAVVGPNADDKSVLLANYYGYPSCVTTVLEG 423

Query: 304 IS 305
           I 
Sbjct: 424 IK 425



 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 40/107 (37%), Positives = 54/107 (50%), Gaps = 13/107 (12%)

Query: 333 AARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVV 382
           + + AD  V V GL   +E E +          DR  + +P  QQ L+  +  A+  PV+
Sbjct: 595 SVKDADVVVFVGGLSAKVEGEEMKVEIDGFKRGDRTSISIPVVQQNLLKELY-ATGKPVI 653

Query: 383 LVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            +LM G  V + +      + AIL   Y GQAGG AIADVLFG  NP
Sbjct: 654 FILMTGSAVGLEWESE--HLPAILNAWYGGQAGGQAIADVLFGDYNP 698


>gi|315500297|ref|YP_004089100.1| glycoside hydrolase family 3 domain protein [Asticcacaulis
           excentricus CB 48]
 gi|315418309|gb|ADU14949.1| glycoside hydrolase family 3 domain protein [Asticcacaulis
           excentricus CB 48]
          Length = 882

 Score =  240 bits (612), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 134/308 (43%), Positives = 175/308 (56%), Gaps = 10/308 (3%)

Query: 6   MAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKH 65
             GLT WSPN+NIFRDPRWGRGQET GEDP LT +   ++VRGLQG+     +  A  KH
Sbjct: 139 FGGLTVWSPNINIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQGDDPVYYRTVATPKH 198

Query: 66  YTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
           Y  +         R+  N   S  DL DTY   F+A + EG+  S+MC+YN +NG+P CA
Sbjct: 199 YAVHSGPE---AGRHRDNVNPSPYDLADTYLPAFRATITEGQAGSIMCAYNAINGQPACA 255

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVG-VLYNTQH-YTRTPEEAAADAIKAGLDLDCG-P 182
           + D+L   +   W   GY+VSDCD+VG + Y T H Y  TPEE    A + G DL CG  
Sbjct: 256 NEDLLVKYLRKDWGFKGYVVSDCDAVGDIYYKTSHAYRPTPEEGVTAAYQVGTDLICGNA 315

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
             A H   AVR GLL E+ ++ AL    T + +LG FD  P A+ F  +   D  TPA++
Sbjct: 316 NEADHLTRAVRQGLLPEKTLDTALIRLFTARFKLGQFD--PPAKVFPKITAEDYDTPANR 373

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
             + + A   +VLLKN    LPL       +AVIGPN+D   +++GNY G      T L 
Sbjct: 374 DFSQKVAESAMVLLKNENNLLPLKG-EPRQIAVIGPNADSMDSLVGNYNGDPSHPVTVLS 432

Query: 303 GI-SRYAK 309
           GI +R+ K
Sbjct: 433 GIRARFPK 440



 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 42/111 (37%), Positives = 64/111 (57%), Gaps = 13/111 (11%)

Query: 329 AAEVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASR 378
           +A  AA++AD  V V GL Q +E E +          DR  L LP  QQ+++ +V+ A +
Sbjct: 598 SAVAAAKEADLVVFVAGLSQRVEGEEMRVETEGFSGGDRTTLNLPPAQQKVLEQVSAAGK 657

Query: 379 GPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            PVVLVL+ G  + +++A  D  + AI+   YPG  GGAA+A ++ G  +P
Sbjct: 658 -PVVLVLINGSALGINWA--DKNVPAIIEAWYPGGQGGAAVARLIAGDYSP 705


>gi|443717728|gb|ELU08656.1| hypothetical protein CAPTEDRAFT_228276 [Capitella teleta]
          Length = 731

 Score =  239 bits (611), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 148/435 (34%), Positives = 234/435 (53%), Gaps = 13/435 (2%)

Query: 5   GMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCK 64
           G  G+  ++P +NI R P WGR QET GEDP L+G+ +  +V+GLQG+    ++ +  CK
Sbjct: 126 GHRGVNCFTPVINIMRHPLWGRNQETYGEDPWLSGQLSVGFVKGLQGDHPRYIQASGGCK 185

Query: 65  HYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           H+  ++      V R+ F+A+VS++D   T+   FK CV  G + ++MCSYN++NG P C
Sbjct: 186 HFDVHNGPENIPVSRFGFDAKVSERDWRMTFLPQFKTCVEAGSI-NIMCSYNRINGVPAC 244

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD----C 180
           A+  +L + +  +W  +GY++SD  ++  +     YT+T  EAAAD++KAG +++     
Sbjct: 245 ANKKLLTDILRKEWGFNGYVISDSGAIENIVYHHKYTKTLAEAAADSVKAGCNVELTGAT 304

Query: 181 GPFLA-IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           G  +A  +   AV+  L+ EE++   L   +  +MR G FD      PF  +    V + 
Sbjct: 305 GSGVAYFNLLNAVKQNLISEEELRENLKKPMYSRMRQGEFD-PVDMNPFTKIDMSVVLSQ 363

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA--GVACGY 297
            HQ LA++A+    VL+KN  R LPL   R   +A+IGP +D   T+ G+Y         
Sbjct: 364 EHQDLAVKASAMSFVLMKNLNRVLPLKK-RFDRLAIIGPFADNAETLFGDYIPNWDPKFV 422

Query: 298 TTPLQGISRYAKTIHQA-GCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFID 356
           +TP +G+      +  A GC   +C  N    A E A + A    + +G+  ++E E  D
Sbjct: 423 STPYEGLKSLGDDVRYASGCDDPSCT-NYDPKAIEKAVKGAQFVFVCLGVGSNLEREGHD 481

Query: 357 RAGLLLPGRQQELVSRVAKASR-GPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           RA L LPG Q +++      SR  P+VLVL   GPVD+++ K  P +  I+   YP    
Sbjct: 482 RADLDLPGYQLQILKDAEFFSREAPLVLVLFNAGPVDLTWPKLSPEVDGIIECFYPAMGT 541

Query: 416 GAAIADVLFGRANPG 430
           G A+  V+    + G
Sbjct: 542 GKALYQVVTATGDDG 556


>gi|325919363|ref|ZP_08181395.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
           19865]
 gi|325550152|gb|EGD20974.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
           19865]
          Length = 876

 Score =  239 bits (611), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 133/298 (44%), Positives = 179/298 (60%), Gaps = 10/298 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+GLQ   G   K+ A  KH+ 
Sbjct: 125 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKLDATAKHFA 184

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +   +    DR+HF+   S++DL +TY   F+A V EGKVA+VM +YN+VNG+   A  
Sbjct: 185 VH---SGPEADRHHFDVHPSERDLHETYLPAFQALVQEGKVAAVMGAYNRVNGESASAST 241

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
             L+  +   W  DGYIVSDC ++  ++       TPE AAA  +K G DLDCG   A  
Sbjct: 242 R-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTDLDCGDTYAA- 299

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              AVR GL+ E  ++ AL   +T +MRLGMFD  P+  P+  +      +P H  LA +
Sbjct: 300 LPAAVRAGLIDEATIDTALKRLMTTRMRLGMFD-PPAKVPWAQIPASANQSPQHDALARR 358

Query: 248 AAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
            A + +VLLKN    LPL  TL+   +AVIGP +D  ++++GNY G      T LQGI
Sbjct: 359 TARESLVLLKNDG-VLPLKPTLKR--IAVIGPTADDPMSLLGNYYGTPAAPVTILQGI 413



 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 43/108 (39%), Positives = 60/108 (55%), Gaps = 13/108 (12%)

Query: 333 AARQADATVLVMGLDQSIEAEFID----------RAGLLLPGRQQELVSRVAKASRGPVV 382
           AAR A+  V V GL   +E E +D          R    LP  Q+EL+  + +A+  PVV
Sbjct: 606 AARDAEVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQAL-QATGTPVV 664

Query: 383 LVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            VL  G  + + +A+    + AIL   YPGQ GG+A+ DVLFG+A+PG
Sbjct: 665 AVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGSAVGDVLFGQASPG 710


>gi|375357164|ref|YP_005109936.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
 gi|301161845|emb|CBW21389.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
          Length = 722

 Score =  239 bits (611), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 144/426 (33%), Positives = 229/426 (53%), Gaps = 22/426 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYWSP +N+ RDPRWGR +ET GEDP LT +   ++V+GLQG+  + LK  A  KH+ 
Sbjct: 148 GLTYWSPTINMARDPRWGRNEETYGEDPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFV 207

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
           A + +N    +R+  ++++  + L + Y   ++ACV E    SVM +YN  NG P     
Sbjct: 208 ANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +L + +  +W  DG++VSDC ++GV+        + EEAAA  + +G DL+CG      
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEK 323

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              AV  GL+ E  ++ AL   +T + +LG FD      P+ +   + +      +LA +
Sbjct: 324 LVQAVEQGLISEAAIDRALTRVLTARFKLGEFD-PMELVPYNHYDKKLLAGKKFAELAYE 382

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG---- 303
           AA + +VLLKN A  LPL+  +  +VAV+GP +D     +G Y+G      + L+G    
Sbjct: 383 AAVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKEL 439

Query: 304 ISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLP 363
           I +  K  +  G    A +  Q++       + AD  ++ +G D+ +  E  D   + LP
Sbjct: 440 IGKKGKVTYLNGMGTSADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLP 492

Query: 364 GRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVL 423
             Q++L+ ++ + +   +VLV   G P+   +A  D  I AI+   YPGQ  G A+A++L
Sbjct: 493 EEQEKLLKKIYQVN-PRIVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLL 549

Query: 424 FGRANP 429
           FG  NP
Sbjct: 550 FGNENP 555


>gi|265765457|ref|ZP_06093732.1| beta-xylosidase [Bacteroides sp. 2_1_16]
 gi|263254841|gb|EEZ26275.1| beta-xylosidase [Bacteroides sp. 2_1_16]
          Length = 722

 Score =  239 bits (611), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 144/426 (33%), Positives = 229/426 (53%), Gaps = 22/426 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYWSP +N+ RDPRWGR +ET GEDP LT +   ++V+GLQG+  + LK  A  KH+ 
Sbjct: 148 GLTYWSPTINMARDPRWGRNEETYGEDPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFV 207

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
           A + +N    +R+  ++++  + L + Y   ++ACV E    SVM +YN  NG P     
Sbjct: 208 ANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +L + +  +W  DG++VSDC ++GV+        + EEAAA  + +G DL+CG      
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEK 323

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              AV  GL+ E  ++ AL   +T + +LG FD      P+ +   + +      +LA +
Sbjct: 324 LVQAVEQGLISEAAIDRALTRVLTARFKLGEFD-PMELVPYNHYDKKLLAGKKFAELAYE 382

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG---- 303
           AA + +VLLKN A  LPL+  +  +VAV+GP +D     +G Y+G      + L+G    
Sbjct: 383 AAVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKEL 439

Query: 304 ISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLP 363
           I +  K  +  G    A +  Q++       + AD  ++ +G D+ +  E  D   + LP
Sbjct: 440 IGKKGKVTYLNGMGTSADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLP 492

Query: 364 GRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVL 423
             Q++L+ ++ + +   +VLV   G P+   +A  D  I AI+   YPGQ  G A+A++L
Sbjct: 493 EEQEKLLKKIYQVN-PRIVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLL 549

Query: 424 FGRANP 429
           FG  NP
Sbjct: 550 FGNENP 555


>gi|374312362|ref|YP_005058792.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
 gi|358754372|gb|AEU37762.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
          Length = 874

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 142/358 (39%), Positives = 193/358 (53%), Gaps = 19/358 (5%)

Query: 6   MAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKH 65
             GLT WSPN+NIFRDPRWGRGQET GEDP LT      +V G+QGN    LK  A  KH
Sbjct: 130 FGGLTVWSPNINIFRDPRWGRGQETYGEDPFLTATLGTQFVEGVQGNDPFYLKADATPKH 189

Query: 66  YTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPT 123
           + A+     +G +  R  FNA VS  DL DTY   F A       A++MCSYN+++G P+
Sbjct: 190 FAAH-----SGPEEGRDSFNAVVSPHDLADTYLPAFHALTTNAHAAALMCSYNEIDGTPS 244

Query: 124 CADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPF 183
           CA  + L++ +  +W   GY+VSDCD+VG +    H+       AADA+ AG+DLDCG  
Sbjct: 245 CASGNNLQDLVRERWGFKGYVVSDCDAVGNIAGYHHFATDNAHGAADALNAGVDLDCGNT 304

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            A  ++ ++   L  E  +N AL   +  ++RLGM D   S  P+ ++G  ++ +PAH  
Sbjct: 305 YAALSK-SLDQNLTTEAKLNQALHRLLLARVRLGMLD-PLSCSPYRDIGAEELDSPAHHT 362

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+AA + IVLLKN    LPL       V+VIGP +D+   +  NY G A    TPL G
Sbjct: 363 LALRAAEESIVLLKNDG-VLPLQA-STQKVSVIGPTADMVKVLEANYHGTALHPITPLDG 420

Query: 304 I-SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
             SR+    +  G            G +    R A       G    ++AE+ D+A L
Sbjct: 421 FRSRFHDVSYAQGSLLAE-------GVSAPVPRNALRVAAAPGSSAGLQAEYFDKASL 471



 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/114 (33%), Positives = 56/114 (49%), Gaps = 13/114 (11%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAK 375
           L+  A   A ++D  V  +GL   +E E +          DR  L LP  Q+ L+SR+ +
Sbjct: 594 LLDQAVQTAAKSDVIVAFVGLSPDLEGEALQLRLKGFNGGDRTSLDLPEAQRTLLSRLTQ 653

Query: 376 ASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
             + PV++VL  G  V +     D     +L   YPG+AGG A+A +L G  NP
Sbjct: 654 LHK-PVIIVLTSGSGVALGPEAKD--AAGVLEAWYPGEAGGEALAGILAGNVNP 704


>gi|255572559|ref|XP_002527213.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
 gi|223533389|gb|EEF35139.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
          Length = 454

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 122/256 (47%), Positives = 167/256 (65%), Gaps = 1/256 (0%)

Query: 176 LDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD 235
           +D++CG +   + + AV  G LREED++ AL    +VQ+RLG+FDG+     F  LGP D
Sbjct: 1   MDINCGSYAIRNAQSAVDKGKLREEDIDRALLNLFSVQLRLGLFDGDRINGHFSKLGPED 60

Query: 236 VCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVAC 295
           VCT  H++LAL+AA QGIVLLKN  + LPL+     ++A+IGP ++   ++ G+Y G +C
Sbjct: 61  VCTEEHKKLALEAARQGIVLLKNEKKFLPLNKKAVSSLAIIGPLANNGGSLGGDYTGYSC 120

Query: 296 GYTTPLQGISRYAK-TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEF 354
              +   G+  Y K T +  GC  V+C+ +     A   A+ AD  ++V G+D S E E 
Sbjct: 121 NPQSLFDGVQAYIKRTSYAVGCSNVSCDSDDQFPEAIHIAKTADFVIVVAGIDLSQETED 180

Query: 355 IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQA 414
            DR  LLLPG+Q  LVS VA AS+ PV+LVL  GGPVDVSFAK D RI +ILW+GYPG+A
Sbjct: 181 RDRISLLLPGKQMALVSYVAAASKKPVILVLTGGGPVDVSFAKRDSRIASILWIGYPGEA 240

Query: 415 GGAAIADVLFGRANPG 430
           G  A+AD++FG  NPG
Sbjct: 241 GAKALADIIFGEYNPG 256


>gi|383117083|ref|ZP_09937830.1| hypothetical protein BSHG_0813 [Bacteroides sp. 3_2_5]
 gi|251947612|gb|EES87894.1| hypothetical protein BSHG_0813 [Bacteroides sp. 3_2_5]
          Length = 722

 Score =  239 bits (609), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 144/426 (33%), Positives = 229/426 (53%), Gaps = 22/426 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYWSP +N+ RDPRWGR +ET GEDP LT +   ++V+GLQG+  + LK  A  KH+ 
Sbjct: 148 GLTYWSPTINMARDPRWGRNEETYGEDPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFV 207

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
           A + +N    +R+  ++++  + L + Y   ++ACV E    SVM +YN  NG P     
Sbjct: 208 ANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +L + +  +W  DG++VSDC ++GV+        + EEAAA  + +G DL+CG      
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEK 323

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              AV  GL+ E  ++ AL   +T + +LG FD      P+ +   + +      +LA +
Sbjct: 324 LVQAVEQGLISEVAIDRALTRVLTARFKLGEFD-PMELVPYNHYDKKLLAGKKFAELAYE 382

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG---- 303
           AA + +VLLKN A  LPL+  +  +VAV+GP +D     +G Y+G      + L+G    
Sbjct: 383 AAVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKEL 439

Query: 304 ISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLP 363
           I +  K  +  G    A +  Q++       + AD  ++ +G D+ +  E  D   + LP
Sbjct: 440 IGKKGKVTYLNGMGTSADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLP 492

Query: 364 GRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVL 423
             Q++L+ ++ + +   +VLV   G P+   +A  D  I AI+   YPGQ  G A+A++L
Sbjct: 493 EEQEKLLKKIYQVN-PRIVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLL 549

Query: 424 FGRANP 429
           FG  NP
Sbjct: 550 FGNENP 555


>gi|423258868|ref|ZP_17239791.1| hypothetical protein HMPREF1055_02068 [Bacteroides fragilis
           CL07T00C01]
 gi|423264161|ref|ZP_17243164.1| hypothetical protein HMPREF1056_00851 [Bacteroides fragilis
           CL07T12C05]
 gi|387776448|gb|EIK38548.1| hypothetical protein HMPREF1055_02068 [Bacteroides fragilis
           CL07T00C01]
 gi|392706427|gb|EIY99550.1| hypothetical protein HMPREF1056_00851 [Bacteroides fragilis
           CL07T12C05]
          Length = 722

 Score =  238 bits (608), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 144/426 (33%), Positives = 228/426 (53%), Gaps = 22/426 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYWSP +N+ RDPRWGR +ET GEDP LT +   ++V+GLQG+  + LK  A  KH+ 
Sbjct: 148 GLTYWSPTINMARDPRWGRNEETYGEDPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFV 207

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
           A + +N    +R+  ++++  + L + Y   ++ACV E    SVM +YN  NG P     
Sbjct: 208 ANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +L + +  +W  DG++VSDC ++GV+        + EEAAA  + +G DL+CG      
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEK 323

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              AV  GL+ E  ++ AL   +T + +LG FD      P+ +   + +      +LA +
Sbjct: 324 LVQAVEQGLISEAAIDRALTRVLTARFKLGEFD-PMELVPYNHYDKKLLAGKKFAELAYE 382

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG---- 303
           AA + +VLLKN A  LPL+  +  +VAV+GP +D     +G Y+G      + L+G    
Sbjct: 383 AAVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKEL 439

Query: 304 ISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLP 363
           I +  K  +  G    A +  Q++       + AD  ++ +G D+ +  E  D   + LP
Sbjct: 440 IGKKGKVTYLNGMGTSADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLP 492

Query: 364 GRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVL 423
             Q++L+  + + +   +VLV   G P+   +A  D  I AI+   YPGQ  G A+A++L
Sbjct: 493 EEQEKLLKEIYQVN-PRIVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLL 549

Query: 424 FGRANP 429
           FG  NP
Sbjct: 550 FGNENP 555


>gi|336408348|ref|ZP_08588841.1| hypothetical protein HMPREF1018_00856 [Bacteroides sp. 2_1_56FAA]
 gi|423248801|ref|ZP_17229817.1| hypothetical protein HMPREF1066_00827 [Bacteroides fragilis
           CL03T00C08]
 gi|423253750|ref|ZP_17234681.1| hypothetical protein HMPREF1067_01325 [Bacteroides fragilis
           CL03T12C07]
 gi|335937826|gb|EGM99722.1| hypothetical protein HMPREF1018_00856 [Bacteroides sp. 2_1_56FAA]
 gi|392655379|gb|EIY49022.1| hypothetical protein HMPREF1067_01325 [Bacteroides fragilis
           CL03T12C07]
 gi|392657742|gb|EIY51373.1| hypothetical protein HMPREF1066_00827 [Bacteroides fragilis
           CL03T00C08]
          Length = 722

 Score =  238 bits (608), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 144/426 (33%), Positives = 228/426 (53%), Gaps = 22/426 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYWSP +N+ RDPRWGR +ET GEDP LT +   ++V+GLQG+  + LK  A  KH+ 
Sbjct: 148 GLTYWSPTINMARDPRWGRNEETYGEDPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFV 207

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
           A + +N    +R+  ++++  + L + Y   ++ACV E    SVM +YN  NG P     
Sbjct: 208 ANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +L + +  +W  DG++VSDC ++GV+        + EEAAA  + +G DL+CG      
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEK 323

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              AV  GL+ E  ++ AL   +T + +LG FD      P+ +   + +      +LA +
Sbjct: 324 LVQAVEQGLISEAAIDRALTRVLTARFKLGEFD-PMELVPYNHYDKKLLAGKKFAELAYE 382

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG---- 303
           AA + +VLLKN A  LPL+  +  +VAV+GP +D     +G Y+G      + L+G    
Sbjct: 383 AAVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKEL 439

Query: 304 ISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLP 363
           I +  K  +  G    A +  Q++       + AD  ++ +G D+ +  E  D   + LP
Sbjct: 440 IGKKGKVTYLNGMGTSADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLP 492

Query: 364 GRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVL 423
             Q++L+  + + +   +VLV   G P+   +A  D  I AI+   YPGQ  G A+A++L
Sbjct: 493 EEQEKLLKEIYQVN-PRIVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLL 549

Query: 424 FGRANP 429
           FG  NP
Sbjct: 550 FGNENP 555


>gi|449489074|ref|XP_002195511.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Taeniopygia guttata]
          Length = 685

 Score =  238 bits (607), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 164/434 (37%), Positives = 232/434 (53%), Gaps = 25/434 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GL+ +SP +NI R P WGR QET GEDP L+G+ A S+V+GLQG     +K +A CKH++
Sbjct: 81  GLSCFSPVLNIMRHPLWGRNQETYGEDPFLSGELARSFVQGLQGPHPRYVKASAGCKHFS 140

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +       +  Y     V ++D   T+   F+ACV  G   S MCSYN++NG P CA+ 
Sbjct: 141 VH--GGHENILLYLLT--VLERDWRMTFLPQFQACVRAGSY-SFMCSYNRINGVPACANK 195

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG------ 181
            +L + + G+W  DGY+VSD  +V ++    HYTR+  E A  ++ AG +L+        
Sbjct: 196 KLLTDILRGEWGFDGYVVSDEGAVELIMLGHHYTRSFLETAVASVNAGCNLELSYGMRNN 255

Query: 182 PFLAIHTEGAVRGGLLRE--EDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
            F+ I  E    G +  +   D    L YT   +MRLG FD  P+  P+ +L    V +P
Sbjct: 256 VFMRI-PEALAMGNITLQMLRDRVRPLFYT---RMRLGEFD-PPAMNPYSSLDLSVVQSP 310

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLST--LRHHTVAVIGPNSDVTVTMIGNYAGVACG- 296
            H+ L+L+AA +  VLLKN   TLPL    L    +AV+GP +D    + G+YA V    
Sbjct: 311 EHRNLSLEAAVKSFVLLKNVRGTLPLKAQDLSSQHLAVVGPFADNPRVLFGDYAPVPEPR 370

Query: 297 -YTTPLQGISRY-AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEF 354
              TP +G+    A     AGC    C         +V    AD  ++ +G    +E E 
Sbjct: 371 YIYTPRRGLEMLGANVSFAAGCSEPRCQRYSRAELVKVVG-AADVVLVCLGTGVDVETEA 429

Query: 355 IDRAGLLLPGRQQELVSRVAKASRG-PVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQ 413
            DR+ L LPG Q EL+    +A+ G PV+L+L   GP+DVS+A+    +GAIL   +P Q
Sbjct: 430 KDRSDLSLPGHQLELLQDAVQAAAGRPVILLLFNAGPLDVSWAQAHDGVGAILACFFPAQ 489

Query: 414 AGGAAIADVLFGRA 427
           A G AIA VL G A
Sbjct: 490 ATGLAIARVLLGEA 503


>gi|329851587|ref|ZP_08266344.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
 gi|328840433|gb|EGF90005.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
          Length = 883

 Score =  238 bits (607), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 132/300 (44%), Positives = 173/300 (57%), Gaps = 10/300 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+GLQG      +  A  KHY 
Sbjct: 143 GLTVWSPNINIFRDPRWGRGQETYGEDPYLTARIGVAFVKGLQGEDPVYYRTIATPKHYA 202

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +         R+  N   S+ DLEDTY   F+A +VEGK  S+MC+YN ++G+P CA+ 
Sbjct: 203 VHSGPE---ASRHRDNINPSRYDLEDTYLPAFRATIVEGKAVSIMCAYNAIDGQPACAND 259

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLY--NTQHYTRTPEEAAADAIKAGLDLDCG-PFL 184
           D+L   +   W   G++VSDCD+VG +Y   + HY  TPEE    A +AG DL CG    
Sbjct: 260 DLLVKHLRQDWGFKGFVVSDCDAVGDIYYKTSHHYRPTPEEGVTVAYQAGTDLICGNANE 319

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
           A H   AVR G+L E  V+ AL    + + +LG FD  P AQ F  +   D  T A++  
Sbjct: 320 ADHVASAVRKGILPESLVDTALVRLFSARFKLGQFD--PPAQVFPAITADDYDTQANRDF 377

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           +   A   +VLLKN    LPL +    T+AVIGPN+D   +++GNY G      T L GI
Sbjct: 378 SQHVAESAMVLLKNDG-LLPLKS-EPRTIAVIGPNADTMDSLVGNYNGDPSHPVTVLAGI 435



 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 41/110 (37%), Positives = 63/110 (57%), Gaps = 13/110 (11%)

Query: 330 AEVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRG 379
           A  AA+++D  + V GL Q +E E +          DR  L LP  QQ+++ +V+ A+  
Sbjct: 600 AVAAAKESDLVIFVAGLSQRVEGEEMRVETPGFSGGDRTSLDLPPVQQKVLEQVS-ATGK 658

Query: 380 PVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           PVVLVL+ G  + V++A  D  + AI+   YPG  GGAA+A ++ G  +P
Sbjct: 659 PVVLVLINGSALSVNWA--DKNVPAIVEAWYPGGQGGAAVARLIAGDFSP 706


>gi|60680313|ref|YP_210457.1| glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
 gi|60491747|emb|CAH06504.1| putative glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
          Length = 722

 Score =  238 bits (606), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 143/426 (33%), Positives = 228/426 (53%), Gaps = 22/426 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYWSP +N+ RDPRWGR +ET GEDP LT +   ++V+GLQG+  + LK  A  KH+ 
Sbjct: 148 GLTYWSPTINMARDPRWGRNEETYGEDPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFV 207

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
           A + +N    +R+  ++++  + L + Y   ++ACV E    SVM +YN  NG P     
Sbjct: 208 ANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +L + +  +W  DG++VSDC ++GV+        + EEAAA  + +G DL+CG      
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEK 323

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              AV  GL+ E  ++ AL   +T + +LG FD      P+ +   + +      +LA +
Sbjct: 324 LVQAVEQGLISEAAIDRALTRVLTARFKLGEFD-PMELVPYNHYDKKLLAGKKFAELAYE 382

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG---- 303
           AA + +VLLKN A  LPL+  +  +VAV+GP +D     +G Y+G      + L+G    
Sbjct: 383 AAVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKEL 439

Query: 304 ISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLP 363
           I +  K  +  G    A +  Q++       + AD  ++ +G D+ +  E  D   + LP
Sbjct: 440 IGKKGKVTYLNGMGTSADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLP 492

Query: 364 GRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVL 423
             Q++ + ++ + +   +VLV   G P+   +A  D  I AI+   YPGQ  G A+A++L
Sbjct: 493 EEQEKFLKKIYQVN-PRIVLVFHTGNPLTSEWA--DTHILAIMQAWYPGQEAGRALANLL 549

Query: 424 FGRANP 429
           FG  NP
Sbjct: 550 FGNENP 555


>gi|389636381|ref|XP_003715843.1| beta-xylosidase [Magnaporthe oryzae 70-15]
 gi|351648176|gb|EHA56036.1| beta-xylosidase [Magnaporthe oryzae 70-15]
 gi|440480767|gb|ELQ61414.1| beta-xylosidase [Magnaporthe oryzae P131]
          Length = 517

 Score =  238 bits (606), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 145/378 (38%), Positives = 199/378 (52%), Gaps = 11/378 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAA 61
           N G AG  YW+PNVN F+DPRWGRG ETPGED +   +YA    RGL G     + ++ +
Sbjct: 144 NSGWAGFDYWTPNVNAFKDPRWGRGSETPGEDVLRIKRYAEYITRGLDGPVPNEQRRIIS 203

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKHY   D ++WNG  R+ FNA+++ QDL + Y  PF+ C  + KV S+MC+YN VNG 
Sbjct: 204 TCKHYAGNDFEDWNGTTRHDFNAKITMQDLAEYYLKPFQQCARDSKVGSIMCAYNAVNGV 263

Query: 122 PTCADPDILKNTIHGQWRL---DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           P+CA+  +L+  +   W+    + Y+ SDC++V  +    HY  T     A   +AG+D 
Sbjct: 264 PSCANKYLLQTILRDHWKWTEHNNYVTSDCEAVLDVSANHHYAPTNAAGTAICFEAGMDT 323

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
            C    +    GA   GLL+EE V+ AL       +R G FDGE +   + +L  + V +
Sbjct: 324 SCEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGYFDGEEAM--YADLDWQHVNS 381

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
              Q LALQAA +G+VLLKN+  TLPL     H +A+IG  +D    + G Y+G A    
Sbjct: 382 AEAQSLALQAAVEGMVLLKNNG-TLPLDLDPSHKIAMIGFWADAPEKLQGGYSGRAHHLY 440

Query: 299 TPLQGISRYAKTIHQAGCFGVACNG---NQLIGAAEVAARQADATVLVMGLDQSIEAEFI 355
           +P     +    I  A    +  N    N    A E AA  AD  +   GLD S   E +
Sbjct: 441 SPAFAARQLGLDITVASGPVLQDNNASDNWTTNALE-AASGADYILYFGGLDTSAAGETL 499

Query: 356 DRAGLLLPGRQQELVSRV 373
           DR  L  P  Q  LV  V
Sbjct: 500 DRTDLDWPEAQLTLVKVV 517


>gi|440476402|gb|ELQ45004.1| beta-xylosidase, partial [Magnaporthe oryzae Y34]
          Length = 515

 Score =  237 bits (604), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 144/375 (38%), Positives = 198/375 (52%), Gaps = 11/375 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAA 61
           N G AG  YW+PNVN F+DPRWGRG ETPGED +   +YA    RGL G     + ++ +
Sbjct: 144 NSGWAGFDYWTPNVNAFKDPRWGRGSETPGEDVLRIKRYAEYITRGLDGPVPNEQRRIIS 203

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKHY   D ++WNG  R+ FNA+++ QDL + Y  PF+ C  + KV S+MC+YN VNG 
Sbjct: 204 TCKHYAGNDFEDWNGTTRHDFNAKITMQDLAEYYLKPFQQCARDSKVGSIMCAYNAVNGV 263

Query: 122 PTCADPDILKNTIHGQWRL---DGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
           P+CA+  +L+  +   W+    + Y+ SDC++V  +    HY  T     A   +AG+D 
Sbjct: 264 PSCANKYLLQTILRDHWKWTEHNNYVTSDCEAVLDVSANHHYAPTNAAGTAICFEAGMDT 323

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
            C    +    GA   GLL+EE V+ AL       +R G FDGE +   + +L  + V +
Sbjct: 324 SCEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGYFDGEEAM--YADLDWQHVNS 381

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
              Q LALQAA +G+VLLKN+  TLPL     H +A+IG  +D    + G Y+G A    
Sbjct: 382 AEAQSLALQAAVEGMVLLKNNG-TLPLDLDPSHKIAMIGFWADAPEKLQGGYSGRAHHLY 440

Query: 299 TPLQGISRYAKTIHQAGCFGVACNG---NQLIGAAEVAARQADATVLVMGLDQSIEAEFI 355
           +P     +    I  A    +  N    N    A E AA  AD  +   GLD S   E +
Sbjct: 441 SPAFAARQLGLDITVASGPVLQDNNASDNWTTNALE-AASGADYILYFGGLDTSAAGETL 499

Query: 356 DRAGLLLPGRQQELV 370
           DR  L  P  Q  LV
Sbjct: 500 DRTDLDWPEAQLTLV 514


>gi|53712125|ref|YP_098117.1| beta-xylosidase [Bacteroides fragilis YCH46]
 gi|52214990|dbj|BAD47583.1| beta-xylosidase [Bacteroides fragilis YCH46]
          Length = 722

 Score =  237 bits (604), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 144/426 (33%), Positives = 228/426 (53%), Gaps = 22/426 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYWSP +N+ RDPRWGR +ET GEDP LT +   ++V+GLQG+  + LK  A  KH+ 
Sbjct: 148 GLTYWSPTINMARDPRWGRNEETYGEDPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFV 207

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
           A + +N    +R+  ++++  + L + Y   ++ACV E    SVM +YN  NG P     
Sbjct: 208 ANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +L + +  +W  DG++VSDC ++GV+        + EEAAA  + +G DL+CG      
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEK 323

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              AV  GL+ E  ++ AL   +T + +LG FD      P+ +   + +      +LA +
Sbjct: 324 LVQAVEQGLISEAAIDRALTRVLTARFKLGEFD-PMELVPYNHYDKKLLAGKKFAELAYE 382

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG---- 303
           AA + +VLLKN A  LPL+  +  +VAV+GP +D     +G Y+G      + L+G    
Sbjct: 383 AAVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKEL 439

Query: 304 ISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLP 363
           I +  K  +  G    A +  Q++       + AD  ++ +G D+ +  E  D   + LP
Sbjct: 440 IGKKGKVTYLNGMGTSADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLP 492

Query: 364 GRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVL 423
             Q++L+  + + +   +VLV   G P+   +A  D  I AI+   YPGQ  G A+A++L
Sbjct: 493 EGQEKLLKEIYQVN-PRIVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLL 549

Query: 424 FGRANP 429
           FG  NP
Sbjct: 550 FGNENP 555


>gi|374313710|ref|YP_005060140.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
 gi|358755720|gb|AEU39110.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
          Length = 883

 Score =  237 bits (604), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 122/314 (38%), Positives = 182/314 (57%), Gaps = 11/314 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT WSPN+NIFRDPRWGRGQET GEDP LT +   ++V GLQG+  +  +     KH+ 
Sbjct: 142 GLTIWSPNINIFRDPRWGRGQETYGEDPFLTARLGTAFVEGLQGDDPNYYRAIGTPKHFA 201

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +   +    +R+ FNA  S  DL DTY   F+A +VEGK  S+MC+YN + GKP CA  
Sbjct: 202 VH---SGPESERHRFNADPSPHDLWDTYLPAFRATIVEGKAGSIMCAYNAIEGKPACASD 258

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNT--QHYTRTPEEAAADAIKAGLDLDCGPFLA 185
            +L   +   W   G++ SDC ++   +     HY++  E+A+ D I+AG D +CG    
Sbjct: 259 LLLDEVLRKDWAFKGFVTSDCGAIDNFFEKDGHHYSKDAEQASVDGIRAGTDTNCGGTYR 318

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
            +   AVR G+++E ++++ L      + +LG+FD  PS   + ++   +  + +H +LA
Sbjct: 319 -NLASAVRKGMIQESELDVPLRRLFLARFKLGLFD-PPSQVKYASMPITENMSSSHTELA 376

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
           LQAA + +VLLKN   TLPL   R  T+AVIGPN+   +++ GNY  +       + GI+
Sbjct: 377 LQAAREAVVLLKNEHHTLPLDA-RVKTIAVIGPNASSLISLEGNYNAIPKNPVMQVDGIA 435

Query: 306 RY---AKTIHQAGC 316
           R    AK ++  G 
Sbjct: 436 REFRDAKVLYAQGS 449



 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 47/114 (41%), Positives = 63/114 (55%), Gaps = 13/114 (11%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFID----------RAGLLLPGRQQELVSRVAK 375
           L   A  A +QADA V  +GL   +E E +D          R  L+LP  QQ+L+   AK
Sbjct: 602 LRAQAMEAVKQADAVVAFVGLSPELEGEEMDVHIPGFSGGDRTDLVLPAAQQQLL-EAAK 660

Query: 376 ASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           AS  P+V+VL+ G  + V++A+      AIL   YPGQAG  AIA+ L G+ NP
Sbjct: 661 ASGKPLVVVLLNGSALAVNWAQE--HADAILEAWYPGQAGAQAIAETLSGKNNP 712


>gi|423281966|ref|ZP_17260851.1| hypothetical protein HMPREF1204_00389 [Bacteroides fragilis HMW
           615]
 gi|404582453|gb|EKA87147.1| hypothetical protein HMPREF1204_00389 [Bacteroides fragilis HMW
           615]
          Length = 722

 Score =  236 bits (603), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 143/426 (33%), Positives = 227/426 (53%), Gaps = 22/426 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYWSP +N+ RDPRWGR +ET GEDP LT +   ++V+GLQG+  + LK  A  KH+ 
Sbjct: 148 GLTYWSPTINMARDPRWGRNEETYGEDPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFV 207

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
           A + +N    +R+  ++++  + L + Y   ++ACV E    SVM +YN  NG P     
Sbjct: 208 ANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +L + +  +W  DG++VSDC ++GV+        + EEAAA  + +G DL+CG      
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEK 323

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              AV  GL+ E  ++ AL   +T + +LG FD      P+ +   + +      +LA +
Sbjct: 324 LVQAVEQGLISEAAIDRALTRVLTARFKLGEFD-PMELVPYNHYDKKLLAGKKFAELAYE 382

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG---- 303
           AA + +VLLKN A  LPL+  +  +VAV+GP +D     +G Y+G      + L+G    
Sbjct: 383 AAVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKEL 439

Query: 304 ISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLP 363
           I +  K  +  G    A +  Q++       + AD  ++ +G D+ +  E  D   + LP
Sbjct: 440 IGKKGKVTYLNGMGTSADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLP 492

Query: 364 GRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVL 423
             Q++L+  + + +   + LV   G P+   +A  D  I AI+   YPGQ  G A+A++L
Sbjct: 493 EEQEKLLKEIYQVNPR-IALVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLL 549

Query: 424 FGRANP 429
           FG  NP
Sbjct: 550 FGNENP 555


>gi|197106390|ref|YP_002131767.1| glucan 1,4-beta-glucosidase [Phenylobacterium zucineum HLK1]
 gi|196479810|gb|ACG79338.1| glucan 1,4-beta-glucosidase [Phenylobacterium zucineum HLK1]
          Length = 888

 Score =  236 bits (603), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 144/356 (40%), Positives = 193/356 (54%), Gaps = 25/356 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT WSPN+NIFRDPRWGRGQET GEDP LTG+   +++RGLQG   +  K  A  KHY 
Sbjct: 144 GLTVWSPNINIFRDPRWGRGQETYGEDPYLTGRMGVAFIRGLQGQDPNFFKTIATAKHYA 203

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +     N   R+  +   S  DLEDTY   F+A V EGKV +VMC+YN V+G P CA  
Sbjct: 204 VHSGPESN---RHREDVHPSAYDLEDTYLPAFRAAVTEGKVQAVMCAYNAVDGVPACASE 260

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLY--NTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
           D++   +   W   G++VSDC +   +Y  ++  Y +TPEE    A+ AG+DL CG + A
Sbjct: 261 DLMDQRLRRDWGFSGHVVSDCGAAANIYREDSLAYVKTPEEGITRALNAGMDLVCGDYRA 320

Query: 186 -IHTEG-----AVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
             +TE      AVR G+L E  ++ AL      ++RLG+FD  P+  PF  +      TP
Sbjct: 321 DWNTEAEATVSAVRKGMLDETVLDGALVRLFADRIRLGLFD-PPAEVPFSKITAAQNDTP 379

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
            H+ ++L+ A   + LLKN    LPL       +AV+GPN+D    +IGNY G      T
Sbjct: 380 EHRAMSLEMAKASMTLLKNDG-VLPLKG-EPRRIAVVGPNADSVDALIGNYYGTPSNPVT 437

Query: 300 PLQGI-SRYAKTIHQAGCFGVACNGNQLIGAAEV----AARQADATVLVMGLDQSI 350
            L GI +R+ K         V   G  L+G A +    A   ADA     GL Q +
Sbjct: 438 VLAGIRARFPK------AEVVYAEGTGLVGPASLPVPDAVLCADAACRTKGLKQEV 487



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 54/102 (52%), Gaps = 13/102 (12%)

Query: 338 DATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMC 387
           D  V V GL   +E E +          DR  L LP  QQ+L+ R+  A+  PVVLVLM 
Sbjct: 613 DLVVFVGGLTARVEGEEMKLQVPGFAGGDRTSLDLPAPQQDLLRRL-HATGKPVVLVLMN 671

Query: 388 GGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           G  + V++A  D  + AI+   YPG  GG A+A +L G  +P
Sbjct: 672 GSALSVNWA--DANLPAIVEAWYPGGEGGHAVAQLLAGDYSP 711


>gi|381170979|ref|ZP_09880130.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           citri pv. mangiferaeindicae LMG 941]
 gi|380688543|emb|CCG36617.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           citri pv. mangiferaeindicae LMG 941]
          Length = 901

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 132/306 (43%), Positives = 181/306 (59%), Gaps = 15/306 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA------- 60
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+GLQG   +  K A       
Sbjct: 139 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPKNAQGEPYRK 198

Query: 61  --ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
             A  KH+  +        DR+HF+AR S++DL +TY   F+A V EGKV +VM +YN+V
Sbjct: 199 LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRV 255

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            G+   A   +L++ +  QW   GY+VSDC ++  ++       T E+AAA A+K G +L
Sbjct: 256 YGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTEL 315

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG   A     AVR GL+ E  ++ AL   +T +MRLGMFD  P   P+  +      +
Sbjct: 316 ECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLPWSTIPASVNQS 373

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PAH  LA + A + +VLLKN    LPLS  +   +AVIGP +D T+ ++GNY G      
Sbjct: 374 PAHDALARRTARESLVLLKNDG-LLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPV 432

Query: 299 TPLQGI 304
           T LQGI
Sbjct: 433 TVLQGI 438



 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/107 (41%), Positives = 56/107 (52%), Gaps = 13/107 (12%)

Query: 334 ARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVL 383
           AR AD  V V GL   +E E +          DR  L LP  Q++L+  +    R PVV 
Sbjct: 632 ARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGR-PVVA 690

Query: 384 VLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           VL  G  + + +A+    + AIL   YPGQ GG A+AD LFG ANPG
Sbjct: 691 VLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPG 735


>gi|289668505|ref|ZP_06489580.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. musacearum
           NCPPB 4381]
          Length = 902

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 130/306 (42%), Positives = 181/306 (59%), Gaps = 15/306 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG---------SRLK 58
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++VRGLQG  G         S  K
Sbjct: 140 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVRGLQGEGGDAPKNAQGESYRK 199

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           + A  KH+  +        DR+HF+AR S++DL +TY   F+A V +GKV +VM +YN+V
Sbjct: 200 LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRV 256

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            G+   A   +L++ +  QW   GY+VSDC ++  ++       T E+AAA A+K G +L
Sbjct: 257 YGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTEL 316

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG   +     AV  GL+ E  ++ +L   +T +MRLGMFD  P   P+  +      +
Sbjct: 317 ECGEEYST-LPAAVHQGLIEEAQIDTSLQTLMTARMRLGMFD-PPGQLPWSKIPASVNQS 374

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PAH  LA + A + +VLLKN    LPLS  +   +AVIGP +D T+ ++GNY G      
Sbjct: 375 PAHDALARRTARESLVLLKNDG-LLPLSRTKLKRIAVIGPTADDTMALLGNYYGTPAAPV 433

Query: 299 TPLQGI 304
           T LQGI
Sbjct: 434 TVLQGI 439



 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 44/107 (41%), Positives = 58/107 (54%), Gaps = 13/107 (12%)

Query: 334 ARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVL 383
           AR A+  V V GL   +E E +          DR  L LP  Q+EL+  + +A+  PVV 
Sbjct: 633 ARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEAL-QATGKPVVA 691

Query: 384 VLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           VL  G  + + +A+    + AIL   YPGQ GG A+AD LFG ANPG
Sbjct: 692 VLTAGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVADTLFGDANPG 736


>gi|340368019|ref|XP_003382550.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 742

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 162/441 (36%), Positives = 232/441 (52%), Gaps = 22/441 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GL+ WSP +NI RDPRWGR QET GEDP L+G    +YV GLQGN    +   A CKH+ 
Sbjct: 138 GLSCWSPVLNIMRDPRWGRNQETYGEDPYLSGYLGQAYVNGLQGNNSRYIIANAGCKHFD 197

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +         R+ F+A+VS +D   T+   FKACV  G + S+MCSYN++NG P CA+ 
Sbjct: 198 VHGGPENIPTSRFSFDAKVSMRDWRMTFLPQFKACVEAGAL-SLMCSYNRINGVPACANK 256

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG----PF 183
            +L + +  +W   GY+VSD  ++  +    HY     +AAADA  AG  L+ G     F
Sbjct: 257 ALLTDILRNEWDFKGYVVSDQGALEFIVIEHHYAPDFMKAAADAANAGTCLEDGNIGRKF 316

Query: 184 LAI--HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAH 241
             +  H   AV+  L+  + +  A++    V+M+LG FD  P   P+ N+    + + AH
Sbjct: 317 FNVFEHLVDAVKNNLVSVDTLKNAVSRLFYVRMKLGEFD-PPDNNPYANIPLSVIQSDAH 375

Query: 242 QQLALQAAHQGIVLLKNS----ARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG- 296
             L+LQAA + IVL+KN     +  LP+ T       ++GP SD    + G+Y+      
Sbjct: 376 INLSLQAAMESIVLMKNDDGFRSPFLPI-TNEVKKACMVGPFSDDPEVLFGDYSPTLMRD 434

Query: 297 -YTTPLQGISRY---AKTIHQA-GCF-GVACNGNQLIGAAEVAARQADATVLVMGLDQSI 350
              T L G+        T++ A GC  G AC  N        A    +  ++  GL + +
Sbjct: 435 YVITSLAGLKNANIGTDTLNYAVGCEDGPACR-NYDSAKVRSACDGVELIIVTAGLSKHL 493

Query: 351 EAEFIDRAGLLLPGRQQELVSRVAKASR-GPVVLVLMCGGPVDVSFAKNDPRIGAILWVG 409
           E+E  D + + LPG Q +L+     AS+   V+L+L    P+D+ +AK DPRI  IL   
Sbjct: 494 ESEGKDLSDINLPGHQLDLMQDAEAASKNASVILILFNASPLDIRYAKTDPRIVGILEAY 553

Query: 410 YPGQAGGAAIADVLFGRANPG 430
           YPGQ  G AIA+VL G  NP 
Sbjct: 554 YPGQTAGKAIANVLTGEYNPS 574


>gi|418518550|ref|ZP_13084692.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
 gi|418522850|ref|ZP_13088880.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
 gi|410700720|gb|EKQ59264.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
 gi|410703176|gb|EKQ61671.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
          Length = 901

 Score =  236 bits (601), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 132/306 (43%), Positives = 182/306 (59%), Gaps = 15/306 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN--------TGSRL-K 58
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+GLQG          G R  K
Sbjct: 139 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGERYRK 198

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           + A  KH+  +        DR+HF+AR S++DL +TY   F+A V +GKV +VM +YN+V
Sbjct: 199 LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRV 255

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            G+   A   +L++ +  QW   GY+VSDC ++  ++       T E+AAA A+K G +L
Sbjct: 256 YGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTEL 315

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG   A     AVR GL+ E  ++ AL   +T +MRLGMFD  P   P+  +      +
Sbjct: 316 ECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLPWSTIPASVNQS 373

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PAH  LA + A + +VLLKN    LPLS  +   +AVIGP +D T+ ++GNY G      
Sbjct: 374 PAHDALARRTARESLVLLKNDG-LLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPV 432

Query: 299 TPLQGI 304
           T LQGI
Sbjct: 433 TVLQGI 438



 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/107 (41%), Positives = 58/107 (54%), Gaps = 13/107 (12%)

Query: 334 ARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVL 383
           AR AD  V V GL   +E E +          DR  L LP  Q++L+  + +A+  PVV 
Sbjct: 632 ARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-QATGKPVVA 690

Query: 384 VLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           VL  G  + + +A+    + AIL   YPGQ GG A+AD LFG ANPG
Sbjct: 691 VLTAGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPG 735


>gi|386718620|ref|YP_006184946.1| glucan 1,4-beta-glucosidase [Stenotrophomonas maltophilia D457]
 gi|384078182|emb|CCH12773.1| Glucan 1,4-beta-glucosidase [Stenotrophomonas maltophilia D457]
          Length = 897

 Score =  236 bits (601), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 135/317 (42%), Positives = 183/317 (57%), Gaps = 7/317 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           +G   GLT+WSPNVNIFRDPRWGRGQET GEDP LT +   ++VRGLQG+     K+ A 
Sbjct: 138 HGRYQGLTFWSPNVNIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQGDDPVYRKLDAT 197

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
            KH   +        DR+HF+AR S++DL DTY   F+A V EG V +VM +YN+V G+ 
Sbjct: 198 AKHLAVHSGPE---ADRHHFDARPSRRDLYDTYLPAFEALVKEGDVDAVMGAYNRVYGES 254

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP 182
             A   +L++ +   W   GY+VSDC ++  ++   H   T E AAA A++ G +L+CG 
Sbjct: 255 ASASRFLLRDVLRRDWGFKGYVVSDCWAIVDIWKHHHIVTTREAAAALAVRNGTELECGQ 314

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
             A     AVR GL+ E +++ A+    T +MRLGMFD  P    +  +       P+H 
Sbjct: 315 EYAT-LPSAVRQGLISEAEIDDAVTRLFTARMRLGMFD-PPERVRWARIPASVNQAPSHD 372

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
            LAL+AA   +VLLKN    LPLS      +AV+GP +D T+ ++GNY G      T LQ
Sbjct: 373 ALALKAAQASLVLLKNDG-ILPLSR-DIKRIAVVGPTADDTMALLGNYFGTPAAPVTILQ 430

Query: 303 GISRYAKTIHQAGCFGV 319
           GI   AK +      GV
Sbjct: 431 GIREAAKGVEVRYARGV 447



 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/107 (39%), Positives = 56/107 (52%), Gaps = 13/107 (12%)

Query: 333 AARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVV 382
           AAR+AD  V V GL   +E E +          DR  L LP  Q+ L+  +  A+  PVV
Sbjct: 625 AAREADVVVFVGGLTGDVEGEEMTVNYPGFAGGDRTDLRLPAPQRTLLEAL-HATGKPVV 683

Query: 383 LVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +VL  G  + V +A++   + AIL   YPGQ GG A+   LFG  NP
Sbjct: 684 MVLTGGSAIAVDWAQS--HLPAILMSWYPGQRGGTAVGQALFGDVNP 728


>gi|289666226|ref|ZP_06487807.1| beta-glucosidase precursor [Xanthomonas campestris pv. vasculorum
           NCPPB 702]
          Length = 902

 Score =  236 bits (601), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 130/306 (42%), Positives = 181/306 (59%), Gaps = 15/306 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG---------SRLK 58
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++VRGLQG  G         S  K
Sbjct: 140 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVRGLQGEGGDAPKNAQGESYRK 199

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           + A  KH+  +        DR+HF+AR S++DL +TY   F+A V +GKV +VM +YN+V
Sbjct: 200 LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRV 256

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            G+   A   +L++ +  QW   GY+VSDC ++  ++       T E+AAA A+K G +L
Sbjct: 257 YGESASASKFLLQDLLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTEL 316

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG   +     AV  GL+ E  ++ +L   +T +MRLGMFD  P   P+  +      +
Sbjct: 317 ECGEEYST-LPAAVHQGLIEEAQIDTSLQTLMTARMRLGMFD-PPGQLPWSKIPASVNQS 374

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PAH  LA + A + +VLLKN    LPLS  +   +AVIGP +D T+ ++GNY G      
Sbjct: 375 PAHDALARRTARESLVLLKNDG-LLPLSRTKLKRIAVIGPTADDTMALLGNYYGTPAAPV 433

Query: 299 TPLQGI 304
           T LQGI
Sbjct: 434 TVLQGI 439



 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 44/107 (41%), Positives = 58/107 (54%), Gaps = 13/107 (12%)

Query: 334 ARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVL 383
           AR A+  V V GL   +E E +          DR  L LP  Q+EL+  + +A+  PVV 
Sbjct: 633 ARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEAL-QATGKPVVA 691

Query: 384 VLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           VL  G  + + +A+    + AIL   YPGQ GG A+AD LFG ANPG
Sbjct: 692 VLTAGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVADTLFGDANPG 736


>gi|389794400|ref|ZP_10197553.1| beta-glucosidase-related glycosidase [Rhodanobacter fulvus Jip2]
 gi|388432423|gb|EIL89432.1| beta-glucosidase-related glycosidase [Rhodanobacter fulvus Jip2]
          Length = 902

 Score =  235 bits (600), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 130/306 (42%), Positives = 180/306 (58%), Gaps = 7/306 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           +G   GLTYWSPN+NIFRDPRWGRGQET GEDP LT +   ++V GLQG+  +  K+ A 
Sbjct: 147 HGRYEGLTYWSPNINIFRDPRWGRGQETYGEDPYLTERMGVAFVTGLQGDNPTYRKLDAT 206

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
            KH+  +   +    DR+HF+   S++DL +TY   F+  V E  V +VM +YN+VNG+P
Sbjct: 207 AKHFAVH---SGPEADRHHFDVHPSERDLYETYLPAFQTLVQEADVDAVMSAYNRVNGEP 263

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP 182
               P +L   +   W   GY+VSDC +V  +Y       T E A+A A+K G+DLDCG 
Sbjct: 264 ATGSPRLLGQILRKDWGFKGYVVSDCGAVEDIYKHHKVVDTVEAASALAVKNGVDLDCGT 323

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
             A   + AV  GL++E +++ AL   +  +MRLGMFD   S  P+ ++      +P H 
Sbjct: 324 EYAALVK-AVHDGLIKESEIDAALTRLMQARMRLGMFD-PASKVPWSDVPYSVNQSPQHD 381

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
            LA +AA + +VLLKN    LPLS    H +AVIGP +D  + ++GNY G      T L+
Sbjct: 382 ALARRAARESMVLLKNDG-VLPLSKDIKH-IAVIGPTADDVMALVGNYHGTPADPVTILR 439

Query: 303 GISRYA 308
           GI   A
Sbjct: 440 GIREAA 445



 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 39/99 (39%), Positives = 54/99 (54%), Gaps = 13/99 (13%)

Query: 341 VLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGP 390
           V   GL   +E E +          DR  L LP  Q++L+  + +A+  PVVLVL  G  
Sbjct: 642 VFAGGLTSDVEGEEMKVNYPGFAGGDRTDLRLPATQRKLLEAL-QATGKPVVLVLTSGSA 700

Query: 391 VDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           + V +A     + A+L   YPGQ GG A+ADVLFG+A+P
Sbjct: 701 LAVDWANQ--HLPAVLLAWYPGQRGGNAVADVLFGKADP 737


>gi|423269271|ref|ZP_17248243.1| hypothetical protein HMPREF1079_01325 [Bacteroides fragilis
           CL05T00C42]
 gi|423273165|ref|ZP_17252112.1| hypothetical protein HMPREF1080_00765 [Bacteroides fragilis
           CL05T12C13]
 gi|392701693|gb|EIY94850.1| hypothetical protein HMPREF1079_01325 [Bacteroides fragilis
           CL05T00C42]
 gi|392708197|gb|EIZ01305.1| hypothetical protein HMPREF1080_00765 [Bacteroides fragilis
           CL05T12C13]
          Length = 722

 Score =  235 bits (600), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 143/426 (33%), Positives = 228/426 (53%), Gaps = 22/426 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYWSP +N+ RDPRWGR +ET GE+P LT +   ++V+GLQG+  + LK  A  KH+ 
Sbjct: 148 GLTYWSPTINMARDPRWGRNEETYGEEPHLTSRLGVAFVKGLQGDHPTYLKTVATIKHFV 207

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
           A + +N    +R+  ++++  + L + Y   ++ACV E    SVM +YN  NG P     
Sbjct: 208 ANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +L + +  +W  DG++VSDC ++GV+        + EEAAA  + +G DL+CG      
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEK 323

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              AV  GL+ E  ++ AL   +T + +LG FD      P+ +   + +      +LA +
Sbjct: 324 LVQAVEQGLISEAAIDRALTRVLTARFKLGEFD-PMELVPYNHYDKKLLAGKKFAELAYE 382

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG---- 303
           AA + +VLLKN A  LPL+  +  +VAV+GP +D     +G Y+G      + L+G    
Sbjct: 383 AAVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQPPYSVSLLKGVKEL 439

Query: 304 ISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLP 363
           I +  K  +  G    A +  Q++       + AD  ++ +G D+ +  E  D   + LP
Sbjct: 440 IGKKGKVTYLNGMGTSADSIAQVV-------KGADIVLVALGSDEKMARENHDMPSIYLP 492

Query: 364 GRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVL 423
             Q++L+  + + +   +VLV   G P+   +A  D  I AI+   YPGQ  G A+A++L
Sbjct: 493 EGQEKLLKEIYQVN-PRIVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLL 549

Query: 424 FGRANP 429
           FG  NP
Sbjct: 550 FGNENP 555


>gi|167537541|ref|XP_001750439.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163771117|gb|EDQ84789.1| predicted protein [Monosiga brevicollis MX1]
          Length = 834

 Score =  235 bits (600), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 146/426 (34%), Positives = 224/426 (52%), Gaps = 12/426 (2%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ-GNTGSRLKVAACCKHY 66
           GL  WSP +N  RDPRWGR  E+PGE P + G+Y A+Y  GLQ G+     +     KH+
Sbjct: 214 GLDTWSPTINPSRDPRWGRNVESPGESPFVCGQYGAAYTEGLQNGDDKDYTQAVVTLKHW 273

Query: 67  TAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
            AY +++++ V RY +NA VS+ DL DTY   ++  V   K   VMCSYN +NG PTC +
Sbjct: 274 VAYSVEDYDNVTRYEYNAIVSEYDLMDTYFPGWEYVVKNAKPLGVMCSYNSLNGVPTCGN 333

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAI 186
           P  L   +   W  +GYI SD DS+  ++   HY      A  D +  G D+D G   A 
Sbjct: 334 P-ALTAYLREDWGFEGYITSDSDSIHCIWADHHYESNAVLATRDGLLGGCDIDSGDTYAD 392

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLAL 246
           + E AV   L+    V+ AL  +  ++  LG+FD   +   +  +   +V   + Q+ +L
Sbjct: 393 NLEAAVNQSLVNRSAVDAALTNSYRMRFNLGLFDPNVT-NAYDRISADEVGMSSSQETSL 451

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVAC--GYTTPLQGI 304
            AA + + LLKN  +TLP +T +   VAVIG +S+    ++GNY G  C  G    +Q +
Sbjct: 452 LAARKSMTLLKNDGQTLPFATGKK--VAVIGKSSNSAEDILGNYVGPICPSGAFDCVQTL 509

Query: 305 SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPG 364
            +     +Q G   ++ +   +  A ++A       + +    Q+ E +  DR  + L  
Sbjct: 510 YQGVAAANQGGATTLSDDVADINTAIQLAMDADQVVLTISNYGQAGEGK--DRTYIGLDT 567

Query: 365 RQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLF 424
            QQELV+ V K  + P  +V++ GG + + + K++ +  AIL    PG  GG A+A+ +F
Sbjct: 568 DQQELVAAVLKVGK-PTAIVMLNGGLISLDWIKDEAQ--AILVAFAPGVHGGQAVAETIF 624

Query: 425 GRANPG 430
           G  NPG
Sbjct: 625 GANNPG 630


>gi|21244948|ref|NP_644530.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
           306]
 gi|21110666|gb|AAM39066.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
           306]
          Length = 901

 Score =  235 bits (600), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 132/306 (43%), Positives = 180/306 (58%), Gaps = 15/306 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA------- 60
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+GLQG   +  K A       
Sbjct: 139 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPKNAQGEPYRK 198

Query: 61  --ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
             A  KH   +        DR+HF+AR S++DL +TY   F+A V EGKV +VM +YN+V
Sbjct: 199 LDATAKHLAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRV 255

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            G+   A   +L++ +  QW   GY+VSDC ++  ++       T E+AAA A+K G +L
Sbjct: 256 YGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTEL 315

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG   A     AVR GL+ E  ++ AL   +T +MRLGMFD  P   P+  +      +
Sbjct: 316 ECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLPWSTIPASVNQS 373

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PAH  LA + A + +VLLKN    LPLS  +   +AVIGP +D T+ ++GNY G      
Sbjct: 374 PAHDALARRTARESLVLLKNDG-LLPLSRAKFKRIAVIGPTADDTMALLGNYYGTPAAPV 432

Query: 299 TPLQGI 304
           T LQGI
Sbjct: 433 TVLQGI 438



 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/107 (41%), Positives = 56/107 (52%), Gaps = 13/107 (12%)

Query: 334 ARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVL 383
           AR AD  V V GL   +E E +          DR  L LP  Q++L+  +    R PVV 
Sbjct: 632 ARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGR-PVVA 690

Query: 384 VLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           VL  G  + + +A+    + AIL   YPGQ GG A+AD LFG ANPG
Sbjct: 691 VLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPG 735


>gi|300777563|ref|ZP_07087421.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
 gi|300503073|gb|EFK34213.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
          Length = 896

 Score =  235 bits (599), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 126/319 (39%), Positives = 182/319 (57%), Gaps = 13/319 (4%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G   GLT+W+PN+NIFRDPRWGRGQET GEDP LT     + V+GLQGN     K  AC 
Sbjct: 148 GRYEGLTFWTPNINIFRDPRWGRGQETYGEDPYLTSVLGVAAVKGLQGNDPKFFKTHACA 207

Query: 64  KHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPT 123
           KH+  +    WN   R+ +NA +SK+DL +TY   FKA V EG V  VMC+YN  +G+P 
Sbjct: 208 KHFAVHSGPEWN---RHSYNAEISKRDLYETYLPAFKALVQEGNVREVMCAYNAFDGQPC 264

Query: 124 CADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEE--AAADAIKAGLDLDCG 181
           CA+  +L   + G+W+ DG +VSDC ++   +  +++   P+E   AADA+K   DL+CG
Sbjct: 265 CANNTLLTEILRGKWKYDGMVVSDCWALADFFQKKYHGTHPDEKTTAADALKHSTDLECG 324

Query: 182 PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC-TPA 240
                +   ++  GL+ E+D++ ++   +     LGM D  P +    N  P  V  +  
Sbjct: 325 DTYN-NLNKSLASGLITEKDIDESMRRILKGWFELGMLD--PKSSVHWNTIPYSVVDSEE 381

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H++ AL+ A + IVL+KN    LPL+      +AV+GPN+D  +  +GNY G      T 
Sbjct: 382 HKKQALKMAQKSIVLMKNEKNILPLNR-NIKKIAVVGPNADDGLMQLGNYNGTPSSIVTI 440

Query: 301 LQGISRY---AKTIHQAGC 316
           L GI      A+ I++ G 
Sbjct: 441 LDGIKTKFPNAEIIYEKGS 459



 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 54/106 (50%), Gaps = 13/106 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  V   GL  S+E E +          D+  + LP  Q++L++ + K  + PVV V
Sbjct: 621 KNADVIVFAGGLSPSLEGEEMMVNAEGFKGGDKTSIALPKVQRDLLAELRKTGK-PVVFV 679

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           L  G  + +   +++    A+L   Y GQ+GG A+ADVL G  NP 
Sbjct: 680 LCTGSALGLE--QDEKNYDALLNAWYGGQSGGTAVADVLAGDYNPS 723


>gi|405968899|gb|EKC33925.1| Putative beta-D-xylosidase 5 [Crassostrea gigas]
          Length = 748

 Score =  235 bits (599), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 152/439 (34%), Positives = 235/439 (53%), Gaps = 21/439 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GL+ +SP VNI R P WGR QET GEDP L+G YA+ +V+GLQG+    ++  A CKH+ 
Sbjct: 139 GLSCFSPVVNIMRHPLWGRNQETYGEDPYLSGTYASYFVQGLQGDHDRYIQANAGCKHFD 198

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
           A+         R  F+A+VS +DL  T+   F+ CV  G   S+MCSYN +NG P C++ 
Sbjct: 199 AHGGPEDIPESRMGFDAKVSMRDLRLTFLPAFQKCVQAG-AYSLMCSYNSINGVPACSNK 257

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA-- 185
            ++ + + G+W   GY+VSD  ++    +  HY    E+AAA ++ AG +L+    L   
Sbjct: 258 LLMMDILRGEWNFTGYVVSDEGAIENQISFHHYYNNSEDAAAGSVNAGCNLELSGNLTEP 317

Query: 186 --IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
             +    AV+ G L E  V   +      +MRLG FD  P   P+ ++    + +  H+ 
Sbjct: 318 VFMKIGDAVKSGKLEESVVRNRVKPLFYTRMRLGEFD-PPEMNPYSSVNLSVIQSEEHRN 376

Query: 244 LALQAAHQGIVLLKN----SARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA--CGY 297
           L+L AA + +VLLK     S R L +       +AVIGP ++ T  + G+Y+        
Sbjct: 377 LSLTAAAKSLVLLKRPSKFSKRHL-IGGFPSERMAVIGPMANNTDQIFGDYSPTTDPRFV 435

Query: 298 TTPLQGISRYAKTI-HQAGCF-GVAC-NGNQLIGAAEVAARQADATVLVMGLDQSIEAEF 354
            TPL+G++    ++ + AGC  G  C N +Q     + A   AD  V+ +G  + +E+E 
Sbjct: 436 KTPLKGLTELNFSMNYAAGCVDGTRCLNYSQ--DDVKTALVGADLVVVCLGTGKDLESEN 493

Query: 355 IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQA 414
           +DR  ++LPG+Q +L+  V   +   V L++   GPV++++A+   R+  IL   YP Q+
Sbjct: 494 VDRKDMMLPGKQLQLLQDVVSMTNKAVYLLVFSAGPVNITWAQESERVLIILQCFYPAQS 553

Query: 415 GGAAIADVLF---GRANPG 430
            G AI   L    GR NP 
Sbjct: 554 AGDAITQALIMRDGRFNPA 572


>gi|408824590|ref|ZP_11209480.1| Glucan 1,4-beta-glucosidase [Pseudomonas geniculata N1]
          Length = 897

 Score =  234 bits (598), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 135/317 (42%), Positives = 182/317 (57%), Gaps = 7/317 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           +G   GLT+WSPNVNIFRDPRWGRGQET GEDP LT +   ++VRGLQG+     K+ A 
Sbjct: 138 HGRYQGLTFWSPNVNIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQGDDPVYRKLDAT 197

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
            KH   +        DR+HF+AR S++DL DTY   F+A V EG V +VM +YN+V G+ 
Sbjct: 198 AKHLAVHSGPE---ADRHHFDARPSRRDLYDTYLPAFEALVKEGDVDAVMGAYNRVYGES 254

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP 182
             A   +L++ +   W   GY+VSDC ++  ++       T E AAA A++ G +L+CG 
Sbjct: 255 ASASRFLLRDVLRRDWGFKGYVVSDCWAIVDIWKHHRIVTTREAAAALAVRNGTELECGQ 314

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
             A     AVR GL+ E +++ A+    T +MRLGMFD  P    +  +       PAH 
Sbjct: 315 EYAT-LPSAVRQGLISEAEIDDAVTRLFTARMRLGMFD-PPERVRWARIPASVNQAPAHD 372

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
            LAL+AA   +VLLKN    LPLS      +AV+GP +D T+ ++GNY G      T LQ
Sbjct: 373 ALALKAAQASLVLLKNDG-ILPLSR-NTRRIAVVGPTADDTMALLGNYFGTPAAPVTILQ 430

Query: 303 GISRYAKTIHQAGCFGV 319
           GI   AK +      GV
Sbjct: 431 GIREAAKGVEVRYARGV 447



 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 36/99 (36%), Positives = 48/99 (48%), Gaps = 13/99 (13%)

Query: 341 VLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGP 390
           V V GL   +E E +          DR  L LP  Q+ L+  +    + PVV+VL  G  
Sbjct: 633 VFVGGLTGDVEGEEMTVNYPGFAGGDRTDLRLPAPQRTLLEALHGTGK-PVVMVLTGGSA 691

Query: 391 VDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           + V +A+    + AIL   YPGQ GG A+   LFG  NP
Sbjct: 692 IAVDWAQA--HLPAILMSWYPGQRGGTAVGQALFGDVNP 728


>gi|390991557|ref|ZP_10261819.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
 gi|372553724|emb|CCF68794.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
          Length = 901

 Score =  234 bits (598), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 131/306 (42%), Positives = 181/306 (59%), Gaps = 15/306 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA------- 60
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+GLQG   +  K A       
Sbjct: 139 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPKNAQGEPYRK 198

Query: 61  --ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
             A  KH+  +        DR+HF+AR S++DL +TY   F+A V +GKV +VM +YN+V
Sbjct: 199 LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRV 255

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            G+   A   +L++ +  QW   GY+VSDC ++  ++       T E+AAA A+K G +L
Sbjct: 256 YGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTEL 315

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG   A     AVR GL+ E  ++ AL   +T +MRLGMFD  P   P+  +      +
Sbjct: 316 ECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLPWSTIPASVNQS 373

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PAH  LA + A + +VLLKN    LPLS  +   +AVIGP +D T+ ++GNY G      
Sbjct: 374 PAHDALARRTARESLVLLKNDG-LLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPV 432

Query: 299 TPLQGI 304
           T LQGI
Sbjct: 433 TVLQGI 438



 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/107 (41%), Positives = 56/107 (52%), Gaps = 13/107 (12%)

Query: 334 ARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVL 383
           AR AD  V V GL   +E E +          DR  L LP  Q++L+  +    R PVV 
Sbjct: 632 ARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGR-PVVA 690

Query: 384 VLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           VL  G  + + +A+    + AIL   YPGQ GG A+AD LFG ANPG
Sbjct: 691 VLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPG 735


>gi|315499711|ref|YP_004088514.1| beta-glucosidase [Asticcacaulis excentricus CB 48]
 gi|315417723|gb|ADU14363.1| Beta-glucosidase [Asticcacaulis excentricus CB 48]
          Length = 869

 Score =  234 bits (597), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 136/297 (45%), Positives = 175/297 (58%), Gaps = 10/297 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT WSPN+NIFRDPRWGRGQET GEDP LT + A  +V GLQG      KV A  KH  
Sbjct: 130 GLTLWSPNINIFRDPRWGRGQETYGEDPFLTSRLAEGFVTGLQGPDPQHPKVVASVKHLA 189

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +         R+ F A VS  DLE TY   F+  V+  K  SVMC+YN V G P CA  
Sbjct: 190 VHSGPE---AGRHGFAASVSPYDLEMTYLPAFRYSVMTTKAQSVMCAYNAVGGVPACASD 246

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +LK  +   W   GY+V+DCD++  +     Y     E++A+++KAG+DL+CG   A  
Sbjct: 247 LLLKTYVREAWGFKGYVVTDCDAIYDMTRFHFYRLNDAESSAESLKAGVDLNCGNAYAAL 306

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
            E AV+ GL+ E  ++ +L   + V+ RLG+ DG PS  P+  + P  + TP  Q LALQ
Sbjct: 307 PE-AVQKGLIPESLMDQSLNRLLDVRKRLGI-DGAPS--PWARISPEAINTPQAQGLALQ 362

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           AA Q +VLLKN+   LPL      TVAVIGPN+D   T+ GNY G+A    TPL G+
Sbjct: 363 AAEQSLVLLKNNG-VLPLKP--GQTVAVIGPNADTEETLRGNYNGIARQPVTPLTGL 416



 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 36/89 (40%), Positives = 49/89 (55%), Gaps = 9/89 (10%)

Query: 341 VLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDP 400
           +LV G D+       DR  L LP  Q++L+  V KA+  P+V+VL+ G  V +++A    
Sbjct: 620 ILVPGFDRG------DRTDLGLPRTQEDLLKAV-KATGKPLVVVLLSGSAVALNWADAHA 672

Query: 401 RIGAILWVGYPGQAGGAAIADVLFGRANP 429
                 W  YPG+AGG AIA  L G ANP
Sbjct: 673 DAVVAAW--YPGEAGGTAIARTLTGEANP 699


>gi|157676888|emb|CAP07659.1| beta-xylosidase [uncultured rumen bacterium]
          Length = 761

 Score =  234 bits (597), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 172/486 (35%), Positives = 239/486 (49%), Gaps = 77/486 (15%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR-LKVAACCKH 65
           AGL++W+PN+NIFRDPRWGRG ET GEDP L G+   + VRGLQG+  +  LK  AC KH
Sbjct: 135 AGLSFWTPNINIFRDPRWGRGMETYGEDPYLMGQLGMAVVRGLQGDPDADVLKTHACAKH 194

Query: 66  YTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
           Y  +     N   R+ F+A+VS++DL +TY   FK  V +  V  VM +YN+  G P  A
Sbjct: 195 YAVHSGLESN---RHRFDAQVSERDLRETYLPAFKDLVTKAGVKEVMTAYNRFRGYPCAA 251

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNT-QH-YTRTPEEAAADAIKAGLDLDCGPF 183
              +++  +  +W   G +VSDC ++   +   +H +  T EEAAA A+  GLD++CG  
Sbjct: 252 SEYLVQKILREEWGYKGLVVSDCWAIPDFFEPGRHGFVATGEEAAALAVANGLDVECGST 311

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            +     A+  GLL+EED++  L   +T + RLG  DGE    P+ +L P  V  P H+ 
Sbjct: 312 FS-KIPAAIDQGLLKEEDLDRNLLRVLTERFRLGEMDGE---SPWDDLDPAIVEGPEHRA 367

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           L+L  A + +VLL+N+   LPL       +A+IGPN+D      GNY  V     T LQ 
Sbjct: 368 LSLDIARETMVLLRNNG-VLPLKA--GEKIALIGPNADDAQMQWGNYNPVPKSTITLLQA 424

Query: 304 I-SRYAKTIHQAGCFGV-----ACNGN---QLIGAA----EVAARQ-------------- 336
           + +R    ++   C G+     A  G+    LIGA+    E AAR+              
Sbjct: 425 MQARVPGLVYDRAC-GILDAEYAPQGSAYANLIGASEAQLEAAARRYAVSVNDIKNYIRR 483

Query: 337 -----------------------ADATVLVMGLDQSIEAEFI----------DRAGLLLP 363
                                   D  V   G+   +E E +          DR  + LP
Sbjct: 484 DEEQRRSFMPALDEAAVLKKLEGVDVVVFAGGISPRLEGEEMRVQVPGFSGGDRTDIELP 543

Query: 364 GRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVL 423
           G Q+ L+  +  A +  VVLV   G    +          AIL   YPGQ GG AIADVL
Sbjct: 544 GVQRRLLKALHDAGK-KVVLVNFSG--CAIGLVPETESCDAILQAWYPGQEGGTAIADVL 600

Query: 424 FGRANP 429
           FG  NP
Sbjct: 601 FGDVNP 606


>gi|294667502|ref|ZP_06732718.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292602731|gb|EFF46166.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 901

 Score =  234 bits (596), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 131/306 (42%), Positives = 180/306 (58%), Gaps = 15/306 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA------- 60
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+GLQG  G   K A       
Sbjct: 139 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGGDAPKNAQGERYRK 198

Query: 61  --ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
             A  KH+  +        DR+HF+A  S++DL +TY   F+A V +GKV +VM +YN+V
Sbjct: 199 LDATAKHFAVHSGPE---ADRHHFDAHPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRV 255

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            G+   A   +L++ +  QW   GY+VSDC ++  ++       T E+AAA A+K G +L
Sbjct: 256 YGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTEL 315

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG   +     AVR GL+ E  ++ AL   +T +MRLGMFD  P   P+  +      +
Sbjct: 316 ECGEEYST-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLPWSQIPASVNQS 373

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PAH  LA + A + +VLLKN    LPLS  R   +AVIGP +D T+ ++GNY G      
Sbjct: 374 PAHDALARRTARESLVLLKNDG-LLPLSRARLKRIAVIGPTADDTMALLGNYYGTPAAPV 432

Query: 299 TPLQGI 304
           T LQGI
Sbjct: 433 TVLQGI 438



 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 43/107 (40%), Positives = 57/107 (53%), Gaps = 13/107 (12%)

Query: 334 ARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVL 383
           AR A+  V V GL   +E E +          DR  L LP  Q++L+  +  A+  PVV 
Sbjct: 632 ARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-HATGKPVVA 690

Query: 384 VLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           VL  G  + + +A+    + AIL   YPGQ GG A+AD LFG ANPG
Sbjct: 691 VLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPG 735


>gi|116181370|ref|XP_001220534.1| hypothetical protein CHGG_01313 [Chaetomium globosum CBS 148.51]
 gi|88185610|gb|EAQ93078.1| hypothetical protein CHGG_01313 [Chaetomium globosum CBS 148.51]
          Length = 549

 Score =  234 bits (596), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 153/396 (38%), Positives = 215/396 (54%), Gaps = 22/396 (5%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N G+AGL YW+PN+N ++DPRWGRG ETPGEDPV    Y  + + GL+G+  S  KV A 
Sbjct: 143 NAGLAGLDYWTPNINPYKDPRWGRGHETPGEDPVRIKGYVKALLAGLEGDDPSIRKVVAT 202

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           CKHY AYDL+ W G  R+ F+A VS QDL + Y  PF+ C  + KV S MCSYN +NG P
Sbjct: 203 CKHYAAYDLERWQGTTRHRFDAVVSLQDLSEYYLPPFQQCARDSKVGSFMCSYNALNGTP 262

Query: 123 TCADPDILKNTIHGQW---RLDGYIVSDCDSV-----GVLYNTQHYTRTPEEAAADAIKA 174
            CA   ++ + +   W     + YI SDC+++     G  ++    T+T  EAAA A +A
Sbjct: 263 ACASTYLMDDILRKHWGWTEHNNYITSDCNAIQDFLPGPKWHNFSSTQTEAEAAAVAYQA 322

Query: 175 GLDLDCG-PFLAIHTE--GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNL 231
           G D  C  P    +T+  GA    LL EE ++ AL       +R+G FD   S  P+ ++
Sbjct: 323 GTDTVCEVPGWPPYTDVIGAYNQTLLSEEVIDTALKRLYEGLVRVGYFD-PASGSPYRSI 381

Query: 232 GPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIG--PNSDVTVTMIGN 289
           G  DV TP  Q+LALQ+   G+VLLKN   TLPL+ L   TVA+IG   NS     ++G 
Sbjct: 382 GWEDVNTPEAQELALQSGTDGLVLLKNDG-TLPLN-LEDKTVALIGFWANSTNGGRILGG 439

Query: 290 YAGVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQ-----LIGAAEVAARQADATVLVM 344
           Y+G      +P+    +   T H A    +A N  Q      +  A   A++++  +   
Sbjct: 440 YSGFPPYIHSPVDAAEKLNLTYHYASG-PLAENITQAAIDDWVAKALEPAKKSNVILYFG 498

Query: 345 GLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGP 380
           G D SI AE +DR  +  P  Q  ++  ++   + P
Sbjct: 499 GTDTSIAAEDLDRDSIAWPEIQLAVIEALSALRQAP 534


>gi|329850151|ref|ZP_08264997.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
 gi|328842062|gb|EGF91632.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
          Length = 877

 Score =  233 bits (595), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 143/335 (42%), Positives = 185/335 (55%), Gaps = 25/335 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT WSPN+NIFRDPRWGRGQET GEDP LT +    Y+ GLQGN     K  A  KH+ 
Sbjct: 129 GLTVWSPNINIFRDPRWGRGQETYGEDPYLTSRIGIGYIHGLQGNDPKFFKTVATSKHFA 188

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +     N   R+  +   SK DLEDTY   F+A V EGK  SVMC YN V G P CA  
Sbjct: 189 VHSGPESN---RHKEDVYPSKFDLEDTYLPAFRATVTEGKAYSVMCVYNAVYGVPGCASD 245

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLY--NTQHYTRTPEEAAADAIKAGLDLDCGPFL- 184
            +++  +   W   G++VSDC +   ++  +  HYT+T EE  A  +KAG+DL CG +  
Sbjct: 246 FLMEEKLRQNWGFPGFVVSDCGAAANIFREDALHYTKTAEEGVAVGLKAGMDLICGDYRN 305

Query: 185 AIHTE-----GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
            + TE      AV+ G L    V+ AL      ++RLGMFD  P++ PF ++   D  TP
Sbjct: 306 KMSTEVQPIINAVKAGQLPIAVVDQALVRLFEGRIRLGMFD-PPASLPFAHITADDSDTP 364

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
           AH  +AL  A + +VLLKN    LPL      T+AVIGPN+D    ++GNY G      T
Sbjct: 365 AHHAVALDMAKKSMVLLKNDG-LLPLKA-EPKTIAVIGPNADSLDALVGNYYGKPSKPVT 422

Query: 300 PLQGI-SRY--AKTIHQAGCFGVACNGNQLIGAAE 331
            L GI +R+  AK ++          G  LIG AE
Sbjct: 423 VLDGIRARFPTAKIVY--------AEGTGLIGPAE 449



 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 40/114 (35%), Positives = 58/114 (50%), Gaps = 13/114 (11%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAK 375
           + G A   A+ AD  V V GL   +E E +          DR  + LP  QQ+L+ +V  
Sbjct: 587 MAGQAVDVAKTADFVVFVGGLSARVEGEEMKVEAEGFAGGDRTSIDLPKPQQQLLEKVIG 646

Query: 376 ASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
             + P VLVLM G  + V++A  D  + AI+   YPG  GG A+A ++ G  +P
Sbjct: 647 TGK-PTVLVLMSGSALGVNWA--DKHVPAIIEAWYPGGEGGHAVAQLIAGDYSP 697


>gi|325929067|ref|ZP_08190221.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
           91-118]
 gi|325540562|gb|EGD12150.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
           91-118]
          Length = 850

 Score =  233 bits (595), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 130/306 (42%), Positives = 180/306 (58%), Gaps = 15/306 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA------- 60
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+GLQG      K A       
Sbjct: 88  GLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEPYRK 147

Query: 61  --ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
             A  KH+  +        DR+HF+AR S++DL +TY   F+A V +GKV +VM +YN+V
Sbjct: 148 LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRV 204

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            G+   A   +L++ +  QW   GY+VSDC ++  ++       T E+AAA A+K G +L
Sbjct: 205 YGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTEL 264

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG   +     AVR GL+ E  ++ AL   +T +MRLGMFD  P   P+  +      +
Sbjct: 265 ECGEEYST-LPAAVRQGLIDEAQIDTALTTLMTARMRLGMFD-PPGQLPWSTIPASVNQS 322

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PAH  LA + A + +VLLKN    LPLS  +   +AVIGP +D T+ ++GNY G      
Sbjct: 323 PAHDALARRTARESLVLLKNDG-LLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPV 381

Query: 299 TPLQGI 304
           T LQGI
Sbjct: 382 TVLQGI 387



 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/107 (41%), Positives = 58/107 (54%), Gaps = 13/107 (12%)

Query: 334 ARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVL 383
           AR AD  V V GL   +E E +          DR  L LP  Q++L+  + +A+  PVV 
Sbjct: 581 ARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-QATGKPVVA 639

Query: 384 VLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           VL  G  + + +A+    + AIL   YPGQ GG A+AD LFG ANPG
Sbjct: 640 VLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPG 684


>gi|94970273|ref|YP_592321.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
 gi|94552323|gb|ABF42247.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
          Length = 881

 Score =  233 bits (593), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 120/306 (39%), Positives = 184/306 (60%), Gaps = 17/306 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GL +W+PN+NIFRDPRWGRGQET GEDP LT +   +YV+GLQG+        +  KHY 
Sbjct: 134 GLDFWAPNINIFRDPRWGRGQETYGEDPFLTARMGVAYVKGLQGDDPKYYLAISTPKHYA 193

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +   +     R+  + +VSK D  DTY   F+A V E K  SVMC+YN +NG+P C + 
Sbjct: 194 VH---SGPETTRHFADVKVSKHDELDTYLPAFRATVTEAKAGSVMCAYNSINGQPACVNE 250

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +L++ + G+W   GY+VSDC+++  +Y    +T+T  EA+A A++ G+D +C  F    
Sbjct: 251 FLLQDQLRGKWNFQGYVVSDCEAIINIYRDHKFTKTQAEASALAVQRGMDNECVDFGKQK 310

Query: 188 TE-------GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
            +        A + G+L+E +++ AL    T +M+LGMFD  P   P+  + P+++ +  
Sbjct: 311 DDHDYRPYFDAYKQGILKESEIDTALVRLFTARMKLGMFD-PPEMVPYSKIDPKELESAE 369

Query: 241 HQQLALQAAHQGIVLLKNSARTLPL--STLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           H++LA   A++ +VLLKN   TLPL  S L+   +AVIGP ++ T  ++GNY G      
Sbjct: 370 HRELARTLANESMVLLKNDG-TLPLKKSGLK---IAVIGPLAEQTRYLLGNYNGTPSHTV 425

Query: 299 TPLQGI 304
           + L+G+
Sbjct: 426 SVLEGL 431



 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 42/111 (37%), Positives = 60/111 (54%), Gaps = 13/111 (11%)

Query: 329 AAEVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASR 378
           AA  AA+ AD  + V+G+   +E E +          DR  L LP  +Q+L+  ++ A +
Sbjct: 602 AAVTAAKNADVVIAVLGITSDLEGEEMPVSEEGFNGGDRTSLDLPKPEQQLLESISAAGK 661

Query: 379 GPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            PVVLVL  G  + V++A+      AIL   YPG+ GG AIA  L G+ NP
Sbjct: 662 -PVVLVLSNGSALSVNWAQQ--HANAILEGWYPGEEGGTAIAQTLSGKNNP 709


>gi|167524198|ref|XP_001746435.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775197|gb|EDQ88822.1| predicted protein [Monosiga brevicollis MX1]
          Length = 834

 Score =  233 bits (593), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 154/440 (35%), Positives = 221/440 (50%), Gaps = 25/440 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GL+ WSP VNI R P WGR  ET GE PVL+   A S+V G+QGN       AA CKH  
Sbjct: 151 GLSCWSPVVNINRHPLWGRNDETFGECPVLSSFMARSFVEGIQGNHTRYYAAAAACKH-- 208

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
              LD + G D  RY F+A VS+ DL  T+ + F+ C   G V   MCSYN + G P CA
Sbjct: 209 ---LDVYGGPDNLRYVFDADVSQADLTGTFLMAFEECAAAG-VMGYMCSYNSIRGVPACA 264

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL----DCG 181
           +   +      QW  +GY+VSD  +V  +  + +YT      A  A+ AG D+    D  
Sbjct: 265 NYRTMTFFAREQWGFEGYVVSDQGAVFRITESHNYTANQTLGAVAALNAGCDMEDSDDAQ 324

Query: 182 PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAH 241
                +   A+   L     ++ +++    V+MRLG FD  P   P+ +L    V +PAH
Sbjct: 325 HVAYYNLSLALDLKLTDMATIDASVSRLFYVRMRLGEFD-PPENDPWRSLNMSIVSSPAH 383

Query: 242 QQLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYA--------- 291
            ++A   A   IVLLKN   TLPLS   ++ +  ++GP +D    M+G Y+         
Sbjct: 384 VEMARDVATASIVLLKNQNETLPLSAAAKNASYCLLGPFADNADLMMGKYSPHGSTNVTV 443

Query: 292 GVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIE 351
               G    LQ  S+ A   +  GC G  C+G           +  D  +L +G    +E
Sbjct: 444 TYRAGLAAALQNASQTASFQYLEGCTGPFCDGLDTAAVTTFIQQGCDTVLLAVGTSYHVE 503

Query: 352 AEFIDRAGLLLPGRQQELVSRVAKA--SRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVG 409
           +E +DR+ +  PG Q  LV  V +A  ++  +VL++   GPVD++  + D R+ AIL + 
Sbjct: 504 SESLDRSNMSFPGAQPTLVQTVLEALGTKQRLVLLVSTAGPVDLAALEQDTRVAAILDLI 563

Query: 410 YPGQAGGAAIADVLFGRANP 429
           Y GQ  G A+AD+L G  +P
Sbjct: 564 YLGQTAGTALADILLGETSP 583


>gi|325916103|ref|ZP_08178390.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325537647|gb|EGD09356.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 896

 Score =  232 bits (592), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 130/298 (43%), Positives = 178/298 (59%), Gaps = 10/298 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+GLQ   G   K+ A  KHY 
Sbjct: 145 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKLDATAKHYA 204

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +   +    DR+HF+   S++DL +TY   F+A V EG VA+VM +YN+VNG+   A  
Sbjct: 205 VH---SGPEADRHHFDVHPSERDLHETYLPAFQALVQEGHVAAVMGAYNRVNGESASAS- 260

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
             L+  +   W  DGYIVSDC ++  ++       TPE AAA  +K G DLDCG   A  
Sbjct: 261 TRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTDLDCGDTYAAL 320

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
            + AVR GL+ E  ++ +L   +T +MRLGMFD  P+   +  +      +P H  LA +
Sbjct: 321 PK-AVRAGLIDEATIDTSLKRLMTTRMRLGMFD-PPAKVAWAQIPASVNQSPQHDALARR 378

Query: 248 AAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
            A + +VLLKN    LPL  TL+   +AV+GP +D  ++++GNY G      T LQGI
Sbjct: 379 TARESLVLLKNDG-LLPLKPTLKR--IAVVGPTADDPMSLLGNYYGTPAAPVTILQGI 433



 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 44/108 (40%), Positives = 60/108 (55%), Gaps = 13/108 (12%)

Query: 333 AARQADATVLVMGLDQSIEAEFID----------RAGLLLPGRQQELVSRVAKASRGPVV 382
           AAR A+  V V GL   +E E +D          R    LP  Q+EL+  + +A+  PVV
Sbjct: 626 AARNAEVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQAL-QATGTPVV 684

Query: 383 LVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            VL  G  + V +A+    + AIL   YPGQ GG+A+ DVLFG+A+PG
Sbjct: 685 AVLTTGSALAVDWAQQ--HVPAILLAWYPGQRGGSAVGDVLFGQASPG 730


>gi|392537607|ref|ZP_10284744.1| Beta-glucosidase [Pseudoalteromonas marina mano4]
          Length = 870

 Score =  232 bits (592), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 126/298 (42%), Positives = 178/298 (59%), Gaps = 12/298 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYWSPN+NIFRDPRWGRGQET GEDP LT + A +++ GLQG+    LK  A  KHY 
Sbjct: 139 GLTYWSPNINIFRDPRWGRGQETYGEDPYLTTRIAVNFINGLQGDNTEYLKSVATLKHYA 198

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +   +   V R+  +   SK+DL +TY   FK  + + KVASVMC+YN VNG P C + 
Sbjct: 199 VH---SGPEVSRHSDDYTASKKDLAETYLPAFKDVIAQTKVASVMCAYNSVNGTPACGND 255

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYT--RTPEEAAADAIKAGLDLDCGPF-- 183
           ++++N +  ++  DGYIVSDC ++   Y+ + +    T  +AAA A+K G DL+CG    
Sbjct: 256 ELIQNKLRDEFNFDGYIVSDCGAIADFYDVKSHNIVNTEAKAAAMALKTGTDLNCGDHHG 315

Query: 184 -LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
               +   AV+ GL+ E+DV+ AL   +  + +LGMFD  P   P+ +     V +  H 
Sbjct: 316 NTYSYLSQAVKEGLVEEKDVDKALKRLMYARFKLGMFDN-PENVPYSDTSIDIVGSNKHL 374

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
            L  +AA + +VLLKN  + LPL    +  VA+IGPN+D    ++GNY G+     TP
Sbjct: 375 ALTQEAAKKSLVLLKNE-QVLPLKG--NEKVALIGPNADNEAILLGNYNGMPIVPITP 429



 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 36/134 (26%), Positives = 63/134 (47%), Gaps = 13/134 (9%)

Query: 307 YAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFI----------D 356
           ++  I+         N   L   A   A +AD  V V G+  ++E E +          D
Sbjct: 576 HSNVINPTASLTWLKNPQSLTQQALNNANEADVIVFVGGISANLEGEEMPLQIDGFSHGD 635

Query: 357 RAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGG 416
           R  + LP  Q  L+ ++ +  + P+VLV M G  + +++   +  I AI+   YPG+A G
Sbjct: 636 RTNINLPKSQLNLLKKLKQTGK-PIVLVNMSGSAMALNWENEN--IDAIIQGFYPGEAAG 692

Query: 417 AAIADVLFGRANPG 430
           +A+  +L+G  +P 
Sbjct: 693 SALVSLLYGEYSPS 706


>gi|399030621|ref|ZP_10730998.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
 gi|398071229|gb|EJL62496.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
          Length = 876

 Score =  232 bits (591), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 124/307 (40%), Positives = 173/307 (56%), Gaps = 11/307 (3%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N    GLTYW+PN+NIFRDPRWGRGQET GEDP LTG    S+V+GLQG+    LK AAC
Sbjct: 133 NERYLGLTYWTPNINIFRDPRWGRGQETYGEDPYLTGVLGDSFVKGLQGDDPKYLKAAAC 192

Query: 63  CKHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
            KHY  +     +G +  R+ F+  V+  +L DTY   F+  V E KVA VMC+YN    
Sbjct: 193 AKHYAVH-----SGPEPLRHTFDVDVTPYELWDTYLPAFQKLVTESKVAGVMCAYNAFRT 247

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
           +P CA   ++ + +  QW+ +GY+ SDC ++   +         E A+ADA+  G D+DC
Sbjct: 248 QPCCASDILMTDILRNQWKFEGYVTSDCWAIDDFFKNHKTHPDAESASADAVFHGTDIDC 307

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC-TP 239
           G         AV+ G + E+ +++++     ++ RLGMFD  P         P  V    
Sbjct: 308 GTDAYKALVQAVKDGKISEKQIDISVKRLFMIRFRLGMFD--PVEMVKYAQTPTSVLEND 365

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
            H+  AL+ A Q IVLL+N  +TLPLS  +   + V+GPN D  + ++GNY G     TT
Sbjct: 366 EHKAHALKMARQSIVLLRNENKTLPLSK-KLKKIVVLGPNVDNAIAILGNYNGTPSKLTT 424

Query: 300 PLQGISR 306
            L+GI  
Sbjct: 425 VLEGIKE 431



 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 41/107 (38%), Positives = 57/107 (53%), Gaps = 15/107 (14%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + ADA V V G+   +E E +          DR  +LLP  Q +L+ +  K +  P+V V
Sbjct: 606 KDADAFVFVGGISPQLEGEEMKVNFPGFKGGDRTSILLPKIQTDLM-KALKTTGKPIVFV 664

Query: 385 LMCGGPVDVSF-AKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           +M G  + + + A+N P I A  W G  GQA G A+ADVLFG  NP 
Sbjct: 665 MMTGSAIAIPWEAENIPAI-ANAWYG--GQAAGTAVADVLFGNYNPA 708


>gi|188993706|ref|YP_001905716.1| beta-glucosidase [Xanthomonas campestris pv. campestris str. B100]
 gi|167735466|emb|CAP53681.1| exported beta-glucosidase [Xanthomonas campestris pv. campestris]
          Length = 896

 Score =  232 bits (591), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 129/298 (43%), Positives = 177/298 (59%), Gaps = 10/298 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+GLQ   G   K+ A  KHY 
Sbjct: 145 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKLDATAKHYA 204

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +   +    DR+HF+   S++DL +TY   F+A V EG VA+VM +YN+VNG+   A  
Sbjct: 205 VH---SGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYNRVNGESASAS- 260

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
             L+  +   W  DGYIVSDC ++  ++       TPE AAA  +K G DLDCG   A  
Sbjct: 261 TRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTDLDCGDTYAA- 319

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              AVR GL+ E  ++ +L   +  ++RLGMFD  P+  P+  +      +P H  LA +
Sbjct: 320 LPAAVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAKVPWAQIPASANQSPQHDALARR 378

Query: 248 AAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
            A + +VLLKN    LPL  TL+   +AV+GP +D  ++++GNY G      T LQGI
Sbjct: 379 TARESLVLLKNDG-LLPLKPTLKR--IAVVGPTADDPMSLLGNYYGTPAAPVTILQGI 433



 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 44/108 (40%), Positives = 59/108 (54%), Gaps = 13/108 (12%)

Query: 333 AARQADATVLVMGLDQSIEAEFID----------RAGLLLPGRQQELVSRVAKASRGPVV 382
           AAR AD  V V GL   +E E +D          R    LP  Q+EL+  + +A+  PVV
Sbjct: 626 AARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQAL-QATGTPVV 684

Query: 383 LVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            VL  G  + + +A+    + AIL   YPGQ GG A+ DVLFG+A+PG
Sbjct: 685 AVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVGDVLFGQASPG 730


>gi|78049893|ref|YP_366068.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
 gi|78038323|emb|CAJ26068.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
          Length = 902

 Score =  232 bits (591), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 129/306 (42%), Positives = 180/306 (58%), Gaps = 15/306 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA------- 60
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+GL+G      K A       
Sbjct: 140 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLRGEGADAPKNAQGEPYRK 199

Query: 61  --ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
             A  KH+  +        DR+HF+AR S++DL +TY   F+A V +GKV +VM +YN+V
Sbjct: 200 LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRV 256

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            G+   A   +L++ +  QW   GY+VSDC ++  ++       T E+AAA A+K G +L
Sbjct: 257 YGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTEL 316

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG   +     AVR GL+ E  ++ AL   +T +MRLGMFD  P   P+  +      +
Sbjct: 317 ECGEEYST-LPAAVRQGLIDEAQIDTALTTLMTARMRLGMFD-PPGQLPWSTIPASVNQS 374

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PAH  LA + A + +VLLKN    LPLS  +   +AVIGP +D T+ ++GNY G      
Sbjct: 375 PAHDALARRTARESLVLLKNDG-LLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPV 433

Query: 299 TPLQGI 304
           T LQGI
Sbjct: 434 TVLQGI 439



 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 43/107 (40%), Positives = 57/107 (53%), Gaps = 13/107 (12%)

Query: 334 ARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVL 383
           A  AD  V V GL   +E E +          DR  L LP  Q++L+  + +A+  PVV 
Sbjct: 633 ASSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-QATGKPVVA 691

Query: 384 VLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           VL  G  + + +A+    + AIL   YPGQ GG A+AD LFG ANPG
Sbjct: 692 VLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPG 736


>gi|440731995|ref|ZP_20911965.1| glucan 1,4-beta-glucosidase [Xanthomonas translucens DAR61454]
 gi|440370332|gb|ELQ07251.1| glucan 1,4-beta-glucosidase [Xanthomonas translucens DAR61454]
          Length = 913

 Score =  232 bits (591), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 129/310 (41%), Positives = 181/310 (58%), Gaps = 15/310 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG---------SRLK 58
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+GLQG            +  K
Sbjct: 142 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEAYRK 201

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           + A  KH+  +   +    DR+HF+A  S++DL +TY   F+A V EGKV +VM +YN+V
Sbjct: 202 LDATAKHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRV 258

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            G+   A   +L++ +  +W  DGY+VSDC ++  ++       T EEAAA A+K G +L
Sbjct: 259 YGESASASKFLLRDVLRDRWGFDGYVVSDCWAIVDIWKNHKIVATREEAAALAVKHGTEL 318

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG   +     AVR GL+ E DV+ AL   +  +MRLGMFD  P    +  +      +
Sbjct: 319 ECGAEYST-LPSAVRKGLISEADVDKALQKLMYSRMRLGMFD-PPEKLAWAQIPLSANQS 376

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           P H  LA + A + +VLLKN    LPLS  +   +AV+GP +D T+ ++GNY G      
Sbjct: 377 PEHDALARRTARESLVLLKNDG-VLPLSRAKIKRIAVVGPTADDTMALLGNYYGTPAAPV 435

Query: 299 TPLQGISRYA 308
           T LQGI   A
Sbjct: 436 TVLQGIREAA 445



 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/108 (41%), Positives = 59/108 (54%), Gaps = 13/108 (12%)

Query: 333 AARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVV 382
           AAR+AD  V V GL   +E E +          DR  L LP  Q+ L+  +    + PVV
Sbjct: 634 AARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRALLEALHGTGK-PVV 692

Query: 383 LVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            VL  G  + + +A+    + AIL   YPGQ GG+A+ADVLFG ANPG
Sbjct: 693 AVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGSAVADVLFGDANPG 738


>gi|340369765|ref|XP_003383418.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 748

 Score =  231 bits (589), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 158/446 (35%), Positives = 236/446 (52%), Gaps = 33/446 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GL+ WSP +NI RDPRWGR QET GEDP L+G    ++V GLQG+  + +   A CKH+ 
Sbjct: 141 GLSCWSPVLNIMRDPRWGRNQETYGEDPYLSGYLGQAFVTGLQGDDPTYVIANAGCKHFD 200

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +       + R  F+A V+  D   T+   FKACV  G + S+MCSYN++NG P CA+ 
Sbjct: 201 VHGGPEDTPLPRASFDANVTMIDWRMTFLPQFKACVEAGAL-SLMCSYNRINGVPACANK 259

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +L + +  +W   GY+VSD  ++  +    HY      AAADA  AG  L+ G     +
Sbjct: 260 KLLTDILRNEWNFKGYVVSDQGALENIVTQHHYAPDFVTAAADAANAGTCLEDG-----N 314

Query: 188 TEG-----------AVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDV 236
           +EG           AV  GL+  + +  A++    V+ +LG FD   +  P+ N+    +
Sbjct: 315 SEGKGGNVFDNLDDAVEKGLVSVDTLKDAVSRLFYVRTKLGEFDPPDNNNPYANIPLSII 374

Query: 237 CTPAHQQLALQAAHQGIVLLKNS---ARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 293
            +  H +L++QAA + IVL+KN    +  LPL+        V+GP  +   TM G+Y+  
Sbjct: 375 QSDEHIKLSIQAAMETIVLMKNDNDGSPFLPLAADDFKKACVVGPFIENADTMFGDYSPT 434

Query: 294 ACG--YTTPLQGI--SRYAKTI--HQAGCF-GVACNGNQLIGAAEV--AARQADATVLVM 344
                  TPL GI  ++    +  ++ GC  G AC   ++    +V  A    D  ++  
Sbjct: 435 MMTDYIVTPLAGIKTTQIGSDLLNYEDGCTDGPAC---EIYDGYKVRTACEGVDLVIVTA 491

Query: 345 GLDQSIEAEFIDRAGLLLPGRQQELVSRVAKAS-RGPVVLVLMCGGPVDVSFAKNDPRIG 403
           GL + +E E  D + + LPG Q  L++    AS   P++L+L    P+D+S+AK++PR  
Sbjct: 492 GLSRYLEHEGHDISDIYLPGHQMSLLTDAESASGSAPIILLLFNANPLDISYAKSNPRFA 551

Query: 404 AILWVGYPGQAGGAAIADVLFGRANP 429
           AIL   YPGQ  G AIA+VL G  NP
Sbjct: 552 AILEAYYPGQEAGVAIANVLTGSYNP 577


>gi|346726970|ref|YP_004853639.1| beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346651717|gb|AEO44341.1| Beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 902

 Score =  231 bits (589), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 129/306 (42%), Positives = 179/306 (58%), Gaps = 15/306 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA------- 60
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+GLQG      K A       
Sbjct: 140 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEPYRK 199

Query: 61  --ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
             A  KH+  +        DR+HF+AR S++DL +TY   F+A V +GKV +VM +YN+V
Sbjct: 200 LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRV 256

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            G+   A   +L++ +  QW   GY+VSDC ++  ++       T E+AAA A+K G +L
Sbjct: 257 YGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTEL 316

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG   +     AV  GL+ E  ++ AL   +T +MRLGMFD  P   P+  +      +
Sbjct: 317 ECGEEYST-LPAAVHQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLPWSTIPASVNQS 374

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PAH  LA + A + +VLLKN    LPLS  +   +AVIGP +D T+ ++GNY G      
Sbjct: 375 PAHDALARRTARESLVLLKNDG-LLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPV 433

Query: 299 TPLQGI 304
           T LQGI
Sbjct: 434 TVLQGI 439



 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/107 (41%), Positives = 58/107 (54%), Gaps = 13/107 (12%)

Query: 334 ARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVL 383
           AR AD  V V GL   +E E +          DR  L LP  Q++L+  + +A+  PVV 
Sbjct: 633 ARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-QATGKPVVA 691

Query: 384 VLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           VL  G  + + +A+    + AIL   YPGQ GG A+AD LFG ANPG
Sbjct: 692 VLTTGSALAIDWAQQ--HLPAILLAWYPGQRGGTAVADTLFGDANPG 736


>gi|21233528|ref|NP_639445.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
 gi|66770493|ref|YP_245255.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. 8004]
 gi|21115383|gb|AAM43327.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
 gi|66575825|gb|AAY51235.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. 8004]
          Length = 896

 Score =  231 bits (588), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 129/298 (43%), Positives = 176/298 (59%), Gaps = 10/298 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+GLQ   G   K+ A  KHY 
Sbjct: 145 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKLDATAKHYA 204

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +   +    DR+HF+   S++DL +TY   F+A V EG VA+VM +YN+VNG+   A  
Sbjct: 205 VH---SGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYNRVNGESASAS- 260

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
             L+  +   W  DGYIVSDC ++  ++       TPE AAA  +K G DLDCG   A  
Sbjct: 261 TRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTDLDCGDTYAA- 319

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              AVR GL+ E  ++ +L   +  ++RLGMFD  P+  P+         +P H  LA +
Sbjct: 320 LPAAVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAKVPWAQTPASANQSPQHDALARR 378

Query: 248 AAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
            A + +VLLKN    LPL  TL+   +AV+GP +D  ++++GNY G      T LQGI
Sbjct: 379 TARESLVLLKNDG-LLPLKPTLKR--IAVVGPTADDPMSLLGNYYGTPAAPVTILQGI 433



 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 44/108 (40%), Positives = 59/108 (54%), Gaps = 13/108 (12%)

Query: 333 AARQADATVLVMGLDQSIEAEFID----------RAGLLLPGRQQELVSRVAKASRGPVV 382
           AAR AD  V V GL   +E E +D          R    LP  Q+EL+  + +A+  PVV
Sbjct: 626 AARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQAL-QATGTPVV 684

Query: 383 LVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            VL  G  + + +A+    + AIL   YPGQ GG A+ DVLFG+A+PG
Sbjct: 685 AVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVGDVLFGQASPG 730


>gi|384430040|ref|YP_005639401.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. raphani
           756C]
 gi|341939144|gb|AEL09283.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. raphani
           756C]
          Length = 896

 Score =  231 bits (588), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 129/298 (43%), Positives = 175/298 (58%), Gaps = 10/298 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+GLQ   G   K+ A  KHY 
Sbjct: 145 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKLDATAKHYA 204

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +        DR+HF+   S++DL +TY   F+A V EG VA+VM +YN+VNG+   A  
Sbjct: 205 VHSGPE---ADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYNRVNGESASAS- 260

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
             L+  +   W  DGYIVSDC ++  ++       TPE AAA  +K G DLDCG   A  
Sbjct: 261 TRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTDLDCGDTYAA- 319

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              AVR GL+ E  ++ +L   +  ++RLGMFD  P+  P+         +P H  LA +
Sbjct: 320 LPAAVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAKVPWAQTPASANQSPQHDALARR 378

Query: 248 AAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
            A + +VLLKN    LPL  TL+   +AV+GP +D  ++++GNY G      T LQGI
Sbjct: 379 TARESLVLLKNDG-LLPLKPTLKR--IAVVGPTADDPMSLLGNYYGTPAAPVTILQGI 433



 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 44/108 (40%), Positives = 59/108 (54%), Gaps = 13/108 (12%)

Query: 333 AARQADATVLVMGLDQSIEAEFID----------RAGLLLPGRQQELVSRVAKASRGPVV 382
           AAR AD  V V GL   +E E +D          R    LP  Q+EL+  + +A+  PVV
Sbjct: 626 AARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQAL-QATGTPVV 684

Query: 383 LVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            VL  G  + + +A+    + AIL   YPGQ GG A+ DVLFG+A+PG
Sbjct: 685 AVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVGDVLFGQASPG 730


>gi|320105647|ref|YP_004181237.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
 gi|319924168|gb|ADV81243.1| glycoside hydrolase family 3 domain protein [Terriglobus saanensis
           SP1PR4]
          Length = 885

 Score =  230 bits (587), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 123/301 (40%), Positives = 188/301 (62%), Gaps = 11/301 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT WSPN+NIFRDPRWGRGQET GEDP LT +   ++VRG+QG+  +  +  A  KH+ 
Sbjct: 135 GLTIWSPNINIFRDPRWGRGQETYGEDPFLTARMGTAFVRGIQGDDPNYFRTIATPKHFA 194

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +         R+ FN  VS+ DL DTY   F++ ++EGK  S+MC+YN+++G+P CA  
Sbjct: 195 VHSGPE---STRHTFNVDVSQHDLWDTYLPAFRSTIIEGKADSIMCAYNRIDGQPACASD 251

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNT--QHYTRTPEEAAADAIKAGLDLDCGP-FL 184
            +LK  + G W   G++ SDC ++   Y     H+++  E+A+A  +KAG D  CG  +L
Sbjct: 252 LLLKQILRGDWGFRGFVTSDCGAIDDFYTKIGHHFSKEKEDASAAGVKAGTDTACGKTYL 311

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
            + +  AV+ GL+ E +++++L      ++RLG+FD +P+  P+  L   +V +PAH+ L
Sbjct: 312 GLTS--AVKSGLITEHEMDISLERLFEARIRLGLFD-DPARMPYARLTMAEVNSPAHRAL 368

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           AL+AA + IVLLKN+   LPL  +++  +AVIGPN+     + GNY  +A     P+ GI
Sbjct: 369 ALRAARESIVLLKNANNLLPLHGVKN--IAVIGPNAASLDALEGNYNAIARDPAMPVDGI 426

Query: 305 S 305
           +
Sbjct: 427 A 427



 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 37/102 (36%), Positives = 51/102 (50%), Gaps = 15/102 (14%)

Query: 338 DATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMC 387
           D  V  +GL   +E E +          DR  + LP  Q EL+ R  KA+  P+++VLM 
Sbjct: 620 DVVVAFVGLSPELEGEEMPIKVKGFAGGDRTDIELPQTQLELL-RAVKATGKPLIVVLMN 678

Query: 388 GGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           G  +    A  D    A+L   YPG+AG  AIA+ L G+ NP
Sbjct: 679 GSAI----ALKDSETDALLEAWYPGEAGAQAIAETLAGKNNP 716


>gi|433677589|ref|ZP_20509555.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
           18974]
 gi|430817300|emb|CCP39963.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
           18974]
          Length = 913

 Score =  230 bits (587), Expect = 9e-58,   Method: Compositional matrix adjust.
 Identities = 129/310 (41%), Positives = 181/310 (58%), Gaps = 15/310 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT---------GSRLK 58
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+GLQG            +  K
Sbjct: 142 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEDVDVPKNAQGEAYRK 201

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           + A  KH+  +   +    DR+HF+A  S++DL +TY   F+A V EGKV +VM +YN+V
Sbjct: 202 LDATAKHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRV 258

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            G+   A   +L++ +  +W  DGY+VSDC ++  ++       T EEAAA A+K G +L
Sbjct: 259 YGESASASKFLLRDVLRDRWGFDGYVVSDCWAIVDIWKNHKIVATREEAAALAVKHGTEL 318

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG   +     AVR GL+ E DV+ AL   +  +MRLGMFD  P    +  +      +
Sbjct: 319 ECGAEYST-LPTAVRKGLISEADVDNALQKLMYSRMRLGMFD-PPEKLAWAQIPLSANQS 376

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           P H  LA + A + +VLLKN    LPLS  +   +AV+GP +D T+ ++GNY G      
Sbjct: 377 PEHDALARRTARESLVLLKNDG-VLPLSRAKIKRIAVVGPTADDTMALLGNYYGTPAAPV 435

Query: 299 TPLQGISRYA 308
           T LQGI   A
Sbjct: 436 TVLQGIREAA 445



 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/108 (41%), Positives = 59/108 (54%), Gaps = 13/108 (12%)

Query: 333 AARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVV 382
           AAR+AD  V V GL   +E E +          DR  L LP  Q+ L+  +    + PVV
Sbjct: 634 AARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRALLEALHGTGK-PVV 692

Query: 383 LVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            VL  G  + + +A+    + AIL   YPGQ GG+A+ADVLFG ANPG
Sbjct: 693 AVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGSAVADVLFGDANPG 738


>gi|146301622|ref|YP_001196213.1| glycoside hydrolase family 3 protein [Flavobacterium johnsoniae
           UW101]
 gi|146156040|gb|ABQ06894.1| Candidate beta-xylosidase; Glycoside hydrolase family 3
           [Flavobacterium johnsoniae UW101]
          Length = 875

 Score =  230 bits (586), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 122/307 (39%), Positives = 172/307 (56%), Gaps = 11/307 (3%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N    GLTYW+PN+NIFRDPRWGRGQET GEDP LT     ++V+GLQG+    LK AAC
Sbjct: 131 NERYLGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAVLGDAFVKGLQGDDPKYLKAAAC 190

Query: 63  CKHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
            KHY  +     +G +  R+ F+  V+  +L DTY   F+  + E  VA VMC+YN    
Sbjct: 191 AKHYAVH-----SGPESLRHTFDVDVTPYELWDTYLPAFRKLITESNVAGVMCAYNAFRT 245

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
           +P CA   ++ + +  +W+ DGY+ SDC ++   +         E AAADA+  G D+DC
Sbjct: 246 QPCCASDILMNDILRKEWKFDGYVTSDCWAIDDFFKNHKTHPDAESAAADAVFHGTDIDC 305

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC-TP 239
           G         AV+ G + E+ +++++     ++ RLGMFD  P +       P  V  + 
Sbjct: 306 GTDAYKALVQAVKNGKISEKQIDISVKRLFMIRFRLGMFD--PVSMVKYAQTPSSVLESK 363

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
            HQ  AL+ A Q IVLLKN    LPL+      + V+GPN+D  ++++GNY G     TT
Sbjct: 364 EHQLHALKMARQSIVLLKNEKNILPLNK-NLKKIVVLGPNADNAISILGNYNGTPSKLTT 422

Query: 300 PLQGISR 306
            LQGI  
Sbjct: 423 VLQGIKE 429



 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 36/107 (33%), Positives = 55/107 (51%), Gaps = 15/107 (14%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + ADA +   G+   +E E +          DR  +L P  Q +L+ +  ++S  PVV  
Sbjct: 604 KNADAFIFAGGISPQLEGEEMPVDFPGFKGGDRTSILFPEVQTKLL-KALQSSGKPVVFA 662

Query: 385 LMCGGPVDVSF-AKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           +M G  + + + A+N P   AIL + Y GQ+ G A ADV+FG  NP 
Sbjct: 663 MMTGSAIAIPWEAENIP---AILNIWYGGQSAGTAAADVIFGDYNPA 706


>gi|359450637|ref|ZP_09240068.1| beta-glucosidase [Pseudoalteromonas sp. BSi20480]
 gi|358043611|dbj|GAA76317.1| beta-glucosidase [Pseudoalteromonas sp. BSi20480]
          Length = 468

 Score =  230 bits (586), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 124/298 (41%), Positives = 178/298 (59%), Gaps = 12/298 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYWSPN+NIFRDPRWGRGQET GEDP LT + A +++ GLQG+    LK  A  KHY 
Sbjct: 139 GLTYWSPNINIFRDPRWGRGQETYGEDPYLTTRIAVNFINGLQGDNAEYLKSVATLKHYA 198

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +   +   V R+  +   S++DL +TY   FK  + + KVASVMC+YN VNG P C + 
Sbjct: 199 VH---SGPEVSRHSDDYTASEKDLAETYLPAFKDVIAQTKVASVMCAYNSVNGTPACGND 255

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYT--RTPEEAAADAIKAGLDLDCGPF-- 183
           ++++N +  ++  DGYIVSDC ++   Y+ + +    T  +AAA A+K G DL+CG    
Sbjct: 256 ELIQNKLRDEFNFDGYIVSDCGAIADFYDVKSHNIVNTGAKAAAMALKTGTDLNCGDHHG 315

Query: 184 -LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
               +   AV+ GL+ E+DV+ AL   +  + +LGMFD  P   P+ +     V +  H 
Sbjct: 316 NTYSYLTQAVKEGLVEEKDVDKALKRLMYARFKLGMFDN-PENVPYSDTSIDVVGSNKHL 374

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
            L  +AA + +VLLKN  + LPL    +  +A+IGPN+D    ++GNY G+     TP
Sbjct: 375 ALTQEAAQKSLVLLKNE-QVLPLKG--NEKIALIGPNADNEAILLGNYNGMPIVPITP 429


>gi|325925754|ref|ZP_08187127.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
           91-118]
 gi|325543811|gb|EGD15221.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
           91-118]
          Length = 874

 Score =  229 bits (583), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 127/303 (41%), Positives = 177/303 (58%), Gaps = 13/303 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++RGLQG   +  +  A  KH 
Sbjct: 120 AGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIATPKHI 179

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS +D+E TY   F+A +VEG+  SVMC+YN ++G P C
Sbjct: 180 AVH-----SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPAC 234

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PF 183
           A   +L   + G W   G++VSDCD+V  +    ++      ++A A+KAG DL+CG  +
Sbjct: 235 AADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY 294

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
             + T  A+  G + E  ++ +L      + RLG  +  P   P+  LG +DV   AH+ 
Sbjct: 295 RELGT--AIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRA 351

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LALQAA + IVLLKN A TLPL       +AVIGPN+D    +  NY G +    TPL G
Sbjct: 352 LALQAAAESIVLLKNDANTLPLKA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLG 409

Query: 304 ISR 306
           + +
Sbjct: 410 LRQ 412



 Score = 58.5 bits (140), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 37/75 (49%), Positives = 47/75 (62%), Gaps = 3/75 (4%)

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  + LP  QQ L+ R AKAS  P+V+VLM G  V +++AK      AI+   YPGQ+G
Sbjct: 632 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKT--HADAIVAAWYPGQSG 688

Query: 416 GAAIADVLFGRANPG 430
           G AIA +L G  NPG
Sbjct: 689 GTAIARMLAGDDNPG 703


>gi|285016879|ref|YP_003374590.1| beta-glucosidase [Xanthomonas albilineans GPE PC73]
 gi|283472097|emb|CBA14604.1| putative beta-glucosidase protein [Xanthomonas albilineans GPE
           PC73]
          Length = 914

 Score =  229 bits (583), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 128/311 (41%), Positives = 183/311 (58%), Gaps = 15/311 (4%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG-------- 54
           +G   GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+G+QG           
Sbjct: 134 HGRYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGMQGEGADAPKNAQG 193

Query: 55  -SRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 113
            +  K+ A  KH+  +   +    +R+HF+AR S++DL +TY   F+A V EGKV +VM 
Sbjct: 194 ETYRKLDATAKHFAVH---SGPESERHHFDARPSQRDLYETYLPAFEALVKEGKVDAVMG 250

Query: 114 SYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIK 173
           +YN++ G+   A   +L++ +  +W   GY+VSDC ++  ++       T E+AAA A+K
Sbjct: 251 AYNRLFGESASASKFLLRDVLRERWGFHGYVVSDCWAIVDIWKNHKIVATREQAAALAVK 310

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
            G  L+CG   A     AV+ GL+ E D++ AL   +T +MRLGMFD  P    +  L  
Sbjct: 311 NGTQLECGQEYAT-LPAAVQQGLIGETDIDAALRTLMTARMRLGMFD-PPGQLRWAQLPI 368

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 293
               +P H  LA + A + +VLLKN    LPLS  +H  +AVIGP +D T+ ++GNY G 
Sbjct: 369 SVNQSPEHDALARRTARESLVLLKNDG-LLPLSRAKHKRIAVIGPTADDTMALLGNYYGT 427

Query: 294 ACGYTTPLQGI 304
                T LQGI
Sbjct: 428 PATPVTILQGI 438



 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 45/107 (42%), Positives = 60/107 (56%), Gaps = 13/107 (12%)

Query: 334 ARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVL 383
           AR+AD  V V GL   +E E +          DR  L LP  Q+EL+  ++ A+  PVV 
Sbjct: 632 ARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLQALS-ATGKPVVA 690

Query: 384 VLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           VL  G  + + +A+    + AIL   YPGQ GG+A+ADVLFG  NPG
Sbjct: 691 VLTTGSALAIDWAQE--HVPAILLAWYPGQRGGSAVADVLFGDTNPG 735


>gi|346725879|ref|YP_004852548.1| beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346650626|gb|AEO43250.1| Beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 889

 Score =  228 bits (582), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 127/301 (42%), Positives = 176/301 (58%), Gaps = 13/301 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++RGLQG   +  +  A  KH 
Sbjct: 135 AGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIATPKHI 194

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS +D+E TY   F+A +VEG+  SVMC+YN ++G P C
Sbjct: 195 AVH-----SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPAC 249

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PF 183
           A   +L   + G W   G++VSDCD+V  +    ++      ++A A+KAG DL+CG  +
Sbjct: 250 AADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY 309

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
             + T  A+  G + E  ++ +L      + RLG  +  P   P+  LG +DV   AH+ 
Sbjct: 310 RELGT--AIARGEVDEALLDQSLVRLFATRYRLGELEA-PRKDPYARLGAKDVDNAAHRA 366

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LALQAA + IVLLKN A TLPL       +AVIGPN+D    +  NY G +    TPL G
Sbjct: 367 LALQAAAESIVLLKNDANTLPLKA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLG 424

Query: 304 I 304
           +
Sbjct: 425 L 425



 Score = 58.5 bits (140), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 37/75 (49%), Positives = 47/75 (62%), Gaps = 3/75 (4%)

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  + LP  QQ L+ R AKAS  P+V+VLM G  V +++AK      AI+   YPGQ+G
Sbjct: 647 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKT--HADAIVAAWYPGQSG 703

Query: 416 GAAIADVLFGRANPG 430
           G AIA +L G  NPG
Sbjct: 704 GTAIARMLAGDDNPG 718


>gi|78048767|ref|YP_364942.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
 gi|78037197|emb|CAJ24942.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
          Length = 889

 Score =  228 bits (582), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 127/301 (42%), Positives = 176/301 (58%), Gaps = 13/301 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++RGLQG   +  +  A  KH 
Sbjct: 135 AGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIATPKHI 194

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS +D+E TY   F+A +VEG+  SVMC+YN ++G P C
Sbjct: 195 AVH-----SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPAC 249

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PF 183
           A   +L   + G W   G++VSDCD+V  +    ++      ++A A+KAG DL+CG  +
Sbjct: 250 AADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY 309

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
             + T  A+  G + E  ++ +L      + RLG  +  P   P+  LG +DV   AH+ 
Sbjct: 310 RELGT--AIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRA 366

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LALQAA + IVLLKN A TLPL       +AVIGPN+D    +  NY G +    TPL G
Sbjct: 367 LALQAAAESIVLLKNDANTLPLKA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLG 424

Query: 304 I 304
           +
Sbjct: 425 L 425



 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 37/75 (49%), Positives = 47/75 (62%), Gaps = 3/75 (4%)

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  + LP  QQ L+ R AKAS  P+V+VLM G  V +++AK      AI+   YPGQ+G
Sbjct: 647 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKT--HADAIVAAWYPGQSG 703

Query: 416 GAAIADVLFGRANPG 430
           G AIA +L G  NPG
Sbjct: 704 GTAIARMLAGDDNPG 718


>gi|58584046|ref|YP_203062.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|84625823|ref|YP_453195.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|58428640|gb|AAW77677.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|84369763|dbj|BAE70921.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
           311018]
          Length = 904

 Score =  228 bits (582), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 129/306 (42%), Positives = 179/306 (58%), Gaps = 15/306 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN--------TGSRL-K 58
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+GLQG          G R  K
Sbjct: 142 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGSDAPKNAQGERYRK 201

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           + A  KH+  +        DR+HF+AR S++DL +TY   F+A V +GKV +VM +YN+V
Sbjct: 202 LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRV 258

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            G+   A   +L++ +  QW   GY+VSDC ++  ++       T E+AAA A+  G +L
Sbjct: 259 YGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVATREQAAALAVTHGTEL 318

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG   +     AV  GL+ E  ++ AL   +T +MRLGMFD  P   P+  +      +
Sbjct: 319 ECGEEYST-LPAAVHQGLIDEAQIDTALQTLMTARMRLGMFD-PPGQLPWSKIPASVNQS 376

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PAH  LA + A + +VLLKN    LPLS      +AVIGP +D T+ ++GNY G      
Sbjct: 377 PAHDALARRTARESLVLLKNDG-LLPLSRATLKRIAVIGPTADDTMALLGNYYGTPAAPV 435

Query: 299 TPLQGI 304
           T LQGI
Sbjct: 436 TVLQGI 441



 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 46/107 (42%), Positives = 58/107 (54%), Gaps = 13/107 (12%)

Query: 334 ARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVL 383
           AR AD  V V GL   +E E +          DR  L LP  Q+EL+  + +A+  PVV 
Sbjct: 635 ARSADVVVFVGGLTGDVEGEEMKVSYPGFAGGDRTDLRLPKPQRELLEAL-QATGKPVVA 693

Query: 384 VLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           VL  G  + V +A+    + AIL   YPGQ GG A+AD LFG ANPG
Sbjct: 694 VLTAGSALAVDWAQQ--HVPAILLAWYPGQRGGTAVADTLFGDANPG 738


>gi|188574621|ref|YP_001911550.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
 gi|188519073|gb|ACD57018.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
          Length = 904

 Score =  228 bits (581), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 129/306 (42%), Positives = 179/306 (58%), Gaps = 15/306 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN--------TGSRL-K 58
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+GLQG          G R  K
Sbjct: 142 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGSDAPKNAQGERYRK 201

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           + A  KH+  +        DR+HF+AR S++DL +TY   F+A V +GKV +VM +YN+V
Sbjct: 202 LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRV 258

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            G+   A   +L++ +  QW   GY+VSDC ++  ++       T E+AAA A+  G +L
Sbjct: 259 YGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVATREQAAALAVTHGTEL 318

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG   +     AV  GL+ E  ++ AL   +T +MRLGMFD  P   P+  +      +
Sbjct: 319 ECGEEYST-LPAAVHQGLIDEAQIDTALQTLMTARMRLGMFD-PPGQLPWSKIPASVNQS 376

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PAH  LA + A + +VLLKN    LPLS      +AVIGP +D T+ ++GNY G      
Sbjct: 377 PAHDALARRTARESLVLLKNDG-LLPLSRATLKRIAVIGPTADDTMALLGNYYGTPAAPV 435

Query: 299 TPLQGI 304
           T LQGI
Sbjct: 436 TVLQGI 441



 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 45/107 (42%), Positives = 58/107 (54%), Gaps = 13/107 (12%)

Query: 334 ARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVL 383
           AR AD  V V GL   +E E +          DR  L LP  Q+EL+  + +A+  PVV 
Sbjct: 635 ARSADVVVFVGGLTGDVEGEEMKVSYPGFAGGDRTDLRLPKPQRELLEAL-QATGKPVVA 693

Query: 384 VLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           VL  G  + + +A+    + AIL   YPGQ GG A+AD LFG ANPG
Sbjct: 694 VLTAGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVADTLFGDANPG 738


>gi|225873993|ref|YP_002755452.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
 gi|225791521|gb|ACO31611.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
          Length = 894

 Score =  228 bits (581), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 120/304 (39%), Positives = 181/304 (59%), Gaps = 12/304 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GL +W+PN+NIFRDPRWGRGQET GEDP LTGK   ++V G+QG+     +V A  KH+ 
Sbjct: 143 GLDFWAPNLNIFRDPRWGRGQETYGEDPFLTGKMGVAFVSGMQGDNPKYYRVIATPKHF- 201

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
             D+ +     R+  +  VS  D  DTY   F+A +++G   SVMCSYN +NG+P CA+ 
Sbjct: 202 --DVHSGPEPTRHFADVDVSLHDQLDTYEPAFRAAIMQGHADSVMCSYNAINGQPACANQ 259

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
             L++ + G W   GY+VSDCD+V  +Y+   Y  T  +AAA +++ G+D DC  F    
Sbjct: 260 FTLQHQLRGAWGFKGYVVSDCDAVHDIYSGHKYRPTLAQAAAISMERGMDNDCADFAQPK 319

Query: 188 TE-------GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
            +        AV+ G L ++ ++ AL    T +++LG+FD +    P+ +    ++ +PA
Sbjct: 320 GDDDYKAYIDAVQQGYLSQQAMDTALVRLFTARIKLGLFDPK-GMDPYADTPHSELNSPA 378

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H+  A + A + +VLLKN   TLPL     H++AV+GP +D T  ++GNY GV     + 
Sbjct: 379 HRAYARKLADESMVLLKNDG-TLPLKPGSVHSIAVVGPLADQTAVLLGNYNGVPTHTVSF 437

Query: 301 LQGI 304
           L+G+
Sbjct: 438 LEGL 441



 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 52/99 (52%), Gaps = 13/99 (13%)

Query: 341 VLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGP 390
           + V+G+   +E E +          DR  L +P  ++ LV  VAK  + PVV+VLM G  
Sbjct: 627 IAVVGITSKLEGEEMPVDQPGFLGGDRTNLQMPEPEEALVEAVAKTGK-PVVVVLMNGSA 685

Query: 391 VDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           + V++        A+L   Y G+ GGAAIAD L G+ +P
Sbjct: 686 LAVNWISQ--HANAVLEAWYSGEEGGAAIADTLSGKNDP 722


>gi|386819249|ref|ZP_10106465.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
           19592]
 gi|386424355|gb|EIJ38185.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
           19592]
          Length = 878

 Score =  228 bits (580), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 124/300 (41%), Positives = 179/300 (59%), Gaps = 11/300 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPN+NIFRDPRWGRG ET GEDP LTG     YV+GLQGN    LKV A  KH+ 
Sbjct: 131 GLTFWSPNINIFRDPRWGRGHETYGEDPYLTGVLGTEYVKGLQGNNAKYLKVVATAKHFA 190

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+   S++DL +TY   F+  V +G V S+M +YN++ G+   A
Sbjct: 191 VH-----SGPEPLRHEFDVAPSQRDLWETYLPAFRTLVKDGNVYSIMTAYNRIYGEAASA 245

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
             + L + +  +W  +GY+VSDC ++  ++ T H  +   EA+A A+K G DL+CG    
Sbjct: 246 S-NSLYSILRDKWGFNGYVVSDCGAIADMWKTHHVAKDAAEASAMAVKEGCDLNCGNSYE 304

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
             T+ A++ GL+ E D+++AL   +  + +LGMFD +    P+  +       P H+ LA
Sbjct: 305 KLTD-ALQDGLITEADLDVALHRLMRARFKLGMFDSDEKV-PYAKIPFSVNNNPKHKVLA 362

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
           L+AA + IVLLKN    LPLS      +AVIGPN+D   ++ GNY G+     T L+GI 
Sbjct: 363 LKAAQKSIVLLKNENAILPLSK-NLKNIAVIGPNADNIQSLWGNYNGMPKNPVTVLEGIK 421



 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 50/117 (42%), Positives = 69/117 (58%), Gaps = 14/117 (11%)

Query: 324 NQLIGAAEVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRV 373
           NQL  A  +AA ++D  VL +GL++ +E E +          DR  L LP +Q EL+  V
Sbjct: 587 NQLEKAV-LAANKSDVVVLALGLNERLEGEEMKVEVEGFADGDRTSLNLPKKQVELMKEV 645

Query: 374 AKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
             A+  PVVLVL+ G  + +++A  +  I AI+  GYPGQ GG AIA+VLFG  NP 
Sbjct: 646 V-ATGKPVVLVLLNGSALSINWASEN--IPAIISAGYPGQEGGNAIANVLFGDYNPA 699


>gi|325914134|ref|ZP_08176487.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325539637|gb|EGD11280.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 874

 Score =  228 bits (580), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 126/304 (41%), Positives = 180/304 (59%), Gaps = 13/304 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++RGLQG+  +  +  A  KH 
Sbjct: 120 AGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLNHPRTIATPKHI 179

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS +D+E TY   F+A +V+G+  SVMC+YN ++G P C
Sbjct: 180 AVH-----SGPEPGRHGFDVDVSPRDMEATYTPAFRAALVDGQAWSVMCAYNSLHGTPAC 234

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PF 183
           A   +L   + G W   G++VSDCD+V  +    ++      ++A A+KAG DL+CG  +
Sbjct: 235 AADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY 294

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
             + T  A+  G + E  ++ +L      + RLG  +  P   P+  LG +DV   AH+ 
Sbjct: 295 RELGT--AIERGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRA 351

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LALQAA + IVLLKN+A TLPL       +AVIGPN+D    +  NY G +    TPL G
Sbjct: 352 LALQAAAESIVLLKNTATTLPLKA--GTRLAVIGPNADALAALEANYQGTSATPITPLLG 409

Query: 304 ISRY 307
           + ++
Sbjct: 410 LRQH 413



 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 45/105 (42%), Positives = 59/105 (56%), Gaps = 13/105 (12%)

Query: 336 QADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           Q+DA V  +GL   +E E +          DR  + LP  QQ L+ R AKAS  P+V+VL
Sbjct: 602 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVL 660

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           M G  V +++AK +    AI+   YPGQ+GG AIA  L G  NPG
Sbjct: 661 MSGSAVALNWAKAN--ADAIVAAWYPGQSGGTAIARALAGDDNPG 703


>gi|390992294|ref|ZP_10262532.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
 gi|372552957|emb|CCF69507.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
          Length = 886

 Score =  227 bits (579), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 127/303 (41%), Positives = 176/303 (58%), Gaps = 13/303 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++RGLQG   +  +  A  KH 
Sbjct: 132 AGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIATPKHI 191

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS  D+E TY   F+A +VEG+  SVMC+YN ++G P C
Sbjct: 192 AVH-----SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVC 246

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PF 183
           A   +L   + G W   G++VSDCD+V  +    ++      ++A A+KAG DL+CG  +
Sbjct: 247 AADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY 306

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
             + T  A+  G + E  ++ +L      + RLG  +  P   P+  LG +DV   AH+ 
Sbjct: 307 RELGT--AIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRA 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LALQAA + IVLLKN A TLPL       +AVIGPN+D    +  NY G +    TPL G
Sbjct: 364 LALQAAAESIVLLKNDANTLPLRA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLG 421

Query: 304 ISR 306
           + +
Sbjct: 422 LRQ 424



 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 41/97 (42%), Positives = 54/97 (55%), Gaps = 13/97 (13%)

Query: 344 MGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDV 393
           +GL   +E E +          DR  + LP  QQ L+ R AKAS  P+V+VLM G  V +
Sbjct: 622 VGLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQTLLER-AKASGKPLVVVLMSGSAVAL 680

Query: 394 SFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           ++AK      AI+   YPGQ+GG AIA +L G  NPG
Sbjct: 681 NWAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPG 715


>gi|418519424|ref|ZP_13085476.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
 gi|410704868|gb|EKQ63347.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
          Length = 886

 Score =  227 bits (578), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 127/303 (41%), Positives = 176/303 (58%), Gaps = 13/303 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++RGLQG   +  +  A  KH 
Sbjct: 132 AGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIATPKHI 191

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS  D+E TY   F+A +VEG+  SVMC+YN ++G P C
Sbjct: 192 AVH-----SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVC 246

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PF 183
           A   +L   + G W   G++VSDCD+V  +    ++      ++A A+KAG DL+CG  +
Sbjct: 247 AADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY 306

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
             + T  A+  G + E  ++ +L      + RLG  +  P   P+  LG +DV   AH+ 
Sbjct: 307 RELGT--AIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRA 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LALQAA + IVLLKN A TLPL       +AVIGPN+D    +  NY G +    TPL G
Sbjct: 364 LALQAAAESIVLLKNDANTLPLRA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLG 421

Query: 304 ISR 306
           + +
Sbjct: 422 LRQ 424



 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 41/97 (42%), Positives = 54/97 (55%), Gaps = 13/97 (13%)

Query: 344 MGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDV 393
           +GL   +E E +          DR  + LP  QQ L+ R AKAS  P+V+VLM G  V +
Sbjct: 622 VGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQTLLER-AKASGKPLVVVLMSGSAVAL 680

Query: 394 SFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           ++AK      AI+   YPGQ+GG AIA +L G  NPG
Sbjct: 681 NWAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPG 715


>gi|294627323|ref|ZP_06705909.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
 gi|292598405|gb|EFF42556.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
          Length = 886

 Score =  227 bits (578), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 127/303 (41%), Positives = 175/303 (57%), Gaps = 13/303 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++RGLQG      +  A  KH 
Sbjct: 132 AGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLDHPRTIATPKHI 191

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS  D+E TY   F+A +VEG+  SVMC+YN ++G P C
Sbjct: 192 AVH-----SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPAC 246

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PF 183
           A   +L   + G W   G++VSDCD+V  +    ++      ++A A+KAG DL+CG  +
Sbjct: 247 AADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY 306

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
             + T  A+  G + E  ++ +L      + RLG  +  P   P+  LG +DV   AH+ 
Sbjct: 307 RDLGT--AIERGDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRA 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LALQAA + IVLLKN A TLPL       +AVIGPN+D    +  NY G +    TPL G
Sbjct: 364 LALQAAAESIVLLKNDANTLPLKA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLG 421

Query: 304 ISR 306
           + +
Sbjct: 422 LRQ 424



 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 41/97 (42%), Positives = 54/97 (55%), Gaps = 13/97 (13%)

Query: 344 MGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDV 393
           +GL   +E E +          DR  + LP  QQ L+ R AKAS  P+V+VLM G  V +
Sbjct: 622 VGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVAL 680

Query: 394 SFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           ++AK      AI+   YPGQ+GG AIA +L G  NPG
Sbjct: 681 NWAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPG 715


>gi|289670678|ref|ZP_06491753.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. musacearum
           NCPPB 4381]
          Length = 886

 Score =  227 bits (578), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 126/303 (41%), Positives = 177/303 (58%), Gaps = 13/303 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++RGLQG   +  +  A  KH 
Sbjct: 132 AGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIATPKHL 191

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS  D+E TY   F+A +V+G+  SVMC+YN ++G P C
Sbjct: 192 AVH-----SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVQGQAGSVMCAYNSLHGTPAC 246

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PF 183
           A   +L   + G W   G++VSDCD+V  +    ++      ++A A+KAG DL+CG  +
Sbjct: 247 AADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY 306

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
             + T  A+  G + E  ++ +L      + RLG  +  P   P+  LG +DV   AH+ 
Sbjct: 307 RELGT--AIERGDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRA 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LALQAA + IVLLKN A TLPL+      +AVIGPN+D    +  NY G +    TPL G
Sbjct: 364 LALQAAAESIVLLKNDANTLPLNA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLG 421

Query: 304 ISR 306
           + +
Sbjct: 422 LRQ 424



 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 45/105 (42%), Positives = 59/105 (56%), Gaps = 13/105 (12%)

Query: 336 QADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           Q+DA V  +GL   +E E +          DR  + LP  QQ L+ R AKAS  P+V+VL
Sbjct: 614 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVL 672

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           M G  V +++AK      AI+   YPGQ+GG AIA +L G  NPG
Sbjct: 673 MSGSAVALNWAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPG 715


>gi|21243803|ref|NP_643385.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
           306]
 gi|21109396|gb|AAM37921.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
           306]
          Length = 886

 Score =  227 bits (578), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 127/303 (41%), Positives = 176/303 (58%), Gaps = 13/303 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++RGLQG   +  +  A  KH 
Sbjct: 132 AGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIATPKHI 191

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS  D+E TY   F+A +VEG+  SVMC+YN ++G P C
Sbjct: 192 AVH-----SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVC 246

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PF 183
           A   +L   + G W   G++VSDCD+V  +    ++      ++A A+KAG DL+CG  +
Sbjct: 247 AADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY 306

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
             + T  A+  G + E  ++ +L      + RLG  +  P   P+  LG +DV   AH+ 
Sbjct: 307 RELGT--AIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRA 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LALQAA + IVLLKN A TLPL       +AVIGPN+D    +  NY G +    TPL G
Sbjct: 364 LALQAAAESIVLLKNDANTLPLRA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLG 421

Query: 304 ISR 306
           + +
Sbjct: 422 LRQ 424



 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 41/97 (42%), Positives = 54/97 (55%), Gaps = 13/97 (13%)

Query: 344 MGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDV 393
           +GL   +E E +          DR  + LP  QQ L+ R AKAS  P+V+VLM G  V +
Sbjct: 622 VGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVAL 680

Query: 394 SFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           ++AK      AI+   YPGQ+GG AIA +L G  NPG
Sbjct: 681 NWAKM--HADAIVAAWYPGQSGGTAIARMLAGDDNPG 715


>gi|294665226|ref|ZP_06730524.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292605014|gb|EFF48367.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 886

 Score =  227 bits (578), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 127/303 (41%), Positives = 175/303 (57%), Gaps = 13/303 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++RGLQG      +  A  KH 
Sbjct: 132 AGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLDHPRTIATPKHI 191

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS  D+E TY   F+A +VEG+  SVMC+YN ++G P C
Sbjct: 192 AVH-----SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPAC 246

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PF 183
           A   +L   + G W   G++VSDCD+V  +    ++      ++A A+KAG DL+CG  +
Sbjct: 247 AADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY 306

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
             + T  A+  G + E  ++ +L      + RLG  +  P   P+  LG +DV   AH+ 
Sbjct: 307 RDLGT--AIERGDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRA 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LALQAA + IVLLKN A TLPL       +AVIGPN+D    +  NY G +    TPL G
Sbjct: 364 LALQAAAESIVLLKNDANTLPLKA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLG 421

Query: 304 ISR 306
           + +
Sbjct: 422 LRQ 424



 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/97 (41%), Positives = 54/97 (55%), Gaps = 13/97 (13%)

Query: 344 MGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDV 393
           +GL   +E E +          DR  + LP  QQ L+ R AKAS  P+V+VLM G  V +
Sbjct: 622 VGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVAL 680

Query: 394 SFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           ++AK      AI+   YPGQ+GG A+A +L G  NPG
Sbjct: 681 NWAKT--HADAIVAAWYPGQSGGTAMARMLAGDDNPG 715


>gi|289664871|ref|ZP_06486452.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. vasculorum
           NCPPB 702]
          Length = 886

 Score =  227 bits (578), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 126/303 (41%), Positives = 177/303 (58%), Gaps = 13/303 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++RGLQG   +  +  A  KH 
Sbjct: 132 AGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIATPKHL 191

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS  D+E TY   F+A +V+G+  SVMC+YN ++G P C
Sbjct: 192 AVH-----SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVQGQAGSVMCAYNSLHGTPAC 246

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PF 183
           A   +L   + G W   G++VSDCD+V  +    ++      ++A A+KAG DL+CG  +
Sbjct: 247 AADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY 306

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
             + T  A+  G + E  ++ +L      + RLG  +  P   P+  LG +DV   AH+ 
Sbjct: 307 RELGT--AIERGDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRA 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LALQAA + IVLLKN A TLPL+      +AVIGPN+D    +  NY G +    TPL G
Sbjct: 364 LALQAAAESIVLLKNDANTLPLNA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLG 421

Query: 304 ISR 306
           + +
Sbjct: 422 LRQ 424



 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 45/105 (42%), Positives = 59/105 (56%), Gaps = 13/105 (12%)

Query: 336 QADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           Q+DA V  +GL   +E E +          DR  + LP  QQ L+ R AKAS  P+V+VL
Sbjct: 614 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVL 672

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           M G  V +++AK      AI+   YPGQ+GG AIA +L G  NPG
Sbjct: 673 MSGSAVALNWAKT--HADAIVAAWYPGQSGGTAIARMLAGDDNPG 715


>gi|393786911|ref|ZP_10375043.1| hypothetical protein HMPREF1068_01323 [Bacteroides nordii
           CL02T12C05]
 gi|392658146|gb|EIY51776.1| hypothetical protein HMPREF1068_01323 [Bacteroides nordii
           CL02T12C05]
          Length = 863

 Score =  226 bits (577), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 129/317 (40%), Positives = 183/317 (57%), Gaps = 15/317 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT+W+PNVNIFRDPRWGRGQET GEDP LT +   + VRGLQG + S+  K+ AC KHY
Sbjct: 129 GLTFWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGLAVVRGLQGPSESKYDKLHACAKHY 188

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ F+   +S +DL +TY   FKA V +G V  VMC+YN+  G+P C 
Sbjct: 189 ALHSGPEWN---RHRFDVENISPRDLWETYLPAFKALVQQGGVKEVMCAYNRFEGEPCCG 245

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYT--RTPEEAAADAIKAGLDLDCGPF 183
              +L N +  +W  DG +VSDC ++   Y   H+    T E A A A+KAG DLDCG  
Sbjct: 246 SNRLLYNILREEWGFDGLVVSDCGAISDFYLKGHHETHSTKESAVAAAVKAGTDLDCGVD 305

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                E AV  G++ E+ ++++L+  +  +  LG+ D E     + ++    V +  H+ 
Sbjct: 306 YQ-SLEKAVEKGIITEKQIDVSLSRLLKARFELGLMD-EEHLVSWSDIPYTVVDSEKHRA 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRH-HTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
            AL+ A + + LLKN   TLPLS  +H   + VIGPN++ ++ M GNY G      T L+
Sbjct: 364 KALEVARKSMTLLKNKNGTLPLS--KHCGKIVVIGPNANDSIMMWGNYNGFPSHTVTILE 421

Query: 303 GISRY---AKTIHQAGC 316
           GI+      + I+  GC
Sbjct: 422 GITHKLDAGQVIYDKGC 438



 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 42/111 (37%), Positives = 61/111 (54%), Gaps = 15/111 (13%)

Query: 331 EVAA--RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASR 378
           E+AA    A+A V V G+   +E E +          DR  + LP  Q++L+  + K  +
Sbjct: 592 EIAATVSDAEAIVFVGGISPKVEGEELPVSFPGFKGGDRTVIELPQVQRDLLQELYKTGK 651

Query: 379 GPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            P++L+L  G  + +S A+ D    AI+   YPGQAGG A+ADVLFG  NP
Sbjct: 652 -PIILILCSGSAIGLS-AEVD-LADAIIQAWYPGQAGGTAVADVLFGDYNP 699


>gi|413925165|gb|AFW65097.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 412

 Score =  226 bits (577), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 104/184 (56%), Positives = 133/184 (72%), Gaps = 8/184 (4%)

Query: 1   MYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR--- 56
            YN G A GLT WSPNVNIFRDPRWGRGQETPGEDP +  +YAA++VRGLQG++ +    
Sbjct: 199 FYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRGLQGSSSNTKSV 258

Query: 57  ---LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC 113
              L  +ACCKH TAYDL++W GV RY F A V+ QDL DT+N PF++CVV+GK + VMC
Sbjct: 259 PPVLLTSACCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKASCVMC 318

Query: 114 SYNQVNGKPTCADPDILKNTIHGQWRLDG-YIVSDCDSVGVLYNTQHYTRTPEEAAADAI 172
           +Y  VNG P+CA+ D+L  T  G W LDG Y+ +DCD+V ++ N+Q Y  T E+  A  +
Sbjct: 319 AYTSVNGVPSCANADLLTKTFRGSWGLDGRYVAADCDAVSIMRNSQFYRPTAEDTVATTL 378

Query: 173 KAGL 176
           KAG+
Sbjct: 379 KAGM 382


>gi|381169747|ref|ZP_09878910.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           citri pv. mangiferaeindicae LMG 941]
 gi|380689765|emb|CCG35397.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           citri pv. mangiferaeindicae LMG 941]
          Length = 874

 Score =  226 bits (577), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 127/303 (41%), Positives = 176/303 (58%), Gaps = 13/303 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++RGLQG   +  +  A  KH 
Sbjct: 120 AGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIATPKHI 179

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS  D+E TY   F+A +VEG+  SVMC+YN ++G P C
Sbjct: 180 AVH-----SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVC 234

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PF 183
           A   +L   + G W   G++VSDCD+V  +    ++      ++A A+KAG DL+CG  +
Sbjct: 235 AADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY 294

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
             + T  A+  G + E  ++ +L      + RLG  +  P   P+  LG +DV   AH+ 
Sbjct: 295 RELGT--AIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRA 351

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LALQAA + IVLLKN A TLPL       +AVIGPN+D    +  NY G +    TPL G
Sbjct: 352 LALQAAAESIVLLKNDANTLPLRA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLG 409

Query: 304 ISR 306
           + +
Sbjct: 410 LRQ 412



 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 41/97 (42%), Positives = 54/97 (55%), Gaps = 13/97 (13%)

Query: 344 MGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDV 393
           +GL   +E E +          DR  + LP  QQ L+ R AKAS  P+V+VLM G  V +
Sbjct: 610 VGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVAL 668

Query: 394 SFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           ++AK      AI+   YPGQ+GG AIA +L G  NPG
Sbjct: 669 NWAKM--HADAIVAAWYPGQSGGTAIARMLAGDDNPG 703


>gi|424796589|ref|ZP_18222299.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
 gi|422794891|gb|EKU23686.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
          Length = 913

 Score =  226 bits (576), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 128/310 (41%), Positives = 179/310 (57%), Gaps = 15/310 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTG---------SRLK 58
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+GLQG            +  K
Sbjct: 142 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGDAYRK 201

Query: 59  VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
           + A  KH+  +   +    DR+HF+A  S++DL +TY   F+A V EGKV +VM +YN+V
Sbjct: 202 LDATAKHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRV 258

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            G+   A   +L++ +   W  DGY+VSDC ++  ++       T E+AAA A+  G +L
Sbjct: 259 YGESASASKFLLRDVLRDTWGFDGYVVSDCWAIVDIWKNHKIVATREQAAALAVNNGTEL 318

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG   +     AVR GL+ E DV+ AL   +  +MRLGMFD  P    +  +      +
Sbjct: 319 ECGEEYST-LPAAVRKGLISEADVDKALQKLMYSRMRLGMFD-PPDTLRWAQIPLSANQS 376

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           P H  LA + A + +VLLKN    LPLS  +   +AVIGP +D T+ ++GNY G      
Sbjct: 377 PEHDALARRTARESLVLLKNDG-VLPLSRGKIKRIAVIGPTADDTMALLGNYYGTPAAPV 435

Query: 299 TPLQGISRYA 308
           T LQGI   A
Sbjct: 436 TVLQGIREAA 445



 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 46/108 (42%), Positives = 60/108 (55%), Gaps = 13/108 (12%)

Query: 333 AARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVV 382
           AAR+AD  V V GL   +E E +          DR  L LP  Q+EL+  +    + PVV
Sbjct: 634 AARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQGTGK-PVV 692

Query: 383 LVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            VL  G  + + +A+    + AIL   YPGQ GG+A+ADVLFG ANPG
Sbjct: 693 AVLTTGSALAIDWAQQ--HVPAILLAWYPGQRGGSAVADVLFGDANPG 738


>gi|423290405|ref|ZP_17269254.1| hypothetical protein HMPREF1069_04297 [Bacteroides ovatus
           CL02T12C04]
 gi|392665792|gb|EIY59315.1| hypothetical protein HMPREF1069_04297 [Bacteroides ovatus
           CL02T12C04]
          Length = 861

 Score =  226 bits (575), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 124/316 (39%), Positives = 184/316 (58%), Gaps = 16/316 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT+W+PNVNIFRDPRWGRGQET GEDP LTG+   + VRGLQG   +R  K+ AC KH+
Sbjct: 131 GLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAKHF 190

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ F+A  +  +DL +TY   FK  V +  V  VMC+YN+  G+P C 
Sbjct: 191 AVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCG 247

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   +  +W   G +VSDC ++   Y    +   P  E A+ADA++AG DL+CG  
Sbjct: 248 SNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPDKEHASADAVRAGTDLECGSE 307

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            A   + AV+ GL+ E++++++L   +T +  LG  D +P+   +  +    + +  HQ 
Sbjct: 308 YASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WAEIPTSVLNSKEHQA 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ A + +VLL+N    LPL+T  H  VAV+GPN++ +V   GNY G+     T L+ 
Sbjct: 364 LALRMARESLVLLQNKNNILPLNT--HLKVAVMGPNANDSVMQWGNYNGIPAHTVTLLEA 421

Query: 304 ISRY---AKTIHQAGC 316
           +       + I++ GC
Sbjct: 422 VRAKLPEGQIIYEPGC 437



 Score = 40.4 bits (93), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 13/104 (12%)

Query: 337 ADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLM 386
           AD  +   G+  S+E E +          DR  + LP  Q++L+  + KA +  VV +  
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK-VVFINY 656

Query: 387 CGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            G  +            AIL   YPGQAGG AI D L+G  NPG
Sbjct: 657 SGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPG 698


>gi|384421334|ref|YP_005630694.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
           BLS256]
 gi|353464247|gb|AEQ98526.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
           BLS256]
          Length = 904

 Score =  225 bits (574), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 127/306 (41%), Positives = 178/306 (58%), Gaps = 15/306 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA------- 60
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+GLQG   +  K A       
Sbjct: 142 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPKNAQGEPYRK 201

Query: 61  --ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
             A  KH+  +        +R+HF+AR S++DL +TY   F+A V +GKV +VM +YN+V
Sbjct: 202 LDATAKHFAVHSGPE---AERHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRV 258

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL 178
            G+   A   +L++ +  QW   GY+VSDC ++  ++       T E+AAA A+  G +L
Sbjct: 259 YGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVATREQAAALAVTHGTEL 318

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG   +     AV  GL+ E  ++ AL   +T +MRLGMFD  P   P+  +      +
Sbjct: 319 ECGEEYST-LPAAVHQGLIDEAQIDTALQTLMTARMRLGMFD-PPGQLPWSKIPASVNQS 376

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           PAH  LA + A + +VLLKN    LPLS      +AVIGP +D T+ ++GNY G      
Sbjct: 377 PAHDALARRTARESLVLLKNDG-LLPLSRATLKRIAVIGPTADDTMALLGNYYGTPAAPV 435

Query: 299 TPLQGI 304
           T LQGI
Sbjct: 436 TVLQGI 441



 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 45/107 (42%), Positives = 58/107 (54%), Gaps = 13/107 (12%)

Query: 334 ARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVL 383
           AR AD  V V GL   +E E +          DR  L LP  Q+EL+  + +A+  PVV 
Sbjct: 635 ARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEAL-QATGKPVVA 693

Query: 384 VLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           VL  G  + + +A+    + AIL   YPGQ GG A+AD LFG ANPG
Sbjct: 694 VLTAGSALAIDWAQQ--HVPAILLAWYPGQRGGTAVADTLFGDANPG 738


>gi|389737578|ref|ZP_10190998.1| beta-glucosidase [Rhodanobacter sp. 115]
 gi|388434298|gb|EIL91245.1| beta-glucosidase [Rhodanobacter sp. 115]
          Length = 898

 Score =  225 bits (574), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 127/303 (41%), Positives = 176/303 (58%), Gaps = 9/303 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
            G   GLT+WSPN NIFRDPRWGRGQET GEDP LT +   ++VRGL+G+  +  K+ A 
Sbjct: 143 RGRYEGLTFWSPNTNIFRDPRWGRGQETYGEDPYLTSRMGVAFVRGLEGDDPTYQKLDAT 202

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
            KH+  +        +R+ F+   S++DL +TY   F+A V +G V +VM +YN+V+G P
Sbjct: 203 AKHFAVHSGPE---SERHRFDVHPSERDLHETYLPAFQALVQQGGVDAVMGAYNRVDGVP 259

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP 182
             A   +L++ +   W   GY+VSDCD+V  +Y       T E+AAA A+  G DL+CG 
Sbjct: 260 ATASHRLLQDILRRDWGFKGYVVSDCDAVADIYQFHKVVPTAEQAAALAVNNGDDLNCGT 319

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
             A   + AV  GL+ E  ++ A+   +  + RLGMFD  P   P+  L    V +P H 
Sbjct: 320 TYATLVK-AVHDGLVNEHTIDTAVTRLMLARFRLGMFD-PPGRVPWSTLPMSVVQSPQHD 377

Query: 243 QLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 301
            LAL+ A + +VLLKN    LPLS  +R   +AVIGP +D    ++GNY G      T L
Sbjct: 378 ALALRTAQESMVLLKNDG-LLPLSHNVRR--IAVIGPTADNVTALLGNYHGTPKAPVTIL 434

Query: 302 QGI 304
           QGI
Sbjct: 435 QGI 437



 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 43/111 (38%), Positives = 58/111 (52%), Gaps = 13/111 (11%)

Query: 329 AAEVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASR 378
           AA  AAR AD  +   GL   +E E +          DR  L LP  Q++L+  + + + 
Sbjct: 625 AALDAARHADVVIFAGGLSSDLEGEEMPVDYPGFAGGDRTTLALPATQRKLLQAL-QVTG 683

Query: 379 GPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            PVVLVL  G  + + +AK    + AIL   YPGQ GG A+AD LFG  +P
Sbjct: 684 KPVVLVLTTGSALAIDWAKQ--HLPAILLAWYPGQDGGHAVADALFGNVDP 732


>gi|423215029|ref|ZP_17201557.1| hypothetical protein HMPREF1074_03089 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692292|gb|EIY85530.1| hypothetical protein HMPREF1074_03089 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 861

 Score =  225 bits (574), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 124/316 (39%), Positives = 184/316 (58%), Gaps = 16/316 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT+W+PNVNIFRDPRWGRGQET GEDP LTG+   + VRGLQG   +R  K+ AC KH+
Sbjct: 131 GLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAKHF 190

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ F+A  +  +DL +TY   FK  V +  V  VMC+YN+  G+P C 
Sbjct: 191 AVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCG 247

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   +  +W   G +VSDC ++   Y    +   P  E A+ADA++AG DL+CG  
Sbjct: 248 SNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASADAVRAGTDLECGSE 307

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            A   + AV+ GL+ E++++++L   +T +  LG  D +P+   +  +    + +  HQ 
Sbjct: 308 YASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WSEIPTSVLNSKEHQA 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ A + +VLL+N    LPL+T  H  VAV+GPN++ +V   GNY G+     T L+ 
Sbjct: 364 LALRMARESLVLLQNKNNILPLNT--HLKVAVMGPNANDSVMQWGNYNGIPAHTVTLLEA 421

Query: 304 ISRY---AKTIHQAGC 316
           +       + I++ GC
Sbjct: 422 VRAKLPEGQIIYEPGC 437



 Score = 40.4 bits (93), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 13/104 (12%)

Query: 337 ADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLM 386
           AD  +   G+  S+E E +          DR  + LP  Q++L+  + KA +  VV +  
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK-VVFINY 656

Query: 387 CGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            G  +            AIL   YPGQAGG AI D L+G  NPG
Sbjct: 657 SGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPG 698


>gi|418518029|ref|ZP_13084183.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
 gi|410705279|gb|EKQ63755.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
          Length = 886

 Score =  225 bits (573), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 129/305 (42%), Positives = 178/305 (58%), Gaps = 17/305 (5%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++RGLQG   +  +  A  KH 
Sbjct: 132 AGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLNHPRTIATPKHI 191

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS  D+E TY   F+A +VEG+  SVMC+YN ++G P C
Sbjct: 192 AVH-----SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVC 246

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAAD--AIKAGLDLDCG- 181
           A   +L   + G W   G++VSDCD++  +  TQ +   P+ A +   A+KAG DL+CG 
Sbjct: 247 AADWLLNGRVRGDWGFKGFVVSDCDAIDDM--TQFHYFRPDNAGSSVAALKAGHDLNCGH 304

Query: 182 PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAH 241
            +  + T  A+  G + E  ++ +L      + RLG  +  P   P+  LG +DV   AH
Sbjct: 305 AYRELGT--AIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNVAH 361

Query: 242 QQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 301
           + LALQAA + IVLLKN A TLPL       +AVIGPN+D    +  NY G +    TPL
Sbjct: 362 RALALQAAAESIVLLKNDANTLPLRA--GTRLAVIGPNADALAALEANYQGTSSAPVTPL 419

Query: 302 QGISR 306
            G+ +
Sbjct: 420 LGLRQ 424



 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 37/75 (49%), Positives = 47/75 (62%), Gaps = 3/75 (4%)

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  + LP  QQ L+ R AKAS  P+V+VLM G  V +++AK      AI+   YPGQ+G
Sbjct: 644 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKT--HADAIMAAWYPGQSG 700

Query: 416 GAAIADVLFGRANPG 430
           G AIA +L G  NPG
Sbjct: 701 GTAIARMLAGDDNPG 715


>gi|299147288|ref|ZP_07040353.1| beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298514566|gb|EFI38450.1| beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 861

 Score =  225 bits (573), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 124/316 (39%), Positives = 184/316 (58%), Gaps = 16/316 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT+W+PNVNIFRDPRWGRGQET GEDP LTG+   + VRGLQG   +R  K+ AC KH+
Sbjct: 131 GLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAKHF 190

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ F+A  +  +DL +TY   FK  V +  V  VMC+YN+  G+P C 
Sbjct: 191 AVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCG 247

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   +  +W   G +VSDC ++   Y    +   P  E A+ADA++AG DL+CG  
Sbjct: 248 SNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASADAVRAGTDLECGSE 307

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            A   + AV+ GL+ E++++++L   +T +  LG  D +P+   +  +    + +  HQ 
Sbjct: 308 YASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WSEIPTSVLNSKEHQA 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ A + +VLL+N    LPL+T  H  VAV+GPN++ +V   GNY G+     T L+ 
Sbjct: 364 LALRMARESLVLLQNKNNILPLNT--HLKVAVMGPNANDSVMQWGNYNGIPAHTVTLLEA 421

Query: 304 ISRY---AKTIHQAGC 316
           +       + I++ GC
Sbjct: 422 VRAKLPEGQIIYEPGC 437



 Score = 39.7 bits (91), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 47/104 (45%), Gaps = 13/104 (12%)

Query: 337 ADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLM 386
           AD  +   G+  S+E E +          DR  + LP  Q+ L+  + KA +  VV +  
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRNLLKALKKAGKK-VVFINY 656

Query: 387 CGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            G  +            AIL   YPGQAGG AI D L+G  NPG
Sbjct: 657 SGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPG 698


>gi|298481648|ref|ZP_06999839.1| beta-glucosidase [Bacteroides sp. D22]
 gi|298272189|gb|EFI13759.1| beta-glucosidase [Bacteroides sp. D22]
          Length = 861

 Score =  225 bits (573), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 124/316 (39%), Positives = 184/316 (58%), Gaps = 16/316 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT+W+PNVNIFRDPRWGRGQET GEDP LTG+   + VRGLQG   +R  K+ AC KH+
Sbjct: 131 GLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAKHF 190

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ F+A  +  +DL +TY   FK  V +  V  VMC+YN+  G+P C 
Sbjct: 191 AVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCG 247

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   +  +W   G +VSDC ++   Y    +   P  E A+ADA++AG DL+CG  
Sbjct: 248 SNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASADAVRAGTDLECGSE 307

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            A   + AV+ GL+ E++++++L   +T +  LG  D +P+   +  +    + +  HQ 
Sbjct: 308 YASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WSEIPTSVLNSKEHQA 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ A + +VLL+N    LPL+T  H  VAV+GPN++ +V   GNY G+     T L+ 
Sbjct: 364 LALRMARESLVLLQNKNNILPLNT--HLKVAVMGPNANDSVMQWGNYNGIPAHTVTLLEA 421

Query: 304 ISRY---AKTIHQAGC 316
           +       + I++ GC
Sbjct: 422 VRAKLPEGQIIYEPGC 437



 Score = 39.7 bits (91), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 47/104 (45%), Gaps = 13/104 (12%)

Query: 337 ADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLM 386
           AD  +   G+  S+E E +          DR  + LP  Q+ L+  + KA +  VV +  
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRNLLKALKKAGKK-VVFINY 656

Query: 387 CGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            G  +            AIL   YPGQAGG AI D L+G  NPG
Sbjct: 657 SGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPG 698


>gi|325922365|ref|ZP_08184139.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
           19865]
 gi|325547147|gb|EGD18227.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
           19865]
          Length = 889

 Score =  224 bits (572), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 126/310 (40%), Positives = 181/310 (58%), Gaps = 14/310 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++RGLQG+     +  A  KH 
Sbjct: 135 AGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLDHPRTIATPKHI 194

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS +D+E TY   F+A +++G+  SVMC+YN ++G P C
Sbjct: 195 AVH-----SGPEPGRHSFDVDVSPRDVEATYTPAFRAALIDGQAGSVMCAYNSLHGTPAC 249

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PF 183
           A   +L   + G W   G++VSDCD+V  +    ++      ++A ++KAG DL+CG  +
Sbjct: 250 AADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCGYAY 309

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            A+ T  A+  G + E  ++ +L      + RLG  +  P   P+  LG +D+   A++ 
Sbjct: 310 RALGT--AIERGEVDEALLDQSLVRLFAARYRLGELEA-PHKDPYATLGAKDIDNTANRA 366

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+AA Q IVLLKN A TLPL       +AVIGPN+D    +  NY G +    TPL G
Sbjct: 367 LALKAAAQSIVLLKNDANTLPLKA--GARLAVIGPNADALAALEANYQGTSSTPVTPLLG 424

Query: 304 ISRYAKTIHQ 313
           + R    +HQ
Sbjct: 425 L-RQRFGVHQ 433



 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 37/75 (49%), Positives = 47/75 (62%), Gaps = 3/75 (4%)

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  + LP  QQ L+ R AKAS  P+V+VLM G  V +++AK      AI+   YPGQ+G
Sbjct: 647 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKT--HADAIVAAWYPGQSG 703

Query: 416 GAAIADVLFGRANPG 430
           G AIA +L G  NPG
Sbjct: 704 GTAIARMLAGDDNPG 718


>gi|189468358|ref|ZP_03017143.1| hypothetical protein BACINT_04755 [Bacteroides intestinalis DSM
           17393]
 gi|189436622|gb|EDV05607.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 865

 Score =  224 bits (571), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 124/306 (40%), Positives = 181/306 (59%), Gaps = 12/306 (3%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN-TGSRLKVAAC 62
           GG  GLT+W+PN+NI+RDPRWGRG ET GEDP LT     + V+GLQGN  G   K  AC
Sbjct: 126 GGYKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGNGAGKYDKAHAC 185

Query: 63  CKHYTAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            KHY  +    WN   R+ F+++ +S++DL +TY   FK  V EGKV  VMC+YN+  G+
Sbjct: 186 AKHYAVHSGPEWN---RHSFDSKNISQRDLWETYLPAFKTLVTEGKVKEVMCAYNRFEGE 242

Query: 122 PTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLD 179
           P C++  +L   +   W  D  +VSDC ++G  Y   H+   P  E A+ADA+ +G DL+
Sbjct: 243 PCCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPSAEAASADAVVSGTDLE 302

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG   +   E AV+ GL+ E+ +N ++   +  + +LGMFD + +   +  +    V + 
Sbjct: 303 CGGSYSSLNE-AVKKGLITEDKINESVFRLLRARFQLGMFDDD-TLVSWSEIPYSVVESK 360

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
            H   AL+ A + +VLL N   +LPLS ++R   VAV+GPN++ +V +  NY G      
Sbjct: 361 EHVDKALEMARKSMVLLTNKNNSLPLSKSIRK--VAVLGPNANDSVMLWANYNGFPTKSV 418

Query: 299 TPLQGI 304
           T L+GI
Sbjct: 419 TILEGI 424



 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 33/99 (33%), Positives = 52/99 (52%), Gaps = 13/99 (13%)

Query: 341 VLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGP 390
           + V GL  ++E E +          DR  + LP  Q+E++  + K  + PV+ V+  G  
Sbjct: 605 IFVGGLSSALEGEEMPVDLPGFKKGDRTNIDLPRVQEEMLKALKKTGK-PVIFVVCSGST 663

Query: 391 VDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           + + +   +  + A+L   YPGQ GG A+ADVLFG  NP
Sbjct: 664 LALPWEAEN--LDAMLEAWYPGQQGGTAVADVLFGDYNP 700


>gi|383114360|ref|ZP_09935124.1| hypothetical protein BSGG_1469 [Bacteroides sp. D2]
 gi|313693934|gb|EFS30769.1| hypothetical protein BSGG_1469 [Bacteroides sp. D2]
          Length = 863

 Score =  224 bits (570), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 121/301 (40%), Positives = 175/301 (58%), Gaps = 13/301 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT W+PNVNIFRDPRWGRGQET GEDP L+G+   + VRGLQG   +   K+ AC KH+
Sbjct: 133 GLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLHACAKHF 192

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  ++ +DL +TY   FK  V +  V  VMC+YN+  G P C 
Sbjct: 193 AVHSGPEWN---RHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCG 249

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE--EAAADAIKAGLDLDCGPF 183
              +L   +   W   G +V+DC ++G  +  + +   P+   A+ADA+ +G DL+CG  
Sbjct: 250 SNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLSGTDLECGGN 309

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
               T+ AV+ GL+ EE +N ++   +  +  LG  +   S  P+ N+    +  P H++
Sbjct: 310 FKSITD-AVKKGLISEEKINTSVKRLLKARFELGEMN---STHPWSNIPFSVIDCPKHKE 365

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ AH+ +VLL+N+   LPL+  R   VAVIGPN++ +V   GNY G      T L+G
Sbjct: 366 LALKMAHESLVLLQNNNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGFPSHTVTLLEG 423

Query: 304 I 304
           I
Sbjct: 424 I 424



 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 49/105 (46%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  +   G+   +E E +          DR  + LP  Q+E+++ + K  +  V + 
Sbjct: 598 QSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKKTVFVN 657

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
                   V   +N     AIL   YPGQAGG A+ADVLFG  NP
Sbjct: 658 FSGSAMAIVPETQN---CDAILQAWYPGQAGGTAVADVLFGDYNP 699


>gi|431798021|ref|YP_007224925.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
           DSM 17526]
 gi|430788786|gb|AGA78915.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
           DSM 17526]
          Length = 906

 Score =  224 bits (570), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 134/330 (40%), Positives = 186/330 (56%), Gaps = 28/330 (8%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G   GL +WSPN+NIFRDPRWGRG ET GEDP LTG+ A+ ++ GLQ + G  LK  A  
Sbjct: 153 GIYTGLDFWSPNINIFRDPRWGRGHETYGEDPYLTGELASQFIEGLQDSDGKYLKTIATS 212

Query: 64  KHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
           KH+  +     +G +  R+ F+  VS +DL +TY   F+  V E KV S+M +YN+  G+
Sbjct: 213 KHFAVH-----SGPEPLRHTFDVDVSDRDLYETYLPAFRKTVKEAKVYSIMGAYNRFRGE 267

Query: 122 PTCADPDILKNT-IHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
            +C+  D L N  +  QW  +GY+VSDC ++  ++       T  EAAA  +  G DL+C
Sbjct: 268 -SCSGHDFLLNQLLREQWGFEGYVVSDCGAIQDIHTGHKIASTAAEAAAIGVSGGCDLNC 326

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQ----PFGNLGPRDV 236
           G +    TE AV  GL+ EE++++A+      + RLGMFD E +      PFG      V
Sbjct: 327 GNYYTHLTE-AVAEGLISEEEIDIAVKRLFLARFRLGMFDPEEAVSYAQIPFGI-----V 380

Query: 237 CTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
           C+ AH  LA QAA + +VLLKN    LPLS  +   +AVIGPN+D   +++GNY G+   
Sbjct: 381 CSEAHNTLARQAAQKSMVLLKNQKNLLPLSVDKIKRIAVIGPNADNVESLLGNYHGIPKK 440

Query: 297 YTTPLQGISR---------YAKTIHQAGCF 317
             T L GI           Y + +H A  F
Sbjct: 441 PVTFLDGIKHKVGPKAEVLYTEGVHPAEGF 470



 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 43/113 (38%), Positives = 65/113 (57%), Gaps = 13/113 (11%)

Query: 327 IGAAEVAARQADATVLVMGLDQSIEAEFID----------RAGLLLPGRQQELVSRVAKA 376
           I  A   A+ AD  V+V+GL Q +E E +D          R  + LP +Q+ L+  V + 
Sbjct: 617 IDEAVAMAKSADLAVVVLGLSQRLEGESMDVVTPGFDRGDRTAITLPAQQEALLKAVKET 676

Query: 377 SRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            + PV+LVL  G  + +++AK +  + AI+  GYPG+ GG A+ADV+FG  NP
Sbjct: 677 GK-PVILVLNAGSAMAINWAKEN--VDAIISAGYPGEEGGNALADVVFGDYNP 726


>gi|408369545|ref|ZP_11167326.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
 gi|407745291|gb|EKF56857.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
          Length = 881

 Score =  224 bits (570), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 130/304 (42%), Positives = 182/304 (59%), Gaps = 13/304 (4%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G   GLT+WSPN+NIFRDPRWGRG ET GEDP LTG+   +YV+GLQGN  + LK+ A  
Sbjct: 130 GIYQGLTFWSPNINIFRDPRWGRGHETYGEDPYLTGQLGIAYVKGLQGNDPNYLKLVATA 189

Query: 64  KHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
           KH+  +     +G +  R+ F+   SK+DL +TY   F+  V +G V SVM +YN+V G+
Sbjct: 190 KHFAVH-----SGPEPLRHEFDVSPSKRDLWETYLPAFRYLVKQGDVKSVMTAYNRVYGE 244

Query: 122 PTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG 181
              A  D L   +   W  DGY+VSDC ++  ++      +   EA+A A+  G DL+CG
Sbjct: 245 AASAS-DTLFTILRDYWDFDGYVVSDCFAISDIWKYHKIAKDAAEASAMAVIEGCDLNCG 303

Query: 182 PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT-PA 240
                  + A + G++ E+D+++AL+  +  +++LGMFD E    P+  + P +V T   
Sbjct: 304 DSYEKLNQ-AYQQGMVTEKDIDIALSRLMEARIKLGMFDPE-QLVPYAQI-PFNVNTSEK 360

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H QLAL+AA + IVLLKN    LPLS     +VAVIGPN+D   ++ GNY G      T 
Sbjct: 361 HNQLALKAAKESIVLLKNQGDLLPLSK-DLKSVAVIGPNADNIQSLWGNYNGNPKDPITV 419

Query: 301 LQGI 304
           LQGI
Sbjct: 420 LQGI 423



 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 39/98 (39%), Positives = 58/98 (59%), Gaps = 13/98 (13%)

Query: 342 LVMGLDQSIEAEFID----------RAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPV 391
           +V+GL++ +E E +D          R  L LP  Q+ L+  VAK  + P+VLVL+ G  +
Sbjct: 607 MVLGLNERLEGEEMDVVVEGFAGGDRTALDLPASQRTLLKEVAKTGK-PIVLVLLNGSAL 665

Query: 392 DVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            +++A  +  I AI+  GY GQ GG A+A+VLFG  NP
Sbjct: 666 SINWAAEN--IPAIMTAGYAGQQGGNAVAEVLFGDYNP 701


>gi|160886913|ref|ZP_02067916.1| hypothetical protein BACOVA_04927 [Bacteroides ovatus ATCC 8483]
 gi|423288977|ref|ZP_17267828.1| hypothetical protein HMPREF1069_02871 [Bacteroides ovatus
           CL02T12C04]
 gi|423294866|ref|ZP_17272993.1| hypothetical protein HMPREF1070_01658 [Bacteroides ovatus
           CL03T12C18]
 gi|156107324|gb|EDO09069.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
 gi|392668741|gb|EIY62235.1| hypothetical protein HMPREF1069_02871 [Bacteroides ovatus
           CL02T12C04]
 gi|392676057|gb|EIY69498.1| hypothetical protein HMPREF1070_01658 [Bacteroides ovatus
           CL03T12C18]
          Length = 863

 Score =  224 bits (570), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 121/301 (40%), Positives = 175/301 (58%), Gaps = 13/301 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT W+PNVNIFRDPRWGRGQET GEDP L+G+   + VRGLQG   +   K+ AC KH+
Sbjct: 133 GLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLHACAKHF 192

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  ++ +DL +TY   FK  V +  V  VMC+YN+  G P C 
Sbjct: 193 AVHSGPEWN---RHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCG 249

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE--EAAADAIKAGLDLDCGPF 183
              +L   +   W   G +V+DC ++G  +  + +   P+   A+ADA+ +G DL+CG  
Sbjct: 250 SNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLSGTDLECGGN 309

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
               T+ AV+ GL+ EE +N ++   +  +  LG  +   S  P+ N+    +  P H++
Sbjct: 310 FKSITD-AVKKGLISEEKINTSVKRLLKARFELGEMN---STHPWSNIPFSVIDCPKHKE 365

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ AH+ +VLL+N+   LPL+  R   VAVIGPN++ +V   GNY G      T L+G
Sbjct: 366 LALKMAHESLVLLQNNNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGFPSHTVTLLEG 423

Query: 304 I 304
           I
Sbjct: 424 I 424



 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 49/105 (46%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  +   G+   +E E +          DR  + LP  Q+E+++ + K  +  V + 
Sbjct: 598 QSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKKTVFVN 657

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
                   V   +N     AIL   YPGQAGG A+ADVLFG  NP
Sbjct: 658 FSGSAMAIVPETQN---CDAILQAWYPGQAGGTAVADVLFGDYNP 699


>gi|15837447|ref|NP_298135.1| family 3 glycoside hydrolase [Xylella fastidiosa 9a5c]
 gi|9105751|gb|AAF83655.1|AE003924_1 family 3 glycoside hydrolase [Xylella fastidiosa 9a5c]
          Length = 882

 Score =  223 bits (569), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 129/312 (41%), Positives = 181/312 (58%), Gaps = 13/312 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A S++RGLQGN     +  A  KH+
Sbjct: 128 AGLTLWSPNINIFRDPRWGRGMETYGEDPYLTGQLAVSFIRGLQGNIPDHPRTIATPKHF 187

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS  DLE TY   F+A +V+G   SVMC+YN ++G P C
Sbjct: 188 AVH-----SGPEPGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPAC 242

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFL 184
           A   +L   +   W  +G++VSDCD++  +     + +    A+A A+K+G DL+CG   
Sbjct: 243 ASDWLLNTRLRNDWGFNGFVVSDCDAIDDMTRFHFFRQDNASASAAALKSGNDLNCGNTY 302

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
               +   RG  + E  ++ AL      + RLG         P+  +G + + TPAH+ L
Sbjct: 303 RDLNQAIARGD-IDEALLDQALIRLFAARQRLGTLQPR-EHDPYATIGIKHIDTPAHRAL 360

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           ALQAA Q +VLLKNS  TLPL+     T+AV+GP++D    +  NY G +    TPL G+
Sbjct: 361 ALQAAVQSLVLLKNSGNTLPLTPGT--TLAVLGPDADSLTALEANYQGTSSTPVTPLTGL 418

Query: 305 -SRY-AKTIHQA 314
            +R+ A  IH A
Sbjct: 419 RTRFGAAKIHYA 430



 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 60/114 (52%), Gaps = 13/114 (11%)

Query: 327 IGAAEVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKA 376
           +  AE A   ADA V  +GL   +E E +          DR  + LP  Q+ L+  V K 
Sbjct: 601 LAEAERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHV-KT 659

Query: 377 SRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           +  P+++VLM G  V +++A++     AIL   YPGQ+GG AIA  L G  NPG
Sbjct: 660 TGKPLIVVLMSGSAVALNWAQH--HANAILAAWYPGQSGGTAIAQALAGDVNPG 711


>gi|323451996|gb|EGB07871.1| hypothetical protein AURANDRAFT_71699 [Aureococcus anophagefferens]
          Length = 1202

 Score =  223 bits (569), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 172/464 (37%), Positives = 242/464 (52%), Gaps = 46/464 (9%)

Query: 5   GMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCK 64
           G  GLT+++PNVN+ RDPRWGR +E P EDP+L G Y A +VRG QG+   R+   A  K
Sbjct: 455 GCLGLTFYAPNVNLARDPRWGRIEEVPSEDPLLNGVYGAEFVRGFQGDGAYRV-ANAVVK 513

Query: 65  HYTAYDLD---------NWNGV-------DRYHFNARVSKQDLEDTYNVPFKACVVEGKV 108
           H+  Y+L+         +W G        DR+ F+ARVS +D E+TY  PF A  V    
Sbjct: 514 HFAVYNLEVDVEDTPPADWCGSAACAPPNDRHSFDARVSPRDFEETYVGPFVA-PVAAGA 572

Query: 109 ASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAA 168
           A+ MCSYN VNG+P C D  +L+  + G     G + +DC ++           T  EAA
Sbjct: 573 AAAMCSYNAVNGEPACTDGALLRGALRGALNFTGVLATDCGALEDAVARHKRYATEAEAA 632

Query: 169 ADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPF 228
           A AI AG+D +CG  L      A+  GL+R + +   L   +  ++RLG+ D   +  P 
Sbjct: 633 AAAIAAGVDSNCGKVLTSALPEALAAGLVRPDALRPPLERLLEARLRLGLLDDWDADAPV 692

Query: 229 GNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIG 288
                  V +PAH+ LAL+AA +G+VLL+N  + LPL      T+AVIGPN++ ++ ++ 
Sbjct: 693 PRPDVDAVDSPAHRALALRAAREGLVLLQNPNQILPLDG--RGTLAVIGPNANASMNLLS 750

Query: 289 NYAGVACG--YTTPLQGIS---RYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLV 343
            Y G        +PLQ +    R  K ++  GC         L  A ++ A+ AD  VL 
Sbjct: 751 GYHGTPPPDLLRSPLQELEARWRGGKVVYAVGCNASGAATAALDEAVDL-AKTADVVVLG 809

Query: 344 MGL------------DQSI----EAEFIDRAGLLLPGRQQELVSRV-AKASRGPVVLVLM 386
           +GL            D +     EAE +DR  L LPG Q+ L S++ A      V + L+
Sbjct: 810 LGLCGDNYGGGPPKEDATCFSIDEAESVDRTSLKLPGAQEALFSKIWALGKPVAVAVFLV 869

Query: 387 CGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
             G VD SFAK+     A+L  GY G+ GG A+AD L G  NPG
Sbjct: 870 SAGAVDASFAKDK---AALLLAGYGGEFGGVAVADALLGAYNPG 910


>gi|383649536|ref|ZP_09959942.1| beta-glucosidase [Sphingomonas elodea ATCC 31461]
          Length = 853

 Score =  223 bits (569), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 135/299 (45%), Positives = 174/299 (58%), Gaps = 8/299 (2%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT WSPN+NIFRDPRWGRGQET GEDP LTG  A  +++GLQG   +  KV A  KH  
Sbjct: 110 GLTIWSPNINIFRDPRWGRGQETYGEDPFLTGHLAVGFIQGLQGPDPAHPKVLATPKHLA 169

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +         R  F+   S QDLE TY   F+  + EGK  SVMC+YN ++G P CA  
Sbjct: 170 VHSGPE---AGRDGFDVDPSPQDLESTYTPAFRLALTEGKAQSVMCAYNSIHGTPACASG 226

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +L + +   W   G IVSDCD+V  ++   HY     EA+A AIKAG+DL+CG   A  
Sbjct: 227 GLLNDRLRKDWGFTGLIVSDCDAVANIHLFHHYRLDAAEASAAAIKAGMDLNCGTTYAAL 286

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
            E AV+ GL+ E +V++AL   +  +  LG+  G   A P+  + P +  TPA + LAL+
Sbjct: 287 PE-AVKRGLVSEAEVDVALKRALDARRALGIAFG--GANPWSRIKPGERGTPAQRALALE 343

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISR 306
           AA + IVLLKN    LPL+      +AVIG N+D    + GNY G A    TPL GI R
Sbjct: 344 AARKAIVLLKNDGDRLPLAP--GGRIAVIGANADDLGVLEGNYHGTARDPVTPLDGIRR 400



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/103 (38%), Positives = 52/103 (50%), Gaps = 17/103 (16%)

Query: 338 DATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMC 387
           D  V V+GL   +E E +          DR  + LP  Q EL+  + K  + P+VLVL  
Sbjct: 580 DTIVAVLGLSPDLEGEALSVSIPGFVGGDRTDIALPRPQLELLKALRKTGK-PLVLVLTS 638

Query: 388 GGPVDVSFAKNDPRIG-AILWVGYPGQAGGAAIADVLFGRANP 429
           G  V V     DP +  AIL   YPG+ GG AIA+ L G+ NP
Sbjct: 639 GSAVAV-----DPSLADAILEAWYPGEEGGTAIAETLAGKNNP 676


>gi|90021134|ref|YP_526961.1| Beta-glucosidase [Saccharophagus degradans 2-40]
 gi|89950734|gb|ABD80749.1| b-xylosidase-like protein [Saccharophagus degradans 2-40]
          Length = 893

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 124/307 (40%), Positives = 183/307 (59%), Gaps = 20/307 (6%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG--NTGSRLKVAACCKH 65
           GLT+WSPN+NIFRDPRWGRGQET GEDP LTG+ A ++V+G+QG  +    LK  A  KH
Sbjct: 150 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTGRMAINFVKGIQGENDNSDYLKAVATIKH 209

Query: 66  YTAYD-LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           Y  +   +     D YH     +++DL +TY   F+  + E  V S+MC+YN+V+G P C
Sbjct: 210 YAVHSGPEKTRHSDDYH----PTRKDLFETYLPAFRMAIAETNVQSLMCAYNRVDGAPAC 265

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQ--HYTRTPEEAAADAIKAGLDLDCG- 181
            + ++++  + G    +GY+VSDC ++   Y ++  H   +P EAAA A+K+G DL+CG 
Sbjct: 266 GNNELMQEILRGDMGFNGYVVSDCGAIADFYESRSHHVVDSPAEAAAWAVKSGTDLNCGD 325

Query: 182 ----PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
                +  +H   A++ GL+ E+ +++A+      +++LGMFD E    P+  +G   V 
Sbjct: 326 SHGNTYTNLHY--ALQQGLITEDYIDIAVKRLFKARIKLGMFD-EQDRVPYSEIGMDVVG 382

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           +P H  L  +AA + IVLLKN+   LPL       VAVIGPN+     ++GNY GV    
Sbjct: 383 SPKHLALTQEAAEKSIVLLKNNG-VLPLKA--GVKVAVIGPNAVDEDVLVGNYHGVPVKP 439

Query: 298 TTPLQGI 304
             PL+GI
Sbjct: 440 VLPLEGI 446



 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/129 (31%), Positives = 64/129 (49%), Gaps = 13/129 (10%)

Query: 311 IHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFI----------DRAGL 360
           IH  G          L   A  AAR+AD  + + G+D  +E E +          DR  +
Sbjct: 601 IHPYGKLTWLDESRDLEEEALAAARKADVIIFMGGIDAHLEGEEMPLELDGFTHGDRTHI 660

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
            LP  Q  L+ ++ KA+  PVV+V   G  + +++     ++ AIL   YPG+A G A+A
Sbjct: 661 NLPKVQTNLLKQL-KATGKPVVMVNFSGSAMALNWESE--KLDAILQAFYPGEATGTALA 717

Query: 421 DVLFGRANP 429
           ++L+G  +P
Sbjct: 718 NILWGDVSP 726


>gi|225873995|ref|YP_002755454.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
 gi|225792796|gb|ACO32886.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
          Length = 896

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 127/302 (42%), Positives = 177/302 (58%), Gaps = 16/302 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT W+PN+NI RDPRWGRGQET GEDP LTG  AA YV GLQGN    LK  A  KH++
Sbjct: 144 GLTLWAPNINIVRDPRWGRGQETYGEDPFLTGTMAAEYVSGLQGNNPKYLKTVATPKHFS 203

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            Y     NG +  R+  NA  S  D++DTY   F+  + +G   S+MCSYN V G P+CA
Sbjct: 204 VY-----NGPESMRHKINANPSAHDMQDTYLAAFRMAITKGHADSMMCSYNAVYGVPSCA 258

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE--EAAADAIKAGLDLDCGPF 183
           +  +L + + G+W  DGYI SDC ++   Y    +  +P+   AAA A+ AG D DCG  
Sbjct: 259 N-KLLADVVRGKWGFDGYITSDCGAISDFYRPGAHGYSPDAVHAAASAVLAGTDTDCGTG 317

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQ-PFGNLGPRDVCTPAHQ 242
             +  + +V+ GL+ +  ++ A+    T + RLGMFD  P A  P+ ++    V + AH+
Sbjct: 318 YKVLPQ-SVQQGLISKAAIDRAVERLFTARFRLGMFD--PKADVPYNSIPYSVVDSAAHR 374

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
             AL+ A + +VLLKN    LPL   R  T+AV+GPN+    ++ GNY  +    + P+ 
Sbjct: 375 AQALEDASKSMVLLKNEGGILPLRNAR--TIAVVGPNAANLNSIEGNYNAIPSHPSLPVD 432

Query: 303 GI 304
           GI
Sbjct: 433 GI 434



 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 32/75 (42%), Positives = 45/75 (60%), Gaps = 3/75 (4%)

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  L LP  QQ+L+  +  A+  PVVLVL+ G  + + +AK    +  IL   YPG+AG
Sbjct: 655 DRTRLSLPQTQQDLLHALV-ATGKPVVLVLLNGSALSIDWAKQ--HVQGILEAWYPGEAG 711

Query: 416 GAAIADVLFGRANPG 430
           G AI + L G+ +PG
Sbjct: 712 GEAIGETLSGQNDPG 726


>gi|395803818|ref|ZP_10483061.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
 gi|395434089|gb|EJG00040.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
          Length = 875

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 122/302 (40%), Positives = 173/302 (57%), Gaps = 11/302 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYW+PN+NIFRDPRWGRGQET GEDP LT     ++V+GLQG+    LK AAC KHY 
Sbjct: 136 GLTYWTPNINIFRDPRWGRGQETYGEDPYLTAVLGDAFVKGLQGDDPKYLKAAACAKHYA 195

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+  V+  +L DTY   FK  V   KVA VMC+YN    +P CA
Sbjct: 196 VH-----SGPESLRHTFDVDVTPYELWDTYLPAFKKLVTNSKVAGVMCAYNAFRTQPCCA 250

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
              ++ + +  QW+  GY+ SDC ++   +           A+ADA+  G D+DCG    
Sbjct: 251 SDILMNDILRNQWKFTGYVTSDCWAIDDFFKNHKTHPDAASASADAVLHGTDIDCGTDAY 310

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC-TPAHQQL 244
                AV+ G + E+ +++++     ++ RLGMFD  P +       P  V  +  H++ 
Sbjct: 311 KSLVQAVKNGQITEKQIDVSVKRLFMIRFRLGMFD--PVSMVKYAQTPSSVLESEEHKEH 368

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           AL+ A Q IVLLKN   TLPLS  +   + V+GPN+D +++++GNY G     TT LQGI
Sbjct: 369 ALKMARQSIVLLKNEKNTLPLSK-KLKKIVVLGPNADNSISILGNYNGTPSKLTTVLQGI 427

Query: 305 SR 306
             
Sbjct: 428 KE 429



 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 38/107 (35%), Positives = 56/107 (52%), Gaps = 15/107 (14%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + ADA +   G+   +E E +          DR  +LLP  Q  L+ +  ++S  PVV +
Sbjct: 604 KNADAFIFAGGISPQLEGEEMPVDAPGFNGGDRTSILLPEVQTRLL-KALQSSGKPVVFL 662

Query: 385 LMCGGPVDVSF-AKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           +M G  + V + A+N P   AIL + Y GQ+ G A ADV+FG  NP 
Sbjct: 663 IMTGSAIAVPWEAENIP---AILNIWYGGQSAGTASADVIFGDYNPA 706


>gi|384420163|ref|YP_005629523.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
           BLS256]
 gi|353463076|gb|AEQ97355.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
           BLS256]
          Length = 889

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 123/303 (40%), Positives = 176/303 (58%), Gaps = 13/303 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++ GLQG+     +  A  KH 
Sbjct: 135 AGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIHGLQGDDLDHPRTIATPKHL 194

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS +D+E TY   F+A +VEG+  +VMC+YN ++G P C
Sbjct: 195 AVH-----SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPAC 249

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PF 183
           A   ++   + G W   G++VSDCD+V  +    ++      ++A A+KAG DL+CG  +
Sbjct: 250 AADWLINGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY 309

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
             + T  A+  G + E  ++ +L      + RLG  +  P   P+  LG +DV    H+ 
Sbjct: 310 RELGT--AIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAQHRA 366

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LALQAA + IVLLKN+A TLPL       +AVIGPN+D    +  NY G +    TPL G
Sbjct: 367 LALQAAAESIVLLKNNANTLPLKA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLG 424

Query: 304 ISR 306
           + +
Sbjct: 425 LRQ 427



 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 37/75 (49%), Positives = 47/75 (62%), Gaps = 3/75 (4%)

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  + LP  QQ L+ R AKAS  P+V+VLM G  V +++AK      AI+   YPGQ+G
Sbjct: 647 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKT--HADAIVAAWYPGQSG 703

Query: 416 GAAIADVLFGRANPG 430
           G AIA +L G  NPG
Sbjct: 704 GTAIARMLAGDDNPG 718


>gi|84623339|ref|YP_450711.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|188577358|ref|YP_001914287.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
 gi|84367279|dbj|BAE68437.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|188521810|gb|ACD59755.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
          Length = 889

 Score =  223 bits (568), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 123/303 (40%), Positives = 177/303 (58%), Gaps = 13/303 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++ GLQG+     +  A  KH 
Sbjct: 135 AGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIHGLQGDDLDHPRTIATPKHL 194

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS +D+E TY   F+A +VEG+  +VMC+YN ++G P C
Sbjct: 195 AVH-----SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPAC 249

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PF 183
           A   ++   + G W   G++VSDCD+V  +    ++      ++A A+KAG DL+CG  +
Sbjct: 250 AADWLINGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY 309

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
             + T  A+  G + E  ++ +L      + RLG  +  P   P+  LG +DV    H+ 
Sbjct: 310 RELGT--AIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAQHRA 366

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LALQAA + IVLLKN+A TLPL+      +AVIGPN+D    +  NY G +    TPL G
Sbjct: 367 LALQAAAESIVLLKNNANTLPLNA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLG 424

Query: 304 ISR 306
           + +
Sbjct: 425 LRQ 427



 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 37/75 (49%), Positives = 47/75 (62%), Gaps = 3/75 (4%)

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  + LP  QQ L+ R AKAS  P+V+VLM G  V +++AK      AI+   YPGQ+G
Sbjct: 647 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKT--HADAIVAAWYPGQSG 703

Query: 416 GAAIADVLFGRANPG 430
           G AIA +L G  NPG
Sbjct: 704 GTAIARMLAGDDNPG 718


>gi|322437617|ref|YP_004219707.1| glycoside hydrolase family protein [Granulicella tundricola
           MP5ACTX9]
 gi|321165510|gb|ADW71213.1| glycoside hydrolase family 3 domain protein [Granulicella
           tundricola MP5ACTX9]
          Length = 892

 Score =  223 bits (567), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 127/304 (41%), Positives = 179/304 (58%), Gaps = 16/304 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT WSPN+NIFRDPRWGRGQET GEDP LTG+   ++V+G+QG   +  +  A  KH+ 
Sbjct: 139 GLTIWSPNINIFRDPRWGRGQETYGEDPFLTGRLGVAFVKGIQGPDPNYFRAIATPKHFA 198

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +   +     R+  N   +  DL DTY   F+A + E    S+MC+YN V G P CA  
Sbjct: 199 VH---SGPESTRHSANIEPTPHDLHDTYLPAFRATITEAHADSIMCAYNAVEGSPACASK 255

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNT----QHYTRTPEEAAADAIKAGLDLDCG-P 182
            +L++T+   W   G++ SDC ++   Y T     H +   E AAA  IKAG D +CG  
Sbjct: 256 LLLQDTLRRDWGFKGFVTSDCGAIDDFYATDYPSHHTSPDKEAAAAAGIKAGTDSNCGQT 315

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP-RDVCTPAH 241
           +L + +  AV+ GL+ E +++ AL +  T + +LG+FD  P+A+   N  P  +V +PAH
Sbjct: 316 YLTLGS--AVKKGLVTEAEIDTALKHLFTARFQLGLFD--PAAKVAFNAIPFSEVNSPAH 371

Query: 242 QQLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           Q LAL+AA + IVLLKN A TLP   ++R  T+AVIGP++     + GNY  +      P
Sbjct: 372 QALALKAAEESIVLLKNDAHTLPFKPSVR--TIAVIGPSAATLNNLEGNYNAIPLHPVLP 429

Query: 301 LQGI 304
           L GI
Sbjct: 430 LDGI 433



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 37/98 (37%), Positives = 54/98 (55%), Gaps = 13/98 (13%)

Query: 343 VMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            +GL   +E E +          DR  + LP  QQ+++  VA A+  P+V+VL+ G  + 
Sbjct: 629 FLGLSPELEGEEMPIHIEGFAGGDRTDIKLPAAQQQMLEAVA-ATGKPLVVVLLNGSALA 687

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           V++A +     AIL   YPGQAGG AIA+ L G+ NP 
Sbjct: 688 VNWAND--HAAAILEAWYPGQAGGTAIAETLAGKNNPA 723


>gi|423226625|ref|ZP_17213090.1| hypothetical protein HMPREF1062_05276 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392628884|gb|EIY22909.1| hypothetical protein HMPREF1062_05276 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 863

 Score =  223 bits (567), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 124/316 (39%), Positives = 184/316 (58%), Gaps = 14/316 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT W+PN+NIFRDPRWGRGQET GEDP LTG+   + VRGLQG  G +  K+ AC KHY
Sbjct: 131 GLTMWTPNINIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEGEKYDKLHACAKHY 190

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  +  +DL +TY   FK  V +  V  VMC+YN+  G+P C 
Sbjct: 191 AVHSGPEWN---RHSFNAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCG 247

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEE--AAADAIKAGLDLDCGPF 183
              +L   +  +W     +VSDC ++   YN   +   P++  A+A A+ +G D++CG  
Sbjct: 248 SNRLLMQILRDEWGYKEIVVSDCWAISDFYNKDAHETDPDKQHASAKAVLSGTDVECGDS 307

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            A   E AV+ GL+ E+ ++++L   +  +  LG  D EPS   +  +    V +  H++
Sbjct: 308 YASLPE-AVKEGLIDEKQIDISLKRLMKARFELGEMD-EPSQVSWAQIPYSVVDSKEHRE 365

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ A + +VLL+N+   LPL+  ++  VAV+GPN++ +V   GNY G      T L+G
Sbjct: 366 LALRMARESLVLLQNNQSLLPLN--KNLKVAVVGPNANDSVMQWGNYNGFPSHTITLLEG 423

Query: 304 ISRY---AKTIHQAGC 316
           I  Y   ++ I++ GC
Sbjct: 424 IREYLPESQIIYEPGC 439



 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 51/105 (48%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           ++AD  +   G+  ++E E +          DR  + LP  Q  L++ + KA +  +V V
Sbjct: 598 KEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKAGK-KIVFV 656

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G  +  +         AIL   YPGQAGG AIA+VLFG  NP
Sbjct: 657 NFSGSAI--ALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNP 699


>gi|399029285|ref|ZP_10730258.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
 gi|398072895|gb|EJL64089.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
          Length = 871

 Score =  223 bits (567), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 134/304 (44%), Positives = 171/304 (56%), Gaps = 11/304 (3%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           +G   GLT+WSPNVNIFRDPRWGRG ET GEDP LTG+    YV GLQG     LKV A 
Sbjct: 126 HGMYQGLTFWSPNVNIFRDPRWGRGHETYGEDPFLTGQLGLKYVNGLQGTNEKYLKVIAT 185

Query: 63  CKHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
            KHY  +     +G +  R+ FNA  S  DL +TY   F+  V EG V SVM +YN+  G
Sbjct: 186 AKHYAVH-----SGPEPSRHLFNAETSDIDLYETYLPAFRTLVKEGHVYSVMGAYNRFRG 240

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
           +   A P  L N +   W  DGYIVSDC +V  ++     T     A+A A+K GLDL+C
Sbjct: 241 ESCSASP-FLFNILRNVWGFDGYIVSDCGAVTDIWKYHKITGDAATASALALKDGLDLEC 299

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           G       E A+   L+ E D+++A+    T + +LGMFD E     +  +        A
Sbjct: 300 GSSFKSLKE-AIDRKLISEADIDIAVKRLFTARFKLGMFDPEEIVS-YAQIPYSVNNNSA 357

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H  LA  A+ + IVLLKN   TLPLS     TVAVIGPN++   ++ GNY+GV     T 
Sbjct: 358 HDWLARVASQKSIVLLKNQNNTLPLSR-DIKTVAVIGPNANDVQSLWGNYSGVPSNPITV 416

Query: 301 LQGI 304
           L+GI
Sbjct: 417 LKGI 420



 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 59/147 (40%), Positives = 80/147 (54%), Gaps = 22/147 (14%)

Query: 301 LQGISRYAKTIHQAGCFGVAC--------NGNQLIGAAEVAARQADATVLVMGLDQSIEA 352
           L+   +Y  T+     +G A           N L  A +VA  QADA VLV+GL++ +E 
Sbjct: 556 LEAEKKYKITVKYQNFYGDAIAQLLWAEPQENVLQEAVQVAG-QADAIVLVLGLNERLEG 614

Query: 353 EFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           E +          DR  L LP  Q+EL+ +   A+  PV+LVL+ G  + +++A ND  +
Sbjct: 615 EEMKVEADGFEGGDRTSLDLPSNQEELM-KAMTATGKPVILVLINGSALSINWA-ND-HV 671

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANP 429
            AIL  GYPGQ GG AIADVLFG  NP
Sbjct: 672 PAILTAGYPGQQGGNAIADVLFGDYNP 698


>gi|87200432|ref|YP_497689.1| beta-glucosidase [Novosphingobium aromaticivorans DSM 12444]
 gi|87136113|gb|ABD26855.1| Beta-glucosidase [Novosphingobium aromaticivorans DSM 12444]
          Length = 849

 Score =  223 bits (567), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 137/311 (44%), Positives = 176/311 (56%), Gaps = 10/311 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT WSPN+NIFRDPRWGRGQET GEDPVLTG  A +YVRGLQG      +V A  KH  
Sbjct: 111 GLTIWSPNINIFRDPRWGRGQETYGEDPVLTGTLATAYVRGLQGPDLDHPRVIATPKHLV 170

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
           A+         R  FN + S  D+E TY   F+  + EGK  SVMCSYN V+G P C   
Sbjct: 171 AHSGPE---AGRDSFNVQSSAYDMEATYLPAFRRALTEGKALSVMCSYNSVHGVPVCGAD 227

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +L   +   W  DG +VSDCD++G + + Q Y +T   A+A AI AG+DL+CG   A  
Sbjct: 228 WLLNQRVRKDWGFDGLVVSDCDAIGNINHYQRYRQTNAAASAAAINAGMDLNCGRTYAAL 287

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
            E A+  GL   E V+ ALA T   + +LG  D   +  P+  +    V T  H+ LAL+
Sbjct: 288 PE-ALAQGLTTREVVDRALARTFAARRKLG--DAFGATSPWATIPANTVDTAEHRALALE 344

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI-SR 306
           AA + +VLLKN+   LPL   +   +AV+GPN+D   T+  NY G A    TPL G  +R
Sbjct: 345 AARKSLVLLKNNG-VLPLR--KGARIAVVGPNADSLDTLEANYHGTAAQPVTPLDGFAAR 401

Query: 307 YAKTIHQAGCF 317
           Y  +  Q    
Sbjct: 402 YRMSYAQGASL 412



 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 34/96 (35%), Positives = 54/96 (56%), Gaps = 13/96 (13%)

Query: 344 MGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDV 393
           +GL  ++E E +          DR  + LP  QQ+L+  + KA+  P+V+VL+ G  V +
Sbjct: 579 VGLSPAVEGEALQIEVPGFSGGDRTDIALPRAQQDLLETL-KATGKPLVVVLLSGSAVAM 637

Query: 394 SFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            + K +    A++   YPGQ+GG AIAD++ G  NP
Sbjct: 638 PWVKEN--ADAVVAAWYPGQSGGTAIADLVDGTLNP 671


>gi|218262493|ref|ZP_03476939.1| hypothetical protein PRABACTJOHN_02617 [Parabacteroides johnsonii
           DSM 18315]
 gi|218223341|gb|EEC95991.1| hypothetical protein PRABACTJOHN_02617 [Parabacteroides johnsonii
           DSM 18315]
          Length = 868

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 127/323 (39%), Positives = 181/323 (56%), Gaps = 25/323 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+W+PN+NIFRDPRWGRG ET GEDP LT +   + V+GLQG+     K  AC KHY 
Sbjct: 133 GLTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGDDPKYFKTHACAKHYA 192

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +    WN   R+ F+  V+ +DL  TY   F+A V EG V  VMC+YN+  GKP C+  
Sbjct: 193 VHSGPEWN---RHEFDVTVTPRDLWQTYLPAFEALVKEGNVQEVMCAYNRYQGKPCCSSD 249

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--------EEAAADAIKAGLDLD 179
            +L + +   W  +  I+SDC ++   +      RTP        E A+ADA+  G DL+
Sbjct: 250 KLLIDILRNSWGYENIILSDCGAINDFWERDE--RTPRHETHPDAESASADAVLNGTDLE 307

Query: 180 CG-PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQ-PFGNLGPRDVC 237
           CG  + A+    A++ G + E D++++L   +  +  LGMFD  P  Q P+  +    V 
Sbjct: 308 CGNSYRALVK--ALKDGKISENDLDVSLRRLLKGRFELGMFD--PDEQVPYAQIPYNVVE 363

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
           +P H   AL+ AH+ +VLLKN   TLPLS T+R   +AV+GPN+  +  +  NY G    
Sbjct: 364 SPEHVAQALEMAHKSMVLLKNKNNTLPLSKTIRK--IAVVGPNAADSTMLWANYNGFPTH 421

Query: 297 YTTPLQGISRY---AKTIHQAGC 316
             T L+GI       + I++ GC
Sbjct: 422 TVTILEGIRNKVPDTEVIYELGC 444



 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/113 (35%), Positives = 55/113 (48%), Gaps = 13/113 (11%)

Query: 328 GAAEVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKAS 377
            A     + AD  V V G+   +E E +          DR  + LP  QQE+V +  KA+
Sbjct: 596 AATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIELPKVQQEMV-KALKAT 654

Query: 378 RGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
             PVV VL  G  + +++   +  I AIL   Y GQ  G A+AD+LFG  NP 
Sbjct: 655 GKPVVYVLCTGSALALNW--EEANIDAILNAWYGGQEAGTAVADILFGDYNPS 705


>gi|319788503|ref|YP_004147978.1| glycoside hydrolase [Pseudoxanthomonas suwonensis 11-1]
 gi|317467015|gb|ADV28747.1| glycoside hydrolase family 3 domain protein [Pseudoxanthomonas
           suwonensis 11-1]
          Length = 916

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 136/327 (41%), Positives = 183/327 (55%), Gaps = 18/327 (5%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG---NTGSRL-- 57
           +G   GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   S+VRGLQG    TG  L  
Sbjct: 150 HGRYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTTRMGVSFVRGLQGMDPQTGQPLDP 209

Query: 58  ---KVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCS 114
              K+ A  KH+  +   +    DR+ F+   SKQDL DTY   F++ V E  V +VM +
Sbjct: 210 KYRKLDATAKHFAVH---SGPEADRHTFDVHPSKQDLYDTYLPAFESLVKEADVYAVMGA 266

Query: 115 YNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKA 174
           YN+V G+       +L +T+   W  DGY++SDC ++  ++       TPEEAAA A+K 
Sbjct: 267 YNRVYGESASGSKFLLLDTLRRDWGFDGYVMSDCWAIVDIWKNHKIVETPEEAAALAVKN 326

Query: 175 GLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPR 234
           G +L+CG   A H   AV+ GL+ E +++ AL      +M LGMFD  P  Q      P 
Sbjct: 327 GTELNCGSTYADHLPVAVKKGLISEAELDDALTRLFVARMELGMFD--PPEQVRWAQVPY 384

Query: 235 DVCTPA-HQQLALQAAHQGIVLLKNSARTLPLST-LRHHTVAVIGPNSDVTVTMIGNYAG 292
            V   A H  LA + A + +VLLKN    LPLS  +R   +AV+GP +D T+ ++GNY G
Sbjct: 385 SVNQSAEHDALARKMAQESLVLLKNDG-VLPLSKDIRR--LAVVGPTADDTMALLGNYYG 441

Query: 293 VACGYTTPLQGISRYAKTIHQAGCFGV 319
                 T L+GI   A  +      GV
Sbjct: 442 TPADPVTILRGIREAAPGVDVVYARGV 468



 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 42/108 (38%), Positives = 57/108 (52%), Gaps = 13/108 (12%)

Query: 333 AARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVV 382
           AA  ADA V V GL   +E E +          DR  + LP  QQ+L+  V  A+  PVV
Sbjct: 646 AANSADAVVFVGGLTGDVEGEEMKVDYPGFAGGDRTDIRLPATQQKLLEAV-HATGKPVV 704

Query: 383 LVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           +VL  G  + + +A+ +  +  IL   YPGQ GG A+ + LFG  NPG
Sbjct: 705 MVLTTGSALGIDWARRN--VPGILVAWYPGQRGGTAVGEALFGDYNPG 750


>gi|58581402|ref|YP_200418.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|58425996|gb|AAW75033.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
           10331]
          Length = 889

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 123/303 (40%), Positives = 176/303 (58%), Gaps = 13/303 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++ GLQG      +  A  KH 
Sbjct: 135 AGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIHGLQGEDLDHPRTIATPKHL 194

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS +D+E TY   F+A +VEG+  +VMC+YN ++G P C
Sbjct: 195 AVH-----SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPAC 249

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PF 183
           A   ++   + G W   G++VSDCD+V  +    ++      ++A A+KAG DL+CG  +
Sbjct: 250 AADWLINGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY 309

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
             + T  A+  G + E  ++ +L      + RLG  +  P   P+  LG +DV    H+ 
Sbjct: 310 RELGT--AIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAQHRA 366

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LALQAA + IVLLKN+A TLPL+      +AVIGPN+D    +  NY G +    TPL G
Sbjct: 367 LALQAAAESIVLLKNNANTLPLNA--GTRLAVIGPNADALAALEANYQGTSSAPVTPLLG 424

Query: 304 ISR 306
           + +
Sbjct: 425 LRQ 427



 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 37/75 (49%), Positives = 47/75 (62%), Gaps = 3/75 (4%)

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  + LP  QQ L+ R AKAS  P+V+VLM G  V +++AK      AI+   YPGQ+G
Sbjct: 647 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKT--HADAIVAAWYPGQSG 703

Query: 416 GAAIADVLFGRANPG 430
           G AIA +L G  NPG
Sbjct: 704 GTAIARMLAGDDNPG 718


>gi|224537384|ref|ZP_03677923.1| hypothetical protein BACCELL_02262 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521009|gb|EEF90114.1| hypothetical protein BACCELL_02262 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 863

 Score =  222 bits (566), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 124/316 (39%), Positives = 184/316 (58%), Gaps = 14/316 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT W+PN+NIFRDPRWGRGQET GEDP LTG+   + VRGLQG  G +  K+ AC KHY
Sbjct: 131 GLTMWTPNINIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEGEKYDKLHACAKHY 190

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  +  +DL +TY   FK  V +  V  VMC+YN+  G+P C 
Sbjct: 191 AVHSGPEWN---RHSFNAENIDPRDLWETYLPAFKNLVQKAHVKEVMCAYNRFEGEPCCG 247

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEE--AAADAIKAGLDLDCGPF 183
              +L   +  +W     +VSDC ++   YN   +   P++  A+A A+ +G D++CG  
Sbjct: 248 SNRLLMQILRDEWGYKEIVVSDCWAISDFYNKGAHETDPDKQHASAKAVLSGTDVECGDS 307

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            A   E AV+ GL+ E+ ++++L   +  +  LG  D EPS   +  +    V +  H++
Sbjct: 308 YASLPE-AVKEGLIDEKQIDISLKRLMKARFELGEMD-EPSQVSWAQIPYSVVDSKEHRE 365

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ A + +VLL+N+   LPL+  ++  VAV+GPN++ +V   GNY G      T L+G
Sbjct: 366 LALRMARESLVLLQNNQSLLPLN--KNLKVAVVGPNANDSVMQWGNYNGFPSHTITLLEG 423

Query: 304 ISRY---AKTIHQAGC 316
           I  Y   ++ I++ GC
Sbjct: 424 IREYLPESQIIYEPGC 439



 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 51/105 (48%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           ++AD  +   G+  ++E E +          DR  + LP  Q  L++ + KA +  +V V
Sbjct: 598 KEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKAGK-KIVFV 656

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G  +  +         AIL   YPGQAGG AIA+VLFG  NP
Sbjct: 657 NFSGSAI--ALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNP 699


>gi|393720890|ref|ZP_10340817.1| beta-glucosidase [Sphingomonas echinoides ATCC 14820]
          Length = 563

 Score =  222 bits (565), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 124/299 (41%), Positives = 171/299 (57%), Gaps = 8/299 (2%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT WSPN+NIFRDPRWGRGQET GEDP LTG    +++ GLQG      KV A  KH+ 
Sbjct: 107 GLTIWSPNINIFRDPRWGRGQETYGEDPYLTGHLGVAFITGLQGPDPKHPKVIATPKHFV 166

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +         R  F+   S QDLE TY   F+  V EGK  S+MC+YN ++G P CA+ 
Sbjct: 167 VHSGPE---AGRDGFDVDPSPQDLEATYLPAFRLAVTEGKAQSLMCAYNAIHGVPACANA 223

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            ++ + +   W   G+ VSDCD+V  ++   HY      AAA ++K G DL+CG   A  
Sbjct: 224 PLMVDRLRHDWGFTGFTVSDCDAVANVHLFHHYRLDAAGAAAASLKGGTDLNCGTTYAAL 283

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
            E A++ GL+  +D++ A    +  +  LG+  G  S  P+  + PRD+ TP H  +AL+
Sbjct: 284 PE-ALQRGLVTSQDIDTAFGRAMAARKALGIAYGATS--PWARISPRDIGTPEHHAVALE 340

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISR 306
           AA + IVLLKN    LPL T     +A+IG ++D    + GNY G A    TPL+G+ +
Sbjct: 341 AARKAIVLLKNEGDRLPLKTGTR--IALIGADADDIGVLQGNYHGTAIAPVTPLEGLRQ 397


>gi|393773722|ref|ZP_10362116.1| glycosyl hydrolase family 3 N terminal domain protein
           [Novosphingobium sp. Rr 2-17]
 gi|392720897|gb|EIZ78368.1| glycosyl hydrolase family 3 N terminal domain protein
           [Novosphingobium sp. Rr 2-17]
          Length = 846

 Score =  221 bits (564), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 142/363 (39%), Positives = 196/363 (53%), Gaps = 16/363 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT WSPN+NIFRDPRWGRGQET GEDP L+G  A  +++GLQG   +  +V A  KH+ 
Sbjct: 107 GLTIWSPNINIFRDPRWGRGQETYGEDPFLSGTLAVGFIKGLQGTDPAHPRVIATPKHFA 166

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +         R  F+  VS QD E TY   F+  V EG+  SVMC+YN  +G P CAD 
Sbjct: 167 VHSGPE---AGRDSFDVDVSPQDREATYLPAFRKAVTEGRPLSVMCAYNSTHGTPVCADD 223

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            ++   +   W   G++VSDCD++  ++   HY     EA+A +IKAG D DCG   A  
Sbjct: 224 ALINQRLRKDWGFKGFVVSDCDAIANVWMFHHYAADAAEASAASIKAGTDFDCGNTYAAL 283

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
            + AV  GL+ E  ++ ALA ++  +  LG+  G  +A P+G + P  V T A + LAL+
Sbjct: 284 PQ-AVERGLVDEATIDRALARSLQARHSLGIAFG--AANPWGRIKPSQVHTLASRALALE 340

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRY 307
           AA + IVLL+N    LPL       +AV+G N+D    +  +Y G A    TPLQGI R 
Sbjct: 341 AARKSIVLLQNDNARLPLKP--GTKLAVVGANADDLSVIEASYHGTAADPITPLQGIRR- 397

Query: 308 AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQ 367
                + G   VA     ++  AE AA     T L       ++ E++D  G ++  RQ 
Sbjct: 398 -----RFGAANVAYAQGSVL--AEGAAVIVPETALASQGKPGLKGEYLDATGKVVLTRQD 450

Query: 368 ELV 370
             V
Sbjct: 451 RRV 453



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/111 (35%), Positives = 58/111 (52%), Gaps = 17/111 (15%)

Query: 330 AEVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRG 379
           A  A R ADA V V+GL   +E E +          DR+ + LP  Q +L+  +  A+  
Sbjct: 565 AIAATRNADAIVAVLGLSPDLEGEALQVQVPGFVGGDRSDIALPRPQAQLLEAL-HATGK 623

Query: 380 PVVLVLMCGGPVDVSFAKNDPRIG-AILWVGYPGQAGGAAIADVLFGRANP 429
           PV++VL  G  + +     DP++  A+L   YPG+ GG A+A+ L G  NP
Sbjct: 624 PVIVVLTSGSAIAI-----DPKLADAVLAAWYPGEEGGTALAETLGGINNP 669


>gi|433679952|ref|ZP_20511614.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
           18974]
 gi|430814928|emb|CCP42243.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
           18974]
          Length = 909

 Score =  221 bits (564), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 125/301 (41%), Positives = 174/301 (57%), Gaps = 13/301 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++RGLQG+  +  +  A  KH 
Sbjct: 155 AGLTIWSPNINIFRDPRWGRGMETYGEDPYLTGQLAVGFIRGLQGDDLTHPRTIATPKHL 214

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS  DLE TY   F+A +V+G+  +VMC+YN ++G P C
Sbjct: 215 AVH-----SGPEPGRHGFDVDVSPHDLEATYTPAFRAAIVDGRAGAVMCAYNSLHGTPAC 269

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFL 184
           A   +L   + G W   G++VSDCD+V  +    ++      ++A A+KAG DL+CG + 
Sbjct: 270 AADWLLNGRLRGDWGFTGFVVSDCDAVDDMTQFHYFRADNAGSSAAALKAGHDLNCG-YA 328

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
                 A+  G   E  ++ +L      + RLG    +    P+  LG +DV + AH+ L
Sbjct: 329 YRDLGKAIARGDADEAVLDQSLVRLFAARYRLGELQPQ-RKDPYARLGAKDVDSAAHRAL 387

Query: 245 ALQAAHQGIVLLKNSARTLPLST-LRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           ALQAA Q IVLL+N   TLPL   LR   +AVIGPN+D    +  NY G +    TPL G
Sbjct: 388 ALQAAQQSIVLLQNRNATLPLRPGLR---LAVIGPNADALAALEANYQGTSAAPVTPLLG 444

Query: 304 I 304
           +
Sbjct: 445 L 445



 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 39/75 (52%), Positives = 47/75 (62%), Gaps = 3/75 (4%)

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  L LP  QQ L+ R AKAS  P+V+VLM G  V +++AK      AI+   YPGQ+G
Sbjct: 667 DRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKQ--HADAIVAAWYPGQSG 723

Query: 416 GAAIADVLFGRANPG 430
           G AIA VL G  NPG
Sbjct: 724 GTAIAQVLAGDVNPG 738


>gi|293370402|ref|ZP_06616956.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|292634550|gb|EFF53085.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
          Length = 863

 Score =  221 bits (564), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 120/301 (39%), Positives = 174/301 (57%), Gaps = 13/301 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT W+PNVNIFRDPRWGRGQET GEDP L+G+   + VRGLQG   +   K+ AC KH+
Sbjct: 133 GLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLHACAKHF 192

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  ++ +DL +TY   FK  V +  V  VMC+YN+  G P C 
Sbjct: 193 AVHSGPEWN---RHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCG 249

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE--EAAADAIKAGLDLDCGPF 183
              +L   +   W   G +V+DC ++G  +  + +   P+   A+ADA+ +G DL+CG  
Sbjct: 250 SNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLSGTDLECGGN 309

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
               T+ AV+  L+ EE +N ++   +  +  LG  +   S  P+ N+    +  P H++
Sbjct: 310 FKSITD-AVKKDLISEEKINTSVKRVLKARFELGEMN---STHPWSNIPFSVIDCPKHKE 365

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ AH+ +VLL+N+   LPL+  R   VAVIGPN++ +V   GNY G      T L+G
Sbjct: 366 LALKMAHESLVLLQNNNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGFPSHTVTLLEG 423

Query: 304 I 304
           I
Sbjct: 424 I 424



 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 49/106 (46%), Gaps = 13/106 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  +   G+   +E E +          DR  + LP  Q+E+++ + K  +  V + 
Sbjct: 598 QSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKKTVFVN 657

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
                   V   +N     AIL   YPGQAGG A+ADVLFG  NP 
Sbjct: 658 FSGSAMAIVPETQN---CDAILQAWYPGQAGGTAVADVLFGDYNPA 700


>gi|423342048|ref|ZP_17319763.1| hypothetical protein HMPREF1077_01193 [Parabacteroides johnsonii
           CL02T12C29]
 gi|409219455|gb|EKN12417.1| hypothetical protein HMPREF1077_01193 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 868

 Score =  221 bits (564), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 125/322 (38%), Positives = 180/322 (55%), Gaps = 23/322 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+W+PN+NIFRDPRWGRG ET GEDP LT +   + V+GLQG+     K  AC KHY 
Sbjct: 133 GLTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGDDPKYFKTHACAKHYA 192

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +    WN   R+ F+  V+ +DL  TY   F+A V EG V  VMC+YN+  GKP C+  
Sbjct: 193 VHSGPEWN---RHEFDVTVTPRDLWQTYLPAFEALVKEGNVQEVMCAYNRYQGKPCCSSD 249

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--------EEAAADAIKAGLDLD 179
            +L + +   W  +  I+SDC ++   +      RTP        E A+ADA+  G DL+
Sbjct: 250 KLLIDILRNSWGYENIILSDCGAINDFWERDE--RTPRHETHPDAESASADAVLNGTDLE 307

Query: 180 CG-PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           CG  + A+    A++ G + E D++++L   +  +  LGMFD +    P+  +    V +
Sbjct: 308 CGNSYRALVK--ALKDGKISENDLDVSLRRLLKGRFELGMFDPDERV-PYAQIPYNVVES 364

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           P H   AL+ AH+ +VLLKN   TLPLS T+R   +AV+GPN+  +  +  NY G     
Sbjct: 365 PEHVAQALEMAHKSMVLLKNKNNTLPLSKTIRK--IAVVGPNAADSTMLWANYNGFPTHT 422

Query: 298 TTPLQGISRY---AKTIHQAGC 316
            T L+GI       + I++ GC
Sbjct: 423 VTILEGIRNKVPDTEVIYELGC 444



 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/113 (35%), Positives = 55/113 (48%), Gaps = 13/113 (11%)

Query: 328 GAAEVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKAS 377
            A     + AD  V V G+   +E E +          DR  + LP  QQE+V +  KA+
Sbjct: 596 AATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIELPKVQQEMV-KALKAT 654

Query: 378 RGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
             PVV VL  G  + +++   +  I AIL   Y GQ  G A+AD+LFG  NP 
Sbjct: 655 GKPVVYVLCTGSALALNW--EEANIDAILNAWYGGQEAGTAVADILFGDYNPS 705


>gi|393786908|ref|ZP_10375040.1| hypothetical protein HMPREF1068_01320 [Bacteroides nordii
           CL02T12C05]
 gi|392658143|gb|EIY51773.1| hypothetical protein HMPREF1068_01320 [Bacteroides nordii
           CL02T12C05]
          Length = 854

 Score =  221 bits (564), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 117/302 (38%), Positives = 180/302 (59%), Gaps = 13/302 (4%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+GK   S+V+GLQG+    LK+ +  KH+ A
Sbjct: 136 LTFWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVSFVKGLQGDDPRYLKIVSTPKHFAA 195

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R+  N  +S++DL + Y   F+ C++EGK AS+M +YN +N  P   +  
Sbjct: 196 NNEEH----NRFECNPIISEKDLREYYLPAFEKCIIEGKAASIMTAYNAINDVPCTLNNW 251

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHT 188
           +LK  +   W  DGY+VSDC     L     Y +T E AAA +I+AGLDL+CG    ++ 
Sbjct: 252 LLKKVLRHDWGFDGYVVSDCGGPSFLVTHHKYVKTLEAAAALSIQAGLDLECGD--EVYM 309

Query: 189 E---GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
           E    A +  ++ E +++ A  + +  +MRLG+FD +P+  P+  + P  V    H +LA
Sbjct: 310 EPLLNAYKQYMVSEAEIDSAAYHVLRARMRLGLFD-DPALNPYNKISPSIVGCEKHSKLA 368

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY-TTPLQGI 304
           L+AA Q IVLLKN  + LPL + +  ++AV+G N+    +  G+Y+G       + L+GI
Sbjct: 369 LEAARQSIVLLKNEKKFLPLDSKKIKSIAVVGINA--GNSEFGDYSGTPVNQPVSILEGI 426

Query: 305 SR 306
            +
Sbjct: 427 KK 428



 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 44/105 (41%), Positives = 61/105 (58%), Gaps = 3/105 (2%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           L G A    R+ D TV V+G+++SIE E  DR  + LP  QQ  +    K +   VV VL
Sbjct: 591 LYGEAGDIMRKCDLTVAVLGINKSIEREGQDRYSIELPKDQQIFIEEAYKINPNTVV-VL 649

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G  + +++   D  I AI+   YPG+AGG A+A+VLFG  NPG
Sbjct: 650 VAGSSLAINWM--DEHIPAIVNAWYPGEAGGTAVAEVLFGDYNPG 692


>gi|332665860|ref|YP_004448648.1| beta-glucosidase [Haliscomenobacter hydrossis DSM 1100]
 gi|332334674|gb|AEE51775.1| Beta-glucosidase [Haliscomenobacter hydrossis DSM 1100]
          Length = 887

 Score =  221 bits (563), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 120/304 (39%), Positives = 165/304 (54%), Gaps = 7/304 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N    GLTYW+PN+NIFRDPRWGRGQET GEDP LT     ++VRGLQG+    LK AAC
Sbjct: 144 NERYLGLTYWTPNINIFRDPRWGRGQETYGEDPFLTSMLGRAFVRGLQGDDPKYLKAAAC 203

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
            KH+  +         R+  N   S  DL DTY   FK  V + KV  VMC+YN  +G+P
Sbjct: 204 AKHFAVHSGPE---PSRHSDNFSPSNYDLWDTYLPAFKELVTKAKVEGVMCAYNAFHGQP 260

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP 182
            C    ++ + +  QW+  GY+ SDC ++   +           A+ DA+  G D++CG 
Sbjct: 261 CCGSDVLMNDILRKQWQFKGYVTSDCWAIDDFFKFHKTHPDATSASVDAVLHGTDVECGT 320

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC-TPAH 241
            +       V+ G++ E  ++++L    T + RLGMFD  P +       P  +  T  H
Sbjct: 321 DVYKSLLDGVKKGMIAEAQLDISLIRLFTTRYRLGMFD--PVSMVKYAQTPESILETAEH 378

Query: 242 QQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 301
           +  +L+ A Q IVLLKN   TLPLS      +AV+GPN+D  + ++GNY G      T L
Sbjct: 379 KAHSLKMAQQSIVLLKNEGNTLPLSK-NIKKIAVLGPNADNRIVVLGNYNGQPSEIITAL 437

Query: 302 QGIS 305
           QGI 
Sbjct: 438 QGIK 441



 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 40/106 (37%), Positives = 56/106 (52%), Gaps = 13/106 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + ADA V V G+   +E E +          DR  +LLP  Q EL+ ++ K +  P+V V
Sbjct: 617 KDADAIVYVGGISPQLEGEEMRVDFPGFNGGDRTSILLPAVQTELL-KMLKGTGKPLVFV 675

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           +M G  + + +   D  I AI+   Y GQ+ G AIADVLFG  NP 
Sbjct: 676 VMTGSAIALPY--EDQNIPAIVNAWYGGQSAGTAIADVLFGDYNPA 719


>gi|380512525|ref|ZP_09855932.1| beta-glucosidase [Xanthomonas sacchari NCPPB 4393]
          Length = 885

 Score =  221 bits (563), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 125/302 (41%), Positives = 175/302 (57%), Gaps = 15/302 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++ GLQG+  +  +  A  KH 
Sbjct: 131 AGLTIWSPNINIFRDPRWGRGMETYGEDPYLTGRLAVGFIHGLQGDDPAHPRTIATPKHL 190

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS  D E TY+  F+A +V+G+  SVMC+YN ++G P C
Sbjct: 191 AVH-----SGPEPGRHGFDVDVSPHDFEATYSPAFRAAIVDGQAGSVMCAYNSLHGTPAC 245

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFL 184
           A   ++   + G W   G++VSDCD++  +    +Y      ++A A+KAG DL+CG   
Sbjct: 246 AADWLIDGRVRGDWGFKGFVVSDCDAIDDMTQFHYYRPDNAGSSAAALKAGHDLNCG--T 303

Query: 185 AIHTEG-AVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
           A    G A   G   E  ++ +L      + RLG      +  P+  LG RD+ + AH+ 
Sbjct: 304 AYRELGIAFDRGEADEALLDRSLVRLFAARYRLGELQPRRN-DPYARLGARDIDSAAHRA 362

Query: 244 LALQAAHQGIVLLKNSARTLPLST-LRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
           LALQAA Q +VLLKN+  TLPL   LR   +AV+GPN+D    +  NY G +    TPLQ
Sbjct: 363 LALQAAQQSLVLLKNANATLPLRPGLR---LAVLGPNADALAALEANYQGTSVQPVTPLQ 419

Query: 303 GI 304
           G+
Sbjct: 420 GL 421



 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 37/75 (49%), Positives = 46/75 (61%), Gaps = 3/75 (4%)

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  L LP  QQ L+ R AKAS  P+V+VLM G  V +++A+      AI+   YPGQ+G
Sbjct: 643 DRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAEQ--HADAIIAAWYPGQSG 699

Query: 416 GAAIADVLFGRANPG 430
           G AIA  L G  NPG
Sbjct: 700 GTAIAQALAGDINPG 714


>gi|427385138|ref|ZP_18881643.1| hypothetical protein HMPREF9447_02676 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727306|gb|EKU90166.1| hypothetical protein HMPREF9447_02676 [Bacteroides oleiciplenus YIT
           12058]
          Length = 863

 Score =  221 bits (563), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 125/316 (39%), Positives = 182/316 (57%), Gaps = 14/316 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT W+PNVNIFRDPRWGRGQET GEDP LTG+   + VRGLQG  G +  K+ AC KH+
Sbjct: 131 GLTMWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEGGKYDKLHACAKHF 190

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ F+A  V  +DL +TY   FK  V +  V  VMC+YN+  G+P C 
Sbjct: 191 AVHSGPEWN---RHSFDAENVDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCG 247

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   +  +W  DG IVSDC ++   +N   +   P  E A+A A+  G D++CG  
Sbjct: 248 SNRLLVQILRDEWAYDGIIVSDCWAINDFFNKGAHETEPDKEHASAKAVLTGTDVECGES 307

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            A   + AV+ GL+ E+ ++++L   +  +  LG  D  P    +  +    V +  H++
Sbjct: 308 YASLPQ-AVKAGLIDEKKIDISLKRLMKARFELGEMDN-PELVSWAQIPYSVVDSKEHRE 365

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ A + +VLL+N+   LPL+  +   VAV+GPN++ +V   GNY G      T L+G
Sbjct: 366 LALRMARESLVLLQNNQNVLPLN--KSLKVAVVGPNANDSVMQWGNYNGFPGHTVTLLEG 423

Query: 304 ISRY---AKTIHQAGC 316
           I +Y   A+ I++ GC
Sbjct: 424 IRQYLPEAQLIYEPGC 439



 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 37/105 (35%), Positives = 50/105 (47%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  V   G+  ++E E +          DR  + LP  Q  L++ + KA +  VV V
Sbjct: 598 KDADIIVFAGGISPAVEGEEMRVTIPGFKGGDRETIELPSIQSRLLAELKKAGK-KVVFV 656

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G  +  +         AIL   YPGQAGG AIA+VLFG  NP
Sbjct: 657 NFSGSAI--ALTPETKTCDAILQAWYPGQAGGTAIANVLFGDYNP 699


>gi|154493680|ref|ZP_02033000.1| hypothetical protein PARMER_03021 [Parabacteroides merdae ATCC
           43184]
 gi|423723902|ref|ZP_17698051.1| hypothetical protein HMPREF1078_02038 [Parabacteroides merdae
           CL09T00C40]
 gi|154086890|gb|EDN85935.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Parabacteroides merdae ATCC 43184]
 gi|409240709|gb|EKN33484.1| hypothetical protein HMPREF1078_02038 [Parabacteroides merdae
           CL09T00C40]
          Length = 868

 Score =  221 bits (563), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 128/340 (37%), Positives = 185/340 (54%), Gaps = 26/340 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+W+PN+NIFRDPRWGRG ET GEDP LT +   + V+GLQG+     K  AC KHY 
Sbjct: 133 GLTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGDDPKYFKTHACAKHYA 192

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +    WN   R+ F+  V+ +DL  TY   F+A V +G V  VMC+YN+  GKP C+  
Sbjct: 193 VHSGPEWN---RHEFDVTVTPRDLWQTYLPAFEALVKKGNVQEVMCAYNRYQGKPCCSSD 249

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--------EEAAADAIKAGLDLD 179
            +L + +   W  +  I+SDC ++   +  Q   RTP        E A+ADA+  G DL+
Sbjct: 250 KLLIDILRNSWGYENIILSDCGAINDFW--QRDERTPRHETHPDAESASADAVLNGTDLE 307

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG       + A++ G + E D++++L   +  +  LGMFD +    P+  +    V +P
Sbjct: 308 CGNSYKALIK-ALKEGKISENDLDVSLRRLLKGRFELGMFDPDERV-PYAQIPYNVVESP 365

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
            H   AL+ AH+ +VLLKN   TLPLS T+R   +AV+GPN+  +  +  NY G      
Sbjct: 366 EHVAQALEMAHKSMVLLKNKNNTLPLSKTIRK--IAVVGPNAADSTMLWANYNGFPTHTV 423

Query: 299 TPLQGISRY---AKTIHQAGC-----FGVACNGNQLIGAA 330
           T L+GI       + I++ GC     F +   GN +   A
Sbjct: 424 TILEGIRNKVPDTEVIYELGCNHAADFVIQDLGNHITSPA 463



 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 40/113 (35%), Positives = 55/113 (48%), Gaps = 13/113 (11%)

Query: 328 GAAEVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKAS 377
            A     + AD  V V G+   +E E +          DR  + +P  QQE+V +  KA+
Sbjct: 596 AATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIEIPKVQQEMV-KALKAT 654

Query: 378 RGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
             PVV VL  G  + +++   D  I AIL   Y GQ  G A+AD+LFG  NP 
Sbjct: 655 GKPVVYVLCTGSALALNW--EDANIDAILNAWYGGQEAGTAVADILFGDYNPS 705


>gi|299148437|ref|ZP_07041499.1| beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298513198|gb|EFI37085.1| beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 863

 Score =  221 bits (562), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 120/301 (39%), Positives = 172/301 (57%), Gaps = 13/301 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT W+PNVNIFRDPRWGRGQET GEDP L+G+   + VRGLQG   +   K+ AC KH+
Sbjct: 133 GLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLHACAKHF 192

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  ++ +DL +TY   FK  V +  V  VMC+YN+  G P C 
Sbjct: 193 AVHSGPEWN---RHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCG 249

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE--EAAADAIKAGLDLDCGPF 183
              +L   +   W   G +V+DC ++G  +  + +   P+   A+ADA+  G DL+CG  
Sbjct: 250 SNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLNGTDLECGGN 309

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
               T+ AV+ GL+ EE +N ++   +  +  LG  +      P+ N+    +  P H++
Sbjct: 310 FKSITD-AVKKGLISEEKINTSVKRLLKARFELGEMN---PTHPWSNIPYSVINCPKHKE 365

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ AH+ +VLL+N    LPL+  R   VAVIGPN++ +V   GNY G      T L+G
Sbjct: 366 LALKMAHESLVLLQNKNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGFPSHTVTLLEG 423

Query: 304 I 304
           I
Sbjct: 424 I 424



 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 49/105 (46%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  +   G+   +E E +          DR  + LP  Q+E+++ + K  +   V V
Sbjct: 598 KNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK-KTVFV 656

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G  + +          AIL   YPGQAGG A+ADVLFG  NP
Sbjct: 657 NFSGSAMAI--VPETQSCDAILQAWYPGQAGGTAVADVLFGDYNP 699


>gi|298482082|ref|ZP_07000270.1| beta-glucosidase [Bacteroides sp. D22]
 gi|298271639|gb|EFI13212.1| beta-glucosidase [Bacteroides sp. D22]
          Length = 863

 Score =  221 bits (562), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 120/301 (39%), Positives = 172/301 (57%), Gaps = 13/301 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT W+PNVNIFRDPRWGRGQET GEDP L+G+   + VRGLQG   +   K+ AC KH+
Sbjct: 133 GLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAVVRGLQGPEDAEYDKLHACAKHF 192

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  ++ +DL +TY   FK  V +  V  VMC+YN+  G P C 
Sbjct: 193 AVHSGPEWN---RHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCG 249

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE--EAAADAIKAGLDLDCGPF 183
              +L   +   W   G +V+DC ++G  +  + +   P+   A+ADA+  G DL+CG  
Sbjct: 250 SNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAVHASADAVLNGTDLECGGN 309

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
               T+ AV+ GL+ EE +N ++   +  +  LG  +      P+ N+    +  P H++
Sbjct: 310 FKSITD-AVKKGLISEEKINTSVKRLLKARFELGEMN---PTHPWSNIPYSVIDCPKHKE 365

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ AH+ +VLL+N    LPL+  R   VAVIGPN++ +V   GNY G      T L+G
Sbjct: 366 LALKMAHESLVLLQNKNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGFPSHTVTLLEG 423

Query: 304 I 304
           I
Sbjct: 424 I 424



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 49/105 (46%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  +   G+   +E E +          DR  + LP  Q+E+++ + K  +   V V
Sbjct: 598 KNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK-KTVFV 656

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G  + +          AIL   YPGQAGG A+ADVLFG  NP
Sbjct: 657 NFSGSAMAI--VPETQSCDAILQAWYPGQAGGTAVADVLFGDYNP 699


>gi|336415490|ref|ZP_08595829.1| hypothetical protein HMPREF1017_02937 [Bacteroides ovatus
           3_8_47FAA]
 gi|335940369|gb|EGN02236.1| hypothetical protein HMPREF1017_02937 [Bacteroides ovatus
           3_8_47FAA]
          Length = 863

 Score =  221 bits (562), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 120/301 (39%), Positives = 172/301 (57%), Gaps = 13/301 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT W+PNVNIFRDPRWGRGQET GEDP L+G+   + VRGLQG   +   K+ AC KH+
Sbjct: 133 GLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLHACAKHF 192

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  ++ +DL +TY   FK  V +  V  VMC+YN+  G P C 
Sbjct: 193 AVHSGPEWN---RHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCG 249

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE--EAAADAIKAGLDLDCGPF 183
              +L   +   W   G +V+DC ++G  +  + +   P+   A+ADA+  G DL+CG  
Sbjct: 250 SNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLNGTDLECGGN 309

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
               T+ AV+ GL+ EE +N ++   +  +  LG  +      P+ N+    +  P H++
Sbjct: 310 FKSITD-AVKKGLISEEKINTSVKRLLKARFELGEMN---PTHPWSNIPYSVINCPKHKE 365

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ AH+ +VLL+N    LPL+  R   VAVIGPN++ +V   GNY G      T L+G
Sbjct: 366 LALKMAHESLVLLQNKNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGFPSHTVTLLEG 423

Query: 304 I 304
           I
Sbjct: 424 I 424



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 49/105 (46%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  +   G+   +E E +          DR  + LP  Q+E+++ + K  +   V V
Sbjct: 598 KNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK-KTVFV 656

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G  + +          AIL   YPGQAGG A+ADVLFG  NP
Sbjct: 657 NFSGSAMAI--VPETQSCDAILQAWYPGQAGGTAVADVLFGNYNP 699


>gi|423344787|ref|ZP_17322476.1| hypothetical protein HMPREF1060_00148 [Parabacteroides merdae
           CL03T12C32]
 gi|409224378|gb|EKN17311.1| hypothetical protein HMPREF1060_00148 [Parabacteroides merdae
           CL03T12C32]
          Length = 866

 Score =  220 bits (561), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 128/340 (37%), Positives = 185/340 (54%), Gaps = 26/340 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+W+PN+NIFRDPRWGRG ET GEDP LT +   + V+GLQG+     K  AC KHY 
Sbjct: 131 GLTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGLAVVKGLQGDDPKYFKTHACAKHYA 190

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +    WN   R+ F+  V+ +DL  TY   F+A V +G V  VMC+YN+  GKP C+  
Sbjct: 191 VHSGPEWN---RHEFDVTVTPRDLWQTYLPAFEALVKKGNVQEVMCAYNRYQGKPCCSSD 247

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--------EEAAADAIKAGLDLD 179
            +L + +   W  +  I+SDC ++   +  Q   RTP        E A+ADA+  G DL+
Sbjct: 248 KLLIDILRNSWGYENIILSDCGAINDFW--QRDERTPRHETHPDAESASADAVLNGTDLE 305

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG       + A++ G + E D++++L   +  +  LGMFD +    P+  +    V +P
Sbjct: 306 CGNSYKALIK-ALKEGKISENDLDVSLRRLLKGRFELGMFDPDERV-PYAQIPYNVVESP 363

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
            H   AL+ AH+ +VLLKN   TLPLS T+R   +AV+GPN+  +  +  NY G      
Sbjct: 364 EHVAQALEMAHKSMVLLKNKNNTLPLSKTIRK--IAVVGPNAADSTMLWANYNGFPTHTV 421

Query: 299 TPLQGISRY---AKTIHQAGC-----FGVACNGNQLIGAA 330
           T L+GI       + I++ GC     F +   GN +   A
Sbjct: 422 TILEGIRNKVPDTEVIYELGCNHAADFVIQDLGNHITSPA 461



 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 40/113 (35%), Positives = 55/113 (48%), Gaps = 13/113 (11%)

Query: 328 GAAEVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKAS 377
            A     + AD  V V G+   +E E +          DR  + +P  QQE+V +  KA+
Sbjct: 594 AATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIEIPKVQQEMV-KALKAT 652

Query: 378 RGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
             PVV VL  G  + +++   D  I AIL   Y GQ  G A+AD+LFG  NP 
Sbjct: 653 GKPVVYVLCTGSALALNW--EDANIDAILNAWYGGQEAGTAVADILFGDYNPS 703


>gi|365118446|ref|ZP_09337032.1| hypothetical protein HMPREF1033_00378 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363649697|gb|EHL88801.1| hypothetical protein HMPREF1033_00378 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 1283

 Score =  220 bits (561), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 146/433 (33%), Positives = 227/433 (52%), Gaps = 31/433 (7%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL 71
           WSP +++ RD RWGR +ET GEDP L G+   ++++G QG       + AC KH+  +  
Sbjct: 169 WSPVLDVARDARWGRVEETYGEDPYLVGRIGVAWIKGFQGE-----HMFACPKHFAGHG- 222

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
               G D + +   +S + + + +  PF+  + E     VM +Y   NG P     ++L+
Sbjct: 223 QPVGGRDSHDYG--LSDRVMRNIHLAPFRDVIKEANAFGVMAAYGLWNGVPDNGSKELLQ 280

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEGA 191
             +  +W  +G++VSDC     +   Q    T EEAAA A++AG+D++CG         A
Sbjct: 281 KILREEWGFEGFVVSDCSGPENIQRKQSVVGTMEEAAAMAVRAGVDIECGSAYKKALASA 340

Query: 192 VRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQ 251
           V+ G+++E +++  L      +MRLG+FD         N  P +  TP H+ LA + A +
Sbjct: 341 VKKGIIKESELDANLRRVFRAKMRLGLFDRPSIENMVWNKLP-EYDTPEHRALARKVAVK 399

Query: 252 GIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY-AGVACG-YTTPLQGISRY-- 307
             VLLKN    LPL      T+AVIGPN+D   T  G+Y A  A G   + L+G+  +  
Sbjct: 400 STVLLKNENNLLPLDK-NIKTIAVIGPNADQGQT--GDYSAKYAPGQIISVLEGVKNHVS 456

Query: 308 --AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLD---------QSIEAEFID 356
              K ++  GC  +  +      A  + A+QADA +LV+G +         +S   E +D
Sbjct: 457 PSTKVLYAQGCTQLDMDTTGFAEAVNI-AKQADAVILVVGDNSNRHENGNKKSTTGENVD 515

Query: 357 RAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGG 416
            A L +PG Q++L+  V +A+  PVVLVL+ G P  +++   D  I +IL   YPG+ GG
Sbjct: 516 GATLEIPGVQRQLIKAV-EATGKPVVLVLVNGKPFTLTW--EDENIESILETWYPGEEGG 572

Query: 417 AAIADVLFGRANP 429
            A AD++FG  NP
Sbjct: 573 NATADIIFGDENP 585


>gi|160885419|ref|ZP_02066422.1| hypothetical protein BACOVA_03419 [Bacteroides ovatus ATCC 8483]
 gi|156109041|gb|EDO10786.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
          Length = 861

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 125/316 (39%), Positives = 184/316 (58%), Gaps = 16/316 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT+W+PNVNIFRDPRWGRGQET GEDP LTG+   + VRGLQG   +R  K+ AC KH+
Sbjct: 131 GLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAKHF 190

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ F+A  +  +DL +TY   FK  V +  V  VMC+YN+  G+P C 
Sbjct: 191 AVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCG 247

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   +  +W  +G +VSDC ++   Y    +   P  E A+A A++AG DL+CG  
Sbjct: 248 SNRLLMQILRDEWGYEGIVVSDCGAISDFYRPGTHGTHPDKEHASAGAVRAGTDLECGSE 307

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            A   + AV+ GL+ E++++++L   +T +  LG  D E SA  +  +    + +  HQ 
Sbjct: 308 YASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMD-EQSA--WSEIPTSVLNSKEHQA 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ A + +VLL+N    LPL+T  H  VAV+GPN++ +V   GNY G+     T L+ 
Sbjct: 364 LALRMARESLVLLQNKNNILPLNT--HLKVAVMGPNANDSVMQWGNYNGIPAHTVTLLEA 421

Query: 304 ISRY---AKTIHQAGC 316
           +       + I++ GC
Sbjct: 422 VRAKLPEGQIIYEPGC 437



 Score = 40.4 bits (93), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 13/104 (12%)

Query: 337 ADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLM 386
           AD  +   G+  S+E E +          DR  + LP  Q++L+  + KA +  VV +  
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK-VVFINY 656

Query: 387 CGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            G  +            AIL   YPGQAGG AI D L+G  NPG
Sbjct: 657 SGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPG 698


>gi|298386950|ref|ZP_06996504.1| beta-glucosidase [Bacteroides sp. 1_1_14]
 gi|298260100|gb|EFI02970.1| beta-glucosidase [Bacteroides sp. 1_1_14]
          Length = 846

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 114/285 (40%), Positives = 172/285 (60%), Gaps = 8/285 (2%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+G    ++V+GLQG+    LK  +  KH+ A
Sbjct: 131 LTFWSPTVNMARDPRWGRTPETYGEDPYLSGVLGTAFVKGLQGDHPRYLKSVSTPKHFAA 190

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R++ +A +++ D+ + Y   F+ C+ EGK  S+M +YN +NG P  A+  
Sbjct: 191 NNEEH----NRFYCDAAITETDMREYYLPAFEKCIREGKAESIMTAYNAINGVPCTANNW 246

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPF-LAIH 187
           +L   +   W  +GYIVSDC + G+L     Y +TPE AA  AIKAGLDL+CG +     
Sbjct: 247 LLNKVLKQDWGFNGYIVSDCGAPGLLMTDHRYVKTPEAAAMIAIKAGLDLECGDYVFGAP 306

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              A +  ++   +++ A  + +  +MRLGMFD +P   P+ +L P  V    H++LAL+
Sbjct: 307 LLNAYKQYMVSTAEIDSAAYHVLRARMRLGMFD-DPEKNPYNHLSPEIVGCEKHKELALE 365

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAG 292
           AA Q IVLLKN   TLPL+  +  ++AV+G N+       G+Y+G
Sbjct: 366 AARQSIVLLKNQKNTLPLNAKKIKSIAVVGINA--ANCEFGDYSG 408



 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 43/105 (40%), Positives = 60/105 (57%), Gaps = 3/105 (2%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           + G A    R++D  + VMG++QSIE E  DR+ + LP  QQ  +    KA+   +V VL
Sbjct: 586 MYGDASKVIRESDVVIAVMGINQSIEREGQDRSSIELPKDQQIFIREAYKANPNTIV-VL 644

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G  + V +   D  I AI+   YPG+ GG AIA+VLFG  NP 
Sbjct: 645 VAGSSMAVGWM--DQNIPAIIDAWYPGEQGGTAIAEVLFGDYNPA 687


>gi|372209036|ref|ZP_09496838.1| glycoside hydrolase [Flavobacteriaceae bacterium S85]
          Length = 859

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 148/440 (33%), Positives = 231/440 (52%), Gaps = 23/440 (5%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVA 60
           + N G   +  +SP VN+ RDPRWGR  E   EDP L  + A  YVRG+QGN    +K  
Sbjct: 114 LKNAGKKEVMMFSPTVNMARDPRWGRNGECYAEDPHLMSEMARMYVRGMQGNDPKYVKTV 173

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
              KHY A +++      R   ++ + K+DL + Y   +K C+V+ +   +M + N +NG
Sbjct: 174 TTVKHYVANNVE----TKREWIHSNIGKKDLYEYYFPAYKTCIVDEEATGIMTALNGLNG 229

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
            P  A   ++   +  +W   GY+++D  +V  L     Y  +  +AAA AIKAG+D +C
Sbjct: 230 IPCSAHDWLVNGVLRNEWGFKGYVIADWAAVQGLEKRMKYASSQAQAAAMAIKAGVDQEC 289

Query: 181 -------GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
                   P +    + A++ GL+ E+++++ +   + ++   G FD +PS  P+  +  
Sbjct: 290 FRNKVRQAPMVQALPD-ALQQGLITEKELDVTVKRLLRLRFMTGDFD-DPSLNPYSAIPT 347

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 293
             +   AH+QLAL+AA Q IVLLKN A  LPL      ++A+IGP +D     +G Y+G 
Sbjct: 348 SVLECDAHKQLALKAAEQSIVLLKNDA-VLPLKK-DLKSIAMIGPFAD--RCWMGIYSGH 403

Query: 294 ACGYTTPLQGISRY--AKTIHQAGCFGVAC-NGNQLIGAAEVAARQADATVLVMGLDQSI 350
                +PL GI  Y  AK     GC   A  +  Q I  A   A++++  +LV+G D++ 
Sbjct: 404 PKSKVSPLDGIKAYTNAKVSFAQGCEVTAKEDDEQKIAEAVALAKKSEQVILVVGNDETT 463

Query: 351 EAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGY 410
             E  DR  + LPG Q +L+  V   ++  V+LVL+  GP  V++ + +  I  I+    
Sbjct: 464 STENTDRKSIKLPGNQHQLIKAVQAVNKN-VILVLVPSGPTAVTWEQKN--IPGIVCAWP 520

Query: 411 PGQAGGAAIADVLFGRANPG 430
            GQ  G A+A VLFG  NPG
Sbjct: 521 NGQEQGTALAKVLFGDVNPG 540


>gi|440733337|ref|ZP_20913088.1| beta-glucosidase [Xanthomonas translucens DAR61454]
 gi|440362904|gb|ELQ00083.1| beta-glucosidase [Xanthomonas translucens DAR61454]
          Length = 895

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 125/301 (41%), Positives = 173/301 (57%), Gaps = 13/301 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++ GLQG+  +  +  A  KH 
Sbjct: 141 AGLTIWSPNINIFRDPRWGRGMETYGEDPYLTGQLAVGFIHGLQGDDLTHPRTIATPKHL 200

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS  DLE TY   F+A +V+G+  SVMC+YN ++G P C
Sbjct: 201 AVH-----SGPEPGRHGFDVDVSPHDLEATYTPAFRAAIVDGRAGSVMCAYNALHGTPAC 255

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFL 184
           A   +L   + G W   G++VSDCD+V  +    ++      ++A A+KAG DL+CG + 
Sbjct: 256 AADWLLNGRLRGDWGFTGFVVSDCDAVDDMTQFHYFRADNAGSSAAALKAGHDLNCG-YA 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
                 A+  G   E  ++ +L      + RLG    +    P+  LG +DV + AH+ L
Sbjct: 315 YRDLGKAIARGDADEALLDQSLVRLFAARYRLGELQPQ-RKDPYAQLGAKDVDSAAHRAL 373

Query: 245 ALQAAHQGIVLLKNSARTLPLST-LRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           ALQAA Q IVLL+N   TLPL   LR   +AVIGPN+D    +  NY G +    TPL G
Sbjct: 374 ALQAAQQSIVLLQNRNATLPLRPGLR---LAVIGPNADALAALEANYQGTSAAPVTPLLG 430

Query: 304 I 304
           +
Sbjct: 431 L 431



 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 39/75 (52%), Positives = 47/75 (62%), Gaps = 3/75 (4%)

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  L LP  QQ L+ R AKAS  P+V+VLM G  V +++AK      AI+   YPGQ+G
Sbjct: 653 DRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKQ--HADAIVAAWYPGQSG 709

Query: 416 GAAIADVLFGRANPG 430
           G AIA VL G  NPG
Sbjct: 710 GTAIAQVLAGDVNPG 724


>gi|330996729|ref|ZP_08320604.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
           11841]
 gi|329572574|gb|EGG54217.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
           11841]
          Length = 852

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 124/305 (40%), Positives = 179/305 (58%), Gaps = 13/305 (4%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+G    ++V+GLQGN    LK  +  KH+  
Sbjct: 137 LTFWSPTVNMARDPRWGRTPETYGEDPYLSGVLGVAFVKGLQGNHPRYLKTVSTPKHFAV 196

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R   NA+VS++DL + Y   F+ C+ EGK  S+M +YN VN  P   +  
Sbjct: 197 NNEEH----NRSSCNAKVSERDLREYYLPSFERCITEGKAQSIMMAYNAVNDVPCTVNTY 252

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHT 188
           ++KN + G W  +GYIVSDC +   +    HY +T E AA  A+KAGLDL+CG    ++ 
Sbjct: 253 LIKNVLRGDWGFNGYIVSDCSAPEWMITKHHYVKTREAAATLAVKAGLDLECGN--QVYG 310

Query: 189 EG---AVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
           EG   A R  ++ E D++ A    +  +M LG+FD +PS  P+  + P  V   AHQ LA
Sbjct: 311 EGLLKAYRQYMVSEADIDSAAYRILRGRMMLGLFD-DPSQNPYNQIEPSVVGCKAHQDLA 369

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY-TTPLQGI 304
           L+AA Q +VLLKN    LPL+  +  ++AV+G ++       G+Y+G       T L GI
Sbjct: 370 LEAARQSMVLLKNKDNFLPLNPQKVKSIAVVGISAG--HCEFGDYSGTPKNEPVTILDGI 427

Query: 305 SRYAK 309
            +YA+
Sbjct: 428 KQYAE 432



 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 45/103 (43%), Positives = 61/103 (59%), Gaps = 3/103 (2%)

Query: 328 GAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMC 387
           G A   A + D TV V+G+++SIE E  DR  L LP  QQE +  + K +   VV VL+ 
Sbjct: 595 GDAGKVAAECDVTVAVLGINKSIEREGQDRFTLELPIDQQEFIKELYKVNPNTVV-VLVA 653

Query: 388 GGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           G  + V++   D  + AIL   YPG+ GG A+A+VLFG  NPG
Sbjct: 654 GSSLAVNWM--DENVPAILNAWYPGEQGGNAVAEVLFGDYNPG 694


>gi|424792251|ref|ZP_18218496.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
 gi|422797157|gb|EKU25539.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
          Length = 909

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 125/301 (41%), Positives = 173/301 (57%), Gaps = 13/301 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++ GLQG+  +  +  A  KH 
Sbjct: 155 AGLTIWSPNINIFRDPRWGRGMETYGEDPYLTGQLAVGFIHGLQGDDLTHPRTIATPKHL 214

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS  DLE TY   F+A +V+G+  SVMC+YN ++G P C
Sbjct: 215 AVH-----SGPEPGRHGFDVDVSPHDLEATYTPAFRAAIVDGRAGSVMCAYNALHGTPAC 269

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFL 184
           A   +L   + G W   G++VSDCD+V  +    ++      ++A A+KAG DL+CG + 
Sbjct: 270 AADWLLNGRLRGDWGFTGFVVSDCDAVDDMTQFHYFRADNAGSSAAALKAGHDLNCG-YA 328

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
                 A+  G   E  ++ +L      + RLG    +    P+  LG +DV + AH+ L
Sbjct: 329 YRDLGKAIARGDADEALLDKSLVRLFAARYRLGELQPQ-RKDPYARLGAKDVDSAAHRAL 387

Query: 245 ALQAAHQGIVLLKNSARTLPLST-LRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           ALQAA Q IVLL+N   TLPL   LR   +AVIGPN+D    +  NY G +    TPL G
Sbjct: 388 ALQAAQQSIVLLQNRNATLPLRPGLR---LAVIGPNADALAALEANYQGTSAAPVTPLLG 444

Query: 304 I 304
           +
Sbjct: 445 L 445



 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 39/75 (52%), Positives = 47/75 (62%), Gaps = 3/75 (4%)

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  L LP  QQ L+ R AKAS  P+V+VLM G  V +++AK      AI+   YPGQ+G
Sbjct: 667 DRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKQ--HADAIVAAWYPGQSG 723

Query: 416 GAAIADVLFGRANPG 430
           G AIA VL G  NPG
Sbjct: 724 GTAIAQVLAGDVNPG 738


>gi|295086418|emb|CBK67941.1| Beta-glucosidase-related glycosidases [Bacteroides xylanisolvens
           XB1A]
          Length = 861

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 125/316 (39%), Positives = 183/316 (57%), Gaps = 16/316 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT+W+PNVNIFRDPRWGRGQET GEDP LTG+   + VRGLQG   +R  K+ AC KH+
Sbjct: 131 GLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAKHF 190

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ F+A  +  +DL +TY   FK  V +  V  VMC+YN+  G+P C 
Sbjct: 191 AVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCG 247

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   +  +W   G +VSDC ++   Y    +   P  E A+A A++AG DL+CG  
Sbjct: 248 SNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPDKEHASAGAVRAGTDLECGSE 307

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            A   + AV+ GL+ E++++++L   +T +  LG  D E SA  +  +    + +  HQ 
Sbjct: 308 YASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMD-EQSA--WSEIPTSVLNSKEHQA 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ A + +VLL+N    LPL+T  H  VAV+GPN++ +V   GNY G+     T L+ 
Sbjct: 364 LALRMARESLVLLQNKNNILPLNT--HLKVAVMGPNANDSVMQWGNYNGIPAHTVTLLEA 421

Query: 304 ISRY---AKTIHQAGC 316
           +       + I++ GC
Sbjct: 422 VRAKLPEGQIIYEPGC 437



 Score = 40.4 bits (93), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 13/104 (12%)

Query: 337 ADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLM 386
           AD  +   G+  S+E E +          DR  + LP  Q++L+  + KA +  VV +  
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK-VVFINY 656

Query: 387 CGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            G  +            AIL   YPGQAGG AI D L+G  NPG
Sbjct: 657 SGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPG 698


>gi|393781488|ref|ZP_10369683.1| hypothetical protein HMPREF1071_00551 [Bacteroides salyersiae
           CL02T12C01]
 gi|392676551|gb|EIY69983.1| hypothetical protein HMPREF1071_00551 [Bacteroides salyersiae
           CL02T12C01]
          Length = 850

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 123/317 (38%), Positives = 179/317 (56%), Gaps = 19/317 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT W+PNVNIFRDPRWGRGQET GEDP LT +   + V+GLQG   +R  K+ AC KH+
Sbjct: 121 GLTMWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPDSARYDKLHACAKHF 180

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  ++ +DL +TY   FK  V E  V  VMC+YN+  G P C 
Sbjct: 181 AVHSGPEWN---RHSFNAENIAPRDLWETYLPAFKTLVQEADVKEVMCAYNRFEGDPCCG 237

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE--EAAADAIKAGLDLDCGPF 183
              +L   +  +W  +G +VSDC ++   +  + +   P+   A+ADA+ +G DL+CG  
Sbjct: 238 SNRLLTQILRDEWGFNGIVVSDCGAISDFWGAKKHNTHPDAAHASADAVLSGTDLECGSN 297

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLG-MFDGEPSAQPFGNLGPRDVCTPAHQ 242
               T+ AV+ G++ EE +++++   +  +  LG M +  P A P+       V  P H+
Sbjct: 298 YRKLTD-AVKAGIISEEQIDISVKRLLKARFELGEMEESHPWALPYS-----IVDCPEHR 351

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
            LALQ AH+ + LL+N    LPL   +H  VAVIGPN++ +V   GNY G     +T L 
Sbjct: 352 HLALQIAHETMTLLQNKENILPLD--KHAKVAVIGPNANDSVMQWGNYNGTPSHTSTLLS 409

Query: 303 GISRY---AKTIHQAGC 316
            +      A+ I++  C
Sbjct: 410 ALRSKLPAAQLIYEPVC 426



 Score = 47.4 bits (111), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 3/74 (4%)

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  + LP  Q+ +++ + KA +  V+ V   G  +  +         AIL   YPGQ G
Sbjct: 616 DRTDIELPAVQRNVLAALKKAGKK-VIFVNFSGSAM--ALTPETENCDAILQAWYPGQEG 672

Query: 416 GAAIADVLFGRANP 429
           G A+ADVLFG  NP
Sbjct: 673 GTAVADVLFGDYNP 686


>gi|237719778|ref|ZP_04550259.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
 gi|229451047|gb|EEO56838.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
          Length = 861

 Score =  219 bits (559), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 125/316 (39%), Positives = 183/316 (57%), Gaps = 16/316 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT+W+PNVNIFRDPRWGRGQET GEDP LTG+   + VRGLQG   +R  K+ AC KH+
Sbjct: 131 GLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAKHF 190

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ F+A  +  +DL +TY   FK  V +  V  VMC+YN+  G+P C 
Sbjct: 191 AVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCG 247

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   +  +W   G +VSDC ++   Y    +   P  E A+A A++AG DL+CG  
Sbjct: 248 SNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETYPDKEHASAGAVRAGTDLECGSE 307

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            A   + AV+ GL+ E++++++L   +T +  LG  D E SA  +  +    + +  HQ 
Sbjct: 308 YASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMD-EQSA--WSEIPTSVLNSKEHQA 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ A + +VLL+N    LPL+T  H  VAV+GPN++ +V   GNY G+     T L+ 
Sbjct: 364 LALRMARESLVLLQNKNNILPLNT--HLKVAVMGPNANDSVMQWGNYNGIPAHTVTLLEA 421

Query: 304 ISRY---AKTIHQAGC 316
           +       + I++ GC
Sbjct: 422 VRAKLPEGQIIYEPGC 437



 Score = 40.4 bits (93), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 13/104 (12%)

Query: 337 ADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLM 386
           AD  +   G+  S+E E +          DR  + LP  Q++L+  + KA +  VV +  
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK-VVFINY 656

Query: 387 CGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            G  +            AIL   YPGQAGG AI D L+G  NPG
Sbjct: 657 SGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPG 698


>gi|150007848|ref|YP_001302591.1| glycoside hydrolase family protein [Parabacteroides distasonis ATCC
           8503]
 gi|301310124|ref|ZP_07216063.1| beta-glucosidase [Bacteroides sp. 20_3]
 gi|423336365|ref|ZP_17314112.1| hypothetical protein HMPREF1059_00064 [Parabacteroides distasonis
           CL09T03C24]
 gi|149936272|gb|ABR42969.1| glycoside hydrolase family 3, candidate beta-glycosidase
           [Parabacteroides distasonis ATCC 8503]
 gi|300831698|gb|EFK62329.1| beta-glucosidase [Bacteroides sp. 20_3]
 gi|409240840|gb|EKN33614.1| hypothetical protein HMPREF1059_00064 [Parabacteroides distasonis
           CL09T03C24]
          Length = 868

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 124/319 (38%), Positives = 177/319 (55%), Gaps = 17/319 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+W+PN+NIFRDPRWGRG ET GEDP LT K   +  RGLQG+  +  K  AC KHY 
Sbjct: 133 GLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPNYYKTHACAKHYA 192

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +    WN   R+ FNA  + +DL +TY   F+A V EG V  VMC+YN+  GKP C+  
Sbjct: 193 VHSGPEWN---RHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKPCCSSD 249

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLY----NTQHYTRTP--EEAAADAIKAGLDLDC- 180
            +L + +   W  D  I+SDC ++   +    NT  +   P  E A+ADA+  G DL+C 
Sbjct: 250 KLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGTDLECG 309

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           G + A++   A+  G + E+D++++L   +  +  LGMFD +    P+  +    V +P 
Sbjct: 310 GSYRALNK--ALADGKISEKDLDVSLRRLLKGRFELGMFDPDERV-PYSKIPYSVVESPE 366

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H   AL  A + IVLLKN    LPL       +AV+GPN+  +  +  NY G      T 
Sbjct: 367 HIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYNGFPTKTVTI 425

Query: 301 LQGISRY---AKTIHQAGC 316
           ++GI      A+ I++ GC
Sbjct: 426 VEGIRNKVPNAEVIYELGC 444



 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 56/106 (52%), Gaps = 13/106 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  V V G+   +E E +          DR  + +P  Q+E+V  +  A+  PVV V
Sbjct: 603 KDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALV-ATGKPVVYV 661

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           +  G  + +++ +ND  + AIL   Y GQ GG A+ADVLFG  NP 
Sbjct: 662 VCTGSALALNW-END-HVNAILNAWYGGQEGGTAVADVLFGDYNPA 705


>gi|262381651|ref|ZP_06074789.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           2_1_33B]
 gi|262296828|gb|EEY84758.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           2_1_33B]
          Length = 868

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 124/319 (38%), Positives = 177/319 (55%), Gaps = 17/319 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+W+PN+NIFRDPRWGRG ET GEDP LT K   +  RGLQG+  +  K  AC KHY 
Sbjct: 133 GLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPNYYKTHACAKHYA 192

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +    WN   R+ FNA  + +DL +TY   F+A V EG V  VMC+YN+  GKP C+  
Sbjct: 193 VHSGPEWN---RHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKPCCSSD 249

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLY----NTQHYTRTP--EEAAADAIKAGLDLDC- 180
            +L + +   W  D  I+SDC ++   +    NT  +   P  E A+ADA+  G DL+C 
Sbjct: 250 KLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGTDLECG 309

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           G + A++   A+  G + E+D++++L   +  +  LGMFD +    P+  +    V +P 
Sbjct: 310 GSYRALNK--ALADGKISEKDLDVSLRRLLKGRFELGMFDPDERV-PYSKIPYSVVESPE 366

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H   AL  A + IVLLKN    LPL       +AV+GPN+  +  +  NY G      T 
Sbjct: 367 HIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYNGFPSKTVTI 425

Query: 301 LQGISRY---AKTIHQAGC 316
           ++GI      A+ I++ GC
Sbjct: 426 VEGIRNKVPNAEVIYELGC 444



 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 56/106 (52%), Gaps = 13/106 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  V V G+   +E E +          DR  + +P  Q+E+V  +  A+  PVV V
Sbjct: 603 KDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALV-ATGKPVVYV 661

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           +  G  + +++ +ND  + AIL   Y GQ GG A+ADVLFG  NP 
Sbjct: 662 VCTGSALALNW-END-HVNAILNAWYGGQEGGTAVADVLFGDYNPA 705


>gi|256840106|ref|ZP_05545615.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
           D13]
 gi|256739036|gb|EEU52361.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
           D13]
          Length = 868

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 124/319 (38%), Positives = 177/319 (55%), Gaps = 17/319 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+W+PN+NIFRDPRWGRG ET GEDP LT K   +  RGLQG+  +  K  AC KHY 
Sbjct: 133 GLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPNYYKTHACAKHYA 192

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +    WN   R+ FNA  + +DL +TY   F+A V EG V  VMC+YN+  GKP C+  
Sbjct: 193 VHSGPEWN---RHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKPCCSSD 249

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLY----NTQHYTRTP--EEAAADAIKAGLDLDC- 180
            +L + +   W  D  I+SDC ++   +    NT  +   P  E A+ADA+  G DL+C 
Sbjct: 250 KLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGTDLECG 309

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           G + A++   A+  G + E+D++++L   +  +  LGMFD +    P+  +    V +P 
Sbjct: 310 GSYRALNK--ALADGKISEKDLDVSLRRLLKGRFELGMFDPDERV-PYSKIPYSVVESPE 366

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H   AL  A + IVLLKN    LPL       +AV+GPN+  +  +  NY G      T 
Sbjct: 367 HIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYNGFPSKTVTI 425

Query: 301 LQGISRY---AKTIHQAGC 316
           ++GI      A+ I++ GC
Sbjct: 426 VEGIRNKVPNAEVIYELGC 444



 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 56/106 (52%), Gaps = 13/106 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  V V G+   +E E +          DR  + +P  Q+E+V  +  A+  PVV V
Sbjct: 603 KDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALV-ATGKPVVYV 661

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           +  G  + +++ +ND  + AIL   Y GQ GG A+ADVLFG  NP 
Sbjct: 662 VCTGSALALNW-END-HVNAILNAWYGGQEGGTAVADVLFGDYNPA 705


>gi|298376791|ref|ZP_06986746.1| beta-glucosidase [Bacteroides sp. 3_1_19]
 gi|298266669|gb|EFI08327.1| beta-glucosidase [Bacteroides sp. 3_1_19]
          Length = 868

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 124/319 (38%), Positives = 177/319 (55%), Gaps = 17/319 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+W+PN+NIFRDPRWGRG ET GEDP LT K   +  RGLQG+  +  K  AC KHY 
Sbjct: 133 GLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPNYYKTHACAKHYA 192

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +    WN   R+ FNA  + +DL +TY   F+A V EG V  VMC+YN+  GKP C+  
Sbjct: 193 VHSGPEWN---RHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKPCCSSD 249

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLY----NTQHYTRTP--EEAAADAIKAGLDLDC- 180
            +L + +   W  D  I+SDC ++   +    NT  +   P  E A+ADA+  G DL+C 
Sbjct: 250 KLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGTDLECG 309

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           G + A++   A+  G + E+D++++L   +  +  LGMFD +    P+  +    V +P 
Sbjct: 310 GSYRALNK--ALADGKISEKDLDVSLRRLLKGRFELGMFDPDERV-PYSKIPYSVVESPE 366

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H   AL  A + IVLLKN    LPL       +AV+GPN+  +  +  NY G      T 
Sbjct: 367 HIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYNGFPSKTVTI 425

Query: 301 LQGISRY---AKTIHQAGC 316
           ++GI      A+ I++ GC
Sbjct: 426 VEGIRNKVPNAEVIYELGC 444



 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 56/106 (52%), Gaps = 13/106 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  V V G+   +E E +          DR  + +P  Q+E+V  +  A+  PVV V
Sbjct: 603 KDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALV-ATGKPVVYV 661

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           +  G  + +++ +ND  + AIL   Y GQ GG A+ADVLFG  NP 
Sbjct: 662 VCTGSALALNW-END-HVNAILNAWYGGQEGGTAVADVLFGDYNPA 705


>gi|254786805|ref|YP_003074234.1| glycoside hydrolase family 3 domain-containing protein
           [Teredinibacter turnerae T7901]
 gi|237686035|gb|ACR13299.1| glycoside hydrolase family 3 domain protein [Teredinibacter
           turnerae T7901]
          Length = 888

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 129/309 (41%), Positives = 180/309 (58%), Gaps = 20/309 (6%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPN+NIFRDPRWGRGQET GEDP LTG+ A  Y+ GLQG     LK AA  KH+ 
Sbjct: 152 GLTFWSPNINIFRDPRWGRGQETYGEDPYLTGELALPYISGLQGENPKYLKTAAMAKHFA 211

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +         R+  N   S +DL +TY   F+  VVEG V SVMC+YN+VN +P C + 
Sbjct: 212 VHSGPE---KSRHSDNYIASPKDLNETYLPAFEKAVVEGDVESVMCAYNRVNDEPACGND 268

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLY--NTQHYTRTPEEAAADAIKAGLDLDCG---- 181
            +LK T+ G+W   G++VSDC ++   Y     H    P  AAA A+++G DL+CG    
Sbjct: 269 MLLKETLRGKWGFKGHVVSDCGAIADFYAPEAHHVVMAPAAAAAWAVRSGTDLNCGTDRL 328

Query: 182 -PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQ-PFGNLGPRDVC-T 238
             F  +H   A++  ++ +++++ ++   +  + +LGMFD  P  Q P+  + P DV  +
Sbjct: 329 STFANLHF--ALQREMITQDEIDQSVKRLMKTRFKLGMFD--PDDQVPYSKI-PMDVVGS 383

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
            AH  L  +AA +  VLLKNS   LPL   +   VA+IGPN+     ++GNY G      
Sbjct: 384 QAHLALTQKAAEKSFVLLKNSG-ILPLK--KSSKVAIIGPNATNPTVLVGNYFGDPIKPV 440

Query: 299 TPLQGISRY 307
           TPL GI +Y
Sbjct: 441 TPLDGIQQY 449



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 53/88 (60%), Gaps = 6/88 (6%)

Query: 345 GLDQSIEAEFID---RAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPR 401
           G + S+E E  D   R  + LP  Q++L++ + K ++ P+VLV   G  + +++A N+  
Sbjct: 636 GEEMSVEIEGFDHGDRTDIRLPEPQRKLLATLKKLNK-PIVLVNFSGSAIALNWANNN-- 692

Query: 402 IGAILWVGYPGQAGGAAIADVLFGRANP 429
           + AIL   YPG+A G A+A +L+G  +P
Sbjct: 693 VDAILQGFYPGEATGTALARILWGEVSP 720


>gi|288927072|ref|ZP_06420962.1| beta-glucosidase [Prevotella buccae D17]
 gi|288336152|gb|EFC74543.1| beta-glucosidase [Prevotella buccae D17]
          Length = 866

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 121/313 (38%), Positives = 175/313 (55%), Gaps = 24/313 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR----------- 56
           G++ W+PN+NIFRDPRWGRGQET GEDP LT +   + V GLQG    R           
Sbjct: 130 GVSIWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRDMRPFTERPRY 189

Query: 57  LKVAACCKHYTAYDLDNWNGVDRYHFNA-RVSKQDLEDTYNVPFKACVVEGKVASVMCSY 115
            K  AC KHY  +    WN   R+ F+  R+ ++DL +TY   FK+ V EG V  VMC+Y
Sbjct: 190 YKTLACAKHYAVHSGPEWN---RHVFDVERLPERDLWETYLPAFKSLVQEGNVREVMCAY 246

Query: 116 NQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHY--TRTPEEAAADAIK 173
            +++G P C +   L   + G+W  +G +VSDC ++   Y   H+    TP EA+A  ++
Sbjct: 247 QRIDGSPCCGNTRYLHQILRGEWEYNGLVVSDCGAISDFYREGHHHVVETPAEASAMGVR 306

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AG D++CG   A     AV  GL+  E ++ ++   +  +  +G FD E    P+   GP
Sbjct: 307 AGTDVECGAVYATLPR-AVEQGLISREAIDTSVVRLLKARFEVGDFDSEKLV-PWKLTGP 364

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLST--LRHHTVAVIGPNSDVTVTMIGNYA 291
             + +  H++LAL  A + + LL+N  R LPLS   LR   +AV+GPN++ +V + GNY 
Sbjct: 365 EVIASETHRRLALDMARESMTLLQNRNRLLPLSKNGLR---IAVMGPNANDSVMLWGNYT 421

Query: 292 GVACGYTTPLQGI 304
           G     TT L+GI
Sbjct: 422 GYPISTTTILKGI 434



 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 40/115 (34%), Positives = 60/115 (52%), Gaps = 15/115 (13%)

Query: 327 IGAAEVAARQADATVLVM--GLDQSIEAEFI----------DRAGLLLPGRQQELVSRVA 374
           I A+E+AA+  DA V+V   G+   +E E +          DR  + LP  Q+E++  + 
Sbjct: 599 ITASEIAAQAGDADVVVFVGGISPRLEGEEMKVDAPGFKGGDRTSIELPEAQREVIRLLR 658

Query: 375 KASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +A  G +V+ + C G   V+         A+L   Y G+AGG A+ADVLFG  NP
Sbjct: 659 QA--GKLVVFVNCSGGA-VALVPETEACDAVLQAWYAGEAGGQAVADVLFGDYNP 710


>gi|423294294|ref|ZP_17272421.1| hypothetical protein HMPREF1070_01086 [Bacteroides ovatus
           CL03T12C18]
 gi|392675485|gb|EIY68926.1| hypothetical protein HMPREF1070_01086 [Bacteroides ovatus
           CL03T12C18]
          Length = 861

 Score =  219 bits (558), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 121/316 (38%), Positives = 182/316 (57%), Gaps = 16/316 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT+W+PNVNIFRDPRWGRGQET GEDP LTG+   + VRGLQG   ++  K+ AC KH+
Sbjct: 131 GLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDTKYDKLHACAKHF 190

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ F+A  +  +DL +TY   FK  V +  V  VMC+YN+  G+P C 
Sbjct: 191 AVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCG 247

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   +  +W   G +VSDC ++   Y    +   P  E A+A A++ G DL+CG  
Sbjct: 248 SNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAAAVRTGTDLECGSE 307

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            A   + AV+ GL+ E++++++L   +T +  LG  D +P+   +  +    + +  HQ 
Sbjct: 308 YASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WSEIPASVLNSKEHQA 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ A + +VLL+N    LPL+T  H  VAV+GPN++ +V   GNY G+     T L+ 
Sbjct: 364 LALRMARESLVLLQNKNNILPLNT--HLKVAVMGPNANDSVMQWGNYNGIPAHTVTLLEA 421

Query: 304 ISRY---AKTIHQAGC 316
           +       + I++ GC
Sbjct: 422 VRAKLPEGQIIYEPGC 437



 Score = 39.7 bits (91), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 47/104 (45%), Gaps = 13/104 (12%)

Query: 337 ADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLM 386
           AD  +   G+  S+E E +          DR  + LP  Q+ L+  + KA +  VV +  
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRNLLKALKKAGKK-VVFINY 656

Query: 387 CGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            G  +            AIL   YPGQAGG AI D L+G  NPG
Sbjct: 657 SGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPG 698


>gi|384428895|ref|YP_005638255.1| beta-glucosidase [Xanthomonas campestris pv. raphani 756C]
 gi|341937998|gb|AEL08137.1| beta-glucosidase [Xanthomonas campestris pv. raphani 756C]
          Length = 888

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 123/303 (40%), Positives = 175/303 (57%), Gaps = 13/303 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++RGLQG+     +  A  KH 
Sbjct: 134 AGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLEHPRTIATPKHI 193

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS +D+E TY   F+A +VEG+  SVMC+YN ++G P C
Sbjct: 194 AVH-----SGPEPGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPAC 248

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PF 183
           A   +L   + G W   G++VSDCD+V  +    ++      ++A A+KAG DL+CG  +
Sbjct: 249 AADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGTAY 308

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            A+ T  A+  G + E  ++ +L      + RLG     P    +  LG +D+    ++ 
Sbjct: 309 RALGT--AIERGEVDEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAKDIDNAGNRA 365

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LALQAA + IVLLKN+  TLPL       +AVIGPN+D    +  NY G +    TPL G
Sbjct: 366 LALQAAAESIVLLKNANATLPLKA--STRLAVIGPNADALAALEANYQGTSSQPVTPLLG 423

Query: 304 ISR 306
           + +
Sbjct: 424 LRQ 426



 Score = 65.5 bits (158), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 45/105 (42%), Positives = 58/105 (55%), Gaps = 13/105 (12%)

Query: 336 QADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           Q+DA V  +GL   +E E +          DR  + LP  QQ L+ R AKAS  P+V+VL
Sbjct: 616 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVL 674

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           M G  V +++AK      AI+   YPGQ+GG AIA  L G  NPG
Sbjct: 675 MSGSAVALNWAKT--HADAIVAAWYPGQSGGTAIARALAGDDNPG 717


>gi|409195436|ref|ZP_11224099.1| glycoside hydrolase family protein [Marinilabilia salmonicolor JCM
           21150]
          Length = 867

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 118/305 (38%), Positives = 174/305 (57%), Gaps = 10/305 (3%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN-TGSRLKVAAC 62
           G   GLT W+PN+N+FRDPRWGRG E  GEDP + G    + V+GLQG+ +G   K+ AC
Sbjct: 127 GRYQGLTMWTPNINVFRDPRWGRGMEAYGEDPFMNGVLGTAVVKGLQGDRSGKYDKLHAC 186

Query: 63  CKHYTAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            KHY  +    WN   R+ FNA  +  +DL +TY   FK  V++G V  VMC+YN+  G+
Sbjct: 187 AKHYAVHSGPEWN---RHSFNAENIRPRDLHETYLPAFKKLVIDGDVRMVMCAYNRFEGE 243

Query: 122 PTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE--EAAADAIKAGLDLD 179
           P C +  +L++ +  +W  DG +VSDC ++   +N   +   P+   A+ DA+ AG DL+
Sbjct: 244 PCCGNNQLLRDILRNEWGFDGVVVSDCWAINDFFNKDAHAMYPDAKTASTDAVLAGTDLN 303

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG       E AV  GL+ EE ++++L   +  +  LG  D +   + +  +    V +P
Sbjct: 304 CGDSYPSLVE-AVEQGLITEEQLDISLRRLLIARFELGEMDPDEEVE-WSKIPHSVVSSP 361

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
            H ++AL+AA + + LL N    LPL      TVAV+GPN++ ++   GNY G     TT
Sbjct: 362 THSEMALEAARKSMTLLMNKNGALPLKK-EGLTVAVMGPNANDSLMQWGNYNGTPATTTT 420

Query: 300 PLQGI 304
            LQGI
Sbjct: 421 ILQGI 425



 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 39/108 (36%), Positives = 57/108 (52%), Gaps = 15/108 (13%)

Query: 334 ARQADATVLVM--GLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPV 381
           A+ ADA V+V   G+   +E E +          DR  + LP  Q+E++  + KA  G  
Sbjct: 598 AKVADADVVVFASGISPFLEGEEMGVDLPGFKGGDRTDIALPAIQKEMLKALHKA--GKE 655

Query: 382 VLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++++ C G   + F +      AIL   YPGQAGG A+A+VLFG  NP
Sbjct: 656 IILVNCSGSA-IGFEEATDYSSAILQAWYPGQAGGQAVAEVLFGDYNP 702


>gi|336415363|ref|ZP_08595703.1| hypothetical protein HMPREF1017_02811 [Bacteroides ovatus
           3_8_47FAA]
 gi|335940959|gb|EGN02821.1| hypothetical protein HMPREF1017_02811 [Bacteroides ovatus
           3_8_47FAA]
          Length = 861

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 125/316 (39%), Positives = 183/316 (57%), Gaps = 16/316 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT+W+PNVNIFRDPRWGRGQET GEDP LTG+   + VRGLQG   +R  K+ AC KH+
Sbjct: 131 GLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAKHF 190

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ F+A  +  +DL +TY   FK  V +  V  VMC+YN+  G+P C 
Sbjct: 191 AVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCG 247

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   +  +W   G +VSDC ++   Y    +   P  E A+A A++AG DL+CG  
Sbjct: 248 SNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAGAVRAGTDLECGSE 307

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            A   + AV+ GL+ E++++++L   +T +  LG  D E SA  +  +    + +  HQ 
Sbjct: 308 YASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMD-EQSA--WSEIPTSVLNSKEHQA 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ A + +VLL+N    LPL+T  H  VAV+GPN++ +V   GNY G+     T L+ 
Sbjct: 364 LALRMARESLVLLQNKNNILPLNT--HLKVAVMGPNANDSVMQWGNYNGIPAHTVTLLEA 421

Query: 304 ISRY---AKTIHQAGC 316
           +       + I++ GC
Sbjct: 422 VRAKLPEGQIIYEPGC 437



 Score = 47.8 bits (112), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 13/104 (12%)

Query: 337 ADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLM 386
           AD  +   G+  S+E E +          DR  + LP  Q++L+  + K  +  VV +  
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKVGK-KVVFINY 656

Query: 387 CGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            G  +            AIL   YPGQAGG AI D L+G  NPG
Sbjct: 657 SGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPG 698


>gi|262405256|ref|ZP_06081806.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|294644754|ref|ZP_06722499.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294810589|ref|ZP_06769241.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
 gi|345508031|ref|ZP_08787672.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
 gi|229444722|gb|EEO50513.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
 gi|262356131|gb|EEZ05221.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|292639876|gb|EFF58149.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294442250|gb|EFG11065.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
          Length = 861

 Score =  219 bits (557), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 125/316 (39%), Positives = 183/316 (57%), Gaps = 16/316 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT+W+PNVNIFRDPRWGRGQET GEDP LTG+   + VRGLQG   +R  K+ AC KH+
Sbjct: 131 GLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLHACAKHF 190

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ F+A  +  +DL +TY   FK  V +  V  VMC+YN+  G+P C 
Sbjct: 191 AVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCG 247

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   +  +W   G +VSDC ++   Y    +   P  E A+A A++AG DL+CG  
Sbjct: 248 SNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAGAVRAGTDLECGSE 307

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            A   + AV+ GL+ E++++++L   +T +  LG  D E SA  +  +    + +  HQ 
Sbjct: 308 YASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMD-EQSA--WSEIPTSVLNSKEHQA 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ A + +VLL+N    LPL+T  H  VAV+GPN++ +V   GNY G+     T L+ 
Sbjct: 364 LALRMARESLVLLQNKNNILPLNT--HLKVAVMGPNANDSVMQWGNYNGIPAHTVTLLEA 421

Query: 304 ISRY---AKTIHQAGC 316
           +       + I++ GC
Sbjct: 422 VRAKLPEGQIIYEPGC 437



 Score = 40.8 bits (94), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 13/104 (12%)

Query: 337 ADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLM 386
           AD  +   G+  S+E E +          DR  + LP  Q++L+  + KA +  VV +  
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK-VVFINY 656

Query: 387 CGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            G  +            AIL   YPGQAGG AI D L+G  NPG
Sbjct: 657 SGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPG 698


>gi|189464583|ref|ZP_03013368.1| hypothetical protein BACINT_00926 [Bacteroides intestinalis DSM
           17393]
 gi|189436857|gb|EDV05842.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 879

 Score =  219 bits (557), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 122/316 (38%), Positives = 183/316 (57%), Gaps = 14/316 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT W+PN+NIFRDPRWGRGQET GEDP LT +   + VRGLQG  G +  K+ AC KHY
Sbjct: 147 GLTMWTPNINIFRDPRWGRGQETYGEDPYLTSQMGMAVVRGLQGPEGEKYDKLHACAKHY 206

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  +  +DL +TY   FK  V +  V  VMC+YN+  G+P C 
Sbjct: 207 AVHSGPEWN---RHSFNAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCG 263

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEE--AAADAIKAGLDLDCGPF 183
              +L + +  +W     +VSDC ++   YN   +   P++  A+A A+ +G D++CG  
Sbjct: 264 SNRLLMHILRDEWGYKEIVVSDCWAISDFYNKGAHETDPDKQHASAKAVLSGTDIECGDS 323

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                E AV+ GL+ E+ ++++L   +  +  LG  D EPS   +  +    V +  H++
Sbjct: 324 YGSLPE-AVKEGLIDEKQIDISLKRLMKARFELGEMD-EPSQVSWAQIPYSVVDSKEHRE 381

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ A + +VLL+N+   LPL+  ++  VAV+GPN++ +V   GNY G      T L+G
Sbjct: 382 LALRMARESLVLLQNNQSLLPLN--KNLKVAVVGPNANDSVMQWGNYNGFPSHTITLLEG 439

Query: 304 ISRY---AKTIHQAGC 316
           I  Y   ++ I++ GC
Sbjct: 440 IREYLPESQIIYEPGC 455



 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 51/105 (48%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           ++AD  +   G+  ++E E +          DR  + LP  Q  L++ + KA +  +V V
Sbjct: 614 KEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKAGK-KIVFV 672

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G  +  +         AIL   YPGQAGG AIA+VLFG  NP
Sbjct: 673 NFSGSAI--ALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNP 715


>gi|380692997|ref|ZP_09857856.1| beta-glucosidase [Bacteroides faecis MAJ27]
          Length = 837

 Score =  219 bits (557), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 116/286 (40%), Positives = 170/286 (59%), Gaps = 8/286 (2%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+G    ++V+GLQG+    LK  A  KH+ A
Sbjct: 122 LTFWSPTVNMARDPRWGRTPETYGEDPYLSGVLGTAFVKGLQGDHPRYLKAVATPKHFAA 181

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R++ +A +++ DL + Y   F+ C+ EGK  S+M +YN +NG P  A+  
Sbjct: 182 NNEEH----NRFYCDAAITETDLREYYFPAFEKCIREGKAESIMTAYNAINGVPCTANNW 237

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPF-LAIH 187
           +L   +   W  +GYIVSDC + G+L     Y +TPE AA  AIKAGLD++CG +  A  
Sbjct: 238 LLNKVLKQDWGFNGYIVSDCGAPGLLMTDHRYVKTPEAAAMIAIKAGLDVECGDYVFANP 297

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              A +  ++   +++ A    +  +MRLGMFD +P   P+ +L P  V    H  LAL+
Sbjct: 298 LLNAYKQYMVSAAEIDSAAYRVLRARMRLGMFD-DPEKNPYNHLSPEIVGCKKHHDLALE 356

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 293
           AA Q IVLLKN   TLPL+  +  ++AV+G N+       G+Y+G 
Sbjct: 357 AARQSIVLLKNQQNTLPLNAQKIKSIAVVGINA--ANCEFGDYSGT 400



 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 42/105 (40%), Positives = 59/105 (56%), Gaps = 3/105 (2%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           + G A    R++D  + VMG++QSIE E  DR  + LP  QQ  +    KA+   +V VL
Sbjct: 577 MYGDASKIIRESDVVIAVMGINQSIEREGQDRNSIELPKDQQIFIREAYKANPNTIV-VL 635

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G  + + +   D  I AI+   YPG+ GG AIA+VLFG  NP 
Sbjct: 636 VAGSSMAIGWM--DQHIPAIIDAWYPGEQGGTAIAEVLFGDYNPA 678


>gi|229580225|ref|YP_002838625.1| glycoside hydrolase family protein [Sulfolobus islandicus
           Y.G.57.14]
 gi|229581131|ref|YP_002839530.1| glycoside hydrolase family protein [Sulfolobus islandicus
           Y.N.15.51]
 gi|228010941|gb|ACP46703.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           Y.G.57.14]
 gi|228011847|gb|ACP47608.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           Y.N.15.51]
          Length = 754

 Score =  219 bits (557), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 149/453 (32%), Positives = 236/453 (52%), Gaps = 54/453 (11%)

Query: 13  SPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL- 71
           SP +++ +DPRWGR +ET GEDP L      +Y+ GLQG+     ++ A  KH+ A+   
Sbjct: 140 SPVLDVCKDPRWGRCEETYGEDPYLVASMGLAYITGLQGDN----QLVATAKHFAAHGFP 195

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
           +    + + H    V  ++L +T+  PF+  V  GKV S+M +Y++++G P   +P +L 
Sbjct: 196 EGGRNIAQVH----VGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLT 251

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTE-- 189
           N +  +W  DG +VSD D +  L           EAA  A+++G+D++  P +  + E  
Sbjct: 252 NILRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEF-PTIDCYGEPL 310

Query: 190 -GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLG--PRDVCTPAHQQLAL 246
             A++ GL+ E  ++ A+   + ++ RLG+ D      PF N    P  +     ++LAL
Sbjct: 311 VNALKEGLVPESLIDRAVERVLRIKDRLGLLDN-----PFVNENSVPEKLDDHKSRELAL 365

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA-----GVACG--YTT 299
           + A + IVLLKN    LPLS    + +AVIGPN++    M+G+Y       +  G    T
Sbjct: 366 KTARESIVLLKNENNILPLSK-NVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVT 424

Query: 300 PLQGISRY---AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG---------LD 347
            LQGI +    +K ++  GC  +A    +    A   ARQAD  + +MG         +D
Sbjct: 425 VLQGIVKKVGESKVLYAKGC-DIASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMD 483

Query: 348 QSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAK 397
              E EF           DR+ L LPG Q+EL+  + K  + P++LVL+ G P+ +S   
Sbjct: 484 IPSEEEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSSII 542

Query: 398 NDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           N   + A++   +PG+ GG AIADV+FG  NPG
Sbjct: 543 N--YVKAVIEAWFPGEEGGNAIADVIFGDYNPG 573


>gi|393781366|ref|ZP_10369565.1| hypothetical protein HMPREF1071_00433 [Bacteroides salyersiae
           CL02T12C01]
 gi|392676859|gb|EIY70281.1| hypothetical protein HMPREF1071_00433 [Bacteroides salyersiae
           CL02T12C01]
          Length = 854

 Score =  219 bits (557), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 116/300 (38%), Positives = 177/300 (59%), Gaps = 9/300 (3%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+GK   S+V+GLQG+    LK+ +  KH+ A
Sbjct: 136 LTFWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVSFVKGLQGDDPRYLKIVSTPKHFAA 195

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R+  N  +S++DL + Y   F+ C++EGK AS+M +YN +N  P   +  
Sbjct: 196 NNEEH----NRFECNPIISEKDLREYYLPAFEKCIIEGKAASIMTAYNAINDVPCTLNNW 251

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH- 187
           +LK  +   W  DGY+VSDC +   L     Y +T E AA  +I+AGLDL+CG  + +  
Sbjct: 252 LLKKVLRHDWGFDGYVVSDCGAPDFLVTHHKYVKTLEAAATLSIQAGLDLECGDNVYMEP 311

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              A +  ++ E +++ A  + +  +MRLG+FD +P+  P+  + P  V    H QLAL+
Sbjct: 312 LLNAYKQYMVTEAEIDSAAYHILRARMRLGLFD-DPNLNPYNKISPSVVGCEKHSQLALE 370

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY-TTPLQGISR 306
           AA Q IVLLKN  + LPL   +  ++AV+G N+       G+Y+G       + L+GI +
Sbjct: 371 AARQSIVLLKNEKKFLPLDLKKIKSIAVVGINA--GNCEFGDYSGTPVNQPVSILEGIKK 428



 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 45/105 (42%), Positives = 63/105 (60%), Gaps = 3/105 (2%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           L GAA  A R+ D T+ V+G+++SIE E  DR  + LP  QQ  +    K +   VV VL
Sbjct: 591 LYGAAGDAMRKCDLTIAVVGINKSIEREGQDRYSIELPKDQQIFIEEAYKINPNTVV-VL 649

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G  + +++   D  I AI+   YPG+AGG A+A+VLFG  NPG
Sbjct: 650 VAGSSLAINWM--DEHIPAIVNAWYPGEAGGTAVAEVLFGDYNPG 692


>gi|315607027|ref|ZP_07882031.1| beta-glucosidase [Prevotella buccae ATCC 33574]
 gi|315251081|gb|EFU31066.1| beta-glucosidase [Prevotella buccae ATCC 33574]
          Length = 866

 Score =  218 bits (556), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 121/313 (38%), Positives = 175/313 (55%), Gaps = 24/313 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR----------- 56
           G++ W+PN+NIFRDPRWGRGQET GEDP LT +   + V GLQG    R           
Sbjct: 130 GVSIWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRDMRPFTERPRY 189

Query: 57  LKVAACCKHYTAYDLDNWNGVDRYHFNA-RVSKQDLEDTYNVPFKACVVEGKVASVMCSY 115
            K  AC KHY  +    WN   R+ F+  R+ ++DL +TY   FK+ V EG V  VMC+Y
Sbjct: 190 YKTLACAKHYAVHSGPEWN---RHVFDVERLPERDLWETYLPAFKSLVQEGNVREVMCAY 246

Query: 116 NQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHY--TRTPEEAAADAIK 173
            +++G P C +   L   + G+W  +G +VSDC ++   Y   H+    TP EA+A  ++
Sbjct: 247 QRIDGSPCCGNTRYLHQILRGEWGYNGLVVSDCGAISDFYREGHHHVVETPAEASAMGVR 306

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AG D++CG   A     AV  GL+  E ++ ++   +  +  +G FD E    P+   GP
Sbjct: 307 AGTDVECGAVYATLPR-AVEQGLISREAIDTSVVRLLKARFEVGDFDSEKLV-PWKLTGP 364

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLST--LRHHTVAVIGPNSDVTVTMIGNYA 291
             + +  H++LAL  A + + LL+N  R LPLS   LR   +AV+GPN++ +V + GNY 
Sbjct: 365 EVIASETHRRLALDMARESMTLLQNRNRLLPLSKNGLR---IAVMGPNANDSVMLWGNYT 421

Query: 292 GVACGYTTPLQGI 304
           G     TT L+GI
Sbjct: 422 GYPISTTTILKGI 434



 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 40/115 (34%), Positives = 60/115 (52%), Gaps = 15/115 (13%)

Query: 327 IGAAEVAARQADATVLVM--GLDQSIEAEFI----------DRAGLLLPGRQQELVSRVA 374
           I A+E+AA+  DA V+V   G+   +E E +          DR  + LP  Q+E++  + 
Sbjct: 599 ITASEIAAQAGDADVVVFVGGISPRLEGEEMKVDAPGFNGGDRTSIELPEAQREVIRLLR 658

Query: 375 KASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +A  G +V+ + C G   V+         A+L   Y G+AGG A+ADVLFG  NP
Sbjct: 659 QA--GKLVVFVNCSGGA-VALVPEAEACDAVLQAWYAGEAGGQAVADVLFGDYNP 710


>gi|256394020|ref|YP_003115584.1| glycoside hydrolase family 3 [Catenulispora acidiphila DSM 44928]
 gi|256360246|gb|ACU73743.1| glycoside hydrolase family 3 domain protein [Catenulispora
           acidiphila DSM 44928]
          Length = 1212

 Score =  218 bits (556), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 165/470 (35%), Positives = 230/470 (48%), Gaps = 59/470 (12%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN------TGSRLKVAAC 62
           LTYW+P VN+ RDPRWGR  E  GEDP L GK A ++V G QG       T   LKVAA 
Sbjct: 171 LTYWAPTVNMDRDPRWGRTDEAFGEDPYLVGKMAGAFVAGYQGETIDGTPTSPYLKVAAT 230

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
            KH+    L+N N  DR+  +A  S+ D+ D Y   F++ V +  VA +M SYN +NG P
Sbjct: 231 AKHFA---LNN-NENDRHADSADASESDIRDYYTAQFRSLVEDSHVAGLMTSYNAINGTP 286

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVG------------------VLYNTQHYTRT- 163
           + AD           W  DGYI SDC +VG                  V+  T  +T T 
Sbjct: 287 SPADTYTTDALAQRTWGFDGYITSDCGAVGDVTASSSHDWAPPGWTVSVVNGTSTWTNTA 346

Query: 164 -----PEEAA--ADAIKAGLDLDC--GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQM 214
                P +A   A A++AG D +C  G     + E A++  +L E  ++ AL    TV+M
Sbjct: 347 TGVQVPADAGGQAYALRAGTDANCTGGDATLGNIEAAIKAEILSEGVIDHALVQLFTVRM 406

Query: 215 RLGMFDGEPSAQ-PFGNLGPRDVCTPAHQQLALQAAHQGIVLLKN------SARTLPLST 267
           + G FD  P+ +  +  +    + +P HQ LA + A   +VLLKN      +A+ LP + 
Sbjct: 407 QTGEFD--PANKVAYTRITKAQIQSPEHQALAEKVAANSLVLLKNDPMPGSAAKVLPANP 464

Query: 268 LRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRYAK--------TIHQAGCFGV 319
              + V V+G  ++ TVT+ G Y+G        +QGI+   K        T    G    
Sbjct: 465 ASLNNVVVVGDLAN-TVTL-GGYSGDPTLQVNAVQGITSAVKAANPNATVTFDACGTSTT 522

Query: 320 ACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRG 379
           A        A + A + AD  V+ +G D S   E  DRA L +PG    L+S+VA     
Sbjct: 523 ATAAASCSAATQAAIKTADLVVVFVGTDGSTAGESNDRASLAMPGNYDSLISQVAALGNP 582

Query: 380 PVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
             VL +   GPVD+   K D    AI++  Y G++ G A+ADVLFG+ NP
Sbjct: 583 RTVLSMQTDGPVDIENVKGD--FPAIVYSAYNGESQGTALADVLFGKQNP 630


>gi|336275603|ref|XP_003352555.1| hypothetical protein SMAC_01389 [Sordaria macrospora k-hell]
 gi|380094444|emb|CCC07823.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 833

 Score =  218 bits (555), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 156/465 (33%), Positives = 229/465 (49%), Gaps = 73/465 (15%)

Query: 5   GMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCK 64
           G  GL YW+PN+N ++DPRWGRG ETPGEDP+    Y  + V GL+GN G+  KV A CK
Sbjct: 142 GFGGLDYWTPNINPYKDPRWGRGAETPGEDPLRIKGYVKAMVAGLEGN-GTVRKVIATCK 200

Query: 65  HYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMC----------- 113
           H+ AYDL+ W G+ RY F+A VS QDL + Y  PF+ C  + +V S+MC           
Sbjct: 201 HFAAYDLERWRGLTRYDFDAVVSLQDLSEYYLPPFQQCARDSRVGSIMCRYVSFFLPPFP 260

Query: 114 ---------------------SYNQVNGKPTCADPDILKNTIHGQWRL---DGYIVSDCD 149
                                SYN +NG P CA   ++ N +   W     + YI SDC+
Sbjct: 261 SFPRLVTRQSGNQVDIVDNFRSYNALNGTPACASTYLMTNILRDHWNWTNHNNYITSDCN 320

Query: 150 SV-GVLYNTQHYTRTPEEAAADAIKAGLDLDCG-----PFLAIHTEGAVRGGLLREEDVN 203
           ++   L +  ++++TP EAAA A  AG D  C      P+  +   GA    LL E  ++
Sbjct: 321 AIQDFLPDNHNFSQTPAEAAAAAYIAGTDTVCEVSGWPPYTDV--VGAYNQSLLSESVID 378

Query: 204 LALAYTITVQMRLGMFD-GEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSART 262
            AL       +R G  D G P++            +P     +             S   
Sbjct: 379 TALRRLYEGLIRAGYLDHGRPASS-----------SPDKAPFS-------------SPDF 414

Query: 263 LPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRYAKTIHQA-GCFGVAC 321
           LPL  L   TVA+IG  ++ T T+ G Y+G+   Y  P+  + +   + + A G    + 
Sbjct: 415 LPLD-LTGKTVALIGHWANATRTIRGPYSGLPPFYHNPMYAVRQLKLSFYYANGPVVNST 473

Query: 322 NGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPV 381
           + +    AA +AA  AD  +   G D ++ +E +DR  +  P  Q  L+ ++A+   G  
Sbjct: 474 DADTWTAAAMLAAESADVVLYFGGTDTTVASEDLDRESIAWPKTQLTLIEKLAQV--GKP 531

Query: 382 VLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGR 426
           ++V+  G  VD +   N+  I +ILWVGYPGQ+GG A+ DVL G+
Sbjct: 532 MVVIQLGDQVDDTPLLNNKNISSILWVGYPGQSGGTAVFDVLTGK 576


>gi|365122193|ref|ZP_09339098.1| hypothetical protein HMPREF1033_02444 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363642907|gb|EHL82241.1| hypothetical protein HMPREF1033_02444 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 853

 Score =  218 bits (555), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 109/273 (39%), Positives = 165/273 (60%), Gaps = 6/273 (2%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+GK   ++V+GLQGN    LK+ +  KH+ A
Sbjct: 138 LTFWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVAFVKGLQGNDPRYLKIVSTPKHFAA 197

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R+  N  +S+++L + Y   F++C+ EGK  S+M +YN +N  P   +P 
Sbjct: 198 NNEEH----NRFECNPHISERNLREYYLPAFESCIKEGKAQSIMSAYNAINDVPCTLNPW 253

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH- 187
           +L   +  +W  +GY+VSDC   G L     Y +TPE AA  +IKAGLDL+CG  + I  
Sbjct: 254 LLTQVLRKEWGFNGYVVSDCGGPGFLVTHHKYVKTPEAAATLSIKAGLDLECGDNVYIEP 313

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              A +  ++ + D++ A    +  +M LG+FD +P   P+  + P  V    H+QLAL+
Sbjct: 314 LMNAYKQCMVTDADIDTAAYRILRARMMLGLFD-DPEKNPYNAISPSIVGCEKHRQLALE 372

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNS 280
           AA Q +VLLKN    LPL+  +  ++AV+G N+
Sbjct: 373 AARQSLVLLKNEKNFLPLNPKKVKSIAVVGINA 405



 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 45/105 (42%), Positives = 61/105 (58%), Gaps = 3/105 (2%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           L G A  A R+ D TV V+G+++SIE E  DR  + LP  QQ  +    KA+   VV VL
Sbjct: 593 LYGEAGRAIRECDVTVAVLGINKSIEREGQDRYTIELPADQQLFIKEAYKANPNTVV-VL 651

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G  + +++   D  I AIL   YPG+ GG A+A+ LFG  NPG
Sbjct: 652 VAGSSLAINWI--DENIPAILNAWYPGEQGGTAVAEALFGDYNPG 694


>gi|402304900|ref|ZP_10823963.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
 gi|400380686|gb|EJP33499.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
          Length = 866

 Score =  218 bits (555), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 121/313 (38%), Positives = 175/313 (55%), Gaps = 24/313 (7%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR----------- 56
           G++ W+PN+NIFRDPRWGRGQET GEDP LT +   + V GLQG    R           
Sbjct: 130 GVSIWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRDMRPFTERPRY 189

Query: 57  LKVAACCKHYTAYDLDNWNGVDRYHFNA-RVSKQDLEDTYNVPFKACVVEGKVASVMCSY 115
            K  AC KHY  +    WN   R+ F+  R+ ++DL +TY   FK+ V EG V  VMC+Y
Sbjct: 190 YKTLACAKHYAVHSGPEWN---RHVFDVERLPERDLWETYLPAFKSLVQEGNVREVMCAY 246

Query: 116 NQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHY--TRTPEEAAADAIK 173
            +++G P C +   L   + G+W  +G +VSDC ++   Y   H+    TP EA+A  ++
Sbjct: 247 QRIDGSPCCGNTRYLHQILRGEWGYNGLVVSDCGAISDFYREGHHHVVETPAEASAMGVR 306

Query: 174 AGLDLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGP 233
           AG D++CG   A     AV  GL+  E ++ ++   +  +  +G FD E    P+   GP
Sbjct: 307 AGTDVECGAVYATLPR-AVEQGLISREAIDTSVVRLLKARFEVGDFDSEKLV-PWKLTGP 364

Query: 234 RDVCTPAHQQLALQAAHQGIVLLKNSARTLPLST--LRHHTVAVIGPNSDVTVTMIGNYA 291
             + +  H++LAL  A + + LL+N  R LPLS   LR   +AV+GPN++ +V + GNY 
Sbjct: 365 EVIASETHRRLALDMARESMTLLQNRNRLLPLSKNGLR---IAVMGPNANDSVMLWGNYT 421

Query: 292 GVACGYTTPLQGI 304
           G     TT L+GI
Sbjct: 422 GYPISTTTILKGI 434



 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 40/115 (34%), Positives = 60/115 (52%), Gaps = 15/115 (13%)

Query: 327 IGAAEVAARQADATVLVM--GLDQSIEAEFI----------DRAGLLLPGRQQELVSRVA 374
           I A+E+AA+  DA V+V   G+   +E E +          DR  + LP  Q+E++  + 
Sbjct: 599 ITASEIAAQAGDADVVVFVGGISPRLEGEEMKVDAPGFKGGDRTSIELPEAQREVIRLLR 658

Query: 375 KASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +A  G +V+ + C G   V+         A+L   Y G+AGG A+ADVLFG  NP
Sbjct: 659 QA--GKLVVFVNCSGGA-VALVPEAEACDAVLQAWYAGEAGGQAVADVLFGDYNP 710


>gi|333380551|ref|ZP_08472242.1| hypothetical protein HMPREF9455_00408 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332826546|gb|EGJ99375.1| hypothetical protein HMPREF9455_00408 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 854

 Score =  218 bits (554), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 124/327 (37%), Positives = 191/327 (58%), Gaps = 15/327 (4%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+G    ++VRGLQG+    LK+ +  KH+ A
Sbjct: 138 LTFWSPTVNMARDPRWGRTPETYGEDPYLSGILGTAFVRGLQGDDPRYLKIVSTPKHFAA 197

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R+  N ++S++ L + Y   F+ CV +GK AS+M +YN +N  P  A+P 
Sbjct: 198 NNEEH----NRFVCNPQISERQLREYYFPAFEMCVKDGKSASIMSAYNAINDVPCTANPW 253

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH- 187
           +L   +   W  +GY+VSDC    +L +   Y +T E AA  +IKAGLDL+CG  + +  
Sbjct: 254 LLTKVLRHDWGFNGYVVSDCGGPSLLVSAMKYVKTKEAAATLSIKAGLDLECGDDVYMQP 313

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              A    ++   D++ A    +  +M LG+FD +P   P+  + P  V +  H+QLAL+
Sbjct: 314 LLNAYNQYMVSRADIDTAAYRVLRARMHLGLFD-DPDLNPYNKISPSVVGSAEHKQLALE 372

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV-ACGYTTPLQGISR 306
           AA Q IVLLKN+ RTLPL+  +  ++AV+G N+    +  G+Y+G+ A    + LQGI  
Sbjct: 373 AARQSIVLLKNNNRTLPLNPKKVKSIAVVGINAG--NSEFGDYSGIPANAPVSILQGIKD 430

Query: 307 Y----AKTIHQAGCFGVACNGNQLIGA 329
                AK ++    +  A +G ++I A
Sbjct: 431 KVGDNAKILY--APWKSAMDGKEMISA 455



 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 37/105 (35%), Positives = 63/105 (60%), Gaps = 3/105 (2%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           + G A  A R+ +  + V+G++++IE E  DR  + LP  Q+E +  + K +   +V+VL
Sbjct: 593 MYGEAGKAVRECEQVIAVLGINKTIEREGQDRYDIHLPADQEEFIREIYKVNPN-IVVVL 651

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G  + +++   D  + AI+   YPG+ GG A+A+VLFG  NPG
Sbjct: 652 VAGSSLAINWM--DEHVPAIVNAWYPGEQGGTAVAEVLFGEYNPG 694


>gi|375149998|ref|YP_005012439.1| Beta-glucosidase [Niastella koreensis GR20-10]
 gi|361064044|gb|AEW03036.1| Beta-glucosidase [Niastella koreensis GR20-10]
          Length = 875

 Score =  218 bits (554), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 123/304 (40%), Positives = 169/304 (55%), Gaps = 11/304 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYW+PN+NIFRDPRWGRGQET GEDP LT   A ++VRGLQGN    LK AAC KHY 
Sbjct: 137 GLTYWTPNINIFRDPRWGRGQETYGEDPYLTAALADAFVRGLQGNDPKYLKAAACAKHYA 196

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+  V+  DL DTY   FK  V    VA VMC+YN    +P CA
Sbjct: 197 VH-----SGPEPSRHVFDVDVTPYDLWDTYLPSFKKLVTVSNVAGVMCAYNAFRKQPCCA 251

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
              ++ + +  QW   GY+ SDC ++   Y           A+ADA+  G D+DCG    
Sbjct: 252 SDVLMTDILRNQWSFKGYVTSDCGAIDDFYRNHKTHPDAAAASADAVFHGTDIDCGNEAY 311

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
                AV+   + E+ +++++     ++ RLGMFD  PS   +      ++ + AH + A
Sbjct: 312 RALVQAVKENKITEKQIDISVKRLFMIRFRLGMFD-PPSMVKYAQTPATELESAAHAKHA 370

Query: 246 LQAAHQGIVLLKNSARTLPLST-LRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           L  AH+ IVLLKN+  TLPL   L+   + V+GPN+   +  +GNY+G      T  QGI
Sbjct: 371 LLMAHESIVLLKNANNTLPLKKGLKK--IVVLGPNATNVIAPLGNYSGTPSKLITLFQGI 428

Query: 305 SRYA 308
              A
Sbjct: 429 KEKA 432



 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/104 (37%), Positives = 53/104 (50%), Gaps = 13/104 (12%)

Query: 337 ADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLM 386
           ADA +   G+   +E E +          DR  +LLP  Q EL+ +  +AS  PVV V+M
Sbjct: 607 ADAFIFAGGISPQLEGEEMKVSDPGFKGGDRTTILLPAIQTELM-KALQASGKPVVFVMM 665

Query: 387 CGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            G  +   +   +  I AI+   Y GQA G A+ADVLFG  NP 
Sbjct: 666 TGSALATPWESEN--IPAIVNAWYGGQAAGTALADVLFGDYNPS 707


>gi|365121645|ref|ZP_09338561.1| hypothetical protein HMPREF1033_01907 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363645135|gb|EHL84409.1| hypothetical protein HMPREF1033_01907 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 868

 Score =  218 bits (554), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 133/324 (41%), Positives = 188/324 (58%), Gaps = 16/324 (4%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN-TGSRLKV 59
           MYNG   GLT+W+PN+NIFRDPRWGRG ET GEDP LT K   + V+GLQG+ T    K 
Sbjct: 125 MYNG-YKGLTFWTPNINIFRDPRWGRGMETYGEDPFLTTKMGLAVVKGLQGDGTQKYDKA 183

Query: 60  AACCKHYTAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 118
            AC KHY  +    WN   R+ +NA  +S +DL +TY   FKA V EGKV  VMC+YN+ 
Sbjct: 184 HACAKHYAVHSGPEWN---RHSYNAENISIRDLRETYLPAFKALVTEGKVKEVMCAYNRF 240

Query: 119 NGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNT-QHYTR-TPEEAAADAIKAGL 176
            G+P C++  +L N +  +W  D  IVSDC ++   Y   +H T  +  +A+ADA+ +G 
Sbjct: 241 EGEPCCSNKTLLINILKDEWGFDDVIVSDCGAIADFYTKGRHETHASAADASADAVISGT 300

Query: 177 DLDC-GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD 235
           DL+C G + A+  + A+  GL+ E  +N ++   +  +  LGMFD + S   + ++    
Sbjct: 301 DLECGGSYWAL--DEALEKGLITETKINESVFRLLRARFELGMFDDD-SLVSWSSIPYSV 357

Query: 236 VCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVAC 295
           VC   H+  AL+ A + +VLL N   TLPLS      VAV+GPN++ +V +  NY G   
Sbjct: 358 VCCDKHKAKALEMARKSMVLLSNKNNTLPLSK-SIKKVAVMGPNANDSVMLWANYNGTPD 416

Query: 296 GYTTPLQGISRY---AKTIHQAGC 316
              T L+GI         I++ GC
Sbjct: 417 RSVTILEGIKAKLPEGSVIYEKGC 440



 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 39/105 (37%), Positives = 56/105 (53%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + ADA + V G+  S+E E +          DR  + LP  Q+ ++ +  K +  PV+ V
Sbjct: 600 KDADAIIFVGGISSSLEGEEMGVKYPGFRNGDRTNIDLPQVQKNMM-KALKETGKPVIFV 658

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           L  G  + +S+   D  + AIL   YPGQ GG A+ADVLFG  NP
Sbjct: 659 LCSGSTMALSW--EDKNMDAILQAWYPGQEGGTAVADVLFGDYNP 701


>gi|255572557|ref|XP_002527212.1| beta-glucosidase, putative [Ricinus communis]
 gi|223533388|gb|EEF35138.1| beta-glucosidase, putative [Ricinus communis]
          Length = 349

 Score =  217 bits (553), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 99/202 (49%), Positives = 130/202 (64%), Gaps = 27/202 (13%)

Query: 1   MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKV- 59
           M+N G +GLT+W+PNVNIFRDPRWGRGQETPGEDP+LT  YA  +V+G QG         
Sbjct: 143 MHNVGQSGLTFWAPNVNIFRDPRWGRGQETPGEDPMLTSAYAIEFVKGFQGGNWKSGVSG 202

Query: 60  -------------------------AACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDT 94
                                    +ACCKH TAYDL+ W    RY FNA V++QDLEDT
Sbjct: 203 SGSGRYGFGEKRMLRDDDGDDGLMLSACCKHLTAYDLEKWGNFSRYSFNAVVTEQDLEDT 262

Query: 95  YNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVL 154
           Y  PF++C+ EGK + +MCSYN+VNG P CA  D+L+     +W  +GYIVSDCD+V  +
Sbjct: 263 YQPPFRSCIEEGKASCLMCSYNEVNGVPACAREDLLQKA-REEWGFEGYIVSDCDAVATI 321

Query: 155 YNTQHYTRTPEEAAADAIKAGL 176
           +  Q+Y+++ E+A A A+KAG+
Sbjct: 322 FEYQNYSKSAEDAVAIALKAGM 343


>gi|393781363|ref|ZP_10369562.1| hypothetical protein HMPREF1071_00430 [Bacteroides salyersiae
           CL02T12C01]
 gi|392676856|gb|EIY70278.1| hypothetical protein HMPREF1071_00430 [Bacteroides salyersiae
           CL02T12C01]
          Length = 863

 Score =  217 bits (552), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 131/318 (41%), Positives = 187/318 (58%), Gaps = 17/318 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT+W+PNVNIFRDPRWGRGQET GEDP LT +   + VRGLQG + S+  K+ AC KHY
Sbjct: 129 GLTFWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGLAVVRGLQGPSESKYDKLHACAKHY 188

Query: 67  TAYDLDNWNGVDRYHFNA-RVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ F+   +S +DL +TY   FKA V +G V  VMC+YN+  G+P C 
Sbjct: 189 ALHSGPEWN---RHSFDVDSISPRDLWETYLPAFKALVQQGGVKEVMCAYNRFEGEPCCG 245

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCG-P 182
              +L N +  +W  DG +VSDC ++   Y   H+   P  E A A A+KAG DLDCG  
Sbjct: 246 SNRLLYNILREEWGFDGLVVSDCGAISDFYLKGHHETHPTKEAAVAAAVKAGTDLDCGVD 305

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
           + A+  + AV  G++ E+ ++++L   +  +  LG+ D E     + ++    V +  H+
Sbjct: 306 YYAL--QKAVEEGIITEKQIDVSLFRLLKARFELGLMD-EEHLVSWSDIPYTVVDSEKHR 362

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRH-HTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 301
           + AL+ A + + LLKN   TLPLS  +H   +AVIGPN++ +V M GNY G      T L
Sbjct: 363 EKALEMARKSMTLLKNDHGTLPLS--KHCGKIAVIGPNANDSVMMWGNYNGFPSHTVTIL 420

Query: 302 QGISRY---AKTIHQAGC 316
           +GI+      + I+  GC
Sbjct: 421 EGITHKLGAEQIIYDKGC 438



 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 55/171 (32%), Positives = 79/171 (46%), Gaps = 20/171 (11%)

Query: 276 IGPNSDVTVTMIGNYAGVACGYTTPLQGISRYAKTIH--QAGCFGVA---CNGNQLIGAA 330
           I  N +  +   G +   +  YT   Q   +Y   I   QAG  G         +     
Sbjct: 532 IFVNDEKVIDYWGQHGAESRFYTLDAQAGMKYEIRIEYMQAGGEGTLQFDLGYTKRFNPN 591

Query: 331 EVAARQADATVLVM--GLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASR 378
           E+AAR  DA V+V   G+   +E E +          DR  + LP  Q++L+  + K  +
Sbjct: 592 EIAARVGDAEVIVFVGGISPKVEGEELPVSFPGFKGGDRTVIELPQVQRDLLQELHKTGK 651

Query: 379 GPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            PV+L+L  G  + +S A+ D    AI+   Y GQAGG A+ADVLFG  NP
Sbjct: 652 -PVILILCSGSAIGLS-AEVD-LADAIIQAWYLGQAGGTAVADVLFGDYNP 699


>gi|21232323|ref|NP_638240.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
 gi|21114093|gb|AAM42164.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
          Length = 888

 Score =  217 bits (552), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 122/303 (40%), Positives = 175/303 (57%), Gaps = 13/303 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++RGLQG+     +  A  KH 
Sbjct: 134 AGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLEHPRTIATPKHI 193

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS +D+E TY   F+A +VEG+  SVMC+YN ++G P C
Sbjct: 194 AVH-----SGPEPGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPAC 248

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PF 183
           A   +L   + G W   G++VSDCD+V  +    ++      ++A ++KAG DL+CG  +
Sbjct: 249 AADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCGTAY 308

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            A+ T  A+  G + E  ++ +L      + RLG     P    +  LG +D+    ++ 
Sbjct: 309 RALGT--AIERGEVDEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAKDIDNAGNRA 365

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LALQAA + IVLLKN+  TLPL       +AVIGPN+D    +  NY G +    TPL G
Sbjct: 366 LALQAAAESIVLLKNANATLPLKA--GTRLAVIGPNADALAALEANYQGTSSQPVTPLLG 423

Query: 304 ISR 306
           + +
Sbjct: 424 LRQ 426



 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 37/75 (49%), Positives = 46/75 (61%), Gaps = 3/75 (4%)

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  + LP  QQ L+ R AKAS  P+V+VLM G  V +++AK      AI+   YPGQ+G
Sbjct: 646 DRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKT--HADAIVAAWYPGQSG 702

Query: 416 GAAIADVLFGRANPG 430
           G AIA  L G  NPG
Sbjct: 703 GTAIARALAGDDNPG 717


>gi|94969405|ref|YP_591453.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
 gi|94551455|gb|ABF41379.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
          Length = 902

 Score =  217 bits (552), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 127/309 (41%), Positives = 178/309 (57%), Gaps = 25/309 (8%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPN+NIFRDPRWGRGQET GEDP LTGK   +++ G+QG   +  K  A  KH+ 
Sbjct: 143 GLTFWSPNINIFRDPRWGRGQETYGEDPFLTGKMGIAFIDGVQGPDAAHPKAVATSKHFA 202

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+ +VS +DLE+TY   F+A V +G V SVMC+YN V+G   CA
Sbjct: 203 VH-----SGPESLRHGFDVKVSPRDLEETYLAAFRATVTDGHVKSVMCAYNAVDGMGACA 257

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE--EAAADAIKAGLDLDCGPF 183
           +  +L+  +   W   G++VSDC +  ++  TQ +   P+   AAA ++ AG DL C   
Sbjct: 258 NKMLLEEHLKQAWGFKGFVVSDCGA--IMDVTQGHKNAPDIVHAAAISLAAGTDLSC--- 312

Query: 184 LAIHTEG------AVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
            +I   G      AVR GL+ E+ V  A       +  LGMFD EP + P   +    V 
Sbjct: 313 -SIWEPGFNTLADAVRKGLVTEDMVTRAAERLYAARFELGMFD-EPGSNPNDKIDMSQVA 370

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           +  H+  AL+AA + IVLLKN    LPL   +  T+AVIGP +++  ++ GNY G     
Sbjct: 371 SEEHRAEALKAAEESIVLLKNDG-LLPLKNAK--TIAVIGPTAELLASLEGNYNGQPVRP 427

Query: 298 TTPLQGISR 306
            TPL GI +
Sbjct: 428 VTPLDGIVK 436



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/74 (40%), Positives = 42/74 (56%), Gaps = 3/74 (4%)

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  + LP  Q++L+  +  A + PVV+V + G  V +++A      GAIL   YPG  G
Sbjct: 661 DRTSIDLPATQEKLLEALGAAGK-PVVVVNLSGSAVALNWANQ--HAGAILQAWYPGVEG 717

Query: 416 GAAIADVLFGRANP 429
           G AIA  L G +NP
Sbjct: 718 GTAIAKTLAGESNP 731


>gi|188990656|ref|YP_001902666.1| beta-glucosidase [Xanthomonas campestris pv. campestris str. B100]
 gi|167732416|emb|CAP50610.1| exported beta-glucosidase [Xanthomonas campestris pv. campestris]
          Length = 888

 Score =  217 bits (552), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 122/303 (40%), Positives = 175/303 (57%), Gaps = 13/303 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++RGLQG+     +  A  KH 
Sbjct: 134 AGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLEHPRTIATPKHI 193

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS +D+E TY   F+A +VEG+  SVMC+YN ++G P C
Sbjct: 194 AVH-----SGPEPGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPAC 248

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PF 183
           A   +L   + G W   G++VSDCD+V  +    ++      ++A ++KAG DL+CG  +
Sbjct: 249 AADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCGTAY 308

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            A+ T  A+  G + E  ++ +L      + RLG     P    +  LG +D+    ++ 
Sbjct: 309 RALGT--AIERGEVDEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAKDIDNAGNRA 365

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LALQAA + IVLLKN+  TLPL       +AVIGPN+D    +  NY G +    TPL G
Sbjct: 366 LALQAAAESIVLLKNANATLPLKA--GTRLAVIGPNADALAALEANYQGTSSQPVTPLLG 423

Query: 304 ISR 306
           + +
Sbjct: 424 LRQ 426



 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 37/75 (49%), Positives = 46/75 (61%), Gaps = 3/75 (4%)

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  + LP  QQ L+ R AKAS  P+V+VLM G  V +++AK      AI+   YPGQ+G
Sbjct: 646 DRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKT--HADAIVAAWYPGQSG 702

Query: 416 GAAIADVLFGRANPG 430
           G AIA  L G  NPG
Sbjct: 703 GTAIARALAGDDNPG 717


>gi|66767544|ref|YP_242306.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. 8004]
 gi|66572876|gb|AAY48286.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. 8004]
          Length = 888

 Score =  217 bits (552), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 122/303 (40%), Positives = 175/303 (57%), Gaps = 13/303 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++RGLQG+     +  A  KH 
Sbjct: 134 AGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLEHPRTIATPKHI 193

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS +D+E TY   F+A +VEG+  SVMC+YN ++G P C
Sbjct: 194 AVH-----SGPEPGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPAC 248

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PF 183
           A   +L   + G W   G++VSDCD+V  +    ++      ++A ++KAG DL+CG  +
Sbjct: 249 AADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCGTAY 308

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            A+ T  A+  G + E  ++ +L      + RLG     P    +  LG +D+    ++ 
Sbjct: 309 RALGT--AIERGEVDEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAKDIDNAGNRA 365

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LALQAA + IVLLKN+  TLPL       +AVIGPN+D    +  NY G +    TPL G
Sbjct: 366 LALQAAAESIVLLKNANATLPLKA--GTRLAVIGPNADALAALEANYQGTSSQPVTPLLG 423

Query: 304 ISR 306
           + +
Sbjct: 424 LRQ 426



 Score = 58.2 bits (139), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 37/75 (49%), Positives = 46/75 (61%), Gaps = 3/75 (4%)

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  + LP  QQ L+ R AKAS  P+V+VLM G  V +++AK      AI+   YPGQ+G
Sbjct: 646 DRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKT--HADAIVAAWYPGQSG 702

Query: 416 GAAIADVLFGRANPG 430
           G AIA  L G  NPG
Sbjct: 703 GTAIARALAGDDNPG 717


>gi|255013451|ref|ZP_05285577.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. 2_1_7]
 gi|410103695|ref|ZP_11298616.1| hypothetical protein HMPREF0999_02388 [Parabacteroides sp. D25]
 gi|409236424|gb|EKN29231.1| hypothetical protein HMPREF0999_02388 [Parabacteroides sp. D25]
          Length = 868

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 122/319 (38%), Positives = 176/319 (55%), Gaps = 17/319 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+W+PN+NIFRDPRWGRG ET GEDP LT K   +  RGLQG+  +  K  AC KHY 
Sbjct: 133 GLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPNYYKTHACAKHYA 192

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +    WN   R+ F+   + +DL +TY   F+A V EG V  VMC+YN+  GKP C+  
Sbjct: 193 VHSGPEWN---RHEFDVEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKPCCSSD 249

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLY----NTQHYTRTP--EEAAADAIKAGLDLDC- 180
            +L + +   W  D  I+SDC ++   +    NT  +   P  E A+ADA+  G DL+C 
Sbjct: 250 KLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGTDLECG 309

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           G + A++   A+  G + E+D++++L   +  +  LGMFD +    P+  +    V +P 
Sbjct: 310 GSYRALNK--ALADGKISEKDLDVSLRRLLKGRFELGMFDPDERV-PYSKIPYSVVESPE 366

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H   AL  A + IVLLKN    LPL       +AV+GPN+  +  +  NY G      T 
Sbjct: 367 HIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYNGFPTKTVTI 425

Query: 301 LQGISRY---AKTIHQAGC 316
           ++GI      A+ I++ GC
Sbjct: 426 VEGIRNKVPNAEVIYELGC 444



 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 56/106 (52%), Gaps = 13/106 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  V V G+   +E E +          DR  + +P  Q+E+V  +  A+  PVV V
Sbjct: 603 KDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALV-ATGKPVVYV 661

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           +  G  + +++ +ND  + AIL   Y GQ GG A+ADVLFG  NP 
Sbjct: 662 VCTGSALALNW-END-HVNAILNAWYGGQEGGTAVADVLFGDYNPA 705


>gi|284998833|ref|YP_003420601.1| glycoside hydrolase family protein [Sulfolobus islandicus L.D.8.5]
 gi|284446729|gb|ADB88231.1| glycoside hydrolase, family 3 domain protein [Sulfolobus islandicus
           L.D.8.5]
          Length = 754

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 148/453 (32%), Positives = 235/453 (51%), Gaps = 54/453 (11%)

Query: 13  SPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL- 71
           SP +++ +DPRWGR +ET GEDP L      +Y+ GLQG+     ++ A  KH+ A+   
Sbjct: 140 SPVLDVCKDPRWGRCEETYGEDPYLVASMGLAYITGLQGDN----QLVATAKHFAAHGFP 195

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
           +    + + H    V  ++L +T+  PF+  V  GKV S+M +Y++++G P   +P +L 
Sbjct: 196 EGGRNIAQVH----VGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLT 251

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTE-- 189
           N +  +W  DG +VSD D +  L           EAA  A+++G+D++  P +  + E  
Sbjct: 252 NILRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEF-PTIDCYGEPL 310

Query: 190 -GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLG--PRDVCTPAHQQLAL 246
             A++ GL+ E  ++ A+   + ++ RLG+ D      PF N    P  +     ++LAL
Sbjct: 311 VNALKEGLVPESLIDRAVERVLRIKDRLGLLDN-----PFVNENSVPEKLDDHKSRELAL 365

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA-----GVACG--YTT 299
           + A + IVLLKN    LPLS    + +AVIGPN++    M+G+Y       +  G    T
Sbjct: 366 KTARESIVLLKNENNILPLSK-NVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVT 424

Query: 300 PLQGISRY---AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG---------LD 347
            LQGI +    +K ++  GC  +A    +    A   ARQAD  + +MG         +D
Sbjct: 425 VLQGIVKKVGESKVLYAKGC-DIASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMD 483

Query: 348 QSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAK 397
              E EF           DR+ L LPG Q+EL+  + K  + P++LVL+ G P+ +S   
Sbjct: 484 IPSEEEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSSII 542

Query: 398 NDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           N   + A++   +PG+ GG AIADV+FG  NP 
Sbjct: 543 N--YVKAVIEAWFPGEEGGNAIADVIFGDYNPS 573


>gi|423331656|ref|ZP_17309440.1| hypothetical protein HMPREF1075_01453 [Parabacteroides distasonis
           CL03T12C09]
 gi|409230226|gb|EKN23094.1| hypothetical protein HMPREF1075_01453 [Parabacteroides distasonis
           CL03T12C09]
          Length = 868

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 122/319 (38%), Positives = 176/319 (55%), Gaps = 17/319 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+W+PN+NIFRDPRWGRG ET GEDP LT K   +  RGLQG+  +  K  AC KHY 
Sbjct: 133 GLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPNYYKTHACAKHYA 192

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +    WN   R+ F+   + +DL +TY   F+A V EG V  VMC+YN+  GKP C+  
Sbjct: 193 VHSGPEWN---RHEFDVEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKPCCSSD 249

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLY----NTQHYTRTP--EEAAADAIKAGLDLDC- 180
            +L + +   W  D  I+SDC ++   +    NT  +   P  E A+ADA+  G DL+C 
Sbjct: 250 KLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGTDLECG 309

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           G + A++   A+  G + E+D++++L   +  +  LGMFD +    P+  +    V +P 
Sbjct: 310 GSYRALNK--ALADGKISEKDLDVSLRRLLKGRFELGMFDPDERV-PYSKIPYSVVESPE 366

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H   AL  A + IVLLKN    LPL       +AV+GPN+  +  +  NY G      T 
Sbjct: 367 HIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYNGFPTKTVTI 425

Query: 301 LQGISRY---AKTIHQAGC 316
           ++GI      A+ I++ GC
Sbjct: 426 VEGIRNKVPNAEVIYELGC 444



 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 56/106 (52%), Gaps = 13/106 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  V V G+   +E E +          DR  + +P  Q+E+V  +  A+  PVV V
Sbjct: 603 KDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALV-ATGKPVVYV 661

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           +  G  + +++ +ND  + AIL   Y GQ GG A+ADVLFG  NP 
Sbjct: 662 VCTGSALALNW-END-HVNAILNAWYGGQEGGTAVADVLFGDYNPA 705


>gi|198425898|ref|XP_002119549.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 754

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 153/447 (34%), Positives = 231/447 (51%), Gaps = 34/447 (7%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G   GL+ +SP +NI R P WGR QET GEDPVLT   A +YV GLQG+    L   A C
Sbjct: 135 GDHTGLSCFSPVINILRHPLWGRNQETYGEDPVLTSLMARAYVTGLQGDE-IYLPATAVC 193

Query: 64  KHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPT 123
           KH+ AY         R+ F+A VS  D+  T+   F+ CV  G    VMCSYN +NG P+
Sbjct: 194 KHFVAYGGPENIPTTRFSFSANVSDHDIGTTFYPAFRECVHAG-AQGVMCSYNAINGVPS 252

Query: 124 CADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPF 183
           CA+P +L+ T+  ++  DGY+VSD +++  +    ++T++  E AA A+ AG+DL+   F
Sbjct: 253 CANP-MLETTLRKKFHFDGYVVSDENALENIDLYFNFTKSKLETAAVALNAGVDLELTGF 311

Query: 184 LAIH----TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
              +       AV  GL+ E  +  +       +M LG FD  P    + N+    V + 
Sbjct: 312 GKTNRYSLLNQAVEQGLVTEAALRRSAKRLFRTRMALGEFD-PPEFNHWLNVPIDVVQSL 370

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY-AGVACGY- 297
           AH++ A++ A +  VLLKN    LPL  L +  V+++GP  + +  + G+Y A     Y 
Sbjct: 371 AHRKQAVEVAAKSFVLLKNDG-ILPLKQL-YDKVSIVGPFINNSEALTGDYPAEFNLKYF 428

Query: 298 TTPL--------QGISRYAKTIHQAGCFGV------ACNGNQLIGAAEVAARQADATVLV 343
           ++PL         G++R+       GC G        C         EV    +D  ++ 
Sbjct: 429 SSPLFAANSLSSSGVARFT-----TGCVGTNNQNLPICATYNSTNVKEVVT-GSDIVLVT 482

Query: 344 MGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIG 403
           +G  + +EAE  DR  + LPG+Q +L+  V K + GPV++VL   GP+DVS+   +    
Sbjct: 483 LGTGRGVEAESNDRRDINLPGKQLQLIQDVVKYANGPVIVVLFNAGPLDVSWVMGN--TA 540

Query: 404 AILWVGYPGQAGGAAIADVLFGRANPG 430
           A++   +  Q  G A+ +VL G  NP 
Sbjct: 541 AVIACHFSAQMTGEAMLEVLTGVVNPA 567


>gi|385774250|ref|YP_005646817.1| glycoside hydrolase family protein [Sulfolobus islandicus HVE10/4]
 gi|323478365|gb|ADX83603.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           HVE10/4]
          Length = 754

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 149/453 (32%), Positives = 235/453 (51%), Gaps = 54/453 (11%)

Query: 13  SPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL- 71
           SP +++ +DPRWGR +ET GEDP L      +Y+ GLQG+     ++ A  KH+ A+   
Sbjct: 140 SPVLDVCKDPRWGRCEETYGEDPYLVASMGLAYITGLQGDN----QLVATAKHFAAHGFP 195

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
           +    + + H    V  ++L +T+  PF+  V  GKV S+M +Y++++G P   +P +L 
Sbjct: 196 EGGRNIAQVH----VGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLT 251

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTE-- 189
           N +  +W  DG +VSD D +  L           EAA  A+++G+D++  P +  ++E  
Sbjct: 252 NILRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEF-PTIDCYSEPL 310

Query: 190 -GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLG--PRDVCTPAHQQLAL 246
             A+  GL+ E  ++ A+   + ++ RLG+ D      PF N    P  +     ++LAL
Sbjct: 311 VNALTEGLVPESLIDRAVERVLRIKDRLGLLDN-----PFVNENSVPEKLDDHKSRELAL 365

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA-----GVACG--YTT 299
           + A + IVLLKN    LPLS    + +AVIGPN++    M+G+Y       +  G    T
Sbjct: 366 KTARESIVLLKNENNILPLSK-NVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVT 424

Query: 300 PLQGISRY---AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG---------LD 347
            LQG+ +    +K ++  GC  +A    +    A   ARQAD  + VMG          D
Sbjct: 425 VLQGVVKKVGESKVLYAKGC-DIASESKEGFAEAIEIARQADVIIAVMGEKSGLPLSWTD 483

Query: 348 QSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAK 397
              E EF           DR+ L LPG Q+EL+  + K  + P++LVL+ G P+ +S   
Sbjct: 484 IPSEEEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSPII 542

Query: 398 NDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           N   + A++   +PG+ GG AIADV+FG  NPG
Sbjct: 543 N--YVKAVIEAWFPGEEGGNAIADVIFGDYNPG 573


>gi|261880245|ref|ZP_06006672.1| beta-glucosidase [Prevotella bergensis DSM 17361]
 gi|270333079|gb|EFA43865.1| beta-glucosidase [Prevotella bergensis DSM 17361]
          Length = 854

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 130/359 (36%), Positives = 197/359 (54%), Gaps = 37/359 (10%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GL++W+PN+NIFRDPRWGRGQET GEDP LT +   + VRGLQG + S+  K+ AC KH+
Sbjct: 130 GLSFWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVRGLQGPSDSKYRKLLACAKHF 189

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FN   + ++DL +TY   FKA V +G VA VMC+Y +++G+P C 
Sbjct: 190 AVHSGPEWN---RHTFNVEDLPERDLWETYLPAFKALVQQGDVAEVMCAYQRIDGQPCCG 246

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE--EAAADAIKAGLDLDCGPF 183
           +   LK+ +  +W   G +VSDC +V   +   H+  +P+   A+A A+ +G D++CG  
Sbjct: 247 NNRFLKSILRNEWNYQGMVVSDCWAVPDFWKKGHHEVSPDATHASAKAVLSGTDVECGSD 306

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            +   E AVR G+++E DV++++   +  +  LG FD +    P+  +    V + AH+Q
Sbjct: 307 YSNLPE-AVRAGIIKEADVDVSVRRLLEARFALGDFDPDELV-PWTKISESVVASKAHKQ 364

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL  A + +VLL+N+   LPL       + V+G N+  +  M GNY+G      T LQG
Sbjct: 365 LALDMARKSMVLLQNND-ILPLKR-SGQKIVVVGANAIDSTMMWGNYSGYPTQTVTILQG 422

Query: 304 I----------------------SRYAKTIH--QAGCFGVACNGNQLIG--AAEVAARQ 336
           +                      SRY +  H  Q G      N   +IG  AA++  R+
Sbjct: 423 LQTKSDQVTFIPGCGLTRNELRESRYNEISHNGQPGMQATYWNNENMIGTPAAQMVHRE 481



 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 53/103 (51%), Gaps = 13/103 (12%)

Query: 337 ADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLM 386
           AD  + V G+   +E E +          DR  + LP  Q+E++  +++A R   ++ + 
Sbjct: 599 ADVVIFVGGISPRLEGEEMEVSDPGFKGGDRTTIELPQAQREVIKALSEAGRR--IVFVN 656

Query: 387 CGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           C G   ++      R+ AIL   YPG+ GG A+ADVLFG  NP
Sbjct: 657 CSGSA-IALTPESQRVDAILQAWYPGEQGGTAVADVLFGDYNP 698


>gi|270296098|ref|ZP_06202298.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D20]
 gi|270273502|gb|EFA19364.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D20]
          Length = 798

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 147/445 (33%), Positives = 224/445 (50%), Gaps = 42/445 (9%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL 71
           +SP ++I +DPRWGR  ET GEDP   G+     +  LQ N     K+ +  KH+  Y +
Sbjct: 206 YSPILDIAQDPRWGRCVETYGEDPYHAGQMGKQMILSLQKN-----KLVSTPKHFAVYSI 260

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
                  +   +  V+ +++   Y  PF+    E     VM SYN  +G+P       L 
Sbjct: 261 PVGGRDGKTRTDPHVAPREMRTLYLDPFRVAFHEAGALGVMSSYNDYDGEPITGSYHFLT 320

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC-----GPFLAI 186
             +  +W   GY+VSD ++V  +          E+A A A+ AGL++         F+ +
Sbjct: 321 EILRQEWGFKGYVVSDSEAVEFISTKHQVANGYEDAVAQAVNAGLNIRTHFTPPADFI-L 379

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPAHQQ 243
               AV+ G + +E +N  +A  + V+  LG+FD      P+     R    V +P HQQ
Sbjct: 380 PLRSAVKKGKISQETLNQRVAEILRVKFWLGLFDN-----PYRGDEKRAGQIVHSPEHQQ 434

Query: 244 LALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
           LAL+AA Q +VLLKN  +TLPLS ++R  +VAVIGPN+D    +I  Y       TT  +
Sbjct: 435 LALEAARQSLVLLKNEHQTLPLSKSIR--SVAVIGPNADERQQLICRYGPANAHITTIYE 492

Query: 303 GISRY---AKTIHQAGC--------------FGVACNGNQLIGAAEVAARQADATVLVMG 345
           GI +    A  +++ GC              F  A    Q++  A  AA+ A+ TV+V+G
Sbjct: 493 GIKKMLPQADVVYKKGCDIIDPHFPESEVLEFPKAAQEAQMMEEAIEAAKGAEVTVMVLG 552

Query: 346 LDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAI 405
            ++    E   R  L LPGRQ+EL+ ++ +  + PVVLV++ G    ++FA     + AI
Sbjct: 553 GNELTVREDRSRTSLDLPGRQKELLKKICQLGK-PVVLVMIDGRASSINFAAT--HVPAI 609

Query: 406 LWVGYPGQAGGAAIADVLFGRANPG 430
           +   +PG+ GG AIA+ LFG  NPG
Sbjct: 610 IHAWFPGEFGGQAIAEALFGDYNPG 634


>gi|285018984|ref|YP_003376695.1| beta-glucosidase [Xanthomonas albilineans GPE PC73]
 gi|283474202|emb|CBA16703.1| putative beta-glucosidase protein [Xanthomonas albilineans GPE
           PC73]
          Length = 904

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 125/305 (40%), Positives = 173/305 (56%), Gaps = 10/305 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++ GLQG+  +  +  A  KH  
Sbjct: 151 GLTIWSPNINIFRDPRWGRGMETYGEDPYLTGQLAVGFIHGLQGDDPTHPRTIATPKHLA 210

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +         R+ F+  VS  D E TY+  F+A +VEG   SVMC+YN ++G P CA  
Sbjct: 211 VHSGPE---SGRHGFDVDVSPHDFEATYSPAFRAAIVEGHAGSVMCAYNALHGIPACAAD 267

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            ++   + G W   G++VSDCD++  +    +Y      +AA A+KAG DL+CG +    
Sbjct: 268 WLIDGRVRGNWGFKGFVVSDCDAIDDMTQFHYYRADNAGSAAAALKAGHDLNCG-YAYRD 326

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              A+  G   E  ++ +L      + RLG      S  P+  LG +D+ +P H+ LALQ
Sbjct: 327 LGTALDRGEAEEAMLDRSLVRLFAARYRLGELQPR-SKDPYARLGAKDIDSPTHRALALQ 385

Query: 248 AAHQGIVLLKNSARTLPLST-LRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI-S 305
           AA Q +VLL+N   TLPL   LR   +AVIGPN+D    +  NY G +    TPLQG+ +
Sbjct: 386 AAQQSLVLLQNRNDTLPLRPGLR---LAVIGPNADALAALEANYQGTSVAPVTPLQGLRA 442

Query: 306 RYAKT 310
           R+  T
Sbjct: 443 RFGTT 447



 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/75 (50%), Positives = 46/75 (61%), Gaps = 3/75 (4%)

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  L LP  QQ L+ R AKAS  P+++VLM G  V +++AK      AIL   YPGQ+G
Sbjct: 662 DRNDLSLPAAQQALLER-AKASGKPLIVVLMSGSAVALNWAKQ--HADAILAAWYPGQSG 718

Query: 416 GAAIADVLFGRANPG 430
           G AIA  L G  NPG
Sbjct: 719 GTAIAQALAGDINPG 733


>gi|332881172|ref|ZP_08448831.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           329 str. F0087]
 gi|357047867|ref|ZP_09109460.1| glycosyl hydrolase family 3 protein [Paraprevotella clara YIT
           11840]
 gi|332680886|gb|EGJ53824.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           329 str. F0087]
 gi|355529206|gb|EHG98645.1| glycosyl hydrolase family 3 protein [Paraprevotella clara YIT
           11840]
          Length = 851

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 122/305 (40%), Positives = 177/305 (58%), Gaps = 13/305 (4%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+G    ++V+GLQG+    LK  +  KH+  
Sbjct: 137 LTFWSPTVNMARDPRWGRTPETYGEDPYLSGVLGVAFVKGLQGDHPRYLKTVSTPKHFAV 196

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R   NA+VS++DL + Y   F+ C+ EGK  S+M +YN VN  P   +  
Sbjct: 197 NNEEH----NRSSCNAKVSERDLREYYLPSFERCITEGKAQSIMMAYNAVNDVPCTVNTY 252

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHT 188
           ++KN + G W  +GYIVSDC +   +    HY +T E AA  A+K GLDL+CG    ++ 
Sbjct: 253 LIKNVLRGDWGFNGYIVSDCSAPEWMITKHHYVKTREAAATLAVKVGLDLECGN--QVYG 310

Query: 189 EG---AVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
           EG   A R  ++ E D++ A    +  +M LG+FD  PS  P+  + P  V   AHQ LA
Sbjct: 311 EGLLKAYRQYMVSEADIDSAAYRILRGRMMLGLFDA-PSQNPYNQIEPSVVGCKAHQDLA 369

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY-TTPLQGI 304
           L+AA Q +VLLKN    LPL+  +  ++AV+G ++       G+Y+G       T L GI
Sbjct: 370 LEAARQSMVLLKNKDNFLPLNPKKVKSIAVVGISAG--HCEFGDYSGTPKNEPVTILDGI 427

Query: 305 SRYAK 309
            +YA+
Sbjct: 428 KQYAE 432



 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 61/104 (58%), Gaps = 3/104 (2%)

Query: 327 IGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLM 386
            G A   A + D TV V+G+++SIE E  DR  L LP  QQE +  + K +   VV VL+
Sbjct: 593 FGDAGKVAAECDVTVAVLGINKSIEREGQDRFSLELPVDQQEFIKELYKVNPNTVV-VLV 651

Query: 387 CGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            G  + V++   D  + AIL   YPG+ GG A+A+VLFG  NPG
Sbjct: 652 AGSSMAVNWM--DENVPAILNAWYPGEQGGNAVAEVLFGDYNPG 693


>gi|423303655|ref|ZP_17281654.1| hypothetical protein HMPREF1072_00594 [Bacteroides uniformis
           CL03T00C23]
 gi|423307623|ref|ZP_17285613.1| hypothetical protein HMPREF1073_00363 [Bacteroides uniformis
           CL03T12C37]
 gi|392688019|gb|EIY81310.1| hypothetical protein HMPREF1072_00594 [Bacteroides uniformis
           CL03T00C23]
 gi|392689492|gb|EIY82769.1| hypothetical protein HMPREF1073_00363 [Bacteroides uniformis
           CL03T12C37]
          Length = 801

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 147/445 (33%), Positives = 224/445 (50%), Gaps = 42/445 (9%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL 71
           +SP ++I +DPRWGR  ET GEDP   G+     +  LQ N     K+ +  KH+  Y +
Sbjct: 209 YSPILDIAQDPRWGRCVETYGEDPYHAGQMGKQMILSLQKN-----KLVSTPKHFAVYSI 263

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
                  +   +  V+ +++   Y  PF+    E     VM SYN  +G+P       L 
Sbjct: 264 PVGGRDGKTRTDPHVAPREMRTLYLDPFRVAFHEAGALGVMSSYNDYDGEPITGSYHFLT 323

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC-----GPFLAI 186
             +  +W   GY+VSD ++V  +          E+A A A+ AGL++         F+ +
Sbjct: 324 EILRQEWGFKGYVVSDSEAVEFISTKHQVANGYEDAVAQAVNAGLNIRTHFTPPADFI-L 382

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPAHQQ 243
               AV+ G + +E +N  +A  + V+  LG+FD      P+     R    V +P HQQ
Sbjct: 383 PLRSAVKKGKISQETLNQRVAEILRVKFWLGLFDN-----PYRGDEKRAGQIVHSPEHQQ 437

Query: 244 LALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
           LAL+AA Q +VLLKN  +TLPLS ++R  +VAVIGPN+D    +I  Y       TT  +
Sbjct: 438 LALEAARQSLVLLKNEHQTLPLSKSIR--SVAVIGPNADERQQLICRYGPANAHITTIYE 495

Query: 303 GISRY---AKTIHQAGC--------------FGVACNGNQLIGAAEVAARQADATVLVMG 345
           GI +    A  +++ GC              F  A    Q++  A  AA+ A+ TV+V+G
Sbjct: 496 GIKKMLPQADVVYKKGCDIIDPHFPESEVLEFPKAAQEAQMMEEAIEAAKGAEVTVMVLG 555

Query: 346 LDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAI 405
            ++    E   R  L LPGRQ+EL+ ++ +  + PVVLV++ G    ++FA     + AI
Sbjct: 556 GNELTVREDRSRTSLDLPGRQEELLKKICQLGK-PVVLVMIDGRASSINFAAT--HVPAI 612

Query: 406 LWVGYPGQAGGAAIADVLFGRANPG 430
           +   +PG+ GG AIA+ LFG  NPG
Sbjct: 613 IHAWFPGEFGGQAIAEALFGDYNPG 637


>gi|332184951|ref|ZP_08386700.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
           sp. S17]
 gi|332014675|gb|EGI56731.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
           sp. S17]
          Length = 886

 Score =  215 bits (548), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 122/292 (41%), Positives = 166/292 (56%), Gaps = 18/292 (6%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GL  WSPN+NIFRDPRWGRGQET GEDP LT +   ++VRG+QG   +R+ V A  KHY 
Sbjct: 129 GLDSWSPNINIFRDPRWGRGQETYGEDPFLTARMGVAFVRGMQGTDPNRIDVVATPKHYA 188

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +         R+  N  VS+ DL DTY   F+A +VEG   S+MC+YN+++G+P CA  
Sbjct: 189 VHSGPEST---RHEANVYVSQHDLVDTYLPAFRAAIVEGGAGSIMCAYNRIDGQPACASD 245

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +LK+ + G+W   GY+VSDCD+V  +    HY   P  A A A++AG+D +C       
Sbjct: 246 LLLKDYLRGRWGFRGYVVSDCDAVKDIDANHHYAPDPATAVAAAMRAGVDNECNNATLTD 305

Query: 188 TEG-------AVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFG--NLGPRDVCT 238
           T+G       A+  GL+   D++ +L      + R G     P  +P    +  P D+ T
Sbjct: 306 TDGLTNPYRDALARGLISVADIDRSLTRLFAARYRTGDL---PGVRPLSMQSSSPADIGT 362

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY 290
           PAHQ LAL+ A + +VLLKN    LPL       +AVIGP  D T  + GNY
Sbjct: 363 PAHQALALETAEKSLVLLKNKG-ILPLKA--QARIAVIGPLGDATRVLRGNY 411



 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 48/113 (42%), Positives = 69/113 (61%), Gaps = 15/113 (13%)

Query: 329 AAEV--AARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKA 376
           AAE+  AA+ +DA V V+GL   +EAE            D+  L LP  QQ L+ + AKA
Sbjct: 605 AAELRTAAKDSDALVAVVGLTSDLEAEEAPISVPGFKGGDKTTLDLPADQQALLEQ-AKA 663

Query: 377 SRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +  P+++VLM G PV++++AK +    AI+   YPGQAGG A+ +VL G+ NP
Sbjct: 664 TGKPLIVVLMNGSPVNLAWAKAN--ADAIVEAWYPGQAGGLAVGNVLSGKTNP 714


>gi|227831319|ref|YP_002833099.1| glycoside hydrolase family protein [Sulfolobus islandicus L.S.2.15]
 gi|227457767|gb|ACP36454.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           L.S.2.15]
          Length = 754

 Score =  215 bits (547), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 147/453 (32%), Positives = 236/453 (52%), Gaps = 54/453 (11%)

Query: 13  SPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL- 71
           SP +++ +DPRWGR +ET GEDP L      +Y+ GLQG+     ++ A  KH+ A+   
Sbjct: 140 SPVLDVCKDPRWGRCEETYGEDPYLVASMGLAYITGLQGDN----QLVATAKHFAAHGFP 195

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
           +    + + H    V  ++L +T+  PF+  V  GKV S+M +Y++++G P   +P +L 
Sbjct: 196 EGGRNIAQVH----VGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLT 251

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTE-- 189
           N +  +W  DG +VSD D +  L           EAA  A+++G+D++  P +  + E  
Sbjct: 252 NILRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEF-PTIDCYGEPL 310

Query: 190 -GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLG--PRDVCTPAHQQLAL 246
             A++ GL+ E  ++ A+   + ++ RLG+ D      PF N    P  +     ++LAL
Sbjct: 311 VNALKEGLVPESLIDRAVERVLRIKDRLGLLDN-----PFVNENSVPEKLDDHKSRELAL 365

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA-----GVACG--YTT 299
           + A + IVLLKN    LPLS    + +AVIGPN++    M+G+Y       +  G    T
Sbjct: 366 KTARESIVLLKNENNILPLSK-NVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVT 424

Query: 300 PLQGISRY---AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG---------LD 347
            LQGI +    +K ++  GC  +A    +    A   ARQAD  + +MG         +D
Sbjct: 425 VLQGIVKKVGESKVLYAKGC-DIASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMD 483

Query: 348 ----------QSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAK 397
                     Q++  E  DR+ L LPG Q+EL+  + K  + P++LVL+ G P+ +S   
Sbjct: 484 IPSKEEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSSII 542

Query: 398 NDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           N   + A++   +PG+ GG AIADV+FG  NP 
Sbjct: 543 N--YVKAVIEAWFPGEEGGNAIADVIFGDYNPS 573


>gi|336404627|ref|ZP_08585320.1| hypothetical protein HMPREF0127_02633 [Bacteroides sp. 1_1_30]
 gi|335941531|gb|EGN03384.1| hypothetical protein HMPREF0127_02633 [Bacteroides sp. 1_1_30]
          Length = 861

 Score =  215 bits (547), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 119/316 (37%), Positives = 182/316 (57%), Gaps = 16/316 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT+W+PNVNIFRDPRWGRGQET GEDP LTG+   + VRGLQG   +   K+ AC KH+
Sbjct: 131 GLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDAGYDKLHACAKHF 190

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ F+A  ++ +DL +TY   FK  V +  V  VMC+YN+  G+P C 
Sbjct: 191 AVHSGPEWN---RHSFDAENIAPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCG 247

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   +  +W   G +VSDC ++   Y    +   P  E A+A A++ G DL+CG  
Sbjct: 248 SNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPDKEHASAAAVRTGTDLECGSE 307

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            A   + AV+ GL+ E++++++L   +T +  LG  D +P+   +  +    + +  HQ 
Sbjct: 308 YASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WAEIPTSVLNSKEHQA 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ A + +VLL+N    LPL+T  +  +AV+GPN++ +V   GNY G+     T L+ 
Sbjct: 364 LALRMARESLVLLQNKNNILPLNT--NLKIAVMGPNANDSVMQWGNYNGIPAHTVTLLEA 421

Query: 304 ISRY---AKTIHQAGC 316
           +       + I++ GC
Sbjct: 422 VRAKLPEGQIIYEPGC 437



 Score = 40.4 bits (93), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 13/104 (12%)

Query: 337 ADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLM 386
           AD  +   G+  S+E E +          DR  + LP  Q++L+  + KA +  VV +  
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK-VVFINY 656

Query: 387 CGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            G  +            AIL   YPGQAGG AI D L+G  NPG
Sbjct: 657 SGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPG 698


>gi|423223593|ref|ZP_17210062.1| hypothetical protein HMPREF1062_02248 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638218|gb|EIY32065.1| hypothetical protein HMPREF1062_02248 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 863

 Score =  215 bits (547), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 126/317 (39%), Positives = 173/317 (54%), Gaps = 14/317 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHY 66
           GLT W+P VNI+RDPRWGRG ET GEDP LT +     V+GLQG   G   K+ AC KH+
Sbjct: 128 GLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGPADGKYDKLHACAKHF 187

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  +  +DL +TY  PF+A V EGKV  VMC+YN+  G P C 
Sbjct: 188 AVHSGPEWN---RHSFNAENIKPRDLYETYLPPFEALVKEGKVEEVMCAYNRFEGDPCCG 244

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   + G+W  DG +VSDC ++   YN + +   P  E A+A A+ +G DL+CG  
Sbjct: 245 SDRLLMQILRGEWGFDGIVVSDCGAIADFYNDRGHHTHPDAESASAAAVISGTDLECGSS 304

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                E +V+ GL+ EE V+ ++   +  +  LG  D EP    +  +    V + AH  
Sbjct: 305 YKALIE-SVKKGLISEETVDTSVKRLMKARFALGEMD-EPEKVSWTKIPFSVVASAAHDS 362

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL  A + + LL N    LPL      TVAV+GPN++ +V   GNY G+     T L G
Sbjct: 363 LALNMARESMTLLMNKDNFLPLKR-GGLTVAVMGPNANDSVMQWGNYNGMPAHTVTILDG 421

Query: 304 ISRYA----KTIHQAGC 316
           +        K I++ GC
Sbjct: 422 VRNLLGTDDKLIYEQGC 438



 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  +   G+  S+E E +          DR  + LP  Q+EL+  + +A +  ++LV
Sbjct: 598 KDADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQRELIDALHRAGK-KIILV 656

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G P+ +       +  AIL   YPGQ GG A+A+VLFG  NP
Sbjct: 657 NCSGSPIGLE--PETQKCEAILQAWYPGQQGGKAVAEVLFGDYNP 699


>gi|329963634|ref|ZP_08301109.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
 gi|328528179|gb|EGF55158.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
          Length = 863

 Score =  214 bits (546), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 125/317 (39%), Positives = 175/317 (55%), Gaps = 14/317 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAACCKHY 66
           GLT W+P VNI+RDPRWGRG ET GEDP LT +     V+GLQG N G   K+ AC KH+
Sbjct: 128 GLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVEVVKGLQGGNDGRYDKLHACAKHF 187

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  +  +DL +TY  PF+A V EGKV  VMC+YN+  G P C 
Sbjct: 188 AVHSGPEWN---RHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCG 244

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   +  +W  DG ++SDC ++   YN + +   P  E A+A A+ +G DL+CG  
Sbjct: 245 SNRLLMQILRDEWGFDGIVLSDCGAIADFYNDRGHKTHPDAESASAAAVLSGTDLECGSS 304

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                E AV+ G + E+ V++A+   +T +  LG  D EP    +  +    V +  H  
Sbjct: 305 YKALVE-AVKLGKIEEKAVDVAVKRLLTARFALGEMD-EPDKVSWTKIPFSVVASAKHDS 362

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL  A + + LL N    LPL      TVAV+GPN++ +V   GNY G+     T L+G
Sbjct: 363 LALDIARKSMTLLMNKNDILPLKQ-GGLTVAVMGPNANDSVMQWGNYNGMPSHTVTILEG 421

Query: 304 ISRYA----KTIHQAGC 316
           + +      K I++ GC
Sbjct: 422 VRKALGADDKLIYEQGC 438



 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 33/99 (33%), Positives = 52/99 (52%), Gaps = 13/99 (13%)

Query: 341 VLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGP 390
           + V G+  S+E E +          DR  + LP  Q+E+++ + ++ +  V+ V   G P
Sbjct: 604 IFVGGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQREMIAALHRSGK-KVIFVNCSGSP 662

Query: 391 VDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           + +          AIL   YPGQAGG A+A+VLFG+ NP
Sbjct: 663 IGLE--PETKECEAILQAWYPGQAGGTAVAEVLFGKYNP 699


>gi|217967241|ref|YP_002352747.1| glycoside hydrolase family 3 [Dictyoglomus turgidum DSM 6724]
 gi|217336340|gb|ACK42133.1| glycoside hydrolase family 3 domain protein [Dictyoglomus turgidum
           DSM 6724]
          Length = 762

 Score =  214 bits (546), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 152/448 (33%), Positives = 234/448 (52%), Gaps = 44/448 (9%)

Query: 13  SPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDLD 72
           SP ++I RDPRWGR +ET GEDP L  + AA YV+GLQG    R  + A  KH+TAY + 
Sbjct: 146 SPVLDIPRDPRWGRTEETFGEDPYLVSRMAAEYVKGLQGEDW-REGIIATVKHFTAYGIS 204

Query: 73  NWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILKN 132
                 R    A+V +++L + +  PF+  + EG+  S+M +Y++++G P  +   +L  
Sbjct: 205 EGA---RNLGPAKVGERELREVFLFPFEVAIKEGQAGSLMNAYHEIDGVPCASSKFLLTK 261

Query: 133 TIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTE--- 189
            +  +W   GY+VSD  ++ +L N     +  +EAA  A++AG+D++  P +  + E   
Sbjct: 262 ILRWEWGFKGYVVSDYIAIRMLENFHRVAKDAKEAAVLALEAGIDIEL-PSVDCYGEPLI 320

Query: 190 GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAA 249
            AV+ GL+ EE +N ++   +  +  LG+FDG+    P       D   P  ++L+ + A
Sbjct: 321 QAVKEGLISEEVINASVERVLRAKFMLGLFDGDLEKDPKKVYDIFD--KPEFRELSREVA 378

Query: 250 HQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA--------------- 294
            + IVLLKN    LPLS     TVAVIGPN+D    + G+Y+  A               
Sbjct: 379 RRSIVLLKNDG-ILPLSK-NIRTVAVIGPNADNPRNLHGDYSYTAHIPSVSETLEGVKIP 436

Query: 295 --CGYTTP--LQGI----SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG- 345
             C   T   L+GI    S   + ++  GC  ++ +      A E+A R AD  + VMG 
Sbjct: 437 EECAVRTVSILEGIKNKVSAETQVLYAKGCEILSDSKEGFDEAIEIAKR-ADVIIAVMGE 495

Query: 346 ----LDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPR 401
                 + I  E  DR  L L G Q++L+  + K  + P+VLVL+ G P  + +   +  
Sbjct: 496 ESGLFHRGISGEGNDRTTLELFGIQRDLLRELHKLGK-PIVLVLVNGRPQALKWEHEN-- 552

Query: 402 IGAILWVGYPGQAGGAAIADVLFGRANP 429
           + AIL   YPG+ GG A+ADV+FG  NP
Sbjct: 553 LNAILEAWYPGEEGGDAVADVIFGDYNP 580


>gi|224536538|ref|ZP_03677077.1| hypothetical protein BACCELL_01413 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521794|gb|EEF90899.1| hypothetical protein BACCELL_01413 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 863

 Score =  214 bits (546), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 126/317 (39%), Positives = 173/317 (54%), Gaps = 14/317 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHY 66
           GLT W+P VNI+RDPRWGRG ET GEDP LT +     V+GLQG   G   K+ AC KH+
Sbjct: 128 GLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGPADGKYDKLHACAKHF 187

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  +  +DL +TY  PF+A V EGKV  VMC+YN+  G P C 
Sbjct: 188 AVHSGPEWN---RHSFNAENIKPRDLYETYLPPFEALVKEGKVEEVMCAYNRFEGDPCCG 244

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   + G+W  DG +VSDC ++   YN + +   P  E A+A A+ +G DL+CG  
Sbjct: 245 SDRLLMQILRGEWGFDGIVVSDCGAIADFYNDRGHHTHPDAESASAAAVISGTDLECGSS 304

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                E +V+ GL+ EE V+ ++   +  +  LG  D EP    +  +    V + AH  
Sbjct: 305 YKALIE-SVKKGLISEETVDTSVKRLMKARFALGEMD-EPEKVSWTKIPFSVVASAAHDS 362

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL  A + + LL N    LPL      TVAV+GPN++ +V   GNY G+     T L G
Sbjct: 363 LALNMARESMTLLMNKDNFLPLKR-GGLTVAVMGPNANDSVMQWGNYNGMPAHTVTILDG 421

Query: 304 ISRYA----KTIHQAGC 316
           +        K I++ GC
Sbjct: 422 VRNLLGTDDKLIYEQGC 438



 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 52/105 (49%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  +   G+  S+E E +          DR  + LP  Q+EL+  + +A +  ++LV
Sbjct: 598 KDADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQRELIDALHRAGK-KIILV 656

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G P+         +  AIL   YPGQ GG A+A+VLFG  NP
Sbjct: 657 NCSGSPI--GLEPETQKCEAILQAWYPGQQGGKAVAEVLFGDYNP 699


>gi|325105296|ref|YP_004274950.1| beta-glucosidase [Pedobacter saltans DSM 12145]
 gi|324974144|gb|ADY53128.1| Beta-glucosidase [Pedobacter saltans DSM 12145]
          Length = 884

 Score =  214 bits (545), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 125/319 (39%), Positives = 177/319 (55%), Gaps = 13/319 (4%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G   GL++W+PN+NIFRDPRWGRGQET GEDP LT +   + VRGLQG+     K  AC 
Sbjct: 133 GRYYGLSFWTPNINIFRDPRWGRGQETYGEDPYLTARLGVAAVRGLQGDDPKYFKTHACA 192

Query: 64  KHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPT 123
           KH+  +    WN   R+ ++A  S +DL +TY   FKA V E  V  VMC+YN   G+P 
Sbjct: 193 KHFAVHSGPEWN---RHSYDATASGRDLWETYLPAFKALVKEANVQEVMCAYNAYEGQPC 249

Query: 124 CADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYT--RTPEEAAADAIKAGLDLDCG 181
           C    +L + +  +W   G +VSDC ++   +   H+   +    AAADA+    DL+CG
Sbjct: 250 CGSDRLLTDILRNRWEYKGIVVSDCWAIDDFFRKGHHETHKDAAAAAADAVIHSTDLECG 309

Query: 182 PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQ-PFGNLGPRDVCTPA 240
                  E AVR GL+ ++ ++++L   +     LGM D  P+ + P+  L  + V +  
Sbjct: 310 SAYTNLLE-AVRQGLISQQQIDISLRRVLRGWFELGMLD--PAERLPWSQLPYQIVASKE 366

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H Q AL+ A + + LLKN+   LPLS      +AVIGPN+  +V + GNY G      T 
Sbjct: 367 HVQQALKVARESMTLLKNNGSILPLSK-SIKKIAVIGPNAADSVMLWGNYNGTPNSTVTI 425

Query: 301 LQGISR---YAKTIHQAGC 316
           LQGI     +A+ I+  GC
Sbjct: 426 LQGIKNKLPHAEIIYDKGC 444



 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 54/105 (51%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           ++ DA V   GL   +E E +          D+  + LP  Q+EL+S + K++  PVV V
Sbjct: 606 KEVDAIVYAGGLSPQLEGEEMPVNADGFRGGDKISIDLPKIQRELLSSL-KSTGKPVVFV 664

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           L  G    ++  +++    A+L   Y GQ  G A+ADVLFG  NP
Sbjct: 665 LCTGS--SLALEQDEKNYNALLCAWYGGQEAGTAVADVLFGDYNP 707


>gi|385776908|ref|YP_005649476.1| glycoside hydrolase family protein [Sulfolobus islandicus REY15A]
 gi|323475656|gb|ADX86262.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           REY15A]
          Length = 754

 Score =  214 bits (545), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 148/453 (32%), Positives = 234/453 (51%), Gaps = 54/453 (11%)

Query: 13  SPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL- 71
           SP +++ +DPRWGR +ET GEDP L      +Y+ GLQG+     ++ A  KH+ A+   
Sbjct: 140 SPVLDVCKDPRWGRCEETYGEDPYLVASMGLAYITGLQGDN----QLVATAKHFAAHGFP 195

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
           +    + + H    V  ++L +T+  PF+  V  GKV S+M +Y++++G P   +P +L 
Sbjct: 196 EGGRNIAQVH----VGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLT 251

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTE-- 189
           N +  +W  DG +VSD D +  L           EAA  A+++G+D++  P +  ++E  
Sbjct: 252 NILRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEF-PTIDCYSEPL 310

Query: 190 -GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLG--PRDVCTPAHQQLAL 246
             A+  GL+ E  ++ A+   + ++ RLG+ D      PF N    P  +     ++LAL
Sbjct: 311 VNALTEGLVPESLIDRAVERVLRIKDRLGLLDN-----PFVNENSVPEKLDDHKSRELAL 365

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA-----GVACG--YTT 299
           + A + IVLLKN    LPLS    + +AVIGPN++    M+G+Y       +  G    T
Sbjct: 366 KTARESIVLLKNENNILPLSK-NVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVT 424

Query: 300 PLQGISRY---AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG---------LD 347
            LQG+ +    +K ++  GC  +A    +    A   ARQAD  + VMG          D
Sbjct: 425 VLQGVVKKVGESKVLYAKGC-DIASESKEGFAEAIEIARQADVIIAVMGEKSGLPLSWTD 483

Query: 348 QSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAK 397
              E EF           DR+ L LPG Q+EL+  + K  + P++LVL+ G P+ +S   
Sbjct: 484 IPSEEEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSPII 542

Query: 398 NDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           N   + A++   +PG+ GG AIADV+FG  NP 
Sbjct: 543 N--YVKAVIEAWFPGEEGGNAIADVIFGDYNPS 573


>gi|293370605|ref|ZP_06617157.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|292634339|gb|EFF52876.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
          Length = 861

 Score =  214 bits (545), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 123/316 (38%), Positives = 180/316 (56%), Gaps = 16/316 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAACCKHY 66
           GLT+W+PNVNIFRDPRWGRGQET GEDP LTG+   + VRGLQG       K+ AC KH+
Sbjct: 131 GLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDAGYDKLHACAKHF 190

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ F+A  +  +DL +TY   FK  V +  V  VMC+YN+  G+P C 
Sbjct: 191 AVHSGPEWN---RHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCG 247

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   +  +W   G +VSDC ++   Y    +   P  E A+A A++ G DL+CG  
Sbjct: 248 SNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAAAVRTGTDLECGSE 307

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            A   + AV+ GL+ E++++++L   +T +  LG  D E SA  +  +    + +  HQ 
Sbjct: 308 YASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMD-EQSA--WSEIPTSVLNSKEHQA 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ A + +VLL+N    LPL+T  H  VAV+GPN++ +V   GNY G+     T L+ 
Sbjct: 364 LALRMARESLVLLQNKNNILPLNT--HLKVAVMGPNANDSVMQWGNYNGIPAHTVTLLEA 421

Query: 304 ISRY---AKTIHQAGC 316
           +       + I++ GC
Sbjct: 422 VRAKLPEGQIIYEPGC 437



 Score = 40.4 bits (93), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 13/104 (12%)

Query: 337 ADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLM 386
           AD  +   G+  S+E E +          DR  + LP  Q++L+  + KA +  VV +  
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK-VVFINY 656

Query: 387 CGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            G  +            AIL   YPGQAGG AI D L+G  NPG
Sbjct: 657 SGSAI--GLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPG 698


>gi|294675412|ref|YP_003576028.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23]
 gi|294472176|gb|ADE81565.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23]
          Length = 875

 Score =  214 bits (544), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 125/319 (39%), Positives = 179/319 (56%), Gaps = 21/319 (6%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ--GNTGSRL------KVA 60
           L++W+PN+NIFRDPRWGRGQET GEDP LT K   + VRGLQ  G  G  L      K+ 
Sbjct: 131 LSFWTPNINIFRDPRWGRGQETYGEDPYLTAKMGLAVVRGLQGVGYNGEDLGVSKYRKLL 190

Query: 61  ACCKHYTAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           AC KH+  +    WN   R+ FN   + ++DL +TY   FKA V EGKVA VMC+Y +++
Sbjct: 191 ACAKHFAVHSGPEWN---RHEFNIENLPERDLWETYLPAFKALVQEGKVAEVMCAYQRID 247

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSV-GVLYNTQHYTRTPEEAAADAIKAGLDL 178
           G+  CA     +  +  +W  DG I SDC ++   L    + ++   EA+A A+ AG D+
Sbjct: 248 GQACCAQTRYEQQILRDEWGFDGLITSDCGAIRDFLPRWHNVSKDGAEASAKAVLAGTDV 307

Query: 179 DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
           +CG     H   AVR G ++E D++ +L   +  +  LG  D +     +  +    V +
Sbjct: 308 ECGSEYK-HLPEAVRRGDVKEADIDRSLRRLLIARFELGDMDSD-DLNAWTKIPETVVAS 365

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRH------HTVAVIGPNSDVTVTMIGNYAG 292
            AH+ LAL+ A + IVLL+N  + LPL    +        + V+GPN++ +V M GNYAG
Sbjct: 366 QAHKDLALKMALKSIVLLQNKIKVLPLGNPLNAGAGSDKDIVVMGPNANDSVMMWGNYAG 425

Query: 293 VACGYTTPLQGISRYAKTI 311
                 T L GI+R AKT+
Sbjct: 426 YPTHTVTALDGITRMAKTL 444



 Score = 39.7 bits (91), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 31/99 (31%), Positives = 48/99 (48%), Gaps = 13/99 (13%)

Query: 341 VLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGP 390
           + V G+  ++E E +          DR  + LP  Q++L++ + KA  G  V+ + C G 
Sbjct: 624 IFVGGISPNLEGEEMRVNEPGFKGGDRTSIELPQAQRDLLAVLHKA--GKKVIFVNCSGS 681

Query: 391 VDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
             ++ A       AIL   Y G+ GGAA+A  LFG   P
Sbjct: 682 A-MALAPELETCDAILQWWYGGEQGGAALATTLFGMVAP 719


>gi|365121873|ref|ZP_09338785.1| hypothetical protein HMPREF1033_02131 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363644185|gb|EHL83481.1| hypothetical protein HMPREF1033_02131 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 850

 Score =  214 bits (544), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 107/273 (39%), Positives = 166/273 (60%), Gaps = 6/273 (2%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+GK   ++V+GLQGN    LKV +  KH+ A
Sbjct: 135 LTFWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVAFVKGLQGNDPRYLKVVSTPKHFAA 194

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R+  N ++S++DL + Y   F+ C+++GK  S+M +YN +N  P   +  
Sbjct: 195 NNEEH----NRFECNPQISERDLREYYLPAFERCIIDGKAQSIMTAYNAINDVPCTLNTW 250

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH- 187
           +LK  +   W  +GY+VSDC +  +L     Y +TPE AA  A+KAGLDL+CG  + I  
Sbjct: 251 LLKKVLRTDWGFNGYVVSDCGAPSLLVTHHKYVKTPEAAATLALKAGLDLECGDNVYIEP 310

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              A +  ++ E +++ A    +  +M LG+FD +P+  P+  L P  V    H+ +AL+
Sbjct: 311 LMNAYKQYMVSEAEIDTAAYRILRARMMLGLFD-DPAKNPYNALSPSIVGCEKHKNMALE 369

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNS 280
           AA Q +VLLKN    LP++  +  ++AV+G N+
Sbjct: 370 AARQSLVLLKNENNFLPINPKKIKSIAVVGINA 402



 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 41/105 (39%), Positives = 60/105 (57%), Gaps = 3/105 (2%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           L G A+ A ++ D T+ VMG+++SIE E  DR  + LP + QEL    A      + +VL
Sbjct: 590 LYGDAKKAIQECDMTIAVMGINKSIEREGRDRDHIELP-KDQELFIEEAYKLNPKMAVVL 648

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G  + V++   D  + AIL   YPG+ GG A+A+ LFG  NP 
Sbjct: 649 VAGSSLAVNWM--DEHVPAILNAWYPGEQGGTAVAEALFGDYNPA 691


>gi|365121914|ref|ZP_09338824.1| hypothetical protein HMPREF1033_02170 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363643627|gb|EHL82934.1| hypothetical protein HMPREF1033_02170 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 1073

 Score =  214 bits (544), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 120/301 (39%), Positives = 175/301 (58%), Gaps = 13/301 (4%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP +N+ RDPRWGR  ET GEDP LTG    ++VRGLQGN    +KV +  KH+ A
Sbjct: 137 LTFWSPTINMARDPRWGRTPETYGEDPFLTGTLGTAFVRGLQGNDPKYIKVVSTPKHFAA 196

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R   NA +S++DL + Y   F+ C+ EG+  SVM +YN VNG P   +  
Sbjct: 197 NNEEH----NRASGNAVISERDLREYYFPAFEKCIKEGQAQSVMSAYNAVNGIPCTLNKW 252

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH- 187
           +L + +   W  DGY+VSDC +   + +  HY  T EEAA+  IKAGLDL+CG  + I  
Sbjct: 253 LLTDVLRDDWGFDGYVVSDCSAPEYIVSQHHYVDTYEEAASLCIKAGLDLECGDNVYITP 312

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              A   G++   +++ A    +  +MRLG+FD +P+  P+  + P  V    H++LAL+
Sbjct: 313 LLNAYNRGMVTMSEIDSAAYRVLRGRMRLGLFD-DPNENPYNKISPSIVGCEKHRELALE 371

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP---LQGI 304
           AA Q +VLLKN    LP+ T    ++AV+G N+       G+Y+G      TP   L+GI
Sbjct: 372 AARQSLVLLKNDKDMLPIQTDNIKSIAVVGINA--ANCEFGDYSGTPVN--TPISVLEGI 427

Query: 305 S 305
            
Sbjct: 428 K 428



 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 41/103 (39%), Positives = 60/103 (58%), Gaps = 3/103 (2%)

Query: 328 GAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMC 387
           G A    R +D T+ V+G+D++IE E  DR+ + LP  QQ  +    KA+   VV VL+ 
Sbjct: 734 GDAGEIIRGSDLTIAVLGIDRTIEREGQDRSTIELPEDQQIFIEEAYKANPNTVV-VLVA 792

Query: 388 GGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           G  + +++   D  I A+L   YPG+ GG A+A+ LFG  NPG
Sbjct: 793 GSSLAINWI--DQNIPAVLDAWYPGEQGGTAVAEALFGDYNPG 833


>gi|340347926|ref|ZP_08671025.1| beta-glucosidase [Prevotella dentalis DSM 3688]
 gi|433652779|ref|YP_007296633.1| beta-glucosidase-like glycosyl hydrolase [Prevotella dentalis DSM
           3688]
 gi|339608382|gb|EGQ13286.1| beta-glucosidase [Prevotella dentalis DSM 3688]
 gi|433303312|gb|AGB29127.1| beta-glucosidase-like glycosyl hydrolase [Prevotella dentalis DSM
           3688]
          Length = 849

 Score =  214 bits (544), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 124/318 (38%), Positives = 189/318 (59%), Gaps = 20/318 (6%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAACCKHY 66
           GL++W+PN+NIFRDPRWGRGQET GEDP LTG+   + VRGLQG +T S  K  AC KH+
Sbjct: 123 GLSFWTPNINIFRDPRWGRGQETYGEDPYLTGRMGLAVVRGLQGPDTASYYKTLACAKHF 182

Query: 67  TAYDLDNWNGVD--RYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPT 123
             +     +G +  R+  N   +S +DL +TY   FK+ V +G+VA VMC+Y + +G+P 
Sbjct: 183 AVH-----SGPESLRHEMNIENLSPRDLWETYLPAFKSLVTKGQVAEVMCAYQRFDGQPC 237

Query: 124 CADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHY--TRTPEEAAADAIKAGLDLDCG 181
           C +  +L++ +  +W   G +VSDC ++   +    +   +   EA+A A +AG D++CG
Sbjct: 238 CGNSRLLQHILRDEWNFKGLVVSDCGAISDFWIPGRHGVAKDAVEASAQAQRAGTDVECG 297

Query: 182 PFLAIHT-EGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQ-PFGNLGPRDVCTP 239
                H+ E AV+ G+L+E D++ ++   +  ++ LG  D  PSA  P+  +    V  P
Sbjct: 298 S--NYHSLEEAVKAGVLKESDIDRSVIRVLEARLALG--DISPSADVPWKTIPYAVVDCP 353

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
           AH+QLAL+ A + +VLL+N   TLPLST  H  + V+G N+  ++ +  NY G      T
Sbjct: 354 AHRQLALRMARESMVLLQNKHHTLPLST--HDKILVVGENAVDSMMLWANYNGTPSHTMT 411

Query: 300 PLQGISRYAKTIH-QAGC 316
            L GI + A+ +    GC
Sbjct: 412 ILDGIRQLAEHVEFMPGC 429



 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 42/106 (39%), Positives = 56/106 (52%), Gaps = 13/106 (12%)

Query: 334 ARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVL 383
           ARQADA V V GL   +E E +          DR  + LP  Q+E +  ++ A R P+V+
Sbjct: 590 ARQADAVVFVGGLSPRLEGEEMKVDLPGFSGGDRTSIELPVAQREAIQALSVA-RKPIVM 648

Query: 384 VLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           V  C G   ++         AIL   YPG+ GG A+ADVLFG+ NP
Sbjct: 649 V-NCSGSA-IALEPETKNCDAILQAWYPGEEGGRAVADVLFGKVNP 692


>gi|313205375|ref|YP_004044032.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312444691|gb|ADQ81047.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 858

 Score =  213 bits (543), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 121/316 (38%), Positives = 177/316 (56%), Gaps = 17/316 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLTYW+PNVNIFRDPRWGRGQET GEDP LT     + V+GLQG   +   K+ AC KH+
Sbjct: 129 GLTYWTPNVNIFRDPRWGRGQETYGEDPYLTSLMGVAVVKGLQGPDNAEYDKLHACAKHF 188

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  ++ +DL +TY   FKA V +  V  VMC+YN+   +P C 
Sbjct: 189 AVHSGPEWN---RHSFNAENINPRDLWETYLPAFKALVQKADVKEVMCAYNRFEDEPCCG 245

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE--EAAADAIKAGLDLDCGPF 183
              +L   +   W+ DG +VSDC ++   Y    +   P+   AAA+A+  G DL+CG  
Sbjct: 246 SNRLLTQILRNDWKFDGLVVSDCWAISDFYKPNAHATQPDATHAAANAVLNGTDLECGSD 305

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                E AV+ GL+ E+ ++++L   +  +  L    GE ++     +    V +  HQ 
Sbjct: 306 FRNLPE-AVKAGLIEEKRIDVSLKRLLKARFEL----GEMNSDQVWPISYSVVNSEKHQN 360

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ A + IVLL+N+   LPLS  +   +AV+GPN++ +V   GNY G      T L+ 
Sbjct: 361 LALRMAEESIVLLQNNNNILPLS--KKLKIAVMGPNANDSVMQWGNYNGFPAHTVTLLEA 418

Query: 304 ISRY---AKTIHQAGC 316
           + +    A+ I++ GC
Sbjct: 419 MRKSFPGAQLIYEPGC 434



 Score = 47.4 bits (111), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 36/114 (31%), Positives = 50/114 (43%), Gaps = 13/114 (11%)

Query: 327 IGAAEVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKA 376
           + A+    + AD  V   G+  S+E E +          DR  + LP  Q+ L+  +  A
Sbjct: 585 LSASIAKVKDADVVVFAGGIAPSLEGEEMRVTVPGFKGGDRTDIELPAIQRRLLQALKDA 644

Query: 377 SRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            +  VV V   G  +            AIL   YPGQAGG A+A+VL G  NP 
Sbjct: 645 GK-KVVFVNFSGSAM--GLVPETQSCEAILQAWYPGQAGGTAVANVLLGNYNPS 695


>gi|284174578|ref|ZP_06388547.1| Beta-xylosidase [Sulfolobus solfataricus 98/2]
 gi|356934752|gb|AET42953.1| beta-xylosidase-like protein [Sulfolobus solfataricus 98/2]
          Length = 754

 Score =  213 bits (543), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 150/454 (33%), Positives = 235/454 (51%), Gaps = 58/454 (12%)

Query: 13  SPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL- 71
           SP +++ RDPRWGR +ET GEDP L      +Y+ GLQG T    ++ A  KH+ A+   
Sbjct: 140 SPVLDVCRDPRWGRCEETYGEDPYLVASMGLAYITGLQGET----QLVATAKHFAAHGFP 195

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
           +    + + H    V  ++L +T+  PF+  V  GKV S+M +Y++++G P   +P +L 
Sbjct: 196 EGGRNIAQVH----VGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGVPCHGNPQLLT 251

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTE-- 189
           N +  +W  DG +VSD D +  L           EAA  A+++G+D++  P +  + E  
Sbjct: 252 NILRQEWGFDGIVVSDYDGIRQLEAIHKVASNKMEAAILALESGVDIEF-PTIDCYGEPL 310

Query: 190 -GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLG--PRDVCTPAHQQLAL 246
             A++ GL+ E  ++ A+   + ++ RLG+ D      PF +    P  +     ++LAL
Sbjct: 311 VTAIKEGLVSEAIIDRAVERVLRIKERLGLLD-----NPFVDESAVPERLDDRKSRELAL 365

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY---------AGVACGY 297
           +AA + IVLLKN    LPLS    + +AVIGPN++    M+G+Y         +G+    
Sbjct: 366 KAARESIVLLKNENNMLPLSK-NINKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEI-- 422

Query: 298 TTPLQGISRYA---KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG--------- 345
            T LQGI++     K ++  GC  +A    +    A   A+QAD  + VMG         
Sbjct: 423 VTVLQGIAKKVGEGKVLYAKGC-DIAGESKEGFSEAIEIAKQADVIIAVMGEKSGLPLSW 481

Query: 346 LDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSF 395
            D   E EF           DRA L L G Q+EL+  + K  + P++LVL+ G P+ +S 
Sbjct: 482 TDIPSEEEFKKYQAVTGEGNDRASLRLLGVQEELLKELYKTGK-PIILVLINGRPLVLSP 540

Query: 396 AKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
             N   + AI+   +PG+ GG AIAD++FG  NP
Sbjct: 541 IIN--YVKAIIEAWFPGEEGGNAIADIIFGDYNP 572


>gi|395492941|ref|ZP_10424520.1| glycoside hydrolase family protein [Sphingomonas sp. PAMC 26617]
          Length = 865

 Score =  213 bits (543), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 126/302 (41%), Positives = 180/302 (59%), Gaps = 13/302 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPN+NIFRDPRWGRGQET GEDP LTG  A  +V G+QG   + LK  A  KH+ 
Sbjct: 119 GLTFWSPNINIFRDPRWGRGQETLGEDPYLTGTMAVPFVHGVQGTDANYLKAIATPKHFA 178

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ FN   S +DL +TY   F+  +V+G+  S+MC+YN V+ K  CA
Sbjct: 179 VH-----SGPEQLRHQFNVDPSPRDLSETYLPAFRRAIVDGRAESLMCAYNAVDTKAACA 233

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PFL 184
           +  +LK+T+ G W   G++ SDC ++  +    H + T  E AA A+KAG D  C     
Sbjct: 234 NTMLLKDTLRGAWGFKGFVTSDCGAIDDITTGHHNSPTNPEGAALAVKAGTDTGCDFKDE 293

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQ-PFGNLGPRDVCTPAHQQ 243
            +    AV+ G L E D+++AL    T +M+LGMFD  P+A+ PF  +   +  +PAH+ 
Sbjct: 294 MLDLPRAVKAGYLTEGDMDVALRRLFTARMKLGMFD--PAARVPFSTISIAENHSPAHRA 351

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+AA + IVLLKN    LPL+      +AV+GP +   + + GNY G   G   P+ G
Sbjct: 352 LALRAARESIVLLKNDG-VLPLAAGARR-IAVVGPTAASLIALEGNYNGTPVGAVLPVDG 409

Query: 304 IS 305
           ++
Sbjct: 410 MT 411



 Score = 47.4 bits (111), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 42/74 (56%), Gaps = 3/74 (4%)

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  + LP  Q +L+  +  A+  P+V+VL  G  +  +    + +  A+L   YPG+AG
Sbjct: 623 DRTAIALPAAQSQLLDALF-ATGKPLVIVLQSGSAI--ALGAQEAKARAVLEAWYPGEAG 679

Query: 416 GAAIADVLFGRANP 429
           G AIA+VL G  NP
Sbjct: 680 GQAIAEVLSGTVNP 693


>gi|224535242|ref|ZP_03675781.1| hypothetical protein BACCELL_00103 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224523140|gb|EEF92245.1| hypothetical protein BACCELL_00103 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 864

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 126/306 (41%), Positives = 179/306 (58%), Gaps = 12/306 (3%)

Query: 5   GMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN-TGSRLKVAACC 63
           G  GLT+W+PN+NI+RDPRWGRG ET GEDP LT     + V+GLQG  TG   K  AC 
Sbjct: 126 GYKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGGGTGKYDKAHACA 185

Query: 64  KHYTAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           KHY  +    WN   R+ F+A+ +S++DL +TY   FK  V EGKV  VMC+YN+  G+P
Sbjct: 186 KHYAVHSGPEWN---RHSFDAKNISQRDLWETYLSAFKTLVKEGKVKEVMCAYNRFEGEP 242

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDC 180
            C++  +L   +   W  D  +VSDC ++G  Y   H+   P    A+ADA+ +G DL+C
Sbjct: 243 CCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTAAAASADAVVSGTDLEC 302

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           G   +   E AVR GL+ EE +N ++   +  + +LGMFD + +   +  +    V +  
Sbjct: 303 GGSYSSLNE-AVRKGLISEEKINESVFRLLRARFQLGMFDDD-ALVSWSEIPYSVVESKE 360

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
           H   AL+ A + +VLL N   TLPLS ++R   VAV+GPN++ +V +  NY G      T
Sbjct: 361 HVTKALEMARKSMVLLTNKNHTLPLSKSIRK--VAVLGPNANDSVMLWANYNGFPTKSVT 418

Query: 300 PLQGIS 305
            L+GI 
Sbjct: 419 ILEGIK 424



 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 43/115 (37%), Positives = 60/115 (52%), Gaps = 15/115 (13%)

Query: 327 IGAAEVA--ARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVA 374
           I   EVA  A +ADA + V GL  ++E E +          DR  + LP  Q E++  + 
Sbjct: 588 INYKEVADKAAEADAIIFVGGLSPTLEGEEMPVDLPGFRKGDRTNIDLPHVQAEMLKALK 647

Query: 375 KASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           K  + PV+ VL  G  + + +   +  + AIL   YPGQ GG A+ADVLFG  NP
Sbjct: 648 KTGK-PVIFVLCSGSTLALPWEAEN--LDAILEAWYPGQQGGTAVADVLFGDYNP 699


>gi|393782348|ref|ZP_10370533.1| hypothetical protein HMPREF1071_01401 [Bacteroides salyersiae
           CL02T12C01]
 gi|392673619|gb|EIY67078.1| hypothetical protein HMPREF1071_01401 [Bacteroides salyersiae
           CL02T12C01]
          Length = 852

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 120/308 (38%), Positives = 175/308 (56%), Gaps = 19/308 (6%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP LTG     +V+GLQG+    LK  +  KH+ A
Sbjct: 138 LTFWSPTVNMARDPRWGRTPETYGEDPFLTGVIGCEFVKGLQGDHPRYLKTVSTPKHFAA 197

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R   NAR+S++DL + Y   F+ C+V+ K  S+M +YN VNG P   +  
Sbjct: 198 NNEEH----NRSSCNARMSERDLREFYLPSFERCIVDAKAQSIMMAYNAVNGVPCTVNTY 253

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG------P 182
           ++KN + G W  +GYIVSDC +   +     Y R  + AA  AIKAGLDL+CG      P
Sbjct: 254 LIKNVLRGDWGFNGYIVSDCSAPEWMVTKHKYVRDLDAAATLAIKAGLDLECGDRVYTAP 313

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
            L  + E  V      + D++ A    +  +M LG+FD +PS  P+  + P  +    HQ
Sbjct: 314 LLKAYNESMV-----SKADIDSAAYRVLRGRMLLGLFD-DPSQNPYNQIEPSVIGCKKHQ 367

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY-TTPL 301
           +LAL+ A Q +VLLKN    LPL+  +  ++AV+G N+       G+Y+G+      + L
Sbjct: 368 ELALETARQSMVLLKNQKNFLPLNLKKVKSIAVVGINA--GHCEFGDYSGIPKNAPVSVL 425

Query: 302 QGISRYAK 309
            GI +YA+
Sbjct: 426 DGIRKYAE 433



 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 3/105 (2%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           L G A  AA++ D TV V+G+++SIE E  DR  L LP  QQE +  + K +   VV VL
Sbjct: 593 LFGDAGKAAKECDVTVAVLGINKSIEREGQDRYSLELPTDQQEFIRELYKVNPNTVV-VL 651

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G  + +++   D  + AIL   YPG+ GG AIA+VLFG  NPG
Sbjct: 652 VAGSSLAINWI--DENVPAILNAWYPGEQGGTAIAEVLFGDYNPG 694


>gi|15899739|ref|NP_344344.1| Beta-xylosidase [Sulfolobus solfataricus P2]
 gi|13816430|gb|AAK43134.1| Beta-xylosidase [Sulfolobus solfataricus P2]
          Length = 754

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 150/454 (33%), Positives = 235/454 (51%), Gaps = 58/454 (12%)

Query: 13  SPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL- 71
           SP +++ RDPRWGR +ET GEDP L      +Y+ GLQG T    ++ A  KH+ A+   
Sbjct: 140 SPVLDVCRDPRWGRCEETYGEDPYLVASMGLAYITGLQGET----QLVATAKHFAAHGFP 195

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
           +    + + H    V  ++L +T+  PF+  V  GKV S+M +Y++++G P   +P +L 
Sbjct: 196 EGGRNIAQVH----VGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGVPCHGNPQLLT 251

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTE-- 189
           N +  +W  DG +VSD D +  L           EAA  A+++G+D++  P +  + E  
Sbjct: 252 NILRQEWGFDGIVVSDYDGIRQLEAIHKVASNKMEAAILALESGVDIEF-PTIDCYGEPL 310

Query: 190 -GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLG--PRDVCTPAHQQLAL 246
             A++ GL+ E  ++ A+   + ++ RLG+ D      PF +    P  +     ++LAL
Sbjct: 311 VTAIKEGLVSEAIIDRAVERVLRIKERLGLLD-----NPFVDESAVPERLDDRKSRELAL 365

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY---------AGVACGY 297
           +AA + IVLLKN    LPLS    + +AVIGPN++    M+G+Y         +G+    
Sbjct: 366 KAARESIVLLKNENNMLPLSK-NINKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEI-- 422

Query: 298 TTPLQGISRYA---KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG--------- 345
            T LQGI++     K ++  GC  +A    +    A   A+QAD  + VMG         
Sbjct: 423 VTVLQGIAKKVGEGKVLYAKGC-DIAGESKEGFSEAIEIAKQADVIIAVMGEKSGLPLSW 481

Query: 346 LDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSF 395
            D   E EF           DRA L L G Q+EL+  + K  + P++LVL+ G P+ +S 
Sbjct: 482 TDIPSEEEFKKYQAVTGEGNDRASLRLLGVQEELLKELYKTGK-PIILVLINGRPLVLSP 540

Query: 396 AKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
             N   + AI+   +PG+ GG AIAD++FG  NP
Sbjct: 541 IIN--YVKAIIEAWFPGEEGGNAIADIIFGDYNP 572


>gi|423227459|ref|ZP_17213920.1| hypothetical protein HMPREF1062_06106 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392623089|gb|EIY17195.1| hypothetical protein HMPREF1062_06106 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 864

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 126/306 (41%), Positives = 179/306 (58%), Gaps = 12/306 (3%)

Query: 5   GMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN-TGSRLKVAACC 63
           G  GLT+W+PN+NI+RDPRWGRG ET GEDP LT     + V+GLQG  TG   K  AC 
Sbjct: 126 GYKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGGGTGKYDKAHACA 185

Query: 64  KHYTAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           KHY  +    WN   R+ F+A+ +S++DL +TY   FK  V EGKV  VMC+YN+  G+P
Sbjct: 186 KHYAVHSGPEWN---RHSFDAKNISQRDLWETYLPAFKTLVKEGKVKEVMCAYNRFEGEP 242

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDC 180
            C++  +L   +   W  D  +VSDC ++G  Y   H+   P    A+ADA+ +G DL+C
Sbjct: 243 CCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTAAAASADAVVSGTDLEC 302

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           G   +   E AVR GL+ EE +N ++   +  + +LGMFD + +   +  +    V +  
Sbjct: 303 GGSYSSLNE-AVRKGLISEEKINESVFRLLRARFQLGMFDDD-ALVSWSEIPYSVVESKE 360

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
           H   AL+ A + +VLL N   TLPLS ++R   VAV+GPN++ +V +  NY G      T
Sbjct: 361 HVAKALEMARKSMVLLTNKNHTLPLSKSIRK--VAVLGPNANDSVMLWANYNGFPTKSVT 418

Query: 300 PLQGIS 305
            L+GI 
Sbjct: 419 ILEGIK 424



 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 43/115 (37%), Positives = 60/115 (52%), Gaps = 15/115 (13%)

Query: 327 IGAAEVA--ARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVA 374
           I   EVA  A +ADA + V GL  ++E E +          DR  + LP  Q E++  + 
Sbjct: 588 INYKEVADKAAEADAIIFVGGLSPTLEGEEMPVDLPGFRKGDRTNIDLPHVQAEMLKALK 647

Query: 375 KASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           K  + PV+ VL  G  + + +   +  + AIL   YPGQ GG A+ADVLFG  NP
Sbjct: 648 KTGK-PVIFVLCSGSTLALPWEAEN--LDAILEAWYPGQQGGTAVADVLFGDYNP 699


>gi|373461705|ref|ZP_09553443.1| hypothetical protein HMPREF9944_01707 [Prevotella maculosa OT 289]
 gi|371951597|gb|EHO69442.1| hypothetical protein HMPREF9944_01707 [Prevotella maculosa OT 289]
          Length = 662

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 123/318 (38%), Positives = 189/318 (59%), Gaps = 20/318 (6%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAACCKHY 66
           GL++W+PN+NIFRDPRWGRGQET GEDP LTG+   + VRGLQG +T    K  AC KH+
Sbjct: 114 GLSFWTPNINIFRDPRWGRGQETYGEDPYLTGRMGLAVVRGLQGPDTACYYKTLACAKHF 173

Query: 67  TAYDLDNWNGVD--RYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPT 123
             +     +G +  R+  +   +S +DL +TY   FK+ V +G+VA VMC+Y + +G+P 
Sbjct: 174 AVH-----SGPESLRHEMDIENLSPRDLWETYLPAFKSLVTKGQVAEVMCAYQRFDGQPC 228

Query: 124 CADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHY--TRTPEEAAADAIKAGLDLDCG 181
           C +  +L++ +  +W   G +VSDC ++   +    +   +   EA+A A  AG D++CG
Sbjct: 229 CGNSRLLQHILRDEWNFKGLVVSDCGAISDFWIPGRHGVAKDAVEASAQAQSAGTDVECG 288

Query: 182 PFLAIHT-EGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSA-QPFGNLGPRDVCTP 239
                H+ E AV+ G+L+E D++ ++   +  ++ LG  D  PSA  P+  +    V  P
Sbjct: 289 S--NYHSLEEAVKAGVLKESDIDRSVIRVLEARLALG--DISPSAVVPWKTIPYAVVDCP 344

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
           AH+QLAL+ AH+ +VLL+N   TLPLST  H  + V+G N+  ++ +  NY G+     T
Sbjct: 345 AHRQLALRMAHESMVLLQNRHHTLPLST--HDKILVVGENAVDSMMLWANYNGIPSHTVT 402

Query: 300 PLQGISRYAKTIH-QAGC 316
            L GI + A+ +    GC
Sbjct: 403 VLDGIRQLAEHVEFMPGC 420


>gi|323451833|gb|EGB07709.1| hypothetical protein AURANDRAFT_64764 [Aureococcus anophagefferens]
          Length = 819

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 181/460 (39%), Positives = 234/460 (50%), Gaps = 52/460 (11%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT--------GSRLKV 59
           G+T ++PNVN+ RDPRWGR +E  GEDP LT + A   V GLQGN         G  L  
Sbjct: 187 GITLYAPNVNLVRDPRWGRAEEVYGEDPHLTAELAVGMVTGLQGNAEGSTSGPGGGPLVT 246

Query: 60  AACCKHYTAYDLDNWNG---VDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYN 116
            ACCKH+ A+     N     DR   +A VS +DL +TY    KACVV  K         
Sbjct: 247 GACCKHFAAHFAVYQNEDLPADRMVLDANVSSRDLWETYLPVMKACVVRAKA-------T 299

Query: 117 QVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGL 176
            VNGKPTCA P++L + +   W  DG++VSD D+   L  T  Y  T EEAAA  I AG+
Sbjct: 300 HVNGKPTCAHPELLNDVLRESWGFDGFVVSDYDAWSNLVTTHKYVSTWEEAAAAGINAGM 359

Query: 177 DLDCG-----PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNL 231
           D + G     P  A+    AVR G +    V  +    + V++RLGMFD   S   +G  
Sbjct: 360 DQEGGFGDYSPVDAL--PDAVRNGTVAAATVRRSFERLMRVRLRLGMFDPPASTAVYGEA 417

Query: 232 GPRDV-C-TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIG- 288
              D  C T A   LA +AA +GIVL KN+   LPL+  +   +A++GP  D    ++G 
Sbjct: 418 YQCDYQCETAAKLALAREAAREGIVLFKNAGGALPLA--KGARIALVGPQVDDWRVLLGA 475

Query: 289 -NYA---GVACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVM 344
            NYA   G      T  +G+   A     AGC  VAC     +  A+  A  ADATV+V+
Sbjct: 476 VNYAFEDGPDVAPVTIQKGLEAVANVSVAAGCDSVACAALVDVDGAKRLAAAADATVVVL 535

Query: 345 G---------------LDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGG 389
           G                D   E+E  DRA + LPG Q  LV+ +  AS   +V VL+ GG
Sbjct: 536 GDSFGATDGWPLCRGTRDDGCESESHDRATIELPGEQVALVAALRAASS-RLVCVLVHGG 594

Query: 390 PVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            V +  A +D      LWV  PGQ GGAA+ADVLFG  +P
Sbjct: 595 AVALGAAADDCDAVLDLWV--PGQMGGAALADVLFGDYSP 632


>gi|71731103|gb|EAO33170.1| Beta-glucosidase [Xylella fastidiosa subsp. sandyi Ann-1]
          Length = 882

 Score =  212 bits (540), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 124/298 (41%), Positives = 172/298 (57%), Gaps = 7/298 (2%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A S++RGLQG+T    +  A  KH+
Sbjct: 128 AGLTLWSPNINIFRDPRWGRGMETYGEDPYLTGQLAVSFIRGLQGDTPDHPRTIATPKHF 187

Query: 67  TAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
             +         R+ F+  VS  DLE TY   F+A +V+G   SVMC+YN ++G P CA 
Sbjct: 188 AVHSGPEQG---RHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACAS 244

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAI 186
             +L   +   W  +G++VSDCD++  +     + +    A+A A+K+G DL+CG     
Sbjct: 245 DWLLNTRLRNDWGFNGFVVSDCDAIEDMTRFHFFRQDNASASAAALKSGDDLNCGNTYRD 304

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLAL 246
             +   RG  + E  ++ AL    T + RLG         P+  +G + + TPAH+ LAL
Sbjct: 305 LNQAIARGD-IDESTLDQALIRLFTARQRLGTLQPR-EHDPYAAIGIKHIDTPAHRALAL 362

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           QAA Q +VLLKNS  TLPL      T+AV+GP++D    +  NY G +    TPL G+
Sbjct: 363 QAAAQSLVLLKNSGNTLPLPP--ETTLAVLGPDADSLTALEANYQGTSSTPVTPLTGL 418



 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 60/114 (52%), Gaps = 13/114 (11%)

Query: 327 IGAAEVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKA 376
           +  AE A   ADA V  +GL   +E E +          DR  + LP  Q+ L+  V K 
Sbjct: 601 LAEAERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHV-KT 659

Query: 377 SRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           +  P+++VLM G  V +++A++     AIL   YPGQ+GG AIA  L G  NPG
Sbjct: 660 TGKPLIVVLMSGSAVALNWAQH--HADAILAAWYPGQSGGTAIAQALAGDVNPG 711


>gi|270294390|ref|ZP_06200592.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|270275857|gb|EFA21717.1| conserved hypothetical protein [Bacteroides sp. D20]
          Length = 864

 Score =  211 bits (538), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 126/317 (39%), Positives = 173/317 (54%), Gaps = 14/317 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAACCKHY 66
           GLT W+P VNI+RDPRWGRG ET GEDP LT +     V+GLQG N G   K+ AC KH+
Sbjct: 127 GLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTNDGKYDKLHACAKHF 186

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  +  +DL +TY  PF+A V EGKV  VMC+YN+  G P C 
Sbjct: 187 AVHSGPEWN---RHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCG 243

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   +  +W  DG ++SDC ++   Y    +   P  E A+A A+ +G DL+CG  
Sbjct: 244 SDRLLMQILRDEWGFDGIVLSDCGAIADFYRDYGHKTHPDAESASAVAVLSGTDLECGSS 303

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                E AV+ G + E+ V++A+   +T +  LG  D EP    +  +    V +  H  
Sbjct: 304 YEALVE-AVKQGKIDEKAVDVAVKRLLTARFALGEMD-EPEKVSWTGIPFSVVASAGHDS 361

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL  A + + LL N   TLPL      TVAV+GPN++ +V   GNY G+     T L G
Sbjct: 362 LALDMARKSMTLLMNKDNTLPLKR-GGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILDG 420

Query: 304 ISRYAKT----IHQAGC 316
           I +   T    I++ GC
Sbjct: 421 IRKALGTDDRLIYEQGC 437



 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 41/105 (39%), Positives = 57/105 (54%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  V V G+  ++E E +          DR  + LP  Q+EL++ + +A +  VVLV
Sbjct: 598 KDADIVVFVGGVSPNLEGEEMGVELPGFRGGDRTDIELPAVQRELIAALHRAGK-KVVLV 656

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G P+ +       R GAIL   YPGQAGG A+A+VLFG  NP
Sbjct: 657 NCSGSPIGLE--PETGRCGAILQAWYPGQAGGTAVAEVLFGDYNP 699


>gi|317480996|ref|ZP_07940076.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides sp. 4_1_36]
 gi|316902889|gb|EFV24763.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides sp. 4_1_36]
          Length = 864

 Score =  211 bits (537), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 126/317 (39%), Positives = 173/317 (54%), Gaps = 14/317 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAACCKHY 66
           GLT W+P VNI+RDPRWGRG ET GEDP LT +     V+GLQG N G   K+ AC KH+
Sbjct: 127 GLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTNDGKYDKLHACAKHF 186

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  +  +DL +TY  PF+A V EGKV  VMC+YN+  G P C 
Sbjct: 187 AVHSGPEWN---RHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCG 243

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   +  +W  DG ++SDC ++   Y    +   P  E A+A A+ +G DL+CG  
Sbjct: 244 SDRLLMQILRDEWGFDGIVLSDCGAIADFYRDYGHKTHPDAESASAAAVLSGTDLECGSS 303

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                E AV+ G + E+ V++A+   +T +  LG  D EP    +  +    V +  H  
Sbjct: 304 YEALVE-AVKQGKIDEKAVDVAVKRLLTARFALGEMD-EPEKVSWTGIPFSVVASAGHDS 361

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL  A + + LL N   TLPL      TVAV+GPN++ +V   GNY G+     T L G
Sbjct: 362 LALDMARKSMTLLMNKDNTLPLKR-GGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILDG 420

Query: 304 ISRYAKT----IHQAGC 316
           I +   T    I++ GC
Sbjct: 421 IRKALGTDDRLIYEQGC 437



 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 41/105 (39%), Positives = 56/105 (53%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  V V G+  ++E E +          DR  + LP  Q+EL++ +  A +  VVLV
Sbjct: 598 KDADVVVFVGGVSPNLEGEEMGVELPGFRGGDRTDIELPAVQRELIAALHHAGK-KVVLV 656

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G P+ +       R GAIL   YPGQAGG A+A+VLFG  NP
Sbjct: 657 NCSGSPIGLE--PETGRCGAILQAWYPGQAGGTAVAEVLFGDYNP 699


>gi|332185848|ref|ZP_08387595.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
           sp. S17]
 gi|332014206|gb|EGI56264.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
           sp. S17]
          Length = 838

 Score =  211 bits (537), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 127/314 (40%), Positives = 178/314 (56%), Gaps = 18/314 (5%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G   GLT WSPN+NIFRDPRWGRGQET GEDP LT +   ++VRGLQG      KV A  
Sbjct: 108 GLFEGLTIWSPNINIFRDPRWGRGQETYGEDPFLTSRIGIAFVRGLQGPDLKHPKVIATV 167

Query: 64  KHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
           KH   +     +G +  R  F+  VS +DLE TY   F+A V EGK  S+MC+YN ++G 
Sbjct: 168 KHLAVH-----SGPEGGRDSFDVMVSPRDLEATYLPAFRATVTEGKALSLMCAYNAIHGT 222

Query: 122 PTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG 181
           P CA+P ++   +   W   G  VSDCD+VG ++   H       AAA A++AG DL+CG
Sbjct: 223 PVCANPMLMTERLRTDWGFKGLTVSDCDAVGNIWMFHHAQPDAPAAAAAALRAGTDLNCG 282

Query: 182 PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAH 241
                  E A++ GL+ E +++ ALA  + V+  L +        P+  + P  + TPA+
Sbjct: 283 NTYRALPE-ALKRGLITEGEIDTALARALAVRRMLTV------DSPWNRIKPSQLGTPAN 335

Query: 242 QQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 301
           + LAL+ A +GI+LL N    LP   L+   +AV G ++D    + GNY G A    TPL
Sbjct: 336 RALALETARKGIILLDNPNDLLP---LKGKKLAVFGADADDLGVLEGNYHGSAIDPVTPL 392

Query: 302 QGI-SRYAKTIHQA 314
            GI +++ + +  A
Sbjct: 393 DGIRAKFGRDVRYA 406



 Score = 46.2 bits (108), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 53/103 (51%), Gaps = 15/103 (14%)

Query: 337 ADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLM 386
           AD  V ++GL   +E E +          DR  ++LP  Q+ L++ + +A+  PVV V++
Sbjct: 563 ADVLVAIVGLSPDLEGEALGVSVPGFAGGDRTEVVLPEPQRNLLAAL-QATGKPVVAVIV 621

Query: 387 CGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            G  V +     D +  A L   YPG  GG A+A++L G  NP
Sbjct: 622 SGSAVSL----GDIKPAATLAAFYPGAEGGTALAEILSGDVNP 660


>gi|206901280|ref|YP_002250567.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
 gi|206740383|gb|ACI19441.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
          Length = 762

 Score =  211 bits (537), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 150/448 (33%), Positives = 233/448 (52%), Gaps = 44/448 (9%)

Query: 13  SPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDLD 72
           SP ++I RDPRWGR +ET GEDP L  + A  YV+GLQG    R  + A  KH+TAY + 
Sbjct: 146 SPVLDIPRDPRWGRTEETFGEDPYLVSRMATEYVKGLQGEDW-REGIVATVKHFTAYGIS 204

Query: 73  NWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILKN 132
                 R    A+V +++L + +  PF+  + EG+  S+M +Y++++G P  +   +L  
Sbjct: 205 EGA---RNLGPAKVGERELREVFLFPFEVAIKEGQAGSLMNAYHEIDGVPCASSKFLLTK 261

Query: 133 TIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTE--- 189
            +  +W   GY+VSD  +V +L N     R  +EAA  A++AG+D++  P +  + E   
Sbjct: 262 ILRWEWGFKGYVVSDYIAVRMLENFHKVARDAKEAAVLALEAGIDIEL-PSVDCYGEPLI 320

Query: 190 GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAA 249
            AV+ GL+ EE +N ++   +  +  LG+FD      P       D   P  + L+ + A
Sbjct: 321 QAVKEGLISEEVINASVERVLRAKFMLGLFDDNLEKDPKKVYEVFD--KPEFRDLSREVA 378

Query: 250 HQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA------GVACGY------ 297
            + IVLLKN   TLPLS      VAVIGPN+D    + G+Y+       +A G       
Sbjct: 379 RRSIVLLKNDG-TLPLSK-NLKKVAVIGPNADNPRNLHGDYSYTAHIPSIAEGLEGVKVE 436

Query: 298 -------TTPLQGI----SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG- 345
                   + L+GI    S   + ++  GC  ++ + +    A E+ A++AD  + VMG 
Sbjct: 437 EKCVVRTVSILEGIRNKVSPETEVLYAKGCDIISDSKDGFAEAIEM-AKEADVIIAVMGE 495

Query: 346 ----LDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPR 401
                 + I  E  DR  L L G Q++L+  + K  + P+VLVL+ G P  + +   +  
Sbjct: 496 ESGLFHRGISGEGNDRTTLELFGVQRDLLKELHKLGK-PIVLVLINGRPQALKWEHEN-- 552

Query: 402 IGAILWVGYPGQAGGAAIADVLFGRANP 429
           + AIL   YPG+ GG A+ADV+FG  NP
Sbjct: 553 LNAILEAWYPGEEGGNAVADVIFGDYNP 580


>gi|86143269|ref|ZP_01061671.1| beta-glucosidase precursor [Leeuwenhoekiella blandensis MED217]
 gi|85830174|gb|EAQ48634.1| beta-glucosidase precursor [Leeuwenhoekiella blandensis MED217]
          Length = 873

 Score =  211 bits (536), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 124/299 (41%), Positives = 175/299 (58%), Gaps = 11/299 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT WSPN+NIFRDPRWGRG ET GEDP LTG   A YV+GLQG+    LKV A  KH+ 
Sbjct: 131 GLTMWSPNINIFRDPRWGRGHETYGEDPFLTGTLGAQYVKGLQGDDPEYLKVVATAKHFA 190

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +         R++F+A  S++DL +TY   F+  V + +V SVM +YN+  G+   ++ 
Sbjct: 191 VHSGPE---ESRHYFDANTSERDLWETYLPAFRMLVKDAQVQSVMTAYNRFRGEAASSN- 246

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +L + +  +W  DGY+VSDC ++  ++     T     A+A A++ G DL+CG      
Sbjct: 247 KLLFDILRNKWGFDGYVVSDCGAINDIWEDHKITADAASASALALETGTDLNCGATYKSL 306

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA-HQQLAL 246
            E A+  GL+ EE +N+A+      +++LGMFD E +   +  + P  V T A H  LA 
Sbjct: 307 KE-AIANGLITEEKINIAIERLFRARLKLGMFDTEENLS-YATI-PFSVNTNASHTALAR 363

Query: 247 QAAHQGIVLLKNSARTLPLST-LRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           +AA + IVLLKN A  LPLS  L+   +AVIGPN+    ++ GNY G      T +QGI
Sbjct: 364 KAAQESIVLLKNEAHMLPLSKDLKQ--IAVIGPNAHNVQSLWGNYNGTPKNPVTVVQGI 420



 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 56/148 (37%), Positives = 79/148 (53%), Gaps = 22/148 (14%)

Query: 301 LQGISRYAKTIHQAGCFGVA--------CNGNQLIGAAEVAARQADATVLVMGLDQSIEA 352
           L+    YA TI  +  +G A           N+L  A  +A   +D T+LV+GL++ +E 
Sbjct: 556 LEAGKNYAITIKYSNLYGNAEARMLWSTPEKNKLERAVNLA-EDSDVTILVLGLNERLEG 614

Query: 353 EFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           E +          DR  L LP  Q+EL+ R   A+  P+VLVL+ G  + +++A+    +
Sbjct: 615 EEMRIDVEGFSKGDRTALDLPLEQRELM-RALVATGKPIVLVLLNGSALAINYAQE--HV 671

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
            AIL  GYPGQ GG AIADVLFG  NP 
Sbjct: 672 PAILSAGYPGQEGGNAIADVLFGDYNPA 699


>gi|160891087|ref|ZP_02072090.1| hypothetical protein BACUNI_03534 [Bacteroides uniformis ATCC 8492]
 gi|156859308|gb|EDO52739.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           uniformis ATCC 8492]
          Length = 865

 Score =  211 bits (536), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 124/317 (39%), Positives = 173/317 (54%), Gaps = 14/317 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAACCKHY 66
           GLT W+P VNI+RDPRWGRG ET GEDP LT +     V+GLQG N G   K+ AC KH+
Sbjct: 128 GLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTNDGKYDKLHACAKHF 187

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  +  +DL +TY  PF+A V EGKV  VMC+YN+  G P C 
Sbjct: 188 AVHSGPEWN---RHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCG 244

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   +  +W  DG ++SDC ++   Y    +   P  E A+A A+ +G DL+CG  
Sbjct: 245 SDRLLMQILRDEWGFDGIVLSDCGAIADFYRDYGHKTHPDAESASAAAVLSGTDLECGSS 304

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                E AV+ G + E+ V++A+   +T +  LG  D EP    +  +    V +  H  
Sbjct: 305 YEALVE-AVKQGKIDEKAVDVAVKRLLTARFALGEMD-EPEKVSWTGIPFSVVASAGHDS 362

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL  A + + LL N   TLPL      T+AV+GPN++ +V   GNY G+     T L G
Sbjct: 363 LALDMARKSMTLLMNKDNTLPLKR-GGLTIAVMGPNANDSVMQWGNYNGMPPHTVTILDG 421

Query: 304 ISRYA----KTIHQAGC 316
           I +      + I++ GC
Sbjct: 422 IRKALGSDDRLIYEQGC 438



 Score = 61.6 bits (148), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 41/105 (39%), Positives = 56/105 (53%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  V V G+  ++E E +          DR  + LP  Q+EL++ +  A +  VVLV
Sbjct: 599 KDADIVVFVGGVSPNLEGEEMGVELPGFRGGDRTDIELPAVQRELIAALHHAGK-KVVLV 657

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G P+ +       R GAIL   YPGQAGG A+A+VLFG  NP
Sbjct: 658 NCSGSPIGLE--PETGRCGAILQAWYPGQAGGTAVAEVLFGDYNP 700


>gi|253574420|ref|ZP_04851761.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
 gi|251846125|gb|EES74132.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
          Length = 782

 Score =  211 bits (536), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 155/458 (33%), Positives = 228/458 (49%), Gaps = 50/458 (10%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHY 66
           G   +SP +++ RDPRWGR +E  GED  L  + A + V GLQG +      VAA  KH+
Sbjct: 174 GAVTYSPVLDVVRDPRWGRTEECFGEDAYLISEMAVASVEGLQGESLDGEDSVAATLKHF 233

Query: 67  TAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
             Y         R      + +++L +   +PF+  V E   AS+M +YN+++G P   +
Sbjct: 234 VGYGSSEGG---RNAGPVHMGRRELLEVDLLPFRKAV-EAGAASIMPAYNEIDGVPCTTN 289

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD-CGPFLA 185
            ++L   + G+W  DG +++DC ++ +L +         +AA  AI+AG+D++  G    
Sbjct: 290 EELLDGVLRGEWGFDGMVITDCGAIDMLASGHDVAEDGRDAAIQAIRAGIDMEMSGVMFG 349

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
            H   AVR G L EE ++ A+   +T++ RLG+F+  P A P      R + +  H +LA
Sbjct: 350 KHLVEAVRSGQLEEEVLDRAVRRVLTLKFRLGLFE-RPYADP--ERAERVIGSAEHVELA 406

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA--CGYTTPLQG 303
            Q A +G+VLLKN    LPLS     T+AVIGPN+D     +G+Y         TT L G
Sbjct: 407 RQLASEGVVLLKNKDGVLPLSA-DAGTIAVIGPNADAGYNQLGDYTSPQPRSKVTTVLGG 465

Query: 304 I-SRYAKT----IHQAGCFGVACNGNQLIG--AAEVAARQADATVLVMGLDQSIE----- 351
           I S+ A+T    ++  GC     NGN   G   A   A +AD  V+V+G   + +     
Sbjct: 466 IRSKLAETPERVLYAPGC---RINGNSREGFDVALSCAEKADTVVMVVGGSSARDFGEGT 522

Query: 352 --------------------AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPV 391
                                E IDR  L L G Q EL+  + K  + P+V+V + G P+
Sbjct: 523 IDLRTGASKVTDNAESDMDCGEGIDRMNLSLSGVQLELIQEIHKLGK-PLVVVYINGRPI 581

Query: 392 DVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              +   D    AIL   YPGQ GG AIAD+LFG  NP
Sbjct: 582 AEPWI--DEHADAILEAWYPGQEGGHAIADILFGDVNP 617


>gi|393786770|ref|ZP_10374902.1| hypothetical protein HMPREF1068_01182 [Bacteroides nordii
           CL02T12C05]
 gi|392658005|gb|EIY51635.1| hypothetical protein HMPREF1068_01182 [Bacteroides nordii
           CL02T12C05]
          Length = 864

 Score =  211 bits (536), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 115/310 (37%), Positives = 172/310 (55%), Gaps = 16/310 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT W+PN+NIFRDPRWGRGQET GEDP LT +   + V+GLQG   +R  K+ AC KH+
Sbjct: 135 GLTMWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPDSARYNKLHACAKHF 194

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  +  +DL +TY   FK  V E  V  VMC+YN+  G P C 
Sbjct: 195 AVHSGPEWN---RHSFNAENIIPRDLWETYLPAFKTLVQEADVKEVMCAYNRFEGDPCCG 251

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE--EAAADAIKAGLDLDCGPF 183
              +L   +  +W   G +VSDC ++   + T+ +   P+   A+A+A+  G DL+CG  
Sbjct: 252 SNRLLTQILRNEWGFKGIVVSDCGAISDFWGTKKHNTHPDAAHASAEAVLNGTDLECGSN 311

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLG-MFDGEPSAQPFGNLGPRDVCTPAHQ 242
               TE A++ G++ E+ +N+++   +  +  LG M +  P   P+       V +P H+
Sbjct: 312 YRKLTE-AIKAGIISEKQINVSVKRLLKARFELGEMENIHPWTLPYS-----IVDSPKHR 365

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
            LAL+ AH+ + LL+N  + LPL   +   +A+IGPN++ +V   GNY G     +T L 
Sbjct: 366 CLALKMAHETMTLLQNKGKVLPLD--KQARIAIIGPNANDSVMQWGNYNGTPSHTSTLLS 423

Query: 303 GISRYAKTIH 312
              +     H
Sbjct: 424 AFRKRLPISH 433



 Score = 44.3 bits (103), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 30/106 (28%), Positives = 49/106 (46%), Gaps = 13/106 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           +  D  +   G+  S+E E +          DR  +  P  Q+++++ + +A +  V+LV
Sbjct: 599 KDIDIIIFAGGISPSLEGEEMNVSATGFKGGDRTDIEFPAVQRKVLAALKEAGKK-VILV 657

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
              G  +  +         AIL   YPG+ GG AI +VLFG  NP 
Sbjct: 658 NFSGSAM--ALTPETKSCDAILQAWYPGEEGGMAIVNVLFGDYNPA 701


>gi|393789624|ref|ZP_10377744.1| hypothetical protein HMPREF1068_04024 [Bacteroides nordii
           CL02T12C05]
 gi|392650340|gb|EIY44009.1| hypothetical protein HMPREF1068_04024 [Bacteroides nordii
           CL02T12C05]
          Length = 855

 Score =  211 bits (536), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 120/315 (38%), Positives = 178/315 (56%), Gaps = 23/315 (7%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           GG   LT+WSP VN+ RDPRWGR  ET GEDP L+G+    +V+GLQG+    LK  +  
Sbjct: 132 GGSDLLTFWSPTVNMARDPRWGRTPETYGEDPFLSGRIGCEFVKGLQGDNPRYLKTVSTP 191

Query: 64  KHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPT 123
           KH+ A + ++    +R   NAR+S++DL + Y   F+ C+V+GK  S+M +YN VN  P 
Sbjct: 192 KHFAANNEEH----NRSSCNARMSERDLREYYLPAFERCIVDGKAQSIMMAYNAVNDVPC 247

Query: 124 CADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-- 181
             +  ++K  + G W  +GYIVSDC +   +     Y +  E AA  A+KAGLDL+CG  
Sbjct: 248 TVNIYLIKKVLRGDWNFNGYIVSDCSAPEWMVTKHKYVKNLEAAATLALKAGLDLECGDR 307

Query: 182 ----PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
               P L  + E  V      E +++ A  + +  +M LG+FD +PS  P+  + P  + 
Sbjct: 308 VYTAPLLKAYNEYMV-----SEAEIDSAAYHILRGRMLLGLFD-DPSQNPYNKIEPSVIG 361

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
              HQ+LAL+ A Q +VLLKN    LPL+  +  ++AV+G ++       G+Y+G     
Sbjct: 362 CKEHQELALETARQSMVLLKNQKNFLPLNRKKIRSIAVVGISA--AHCEFGDYSGNPK-- 417

Query: 298 TTP---LQGISRYAK 309
            TP   L GI +YA+
Sbjct: 418 NTPVSVLDGIKKYAE 432



 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 45/105 (42%), Positives = 63/105 (60%), Gaps = 3/105 (2%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           L G A   A++ D TV V+G+++SIE E  DR  L LP  QQE +  + K +   VV VL
Sbjct: 592 LFGDAGKVAKECDVTVAVLGINKSIEREGQDRYSLELPIDQQEFIKELYKVNPNTVV-VL 650

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G  + +++   D  + AIL   YPG+ GG A+A+VLFG  NPG
Sbjct: 651 VAGSSMAINWM--DENVPAILNAWYPGEQGGNAVAEVLFGDYNPG 693


>gi|383110854|ref|ZP_09931672.1| hypothetical protein BSGG_1962 [Bacteroides sp. D2]
 gi|313694427|gb|EFS31262.1| hypothetical protein BSGG_1962 [Bacteroides sp. D2]
          Length = 861

 Score =  211 bits (536), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 119/316 (37%), Positives = 179/316 (56%), Gaps = 16/316 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAACCKHY 66
           GLT+W+PNVNIFRDPRWGRGQET GEDP LTG+   + VRGLQG   G   K+ AC KH+
Sbjct: 131 GLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQLGMAVVRGLQGPENGKYDKLHACAKHF 190

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ F+A  ++ +DL +TY   FK  V +  V  VMC+YN+  G+P C 
Sbjct: 191 AVHSGPEWN---RHSFDAENITPRDLWETYLPAFKDLVQKADVKEVMCAYNRFEGEPCCG 247

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   +  +W   G +VSDC ++   Y    +   P  E A+A A+ +G DL+CG  
Sbjct: 248 SNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAGAVLSGTDLECGGE 307

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                + AV+ GL+ E+ ++++L   +T +  LG  D +P+   +  +    + +  HQ 
Sbjct: 308 YGSLAD-AVKAGLIDEKQIDVSLKRLLTARFELGEMDEQPA---WAEIPASTLNSKEHQD 363

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ A + +VLL+N    LPL+T     VAV+GPN++ +V   GNY G+     T L+ 
Sbjct: 364 LALRMARESLVLLQNKNDILPLNT--DLKVAVMGPNANDSVMQWGNYNGIPGHTVTLLEA 421

Query: 304 ISRY---AKTIHQAGC 316
           +       + +++ GC
Sbjct: 422 VRSKLPEGQVMYEPGC 437



 Score = 43.1 bits (100), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 37/105 (35%), Positives = 51/105 (48%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  +   G+  S+E E +          DR  + LP  Q++L+  + KA +  VV +
Sbjct: 596 KDADVVLFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPAVQRDLLKALKKAGKK-VVFI 654

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G  + +    N     AIL   YPGQAGG AI DVLFG  NP
Sbjct: 655 NYSGSAIGLVPESN--TCEAILQGWYPGQAGGTAIVDVLFGDYNP 697


>gi|254295141|ref|YP_003061164.1| glycoside hydrolase [Hirschia baltica ATCC 49814]
 gi|254043672|gb|ACT60467.1| glycoside hydrolase family 3 domain protein [Hirschia baltica ATCC
           49814]
          Length = 897

 Score =  211 bits (536), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 122/315 (38%), Positives = 174/315 (55%), Gaps = 27/315 (8%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPN+NIFRDPRWGRGQET GEDP LTG+ A +++ GLQG+     K  A  KHY 
Sbjct: 149 GLTFWSPNINIFRDPRWGRGQETYGEDPYLTGRMAVNFINGLQGDDDKYFKSVATVKHYA 208

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+  N   +  DL +TY   FK    E +VASVMC+YN V G P C 
Sbjct: 209 VH-----SGPEPSRHRDNYIATDADLYETYLPAFKTAFDETEVASVMCAYNAVWGDPACG 263

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQH-------------YTRTPEEAAADAI 172
              ++K+ +  +   DGY+VSDC ++G  Y  +              +  T  +AAA ++
Sbjct: 264 SERLMKDLLREELGFDGYVVSDCGAIGDFYYDEEKKAEGTAPYAAHDHVDTRAQAAALSV 323

Query: 173 KAGLDLDCGPFLAIHTEG---AVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFG 229
             G DL+CG       +    AV+ GL+ EE ++ ++    +   +LGM+D +PS  P+ 
Sbjct: 324 NMGTDLNCGDGEGNKMDALPQAVKEGLITEETIDQSVVRLYSALFKLGMYD-DPSLVPWS 382

Query: 230 NLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGN 289
           N+    V +P+H + + +AA   +VLLKN    LPL       VAVIGPN+D   T++ N
Sbjct: 383 NISIDTVASPSHLEKSEEAARASLVLLKNDG-ILPLKP--DTKVAVIGPNADNWWTLVAN 439

Query: 290 YAGVACGYTTPLQGI 304
           Y G      T L+GI
Sbjct: 440 YYGQPTAPVTALKGI 454



 Score = 48.1 bits (113), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 31/95 (32%), Positives = 51/95 (53%), Gaps = 13/95 (13%)

Query: 345 GLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVS 394
           G+D ++E E +          DR  + LP  Q++L+  +  A+  PVVLV   G  + ++
Sbjct: 638 GIDANLEGEEMGVELDGFLGGDRTHINLPAPQEKLLKEL-HATGKPVVLVNFSGSAMALN 696

Query: 395 FAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +   D  + AI+   YPG+  G AIAD+L+G  +P
Sbjct: 697 W--EDENLPAIVQAFYPGEKSGTAIADLLWGEFSP 729


>gi|423212159|ref|ZP_17198688.1| hypothetical protein HMPREF1074_00220 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392695047|gb|EIY88272.1| hypothetical protein HMPREF1074_00220 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 768

 Score =  211 bits (536), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 149/449 (33%), Positives = 227/449 (50%), Gaps = 48/449 (10%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL 71
           +SP V++ RD RWGR  E  GEDP L G+ A +YVRG QG+  +   + AC KH+  Y  
Sbjct: 162 FSPMVDVTRDARWGRIAEGAGEDPWLGGEIAKAYVRGYQGDLSANTNILACVKHFALYGA 221

Query: 72  ----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
                ++N VD       +S+Q + + Y +P+ A  VE  V S M ++N+V G P  A+ 
Sbjct: 222 PDAGKDYNTVD-------MSRQRMFNEYMLPYLAA-VEAGVGSAMSAFNEVEGIPAAANR 273

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +L + + GQW  DG++VSD D+V  L  T+H     +E +A A+ AGLD+D      + 
Sbjct: 274 WLLTDLLRGQWGFDGFVVSDWDAVREL--TEHGIGNMQEVSARALIAGLDMDMASEGLVS 331

Query: 188 T-EGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLAL 246
           T + +   G + EE +N A    +  + +LG+F  +P          +D+ TP H++ A 
Sbjct: 332 TLKKSFDEGKVTEEQINTACRRILVAKYKLGLFK-DPFKYCNEERAKKDLFTPEHRRAAR 390

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP---LQG 303
           + A +  VLLKN  R LPL   +   +AVIGP +D    M+G +  V+    TP   L+G
Sbjct: 391 EMAAESFVLLKNKERLLPLQ--KSGKIAVIGPLADNRRNMLGTWT-VSADLDTPVSVLEG 447

Query: 304 ISR-----------------YAKTIHQAGCF---GVACNG---NQLIGAAEVAARQADAT 340
           I                   Y + + +       G   +G   +QL   A   A Q+D  
Sbjct: 448 IREAVGDKAEINYARGSNLTYDEELEKRASHWGKGFPRDGRTDSQLQREALDIANQSDII 507

Query: 341 VLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDP 400
           + VMG    +  E   R  L +P  Q++L+ ++    + PVVLVL  G P+ + + + + 
Sbjct: 508 IAVMGEAAEMSGESTSRVDLNIPDAQKDLLKKLVDTGK-PVVLVLFAGRPLTLVWEEEN- 565

Query: 401 RIGAILWVGYPGQAGGAAIADVLFGRANP 429
            + AIL V +PG   G A+ADVLFG  NP
Sbjct: 566 -VPAILNVWFPGTEAGNAVADVLFGDVNP 593


>gi|256393789|ref|YP_003115353.1| beta-glucosidase [Catenulispora acidiphila DSM 44928]
 gi|256360015|gb|ACU73512.1| Beta-glucosidase [Catenulispora acidiphila DSM 44928]
          Length = 1548

 Score =  211 bits (536), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 151/492 (30%), Positives = 223/492 (45%), Gaps = 82/492 (16%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS------RLKVAAC 62
           LT+W+P VN+ RDPRWGR  E  GEDP LT   A ++V G +GNT +       LKVAA 
Sbjct: 190 LTFWAPTVNMDRDPRWGRTDEAFGEDPYLTSTMAGAFVNGYEGNTPTGQSKTGTLKVAAT 249

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
            KHY   D++     DR   ++ VS  DL D Y   F + +    V+ +M SYN +NG P
Sbjct: 250 AKHYALNDVEQ----DRTGISSNVSDTDLHDYYTKQFASLIENAHVSGLMTSYNAINGTP 305

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVGVLY--------------------------N 156
           + AD          Q+  +GY+ SDC ++G  Y                          N
Sbjct: 306 SVADTYTANQLAQRQFGFNGYVTSDCGAIGTAYQSFPSGHDWAPPGWTTDGKSSTGTWTN 365

Query: 157 TQHYTRTPEEAA--ADAIKAGLDLDC--GPFLAIHTEGAVRGGLLREEDVNLALAYTITV 212
           T      P +A   A A++AG DL+C  G         A+  G+L E  ++ AL    TV
Sbjct: 366 TATGATVPAQAGGQAYALRAGTDLNCAGGENTYAQITAAISAGVLSEGVIDNALVKIFTV 425

Query: 213 QMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSART---------- 262
           ++  G FD    + P+  +    + +PAHQ LA + A   +VLLKN              
Sbjct: 426 RVETGEFD-PAGSNPYTGITKAQIQSPAHQALATKVADNSLVLLKNQPPAASGTSTTPPA 484

Query: 263 ---------------LPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRY 307
                          LPLS      + ++G  ++     +GNY+       +P+QGI+  
Sbjct: 485 ASSAASSAAAAAKPLLPLSAAATAKIVIVGDMANAVT--LGNYSSDPALKVSPVQGITAA 542

Query: 308 AKTIHQAGCFGV-ACNGNQLIGA---------AEVAARQADATVLVMGLDQSIEAEFIDR 357
            +  +        AC  +    A         A+VA   ADA ++ +G +Q I  E  DR
Sbjct: 543 VRKANPGASVTFDACGTSTTASAAASCSAQTLADVAG--ADAVIVFVGTNQQIADEGKDR 600

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             + +PG    L+S+VA      +VL +  GGPV +   + D    +I++ G+ G++ G 
Sbjct: 601 TSIAMPGNYDSLISQVAAVGNPRMVLAVQSGGPVRIDDVQKD--FASIVFSGFNGESQGT 658

Query: 418 AIADVLFGRANP 429
           A+ADVLFG  NP
Sbjct: 659 ALADVLFGAQNP 670


>gi|212692496|ref|ZP_03300624.1| hypothetical protein BACDOR_01992 [Bacteroides dorei DSM 17855]
 gi|212664971|gb|EEB25543.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           dorei DSM 17855]
          Length = 864

 Score =  211 bits (536), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 119/318 (37%), Positives = 175/318 (55%), Gaps = 15/318 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL--KVAACCKH 65
           GLT W+P VNI+RDPRWGRG ET GEDP LT     + V+GLQ    ++   K+ AC KH
Sbjct: 129 GLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDKIHACAKH 188

Query: 66  YTAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           +  +    WN   R+ FNA  +  +DL +TY VPF+A V EGKV  VMC+YN++ G P C
Sbjct: 189 FAVHSGPEWN---RHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCC 245

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGP 182
               +L   +  +W  +G ++SDC ++   Y  + +   P  E A+A A+ +G DL+CG 
Sbjct: 246 GSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGS 305

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
                 E A + GL+ E+D+++++   +  +  LG  D +PS   +  +    VC+  H 
Sbjct: 306 SYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHD 363

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
            L+L  A + + LL N    LPL      T+AV+GPN++ +V   GNY G      T L+
Sbjct: 364 SLSLDIARKSMTLLLNKNNILPLKR-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLE 422

Query: 303 GIS----RYAKTIHQAGC 316
           GI        K I++ GC
Sbjct: 423 GIRSAMGENDKLIYEQGC 440



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 50/105 (47%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  +   G+  S+E E +          DR  + LP  Q+EL+  +  A +  V+ V
Sbjct: 600 KDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G P+  +         AIL   YPGQ+GG A A+VLFG  NP
Sbjct: 659 NFSGSPI--AMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNP 701


>gi|237709184|ref|ZP_04539665.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
 gi|229456880|gb|EEO62601.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
          Length = 864

 Score =  211 bits (536), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 119/318 (37%), Positives = 175/318 (55%), Gaps = 15/318 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL--KVAACCKH 65
           GLT W+P VNI+RDPRWGRG ET GEDP LT     + V+GLQ    ++   K+ AC KH
Sbjct: 129 GLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDKIHACAKH 188

Query: 66  YTAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           +  +    WN   R+ FNA  +  +DL +TY VPF+A V EGKV  VMC+YN++ G P C
Sbjct: 189 FAVHSGPEWN---RHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCC 245

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGP 182
               +L   +  +W  +G ++SDC ++   Y  + +   P  E A+A A+ +G DL+CG 
Sbjct: 246 GSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGS 305

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
                 E A + GL+ E+D+++++   +  +  LG  D +PS   +  +    VC+  H 
Sbjct: 306 SYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHD 363

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
            L+L  A + + LL N    LPL      T+AV+GPN++ +V   GNY G      T L+
Sbjct: 364 SLSLDIARKSMTLLLNKNNILPLKR-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLE 422

Query: 303 GIS----RYAKTIHQAGC 316
           GI        K I++ GC
Sbjct: 423 GIRSAMGENDKLIYEQGC 440



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 50/105 (47%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  +   G+  S+E E +          DR  + LP  Q+EL+  +  A +  V+ V
Sbjct: 600 KDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G P+  +         AIL   YPGQ+GG A A+VLFG  NP
Sbjct: 659 NFSGSPI--AMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNP 701


>gi|423240769|ref|ZP_17221883.1| hypothetical protein HMPREF1065_02506 [Bacteroides dorei
           CL03T12C01]
 gi|392643731|gb|EIY37480.1| hypothetical protein HMPREF1065_02506 [Bacteroides dorei
           CL03T12C01]
          Length = 864

 Score =  211 bits (536), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 119/318 (37%), Positives = 175/318 (55%), Gaps = 15/318 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL--KVAACCKH 65
           GLT W+P VNI+RDPRWGRG ET GEDP LT     + V+GLQ    ++   K+ AC KH
Sbjct: 129 GLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDKIHACAKH 188

Query: 66  YTAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           +  +    WN   R+ FNA  +  +DL +TY VPF+A V EGKV  VMC+YN++ G P C
Sbjct: 189 FAVHSGPEWN---RHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCC 245

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGP 182
               +L   +  +W  +G ++SDC ++   Y  + +   P  E A+A A+ +G DL+CG 
Sbjct: 246 GSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPNAESASAAAVLSGTDLECGS 305

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
                 E A + GL+ E+D+++++   +  +  LG  D +PS   +  +    VC+  H 
Sbjct: 306 SYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHD 363

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
            L+L  A + + LL N    LPL      T+AV+GPN++ +V   GNY G      T L+
Sbjct: 364 SLSLDIARKSMTLLLNKNNILPLKR-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLE 422

Query: 303 GIS----RYAKTIHQAGC 316
           GI        K I++ GC
Sbjct: 423 GIRSAMGENDKLIYEQGC 440



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 50/105 (47%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  +   G+  S+E E +          DR  + LP  Q+EL+  +  A +  V+ V
Sbjct: 600 KDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G P+  +         AIL   YPGQ+GG A A+VLFG  NP
Sbjct: 659 NFSGSPI--AMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNP 701


>gi|345514226|ref|ZP_08793739.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
           5_1_36/D4]
 gi|229437207|gb|EEO47284.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
           5_1_36/D4]
          Length = 864

 Score =  211 bits (536), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 119/318 (37%), Positives = 175/318 (55%), Gaps = 15/318 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL--KVAACCKH 65
           GLT W+P VNI+RDPRWGRG ET GEDP LT     + V+GLQ    ++   K+ AC KH
Sbjct: 129 GLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDKIHACAKH 188

Query: 66  YTAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           +  +    WN   R+ FNA  +  +DL +TY VPF+A V EGKV  VMC+YN++ G P C
Sbjct: 189 FAVHSGPEWN---RHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCC 245

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGP 182
               +L   +  +W  +G ++SDC ++   Y  + +   P  E A+A A+ +G DL+CG 
Sbjct: 246 GSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGS 305

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
                 E A + GL+ E+D+++++   +  +  LG  D +PS   +  +    VC+  H 
Sbjct: 306 SYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHD 363

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
            L+L  A + + LL N    LPL      T+AV+GPN++ +V   GNY G      T L+
Sbjct: 364 SLSLDIARKSMTLLLNKNNILPLKR-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLE 422

Query: 303 GIS----RYAKTIHQAGC 316
           GI        K I++ GC
Sbjct: 423 GIRSAMGENDKLIYEQGC 440



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 50/105 (47%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  +   G+  S+E E +          DR  + LP  Q+EL+  +  A +  V+ V
Sbjct: 600 KDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G P+  +         AIL   YPGQ+GG A A+VLFG  NP
Sbjct: 659 NFSGSPI--AMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNP 701


>gi|298480647|ref|ZP_06998843.1| beta-glucosidase [Bacteroides sp. D22]
 gi|298273081|gb|EFI14646.1| beta-glucosidase [Bacteroides sp. D22]
          Length = 862

 Score =  210 bits (535), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 121/317 (38%), Positives = 172/317 (54%), Gaps = 14/317 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAACCKHY 66
           GLT W+P VN++RDPRWGRG ET GEDP LT +     V+GLQG N G   K+ AC KH+
Sbjct: 127 GLTMWTPTVNVYRDPRWGRGIETYGEDPYLTSRMGVEVVKGLQGGNDGKYDKLHACAKHF 186

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FN   +  +DL +TY  PF+A V E KV  VMC+YN+  G P C 
Sbjct: 187 AVHSGPEWN---RHEFNVENIKPRDLYETYLPPFEALVKEAKVKEVMCAYNRFEGDPCCG 243

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   +  +W  DG ++SDC ++   YN + +   P  E A+A A+ +G DL+CG  
Sbjct: 244 SDRLLMQILRNEWGFDGIVLSDCGAIADFYNDRGHKTHPDAESASAAAVLSGTDLECGSS 303

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                E AV+ G + E+DV++A+   +  +  LG  D EP    +  +    V +  H  
Sbjct: 304 YKALIE-AVKLGKIDEKDVDVAVKRLLAARFALGEMD-EPDKVSWTKIPFSVVASEKHDS 361

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL  A + + LL N    LPL      T+AV+GPN++ +V   GNY G+     T L G
Sbjct: 362 LALDIARKSMTLLMNKNDILPLKR-GGLTIAVMGPNANDSVMQWGNYNGMPSHTVTVLDG 420

Query: 304 ISRYA----KTIHQAGC 316
           + +      K I++ GC
Sbjct: 421 VRKALDANDKLIYEQGC 437



 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 37/105 (35%), Positives = 55/105 (52%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  + V G+  S+E E +          DR  + LP  Q+EL++ +  + +  V+LV
Sbjct: 597 KGADVVIFVGGISPSLEGEEMGVNLPGFKRGDRTDIELPAVQRELIAALHHSGK-KVILV 655

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G P+         +  AIL   YPGQAGG A+A+VLFG+ NP
Sbjct: 656 NCSGSPI--GLEPETKKCEAILQAWYPGQAGGTAVAEVLFGKYNP 698


>gi|265752711|ref|ZP_06088280.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           3_1_33FAA]
 gi|263235897|gb|EEZ21392.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           3_1_33FAA]
          Length = 864

 Score =  210 bits (535), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 119/318 (37%), Positives = 175/318 (55%), Gaps = 15/318 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL--KVAACCKH 65
           GLT W+P VNI+RDPRWGRG ET GEDP LT     + V+GLQ    ++   K+ AC KH
Sbjct: 129 GLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDKIHACAKH 188

Query: 66  YTAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           +  +    WN   R+ FNA  +  +DL +TY VPF+A V EGKV  VMC+YN++ G P C
Sbjct: 189 FAVHSGPEWN---RHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCC 245

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGP 182
               +L   +  +W  +G ++SDC ++   Y  + +   P  E A+A A+ +G DL+CG 
Sbjct: 246 GSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGS 305

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
                 E A + GL+ E+D+++++   +  +  LG  D +PS   +  +    VC+  H 
Sbjct: 306 SYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHD 363

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
            L+L  A + + LL N    LPL      T+AV+GPN++ +V   GNY G      T L+
Sbjct: 364 SLSLDIARKSMTLLLNKNNILPLKR-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLE 422

Query: 303 GIS----RYAKTIHQAGC 316
           GI        K I++ GC
Sbjct: 423 GIRSAMGENDKLIYEQGC 440



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 50/105 (47%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  +   G+  S+E E +          DR  + LP  Q+EL+  +  A +  V+ V
Sbjct: 600 KDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G P+  +         AIL   YPGQ+GG A A+VLFG  NP
Sbjct: 659 NFSGSPI--AMEPETQYCQAILQAWYPGQSGGKAAAEVLFGDYNP 701


>gi|423230604|ref|ZP_17217008.1| hypothetical protein HMPREF1063_02828 [Bacteroides dorei
           CL02T00C15]
 gi|423244313|ref|ZP_17225388.1| hypothetical protein HMPREF1064_01594 [Bacteroides dorei
           CL02T12C06]
 gi|392630748|gb|EIY24734.1| hypothetical protein HMPREF1063_02828 [Bacteroides dorei
           CL02T00C15]
 gi|392642494|gb|EIY36260.1| hypothetical protein HMPREF1064_01594 [Bacteroides dorei
           CL02T12C06]
          Length = 864

 Score =  210 bits (535), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 119/318 (37%), Positives = 175/318 (55%), Gaps = 15/318 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL--KVAACCKH 65
           GLT W+P VNI+RDPRWGRG ET GEDP LT     + V+GLQ    ++   K+ AC KH
Sbjct: 129 GLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDKIHACAKH 188

Query: 66  YTAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           +  +    WN   R+ FNA  +  +DL +TY VPF+A V EGKV  VMC+YN++ G P C
Sbjct: 189 FAVHSGPEWN---RHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCC 245

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGP 182
               +L   +  +W  +G ++SDC ++   Y  + +   P  E A+A A+ +G DL+CG 
Sbjct: 246 GSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGS 305

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
                 E A + GL+ E+D+++++   +  +  LG  D +PS   +  +    VC+  H 
Sbjct: 306 SYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHD 363

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
            L+L  A + + LL N    LPL      T+AV+GPN++ +V   GNY G      T L+
Sbjct: 364 SLSLDIARKSMTLLLNKNNILPLKR-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLE 422

Query: 303 GIS----RYAKTIHQAGC 316
           GI        K I++ GC
Sbjct: 423 GIRSAMGENDKLIYEQGC 440



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 50/105 (47%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  +   G+  S+E E +          DR  + LP  Q+EL+  +  A +  V+ V
Sbjct: 600 KDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G P+  +         AIL   YPGQ+GG A A+VLFG  NP
Sbjct: 659 NFSGSPI--AMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNP 701


>gi|28199699|ref|NP_780013.1| family 3 glycoside hydrolase [Xylella fastidiosa Temecula1]
 gi|182682443|ref|YP_001830603.1| beta-glucosidase [Xylella fastidiosa M23]
 gi|417557804|ref|ZP_12208815.1| Beta-glucosidase [Xylella fastidiosa EB92.1]
 gi|28057820|gb|AAO29662.1| family 3 glycoside hydrolase [Xylella fastidiosa Temecula1]
 gi|182632553|gb|ACB93329.1| Beta-glucosidase [Xylella fastidiosa M23]
 gi|338179587|gb|EGO82522.1| Beta-glucosidase [Xylella fastidiosa EB92.1]
          Length = 882

 Score =  210 bits (535), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 123/298 (41%), Positives = 171/298 (57%), Gaps = 7/298 (2%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LT + A S++RGLQG+T    +  A  KH+
Sbjct: 128 AGLTLWSPNINIFRDPRWGRGMETYGEDPYLTSQLAVSFIRGLQGDTPDHPRTIATPKHF 187

Query: 67  TAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
             +         R+ F+  VS  DLE TY   F+A +V+G   SVMC+YN ++G P CA 
Sbjct: 188 AVHSGPEQG---RHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACAS 244

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAI 186
             +L   +   W  +G++VSDCD++  +     + +    A+A A+K+G DL+CG     
Sbjct: 245 DWLLNTRLRNDWGFNGFVVSDCDAIEDMTRFHFFRQDNASASAAALKSGNDLNCGNTYRD 304

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLAL 246
             +   RG  + E  ++ AL    T + RLG         P+  +G + + TPAH+ LAL
Sbjct: 305 LNQAIARGD-IDESTLDQALIRLFTARQRLGTLQPR-EHDPYAAIGIKHIDTPAHRALAL 362

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           QAA Q +VLLKNS  TLPL      T+AV+GP++D    +  NY G +    TPL G+
Sbjct: 363 QAAAQSLVLLKNSGNTLPLPP--ETTLAVLGPDADSLTALEANYQGTSSTPVTPLTGL 418



 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 60/114 (52%), Gaps = 13/114 (11%)

Query: 327 IGAAEVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKA 376
           +  AE A   ADA V  +GL   +E E +          DR  + LP  Q+ L+  V K 
Sbjct: 601 LAEAERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHV-KT 659

Query: 377 SRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           +  P+++VLM G  V +++A++     AIL   YPGQ+GG AIA  L G  NPG
Sbjct: 660 TGKPLIVVLMSGSAVALNWAQH--HADAILAAWYPGQSGGTAIAQALAGDVNPG 711


>gi|427387362|ref|ZP_18883418.1| hypothetical protein HMPREF9447_04451 [Bacteroides oleiciplenus YIT
           12058]
 gi|425725523|gb|EKU88394.1| hypothetical protein HMPREF9447_04451 [Bacteroides oleiciplenus YIT
           12058]
          Length = 865

 Score =  210 bits (535), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 123/304 (40%), Positives = 176/304 (57%), Gaps = 10/304 (3%)

Query: 5   GMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN-TGSRLKVAACC 63
           G  GLT+W+PN+NI+RDPRWGRG ET GEDP LT     + V+GLQG+ TG   K  AC 
Sbjct: 127 GYKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGDGTGKYDKTHACA 186

Query: 64  KHYTAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           KHY  +    WN   R+ F+A+ +S++DL +TY   FK  V EGKV  VMC+YN+  G+P
Sbjct: 187 KHYAVHSGPEWN---RHSFDAKNISQRDLWETYLPAFKTLVTEGKVKEVMCAYNRYEGEP 243

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDC 180
            C++  +L   +   W  D  +VSDC ++G  Y   H+   P    A+ADA+ +G DL+C
Sbjct: 244 CCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTAAAASADAVVSGTDLEC 303

Query: 181 GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           G   +   E AVR GL+ E+ +N ++   +  + +LGMFD + +   +  +    V +  
Sbjct: 304 GGSYSSLNE-AVRKGLISEDKINESVFRLLRARFQLGMFD-DNTLVSWSEIPYSVVESKE 361

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           H   AL+ A + +VLL N    LPLS      VAV+GPN++ +V +  NY G      T 
Sbjct: 362 HVAKALEMARKSMVLLTNKNNILPLSK-SVRKVAVLGPNANDSVMLWANYNGFPTKSVTI 420

Query: 301 LQGI 304
           L+GI
Sbjct: 421 LEGI 424



 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 39/106 (36%), Positives = 56/106 (52%), Gaps = 13/106 (12%)

Query: 334 ARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVL 383
           A +AD  + V GL  S+E E +          DR  + LP  Q+E++  + K  + PVV 
Sbjct: 598 AAEADVIIFVGGLSSSLEGEEMPVDLPGFRKGDRTNIDLPQVQEEMLKALKKTGK-PVVF 656

Query: 384 VLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           VL  G  + + +   +  + AI+   YPGQ GG A+ADVLFG  NP
Sbjct: 657 VLCSGSTLALPWEAEN--LDAIIEAWYPGQQGGTAVADVLFGDYNP 700


>gi|256393466|ref|YP_003115030.1| glycoside hydrolase family 3 [Catenulispora acidiphila DSM 44928]
 gi|256359692|gb|ACU73189.1| glycoside hydrolase family 3 domain protein [Catenulispora
           acidiphila DSM 44928]
          Length = 1343

 Score =  210 bits (535), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 152/470 (32%), Positives = 225/470 (47%), Gaps = 63/470 (13%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN--TGSR----LKVAAC 62
           LT+W+PNVN+ RDP WGR  E+ GEDP LT   A ++V G QG   TG +    LKVAA 
Sbjct: 175 LTFWAPNVNMDRDPLWGRTNESFGEDPYLTSTMAGAFVDGYQGQSMTGQQQTPYLKVAAT 234

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
            KHY+  ++++     R+  ++  +  ++ D Y   F + V +  V+ +M SYN VNG P
Sbjct: 235 AKHYSLNNIED----SRHTGSSDTTDANIRDYYTKQFASLVRDAHVSGIMTSYNAVNGTP 290

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE----------------- 165
           + AD   +   +   +   GY  SDC ++G +Y    +   P                  
Sbjct: 291 SPADTYTVDELLQATYGFAGYTTSDCGAIGDVYGAASHGWAPPGWTSNGTSWTNNATGRQ 350

Query: 166 -EAAAD----AIKAGLDLDC--GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGM 218
             AAA     AI+AG  L+C  G   A +   A+  GLL    V+  L    TV+M  G 
Sbjct: 351 ISAAAGGQAFAIRAGTQLNCAGGEMTAQNISAAIDLGLLSNGVVDATLTRLFTVRMETGE 410

Query: 219 FDGEPSAQPFGNLGPRDVC-----TPAHQQLALQAAHQGIVLLKN------SARTLPLST 267
           FD      P G +G   +      +PAHQ LA Q A   IVLL+N      SA+ LP+  
Sbjct: 411 FD------PAGKVGYTKITKDQIESPAHQALAEQVAANDIVLLQNGAVSGTSAKLLPVDP 464

Query: 268 LRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRYAKTIHQAG--CFGVACNGNQ 325
            +  +V ++G  ++     +G Y+G        +QGI+   +  + +    F     G Q
Sbjct: 465 AKTDSVVIVGDLANKVT--LGGYSGEPTHEVNAVQGITAAVQAANPSATVTFDACGTGTQ 522

Query: 326 LI------GAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRG 379
           +        A + A + A   ++V G D S+  E  DR+ L LPG    L+S+V+     
Sbjct: 523 ITTPASCSAATQAAIKSASLVLVVAGSDLSVADEANDRSTLALPGNYDSLISQVSALGNP 582

Query: 380 PVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              LV+   GP D+  A+ D    AI++ GY GQ+ G A+A VLFG+ NP
Sbjct: 583 RTALVMQADGPYDIQDAQKD--FPAIVFSGYNGQSQGTALAQVLFGQQNP 630


>gi|397691065|ref|YP_006528319.1| glycoside hydrolase family 3 protein [Melioribacter roseus P3M]
 gi|395812557|gb|AFN75306.1| glycoside hydrolase family 3 protein [Melioribacter roseus P3M]
          Length = 769

 Score =  210 bits (534), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 157/460 (34%), Positives = 234/460 (50%), Gaps = 65/460 (14%)

Query: 13  SPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN--TGSRLKVAACCKHYTAYD 70
           +P V++ RDPRWGR +ET GED  L  +   + V+GLQG+    +  KV A  KH+ A+ 
Sbjct: 169 TPVVDVVRDPRWGRVEETFGEDTYLVSQMGIASVKGLQGDGSLNNNNKVIATLKHFAAH- 227

Query: 71  LDNWNGVDRYHFN---ARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
                G      N   A  S++ L DT+ +PFK  + +  V SVM SYN+++G P+ A+ 
Sbjct: 228 -----GQPESGTNCAPANFSERFLRDTFLMPFKEAIDKAGVISVMASYNEIDGIPSHANK 282

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTR-----TPEEAAADAIKAGLDL---- 178
            +L+  +  +W   G++VSD  ++  L++ +            EAA  A++AG+++    
Sbjct: 283 WLLRKVLRDEWNFKGFVVSDYYAITELFHKEETVSHGVAANKVEAAKLALEAGVNIEFPN 342

Query: 179 -DCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDG---EPSAQPFGNLGPR 234
            DC P L   TE  V+GGL  E D++  +   +  +  LG+FD    E     F N   +
Sbjct: 343 PDCYPNL---TE-MVKGGLADESDIDALVLPMLKYKFELGLFDNPYVEAEPGQFENKLEQ 398

Query: 235 DVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA 294
           D      ++LALQAA + I LLKN    LPL   +   +AVIGPN+D   T++G Y G  
Sbjct: 399 D------RELALQAARETITLLKNEGNLLPLKDFKK--IAVIGPNAD--RTLLGGYHGTP 448

Query: 295 CGYTTPLQGIS----RYAKTIHQAGC---FGVACNGNQ-----------LIGAAEVAARQ 336
             YT+  QGI     +  +  +  GC    G + N ++           LI  A   A++
Sbjct: 449 KYYTSVYQGIKDKVGKNGEVFYSEGCKITVGGSWNDDEVILPDPAEDEKLINEAVAVAQK 508

Query: 337 ADATVLVMGLDQSIEAE------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGP 390
           +D  VLV+G ++    E        DR  L L GRQ +LV  + K  + PVV++L  G P
Sbjct: 509 SDVAVLVLGGNEQTSREAWNKKHLGDRPSLELVGRQNKLVEEILKTGK-PVVVLLFNGRP 567

Query: 391 VDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
             + F K++  + AIL   Y GQ  G A+ADVLFG  NP 
Sbjct: 568 NSIGFIKDN--VPAILECWYLGQETGRAVADVLFGDYNPS 605


>gi|224538282|ref|ZP_03678821.1| hypothetical protein BACCELL_03173 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224520107|gb|EEF89212.1| hypothetical protein BACCELL_03173 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 864

 Score =  210 bits (534), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 123/316 (38%), Positives = 176/316 (55%), Gaps = 17/316 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
            LT W+PNVNIFRDPRWGRGQET GEDP LT +   + V GLQG   +R  K+ AC KHY
Sbjct: 134 ALTVWTPNVNIFRDPRWGRGQETYGEDPYLTSRLGVAVVNGLQGPDTARYNKLHACAKHY 193

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  +S +DL +TY   FK  V E KV  VMC+YN+  G+P C 
Sbjct: 194 AVHSGPEWN---RHSFNAENISPRDLWETYLPAFKTLVQEAKVKEVMCAYNRFEGEPCCG 250

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   +  +W  DG +VSDC +V   +  + +   P    A+ADA+  G D++CG  
Sbjct: 251 SNRLLTQILRDEWGFDGVVVSDCGAVSDFWQKRKHETHPDAASASADAVLNGTDVECGNS 310

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                  AV+ GL+ E  +++++   +  +  LG  D       +  +    V +P H+Q
Sbjct: 311 YK-SLPDAVKAGLITENQIDISVKRLLKARFELGEMD----ENVWTGISSDVVDSPKHRQ 365

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LALQ A + + LL+N+   LPLS  +   +A+IGPN++ +V   GNY G+     T L+G
Sbjct: 366 LALQMARETMTLLQNNNNILPLS--KQAKIALIGPNANDSVMQWGNYNGLPSHTITLLEG 423

Query: 304 ISRYAKT---IHQAGC 316
           + RY  T   I++  C
Sbjct: 424 MQRYLPTSNLIYEPVC 439



 Score = 47.0 bits (110), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 13/106 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           +  D  +   G+  ++E E +          DR  + LP  Q+ +V  +  A +  +V V
Sbjct: 599 KDVDVVIFAGGISPALEGEEMPVDAAGFRGGDRTEIELPAVQRRVVEALKTAGKR-IVFV 657

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
              G  +  +         AIL   YPGQAGG A+A+VLFG  NP 
Sbjct: 658 NFSGAAI--ALEPESQNCEAILQAWYPGQAGGQAVAEVLFGDYNPA 701


>gi|149280000|ref|ZP_01886125.1| putative beta-glucosidase [Pedobacter sp. BAL39]
 gi|149229197|gb|EDM34591.1| putative beta-glucosidase [Pedobacter sp. BAL39]
          Length = 793

 Score =  210 bits (534), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 158/458 (34%), Positives = 228/458 (49%), Gaps = 59/458 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAACCKHYTAYD 70
           + P +++ R+PRW R +ET GEDPVL  +   + V G QG N GS + + +  KH+TAY 
Sbjct: 191 YGPVLDLAREPRWSRVEETYGEDPVLNSRMGEAMVSGFQGTNIGSGVNILSTLKHFTAY- 249

Query: 71  LDNWNGVDRYHFN---ARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
                GV     N     V  ++L  +Y  PFKA V  G + SVM +YN V+G P  ++ 
Sbjct: 250 -----GVPEGGHNGGSVTVGNRELFQSYLPPFKAAVKAGAL-SVMTAYNSVDGIPCSSNR 303

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC-----GP 182
            +L + + GQW  +G++VSD +S+  L    H   +  EAAA A+ AGLD D      GP
Sbjct: 304 YLLTDILRGQWGFNGFVVSDLNSISGLEGNHHVASSATEAAALAMNAGLDADLSGYGYGP 363

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
            L      AV GGL++   V+ ALA  + ++  +G+F+  P   P      + V    H 
Sbjct: 364 ALV----KAVNGGLVKMATVDTALARVLRLKFNMGLFEN-PYVNP--KQAEKQVMNAKHV 416

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVAC--GYTTP 300
            LA + A + +VLLKN    LPLS    + +AVIGPN+D     +G+Y          T 
Sbjct: 417 TLARKVAQESVVLLKNEKNILPLSKALKN-IAVIGPNADNVYNQLGDYTAPQADGKVITV 475

Query: 301 LQGI----SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG----------- 345
           L GI    S+     +Q GC  +    +  I AA   A ++D  ++V+G           
Sbjct: 476 LNGIRAKVSKETGVFYQKGC-AIRDTASAGIAAAVALASKSDVAIVVLGGSSARDFKTEY 534

Query: 346 --------------LDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPV 391
                         +      E  DR+ L L GRQ EL+  V K    PVV+VL+ G P+
Sbjct: 535 QNTGAAEVKASAVAVSDMESGEGFDRSTLDLMGRQMELLRAVVKTGT-PVVVVLIKGRPL 593

Query: 392 DVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            +++A  +  + A++   YPGQ GG AIADVLFG  NP
Sbjct: 594 TLNWAAEN--VAAMVDAWYPGQEGGNAIADVLFGDYNP 629


>gi|423303577|ref|ZP_17281576.1| hypothetical protein HMPREF1072_00516 [Bacteroides uniformis
           CL03T00C23]
 gi|423307700|ref|ZP_17285690.1| hypothetical protein HMPREF1073_00440 [Bacteroides uniformis
           CL03T12C37]
 gi|392687941|gb|EIY81232.1| hypothetical protein HMPREF1072_00516 [Bacteroides uniformis
           CL03T00C23]
 gi|392689569|gb|EIY82846.1| hypothetical protein HMPREF1073_00440 [Bacteroides uniformis
           CL03T12C37]
          Length = 942

 Score =  210 bits (534), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 146/448 (32%), Positives = 223/448 (49%), Gaps = 32/448 (7%)

Query: 6   MAGLT-YWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCK 64
           M G T  ++P +++ RD RWGR +E  GE P L  +     VRGLQ N     +VAA  K
Sbjct: 204 MLGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQHNH----QVAATGK 259

Query: 65  HYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           H+ AY  +          + ++S +++E+ +  PFK  + E  +  VM SYN  +G P  
Sbjct: 260 HFAAYSNNKGAREGMARVDPQMSPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQ 319

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG--- 181
                L   + G+    GY+VSD D+V  LY      +  +EA   +++AGL++ C    
Sbjct: 320 GSYYWLTTRLRGEMGFRGYVVSDSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRS 379

Query: 182 -PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
                +     V+ G L EE +N  +   + V+  +G+FD        G    R+V    
Sbjct: 380 PDSFVLPLRELVKEGGLSEEVINDRVRDILRVKFLIGLFDAPYQTDLAG--ADREVEKEE 437

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           ++ +ALQA+H+ +VLLKN+   LPL       +AV GPN++     + +Y  +A   TT 
Sbjct: 438 NEAIALQASHESVVLLKNADELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTV 497

Query: 301 LQGISRYAKT----IHQAGC--------------FGVACNGNQLIGAAEVAARQADATVL 342
           L+GI    K+    ++  GC              + +  +    I  A   ARQAD  V+
Sbjct: 498 LEGIQEKTKSKAEVLYTKGCDLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAVV 557

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           V+G  Q    E   R  L LPGRQ +L+  + +A+  PVVL+L+ G P+ +++A  D  +
Sbjct: 558 VLGGGQRTCGENKSRTSLDLPGRQLQLLQAI-QATGKPVVLILINGRPLSINWA--DKFV 614

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
            AIL   YPG  GG A+AD+LFG  NPG
Sbjct: 615 PAILEAWYPGSKGGTALADILFGDYNPG 642


>gi|423221630|ref|ZP_17208100.1| hypothetical protein HMPREF1062_00286 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392645869|gb|EIY39591.1| hypothetical protein HMPREF1062_00286 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 864

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 123/316 (38%), Positives = 176/316 (55%), Gaps = 17/316 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
            LT W+PNVNIFRDPRWGRGQET GEDP LT +   + V GLQG   +R  K+ AC KHY
Sbjct: 134 ALTVWTPNVNIFRDPRWGRGQETYGEDPYLTSRLGVAVVNGLQGPDTARYNKLHACAKHY 193

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  +S +DL +TY   FK  V E KV  VMC+YN+  G+P C 
Sbjct: 194 AVHSGPEWN---RHSFNAENISPRDLWETYLPAFKTLVQEAKVKEVMCAYNRFEGEPCCG 250

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   +  +W  DG +VSDC +V   +  + +   P    A+ADA+  G D++CG  
Sbjct: 251 SNRLLTQILRDEWGFDGVVVSDCGAVSDFWQKRKHETHPDAASASADAVLNGTDVECGNS 310

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                  AV+ GL+ E  +++++   +  +  LG  D       +  +    V +P H+Q
Sbjct: 311 YK-SLPDAVKAGLITENQIDISVKRLLKARFELGEMD----ENVWTGISSDVVDSPKHRQ 365

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LALQ A + + LL+N+   LPLS  +   +A+IGPN++ +V   GNY G+     T L+G
Sbjct: 366 LALQMARETMTLLQNNNNILPLS--KQAKIALIGPNANDSVMQWGNYNGLPSHTITLLEG 423

Query: 304 ISRYAKT---IHQAGC 316
           + RY  T   I++  C
Sbjct: 424 MQRYLPTSNLIYEPVC 439



 Score = 46.6 bits (109), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 13/106 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           +  D  +   G+  ++E E +          DR  + LP  Q+ +V  +  A +  +V V
Sbjct: 599 KDVDVVIFAGGISPALEGEEMPVDAAGFRGGDRTEIELPAVQRRVVEALKTAGKR-IVFV 657

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
              G  +  +         AIL   YPGQAGG A+A+VLFG  NP 
Sbjct: 658 NFSGAAI--ALEPESLNCEAILQAWYPGQAGGQAVAEVLFGDYNPA 701


>gi|189467715|ref|ZP_03016500.1| hypothetical protein BACINT_04107 [Bacteroides intestinalis DSM
           17393]
 gi|189435979|gb|EDV04964.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 943

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 147/441 (33%), Positives = 219/441 (49%), Gaps = 31/441 (7%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL 71
           ++P +++ RD RWGR +E  GE P L  +     VRG+Q N     +VAA  KH+ AY  
Sbjct: 211 YAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH----QVAATGKHFVAYSN 266

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
           +          + ++S +++E  +  PFK  + E  +  VM SYN  +G P       L 
Sbjct: 267 NKGAREGMARVDPQMSPREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSYYWLT 326

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG----PFLAIH 187
             + G+    GY+VSD D+V  LY      +  +EA   +++AGL++ C         + 
Sbjct: 327 TRLRGEMGFRGYVVSDSDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLP 386

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
               V+ G L EE +N  +   + V+  +G+FD        G    R+V    ++ LALQ
Sbjct: 387 LRELVKEGGLSEEVINDRVRDILRVKFLIGLFDAPYQTDLAG--ADREVEKAENESLALQ 444

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISR- 306
           A+ + +VLLKN    LPL       +AV GPN+D     + +Y  +A   TT L+GI + 
Sbjct: 445 ASRESLVLLKNENNVLPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQK 504

Query: 307 ---YAKTIHQAGC--------------FGVACNGNQLIGAAEVAARQADATVLVMGLDQS 349
               A+ ++  GC              + +  N    I  A   ARQAD  V+V+G  Q 
Sbjct: 505 SEGKAEVLYTKGCDLVDANWPESELIDYPMTDNEQAEIDKAVENARQADVAVVVLGGGQR 564

Query: 350 IEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVG 409
              E   R+ L LPGRQ +L+  V +A+  PVVLVL+ G P+ +++A  D  + AIL   
Sbjct: 565 TCGENKSRSSLDLPGRQLKLLQAV-QATGKPVVLVLINGRPLSINWA--DKFVPAILEAW 621

Query: 410 YPGQAGGAAIADVLFGRANPG 430
           YPG  GG A+ADVLFG  NPG
Sbjct: 622 YPGSKGGTAVADVLFGDYNPG 642


>gi|317474225|ref|ZP_07933501.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316909535|gb|EFV31213.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 858

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 130/382 (34%), Positives = 203/382 (53%), Gaps = 36/382 (9%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+G    ++V+GLQGN    LK+ +  KH+ A
Sbjct: 137 LTFWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGNDSRYLKIVSTPKHFAA 196

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R+  N ++S++ L + Y   F+ACV EGK AS+M +YN +N  P   +  
Sbjct: 197 NNEEH----NRFVCNPQISEKQLREYYLPAFEACVKEGKSASIMSAYNALNDVPCTLNAW 252

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG------P 182
           +L   +   W   GY+VSDC    +L N   Y +T E AA  +IKAGLDL+CG      P
Sbjct: 253 LLTKVLREDWGFKGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDAP 312

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
            L      A R  ++ + D++ A    +  +M+LG+FD   +  P+  + P+ + +  HQ
Sbjct: 313 LL-----NAYRQYMVTDADIDSAAYRVLRARMQLGLFDSGEN-NPYTKISPKVIGSKEHQ 366

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV-ACGYTTPL 301
           ++AL AA + IVLLKN  + LPL   +  ++AV+G N+    +  G+Y+G+      + L
Sbjct: 367 KVALDAARECIVLLKNQNKMLPLDAKKIKSIAVVGINA--GRSEFGDYSGLPVIAPVSIL 424

Query: 302 QGIS-RYAKTIHQA-GCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           QGI  R   T+      +  A +G +LI  A                 + ++AE+ D   
Sbjct: 425 QGIKDRVGDTVKVVYAPWKSAVDGMELIQGA--------------SFPEGLKAEYFDNTN 470

Query: 360 LL-LPGRQQELVSRVAKASRGP 380
           L  +P  ++E       A++ P
Sbjct: 471 LQGIPKVRKEEWINFEPANQAP 492



 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/105 (37%), Positives = 64/105 (60%), Gaps = 3/105 (2%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           L G A    R+ +  V V+G++++IE E  DR+ + LP  Q+E +  + K +   +V+VL
Sbjct: 592 LYGEAGRVVRECEKVVAVLGINKAIEREGQDRSDIQLPADQREFLKEIYKVNPN-IVVVL 650

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G  + +++   D  I AI+   YPG++GG A+A+VLFG  NPG
Sbjct: 651 VAGSSLSINWM--DEHIPAIINAWYPGESGGKAVAEVLFGDYNPG 693


>gi|294674604|ref|YP_003575220.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23]
 gi|294474051|gb|ADE83440.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23]
          Length = 869

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 125/324 (38%), Positives = 174/324 (53%), Gaps = 17/324 (5%)

Query: 2   YNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAA 61
           Y  G   LT+W PNVNIFRDPRWGRGQET GEDP L        V G+QGN     K  A
Sbjct: 134 YYPGNPELTFWCPNVNIFRDPRWGRGQETCGEDPYLNAVLGVQTVLGMQGNNDKYFKTHA 193

Query: 62  CCKHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           C KHY  +     +G +  R+  N   + +DL +TY   FKA V +G V  VMC+Y +  
Sbjct: 194 CAKHYAVH-----SGPEPLRHSMNVEPTNRDLWETYLPAFKALVKKGNVREVMCAYQRFE 248

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNT-QHYT-RTPEEAAADAIKAGLD 177
           GKP C    +L + +  +W  D  +++DCD++   +N  QH T +    A+ DA+  G D
Sbjct: 249 GKPCCTSDRLLIDILRNKWGYDAIVLTDCDAINNFFNRGQHETHKDGLSASVDAVLNGTD 308

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC 237
           L+CG       EG ++ GL++E D++  L  T+  +  LGMFD      P+  LG   + 
Sbjct: 309 LECGKVFMSLVEG-LKKGLIKESDLDNHLRKTLMGRFELGMFD-PADMLPWAKLGADVIS 366

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAG--VAC 295
           +  +  +A+QAA + +VLL+N A  LPLS     T+AV+GPN+D    + GNY G   A 
Sbjct: 367 SEKNDAMAVQAARESMVLLENKAGILPLSK-SIKTLAVLGPNADDVNMLNGNYGGTPTAA 425

Query: 296 GYTTPLQGISRY---AKTIHQAGC 316
              + L GI      AK I+   C
Sbjct: 426 HQHSLLSGIKAAVPGAKIIYNKAC 449



 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 36/96 (37%), Positives = 53/96 (55%), Gaps = 3/96 (3%)

Query: 335 RQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVS 394
           ++ADA V++ G+   +E E  D+  + LP  QQ LV  + K  + PV+ V  C G   ++
Sbjct: 614 KEADAIVIIGGISAQMEGEGGDKQDIELPKVQQMLVKAMHKTGK-PVIFV-NCSGSA-IA 670

Query: 395 FAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           F   +    A+L   Y GQ G  A+A+VLFG  NPG
Sbjct: 671 FGSVEGEYDALLQAWYAGQGGAKALAEVLFGDYNPG 706


>gi|218131278|ref|ZP_03460082.1| hypothetical protein BACEGG_02889 [Bacteroides eggerthii DSM 20697]
 gi|217986495|gb|EEC52831.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           eggerthii DSM 20697]
          Length = 862

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 124/317 (39%), Positives = 171/317 (53%), Gaps = 14/317 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAACCKHY 66
           GLT W+P VNI+RDPRWGRG ET GEDP LT +     V+GLQG N G   K+ AC KH+
Sbjct: 127 GLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTNDGKYDKLHACAKHF 186

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FN   +S +DL +TY  PF+A V E KV  VMC+YN   G+P C 
Sbjct: 187 AVHSGPEWN---RHSFNVEDLSARDLYETYLPPFEALVKEAKVKEVMCAYNSFEGEPCCG 243

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   +   W  DG I+SDC ++   YN   +   P  E A+A A+  G DL+CG  
Sbjct: 244 SNRLLMQILRNDWGFDGIILSDCGAIADFYNEHGHKAYPDAESASAAAVLNGTDLECGSS 303

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                + AVR G + E+D++ A+   +  +  LG  D +P    +  +    V +  H  
Sbjct: 304 YKALVK-AVRAGQINEKDIDKAVMRLLEARFALGEMD-DPDNVSWTKIPFSVVASAEHDS 361

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL  A + + LL+N+   LPL      TVAV+GPN++ +V   GNY G+     T L G
Sbjct: 362 LALNMARKSMTLLQNTGNILPLKR-GGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILGG 420

Query: 304 ISRYA----KTIHQAGC 316
           I +      K I++ GC
Sbjct: 421 IRKALGADDKLIYEQGC 437



 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 39/114 (34%), Positives = 56/114 (49%), Gaps = 13/114 (11%)

Query: 327 IGAAEVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKA 376
           I A+    + AD  V   G+   +E E +          DR  + LP  Q+EL+S + +A
Sbjct: 589 INASVARVKDADIVVFASGVSPVLEGEEMGVNLPGFKGGDRTDIELPAIQRELISALHRA 648

Query: 377 SRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            +  ++LV   G P+  +         AIL   YPGQ GG A+A+VLFG  NPG
Sbjct: 649 GK-KIILVNCSGSPI--ALEPETKNCEAILQAWYPGQQGGTAVAEVLFGDYNPG 699


>gi|218130696|ref|ZP_03459500.1| hypothetical protein BACEGG_02285 [Bacteroides eggerthii DSM 20697]
 gi|217987040|gb|EEC53371.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           eggerthii DSM 20697]
          Length = 858

 Score =  209 bits (531), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 130/382 (34%), Positives = 203/382 (53%), Gaps = 36/382 (9%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+G    ++V+GLQGN    LK+ +  KH+ A
Sbjct: 137 LTFWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGNDSRYLKIVSTPKHFAA 196

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R+  N ++S++ L + Y   F+ACV EGK AS+M +YN +N  P   +  
Sbjct: 197 NNEEH----NRFVCNPQISEKQLREYYLPAFEACVKEGKSASIMSAYNALNDVPCTLNAW 252

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG------P 182
           +L   +   W   GY+VSDC    +L N   Y +T E AA  +IKAGLDL+CG      P
Sbjct: 253 LLTKVLREDWGFKGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDAP 312

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
            L      A R  ++ + D++ A    +  +M+LG+FD   +  P+  + P+ + +  HQ
Sbjct: 313 LL-----NAYRQYMVTDADIDSAAYRVLRARMQLGLFDSGEN-NPYTKISPKVIGSKEHQ 366

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV-ACGYTTPL 301
           ++AL AA + IVLLKN  + LPL   +  ++AV+G N+    +  G+Y+G+      + L
Sbjct: 367 KVALDAARECIVLLKNQNKMLPLDAKKIKSIAVVGINA--GRSEFGDYSGLPVIAPVSIL 424

Query: 302 QGIS-RYAKTIHQA-GCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
           QGI  R   T+      +  A +G +LI  A                 + ++AE+ D   
Sbjct: 425 QGIKDRVGDTVKVVYAPWKSAVDGMELIQGA--------------SFPEGLKAEYFDNTN 470

Query: 360 LL-LPGRQQELVSRVAKASRGP 380
           L  +P  ++E       A++ P
Sbjct: 471 LQGIPKVRKEEWINFEPANQAP 492



 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/105 (37%), Positives = 64/105 (60%), Gaps = 3/105 (2%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           L G A    R+ +  V V+G++++IE E  DR+ + LP  Q+E +  + K +   +V+VL
Sbjct: 592 LYGEAGRVVRECEKVVAVLGINKAIEREGQDRSDIQLPADQREFLKEIYKVNPN-IVVVL 650

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G  + +++   D  I AI+   YPG++GG A+A+VLFG  NPG
Sbjct: 651 VAGSSLSINWM--DEHIPAIINAWYPGESGGKAVAEVLFGDYNPG 693


>gi|423305378|ref|ZP_17283377.1| hypothetical protein HMPREF1072_02317 [Bacteroides uniformis
           CL03T00C23]
 gi|423311197|ref|ZP_17289166.1| hypothetical protein HMPREF1073_03916 [Bacteroides uniformis
           CL03T12C37]
 gi|392679729|gb|EIY73108.1| hypothetical protein HMPREF1073_03916 [Bacteroides uniformis
           CL03T12C37]
 gi|392681368|gb|EIY74727.1| hypothetical protein HMPREF1072_02317 [Bacteroides uniformis
           CL03T00C23]
          Length = 864

 Score =  209 bits (531), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 125/317 (39%), Positives = 174/317 (54%), Gaps = 14/317 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAACCKHY 66
           GLT W+P VNI+RDPRWGRG ET GEDP LT +     V+GLQG N G   K+ AC KH+
Sbjct: 127 GLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTNDGKYDKLHACAKHF 186

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  +  +DL +TY  PF+A V EGKV  VMC+YN+  G P C 
Sbjct: 187 AVHSGPEWN---RHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCG 243

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLY-NTQHYTR-TPEEAAADAIKAGLDLDCGPF 183
              +L   +  +W  DG ++SDC ++   Y +  H T    E A+A A+ +G DL+CG  
Sbjct: 244 SDRLLMQILRDEWGFDGIVLSDCGAIADFYRDYGHKTHLDAESASAAAVLSGTDLECGSS 303

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                E AV+ G + E+ V++A+   +T +  LG  D EP    +  +    V +  H  
Sbjct: 304 YEALVE-AVKQGKIDEKAVDVAVKRLLTARFALGEMD-EPEKVSWTGIPFSVVASAGHDS 361

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL  A + + LL N   TLPL      T+AV+GPN++ +V   GNY G+     T L G
Sbjct: 362 LALDMARKSMTLLMNKDNTLPLKR-GGLTIAVMGPNANDSVMQWGNYNGMPPHTVTILDG 420

Query: 304 ISRYA----KTIHQAGC 316
           I +      + I++ GC
Sbjct: 421 IRKALGSDDRLIYEQGC 437



 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 41/105 (39%), Positives = 57/105 (54%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  V V G+  ++E E +          DR  + LP  Q+EL++ + +A +  VVLV
Sbjct: 598 KDADIVVFVGGVSPNLEGEEMGVELPGFRGGDRTDIELPAVQRELIAALHRAGK-KVVLV 656

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G P+ +       R GAIL   YPGQAGG A+A+VLFG  NP
Sbjct: 657 NCSGSPIGLE--PETGRCGAILQAWYPGQAGGTAVAEVLFGDYNP 699


>gi|334144838|ref|YP_004538047.1| beta-glucosidase [Novosphingobium sp. PP1Y]
 gi|333936721|emb|CCA90080.1| beta-glucosidase [Novosphingobium sp. PP1Y]
          Length = 889

 Score =  209 bits (531), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 122/291 (41%), Positives = 163/291 (56%), Gaps = 13/291 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GL  WSPN+NIFRDPRWGRGQET GEDP LT +   S+V G+QG       V A  KH+ 
Sbjct: 129 GLDTWSPNINIFRDPRWGRGQETYGEDPYLTARMGVSFVEGMQGPDPDLPDVIATPKHFA 188

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +   N     R+H N  VS+ DLEDTY   F+A +VEG+  SVMC+YN+V+G+P CA  
Sbjct: 189 VH---NGPESTRHHANVFVSRHDLEDTYLPAFRAAIVEGRAGSVMCAYNRVDGQPACASQ 245

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
           ++L+  +   W   GY+VSDCD+V  + +   Y      A A A++ G+D +C  +    
Sbjct: 246 ELLQEHLVDAWGFQGYVVSDCDAVKDISDNHKYAPDGAAAVAAAMRMGVDSECHTWTLSD 305

Query: 188 TEG-------AVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           T+G       A+  GL+   DV+  L    + ++R G   G      F +    DV TPA
Sbjct: 306 TDGLTDRYREALERGLITVSDVDRTLIRLFSARLRNGDLPGVRKLSTFTSSA-ADVGTPA 364

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA 291
           H  LAL+AA + +VLLKN    LP  T     VAVIGP  D T  + GNY+
Sbjct: 365 HGALALKAAEESLVLLKNDG-ILPFQTA-GMKVAVIGPFGDATRVLRGNYS 413



 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 46/107 (42%), Positives = 65/107 (60%), Gaps = 13/107 (12%)

Query: 333 AARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVV 382
           AA+ AD  V V+GL   +EAE            D+  L +P  QQEL+ + AKA+  P++
Sbjct: 614 AAQAADVLVAVVGLTSDLEAEESPIEIPGFKGGDKTTLDIPADQQELLEQ-AKATGKPLI 672

Query: 383 LVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +V M G P+++ +AK +    AIL   YPGQ+GG AIA+VL G+ANP
Sbjct: 673 VVAMNGSPINLHWAKEN--ADAILEAWYPGQSGGLAIANVLTGKANP 717


>gi|317476310|ref|ZP_07935559.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316907336|gb|EFV29041.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 862

 Score =  209 bits (531), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 124/317 (39%), Positives = 171/317 (53%), Gaps = 14/317 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAACCKHY 66
           GLT W+P VNI+RDPRWGRG ET GEDP LT +     V+GLQG N G   K+ AC KH+
Sbjct: 127 GLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTNDGKYDKLHACAKHF 186

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FN   +S +DL +TY  PF+A V E KV  VMC+YN   G+P C 
Sbjct: 187 AVHSGPEWN---RHSFNVEDLSARDLYETYLPPFEALVKEAKVKEVMCAYNSFEGEPCCG 243

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   +   W  DG I+SDC ++   YN   +   P  E A+A A+  G DL+CG  
Sbjct: 244 SNRLLMQILRNDWGFDGIILSDCGAIADFYNEHGHKAYPDAESASAAAVLNGTDLECGSS 303

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                + AVR G + E+D++ A+   +  +  LG  D +P    +  +    V +  H  
Sbjct: 304 YKALVK-AVRAGQINEKDIDKAVMRLLEARFALGEMD-DPDNVSWTKIPFSVVASAEHDS 361

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL  A + + LL+N+   LPL      TVAV+GPN++ +V   GNY G+     T L G
Sbjct: 362 LALDMARKSMTLLQNTGNILPLKR-GGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILGG 420

Query: 304 ISRYA----KTIHQAGC 316
           I +      K I++ GC
Sbjct: 421 IRKALGADDKLIYEQGC 437



 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 39/114 (34%), Positives = 56/114 (49%), Gaps = 13/114 (11%)

Query: 327 IGAAEVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKA 376
           I A+    + AD  V   G+   +E E +          DR  + LP  Q+EL+S + +A
Sbjct: 589 INASVARVKDADIVVFASGVSPVLEGEEMGVNLPGFKGGDRTDIELPAIQRELISALHRA 648

Query: 377 SRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            +  ++LV   G P+  +         AIL   YPGQ GG A+A+VLFG  NPG
Sbjct: 649 GK-KIILVNCSGSPI--ALEPETKNCEAILQAWYPGQQGGTAVAEVLFGDYNPG 699


>gi|170731072|ref|YP_001776505.1| beta-glucosidase [Xylella fastidiosa M12]
 gi|167965865|gb|ACA12875.1| Beta-glucosidase [Xylella fastidiosa M12]
          Length = 882

 Score =  209 bits (531), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 124/300 (41%), Positives = 174/300 (58%), Gaps = 11/300 (3%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LT + A S++RGLQGN     +  A  KH+
Sbjct: 128 AGLTLWSPNINIFRDPRWGRGMETYGEDPYLTSQLAVSFIRGLQGNIPDHPRTIATPKHF 187

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS  DLE TY   F+A +V+G   SVMC+YN ++G P C
Sbjct: 188 AVH-----SGPEPGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPAC 242

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFL 184
           A   +L   +   W  +G++VSDCD++  +     + +    A+A A+K+G DL+CG   
Sbjct: 243 ASDWLLNTRLRNDWGFNGFVVSDCDAIEDMTRFHFFRQDNASASAAALKSGDDLNCGNTY 302

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
               +   RG  + E  ++ AL    T + RLG         P+  +G + + TPAH+ L
Sbjct: 303 RDLNQAIARGD-IDESTLDQALIRLFTARQRLGTLQPR-EHDPYAAIGIKHIDTPAHRAL 360

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           ALQAA Q +VLLKNS  TLPL+     T+AV+GP++D    +  NY G +    TPL G+
Sbjct: 361 ALQAAAQSLVLLKNSGNTLPLTPGT--TLAVLGPDADSLTALEANYQGTSSTPVTPLIGL 418



 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 60/114 (52%), Gaps = 13/114 (11%)

Query: 327 IGAAEVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKA 376
           +  AE A   ADA V  +GL   +E E +          DR  + LP  Q+ L+  V K 
Sbjct: 601 LAEAERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHV-KT 659

Query: 377 SRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           +  P+++VLM G  V +++A++     AIL   YPGQ+GG AIA  L G  NPG
Sbjct: 660 TGKPLIVVLMSGSAVALNWAQH--HADAILAAWYPGQSGGTAIAQALAGDVNPG 711


>gi|354580734|ref|ZP_08999639.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
 gi|353203165|gb|EHB68614.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
          Length = 766

 Score =  209 bits (531), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 148/461 (32%), Positives = 231/461 (50%), Gaps = 55/461 (11%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHY 66
           G   +SP +++ RDPRWGR +ET GEDP L  ++A + V+GLQG+   +   + A  KH+
Sbjct: 158 GAATYSPVLDVVRDPRWGRTEETFGEDPHLVAEFAVAAVQGLQGDRLDAEDSLLATLKHF 217

Query: 67  TAYDLDN--WNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             Y       NG    H   R    +L +   +PF+  V E    SVM +YN+++G P  
Sbjct: 218 AGYGASEGGRNGAP-VHMGLR----ELHEIDLLPFRKAV-EAGAQSVMTAYNEIDGVPCT 271

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD-CGPF 183
           +   +L + +   W  DG++++DC ++ +L +  +   + EEAAA A+ AG+D++  G  
Sbjct: 272 SSRYLLHDVLREAWGFDGFVITDCGAIDMLKSGHNTAASGEEAAAQALTAGVDMEMSGSM 331

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
             ++   A+  G + E+D+N A+   + ++ RLG+FD  P   P      + +    H +
Sbjct: 332 FRVYLRQALEQGHITEDDLNTAVGRVLAMKFRLGLFD-RPYTDP--ERAEKVIGCEEHIE 388

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVAC--GYTTPL 301
           LA + A +GIVLLKN    LPL+  +   +AVIGPN++     +G+Y          T L
Sbjct: 389 LARRVAAEGIVLLKNEGNVLPLNP-KTGKIAVIGPNANAPYNQLGDYTSPQPPGQIITVL 447

Query: 302 QGISRY------AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEF- 354
           +GI R+       + ++  GC      G+   G +   A  A+A V+VM +  S   +F 
Sbjct: 448 EGIRRHIGEDADTRVLYAPGC---RIQGDSREGLSHALACAAEADVIVMAIGGSSARDFG 504

Query: 355 --------------------------IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCG 388
                                     IDR+ L L G Q EL+  + K  + PVV+V + G
Sbjct: 505 EGTIDLRTGASVVTGLAQSDMECGEGIDRSTLHLMGVQLELLQEIHKLGK-PVVVVYING 563

Query: 389 GPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            P+   +   D  I AIL   YPGQ GG+AIAD+LFG  NP
Sbjct: 564 RPITEPWI--DEHIPAILEAWYPGQEGGSAIADILFGDVNP 602


>gi|71275088|ref|ZP_00651375.1| Beta-glucosidase [Xylella fastidiosa Dixon]
 gi|71163897|gb|EAO13612.1| Beta-glucosidase [Xylella fastidiosa Dixon]
 gi|71730562|gb|EAO32640.1| Beta-glucosidase [Xylella fastidiosa Ann-1]
          Length = 815

 Score =  208 bits (530), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 124/300 (41%), Positives = 174/300 (58%), Gaps = 11/300 (3%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LT + A S++RGLQGN     +  A  KH+
Sbjct: 61  AGLTLWSPNINIFRDPRWGRGMETYGEDPYLTSQLAVSFIRGLQGNIPDHPRTIATPKHF 120

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS  DLE TY   F+A +V+G   SVMC+YN ++G P C
Sbjct: 121 AVH-----SGPEPGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPAC 175

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFL 184
           A   +L   +   W  +G++VSDCD++  +     + +    A+A A+K+G DL+CG   
Sbjct: 176 ASDWLLNTRLRNDWGFNGFVVSDCDAIEDMTRFHFFRQDNASASAAALKSGDDLNCGNTY 235

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
               +   RG  + E  ++ AL    T + RLG         P+  +G + + TPAH+ L
Sbjct: 236 RDLNQAIARGD-IDESTLDQALIRLFTARQRLGTLQPR-EHDPYAAIGIKHIDTPAHRAL 293

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           ALQAA Q +VLLKNS  TLPL+     T+AV+GP++D    +  NY G +    TPL G+
Sbjct: 294 ALQAAAQSLVLLKNSGNTLPLTPGT--TLAVLGPDADSLTALEANYQGTSSTPVTPLIGL 351



 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 60/114 (52%), Gaps = 13/114 (11%)

Query: 327 IGAAEVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKA 376
           +  AE A   ADA V  +GL   +E E +          DR  + LP  Q+ L+  V K 
Sbjct: 534 LAEAERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHV-KT 592

Query: 377 SRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           +  P+++VLM G  V +++A++     AIL   YPGQ+GG AIA  L G  NPG
Sbjct: 593 TGKPLIVVLMSGSAVALNWAQH--HADAILAAWYPGQSGGTAIAQALAGDVNPG 644


>gi|114568800|ref|YP_755480.1| glycoside hydrolase family protein [Maricaulis maris MCS10]
 gi|114339262|gb|ABI64542.1| glycoside hydrolase, family 3 domain protein [Maricaulis maris
           MCS10]
          Length = 750

 Score =  208 bits (530), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 150/430 (34%), Positives = 221/430 (51%), Gaps = 30/430 (6%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAACCKHYTAYD 70
           ++P +++ RDPRWGR  E  GEDPVLT    A+ VRG QG +      +AAC KH+  Y 
Sbjct: 144 FAPMIDVCRDPRWGRIAECLGEDPVLTSVLGAAMVRGFQGASLDDPSSLAACAKHFAGYG 203

Query: 71  LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
                   R +    + + +L + +  PF+A  VE  VAS+M S++ ++G P  A+  +L
Sbjct: 204 ASESG---RDYNTTNLPENELRNVHFPPFRAA-VEAGVASLMTSFSDIDGVPATANSFLL 259

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD--CGPFLAIHT 188
           ++ +  +WR DG +VSD D++  L      T T +EAA  A  AG+D+D   G +L  H 
Sbjct: 260 RDVLREEWRYDGLVVSDWDAIQQLC-VHGLTETRDEAAFQAASAGVDMDMVAGAYLQ-HL 317

Query: 189 EGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEP--SAQPFGNLGPRDVCTPAHQQLAL 246
            G V  G +  E V+  +A  + ++ RLG+FD  P  + +P              + LA 
Sbjct: 318 AGLVASGRIELETVDRMVANVLRLKFRLGLFDSRPVLADEP---------ARMTSRSLAK 368

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA--GVACGYTTPLQGI 304
           +AA Q  VLLKN  R LPL       +AVIGP ++     +G +   G      TPL  I
Sbjct: 369 EAALQSCVLLKNEGRALPLDPACLDHLAVIGPLANEPAEQLGTWVFDGDPERSVTPLAAI 428

Query: 305 SRYAK----TIHQAGCFGVACNGNQL-IGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
              A     ++  A       + ++     AE  AR AD  V+ +G +  +  E   RA 
Sbjct: 429 ESLAADAGMSVSHARAMPTTRSLDETAFAEAEAIARNADVVVVFLGEEAILSGEAHCRAD 488

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           + LPG Q  LV R+ KA   PV+ V+  G P+ ++   +D  + AIL+  +PG  GGAAI
Sbjct: 489 IDLPGAQVSLVKRL-KAVGKPVIAVIQAGRPLTLTSVIDD--LDAILFAWHPGSLGGAAI 545

Query: 420 ADVLFGRANP 429
           AD+LFGRA P
Sbjct: 546 ADLLFGRACP 555


>gi|383125190|ref|ZP_09945844.1| hypothetical protein BSIG_4346 [Bacteroides sp. 1_1_6]
 gi|251838523|gb|EES66609.1| hypothetical protein BSIG_4346 [Bacteroides sp. 1_1_6]
          Length = 853

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 118/304 (38%), Positives = 175/304 (57%), Gaps = 21/304 (6%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+G    ++V GLQG+    LK+ +  KH+ A
Sbjct: 138 LTFWSPTVNMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQGDDPHYLKIVSTPKHFAA 197

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R+  N ++S++ L + Y   F+ CV EGK AS+M +YN +N  P   +P 
Sbjct: 198 NNEEH----NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALNDVPCTLNPW 253

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC------GP 182
           +L+  +   W   GY+VSDC    +L N   Y +T E AA  +IKAGLDL+C      GP
Sbjct: 254 LLQKVLRQDWGFQGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDGP 313

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFD-GEPSAQPFGNLGPRDVCTPAH 241
            L      A +  ++ + D++ A  + +T +M+LG+FD GE    P+  + P  + +  H
Sbjct: 314 LL-----NAYKQYMVSDADIDSAAYHVLTARMKLGLFDSGE--RNPYTKISPSVIGSKEH 366

Query: 242 QQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV-ACGYTTP 300
           QQ+AL AA Q IVLLKN    LPL+  +  ++AV+G N+       G+Y+G       + 
Sbjct: 367 QQIALDAARQCIVLLKNQKNRLPLNADKLKSIAVVGINA--GKCEFGDYSGAPVVEPVSI 424

Query: 301 LQGI 304
           LQGI
Sbjct: 425 LQGI 428



 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 40/105 (38%), Positives = 62/105 (59%), Gaps = 3/105 (2%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           L G A  A R+ +  V VMG+++SIE E  DR  + LP  Q+E +  + K +   +++VL
Sbjct: 593 LYGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVL 651

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G  + +++   D  I AI+   YPG+ GG A+A+VLFG  NP 
Sbjct: 652 VAGSSLAINWM--DEHIPAIVNAWYPGEQGGTAVAEVLFGDYNPA 694


>gi|329956938|ref|ZP_08297506.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
 gi|328523695|gb|EGF50787.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
          Length = 944

 Score =  208 bits (529), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 147/448 (32%), Positives = 224/448 (50%), Gaps = 32/448 (7%)

Query: 6   MAGLT-YWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCK 64
           M G T  ++P +++ RD RWGR +E  GE P L  +     VRGLQ N     +VAA  K
Sbjct: 204 MLGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQHNH----QVAATAK 259

Query: 65  HYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           H+ AY  +          + ++S +++E+ +  PFK  + E  +  +M SYN  +G P  
Sbjct: 260 HFAAYSNNKGAREGMSRVDPQMSPREVENIHIYPFKRVIRETGLLGIMSSYNDYDGIPVQ 319

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG--- 181
                L   +  +    GY+VSD D+V  LY   +  +  +EA   +++AGL++ C    
Sbjct: 320 GSYYWLTTRLRQEMGFRGYVVSDSDAVEYLYTKHNTAKDMKEAVRQSVEAGLNVRCTFRS 379

Query: 182 -PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
                +     V+ G L EE +N  +   + V+  +G+FD        G     +V   A
Sbjct: 380 PDSFVLPLRELVKEGGLSEEVINDRVRDILRVKFLIGLFDSPYQTDLAG--ADNEVEKAA 437

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           ++ +ALQA+ + +VLLKN+  TLPL+  +   +AV GPN+D     + +Y  +A   TT 
Sbjct: 438 NEAVALQASRESVVLLKNADNTLPLNIDKIKKIAVCGPNADEEGYALTHYGPLAVEVTTV 497

Query: 301 LQGISRYAK----TIHQAGC--------------FGVACNGNQLIGAAEVAARQADATVL 342
           L+GI   A+     ++  GC              + +  +    I  A   ARQAD  V+
Sbjct: 498 LEGIREKAQGKAEVLYTKGCDLVDAHWPESEIIEYPLTPDEQAEIDRAAANARQADVAVV 557

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           V+G  Q    E   R  L LPG Q +L+  V +A+  PVVLVL+ G P+ V++A  D  +
Sbjct: 558 VLGGGQRTCGENKSRTSLDLPGHQLKLLQAV-QATGKPVVLVLINGRPLSVNWA--DKFV 614

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
            AIL   YPG  GG A+AD+LFG  NPG
Sbjct: 615 PAILEAWYPGSKGGTAVADILFGDYNPG 642


>gi|380696428|ref|ZP_09861287.1| beta-glucosidase [Bacteroides faecis MAJ27]
          Length = 851

 Score =  207 bits (528), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 115/303 (37%), Positives = 174/303 (57%), Gaps = 19/303 (6%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+G    ++V+GLQG+    LK+ +  KH+ A
Sbjct: 136 LTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDDPHYLKIVSTPKHFAA 195

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R+  N ++S++ L + Y   F+ CV EGK AS+M +YN +N  P   +  
Sbjct: 196 NNEEH----NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALNDVPCTLNAW 251

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC------GP 182
           +L+  +   W   GY+VSDC    +L N   Y +T E AA  ++KAGLDL+C      GP
Sbjct: 252 LLQKVLRKDWGFQGYVVSDCGGPALLVNAHKYLKTKEAAATLSLKAGLDLECGDDVYDGP 311

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
            L      A +  ++ + D++ A  + +T +M+LG+FDG     P+  + P  + +  HQ
Sbjct: 312 LL-----NAYKQYMVSDADIDSAAYHVLTARMKLGLFDG-VERNPYTKISPSVIGSKEHQ 365

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV-ACGYTTPL 301
           Q+AL AA Q IVLLKN    LPL+  +  ++AV+G N+       G+Y+G       + L
Sbjct: 366 QIALDAARQCIVLLKNQKNMLPLNASKLKSIAVVGINA--GKCEFGDYSGAPVVEPVSIL 423

Query: 302 QGI 304
           QGI
Sbjct: 424 QGI 426



 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 37/105 (35%), Positives = 62/105 (59%), Gaps = 3/105 (2%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           L G A  A R+ +  + VMG+++SIE E  DR  + LP  Q+E +  + K +   ++++L
Sbjct: 591 LYGEAGKAVRECETVIAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNSN-MIVIL 649

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G  + +++   D  + AI+   YPG+ GG A+A+VLFG  NP 
Sbjct: 650 VAGSSLAINWM--DEHVPAIVNAWYPGEQGGTAVAEVLFGDYNPA 692


>gi|336417083|ref|ZP_08597412.1| hypothetical protein HMPREF1017_04520 [Bacteroides ovatus
           3_8_47FAA]
 gi|335936708|gb|EGM98626.1| hypothetical protein HMPREF1017_04520 [Bacteroides ovatus
           3_8_47FAA]
          Length = 850

 Score =  207 bits (528), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 115/304 (37%), Positives = 176/304 (57%), Gaps = 13/304 (4%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+G    ++V+GLQG+    LK+ +  KH+ A
Sbjct: 135 LTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDDPRYLKIVSTPKHFAA 194

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R+  N ++S++ L + Y   F+ CV EGK AS+M +YN +N  P   +  
Sbjct: 195 NNEEH----NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALNDVPCTLNAW 250

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHT 188
           +LK  +   W   GY+VSDC    +L N   Y +T E AA  +I+AGLDL+CG    ++ 
Sbjct: 251 LLKKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIQAGLDLECGD--DVYD 308

Query: 189 E---GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
           E    A +  ++ + D++ A  + +T +M+LG+FDG     P+  + P  + +  HQQ+A
Sbjct: 309 EYLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDGT-ERNPYTRISPSVIGSKEHQQIA 367

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV-ACGYTTPLQGI 304
           L AA + IVLLKN    LPL+  +  ++AV+G N+       G+Y+G       + LQGI
Sbjct: 368 LDAARECIVLLKNKNNMLPLNVNKVKSIAVVGINA--GKCEFGDYSGAPVVDPVSILQGI 425

Query: 305 SRYA 308
              A
Sbjct: 426 KDRA 429



 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 42/105 (40%), Positives = 62/105 (59%), Gaps = 3/105 (2%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           L G A  A R+ +  V VMG+++SIE E  DR  + LP  Q+E +  + K +   +++VL
Sbjct: 590 LYGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVL 648

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G  + V++   D  I AI+   YPG+ GG A+ADVLFG  NP 
Sbjct: 649 VAGSSLAVNWM--DEHIPAIVNAWYPGEQGGTAVADVLFGDYNPA 691


>gi|189461690|ref|ZP_03010475.1| hypothetical protein BACCOP_02354 [Bacteroides coprocola DSM 17136]
 gi|189431577|gb|EDV00562.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           coprocola DSM 17136]
          Length = 499

 Score =  207 bits (528), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 122/328 (37%), Positives = 183/328 (55%), Gaps = 21/328 (6%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+G    ++VRGLQG+    LKV +  KH+ A
Sbjct: 136 LTFWSPTVNMARDPRWGRTPETYGEDPYLSGTMGTAFVRGLQGDDARYLKVVSTPKHFAA 195

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R+  N ++S++ L + Y   F+AC+ +GK AS+M +YN +N  P   +  
Sbjct: 196 NNEEH----NRFECNPQISEKQLREYYLPAFEACIKDGKAASIMSAYNAINNVPCTLNSW 251

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG------P 182
           +L   +   W   GY+VSDC    +L N   Y +T E AA  +IKAGLDL+CG      P
Sbjct: 252 LLTKVLRHDWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYYEP 311

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
            L      A +  ++ + D++    + +  +MRLG+FD   +  P+  + P  + +  HQ
Sbjct: 312 LL-----NAYKQYMVSDADIDSTAYHVLKARMRLGLFDNGKN-NPYTKISPSIIGSKLHQ 365

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV-ACGYTTPL 301
           ++AL+AA Q IVLLKN    LPL T +  ++AV+G N+       G+Y+G       + L
Sbjct: 366 RVALEAARQCIVLLKNHNWVLPLDTKKLKSIAVVGINAG--NCEFGDYSGSPVIAPISIL 423

Query: 302 QGI-SRYAKTIHQA-GCFGVACNGNQLI 327
           QGI  R  K I      +  A +G +LI
Sbjct: 424 QGIKERVGKDIKVVYAPWRSAVDGMELI 451


>gi|299149391|ref|ZP_07042448.1| beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298512578|gb|EFI36470.1| beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 853

 Score =  207 bits (528), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 115/304 (37%), Positives = 176/304 (57%), Gaps = 13/304 (4%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+G    ++V+GLQG+    LK+ +  KH+ A
Sbjct: 138 LTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDDPRYLKIVSTPKHFAA 197

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R+  N ++S++ L + Y   F+ CV EGK AS+M +YN +N  P   +  
Sbjct: 198 NNEEH----NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALNDVPCTLNAW 253

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHT 188
           +LK  +   W   GY+VSDC    +L N   Y +T E AA  +I+AGLDL+CG    ++ 
Sbjct: 254 LLKKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIQAGLDLECGD--DVYD 311

Query: 189 E---GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
           E    A +  ++ + D++ A  + +T +M+LG+FDG     P+  + P  + +  HQQ+A
Sbjct: 312 EYLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDGT-ERNPYTRISPSVIGSKEHQQIA 370

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV-ACGYTTPLQGI 304
           L AA + IVLLKN    LPL+  +  ++AV+G N+       G+Y+G       + LQGI
Sbjct: 371 LDAARECIVLLKNKNNMLPLNVNKVKSIAVVGINA--GKCEFGDYSGAPVVDPVSILQGI 428

Query: 305 SRYA 308
              A
Sbjct: 429 KDRA 432



 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 42/105 (40%), Positives = 62/105 (59%), Gaps = 3/105 (2%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           L G A  A R+ +  V VMG+++SIE E  DR  + LP  Q+E +  + K +   +++VL
Sbjct: 593 LYGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVL 651

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G  + V++   D  I AI+   YPG+ GG A+ADVLFG  NP 
Sbjct: 652 VAGSSLAVNWM--DEHIPAIVNAWYPGEQGGTAVADVLFGDYNPA 694


>gi|383113364|ref|ZP_09934136.1| hypothetical protein BSGG_3068 [Bacteroides sp. D2]
 gi|382948729|gb|EFS32368.2| hypothetical protein BSGG_3068 [Bacteroides sp. D2]
          Length = 850

 Score =  207 bits (528), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 114/301 (37%), Positives = 175/301 (58%), Gaps = 13/301 (4%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+G    ++V+GLQG+    LK+ +  KH+ A
Sbjct: 135 LTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDDPRYLKIVSTPKHFAA 194

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R+  N ++S++ L + Y   F+ CV EGK AS+M +YN +N  P   +  
Sbjct: 195 NNEEH----NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALNDVPCTLNAW 250

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHT 188
           +LK  +   W   GY+VSDC    +L N   Y +T E AA  +I+AGLDL+CG    ++ 
Sbjct: 251 LLKKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIQAGLDLECGD--DVYD 308

Query: 189 E---GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
           E    A +  ++ + D++ A  + +T +M+LG+FDG     P+  + P  + +  HQQ+A
Sbjct: 309 EYLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDGT-ERNPYTRISPSVIGSKEHQQIA 367

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV-ACGYTTPLQGI 304
           L AA + IVLLKN    LPL+  +  ++AV+G N+       G+Y+G       + LQGI
Sbjct: 368 LDAARECIVLLKNKNNMLPLNVNKVKSIAVVGINA--GKCEFGDYSGAPVVDPVSILQGI 425

Query: 305 S 305
            
Sbjct: 426 K 426



 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 42/105 (40%), Positives = 62/105 (59%), Gaps = 3/105 (2%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           L G A  A R+ +  V VMG+++SIE E  DR  + LP  Q+E +  + K +   +++VL
Sbjct: 590 LYGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVL 648

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G  + V++   D  I AI+   YPG+ GG A+ADVLFG  NP 
Sbjct: 649 VAGSSLAVNWM--DEHIPAIVNAWYPGEQGGTAVADVLFGDYNPA 691


>gi|325103214|ref|YP_004272868.1| glycoside hydrolase family protein [Pedobacter saltans DSM 12145]
 gi|324972062|gb|ADY51046.1| glycoside hydrolase family 3 domain protein [Pedobacter saltans DSM
           12145]
          Length = 866

 Score =  207 bits (528), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 121/323 (37%), Positives = 178/323 (55%), Gaps = 18/323 (5%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAAC 62
           G   GLT W+P +NIFRDPRWGRG ET GEDP LT       V+GLQG + G   K+ AC
Sbjct: 127 GRYEGLTMWTPTINIFRDPRWGRGIETYGEDPYLTAVMGVQAVKGLQGPSNGKYDKLHAC 186

Query: 63  CKHYTAYDLDNWNGVDRYHFN-ARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            KH+  +    WN   R+ F+ A + ++DL +TY   F+A V E KV  VMC+YN+  G 
Sbjct: 187 AKHFAVHSGPEWN---RHSFDAANIKQRDLYETYLPAFEALVKEAKVQEVMCAYNRFEGD 243

Query: 122 PTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE--EAAADAIKAGLDLD 179
           P C    +L+  +  +W  +G +V+DC ++   +    +   P+   A+A A+ +G DLD
Sbjct: 244 PCCGSDRLLQQILRKKWGFEGIVVADCGAIADFFKENAHKTHPDAASASAAAVYSGTDLD 303

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG      TE AV+ GL+ E+D+++++   +  + RLG  D + S  P+  +    V + 
Sbjct: 304 CGSSYKALTE-AVKKGLIEEKDIDVSVRRLLMARFRLGEMDDQ-SLVPWSKISYNVVASK 361

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLST--LRHHTVAVIGPNSDVTVTMIGNYAGVACGY 297
           AH Q+AL  A + I LL+N    LPL +  L+   +AV+GPN+  +V   GNY G     
Sbjct: 362 AHNQIALDMARKSITLLQNKNNILPLKSGGLK---IAVMGPNAQDSVMQWGNYNGTPANT 418

Query: 298 TTPLQGISR----YAKTIHQAGC 316
            T L+GI        K I++  C
Sbjct: 419 ITILEGIKAALGPKDKLIYEQAC 441



 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 38/103 (36%), Positives = 52/103 (50%), Gaps = 13/103 (12%)

Query: 337 ADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLM 386
           AD  V V G+  S+E E +          DR  + LP  Q++ V  + +A  G  V+ + 
Sbjct: 603 ADLVVFVGGISPSLEGEEMGVKLPGFRGGDRTDIQLPTIQRQFVKALKEA--GKRVIFIN 660

Query: 387 CGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           C G   +  A       AI+   YPGQAGG A+ADVLFG+ NP
Sbjct: 661 CSGS-PIGLADEMANSEAIVQAWYPGQAGGQAVADVLFGKYNP 702


>gi|29347188|ref|NP_810691.1| beta-glucosidase [Bacteroides thetaiotaomicron VPI-5482]
 gi|29339087|gb|AAO76885.1| beta-glucosidase (gentiobiase) [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 853

 Score =  207 bits (528), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 117/304 (38%), Positives = 175/304 (57%), Gaps = 21/304 (6%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+G    ++V GLQG+    LK+ +  KH+ A
Sbjct: 138 LTFWSPTVNMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQGDDPHYLKIVSTPKHFAA 197

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R+  N ++S++ L + Y   F+ CV EGK AS+M +YN +N  P   +P 
Sbjct: 198 NNEEH----NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALNDVPCTLNPW 253

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC------GP 182
           +L+  +   W   GY+VSDC    +L N   Y +T E AA  +IKAGLDL+C      GP
Sbjct: 254 LLQKVLRQDWGFQGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDGP 313

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFD-GEPSAQPFGNLGPRDVCTPAH 241
            L      A +  ++ + D++ A  + +T +M+LG+FD GE    P+  + P  + +  H
Sbjct: 314 LL-----NAYKQYMVSDADIDSAAYHVLTARMKLGLFDSGE--RNPYTKISPSVIGSKEH 366

Query: 242 QQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV-ACGYTTP 300
           QQ+AL AA Q +VLLKN    LPL+  +  ++AV+G N+       G+Y+G       + 
Sbjct: 367 QQIALDAARQCVVLLKNQKNRLPLNADKLKSIAVVGINA--GKCEFGDYSGAPVVEPVSI 424

Query: 301 LQGI 304
           LQGI
Sbjct: 425 LQGI 428



 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 40/105 (38%), Positives = 62/105 (59%), Gaps = 3/105 (2%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           L G A  A R+ +  V VMG+++SIE E  DR  + LP  Q+E +  + K +   +++VL
Sbjct: 593 LYGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVL 651

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G  + +++   D  I AI+   YPG+ GG A+A+VLFG  NP 
Sbjct: 652 VAGSSLAINWM--DEHIPAIVNAWYPGEQGGTAVAEVLFGDYNPA 694


>gi|224538725|ref|ZP_03679264.1| hypothetical protein BACCELL_03619 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224519667|gb|EEF88772.1| hypothetical protein BACCELL_03619 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 942

 Score =  207 bits (527), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 152/448 (33%), Positives = 226/448 (50%), Gaps = 32/448 (7%)

Query: 6   MAGLT-YWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCK 64
           M G T  ++P +++ RD RWGR +E  GE P L  +     VRG+Q N     +VAA  K
Sbjct: 204 MLGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH----QVAATGK 259

Query: 65  HYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           H+ AY  +          + ++S +++E  +  PFK  + E  +  VM SYN  +G P  
Sbjct: 260 HFVAYSNNKGAREGMARVDPQMSPREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQ 319

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG--- 181
                L   + G+    GY+VSD D+V  LY      +  +EA   +++AGL++ C    
Sbjct: 320 GSYYWLTTRLRGEMGFRGYVVSDSDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRCTFRS 379

Query: 182 -PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
                +     V+ G L EE +N  +   + V+  +G+FD        G    ++V    
Sbjct: 380 PDSYVLPLRELVKEGGLSEEVINDRVRDILRVKFLVGLFDTPYQTDLAG--ADKEVEKAE 437

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           ++ LALQA+ + +VLLKN    LPL       +AV GPN+D     + +Y  +A   TT 
Sbjct: 438 NESLALQASRESLVLLKNENNVLPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTV 497

Query: 301 LQGISR----YAKTIHQAGCFGVACN--GNQLIG-------AAEV-----AARQADATVL 342
           L+GI +     A+ ++  GC  V  N   ++LI         AE+      ARQAD  V+
Sbjct: 498 LEGIRQKAEGKAEVLYTKGCDLVDANWPESELIDYPMTDSEQAEIDKAVENARQADVAVV 557

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           V+G  Q    E   R+ L LPGRQ +L+  V +A+  PVVLVL+ G P+ +++A  D  +
Sbjct: 558 VLGGGQRTCGENKSRSSLELPGRQLKLLQAV-QATGKPVVLVLINGRPLSINWA--DKFV 614

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
            AIL   YPG  GG A+ADVLFG  NPG
Sbjct: 615 PAILEAWYPGSKGGTAVADVLFGDYNPG 642


>gi|189463167|ref|ZP_03011952.1| hypothetical protein BACCOP_03878 [Bacteroides coprocola DSM 17136]
 gi|189430146|gb|EDU99130.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           coprocola DSM 17136]
          Length = 865

 Score =  207 bits (526), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 119/316 (37%), Positives = 175/316 (55%), Gaps = 16/316 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT+W+PN+NIFRDPRWGRGQET GEDP LT +   + VRGLQG    +  K+ AC KHY
Sbjct: 132 GLTFWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVRGLQGPDNMKYDKLHACAKHY 191

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  ++ +DL +TY   FKA V E  V  VMC+YN+  G+P C 
Sbjct: 192 AVHSGPEWN---RHSFNAENIAPRDLWETYLPAFKALVQEADVKEVMCAYNRFEGEPCCG 248

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGPF 183
              +L   +  +W+  G IVSDC ++   +    +   P  E A+A A+ +G DL+CG  
Sbjct: 249 SNRLLMQILRDEWKYKGIIVSDCGAISDFWRKGDHETHPDKETASAGAVLSGTDLECGNN 308

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                E AV+ GL+ E+ +++++   +T +  LG  D       + ++    V + AH+ 
Sbjct: 309 YKSLPE-AVQKGLIDEKQIDISVKRLLTARFELGEMDEHVC---WDSIPYSVVDSKAHKD 364

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ A + IVLL+N    LPL       +A+IGPN++ +V   GNY G     +T  + 
Sbjct: 365 LALEIARKSIVLLQNRNNILPLK--EDMKIALIGPNANDSVMQWGNYNGFPSHTSTLYEA 422

Query: 304 ISRYA---KTIHQAGC 316
           +       + I+  GC
Sbjct: 423 LKERIPANQLIYDFGC 438



 Score = 58.2 bits (139), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 39/106 (36%), Positives = 55/106 (51%), Gaps = 15/106 (14%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  V   G+  S+E E +          DR  + LP  Q+ L+S + K  + P++ V
Sbjct: 599 KAADVIVFAGGISPSLEGEEMPVNAEGFKGGDRTTIELPAIQRRLISELKKLGK-PIIFV 657

Query: 385 LMCGGPVDVSFAKNDPRI-GAILWVGYPGQAGGAAIADVLFGRANP 429
              G  V +   + + +I  AIL   YPGQAGG A+ADVLFG  NP
Sbjct: 658 NYSGSAVGL---EPESKICDAILQAWYPGQAGGTAVADVLFGDYNP 700


>gi|317474349|ref|ZP_07933623.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316909030|gb|EFV30710.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 877

 Score =  207 bits (526), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 120/309 (38%), Positives = 176/309 (56%), Gaps = 14/309 (4%)

Query: 2   YNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAA 61
           Y   +  LT+W+PN+NIFRDPRWGRGQET GEDP L  +   + V G+QG+     K  A
Sbjct: 121 YVSAIPNLTFWAPNINIFRDPRWGRGQETYGEDPFLMSRMGLNVVLGMQGDDEHYYKTHA 180

Query: 62  CCKHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           C KHY  +     +G +  R+ FNA VS +DL +TY   F+  VV+G V  VMC+Y+   
Sbjct: 181 CAKHYGVH-----SGPEPLRHEFNAVVSMRDLWETYLPAFETLVVKGNVREVMCAYSAYE 235

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLD 177
           G+P CA   +L + +  +W  DG +VSDCD++   Y    +   P    A+ADA+  G D
Sbjct: 236 GEPCCASNRLLVDILRNRWGFDGMVVSDCDAINDFYVKGRHETHPDAAAASADAVLTGTD 295

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQ-PFGNLGPRDV 236
           L+CG       E AV  G+++E+D++++L   +T + RLG+ D  P+   P+  +    +
Sbjct: 296 LECGRSYNALIE-AVEKGIIKEQDLDVSLRRILTERFRLGLLD--PAKYVPYSTIPGSVI 352

Query: 237 CTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
               H+  AL+ AH+  VLLKN    LPL      ++AV+GPN + ++ M GNY+G    
Sbjct: 353 DCQEHRDHALKMAHESQVLLKNEGNILPLDK-NIKSIAVVGPNINDSIMMRGNYSGSPTH 411

Query: 297 YTTPLQGIS 305
             T LQG+ 
Sbjct: 412 CITILQGLK 420



 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 34/95 (35%), Positives = 49/95 (51%), Gaps = 3/95 (3%)

Query: 335 RQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVS 394
           +  +A + V G+  + E E  +RA + LP  Q+  +   A    G  V+ + C G   ++
Sbjct: 618 KDVEAIIYVGGITPTQEGEGHERAKIELPDVQKRFLK--AMHETGKPVIYVNCSGSA-IA 674

Query: 395 FAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            A  D    A+L   YPGQ GG A+ADVLFG  NP
Sbjct: 675 LADIDYAYDALLQAWYPGQEGGTAVADVLFGDYNP 709


>gi|270296173|ref|ZP_06202373.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|270273577|gb|EFA19439.1| conserved hypothetical protein [Bacteroides sp. D20]
          Length = 942

 Score =  207 bits (526), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 145/448 (32%), Positives = 221/448 (49%), Gaps = 32/448 (7%)

Query: 6   MAGLT-YWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCK 64
           M G T  ++P +++ RD RWGR +E  GE P L  +     VRGLQ N     +VAA  K
Sbjct: 204 MLGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQHNH----QVAATGK 259

Query: 65  HYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           H+ AY  +          + ++S +++E+ +  PFK  + E  +  VM SYN  +G P  
Sbjct: 260 HFAAYSNNKGAREGMARVDPQMSPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQ 319

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG--- 181
                L   + G+    GY+VSD D+V  LY      +  +EA   +++AGL++ C    
Sbjct: 320 GSYYWLTTRLRGEMGFRGYVVSDSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRS 379

Query: 182 -PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
                +     V+ G L EE +N  +   + V+  +G+FD        G    R+V    
Sbjct: 380 PDSFVLPLRELVKEGGLSEEVINDRVRDILRVKFLIGLFDAPYQTDLAG--ADREVEKEE 437

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           ++ +ALQA+ + IVLLKN+   LPL       +AV GPN++     + +Y  +A   TT 
Sbjct: 438 NEAIALQASRESIVLLKNAGELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTV 497

Query: 301 LQGISRYAK----TIHQAGC--------------FGVACNGNQLIGAAEVAARQADATVL 342
           L+GI    K     ++  GC              + +  +    I  A   ARQAD  ++
Sbjct: 498 LEGIQEKTKGKAEVLYTKGCDLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIV 557

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           V+G  Q    E   R  L LPGRQ +L+  + +A+  PVVL+L+ G P+ +++A  D  +
Sbjct: 558 VLGGGQRTCGENKSRTSLDLPGRQLQLLQAI-QATGKPVVLILINGRPLSINWA--DKFV 614

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
            AIL   YPG  GG A+AD+LFG  NPG
Sbjct: 615 PAILEAWYPGSKGGTALADILFGDYNPG 642


>gi|371777646|ref|ZP_09483968.1| beta-glucosidase [Anaerophaga sp. HS1]
          Length = 865

 Score =  207 bits (526), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 114/307 (37%), Positives = 169/307 (55%), Gaps = 8/307 (2%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAA 61
           N     L+ W+PNVNIFRDPRWGRGQET GEDP LT +     V+GLQG   ++  K+ A
Sbjct: 125 NRRFLSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQGPEDAKYRKLLA 184

Query: 62  CCKHYTAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           C KHYT +    W+   R+  N   VS ++  +TY   FKA V +  V  VMC+Y++++ 
Sbjct: 185 CAKHYTVHSGPEWS---RHELNINDVSPREFYETYMPAFKALVQKADVRQVMCAYHRLDD 241

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
           +P C++  IL+  +  +W  +  +V+DC ++   Y T   + TP  AAA  + AG DL+C
Sbjct: 242 EPCCSNTRILQRILRDEWGYEHMVVADCGAISDFYTTHGISSTPVHAAATGLLAGTDLEC 301

Query: 181 --GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
               +       A+   L+ E+D++ +L   +  +  LG  D + S  P+  + P  +  
Sbjct: 302 IWDNYHYKMLPEALEKDLITEKDIDRSLMRVLKGRFDLGEMD-DNSLVPWAQIPPSVLNC 360

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
             H+QLA + A Q IVLL+N  + LPL     + +AV+GPN+D  V + GNY G      
Sbjct: 361 EKHRQLAYKMAQQSIVLLQNKNKVLPLDKSSINKIAVVGPNADDEVVLWGNYNGTPIRTI 420

Query: 299 TPLQGIS 305
           T L GI 
Sbjct: 421 TVLDGIK 427



 Score = 43.5 bits (101), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 38/75 (50%), Gaps = 3/75 (4%)

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  + LP  Q+  +  + +A  G  V+ + C G   ++         AI+   Y G++G
Sbjct: 632 DRTNIELPSVQRNCLKALKEA--GKTVIFVNCSGSA-IALEPETESCDAIIQAWYGGESG 688

Query: 416 GAAIADVLFGRANPG 430
           G AIADVLFG  NP 
Sbjct: 689 GQAIADVLFGDYNPS 703


>gi|294777452|ref|ZP_06742903.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           vulgatus PC510]
 gi|294448520|gb|EFG17069.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           vulgatus PC510]
          Length = 864

 Score =  207 bits (526), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 118/318 (37%), Positives = 172/318 (54%), Gaps = 15/318 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL--KVAACCKH 65
           GLT W+P VNI+RDPRWGRG ET GEDP LT     + V+GLQ    ++   K+ AC KH
Sbjct: 129 GLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCMDANQKYDKIHACAKH 188

Query: 66  YTAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           +  +    WN   R+ FNA  +  +DL +TY VPF+A V E KV  VMC+YN++ G P C
Sbjct: 189 FAVHSGPEWN---RHEFNAENIKPRDLHETYLVPFEALVKEAKVKEVMCAYNRLEGDPCC 245

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGP 182
               +L   +   W  DG ++SDC ++   Y  + +   P  E A+A A+ +G DL+CG 
Sbjct: 246 GSDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGS 305

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
                 E A + GL+ E+D+++++   +  +  LG  D +P    +  +    VC+  H 
Sbjct: 306 SYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPDKVEWTKIPYSVVCSAEHD 363

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
            L+L  A + + LL N    LPL      T+AV+GPN++ +V   GNY G      T L+
Sbjct: 364 SLSLDIARKSMTLLLNKNNILPLKR-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLE 422

Query: 303 GIS----RYAKTIHQAGC 316
           GI        K I++ GC
Sbjct: 423 GIRSAMGENDKLIYEQGC 440



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 50/105 (47%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  +   G+  S+E E +          DR  + LP  Q+EL+  +  A +  V+ V
Sbjct: 600 KDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G P+  +         AIL   YPGQ+GG A A+VLFG  NP
Sbjct: 659 NFSGSPI--AMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNP 701


>gi|427384377|ref|ZP_18880882.1| hypothetical protein HMPREF9447_01915 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727638|gb|EKU90497.1| hypothetical protein HMPREF9447_01915 [Bacteroides oleiciplenus YIT
           12058]
          Length = 1050

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 116/305 (38%), Positives = 178/305 (58%), Gaps = 21/305 (6%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+G    ++V+GLQGN    LK+ +  KH+ A
Sbjct: 137 LTFWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGNDSRYLKIVSTPKHFAA 196

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R+  N ++S++ L + Y   F+ACV +GK AS+M +YN +N  P   +  
Sbjct: 197 NNEEH----NRFVCNPQISEKQLREYYLPAFEACVKDGKSASIMSAYNALNDVPCTLNAW 252

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG------P 182
           +L   +   W   GY+VSDC    +L N   Y +T E AA  +IKAGLDL+CG      P
Sbjct: 253 LLTKVLRNDWGFKGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDEP 312

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFD-GEPSAQPFGNLGPRDVCTPAH 241
            L+     A R  ++ + D++ A    +  +M+LG+FD GE +  P+  + P  + +  H
Sbjct: 313 LLS-----AYRQYMVTDADIDSAAYRVLRARMQLGLFDSGEKN--PYTKISPAVIGSKEH 365

Query: 242 QQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV-ACGYTTP 300
           Q++AL AA + IVLLKN  + LPL+  +  ++AV+G N+    +  G+Y+G+      + 
Sbjct: 366 QEVALNAARECIVLLKNQKKMLPLNAKKIKSIAVVGINA--GSSEFGDYSGLPVIAPVSV 423

Query: 301 LQGIS 305
           LQGI 
Sbjct: 424 LQGIK 428



 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 41/105 (39%), Positives = 64/105 (60%), Gaps = 3/105 (2%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           L G A  A R+ +  V V+G+++SIE E  DR  + LP  Q+E +  + K +   +V+VL
Sbjct: 592 LYGEAGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IVVVL 650

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G  + V++   D  + AI+   YPG++GG A+A+VLFG  NPG
Sbjct: 651 VAGSSLAVNWM--DEHVPAIVNAWYPGESGGKAVAEVLFGDYNPG 693


>gi|333379224|ref|ZP_08470948.1| hypothetical protein HMPREF9456_02543 [Dysgonomonas mossii DSM
           22836]
 gi|332885492|gb|EGK05741.1| hypothetical protein HMPREF9456_02543 [Dysgonomonas mossii DSM
           22836]
          Length = 745

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 143/430 (33%), Positives = 223/430 (51%), Gaps = 28/430 (6%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLK-VAACCKHYTAYD 70
           ++P V+I RDPRWGR  E  GED  L    AA+ V+G QG+  S +  V AC KHY AY 
Sbjct: 157 YAPMVDISRDPRWGRVAEGAGEDVYLGSLIAAARVKGFQGDNLSAVNTVVACVKHYAAYG 216

Query: 71  LD----NWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                 ++N VD       +S  +L +TY  PFKA +  G   ++M S+N +NG P   +
Sbjct: 217 ATMAGRDYNTVD-------MSLNELWNTYLPPFKAALDAG-CGTIMTSFNDLNGIPATGN 268

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC-GPFLA 185
             +LK+ +  +W  +G++V+D  S+  +     Y    + +A  A+ AG+D+D  G    
Sbjct: 269 KYLLKDILRDKWNFNGFVVTDYTSINEMI-PHGYANDEKHSAEIAMNAGVDMDMQGGVYM 327

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
            H +  +  G + E+DV  A    + ++ +LG+F+ +P      N    D+ TPA+++ A
Sbjct: 328 NHLKTLIEEGKVSEKDVTEAARAILKIKYKLGLFE-DPYRYCDANREKTDILTPANKEAA 386

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
              A + +VLLKN  +TLPL    +  VA+IGP       ++G ++ +    T P+    
Sbjct: 387 RDMARKSMVLLKNDKQTLPLK--ENKRVALIGPLVKDKYEILGCWSAMGNRDTIPVSVYD 444

Query: 306 RYAKTI------HQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
              + I      +  GC  +     +    A   A  +D  V+VMG   ++  E   R  
Sbjct: 445 GLVEAIGKDKISYAKGC-DIQSEDTKGFAEAVRVASASDVVVMVMGEFHNMSGENNSRTN 503

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           L LPG Q +L+  + K  + PVVLVLM G P+ +++ K++  + AIL   +PG  GGAAI
Sbjct: 504 LSLPGVQVDLLKAIKKTGK-PVVLVLMNGRPLTINWEKDN--LDAILEAWFPGTMGGAAI 560

Query: 420 ADVLFGRANP 429
           ADVL G+ NP
Sbjct: 561 ADVLTGKYNP 570


>gi|218132025|ref|ZP_03460829.1| hypothetical protein BACEGG_03650 [Bacteroides eggerthii DSM 20697]
 gi|217985785|gb|EEC52125.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           eggerthii DSM 20697]
          Length = 888

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 119/309 (38%), Positives = 176/309 (56%), Gaps = 14/309 (4%)

Query: 2   YNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAA 61
           Y   +  LT+W+PN+NIFRDPRWGRGQET GEDP L  +   + V G+QG+     K  A
Sbjct: 132 YVSAIPNLTFWAPNINIFRDPRWGRGQETYGEDPFLMSRMGLNVVLGMQGDDEHYYKTHA 191

Query: 62  CCKHYTAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVN 119
           C KHY  +     +G +  R+ FNA VS +DL +TY   F+  VV+G V  VMC+Y+   
Sbjct: 192 CAKHYGVH-----SGPEPLRHEFNAVVSMRDLWETYLPAFETLVVKGNVREVMCAYSAYE 246

Query: 120 GKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLD 177
           G+P CA   +L + +  +W  DG +VSDCD++   Y    +   P    A+ADA+  G D
Sbjct: 247 GEPCCASNRLLVDILRNRWGFDGMVVSDCDAINDFYVKGRHETHPDAAAASADAVLTGTD 306

Query: 178 LDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQ-PFGNLGPRDV 236
           L+CG       E AV  G+++E+D++++L   +T + RLG+ D  P+   P+  +    +
Sbjct: 307 LECGRSYNALIE-AVEKGIIKEQDLDVSLRRILTERFRLGLLD--PAKYVPYSTIPGSVI 363

Query: 237 CTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
               H+  AL+ AH+  VLLKN    LPL      ++A++GPN + ++ M GNY+G    
Sbjct: 364 DCQEHRDHALKMAHESQVLLKNEGNILPLDK-NIKSIAIVGPNINDSIMMRGNYSGSPTH 422

Query: 297 YTTPLQGIS 305
             T LQG+ 
Sbjct: 423 CITILQGLK 431



 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 34/95 (35%), Positives = 49/95 (51%), Gaps = 3/95 (3%)

Query: 335 RQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVS 394
           +  +A + V G+  + E E  +RA + LP  Q+  +   A    G  V+ + C G   ++
Sbjct: 629 KDVEAIIYVGGITPTQEGEGHERAKIELPDVQKRFLK--AMHETGKPVIYVNCSGSA-IA 685

Query: 395 FAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            A  D    A+L   YPGQ GG A+ADVLFG  NP
Sbjct: 686 LADIDYAYDALLQAWYPGQEGGTAVADVLFGDYNP 720


>gi|255690202|ref|ZP_05413877.1| beta-glucosidase [Bacteroides finegoldii DSM 17565]
 gi|260624221|gb|EEX47092.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           finegoldii DSM 17565]
          Length = 853

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 114/301 (37%), Positives = 173/301 (57%), Gaps = 13/301 (4%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+G    ++V+GLQG+    LK+ +  KH+ A
Sbjct: 137 LTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDDPHYLKIVSTPKHFAA 196

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R+  N ++S++ L + Y   F+ CV EGK AS+M +YN +N  P   +  
Sbjct: 197 NNEEH----NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALNNVPCTLNSW 252

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHT 188
           +L+  +   W   GY+VSDC    +L N   Y +T E AA  +IKAGLDL+CG    ++ 
Sbjct: 253 LLQKVLRRDWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIKAGLDLECGD--DVYD 310

Query: 189 E---GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
           E    A +  +  E D++ A  + +T +M+LG+FDG     P+  + P  + +  HQ +A
Sbjct: 311 EYLLNAYKQYMASEADIDSAAYHVLTARMKLGLFDG-VERNPYAKISPSVIGSKEHQTVA 369

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV-ACGYTTPLQGI 304
           L AA + IVLLKN    LPL+  +  ++AV+G N+       G+Y+G       + LQGI
Sbjct: 370 LNAARECIVLLKNQKNMLPLNVKKLKSIAVVGINAG--KCEFGDYSGAPVVEPVSILQGI 427

Query: 305 S 305
            
Sbjct: 428 K 428



 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 40/105 (38%), Positives = 62/105 (59%), Gaps = 3/105 (2%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           L G A  A R+ +  V VMG+++SIE E  DR  + LP  Q+E +  + K +   ++LVL
Sbjct: 592 LYGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IILVL 650

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G  + V++   +  + AI+   YPG+ GG A+A+VLFG  NP 
Sbjct: 651 VAGSSLAVNW--ENEHLPAIVNAWYPGEQGGTAVAEVLFGDYNPA 693


>gi|423313129|ref|ZP_17291065.1| hypothetical protein HMPREF1058_01677 [Bacteroides vulgatus
           CL09T03C04]
 gi|392686343|gb|EIY79649.1| hypothetical protein HMPREF1058_01677 [Bacteroides vulgatus
           CL09T03C04]
          Length = 864

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 118/318 (37%), Positives = 172/318 (54%), Gaps = 15/318 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL--KVAACCKH 65
           GLT W+P VNI+RDPRWGRG ET GEDP LT     + V+GLQ    ++   K+ AC KH
Sbjct: 129 GLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCMDANQKYDKIHACAKH 188

Query: 66  YTAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           +  +    WN   R+ FNA  +  +DL +TY VPF+A V E KV  VMC+YN++ G P C
Sbjct: 189 FAVHSGPEWN---RHEFNAENIKPRDLHETYLVPFEALVKEAKVKEVMCAYNRLEGDPCC 245

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGP 182
               +L   +   W  DG ++SDC ++   Y  + +   P  E A+A A+ +G DL+CG 
Sbjct: 246 GSDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGS 305

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
                 E A + GL+ E+D+++++   +  +  LG  D +P    +  +    VC+  H 
Sbjct: 306 SYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPDKVEWTKIPYSVVCSAEHD 363

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
            L+L  A + + LL N    LPL      T+AV+GPN++ +V   GNY G      T L+
Sbjct: 364 SLSLDIARKSMTLLLNKNNILPLKR-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLE 422

Query: 303 GIS----RYAKTIHQAGC 316
           GI        K I++ GC
Sbjct: 423 GIRSAMGENDKLIYEQGC 440



 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 51/105 (48%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  +   G+  S+E E +          DR  + LP  Q+EL+  +  A +  V+ V
Sbjct: 600 KDADVVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G P+  +         AIL   YPGQ+GG A+A+VLFG  NP
Sbjct: 659 NFSGSPI--AMEPETKYCQAILQAWYPGQSGGKAVAEVLFGDYNP 701


>gi|150003731|ref|YP_001298475.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
 gi|319640047|ref|ZP_07994774.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
 gi|345517061|ref|ZP_08796539.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           4_3_47FAA]
 gi|149932155|gb|ABR38853.1| glycoside hydrolase family 3, candidate beta-glycosidase
           [Bacteroides vulgatus ATCC 8482]
 gi|254833833|gb|EET14142.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           4_3_47FAA]
 gi|317388325|gb|EFV69177.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
          Length = 864

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 118/318 (37%), Positives = 172/318 (54%), Gaps = 15/318 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL--KVAACCKH 65
           GLT W+P VNI+RDPRWGRG ET GEDP LT     + V+GLQ    ++   K+ AC KH
Sbjct: 129 GLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCMDANQKYDKIHACAKH 188

Query: 66  YTAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           +  +    WN   R+ FNA  +  +DL +TY VPF+A V E KV  VMC+YN++ G P C
Sbjct: 189 FAVHSGPEWN---RHEFNAENIKPRDLHETYLVPFEALVKEAKVKEVMCAYNRLEGDPCC 245

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTP--EEAAADAIKAGLDLDCGP 182
               +L   +   W  DG ++SDC ++   Y  + +   P  E A+A A+ +G DL+CG 
Sbjct: 246 GSDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGS 305

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
                 E A + GL+ E+D+++++   +  +  LG  D +P    +  +    VC+  H 
Sbjct: 306 SYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPDKVEWTKIPYSVVCSAEHD 363

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
            L+L  A + + LL N    LPL      T+AV+GPN++ +V   GNY G      T L+
Sbjct: 364 SLSLDIARKSMTLLLNKNNILPLKR-GGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLE 422

Query: 303 GIS----RYAKTIHQAGC 316
           GI        K I++ GC
Sbjct: 423 GIRSAMGENDKLIYEQGC 440



 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 50/105 (47%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  +   G+  S+E E +          DR  + LP  Q+EL+  +  A +  V+ V
Sbjct: 600 KDADVVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDAGK-KVIFV 658

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G P+  +         AIL   YPGQ+GG A A+VLFG  NP
Sbjct: 659 NFSGSPI--AMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNP 701


>gi|317480750|ref|ZP_07939836.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides sp. 4_1_36]
 gi|316903091|gb|EFV24959.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides sp. 4_1_36]
          Length = 942

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 145/448 (32%), Positives = 222/448 (49%), Gaps = 32/448 (7%)

Query: 6   MAGLT-YWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCK 64
           M G T  ++P +++ RD RWGR +E  GE P L  +     VRGLQ N     +VAA  K
Sbjct: 204 MLGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQHNH----QVAATGK 259

Query: 65  HYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           H+ AY  +          + ++S +++E+ +  PFK  + E  +  VM SYN  +G P  
Sbjct: 260 HFAAYSNNKGAREGMARVDPQMSPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQ 319

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG--- 181
                L   + G+    GY+VSD D+V  LY      +  +EA   +++AGL++ C    
Sbjct: 320 GSYYWLTTRLRGEMGFRGYVVSDSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRS 379

Query: 182 -PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
                +     V+ G L EE +N  +   + V+  +G+FD  P      +   R+V    
Sbjct: 380 PDSFVLPLRELVKEGGLSEEVINDRVRDILRVKFLIGLFDA-PYQTDLAD-ADREVEKEE 437

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           ++ +ALQA+ + IVLLKN+   LPL       +AV GPN++     + +Y  +A   TT 
Sbjct: 438 NEAIALQASRESIVLLKNAGELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTV 497

Query: 301 LQGISRYAK----TIHQAGC--------------FGVACNGNQLIGAAEVAARQADATVL 342
           L+GI    K     ++  GC              + +  +    I  A   ARQAD  ++
Sbjct: 498 LEGIQEKTKGKAEVLYTKGCDLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIV 557

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           V+G  Q    E   R  L LPGRQ +L+  + +A+  PVVL+L+ G P+ +++A  D  +
Sbjct: 558 VLGGGQRTCGENKSRTSLDLPGRQLQLLQAI-QATGKPVVLILINGRPLSINWA--DKFV 614

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
            AIL   YPG  GG A+AD+LFG  NPG
Sbjct: 615 PAILEAWYPGSKGGTALADILFGDYNPG 642


>gi|160892207|ref|ZP_02073210.1| hypothetical protein BACUNI_04671 [Bacteroides uniformis ATCC 8492]
 gi|156858685|gb|EDO52116.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           uniformis ATCC 8492]
          Length = 990

 Score =  206 bits (525), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 145/448 (32%), Positives = 222/448 (49%), Gaps = 32/448 (7%)

Query: 6   MAGLT-YWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCK 64
           M G T  ++P +++ RD RWGR +E  GE P L  +     VRGLQ N     +VAA  K
Sbjct: 252 MLGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQHNH----QVAATGK 307

Query: 65  HYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           H+ AY  +          + ++S +++E+ +  PFK  + E  +  VM SYN  +G P  
Sbjct: 308 HFAAYSNNKGAREGMARVDPQMSPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQ 367

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG--- 181
                L   + G+    GY+VSD D+V  LY      +  +EA   +++AGL++ C    
Sbjct: 368 GSYYWLTTRLRGEMGFRGYVVSDSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRS 427

Query: 182 -PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
                +     V+ G L EE +N  +   + V+  +G+FD  P      +   R+V    
Sbjct: 428 PDSFVLPLRELVKEGGLSEEVINDRVRDILRVKFLIGLFDA-PYQTDLAD-ADREVEKEE 485

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           ++ +ALQA+ + IVLLKN+   LPL       +AV GPN++     + +Y  +A   TT 
Sbjct: 486 NEAIALQASRESIVLLKNAGELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTV 545

Query: 301 LQGISRYAK----TIHQAGC--------------FGVACNGNQLIGAAEVAARQADATVL 342
           L+GI    K     ++  GC              + +  +    I  A   ARQAD  ++
Sbjct: 546 LEGIQEKTKGKAEVLYTKGCDLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIV 605

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           V+G  Q    E   R  L LPGRQ +L+  + +A+  PVVL+L+ G P+ +++A  D  +
Sbjct: 606 VLGGGQRTCGENKSRTSLDLPGRQLQLLQAI-QATGKPVVLILINGRPLSINWA--DKFV 662

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
            AIL   YPG  GG A+AD+LFG  NPG
Sbjct: 663 PAILEAWYPGSKGGTALADILFGDYNPG 690


>gi|317477144|ref|ZP_07936385.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316906687|gb|EFV28400.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 814

 Score =  206 bits (525), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 153/453 (33%), Positives = 226/453 (49%), Gaps = 52/453 (11%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           + P +++ RDPRW R +ET GED  L G   A+ V+G QG    ++ KV A  KH+ AY 
Sbjct: 207 YGPVLDLARDPRWSRVEETYGEDAYLNGVMGAALVKGFQGEFPRTKGKVIATLKHFAAY- 265

Query: 71  LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
              W        +A V  +++E+    PF+  V  G + SVM SYN+++G P  A+ ++L
Sbjct: 266 --GWTEGGHNGGSAHVGNREMEEAIYPPFREAVAAGAL-SVMSSYNEIDGIPCTANSNLL 322

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP-FLAIHTE 189
              +  +W+  G++VSD  ++G L        T  EAA  A+ AG+D D G    A    
Sbjct: 323 TGLLKKRWQFKGFVVSDLYAIGGL-REHGVADTDYEAAVKAVNAGVDSDLGTNVYAGQLV 381

Query: 190 GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD----VCTPAHQQLA 245
            AV+ G ++E  +N A++  + ++  +G+FD      PF  +  R+    V +  H +LA
Sbjct: 382 NAVKRGDVQEVVINKAVSRILALKFHMGLFD-----HPF--VDEREPEQVVASTEHLELA 434

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAG--VACGYTTPLQG 303
            + A Q I+LLKN    LPL+  +  T+AVIGPN+D    M+G+Y          T L G
Sbjct: 435 REVARQSIILLKNKNELLPLNK-KMKTIAVIGPNADNIYNMLGDYTAPQSESSVVTVLDG 493

Query: 304 I----SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG-------------- 345
           I    S     I+  GC  V  +       A  AARQ+D  V+VMG              
Sbjct: 494 IRQKVSNDTHIIYAKGC-AVRDSSKSGFQEAIEAARQSDVVVMVMGGSSARDFSSKYEET 552

Query: 346 ---------LDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFA 396
                    +      E  DR+ L L GRQ+EL+  V K ++ P+VLVL+ G P+ +   
Sbjct: 553 GAAKVSDSHISDMESGEGYDRSTLELLGRQRELIREVGKLNK-PIVLVLIKGRPLLLEGI 611

Query: 397 KNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
             +  + AI+   YPG  GG A+ADVLFG  NP
Sbjct: 612 --EAEVDAIVDAWYPGMQGGNAVADVLFGDYNP 642


>gi|423302093|ref|ZP_17280116.1| hypothetical protein HMPREF1057_03257 [Bacteroides finegoldii
           CL09T03C10]
 gi|408471184|gb|EKJ89716.1| hypothetical protein HMPREF1057_03257 [Bacteroides finegoldii
           CL09T03C10]
          Length = 1039

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 145/448 (32%), Positives = 223/448 (49%), Gaps = 32/448 (7%)

Query: 6   MAGLT-YWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCK 64
           M G T  ++P +++ RD RWGR +E  GE P L  +     V+G+Q N     +VAA  K
Sbjct: 296 MLGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVKGMQHNH----QVAATGK 351

Query: 65  HYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           H+ AY  +          + ++S +++E  +  PFK  + E  +  VM SYN  +G P  
Sbjct: 352 HFIAYSNNKGAREGMARVDPQMSPREVEMIHVYPFKRVIREAGLLGVMSSYNDYDGFPIQ 411

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG--- 181
           +    L   + G     GY+VSD D+V  LY      +  +EA   +++AGL++ C    
Sbjct: 412 SSYYWLTTRLRGDMGFRGYVVSDSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNIRCTFRS 471

Query: 182 -PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
                +     V+ G L EE +N  +   + V+  +G+FD        G     +V   +
Sbjct: 472 PDSYVLPLRELVKEGELSEEIINDRVRDILRVKFLVGLFDHPYQTDLKG--ADEEVEKAS 529

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           ++++ALQA+ + IVLLKN    LPL+      +AV GPN+D     + +Y  +A   T+ 
Sbjct: 530 NEEIALQASRESIVLLKNDKNVLPLNASTIKKIAVCGPNADEHSYALTHYGPLAVEVTSV 589

Query: 301 LQGISR----YAKTIHQAGC--------------FGVACNGNQLIGAAEVAARQADATVL 342
           L+GI       A+ ++  GC              + ++ N  + I  A    +QAD  V+
Sbjct: 590 LKGIQEKLGGKAEVLYTKGCELVDANWPESELMEYPLSENEQEEIEKAVSQTKQADVAVV 649

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           V+G  Q    E   R+ L LPGRQ +L+  V  A+  PVVLVL+ G P+ +++A  D  +
Sbjct: 650 VLGGGQRTCGENKSRSSLALPGRQLDLLKAVV-ATGKPVVLVLINGRPLSINWA--DKFV 706

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
            AIL   YPG  GG A+ADVLFG  NPG
Sbjct: 707 PAILEAWYPGSKGGKAVADVLFGDYNPG 734


>gi|189464211|ref|ZP_03012996.1| hypothetical protein BACINT_00548 [Bacteroides intestinalis DSM
           17393]
 gi|189438001|gb|EDV06986.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 814

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 153/453 (33%), Positives = 226/453 (49%), Gaps = 52/453 (11%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           + P +++ RDPRW R +ET GED  L G   A+ V+G QG    ++ KV A  KH+ AY 
Sbjct: 207 YGPVLDLARDPRWSRVEETYGEDAYLNGVMGAALVKGFQGEFPRTKGKVIATLKHFAAY- 265

Query: 71  LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
              W        +A V  +++E+    PF+  V  G + SVM SYN+++G P  A+ ++L
Sbjct: 266 --GWTEGGHNGGSAHVGNREMEEAIYPPFREAVAAGAL-SVMSSYNEIDGIPCTANSNLL 322

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP-FLAIHTE 189
              +  +W+  G++VSD  ++G L        T  EAA  A+ AG+D D G    A    
Sbjct: 323 TGLLKERWQFKGFVVSDLYAIGGL-REHGVADTDYEAAVKAVNAGVDSDLGTNVYAGQLV 381

Query: 190 GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD----VCTPAHQQLA 245
            AV+ G ++E  +N A++  + ++  +G+FD      PF  +  R+    V +  H +LA
Sbjct: 382 NAVKRGDVQEVVINKAVSRILALKFHMGLFD-----HPF--VDEREPEQVVASTEHLELA 434

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAG--VACGYTTPLQG 303
            + A Q I+LLKN    LPL+  +  T+AVIGPN+D    M+G+Y          T L G
Sbjct: 435 REVARQSIILLKNKNELLPLNK-KTKTIAVIGPNADNIYNMLGDYTAPQSESSVVTVLDG 493

Query: 304 I----SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG-------------- 345
           I    S     I+  GC  V  +       A  AARQ+D  V+VMG              
Sbjct: 494 IRQKVSNDTHIIYAKGC-AVRDSSKSGFQEAIEAARQSDVVVMVMGGSSARDFSSKYEET 552

Query: 346 ---------LDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFA 396
                    +      E  DR+ L L GRQ+EL+  V K ++ P+VLVL+ G P+ +   
Sbjct: 553 GAAKVSDSHISDMESGEGYDRSTLELLGRQRELIREVGKLNK-PIVLVLIKGRPLLLEGI 611

Query: 397 KNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
             +  + AI+   YPG  GG A+ADVLFG  NP
Sbjct: 612 --EAEVDAIVDAWYPGMQGGNAVADVLFGDYNP 642


>gi|325918730|ref|ZP_08180824.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325535054|gb|EGD06956.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 391

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 112/263 (42%), Positives = 159/263 (60%), Gaps = 11/263 (4%)

Query: 7   AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHY 66
           AGLT WSPN+NIFRDPRWGRG ET GEDP LTG+ A  ++RGLQG+  +  +  A  KH 
Sbjct: 130 AGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLNHPRTIATPKHI 189

Query: 67  TAYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
             +     +G +  R+ F+  VS +D+E TY   F+A +V+G+  SVMC+YN ++G P C
Sbjct: 190 AVH-----SGPEPGRHGFDVDVSPRDMEATYTPAFRAALVDGQAWSVMCAYNSLHGTPAC 244

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PF 183
           A   +L   + G W   G++VSDCD+V  +    ++      ++A A+KAG DL+CG  +
Sbjct: 245 AADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY 304

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
             + T  A+  G + E  ++ +L      + RLG  +  P   P+  LG +DV   AH+ 
Sbjct: 305 RELGT--AIERGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRA 361

Query: 244 LALQAAHQGIVLLKNSARTLPLS 266
           LALQAA + IVLLKN+A TLPL 
Sbjct: 362 LALQAAAESIVLLKNTATTLPLK 384


>gi|319787180|ref|YP_004146655.1| glycoside hydrolase [Pseudoxanthomonas suwonensis 11-1]
 gi|317465692|gb|ADV27424.1| glycoside hydrolase family 3 domain protein [Pseudoxanthomonas
           suwonensis 11-1]
          Length = 903

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 120/298 (40%), Positives = 161/298 (54%), Gaps = 18/298 (6%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GL  WSPN+NIFRDPRWGRGQET GEDP L  +   +YV+G+QG       + A  KHY 
Sbjct: 146 GLNTWSPNINIFRDPRWGRGQETYGEDPFLAARMGVAYVKGVQGEDPRWYDIIATPKHYA 205

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +         R+  N  VS+ DLEDTY   F+A +VEG   SVMC+YN+V+G+P CA+ 
Sbjct: 206 VHSGPEST---RHSANVYVSRHDLEDTYLPAFRAAIVEGGAGSVMCAYNRVDGQPACAND 262

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +LK+ + G W   GY+VSDCD+V  +    HY      A A A+++G+D +C     + 
Sbjct: 263 LLLKDYLRGAWDFKGYVVSDCDAVTDINRHHHYAPDAASAVAAAMRSGVDNECNGATLVD 322

Query: 188 TEG-------AVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFG--NLGPRDVCT 238
             G       A+  GL+   DV+ AL      + R G     P  +P       P DV  
Sbjct: 323 MAGLARPYQEALERGLISMADVDRALVRLFAARYRNGDL---PGLRPLSTETASPADVGK 379

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
           P H  LAL+ A + +VLLKNS   LPL       +AV+GP +D T  + GNY+    G
Sbjct: 380 PEHAALALEVAEKSLVLLKNSG-VLPLRP--QAKLAVVGPLADATRVLRGNYSSPQSG 434



 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 45/113 (39%), Positives = 63/113 (55%), Gaps = 13/113 (11%)

Query: 327 IGAAEVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKA 376
           IG    AA Q+D  V V+GL   +EAE            D+  L L   QQ L+   A+A
Sbjct: 622 IGQMRAAAAQSDVVVAVVGLTSDLEAEEAPVEVPGFKGGDKTSLDLLADQQALLE-AARA 680

Query: 377 SRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +  P+V+V+M G PV++++A+      AI+   YPGQ+GG AI +VL G ANP
Sbjct: 681 TGKPLVVVVMNGSPVNLAWARQ--HAAAIIEAWYPGQSGGLAIGNVLAGHANP 731


>gi|402826673|ref|ZP_10875843.1| glycoside hydrolase family protein [Sphingomonas sp. LH128]
 gi|402259784|gb|EJU09977.1| glycoside hydrolase family protein [Sphingomonas sp. LH128]
          Length = 737

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 154/451 (34%), Positives = 229/451 (50%), Gaps = 48/451 (10%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           ++P V+I  DPRWGR  E  GEDPVL    AA+ VRG QG   G+   V +  KH+  Y 
Sbjct: 134 FAPMVDITMDPRWGRMVEGAGEDPVLGAAMAAAQVRGFQGGAIGTPGHVLSGPKHFVGYG 193

Query: 71  LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
             ++ G D    +  +S+  L +TY  PFKA +  G   ++M +Y  +NG P  A+  +L
Sbjct: 194 A-SFGGRDYDEVD--LSESQLRNTYLPPFKAALDAG-AGNIMSAYMGLNGVPAAANTWLL 249

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL------DCGPFL 184
            + +  +W   G++VSD + V  L   Q    +P++AA  A++AG+DL      +  P L
Sbjct: 250 TDVLRKEWGFKGFVVSDANGVDSL-EKQGMAGSPKQAAVRALEAGMDLAMTVPSNPSPML 308

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
           A+    AV+GG + E  +   +   +  + RLG+FD +P   P     PR    PAH+++
Sbjct: 309 ALVD--AVKGGEVAESALEGPVLRLLEAKYRLGLFD-KPYVDP--KAVPRTFDDPAHREV 363

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY-------------- 290
           A  AA +  VLL N+   LPL   +  +VAVIGP  D    M+G +              
Sbjct: 364 ARIAAERSAVLLANAGGLLPLDRAKLKSVAVIGPLGDAGHDMLGPWVFSSNKPEGVSVLA 423

Query: 291 -------AGVACGYTT----PLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADA 339
                  AGV   Y T    P +    +   +++ G    A +    +  A   A Q+D 
Sbjct: 424 GLRAKLGAGVKVEYATGTAWPTRKNPSFFDAMNKPGEH-PAIDEKAELAKAITLANQSDV 482

Query: 340 TVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKND 399
            V+V+G  Q++  EF  R+ L LPGRQQEL+  V  A+  PVV+VL+ G P+ +     D
Sbjct: 483 AVMVLGEAQNMAGEFASRSDLKLPGRQQELLDAVI-ATGKPVVVVLVNGRPLSL----GD 537

Query: 400 PRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            + GA+L   YPG  GG A+A++L G  NPG
Sbjct: 538 AKPGAVLEAWYPGSEGGNAVANLLLGDVNPG 568


>gi|427384502|ref|ZP_18881007.1| hypothetical protein HMPREF9447_02040 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727763|gb|EKU90622.1| hypothetical protein HMPREF9447_02040 [Bacteroides oleiciplenus YIT
           12058]
          Length = 862

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 124/317 (39%), Positives = 176/317 (55%), Gaps = 14/317 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHY 66
           GLT W+P VNI+RDPRWGRG ET GEDP LT +     V+GLQG   G   K+ AC KH+
Sbjct: 127 GLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGPADGKYDKLHACAKHF 186

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  ++ +DL +TY  PF+A V EG+V  VMC+YN+  G+P C 
Sbjct: 187 AVHSGPEWN---RHSFNAENINPRDLYETYLPPFEALVKEGEVKEVMCAYNRFEGEPCCG 243

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE--EAAADAIKAGLDLDCGPF 183
              +L   + G+W  DG +V+DC ++   YN + +   P    A+A A+ +G DLDCG  
Sbjct: 244 SDRLLMQILRGEWGFDGIVVADCGAIADFYNDRGHHTHPNAESASAAAVVSGTDLDCGSS 303

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                E +V+ GL+ EE V++++   +  +  LG  D EP    +  +    V + AH  
Sbjct: 304 YKALVE-SVKKGLIAEETVDVSVKRLLKARFELGEMD-EPEKVSWTKIPFSVVASAAHDS 361

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ A + + LL N    LPL      TVAV+GPN++ +V   GNY G+     T L G
Sbjct: 362 LALEIARKSMTLLMNKDNFLPLKR-GGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILDG 420

Query: 304 ISRYA----KTIHQAGC 316
           +        K I++ GC
Sbjct: 421 VRNILGAEDKLIYEQGC 437



 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 52/105 (49%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  +   G+  S+E E +          DR  + LP  Q+EL+  + +A +  ++LV
Sbjct: 597 KDADVVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQRELIDALHRAGK-KIILV 655

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G P+         +  A+L   YPGQ GG A+A+VLFG  NP
Sbjct: 656 NCSGSPI--GLEPETKKCEALLQAWYPGQQGGTAVAEVLFGTYNP 698


>gi|224537102|ref|ZP_03677641.1| hypothetical protein BACCELL_01979 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521279|gb|EEF90384.1| hypothetical protein BACCELL_01979 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 769

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 143/436 (32%), Positives = 230/436 (52%), Gaps = 32/436 (7%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLK-VAACCKHYT--A 68
           ++PNV + RDPRWGR  ET GEDP L G   A+ VRGLQGN  S  + V AC KH+   +
Sbjct: 184 FTPNVEVARDPRWGRTGETFGEDPHLVGVMGAATVRGLQGNDFSNPENVIACPKHFIGGS 243

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
             ++  NG         VS++ + + +  PFKAC ++    + M ++N+VNG P+ ++  
Sbjct: 244 QSINGINGAP-----CDVSERTIREIFLPPFKAC-LDANAYTFMMAHNEVNGIPSHSNKY 297

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC-GPFLAIH 187
           ++ + +  +W+ DGYIVSD   +  L++    T +  +A   ++++G+D+   GP     
Sbjct: 298 LMTDLLRDEWKFDGYIVSDWMDIERLHDYHRITESYTDAFVLSVQSGMDMHMHGPDFMEA 357

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC-TPAHQQLAL 246
              AV+ G L E+ ++ ++   +T + +LG+F+       F     +D+    AHQQ AL
Sbjct: 358 LLEAVKDGRLTEKRIDQSVRRILTAKFKLGLFENP----YFDEAKSKDLLFNKAHQQTAL 413

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA--GVACGYTTPLQGI 304
           + AH+ IVLLKN    LPL   ++  + V GPN+D  V ++G++A         T L+G+
Sbjct: 414 EIAHKSIVLLKNDG-ILPLDVSKYKKIFVTGPNADTHV-ILGDWAVPQPEGNVVTVLKGL 471

Query: 305 SRYA-KTIHQAGCFG--VACNGNQLIGAAEVAARQADATVLVMGLDQSIE-------AEF 354
              A  T      FG  +       +  A   ARQAD  ++V+G +   E        E 
Sbjct: 472 KDAAPNTTFSFLDFGWNIRTMDPAKVKQAAGMARQADLAIVVVGENSMREHWSEKTCGEN 531

Query: 355 IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQA 414
            DR+ + LPG QQELV  +      P +++L+ G P+ V +  +   + A++    PG  
Sbjct: 532 TDRSDINLPGLQQELVETIQNTGV-PTIVILVNGRPLGVEWIAD--HVAALIEAWEPGSF 588

Query: 415 GGAAIADVLFGRANPG 430
           GG AIAD+L+G+ NP 
Sbjct: 589 GGQAIADILYGKVNPS 604


>gi|380694149|ref|ZP_09859008.1| glycoside hydrolase 3 [Bacteroides faecis MAJ27]
          Length = 946

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 148/448 (33%), Positives = 223/448 (49%), Gaps = 32/448 (7%)

Query: 6   MAGLT-YWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCK 64
           M G T  ++P +++ RD RWGR +E  GE P L  +     VRG+Q N     ++AA  K
Sbjct: 204 MLGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH----QIAATGK 259

Query: 65  HYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           H+ AY  +          + ++S +++E T+  PFK  + E  +  VM SYN  +G P  
Sbjct: 260 HFIAYSNNKGAREGMARVDPQMSPREVEMTHVYPFKRVIREAGLLGVMSSYNDYDGFPIQ 319

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG--- 181
           +    L   + G+    GY+VSD D+V  LY      +  +EA   +++AGL++ C    
Sbjct: 320 SSYYWLTTRLRGEMGFRGYVVSDSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRS 379

Query: 182 -PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
                +     V+ G L EE +N  +   + V+  +G+FD        G     +V   A
Sbjct: 380 PDSYVLPLRELVKEGGLSEEVINDRVRDILRVKFLVGLFDHPYQIDLKG--ADEEVEKAA 437

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           ++++ALQA+ + IVLLKN    LPL       +AV GPN+D     + +Y  +A   T+ 
Sbjct: 438 NEEIALQASRESIVLLKNDKNILPLDASGIQKIAVCGPNADEHSYALTHYGPLAVEVTSV 497

Query: 301 LQGISR----YAKTIHQAGCFGVACN--------------GNQLIGAAEVAARQADATVL 342
           L+GI       A+ ++  GC  V  N                + I  A    +QAD  V+
Sbjct: 498 LKGIQEKMKGKAEVLYTKGCDLVDANWPESELIDYPLTDEEQKEIEKAVDQTKQADVAVV 557

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           V+G  Q    E   R+ L LPGRQ +L+  VA A+  PVVLVL+ G P+ +++A  D  +
Sbjct: 558 VLGGGQRTCGENKSRSSLDLPGRQLDLLKAVA-ATGKPVVLVLINGRPLSINWA--DKFV 614

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
            AI+   YPG  GG A+ADVLFG  NPG
Sbjct: 615 PAIVEAWYPGSKGGKAVADVLFGEYNPG 642


>gi|399070782|ref|ZP_10749848.1| beta-glucosidase-like glycosyl hydrolase [Caulobacter sp. AP07]
 gi|398043993|gb|EJL36849.1| beta-glucosidase-like glycosyl hydrolase [Caulobacter sp. AP07]
          Length = 900

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 127/324 (39%), Positives = 179/324 (55%), Gaps = 21/324 (6%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GL  WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+G+QG      +V A  KH+ 
Sbjct: 144 GLDTWSPNINIFRDPRWGRGQETYGEDPFLTARVGVAFVQGVQGPNPDLPQVIATPKHFA 203

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +         R+  N  VS+ DLEDTY   F+A + EG+  SVMC+YN+++G+P CA  
Sbjct: 204 VHSGPEST---RHSANVFVSRHDLEDTYLPAFRAAITEGRAGSVMCAYNRIDGQPACASD 260

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +LK+ +   WR DGY+VSDCD+V  + +   Y      A A A++AG+D +C       
Sbjct: 261 LLLKDHLRQAWRFDGYVVSDCDAVKDISDHHRYAPDAAAAVAAAMRAGVDNECNGATLTD 320

Query: 188 TEG-------AVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           T+G       A+R  L+   DV+ AL    + ++R G   G    +P   +    V TP 
Sbjct: 321 TDGLADRYREALRRDLISVADVDRALVRLFSARLRTGDLPG--VRKPETPVALASVMTPE 378

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY-AGVACGYTT 299
           H  LAL AA + +VLLKN    LPL       +AVIGP  D T  + GNY + ++    +
Sbjct: 379 HAVLALAAAEKSLVLLKNDG-VLPLKA--DTRIAVIGPLGDATRVLRGNYSSALSAPPIS 435

Query: 300 PLQGISR---YAKTIHQAGCFGVA 320
            L+G+ R    A+ +H A  FG +
Sbjct: 436 VLEGLRRAMPSARIVHAA--FGAS 457



 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 47/112 (41%), Positives = 66/112 (58%), Gaps = 13/112 (11%)

Query: 328 GAAEVAARQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKAS 377
           G  + AA QAD  V V+GL   +EAE            D+  L +P  QQ L+ + AKA 
Sbjct: 620 GDLKAAAAQADVLVAVVGLTSDLEAEESPVEIPGFKGGDKTTLDIPSDQQALLEQ-AKAL 678

Query: 378 RGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
             P+V+V M G P+++S+AK++    AIL   YPGQ+GG A+A+VL G+ NP
Sbjct: 679 GKPLVVVTMNGSPLNLSWAKDN--ASAILEAWYPGQSGGLAVANVLSGKTNP 728


>gi|333381842|ref|ZP_08473521.1| hypothetical protein HMPREF9455_01687 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332829771|gb|EGK02417.1| hypothetical protein HMPREF9455_01687 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 861

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 117/314 (37%), Positives = 170/314 (54%), Gaps = 11/314 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GL+ W+PNVNIFRDPRWGRGQET GEDP LT +   S V GLQG   ++  K+ AC KHY
Sbjct: 129 GLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGISVVNGLQGPKDAKYKKLLACAKHY 188

Query: 67  TAYDLDNWNGVDRYHFNA-RVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+  N   +  + L +TY   F+  V +  V+ VMC+Y++ +  P C 
Sbjct: 189 AVHSGPEWN---RHVLNLNNLDNRHLWETYMPAFQVLVQKADVSQVMCAYHRQDDDPCCG 245

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
           +  +LK  +  +W     +VSDC ++   Y +   +     +A   + AG D++CG    
Sbjct: 246 NNHLLKRILRDEWGFKRMVVSDCGAIADFYTSHKVSSDALHSAVKGVLAGTDVECGFGYT 305

Query: 186 IHT-EGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
            H    AV  GL+ E D++ ++   +T + RLG FD + S  P+ N+    +    HQ L
Sbjct: 306 YHELVDAVSRGLIYEADIDKSVLRLLTERFRLGDFD-DNSIVPWANIPDTIINCKKHQAL 364

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           AL+ A Q + LL+N    LPLS+ +   +AVIGPN+D    M GNY G+     T L+GI
Sbjct: 365 ALEMARQSMTLLQNKNNILPLSSKKK--IAVIGPNADDAKLMWGNYNGIPVKTVTILEGI 422

Query: 305 SRYA--KTIHQAGC 316
              A     ++ GC
Sbjct: 423 KSIAGKDIFYEKGC 436



 Score = 48.1 bits (113), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 48/106 (45%), Gaps = 13/106 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           +  D  V   G+   +E E +          DR  + LP  Q+  +  + KA  G  V++
Sbjct: 597 KDIDVVVFAGGISGELEGEEMPIEMPGFKGGDRTDIELPASQRNCIKALKKA--GKRVIM 654

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + C G   +          AIL   Y GQ+GG AIA+VLFG+ NP 
Sbjct: 655 VNCSGSA-IGLMPESESCEAILQAWYGGQSGGQAIAEVLFGKYNPS 699


>gi|261408260|ref|YP_003244501.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
 gi|261284723|gb|ACX66694.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
           Y412MC10]
          Length = 763

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 148/454 (32%), Positives = 228/454 (50%), Gaps = 44/454 (9%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHY 66
           G   +SP +++ RDPRWGR +ET GEDP L  ++A + V+GLQG    S   + A  KH+
Sbjct: 158 GSATYSPVLDVVRDPRWGRTEETFGEDPHLVTEFAVAAVQGLQGERLDSHTSLLATLKHF 217

Query: 67  TAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
             Y         R      +  ++L +   +PF+  V  G + SVM +YN+++G P  + 
Sbjct: 218 AGYGASEGG---RNGAPVHMGLRELHEVDLLPFRKAVEAGAL-SVMTAYNEIDGVPCTSS 273

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD-CGPFLA 185
             +L++ +   W  DG++++DC ++ +L    +   +  EAAA ++KAG+D++  G    
Sbjct: 274 GYLLQDVLREAWGFDGFVITDCGAIHMLACGHNTAGSGVEAAAQSLKAGVDMEMSGTMFR 333

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
            H   A+  GL+ EED+N A    + ++ RLG+FD  P   P      + +    H  LA
Sbjct: 334 AHLHQALEQGLITEEDLNRAAGRVLELKFRLGLFD-RPYVDP--AWAEQVIGCKEHIALA 390

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA--CGYTTPLQG 303
            QAA +GIVLLKN    LPL +    T+AVIGPN+      +G+Y          T L G
Sbjct: 391 YQAAAEGIVLLKNEGNLLPLDS-SSGTIAVIGPNAHAPYHQLGDYTSPQPPGQIVTVLDG 449

Query: 304 ISRY---AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG--------------- 345
           I R    ++ ++  GC  +  +  +    A   A QAD  V+V+G               
Sbjct: 450 IRRRLGDSRVLYAPGCR-IQGDSREGFPRALACAEQADVIVMVLGGSSARDFGEGTIDLR 508

Query: 346 ---------LDQSIE-AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSF 395
                     +  +E  E IDR+ L L G Q EL+  + K  + PV++V + G P+   +
Sbjct: 509 TGASVVTGHAESDMECGEGIDRSTLTLMGVQLELLQELHKLGK-PVIVVYINGRPITEPW 567

Query: 396 AKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              D  I +I+   YPGQ GG+AIAD+LFG  NP
Sbjct: 568 I--DEHIPSIVEAWYPGQEGGSAIADMLFGDINP 599


>gi|423223731|ref|ZP_17210200.1| hypothetical protein HMPREF1062_02386 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638106|gb|EIY31959.1| hypothetical protein HMPREF1062_02386 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 854

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 116/305 (38%), Positives = 178/305 (58%), Gaps = 21/305 (6%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+G    ++V+GLQG+    LK+ +  KH+ A
Sbjct: 136 LTFWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGDDDRYLKIVSTPKHFAA 195

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R+  N ++S++ L + Y   F+ACV +GK AS+M +YN +N  P   +  
Sbjct: 196 NNEEH----NRFVCNPQISEKQLREYYLPAFEACVKDGKSASIMSAYNALNDVPCTLNAW 251

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG------P 182
           +L   +   W   GY+VSDC    +L N   Y +T E AAA +IKAGLDL+CG      P
Sbjct: 252 LLTKVLRKDWGFKGYVVSDCGGPSLLVNAHKYVKTKEAAAALSIKAGLDLECGDDVYDQP 311

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFD-GEPSAQPFGNLGPRDVCTPAH 241
            L+     A R  ++ + D++ A    +  +M LG+FD GE +  P+  + P  + +  H
Sbjct: 312 LLS-----AYRQYMVTDADIDSAAYRVLRARMELGLFDSGEQN--PYTKISPAVIGSAEH 364

Query: 242 QQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV-ACGYTTP 300
           Q++AL AA + IVLLKN  + LPL+  +  ++AV+G N+    +  G+Y+G+      + 
Sbjct: 365 QEVALNAARECIVLLKNQKKMLPLNAKKVKSIAVVGINA--GSSEFGDYSGLPVIAPISV 422

Query: 301 LQGIS 305
           LQGI 
Sbjct: 423 LQGIK 427



 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 42/105 (40%), Positives = 64/105 (60%), Gaps = 3/105 (2%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           L G A  A R+ +  V V+G+++SIE E  DR  + LP  QQE +  + K +   +V+VL
Sbjct: 591 LYGEAGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQQEFLQEIYKVNPN-IVVVL 649

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G  + +++   D  I AI+   YPG++GG A+A+VLFG  NPG
Sbjct: 650 VAGSSLAINWM--DEHIPAIVNAWYPGESGGKAVAEVLFGDYNPG 692


>gi|298387490|ref|ZP_06997042.1| beta-glucosidase [Bacteroides sp. 1_1_14]
 gi|298259697|gb|EFI02569.1| beta-glucosidase [Bacteroides sp. 1_1_14]
          Length = 853

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 117/304 (38%), Positives = 174/304 (57%), Gaps = 21/304 (6%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+G    ++V GLQG+    LK+ +  KH+ A
Sbjct: 138 LTFWSPTVNMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQGDDPHYLKIVSTPKHFAA 197

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R+  N ++S++ L + Y   F+ CV EGK AS+M +YN +N  P   +  
Sbjct: 198 NNEEH----NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALNDVPCTLNSW 253

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC------GP 182
           +L+  +   W   GY+VSDC    +L N   Y +T E AA  +IKAGLDL+C      GP
Sbjct: 254 LLQKVLRQDWGFQGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDGP 313

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFD-GEPSAQPFGNLGPRDVCTPAH 241
            L      A +  ++ + D++ A  + +T +M+LG+FD GE    P+  + P  + +  H
Sbjct: 314 LL-----NAYKQYMVSDADIDSAACHVLTARMKLGLFDSGE--RNPYTKISPSVIGSKEH 366

Query: 242 QQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV-ACGYTTP 300
           QQ+AL AA Q IVLLKN    LPL+  +  ++AV+G N+       G+Y+G       + 
Sbjct: 367 QQIALDAARQCIVLLKNQKNRLPLNADKLKSIAVVGINA--GKCEFGDYSGAPVVEPVSI 424

Query: 301 LQGI 304
           LQGI
Sbjct: 425 LQGI 428



 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 40/105 (38%), Positives = 62/105 (59%), Gaps = 3/105 (2%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           L G A  A R+ +  V VMG+++SIE E  DR  + LP  Q+E +  + K +   +++VL
Sbjct: 593 LYGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVL 651

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G  + V++   D  + AI+   YPG+ GG A+A+VLFG  NP 
Sbjct: 652 VAGSSLAVNWM--DEHVPAIVNAWYPGEQGGTAVAEVLFGDYNPA 694


>gi|300773468|ref|ZP_07083337.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
           33861]
 gi|300759639|gb|EFK56466.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
           33861]
          Length = 777

 Score =  205 bits (521), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 148/452 (32%), Positives = 234/452 (51%), Gaps = 50/452 (11%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGL-QGNTGSRLKVAACCKHYTAYD 70
           + P +++ RDPRW R +E+ GEDPVLTG  AA+ VRGL  GN           KH+ AY 
Sbjct: 184 YGPVLDLSRDPRWSRVEESYGEDPVLTGTLAAAIVRGLGSGNLSDPFATIPTLKHFVAYG 243

Query: 71  LDN--WNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
           +     NG       A V +++L + +  PF++ V  G   SVM +YN V+G P  ++  
Sbjct: 244 IPEGGHNGS-----AASVGERELREYFLPPFQSAVAAG-AKSVMAAYNSVDGIPCSSNKF 297

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHT 188
           +L + +  +W  +G+ VSD  S+  +  +    +  ++AA  AI+AGLD D G    +  
Sbjct: 298 LLTDILRKEWSFNGFTVSDLGSIEGIKGSHRVAKDHKQAAILAIEAGLDADLGGNAYVRL 357

Query: 189 EGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNL--GPRDVCTPAHQQLAL 246
             AV+ G ++E  ++ A++  + ++  +G+F+     +PF ++    ++V T ++  L+ 
Sbjct: 358 IEAVKQGEVQENSIDQAVSRILALKFEMGLFE-----KPFVDVKTAKKEVKTESNIALSR 412

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVAC--GYTTPLQGI 304
           Q A + IVLL+N    LPL   +   +A++GPN+D    M+G+Y         TT  Q I
Sbjct: 413 QVARESIVLLENKNNILPLR--KDVKIAIVGPNADNVYNMLGDYTAPQPDGAVTTVRQAI 470

Query: 305 SR---YAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG---------------- 345
           S     A+  +  GC  +    N  I AA  AARQ+D  V V+G                
Sbjct: 471 SARLPKAQVSYVKGC-AIRDTTNSDIPAAVTAARQSDIIVAVVGGSSARDFKTEYISTGA 529

Query: 346 ---LDQSI----EAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKN 398
               D+S+      E  DR+ L L GRQ EL+ +  K +  P+V++ + G P+++++A  
Sbjct: 530 AVASDKSVSDMESGEGFDRSTLDLLGRQMELL-KALKQTGKPLVVIYIQGRPLNMNWAAT 588

Query: 399 DPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
             +  A+L   YPGQ GG AIADVLFG  NP 
Sbjct: 589 --QADALLCAWYPGQEGGHAIADVLFGDYNPA 618


>gi|224536364|ref|ZP_03676903.1| hypothetical protein BACCELL_01238, partial [Bacteroides
           cellulosilyticus DSM 14838]
 gi|224522024|gb|EEF91129.1| hypothetical protein BACCELL_01238 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 808

 Score =  205 bits (521), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 116/305 (38%), Positives = 177/305 (58%), Gaps = 21/305 (6%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+G    ++V+GLQG+    LK+ +  KH+ A
Sbjct: 90  LTFWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGDDDRYLKIVSTPKHFAA 149

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R+  N ++S++ L + Y   F+ACV +GK AS+M +YN +N  P   +  
Sbjct: 150 NNEEH----NRFVCNPQISEKQLREYYLPAFEACVKDGKSASIMSAYNALNDVPCTLNAW 205

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG------P 182
           +L   +   W   GY+VSDC    +L N   Y +T E AAA +IKAGLDL+CG      P
Sbjct: 206 LLTKVLRKDWGFKGYVVSDCGGPSLLVNAHKYVKTKEAAAALSIKAGLDLECGDDVYDQP 265

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFD-GEPSAQPFGNLGPRDVCTPAH 241
            L+     A R  ++ + D++ A    +  +M LG+FD GE    P+  + P  + +  H
Sbjct: 266 LLS-----AYRQYMVTDADIDSAAYRVLRARMELGLFDSGE--QNPYTKISPAVIGSAEH 318

Query: 242 QQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV-ACGYTTP 300
           Q++AL AA + IVLLKN  + LPL+  +  ++AV+G N+    +  G+Y+G+      + 
Sbjct: 319 QEVALNAARECIVLLKNQKKMLPLNARKVKSIAVVGINA--GSSEFGDYSGLPVIAPISV 376

Query: 301 LQGIS 305
           LQGI 
Sbjct: 377 LQGIK 381



 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 42/105 (40%), Positives = 64/105 (60%), Gaps = 3/105 (2%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           L G A  A R+ +  V V+G+++SIE E  DR  + LP  QQE +  + K +   +V+VL
Sbjct: 545 LYGEAGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQQEFLQEIYKVNPN-IVVVL 603

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G  + +++   D  I AI+   YPG++GG A+A+VLFG  NPG
Sbjct: 604 VAGSSLAINWM--DEHIPAIVNAWYPGESGGKAVAEVLFGDYNPG 646


>gi|167765093|ref|ZP_02437206.1| hypothetical protein BACSTE_03479 [Bacteroides stercoris ATCC
           43183]
 gi|167696721|gb|EDS13300.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           stercoris ATCC 43183]
          Length = 944

 Score =  205 bits (521), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 145/448 (32%), Positives = 222/448 (49%), Gaps = 32/448 (7%)

Query: 6   MAGLT-YWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCK 64
           M G T  ++P +++ RD RWGR +E  GE P L  +     VRGLQ N     +VAA  K
Sbjct: 204 MLGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQHNH----QVAATAK 259

Query: 65  HYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           H+ AY  +          + ++  +++E+ +  PFK  + E  +  VM SYN  +G P  
Sbjct: 260 HFAAYSNNKGAREGMARVDPQMPPREVENIHIYPFKRVIREAGLLGVMSSYNDYDGIPIQ 319

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG--- 181
                L   +  +    GY+VSD D+V  LY   +  +  +EA   +++AGL++ C    
Sbjct: 320 GSYYWLTTRLRKEMGFRGYVVSDSDAVEYLYTKHNTAKDMKEAVRQSVEAGLNVRCTFRS 379

Query: 182 -PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
                +     V+ G L EE +N  +   + V+  +G+FD        G     +V   A
Sbjct: 380 PDSFVLPLRELVKEGGLSEEVINDRVRDILRVKFLIGLFDAPYQTDLAG--ADDEVEKEA 437

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           ++ +ALQA+ + IVLLKN+  TLPL+  +   +AV GPN+D     + +Y  +A   TT 
Sbjct: 438 NEAVALQASRESIVLLKNTDNTLPLNIDKIKKIAVCGPNADEEGYALTHYGPLAVEVTTV 497

Query: 301 LQGISRYAK----TIHQAGC--------------FGVACNGNQLIGAAEVAARQADATVL 342
           L+GI   A+     ++  GC              + +  +    I  A   ARQAD  V+
Sbjct: 498 LEGIREKAQGKAEVLYTKGCDLVDAHWPESEIMEYPLTPDEQAEIDRAVANARQADVAVV 557

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           V+G  Q    E   R  L LPG Q +L+  V +A+  PV+L+L+ G P+ V++A  D  +
Sbjct: 558 VLGGGQRTCGENKSRTSLELPGHQLKLLQAV-QATGKPVILILINGRPLSVNWA--DKFV 614

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
            AIL   YPG  GG  +AD+LFG  NPG
Sbjct: 615 PAILEAWYPGSKGGTVVADILFGDYNPG 642


>gi|423223135|ref|ZP_17209604.1| hypothetical protein HMPREF1062_01790 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392639742|gb|EIY33554.1| hypothetical protein HMPREF1062_01790 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 769

 Score =  205 bits (521), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 143/436 (32%), Positives = 229/436 (52%), Gaps = 32/436 (7%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLK-VAACCKHYT--A 68
           ++PNV + RDPRWGR  ET GEDP L G   A+ VRGLQGN  S  + V AC KH+   +
Sbjct: 184 FTPNVEVARDPRWGRTGETFGEDPHLVGVMGAATVRGLQGNDFSNPENVIACPKHFIGGS 243

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
             ++  NG         VS++ + + +  PFKAC ++    + M ++N+VNG P+ ++  
Sbjct: 244 QSINGINGAP-----CDVSERTIREIFLPPFKAC-LDANAYTFMMAHNEVNGIPSHSNKY 297

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC-GPFLAIH 187
           ++ + +  +W+ DGYIVSD   +  L++    T +   A   ++++G+D+   GP     
Sbjct: 298 LMTDLLRDEWKFDGYIVSDWMDIERLHDYHRVTESYANAFVLSVQSGMDMHMHGPDFMEA 357

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC-TPAHQQLAL 246
              AV+ G L E+ ++ ++   +T + +LG+F+       F     +D+    AHQQ AL
Sbjct: 358 LLEAVKDGRLTEKRIDQSVRRILTAKFKLGLFENP----YFDEAKSKDLLFNKAHQQTAL 413

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA--GVACGYTTPLQGI 304
           + AH+ IVLLKN    LPL   ++  + V GPN+D  V ++G++A         T L+G+
Sbjct: 414 EIAHKSIVLLKNDG-ILPLDASKYKKIFVTGPNADTHV-ILGDWAVPQPEGNVVTVLKGL 471

Query: 305 SRYA-KTIHQAGCFG--VACNGNQLIGAAEVAARQADATVLVMGLDQSIE-------AEF 354
              A  T      FG  +       +  A   ARQAD  ++V+G +   E        E 
Sbjct: 472 KDAAPNTTFSFLDFGWNIRTMDPAKVKQAAGMARQADLAIVVVGENSMREHWSEKTCGEN 531

Query: 355 IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQA 414
            DR+ + LPG QQELV  +      P +++L+ G P+ V +  +   + A++    PG  
Sbjct: 532 TDRSDINLPGLQQELVETIQNTGV-PTIVILVNGRPLGVEWIAD--HVAALIEAWEPGSF 588

Query: 415 GGAAIADVLFGRANPG 430
           GG AIAD+L+G+ NP 
Sbjct: 589 GGQAIADILYGKVNPS 604


>gi|390957160|ref|YP_006420917.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
           18391]
 gi|390412078|gb|AFL87582.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
           18391]
          Length = 908

 Score =  205 bits (521), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 112/301 (37%), Positives = 174/301 (57%), Gaps = 12/301 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT+WSPN+NIFRDPRWGRGQET GEDP LT +   +++ GLQG      KV A  KH+ 
Sbjct: 129 GLTFWSPNINIFRDPRWGRGQETYGEDPHLTTQLGVNFIEGLQGTDPKFYKVIATPKHFA 188

Query: 68  AYDLDNWNGVD--RYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
            +     +G +  R+ F+   +  DL DTY   F+A +V+ K  S+MC+YN+++G+P C 
Sbjct: 189 VH-----SGPEEGRHKFDVEPTPHDLWDTYLPQFRAAIVDAKADSIMCAYNRIDGQPACG 243

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAAD--AIKAGLDLDCGPF 183
              +L + +   W+  G++ SDC ++   +    +   P+   AD  A+ AG D +CG  
Sbjct: 244 SKLLLVDILRNDWKFQGFVTSDCGAIDDFFRPNTHQTEPDAEHADKAALLAGTDTNCGST 303

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                + AV+ GL++E D++++L      ++RLG+FD   S  P+  +    V +PA+  
Sbjct: 304 YRKLGD-AVKSGLIKESDIDVSLRRLFEARVRLGLFDPAGSV-PYAQIPFSQVNSPANAA 361

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           +A +AA + +VLLKN    LPL   ++ T+AVIGPN     ++ GNY G+A     P+  
Sbjct: 362 VAKRAAEESMVLLKNDG-ILPLKAGKYKTIAVIGPNGASLSSLEGNYNGMAHDPRMPVDA 420

Query: 304 I 304
           +
Sbjct: 421 L 421



 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 41/114 (35%), Positives = 61/114 (53%), Gaps = 13/114 (11%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAE--------FI--DRAGLLLPGRQQELVSRVAK 375
           L+  A  AA ++D  V ++GL   +E E        F+  DR  + LP  QQ L+  +  
Sbjct: 620 LLPEALEAANKSDLVVAMLGLSPDLEGEEMPVKLPGFVGGDRTDISLPASQQALLQGLI- 678

Query: 376 ASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           A+  P ++VL+ G  + ++ A  D +  AIL   YPG+AG  A+AD L GR NP
Sbjct: 679 ATGKPTIVVLLNGSALAINLA--DEKANAILESWYPGEAGSTALADTLVGRNNP 730


>gi|237721201|ref|ZP_04551682.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
 gi|229448997|gb|EEO54788.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
          Length = 863

 Score =  205 bits (521), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 119/301 (39%), Positives = 172/301 (57%), Gaps = 13/301 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT W+PNVNIFRDPRWGRGQET GEDP L+G+   + VRGLQG   +   K+ AC KH+
Sbjct: 133 GLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKLHACAKHF 192

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  ++ +DL +TY   FK  V +  V  VMC+YN+  G P C 
Sbjct: 193 AVHSGPEWN---RHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDPCCG 249

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE--EAAADAIKAGLDLDCGPF 183
              +L   +   W   G +V+DC ++G  +  + +   P+   A+ADA+ +G DL+CG  
Sbjct: 250 SNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLSGTDLECGGN 309

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
               T+ AV+ GL+ EE +N ++   +  +  LG  +   S  P+ N+    +  P H++
Sbjct: 310 FKSITD-AVKKGLISEEKINTSVKRLLKARFELGEMN---STHPWSNIPFSVIDCPKHKE 365

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ AH+ +VLL+N+     L   R   VAVIGPN++ +V   GNY G      T L+G
Sbjct: 366 LALKMAHESLVLLQNNNNL--LPLNRQMKVAVIGPNANDSVMQWGNYNGFPSHTVTLLEG 423

Query: 304 I 304
           I
Sbjct: 424 I 424



 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 49/105 (46%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  +   G+   +E E +          DR  + LP  Q+E+++ + K  +  V + 
Sbjct: 598 QSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKKTVFVN 657

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
                   V   +N     AIL   YPGQAGG A+ADVLFG  NP
Sbjct: 658 FSGSAMAIVPETQN---CDAILQAWYPGQAGGTAVADVLFGDYNP 699


>gi|423289663|ref|ZP_17268513.1| hypothetical protein HMPREF1069_03556 [Bacteroides ovatus
           CL02T12C04]
 gi|423298156|ref|ZP_17276215.1| hypothetical protein HMPREF1070_04880 [Bacteroides ovatus
           CL03T12C18]
 gi|392663697|gb|EIY57244.1| hypothetical protein HMPREF1070_04880 [Bacteroides ovatus
           CL03T12C18]
 gi|392667374|gb|EIY60884.1| hypothetical protein HMPREF1069_03556 [Bacteroides ovatus
           CL02T12C04]
          Length = 850

 Score =  204 bits (519), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 112/301 (37%), Positives = 173/301 (57%), Gaps = 13/301 (4%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+G    ++V+GLQG     LK+ +  KH+ A
Sbjct: 135 LTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGEDPRYLKIVSTPKHFVA 194

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R+  N ++S++ L + Y   F+ CV +GK AS+M +YN +N  P   +  
Sbjct: 195 NNEEH----NRFICNPQISEKQLREYYFPAFEMCVKKGKAASIMTAYNALNDVPCTLNAW 250

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHT 188
           +L+  +   W   GY+VSDC    +L N   Y +T E AA  +IKAGLDL+CG    ++ 
Sbjct: 251 LLQKVLRQDWGFRGYVVSDCGGPSLLVNAHKYVKTKETAATLSIKAGLDLECGD--DVYD 308

Query: 189 E---GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
           E    A +  ++ + D++ A  + +  +M+LGMFD +    P+  + P  + +  HQQ+A
Sbjct: 309 EYLLNAYKQYMVSDADIDSAACHVLAARMKLGMFDSK-ERNPYARISPSVIGSKDHQQVA 367

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV-ACGYTTPLQGI 304
           L AA + IVLLKN    LPL+  +  ++AV+G N+       G+Y+G       + LQGI
Sbjct: 368 LDAARECIVLLKNQKNMLPLNVDKLKSIAVVGINA--GTCEFGDYSGAPVIEPVSVLQGI 425

Query: 305 S 305
            
Sbjct: 426 K 426



 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/105 (39%), Positives = 61/105 (58%), Gaps = 3/105 (2%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           L G A  A  + +  V VMG+++SIE E  DR  + LP  Q+E +  + K +   +++VL
Sbjct: 590 LYGEAGKAVSECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVL 648

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G  + V++   D  I AI+   YPG+ GG A+ADVLFG  NP 
Sbjct: 649 VAGSSLAVNWM--DEHIPAIVNAWYPGEQGGTAVADVLFGDYNPA 691


>gi|319901412|ref|YP_004161140.1| glycoside hydrolase 3 [Bacteroides helcogenes P 36-108]
 gi|319416443|gb|ADV43554.1| glycoside hydrolase family 3 domain protein [Bacteroides helcogenes
           P 36-108]
          Length = 944

 Score =  204 bits (519), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 150/448 (33%), Positives = 223/448 (49%), Gaps = 32/448 (7%)

Query: 6   MAGLT-YWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCK 64
           M G T  ++P +++ RD RWGR +E  GE P L  +     VRG+Q N     +VAA  K
Sbjct: 204 MLGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQYNH----QVAATGK 259

Query: 65  HYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           H+ AY  +          + ++S +++E+ +  PF+  + E  +  VM SYN  +G P  
Sbjct: 260 HFAAYSNNKGAREGMSRVDPQISPREVENIHIYPFRRVIREAGLLGVMSSYNDYDGIPIQ 319

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG--- 181
                L   + G+    GY+VSD D+V  LY      +  +EA   +++AGL++ C    
Sbjct: 320 GSHYWLTTRLRGEIGFRGYVVSDSDAVEYLYTKHGTAKDMKEAIRQSVEAGLNIRCTFRS 379

Query: 182 -PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
                +     V+ G L EE +N  +   + V+   G+FD    +   G    R+V    
Sbjct: 380 PDSFVLPLRELVKEGGLSEEIINDRVRDILRVKFLTGLFDTPYQSDLAG--ADREVEKEE 437

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           +  +ALQA+ + IVLLKN    LPL       +AV GPN+D     + +Y  +A    T 
Sbjct: 438 NGSIALQASRESIVLLKNENNMLPLDLSTVKRIAVCGPNADEKNYALTHYGPLAVEVITV 497

Query: 301 LQGI----SRYAKTIHQAGCFGVACNG------NQLIGAAEVA--------ARQADATVL 342
           L+GI    S  A+ ++  GC  V  N       N  + A E A        ARQ+D  V+
Sbjct: 498 LKGIQDKVSGKAEVLYTKGCDLVDANWPESEIINHPLTADEQAEINKAAENARQSDVAVV 557

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           V+G  Q    E   R+ L LPGRQ +L+  + +A+  PV+LVL+ G P+ V++A  D  +
Sbjct: 558 VLGGGQRTCGENKSRSSLDLPGRQLQLLQAI-QATGKPVILVLINGRPLSVNWA--DKYV 614

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
            AIL   YPG  GG A+ADVLFG  NPG
Sbjct: 615 PAILEAWYPGAKGGIALADVLFGDYNPG 642


>gi|317477153|ref|ZP_07936394.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316906696|gb|EFV28409.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 863

 Score =  204 bits (518), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 127/403 (31%), Positives = 207/403 (51%), Gaps = 28/403 (6%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYW+P +N+ RDPRWGR +ET GEDP LT +   ++V+GLQG+  + LK  A  KH+ 
Sbjct: 146 GLTYWAPTINMARDPRWGRNEETYGEDPYLTSRLGVAFVKGLQGDHPNYLKTVATVKHFV 205

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
           A + +N    DR+  ++++  + L + Y   ++ACV E  V S+M +YN  NG P     
Sbjct: 206 ANNQEN----DRFSSSSQIPTKQLYEYYFPAYEACVKEANVQSIMTAYNAFNGIPPSGST 261

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +L++ +  +W  DG++VSDC ++GV+        + EEAAA  I +G DL+CG     +
Sbjct: 262 WLLEDVLRKEWGFDGFVVSDCGAIGVMNWQHRIVNSLEEAAALGINSGCDLECGGTYREN 321

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              AV+ GL+ E  ++ AL   +T++ +LG FD      P+ +   + +     ++LA +
Sbjct: 322 LVAAVQRGLVSEYAIDRALTRVLTMRFKLGEFD-PIELVPYNHYDKKLLAGEQFRRLAYE 380

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRY 307
           AA + I+LLKN    LP+      ++A++GP +D     +G Y+G      + LQG+ + 
Sbjct: 381 AAVKSIILLKNEDNFLPIDKKDVRSIAIVGPFAD--NNYLGGYSGKPVHNISLLQGVKK- 437

Query: 308 AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQ 367
                     G     + + G + V+   +   +   G++  + A++ID   L  P    
Sbjct: 438 --------MVGEEVEISYIEGTSVVSPVDSSYLLASDGVNNGLTADYIDGHDLNSPF--- 486

Query: 368 ELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGY 410
            L SR+ K      V      G  D    KND    ++ W GY
Sbjct: 487 -LFSRIDKT-----VGFDWGDGTPDQRLTKND---FSVRWSGY 520



 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 41/126 (32%), Positives = 63/126 (50%), Gaps = 6/126 (4%)

Query: 306 RYAKTIHQAGCFGVACNGN--QLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLP 363
            Y   I  A C  V+  GN  Q+    ++ +R AD  ++ +G D  +  E  D   + LP
Sbjct: 577 EYINRIGAATCILVSDFGNINQIDKVKKIVSR-ADLVLVALGNDGKLARENRDLPSIYLP 635

Query: 364 GRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVL 423
             Q+ L+  + K +   + L+L  G P+   +A     + +IL   YPGQ GGAA+A +L
Sbjct: 636 MTQELLLKEIYKVN-PRIALILQTGNPLTSQWAAE--HVPSILQAWYPGQEGGAALAGIL 692

Query: 424 FGRANP 429
           FG  NP
Sbjct: 693 FGLENP 698


>gi|256394357|ref|YP_003115921.1| glycoside hydrolase family 3 [Catenulispora acidiphila DSM 44928]
 gi|256360583|gb|ACU74080.1| glycoside hydrolase family 3 domain protein [Catenulispora
           acidiphila DSM 44928]
          Length = 1357

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 143/471 (30%), Positives = 218/471 (46%), Gaps = 59/471 (12%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN--TGSR----LKVAAC 62
           LT+W+P VN+ RDPRWGR  E  GEDP L G+ A ++V G QGN  TG      LK AA 
Sbjct: 176 LTFWAPTVNLDRDPRWGRTDEAFGEDPYLVGQMAGAFVNGFQGNSMTGQSLDGYLKAAAT 235

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
            KHY   D++     +R   ++ VS  DL D Y   F   +    VA +M SYN +NG P
Sbjct: 236 AKHYALNDVEQ----NRTGISSNVSDTDLRDYYTKQFADLIENSHVAGLMTSYNAINGTP 291

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVGVLY--------------------------N 156
           + AD           +  +GY+ SDC +VG  Y                          N
Sbjct: 292 SVADTYTANQLAQRTYGFNGYVTSDCGAVGTAYRNFPAGHAWAPPGWTTDGGDTNSIWTN 351

Query: 157 TQHYTRTPEEAAADA--IKAGLDLDCG--PFLAIHTEGAVRGGLLREEDVNLALAYTITV 212
           T    +    A  +A  ++AG  ++CG   F   + + A+  G+L E  ++  L    T+
Sbjct: 352 TSTGAKISGAAGGEAYSLRAGTQVNCGGDEFSLQNIQAAISAGILSEGVIDSDLTKLFTI 411

Query: 213 QMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKN------SARTLPLS 266
           +M  G FD   S  P+ ++    + +PAHQ LA   A   +VLLKN      SA  LP S
Sbjct: 412 RMETGEFD-PASKVPYTSITKAQIQSPAHQALATSVADNSLVLLKNANVSGTSAPLLPAS 470

Query: 267 TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRYAKTIHQAGCF--------G 318
             +   V ++G  ++     +G+Y+G        +QG++   K  + +            
Sbjct: 471 ASKLANVVILGDMANQVT--LGDYSGAPSLQVNAVQGLTTAIKAANPSANILFDAAGTSS 528

Query: 319 VACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASR 378
              +   L  A + A ++AD  V+ +G +Q+   E  DR  L +PG    L+++      
Sbjct: 529 TTTSAATLSSATQAAIKKADLVVMFVGTNQNNAQEGNDRTTLNMPGNYDSLITQTTALGN 588

Query: 379 GPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
               LV+   GPV +S  +    + A+++ GY G++ G A+ADVL G+ NP
Sbjct: 589 PKTALVVQSDGPVKISDVQG--SVPAVVFSGYNGESQGTALADVLLGKQNP 637


>gi|423313768|ref|ZP_17291703.1| hypothetical protein HMPREF1058_02315 [Bacteroides vulgatus
           CL09T03C04]
 gi|392684303|gb|EIY77631.1| hypothetical protein HMPREF1058_02315 [Bacteroides vulgatus
           CL09T03C04]
          Length = 788

 Score =  203 bits (517), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 139/442 (31%), Positives = 217/442 (49%), Gaps = 36/442 (8%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL 71
           +SP ++I +DPRWGR  ET GEDP L G+     +  LQ     +  + A  KH+  Y +
Sbjct: 196 YSPILDIAQDPRWGRCVETYGEDPYLVGELGKQMITSLQ-----KYNLVATPKHFAVYSI 250

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
                  +   +  V+ +++   Y  PF+    E     VM SYN  +G+P       L 
Sbjct: 251 PIGGRDGKTRTDPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLT 310

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC-----GPFLAI 186
             +  +W   GY+VSD ++V  + N      T E+  A A+ AGL++         F+ +
Sbjct: 311 EILRQEWGFKGYVVSDSEAVEFISNKHKVADTYEDGIAQAVNAGLNIRTHFTPPADFI-L 369

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLAL 246
               AV  G + +E ++  +A  + ++ RLG+FD        G    + V +  HQ ++L
Sbjct: 370 PLRKAVDNGKISQETLDKRVAEILRIKFRLGLFDNPYRGN--GKQAEQIVHSKEHQAVSL 427

Query: 247 QAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
           +AA Q +VLLKN    LPLS ++R  ++AVIGPN++    +I  Y        T  QGI 
Sbjct: 428 EAARQSLVLLKNETNLLPLSKSIR--SIAVIGPNANEQTQLICRYGPANAPIKTVYQGIK 485

Query: 306 R---YAKTIHQAGC--------------FGVACNGNQLIGAAEVAARQADATVLVMGLDQ 348
               +A+ I++ GC              F       QL+  A  AA+QA+  V+V+G ++
Sbjct: 486 ELLPHAEVIYKKGCDIIDPHFPESEILDFPKTAEEVQLMEEAIRAAKQAEVVVMVLGGNE 545

Query: 349 SIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWV 408
               E   R  L LPGRQ+EL+  V  A+  PV+LV++ G    +++A     + AIL  
Sbjct: 546 LTVREDRSRTSLNLPGRQEELLKAVC-ATGKPVILVMLDGRASSINYAA--AHVPAILHA 602

Query: 409 GYPGQAGGAAIADVLFGRANPG 430
            +PG+  G A+A+ LFG  NPG
Sbjct: 603 WFPGEFCGQAVAEALFGDYNPG 624


>gi|423222018|ref|ZP_17208488.1| hypothetical protein HMPREF1062_00674 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392644204|gb|EIY37946.1| hypothetical protein HMPREF1062_00674 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 942

 Score =  203 bits (517), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 150/448 (33%), Positives = 225/448 (50%), Gaps = 32/448 (7%)

Query: 6   MAGLT-YWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCK 64
           M G T  ++P +++ RD RWGR +E  GE P L  +     VRG+Q +     +VAA  K
Sbjct: 204 MLGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHSH----QVAATGK 259

Query: 65  HYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           H+ AY  +          + ++S +++E  +  PFK  + E  +  VM SYN  +G P  
Sbjct: 260 HFVAYSNNKGAREGMARVDPQMSPREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQ 319

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG--- 181
                L   + G+    GY+VSD D+V  LY      +  +EA   +++AGL++ C    
Sbjct: 320 GSYYWLTTRLRGEMGFRGYVVSDSDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRCTFRS 379

Query: 182 -PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
                +     V+ G L EE +N  +   + V+  +G+FD        G    ++V    
Sbjct: 380 PDSYVLPLRELVKEGGLSEEVINDRVRDILRVKFLVGLFDTPYQTDLAG--ADKEVEKAE 437

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           ++ LALQA+ + +VLLKN    LPL       +AV GPN+D     + +Y  +A   TT 
Sbjct: 438 NESLALQASRESLVLLKNENNVLPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTV 497

Query: 301 LQGISR----YAKTIHQAGCFGVACN--GNQLIG-------AAEV-----AARQADATVL 342
           L+GI +     A+ ++  GC  V  N   ++LI         AE+      ARQAD  V+
Sbjct: 498 LEGIRQKAEGKAEVLYTKGCDLVDANWPESELIDYPMTDSEQAEIDKAVENARQADVAVV 557

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           V+G  Q    E   R+ L LPGRQ +L+  V +A+  PVVLVL+ G P+ +++A  D  +
Sbjct: 558 VLGGGQRTCGENKSRSSLDLPGRQLKLLQAV-QATGKPVVLVLINGRPLSINWA--DKFV 614

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
             IL   YPG  GG A+ADVLFG  NPG
Sbjct: 615 PVILEAWYPGSKGGTAVADVLFGDYNPG 642


>gi|334365132|ref|ZP_08514098.1| glycosyl hydrolase family 3 N-terminal domain protein [Alistipes
           sp. HGB5]
 gi|313158675|gb|EFR58064.1| glycosyl hydrolase family 3 N-terminal domain protein [Alistipes
           sp. HGB5]
          Length = 771

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 146/450 (32%), Positives = 225/450 (50%), Gaps = 49/450 (10%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGL-QGNTGSRLKVAACCKHYTAYD 70
           + P ++I RDPRW R +E+ GED  LT +   +YVRG   G+        +  KH+ AY 
Sbjct: 178 YGPVLDIVRDPRWSRTEESYGEDCYLTARIGEAYVRGTGSGDLSQSRHALSTLKHFIAYG 237

Query: 71  LDN--WNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
                 NG         + +++L +TY  PF+A V  G   SVM +YN V+G P  A+  
Sbjct: 238 ASEGGQNGGSNL-----LGERELRETYLPPFEAAVKAG-ARSVMTAYNSVDGIPCTANRR 291

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC--GPFLAI 186
           +L + + G+W  DG++VSD  S+  L+ T     +  EAA  A++AG+D D   G F ++
Sbjct: 292 MLTDILRGEWGFDGFVVSDLLSIEGLHETHGVAGSVREAAVQALRAGVDADLKGGAFASL 351

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPF-GNLGPRDVCTPAHQQLA 245
               A   G + E +++ A+   + ++  +G+F+      P+       +V   AH +LA
Sbjct: 352 RE--AAEAGDVAEAEIDRAVERVLALKFEMGLFE-----NPYIDEAAAAEVGCAAHSELA 404

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
           L+AA Q + LL+N + TLPL   R   VAVIGPN+D     +G+Y        T   G+ 
Sbjct: 405 LEAARQSVTLLENRSGTLPLDPRRLRRVAVIGPNADNIYNQLGDYTAQQTAANTVRDGLE 464

Query: 306 RYA---KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLD--QSIEAEFI----- 355
           +     + ++  GC  V       I AA  AAR  DA V+V+G    +  + EF+     
Sbjct: 465 KLLGRDRVVYSRGCT-VRGGDRSEIAAAVSAARGTDAAVVVIGGSSARDFDTEFLQTGAA 523

Query: 356 ----------------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKND 399
                           DRA L L G Q+EL+ R+ KA+  P+++V + G P+D+  A   
Sbjct: 524 KAAHDEVRDMECGEGFDRATLALLGEQEELLRRI-KATGTPLIVVCIAGRPLDLRRASE- 581

Query: 400 PRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            +  A+L   YPG  GG A+A+ + GR NP
Sbjct: 582 -QADALLMAWYPGARGGDAVAETILGRNNP 610


>gi|397691073|ref|YP_006528327.1| beta-glucosidase [Melioribacter roseus P3M]
 gi|395812565|gb|AFN75314.1| beta-glucosidase [Melioribacter roseus P3M]
          Length = 923

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 110/305 (36%), Positives = 178/305 (58%), Gaps = 10/305 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYWSP +NI RDPRWGR +E+  EDP L  +   ++++G+QG+    LK  A  KH+ 
Sbjct: 202 GLTYWSPTINIARDPRWGRNEESYSEDPYLLSRMGVAFIKGMQGDHPYYLKTVATPKHFI 261

Query: 68  AYDLDNWNGVDRYHF-NARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
           A      N  +R H  ++ V  ++L + Y   FK+ +VE +  S+M +YN++N  P+ A+
Sbjct: 262 AN-----NEEERRHTGSSDVDMRNLYEYYLPAFKSAIVEARAYSIMGAYNELNHVPSNAN 316

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAI 186
             ++ + +  QW  +GY+VSDC ++  +     + +T  EA A +I AG DL+CG     
Sbjct: 317 MFLMTDLLRRQWGFEGYVVSDCGAIHDMLYGHKFFKTGAEAVARSILAGCDLNCGQAYRE 376

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLAL 246
             + A+  GLLRE+D++ AL   ++ + RLG FD  P   P+ ++G   + +  +++LAL
Sbjct: 377 FIKDALDEGLLREKDIDSALFRVLSARFRLGEFD-PPELVPYSSIGKDKLDSKENRRLAL 435

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISR 306
            AA + IVLLKN+   LP+   +  ++AVIGPN+      +G Y+G      +PL+GI  
Sbjct: 436 DAARKSIVLLKNND-ILPIDKSKIKSIAVIGPNA--REAQLGIYSGFPNVLISPLEGIKN 492

Query: 307 YAKTI 311
            A ++
Sbjct: 493 KADSL 497



 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 42/108 (38%), Positives = 64/108 (59%), Gaps = 3/108 (2%)

Query: 323 GNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVV 382
           G +    A+  A + D  +LV+G+   I  E +DR  + LP  Q+ELV + A+ +   +V
Sbjct: 654 GQKDFEKAKKIAAENDLVILVLGITPGISQEELDRKEIELPSVQRELVKQTAEVNPN-IV 712

Query: 383 LVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           +VL+ GGPV ++ A+   +  AI+   Y G+ GG A+ADVLFG  NPG
Sbjct: 713 IVLVNGGPVALAGAEKYAK--AIVENWYNGEFGGQALADVLFGDYNPG 758


>gi|333380553|ref|ZP_08472244.1| hypothetical protein HMPREF9455_00410 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332826548|gb|EGJ99377.1| hypothetical protein HMPREF9455_00410 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 957

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 142/437 (32%), Positives = 216/437 (49%), Gaps = 34/437 (7%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL 71
           WSP +++ +D RWGR +ET GEDPVL  +   ++++G Q        +    KH+ A+  
Sbjct: 267 WSPVLDVAQDARWGRCEETYGEDPVLVTEIGGAWIKGYQSKG-----LMTTPKHFAAHGA 321

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
               G D +     +S++++ + + VPF+    + K  S+M SY+   G P     ++LK
Sbjct: 322 P-LGGRDSHDIG--LSEREMREIHLVPFRDIYKKYKYQSIMMSYSDFLGVPVAKSKELLK 378

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE-EAAADAIKAGLDLDCG-PFLAIHTE 189
             +  +W  DG+IVSDC ++G L   +HYT   + EAA  A+ AG+  +CG  +      
Sbjct: 379 GILRDEWGFDGFIVSDCGAIGNLTARKHYTAVDKVEAARQALAAGIATNCGDTYNDPDVI 438

Query: 190 GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAA 249
            A + G L  +D++      +    R G+F+  P      N       +P HQ LA + A
Sbjct: 439 AAAKRGELNMDDLDFTCKTLLRTLFRNGLFENNPCKPLDWNKIYPGWNSPEHQALARKTA 498

Query: 250 HQGIVLLKNSARTLPLSTLRHHTVAVIGPNSD-------VTVTMIGNYAGVACGYTTPLQ 302
            + IVLL+N    LPLS     T+AVIGP +D        +    G    V  G      
Sbjct: 499 QESIVLLENKGNILPLSK-SLKTIAVIGPGADNLQPGDYTSKPQPGQLKSVLTGIKA--- 554

Query: 303 GISRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEA---------E 353
            ++   K +++ GC  +   G  +  A + AA  AD  VLV+G   + EA         E
Sbjct: 555 AVNSSTKVLYEEGCRFIGTEGTDIAKAVK-AAENADVAVLVLGDCSTSEALKGITNTSGE 613

Query: 354 FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQ 413
             D A L+LPG QQ+L+  V K  + PVVL+L  G P ++S+A  + +   + W+  PGQ
Sbjct: 614 NHDLATLILPGEQQKLLEAVCKTGK-PVVLILQAGRPYNLSYAAENCQAVLVNWL--PGQ 670

Query: 414 AGGAAIADVLFGRANPG 430
            GG A ADVLFG  NP 
Sbjct: 671 EGGYATADVLFGDYNPA 687


>gi|380694609|ref|ZP_09859468.1| periplasmic beta-glucosidase , xylosidase/arabinosidase
           [Bacteroides faecis MAJ27]
          Length = 804

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 148/448 (33%), Positives = 227/448 (50%), Gaps = 41/448 (9%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGL-QGNTGSRLKVAACCKHYTAYD 70
           + P +++ RDPRW R +ET GEDPVL+G+  A+ V GL  G+        A  KH+ AY 
Sbjct: 200 YGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMVTGLGSGDLSREHATIATLKHFLAYA 259

Query: 71  LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
           +        Y   A V  +DL + +  PF+  +  G + SVM SYN ++G P  A+  +L
Sbjct: 260 VPEGGQNGNY---ASVGARDLHENFLPPFREAIEAGAL-SVMTSYNSIDGIPCTANHYLL 315

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEG 190
              +  +W+  G++VSD  S+  ++ +     T EEAA  A+ AG+D+D G    ++   
Sbjct: 316 TQLLRNEWKFRGFVVSDLYSIEGIHESHFVASTMEEAAVQALSAGVDIDLGGDAFMNLLQ 375

Query: 191 AVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAH 250
           AVR G L E  +N A+   + ++  +G+F+  P   P      + V    H +LA + A 
Sbjct: 376 AVRSGKLDETQINAAVDRILRMKFEMGLFE-HPYVNP--KTTTKMVRNKEHVKLARKVAQ 432

Query: 251 QGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA--CGYTTPLQG-ISRY 307
             +VLL+N    LPLS  +   VAV+GPN+D    M+G+Y          T L G IS+ 
Sbjct: 433 SSVVLLENKNSILPLSK-KIKRVAVVGPNADNRYNMLGDYTAPQEDKDIRTVLDGVISKL 491

Query: 308 --AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG-------------------L 346
             ++  +  GC       N++  A E AA +++  + V+G                    
Sbjct: 492 SPSRVEYVRGCAIRDTTVNEIAEAVE-AAHRSEVIIAVVGGSSARDFKTSYQETGAAIAD 550

Query: 347 DQSIE----AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           ++SI      E  DRA L L G+QQ+L++ + K +  P+++V + G P+D  +A      
Sbjct: 551 EKSISDMECGEGFDRATLTLLGKQQDLLNAL-KTTGKPLIVVYIEGRPLDKVWASECA-- 607

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
            A+L   YPGQAGG AIADVLFG  NP 
Sbjct: 608 DALLTASYPGQAGGDAIADVLFGDYNPA 635


>gi|150002739|ref|YP_001297483.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
 gi|294776994|ref|ZP_06742455.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           vulgatus PC510]
 gi|149931163|gb|ABR37861.1| glycoside hydrolase family 3, candidate beta-glycosidase
           [Bacteroides vulgatus ATCC 8482]
 gi|294449242|gb|EFG17781.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           vulgatus PC510]
          Length = 788

 Score =  202 bits (514), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 138/442 (31%), Positives = 216/442 (48%), Gaps = 36/442 (8%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL 71
           +SP ++I +DPRWGR  ET GEDP L G+     +  LQ     +  + A  KH+  Y +
Sbjct: 196 YSPILDIAQDPRWGRCVETYGEDPYLVGELGKQMITSLQ-----KYNLVATPKHFAVYSI 250

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
                  +   +  V+ +++   Y  PF+    E     VM SYN  +G+P       L 
Sbjct: 251 PIGGRDGKTRTDPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLT 310

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC-----GPFLAI 186
             +  +W   GY+VSD ++V  + N      T E+  A A+ AGL++         F+ +
Sbjct: 311 EILRQEWGFKGYVVSDSEAVEFISNKHKVADTYEDGIAQAVNAGLNIRTHFTPPADFI-L 369

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLAL 246
               AV  G + +E ++  +A  + ++ RLG+FD        G    + V +  HQ ++L
Sbjct: 370 PLRKAVDNGKISQETLDKRVAEILRIKFRLGLFDNPYRGN--GKQAEQIVHSKEHQAVSL 427

Query: 247 QAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
           +AA Q +VLLKN    LPLS ++R  ++AVIGPN++    +I  Y        T  QGI 
Sbjct: 428 EAARQSLVLLKNETNLLPLSKSIR--SIAVIGPNANEQTQLICRYGPANAPIKTVYQGIK 485

Query: 306 R---YAKTIHQAGC--------------FGVACNGNQLIGAAEVAARQADATVLVMGLDQ 348
               + + I++ GC              F       QL+  A  AA+QA+  V+V+G ++
Sbjct: 486 ELLPHTEVIYKKGCDIIDPHFPESEILDFPKTAEEVQLMEEAIRAAKQAEVVVMVLGGNE 545

Query: 349 SIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWV 408
               E   R  L LPGRQ+EL+  V  A+  P++LV++ G    +++A     I AIL  
Sbjct: 546 LTVREDRSRTSLNLPGRQEELLKAVC-ATGKPIILVMLDGRASSINYAA--AHIPAILHA 602

Query: 409 GYPGQAGGAAIADVLFGRANPG 430
            +PG+  G A+A+ LFG  NPG
Sbjct: 603 WFPGEFCGQAVAEALFGDYNPG 624


>gi|387888690|ref|YP_006318988.1| periplasmic beta-glucosidase BglX [Escherichia blattae DSM 4481]
 gi|414592757|ref|ZP_11442406.1| beta-glucosidase BglX [Escherichia blattae NBRC 105725]
 gi|386923523|gb|AFJ46477.1| periplasmic beta-glucosidase BglX [Escherichia blattae DSM 4481]
 gi|403196238|dbj|GAB80058.1| beta-glucosidase BglX [Escherichia blattae NBRC 105725]
          Length = 766

 Score =  202 bits (514), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 150/457 (32%), Positives = 224/457 (49%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  L+     + V  +QG +   R  V    KH+ AY 
Sbjct: 156 WAPMVDVSRDPRWGRVSEGFGEDTFLSSAMGQALVTSMQGKSPAERYSVMTSVKHFAAYG 215

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G    VM + N VNG P+ AD
Sbjct: 216 AVEGGREYNTVD-------MSPQRLFNDYMPPYKAALDAGS-GGVMVALNSVNGTPSTAD 267

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PFLA 185
             +LK+ +  QW   G  +SD  ++  L       R PE+A   A+ AG+++     + +
Sbjct: 268 SWLLKDLLRDQWGFTGITISDHGAIKELIK-HGVARDPEDAVRVALNAGINMSMSDEYYS 326

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA----- 240
            +  G ++ G + ++ ++ A  + + V+  +G+F+      P+ +LGPR+   PA     
Sbjct: 327 KYLPGLLKSGAVSQQALDDATRHVLNVKYDMGLFN-----DPYSHLGPRE-SDPAETNAE 380

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A Q A + +VLLKN   TLPL   +  T+AVIGP +D    M+G++  AGVA 
Sbjct: 381 SRLHREAARQVARESLVLLKNRLNTLPLK--KSGTIAVIGPLADSKRDMMGSWSAAGVAE 438

Query: 296 GYTTPLQGIS---------RYAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T LQGI          RYA+  +     G+    NQ              ++  A  
Sbjct: 439 QSVTVLQGIKNALGEQATVRYARGANVTDDQGIVAFLNQYEPAVTIDKRTPQAMLDEAVK 498

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A Q+D  V V+G  Q +  E   R  + LP  QQ L++ + K  + P+VLVLM G P  
Sbjct: 499 TASQSDVIVAVVGEAQGMAHEASSRTDISLPASQQALIAALKKTGK-PLVLVLMNGRP-- 555

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  A+L   + G  GG AIADVLFG  NP
Sbjct: 556 LTLVKEDQQADALLETWFAGTEGGNAIADVLFGDYNP 592


>gi|329962030|ref|ZP_08300041.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
 gi|328530678|gb|EGF57536.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
          Length = 941

 Score =  201 bits (512), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 147/448 (32%), Positives = 220/448 (49%), Gaps = 34/448 (7%)

Query: 6   MAGLT-YWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCK 64
           M G T  ++P +++ RD RWGR +E  GE P L  +     VRGLQ +      VAA  K
Sbjct: 204 MLGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQQH------VAATGK 257

Query: 65  HYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           H+ AY  +          + + S  ++E+ +  PF+  + E  +  VM SYN  +G P  
Sbjct: 258 HFAAYSNNKGAREGMARVDPQTSPHEVENIHIYPFRRVIKEAGLLGVMSSYNDYDGIPIQ 317

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG--- 181
                L   +  +    GY+VSD D+V  LY      +  +EA   +++AGL++ C    
Sbjct: 318 GSYYWLTTRLRDEMGFRGYVVSDSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRS 377

Query: 182 -PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
                +     V+ G L EE VN  +   + V+  +G+FD        G    ++V    
Sbjct: 378 PDSFVLPLRELVKEGGLDEETVNDRVRDILRVKFLIGLFDAPYQTDLAG--ADKEVEKEE 435

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           ++ +ALQA+ + +VLLKN   TLPL+      +AV GPN+D     + +Y  +A   TT 
Sbjct: 436 NEAVALQASRESVVLLKNENSTLPLNINTVKKIAVCGPNADEDGYALTHYGPLAVEVTTV 495

Query: 301 LQGISR----YAKTIHQAGCFGVACNGNQL--------------IGAAEVAARQADATVL 342
           L+GI       A+ ++  GC  V  N  +               I  A   AR+AD  V+
Sbjct: 496 LKGIQDKVNGKAEVLYTKGCDLVDANWPESEIIDYPLTPDEQAEINKAVENARRADVAVV 555

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           V+G  Q    E   R+ L LPGRQ +L+  V +A+  PVVL+L+ G P+ V++A  D  +
Sbjct: 556 VLGGGQRTCGENKSRSSLDLPGRQLQLLQAV-QATGKPVVLILINGRPLSVNWA--DKYV 612

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
            AIL   YPG  GG A+AD+LFG  NPG
Sbjct: 613 PAILEAWYPGSKGGVALADILFGDYNPG 640


>gi|167646366|ref|YP_001684029.1| beta-glucosidase [Caulobacter sp. K31]
 gi|167348796|gb|ABZ71531.1| Beta-glucosidase [Caulobacter sp. K31]
          Length = 898

 Score =  201 bits (512), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 119/293 (40%), Positives = 167/293 (56%), Gaps = 18/293 (6%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GL  WSPN+NIFRDPRWGRGQET GEDP LT +   ++V G+QG+     ++ A  KH+ 
Sbjct: 141 GLNTWSPNINIFRDPRWGRGQETYGEDPFLTARMGVAFVEGIQGDDPDHPRIIATPKHFA 200

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +         R+  N  VS++DLEDTY   F+A VVEG+  S+MC+YN+++G+P CA  
Sbjct: 201 VHSGPEST---RHGANVFVSRRDLEDTYLPAFRAAVVEGRAGSIMCAYNRIDGQPACASD 257

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +LK  + G W+ DGY+VSDCD+V  + +   Y      A A A++AG+D +C       
Sbjct: 258 LLLKEHLRGAWKFDGYVVSDCDAVKDISDHHKYAPEAASAVAAALRAGVDNECNGATLTD 317

Query: 188 TEG-------AVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNL-GPRDVCTP 239
           T+G       A+  GL+    ++ AL    + + R G    +  +   G L GP  V T 
Sbjct: 318 TDGLAGRYREALDRGLISTAQIDTALVRLFSARFRNGDLPAKGGSD--GRLAGPSVVTTH 375

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLST-LRHHTVAVIGPNSDVTVTMIGNYA 291
            H+ LAL A+ + +VLLKN    LPL   LR   +AVIGP  D T  + GNY+
Sbjct: 376 EHEALALAASEKSLVLLKNDG-VLPLKPGLR---IAVIGPLGDATRVLRGNYS 424



 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 39/102 (38%), Positives = 59/102 (57%), Gaps = 13/102 (12%)

Query: 338 DATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMC 387
           D  V V+GL   +EAE            D+  L LP  Q+ ++ + AKA   P+++V M 
Sbjct: 628 DVLVAVVGLTSDMEAEEAPIEIPGFKGGDKTTLDLPADQRAMLEQ-AKALGKPLIVVAMN 686

Query: 388 GGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           G P++ ++AK++    A+L   YPGQ+GG AIA+VL G+ NP
Sbjct: 687 GSPLNFAWAKDN--ASALLEAWYPGQSGGLAIANVLTGKTNP 726


>gi|227536644|ref|ZP_03966693.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
           33300]
 gi|227243445|gb|EEI93460.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
           33300]
          Length = 777

 Score =  201 bits (511), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 147/452 (32%), Positives = 230/452 (50%), Gaps = 50/452 (11%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGL-QGNTGSRLKVAACCKHYTAYD 70
           + P +++ RDPRW R +E+ GEDPVLTG  AA+ V GL  GN           KH+ AY 
Sbjct: 184 YGPVLDLSRDPRWSRVEESYGEDPVLTGTLAAAIVTGLGSGNLSDPFATIPTLKHFVAYG 243

Query: 71  LDN--WNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
           +     NG       A + +++L + +  PF++ V  G   SVM +YN V+G P  ++  
Sbjct: 244 IPEGGHNGS-----AASIGERELREYFLPPFQSAVAAG-AKSVMAAYNSVDGIPCSSNKF 297

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHT 188
           +L + +  +W  +G+ VSD  S+  +  +    +  ++AA  AI+AGLD D G    +  
Sbjct: 298 LLTDILRKEWNFNGFTVSDLGSIEGIKGSHRVAKDHKQAAILAIEAGLDADLGGNAYVRL 357

Query: 189 EGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPF--GNLGPRDVCTPAHQQLAL 246
             AV+ G ++E  ++ A++  + ++  +G+F+     +PF       ++V T A+  L+ 
Sbjct: 358 IEAVKQGEVQENSIDQAVSRVLALKFEMGLFE-----KPFVDAKTAKKEVKTEANIALSR 412

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVAC--GYTTPLQGI 304
           Q A + IVLL+N    LPL   +   +A+IGPN+D    M+G+Y         TT  Q I
Sbjct: 413 QVARESIVLLENKNNILPLR--KDVKIAIIGPNADNIYNMLGDYTAPQPDGAVTTVRQAI 470

Query: 305 SR---YAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG---------------- 345
           S     A+  +  GC  +    N  I AA  AA+Q+D  V V+G                
Sbjct: 471 SARLPKAQVSYVKGC-SIRDTTNSDIPAAVTAAQQSDIIVAVVGGSSARDFKTEYISTGA 529

Query: 346 ---LDQSI----EAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKN 398
               D+S+      E  DR+ L L GRQ EL+ +  K +  P+V++ + G P+++++A  
Sbjct: 530 AVASDKSVSDMESGEGFDRSTLDLLGRQMELL-KALKQTGKPLVVIYIQGRPLNMNWAAT 588

Query: 399 DPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
                A+L   YPGQ GG AIADVLFG  NP 
Sbjct: 589 --HADALLCAWYPGQEGGHAIADVLFGDYNPA 618


>gi|402772447|ref|YP_006591984.1| glycoside hydrolase family 3 domain-containing protein
           [Methylocystis sp. SC2]
 gi|401774467|emb|CCJ07333.1| Glycoside hydrolase family 3 domain protein [Methylocystis sp. SC2]
          Length = 709

 Score =  201 bits (511), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 157/430 (36%), Positives = 227/430 (52%), Gaps = 33/430 (7%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHYTAYD 70
           ++P +++ RDPRWGR  E PGEDP++  ++A + +RG QG+  SR+  +AA  KH+ A  
Sbjct: 125 FAPMLDVSRDPRWGRIAEGPGEDPLVGERFAQAKIRGFQGDDFSRIFPLAATAKHFCA-- 182

Query: 71  LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
                   R +    VS++ L + Y  PF+A  VE   A+VM ++N VNG P  A   +L
Sbjct: 183 -GGAATAGRDYAAVDVSERALHEVYLPPFRAA-VEAGCAAVMTAFNNVNGVPMSAQGRLL 240

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE-EAAADAIKAGLDLD--CGPFLAIH 187
              +  + + DG I+SD  ++  L   +H       EAAA A+KAG+D+D   G +LA H
Sbjct: 241 NGYLRSRLKFDGVIMSDYTAIAEL--VEHGVAADSIEAAAIALKAGVDMDMVSGVYLA-H 297

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              A+  GL+ E D+++A+A  + ++ RLG+FD      P+      D     H  LAL+
Sbjct: 298 LPEALARGLVEERDIDVAVARVLQLKERLGLFD-----DPYRVARLDDSVKETHCILALE 352

Query: 248 AAHQGIVLLKNSARTLPLST-LRHHTVAVIGPNSDVTVTMIGNY--AGVACGYTTPLQGI 304
           AA +   LL N    LPLS  LR   +AV+GP +D    M+G +  AG A    T L+G+
Sbjct: 353 AARRSATLLINRG-LLPLSANLRR--IAVVGPLADAGADMLGPWSAAGGAENCVTILEGL 409

Query: 305 SR---YAKTIHQAGCFGVACNGNQL--IGAAEVAARQADATVLVMGLDQSIEAEFIDRAG 359
            R   Y +  H +   GVA  G+    I  A   A  ADA VL +G    +  E   RA 
Sbjct: 410 RRALPYCEIAHHS---GVAIEGDDESGIAPAFALAEGADAVVLCLGEAARMSGEAACRAS 466

Query: 360 LLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAI 419
           L LPG+Q++L  RV  A   P V +L CG P+ V +     R  A++   + G   G AI
Sbjct: 467 LGLPGKQRQLAERVMSAG-APTVALLSCGRPLAVPWLFK--RAQAVVAAWFLGHRAGEAI 523

Query: 420 ADVLFGRANP 429
           AD+L GR NP
Sbjct: 524 ADILTGRFNP 533


>gi|393781221|ref|ZP_10369422.1| hypothetical protein HMPREF1071_00290 [Bacteroides salyersiae
           CL02T12C01]
 gi|392677556|gb|EIY70973.1| hypothetical protein HMPREF1071_00290 [Bacteroides salyersiae
           CL02T12C01]
          Length = 946

 Score =  201 bits (511), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 146/448 (32%), Positives = 221/448 (49%), Gaps = 32/448 (7%)

Query: 6   MAGLT-YWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCK 64
           M G T  ++P +++ RD RWGR +E  GE P L  +     VRG+Q N     +VAA  K
Sbjct: 204 MLGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH----QVAATGK 259

Query: 65  HYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           H+ AY  +          + ++S +++E  +  PFK  + E  +  VM SYN  +G P  
Sbjct: 260 HFIAYSNNKGAREGMARVDPQMSPREVEMIHVYPFKRVIQEAGLLGVMSSYNDYDGFPIQ 319

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG--- 181
           +    L   + GQ    GY+VSD D+V  LY      +  +EA   +++AGL++ C    
Sbjct: 320 SSYYWLTTRLRGQMGFRGYVVSDSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRS 379

Query: 182 -PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
                +     V+ G L EE +N  +   + V+  +G+FD        G     +V    
Sbjct: 380 PDSYVLPLRELVQEGGLSEEVINDRVRDILRVKFLVGLFDAPYQTDLKG--ADDEVEKEE 437

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           ++ +ALQA+ + IVLLKN   TLPL       +AV GPN+      + +Y  +A   TT 
Sbjct: 438 NEAVALQASRESIVLLKNENNTLPLDITSVKKIAVCGPNAAEKAYALTHYGPLAVEVTTV 497

Query: 301 LQGISR----YAKTIHQAGC--------------FGVACNGNQLIGAAEVAARQADATVL 342
           + G+       A+ ++  GC              + ++ +    I  A   A++AD  V+
Sbjct: 498 VDGLREKLNGKAEVLYTKGCDLVDAHWPESEIIDYPLSKDEQSEIDKAVAQAQEADVAVV 557

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           V+G  Q    E   R+ L LPGRQ +L+  V +A+  PV+LVL+ G P+ V++A  D  +
Sbjct: 558 VLGGGQRTCGENKSRSSLDLPGRQLDLLKAV-QATGKPVILVLINGRPLSVNWA--DKFV 614

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
            AIL   YPG  GG AIADVLFG  NPG
Sbjct: 615 PAILEAWYPGSKGGTAIADVLFGDYNPG 642


>gi|189464310|ref|ZP_03013095.1| hypothetical protein BACINT_00651 [Bacteroides intestinalis DSM
           17393]
 gi|189438100|gb|EDV07085.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 864

 Score =  201 bits (510), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 116/305 (38%), Positives = 175/305 (57%), Gaps = 21/305 (6%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LT+WSP VN+ RDPRWGR  ET GEDP L+G    ++V+GLQG+    LK+ +  KH+ A
Sbjct: 136 LTFWSPTVNMARDPRWGRTPETYGEDPYLSGVMGTAFVKGLQGDDDRYLKIVSTPKHFAA 195

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
            + ++    +R+  N ++S++ L + Y   F+ACV +GK AS+M +YN +N  P   +  
Sbjct: 196 NNEEH----NRFVCNPQISEKQLREYYLPAFEACVKDGKSASIMSAYNALNDVPCTLNAW 251

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG------P 182
           +L   +   W   GY+VSDC    +L N   Y +T E AA  +IKAGLDL+CG      P
Sbjct: 252 LLTKVLREDWGFKGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIKAGLDLECGDDVFDEP 311

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFD-GEPSAQPFGNLGPRDVCTPAH 241
            L+     A R  ++   D++ A    +  +M+LG+FD GE +  P+  + P  V +  H
Sbjct: 312 LLS-----AYRQYMVTNADIDSAAYRVLRARMQLGLFDSGEKN--PYTKISPAVVGSAKH 364

Query: 242 QQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV-ACGYTTP 300
           Q++AL AA + IVLLKN  + LPL+  +  ++AV+G N+       G+Y+G       + 
Sbjct: 365 QEVALNAARECIVLLKNQKKMLPLNAKKVKSIAVVGINA--GNCEFGDYSGSPVIAPISV 422

Query: 301 LQGIS 305
           LQGI 
Sbjct: 423 LQGIK 427



 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 41/105 (39%), Positives = 63/105 (60%), Gaps = 3/105 (2%)

Query: 326 LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           L G A  A R+ +  V V+G+++SIE E  DR  + LP  Q E +  + K +   +V+VL
Sbjct: 591 LYGEAGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQMEFLQEIYKVNPN-IVVVL 649

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           + G  + V++   D  + AI+   YPG++GG A+A+VLFG  NPG
Sbjct: 650 VAGSSLAVNWM--DEHVPAIVNAWYPGESGGKAVAEVLFGDYNPG 692


>gi|404406439|ref|ZP_10998023.1| glycoside hydrolase 3 [Alistipes sp. JC136]
          Length = 925

 Score =  201 bits (510), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 140/448 (31%), Positives = 224/448 (50%), Gaps = 45/448 (10%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL 71
           ++P +++ RD RWGR +E  GE P L  +   +   G+Q +     +VA+  KH+ AY  
Sbjct: 218 YAPVLDVGRDQRWGRYEEVFGESPYLVAELGVAMASGMQTD----YQVASTAKHFAAYSN 273

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
           +          + ++  +++E+ + +PF+  +    +  VM SYN  +G P       L 
Sbjct: 274 NKGAREGMSRVDPQMPPREVENIHLMPFREVIRRAGILGVMSSYNDYDGVPIQGSRYWLT 333

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEGA 191
             + G+    GY+VSD  SV  L+N  H      +A   +I+AGL++ C  +   H E  
Sbjct: 334 ERLRGEMGFRGYVVSDSGSVEYLHNKHHTAVNQLDAVRQSIEAGLNVRCNFW---HPETY 390

Query: 192 V-------RGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGN---LGPRDVCTPAH 241
           V       R GL+ EE ++  +   + V+  +G+FD     +P+        R+V  P H
Sbjct: 391 VMPLRQLLREGLITEELLDSRVRDVLRVKFLVGLFD-----RPYQTDLAAADREVDGPEH 445

Query: 242 QQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 301
            ++ALQA+ + IVLLKN   TLPL   +   +AV+GPN+D     +G+Y  +A   T+ L
Sbjct: 446 NEVALQASRESIVLLKNENSTLPLDARKIRRIAVLGPNADARGFALGHYGPLAVEVTSVL 505

Query: 302 QGISR----YAKTIHQAGC------------FGVACNGNQLIG---AAEVAARQADATVL 342
            G+ R      + +++ GC            F       +  G   AAE AA ++D  V+
Sbjct: 506 DGLKRNLGARCEIVYEKGCELVDAAWPLSEIFREEMTPEEKAGIRRAAE-AASESDVAVV 564

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           V+G       E   R+ L LPGRQ+EL+ R  +A+  P VLV++ G P  +++A  D  +
Sbjct: 565 VLGGGSRTCGENCSRSSLDLPGRQEELL-RAVEATGKPTVLVMINGRPNSINWA--DAHV 621

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
            AI+   YPG  GG A+ +VLFG  NPG
Sbjct: 622 DAIVEAWYPGAHGGQAVYEVLFGEYNPG 649


>gi|427383551|ref|ZP_18880271.1| hypothetical protein HMPREF9447_01304 [Bacteroides oleiciplenus YIT
           12058]
 gi|425728735|gb|EKU91590.1| hypothetical protein HMPREF9447_01304 [Bacteroides oleiciplenus YIT
           12058]
          Length = 939

 Score =  201 bits (510), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 145/448 (32%), Positives = 222/448 (49%), Gaps = 32/448 (7%)

Query: 6   MAGLT-YWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCK 64
           M G T  ++P +++ RD RWGR +E  GE P L  +     VRG+Q N     +VAA  K
Sbjct: 201 MLGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH----QVAATGK 256

Query: 65  HYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           H+ AY  +          + ++S +++E  +  PFK  + E  +  VM SYN  +G P  
Sbjct: 257 HFVAYSNNKGAREGMARVDPQMSPREVEMIHVYPFKRVIKEAGMLGVMSSYNDYDGIPIQ 316

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG--- 181
                L   + G+    GY+VSD D+V  LY      +  +EA   +++AGL++ C    
Sbjct: 317 GSYYWLTKRLRGEMGFRGYVVSDSDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRCTFRS 376

Query: 182 -PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
                +     V+ G L E+ +N  +   + V+  +G+FD        G    ++V    
Sbjct: 377 PDSYVLPLRELVKEGGLSEDIINDRVRDILRVKFLIGLFDAPYQTDLAG--ADKEVEKAE 434

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           ++ +ALQA+ + ++LLKN    LPL      T+AV GPN++     + +Y  +A    T 
Sbjct: 435 NEAVALQASRESLILLKNENNVLPLDINNIKTIAVCGPNANEEGYALTHYGPLAVEVITV 494

Query: 301 LQGISR----YAKTIHQAGCFGVACNGNQ--------------LIGAAEVAARQADATVL 342
           L+GI +     A+ ++  GC  V  N  +               I  A   AR+AD  V+
Sbjct: 495 LEGIRQKAEGKAEVLYAKGCDLVDANWPESELIEYPMTNEEQAEINKAVENARKADVAVV 554

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           V+G  Q    E   R+ L LPGRQ +L+  V +A+  PVVLVL+ G P+ +++A  D  +
Sbjct: 555 VLGGGQRTCGENKSRSSLDLPGRQLKLLQAV-QATGKPVVLVLINGRPLSINWA--DKFV 611

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
            AIL   YPG  GG A+ADVLFG  NPG
Sbjct: 612 PAILETWYPGSKGGTAVADVLFGDYNPG 639


>gi|393787054|ref|ZP_10375186.1| hypothetical protein HMPREF1068_01466 [Bacteroides nordii
           CL02T12C05]
 gi|392658289|gb|EIY51919.1| hypothetical protein HMPREF1068_01466 [Bacteroides nordii
           CL02T12C05]
          Length = 958

 Score =  201 bits (510), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 149/448 (33%), Positives = 221/448 (49%), Gaps = 32/448 (7%)

Query: 6   MAGLT-YWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCK 64
           M G T  ++P +++ RD RWGR +E  GE P L  +     V+G+Q N     +VAA  K
Sbjct: 216 MLGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVKGMQHN----YQVAATGK 271

Query: 65  HYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           H+ AY  +          + ++S +++E  +  PFK  + E  +  VM SYN  +G P  
Sbjct: 272 HFIAYSNNKGAREGMARVDPQMSPREVEMIHVYPFKRVIQEAGLLGVMSSYNDYDGLPVQ 331

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG--- 181
           +    L   + GQ    GY+VSD D+V  LY      +  +EA   +++AGL++ C    
Sbjct: 332 SSYYWLMTRLRGQMGFRGYVVSDSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRS 391

Query: 182 -PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
                +     V+ G L EE +N  +   + V+  +G+FD        G     +V    
Sbjct: 392 PDSYVLPLRELVQEGGLSEEIINDRVRDILRVKFLVGLFDTPYQTDLKG--ADEEVEKEE 449

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           ++ +ALQA+ + IVLLKN    LPL       +AV GPN+D T   + +Y  +A   TT 
Sbjct: 450 NEIVALQASRESIVLLKNDKNALPLDVASIRKIAVCGPNADETAYALTHYGPLAVDVTTV 509

Query: 301 LQGISR----YAKTIHQAGCFGVACN--------------GNQLIGAAEVAARQADATVL 342
           L GI +     A+ ++  GC  V  N                  I  A   A++AD  V+
Sbjct: 510 LSGIRQKVDGKAEVLYTKGCELVDANWPESEIIDYPLTNDEQNKIDKAVAQAKEADVAVV 569

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           V+G  Q    E   R+ L LPGRQ +L+  V +A+  PVVLVL+ G P+ V++A  D  +
Sbjct: 570 VLGGGQRTCGENKSRSSLDLPGRQLDLLKAV-QATGKPVVLVLINGRPLSVNWA--DKFV 626

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
            AI+   YPG  GG A+ADVLFG  NPG
Sbjct: 627 PAIIEAWYPGSKGGTAVADVLFGDYNPG 654


>gi|337749758|ref|YP_004643920.1| glycoside hydrolase family protein [Paenibacillus mucilaginosus
           KNP414]
 gi|336300947|gb|AEI44050.1| glycoside hydrolase family 3 domain protein [Paenibacillus
           mucilaginosus KNP414]
          Length = 937

 Score =  201 bits (510), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 122/319 (38%), Positives = 178/319 (55%), Gaps = 18/319 (5%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N  + GLT W+P V++ RDPRWGR +E  GEDP LTG+ +   V+G+QG+    LK AA 
Sbjct: 103 NPEINGLTIWAPTVDMERDPRWGRTEEAYGEDPYLTGRLSTELVKGMQGDHPFYLKTAAT 162

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
            KH+    L N N +DR   +A +  +++ + Y   F+    EG   S+M +YN VNG P
Sbjct: 163 LKHF----LGNNNEMDRGECSASIDPRNMREYYLKAFEPAFREGGAQSMMTAYNSVNGTP 218

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL---D 179
              +PD+    + G+W +DG++VSD   V    N   Y  +  EA A ++++G+D    D
Sbjct: 219 CNLNPDV-NAIVKGEWGMDGFVVSDAGDVLGTVNEHRYFASYAEAVAASVRSGIDSITDD 277

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
            G  L      A+ GGLL EED++ A+     V++RLG FDG P   P+ ++    +C P
Sbjct: 278 AGITLRA-VRDALEGGLLAEEDLDRAVGNAFRVRIRLGEFDG-PEENPYAHVPEAKLCAP 335

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
            H  L+LQAA +G VLLKN    LPL+  +  ++AVIGP +DV  T    Y+G      T
Sbjct: 336 EHAALSLQAAREGFVLLKNEG-LLPLA--KPASIAVIGPLADVVHTDW--YSGTPPYRIT 390

Query: 300 PLQGISRY---AKTIHQAG 315
           PL+G++        IH+ G
Sbjct: 391 PLEGVAERMAPGTVIHRTG 409



 Score = 45.4 bits (106), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 38/104 (36%), Positives = 55/104 (52%), Gaps = 4/104 (3%)

Query: 327 IGAAEVAARQADATVLVMGLDQSIEA-EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           + AA  AARQA+  ++ +G    I   E +DR  L L   Q+ L+  V  A+    V VL
Sbjct: 544 LQAAVEAARQAETAIVFLGNSPFINGKECVDRPDLTLAPAQEALLQAVL-AANPRTVAVL 602

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +   P  V++A+    + AIL+  + GQ  G A ADVLFG  +P
Sbjct: 603 VGSYPFAVNWAQ--AHVPAILYTSHAGQELGRAAADVLFGDCSP 644


>gi|119476117|ref|ZP_01616469.1| periplasmic beta-glucosidase [marine gamma proteobacterium
           HTCC2143]
 gi|119450744|gb|EAW31978.1| periplasmic beta-glucosidase [marine gamma proteobacterium
           HTCC2143]
          Length = 748

 Score =  201 bits (510), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 141/427 (33%), Positives = 230/427 (53%), Gaps = 22/427 (5%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHYTAYD 70
           ++P ++I RDPRWGR  E+ GEDP L G+  A+ VRG QG   S +  +AAC KH+  Y 
Sbjct: 138 FAPMIDITRDPRWGRIAESLGEDPYLCGELGAAMVRGFQGKDLSAIGSIAACAKHFAGYG 197

Query: 71  LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
                GVD  +  A +++ +L + Y  PFKA +  G VAS M ++N +NG P   +  +L
Sbjct: 198 AAE-GGVD--YNTAIIAENELRNVYLPPFKAALDSG-VASFMTAFNDLNGVPASGNEFLL 253

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQH-YTRTPEEAAADAIKAGLDLD-CGPFLAIHT 188
           K  +  +W   G +VSD +S+  L  T+H +T   +EAA +A  AG+D++      + H 
Sbjct: 254 KQILREEWCYQGMVVSDWESIVQL--TEHGFTANDKEAAFEAANAGIDMEMVSNTYSQHL 311

Query: 189 EGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQA 248
           E  +  G +    V+  +   + ++ RLG+F+  P  QP  +  P  +    H+Q A + 
Sbjct: 312 ESLIIEGRISLAQVDEMVKNILRLKFRLGLFEN-PYPQP--DKLPA-LVNHDHRQAAKKL 367

Query: 249 AHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGN--YAGVACGYTTPLQGISR 306
           A + +VLLKNS ++LPL      ++A+IGP +D     +G   + G A    T LQ I+ 
Sbjct: 368 ALESVVLLKNSHQSLPLRLSALSSIALIGPLADDAYEQLGTWIFDGDADDSETVLQAINA 427

Query: 307 YA---KTIHQAGCFGVACNGNQL-IGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLL 362
           +A    T++         +   + I     AA+ +DA VL +G +  +  E   RA + L
Sbjct: 428 FAGDSLTVNVDRALETTRSNTFIDIDRTMAAAQSSDAIVLCLGEESILSGEAHSRADISL 487

Query: 363 PGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADV 422
           PG Q++L+  +AK ++ P++L++M G P+ +    +   + AIL+  +PG   G A+ D+
Sbjct: 488 PGAQEQLIHLLAKTAK-PMILIVMAGRPLTLEPIID--HVDAILYAWHPGTMAGTALTDL 544

Query: 423 LFGRANP 429
           LFG  +P
Sbjct: 545 LFGEVSP 551


>gi|313204103|ref|YP_004042760.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312443419|gb|ADQ79775.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 1278

 Score =  201 bits (510), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 111/307 (36%), Positives = 171/307 (55%), Gaps = 15/307 (4%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           LTYWSP +   RDPRWGR  ET GEDP L  +  + +V+GL G+  + LK   C KHY A
Sbjct: 142 LTYWSPVIEPARDPRWGRTAETFGEDPFLVSQIGSGFVQGLMGDDPTYLKTVPCGKHYFA 201

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
               N +  +R++ +A +  +D+ + Y  P++  + + K+ S+M +Y+ VNG P  A   
Sbjct: 202 ----NNSEFNRHNGSANMDDRDMREFYLTPYRTLIQKDKLPSIMTAYSAVNGVPMSASKF 257

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHT 188
           ++       + LDGY+  DCD+V  + N+  Y ++  EAAA  +K G+D DCG       
Sbjct: 258 LVDTIAKRTYGLDGYVTGDCDAVADVVNSHRYAKSKAEAAAMGLKTGVDSDCGGIYQTSA 317

Query: 189 EGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQA 248
             A++ GL+ E D++ AL    T++MRLG FD + +  P+  + P  +  P+H  LAL+ 
Sbjct: 318 LEALKQGLISEADMDKALVNIYTIRMRLGEFDPQ-NIVPYAGIKPSIINDPSHNDLALEI 376

Query: 249 AHQGIVLLKNS------ARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA--CGYTTP 300
           A +  VLLKN+       + LPL+      +AV+GP +D     +G+Y+G A      TP
Sbjct: 377 ATKSPVLLKNNLVGKSGKKALPLNAGTIKKIAVLGPQAD--KVELGDYSGEADPKYKITP 434

Query: 301 LQGISRY 307
           L+GI  Y
Sbjct: 435 LEGIKNY 441



 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 41/97 (42%), Positives = 56/97 (57%), Gaps = 1/97 (1%)

Query: 334 ARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDV 393
           A  AD  V+ +G DQ+   E  DR  + LPG Q EL+  +A  +   +V V+   G V+V
Sbjct: 612 AASADVAVVFVGTDQTTGREESDRFAITLPGNQNELIKSIAAVNPNTIV-VIQGMGMVEV 670

Query: 394 SFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
              KN+P +  I++ GY GQA G A+A VLFG  NPG
Sbjct: 671 EQFKNNPNVAGIIFTGYNGQAQGTAMAKVLFGDVNPG 707


>gi|301096878|ref|XP_002897535.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
 gi|262106995|gb|EEY65047.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
          Length = 537

 Score =  201 bits (510), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 137/388 (35%), Positives = 197/388 (50%), Gaps = 41/388 (10%)

Query: 57  LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYN 116
           L+     KHY AY  +N+ G +R  F+A VS  D  DTY   F++ +V+G    VMCSYN
Sbjct: 157 LQAVVTLKHYIAYSYENYGGGNRKEFDAIVSPYDFADTYFPAFRSSIVDGNATGVMCSYN 216

Query: 117 QVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGL 176
            +NG PTCA+                   +D  ++  + +  HY  T  EAA  AI AG 
Sbjct: 217 SINGVPTCANN------------------ADSGAIEAISDRHHYVATRCEAARIAILAGT 258

Query: 177 DLDCGPFLAIHT--EGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPR 234
           D++ G            VR   L E+ ++ AL +T+ ++  LG+FD     QP+ ++ P 
Sbjct: 259 DVNSGRLFGYMKCLPELVRSNQLEEKALDDALRHTLKLRFELGLFD-PIDDQPYWHVKPS 317

Query: 235 DVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA 294
           DV TP  +QLAL  A + IVLL+N+   LPL   +   +AV+GP+++    ++GNY G  
Sbjct: 318 DVNTPKSRQLALNLATKSIVLLQNNNSVLPLR--KGTKLAVLGPHAESKRGLLGNYLGQM 375

Query: 295 C--GYT------TPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEVAAR---QADATVLV 343
           C  GY+      TP++ +S            G   +GN   G  E   +   +ADA VL 
Sbjct: 376 CHGGYSEVGCIQTPMEAVSTTNGAATSTYALGCNISGNSTDGFEETKVKVTQEADAVVLF 435

Query: 344 MGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVS--FAKNDPR 401
           +G+D SIE E  DR  + LP  Q +L+ RV +A   P V+VL+ GG +       + D  
Sbjct: 436 LGIDTSIEEEVNDRNDIKLPSIQMQLLQRV-RAVGKPTVVVLINGGVLGAEDLILQTDAL 494

Query: 402 IGAILWVGYPGQAGGAAIADVLFGRANP 429
           + A     YPG  G  A+ADVLFG ANP
Sbjct: 495 VEAF----YPGFFGAQAMADVLFGDANP 518


>gi|390945417|ref|YP_006409177.1| beta-glucosidase-like glycosyl hydrolase [Alistipes finegoldii DSM
           17242]
 gi|390421986|gb|AFL76492.1| beta-glucosidase-like glycosyl hydrolase [Alistipes finegoldii DSM
           17242]
          Length = 771

 Score =  200 bits (509), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 145/450 (32%), Positives = 224/450 (49%), Gaps = 49/450 (10%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGL-QGNTGSRLKVAACCKHYTAYD 70
           + P ++I RDPRW R +E+ GED  LT +   +YVRG   G+        +  KH+ AY 
Sbjct: 178 YGPVLDIVRDPRWSRTEESYGEDCYLTARIGEAYVRGTGSGDLSQSRHALSTLKHFIAYG 237

Query: 71  LDN--WNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
                 NG         + +++L +TY  PF+A V  G   SVM +YN V+G P  A+  
Sbjct: 238 ASEGGQNGGSNL-----LGERELRETYLPPFEAAVKAG-ARSVMTAYNSVDGIPCTANRR 291

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC--GPFLAI 186
           +L + + G+W  DG++VSD  S+  L+ T     +  EAA  A++AG+D D   G F ++
Sbjct: 292 MLTDILRGEWGFDGFVVSDLLSIEGLHETHGVAGSVREAAVQALRAGVDADLKGGAFASL 351

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPF-GNLGPRDVCTPAHQQLA 245
               A   G + E +++ A+   + ++  +G+F+      P+       +V   AH +LA
Sbjct: 352 RE--AAEAGDVAEAEIDRAVERVLALKFEMGLFE-----NPYIDEAAAAEVGCAAHSELA 404

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
           L+AA Q + LL+N + TLPL   R   VAVIGPN+D     +G+Y        T   G+ 
Sbjct: 405 LEAARQSVTLLENRSGTLPLDPRRLRRVAVIGPNADNIYNQLGDYTAQQTAANTVRDGLE 464

Query: 306 RYA---KTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLD--QSIEAEFI----- 355
           +     + ++  GC  V       I AA  AAR  DA V+V+G    +  + EF+     
Sbjct: 465 KLLGRDRVVYSRGCT-VRGGDRSEIAAAVSAARGTDAAVVVIGGSSARDFDTEFLQTGAA 523

Query: 356 ----------------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKND 399
                           DRA L L G Q+EL+ R+ KA+  P+++V + G P+D+  A   
Sbjct: 524 KAAHDEVRDMECGEGFDRATLALLGEQEELLRRI-KATGTPLIVVCIAGRPLDLRRASE- 581

Query: 400 PRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            +  A+L   YPG  GG A+A+ + G  NP
Sbjct: 582 -QADALLMAWYPGARGGDAVAETILGHNNP 610


>gi|313203744|ref|YP_004042401.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312443060|gb|ADQ79416.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 1286

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 120/352 (34%), Positives = 182/352 (51%), Gaps = 24/352 (6%)

Query: 5   GMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCK 64
           G  GLTYWSP V   RDPRWGR  E+ GEDP L  + A  +VRG+ GN  + LK   C K
Sbjct: 134 GTKGLTYWSPVVEPIRDPRWGRTGESYGEDPFLAAEIAGGFVRGMVGNDPTYLKSVPCAK 193

Query: 65  HYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           HY A    N +  DR+  ++ +  +D+ + Y  P+K  + +  + S+M SYN VNG PT 
Sbjct: 194 HYFA----NNSEFDRHVSSSNMDSRDMREFYLAPYKKLIEQDNLPSIMSSYNAVNGVPTS 249

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFL 184
           A    L       + L GYI  DC ++  +Y   +Y +T EEA A  +KAG+D DCG   
Sbjct: 250 ASQLYLDTIARRTYGLKGYITGDCAAIEDIYTGHYYVKTAEEATAKGLKAGVDSDCGSIY 309

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
             +   A++ GL+   D++ AL     V+MR G FD  P+   +    P  V +PA++ L
Sbjct: 310 QRYAIAALKKGLITMADIDRALLNIFIVRMRTGEFD-PPAKVLYAQFQPNIVNSPANKAL 368

Query: 245 ALQAAHQGIVLLKNSA------RTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVAC--G 296
           A + A +  VLLKN+       + LPL+      +A+IGP++D     +G Y+G      
Sbjct: 369 AKEIATKTPVLLKNNISLKTNRKALPLNPADLKKIALIGPHAD--KVELGPYSGRPAQEN 426

Query: 297 YTTPLQGISRY-------AKTIHQAGCFGVACNGNQLIGAAEVAARQADATV 341
             TP  GI +Y        + +H +G  G + + + L+       ++++ +V
Sbjct: 427 MITPFAGIKKYILERGLSTEVLHSSG--GNSTSKSNLLYVVAFELKKSNGSV 476



 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 42/86 (48%), Positives = 49/86 (56%), Gaps = 1/86 (1%)

Query: 345 GLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGA 404
           G D+    E  DR  LLLPG Q EL+  VA  +   +V V+   G V+V   KN   I  
Sbjct: 619 GTDEKTATEEADRLTLLLPGNQVELIKAVAAVNPNTIV-VMQTLGCVEVEEFKNLQNIPG 677

Query: 405 ILWVGYPGQAGGAAIADVLFGRANPG 430
           I+WVGY GQA G AIA VLFG  NPG
Sbjct: 678 IIWVGYNGQAQGDAIASVLFGEVNPG 703


>gi|373952814|ref|ZP_09612774.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373889414|gb|EHQ25311.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 862

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 113/313 (36%), Positives = 171/313 (54%), Gaps = 10/313 (3%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHYT 67
           L+ W+PNVNIFRDPRWGRGQET GEDP LT +   S V+GLQG   ++  K+ AC KHY 
Sbjct: 130 LSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQGPADAKYRKLLACAKHYA 189

Query: 68  AYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
            +    W+   R+  N   V+ +DL +TY   FK+ V +  V  VMC+Y +++ +P C +
Sbjct: 190 VHSGPEWS---RHEMNVTDVTPRDLWETYLPAFKSLVQDADVREVMCAYQRLDDEPCCGN 246

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAI 186
             +L   +   W     +VSDC ++   YN+ H +     A+A A+ +G D++C  +   
Sbjct: 247 SRLLGQILREDWGFKYLVVSDCGAITDFYNSHHSSSDATHASAKAVLSGTDVECVGYAFD 306

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLAL 246
               AV  GL++E+D+N ++   +T +  LG  D +    P+  +    V +  HQ+LAL
Sbjct: 307 KIPDAVYRGLIKEKDINTSVVRLMTQRFELGEMDKD-ELVPWTKIPLSVVNSEDHQKLAL 365

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISR 306
             A + + LL+N+   LPLS      +AVIGPN++ +  + GNY G        L+GI  
Sbjct: 366 DMARETMTLLQNNNNILPLSK-SIGKLAVIGPNANDSQMLSGNYNGTPLRTINILEGIKT 424

Query: 307 YA---KTIHQAGC 316
                  I+ AGC
Sbjct: 425 KLGADHVIYDAGC 437



 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 49/105 (46%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  V V G+   +E E +          DR  + LP  Q+  +  + KA  G  ++ 
Sbjct: 597 KDADIVVFVGGISPKLEGEEMPVQLPGFKGGDRTDIELPAVQRNCIEALRKA--GKKIVF 654

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           + C G   ++         AIL   Y G++GG A+ADVLFG  NP
Sbjct: 655 VNCSGSA-IAMVPETQNCDAILQAWYAGESGGQAVADVLFGDYNP 698


>gi|295135996|ref|YP_003586672.1| beta-glucosidase [Zunongwangia profunda SM-A87]
 gi|294984011|gb|ADF54476.1| putative beta-glucosidase [Zunongwangia profunda SM-A87]
          Length = 796

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 155/467 (33%), Positives = 223/467 (47%), Gaps = 68/467 (14%)

Query: 10  TYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLK----VAACCKH 65
           T + P ++I R+PRW R +ET GEDP L  +   S V G QG+  S LK    VAA  KH
Sbjct: 182 TAYGPIIDIAREPRWSRVEETFGEDPYLIAEMGKSMVTGFQGSHESDLKSNEHVAATLKH 241

Query: 66  YTAYDLDNWNGVDRYHFN---ARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
           + AY      GV     N     + ++DL   Y  P K  V  G V SVM +Y+ ++G P
Sbjct: 242 FAAY------GVSEGGHNGAAVHIGQRDLFQNYMYPVKEAVDNG-VMSVMTAYSSIDGVP 294

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG- 181
           + A  ++L N +  +W   G+++SD  S+  L    H   T E+AAA A+ AG+D+D G 
Sbjct: 295 STAHKNLLTNILKEKWGFKGFVISDLASIEGLLGDHHIVDTEEDAAAMAMNAGVDVDLGG 354

Query: 182 -----PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGN--LGPR 234
                  +      AV  G + EE ++ A+   +TV+ +LG+F+      P+ N     +
Sbjct: 355 NGYDDALI-----DAVNAGKVAEERIDEAVRRILTVKFKLGLFEN-----PYANEKQAEK 404

Query: 235 DVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA 294
            V    H +LA + A Q I +LKN    LPL+      +AVIG N+D+    +G+Y    
Sbjct: 405 IVRNSEHIELAREVARQSITMLKNEDNILPLNK-ELQNIAVIGSNADMQYNQLGDYTAPQ 463

Query: 295 C--GYTTPLQGISRYAKTIHQAGCFGVACNGNQL--IGAAEVAARQADATVLVMG----- 345
                 T L+GI       +     G A        I AA  AA+ A+  ++V+G     
Sbjct: 464 SEENIITVLEGIQHKMPNANIEYVKGTAVRDTTQTNIPAAVEAAKNAEVAIVVLGGSSAR 523

Query: 346 ---------------------LDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLV 384
                                L      E  DR+ L L G+Q EL+  V  A+  P VLV
Sbjct: 524 DFKTEYLETGAATISSKEDQVLSDMESGEGYDRSTLNLMGKQLELLQAVV-ATGTPTVLV 582

Query: 385 LMCGGPVDVSF-AKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           L+ G P+ +++ A+N P    IL   YPGQ GG+AIADV+FG  NP 
Sbjct: 583 LIKGRPLLLNWPAENVP---VILDAWYPGQEGGSAIADVIFGDFNPA 626


>gi|299140913|ref|ZP_07034051.1| periplasmic beta-glucosidase [Prevotella oris C735]
 gi|298577879|gb|EFI49747.1| periplasmic beta-glucosidase [Prevotella oris C735]
          Length = 767

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 147/444 (33%), Positives = 219/444 (49%), Gaps = 40/444 (9%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL 71
           ++P +++ RDPRWGR  E   E P L G+     V GLQ N     ++ +  KH+  Y L
Sbjct: 157 YAPILDVSRDPRWGRVVECYSESPYLAGELGKQMVLGLQEN-----RIVSTPKHFAVYSL 211

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
                 +    +  V+ ++++     PF+  + EG    VM SYN  +G+P    P  L 
Sbjct: 212 PVGGRDEGTRTDPHVAPKEMKTLLLEPFRKAIQEGGALGVMSSYNDYDGEPITGSPYFLT 271

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG----PFLAIH 187
             +  QW   GY+VSD ++V  L +  H     EE AA AI AGLD+           + 
Sbjct: 272 ELLRHQWGFHGYVVSDSEAVEFLSSKHHVAANREEGAAMAINAGLDVRTNFSMPETFILP 331

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPF-GNLGPRD--VCTPAHQQL 244
              A+  GL+  + ++  +   + V+  LG+FD      P+ GN+   D  V + AHQQL
Sbjct: 332 LRQALTDGLVSMQILDARVKDVLYVKFWLGLFD-----NPYRGNVNEVDQVVHSKAHQQL 386

Query: 245 ALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           +L+AA + IVLLKN    LPLS +L+   +AVIGPN+D T   +  Y        + L G
Sbjct: 387 SLRAALESIVLLKNENNLLPLSKSLKR--IAVIGPNADATTAHVCRYGPANAPIKSVLSG 444

Query: 304 IS--------RYA-------KTIHQAGCFGVACNGNQ--LIGAAEVAARQADATVLVMGL 346
           I         RYA       K   ++  + VA +  +  +I  A   ARQ+D  V+V+G 
Sbjct: 445 IRESMPGAEVRYAKGCSIVDKHFPESELYEVALDTTEQRMIDEAVGVARQSDVAVVVLGG 504

Query: 347 DQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
            +    E   R  L L GRQ++L+ R   A+  PVVLVL+ G    +++A     + AI+
Sbjct: 505 SEETVREEYSRTDLNLMGRQEQLL-RAVYATGKPVVLVLLDGRAATINWANQ--YVPAIV 561

Query: 407 WVGYPGQAGGAAIADVLFGRANPG 430
              +PG+  G A+A VLFG  NPG
Sbjct: 562 HGWFPGEFTGTAVAKVLFGDYNPG 585


>gi|358342292|dbj|GAA27551.2| probable beta-D-xylosidase 7 [Clonorchis sinensis]
          Length = 826

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 142/466 (30%), Positives = 227/466 (48%), Gaps = 52/466 (11%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG------------NTGS 55
           G+  ++P VN+ R P WGR QET GEDP + GK A ++VRGL G            N  S
Sbjct: 137 GIHLFAPVVNLLRHPLWGRNQETFGEDPFMIGKLARTFVRGLGGWKNAEPQSLDEQNLSS 196

Query: 56  R---LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVM 112
           +   L V A CKH+  +       V R  F A V+  DL  TY   F+AC+  G V SVM
Sbjct: 197 QPDVLLVGANCKHFAVHTGPEDFPVSRLSFEANVTDVDLWQTYLPAFRACLEAGAV-SVM 255

Query: 113 CSYNQVNGKPTCADPDILKNTIHGQWRLDGYIVSDCDSVG-VLYNTQHYTRTPEEAAADA 171
           C+Y+ +NG P C +  +L   +  +W+  G++V+DC ++  V++  Q +    E A A A
Sbjct: 256 CAYSGINGTPDCINHWLLTELLRQKWKFKGFVVTDCGALQFVIWKHQIFNHYNETAMA-A 314

Query: 172 IKAGLDLDCGPFLAIHTEGAV-----RGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQ 226
           ++AG++L+     A      +      G L R++ + +A    +T  M+ G F+      
Sbjct: 315 VRAGVNLENSVVYATEVFSTLPHLLASGSLSRDQLIEMARPLFLTRLMQ-GEFN-PVEMD 372

Query: 227 PFGNLGPRD-VCTPAHQQLALQAAHQGIVLLKNSARTLPLST--------LRHHTVAVIG 277
           P+  L P + +    H+++AL    + IVLL+N  R LPL          LRH  +A++G
Sbjct: 373 PYRLLAPEEAILNEDHRRVALATTARSIVLLQNRDRFLPLKNNMSDSGGPLRH--IAIVG 430

Query: 278 PNSDVTVTMIGNY-AGVACGYTTPL-QGISRYAKTIHQAGCF---GVACNGNQLIGAAEV 332
           P +     + G+Y          PL +G+S+ ++ +H +      G   + N     + +
Sbjct: 431 PFATSVTELYGHYRTAPEPEIEVPLSKGLSQLSRRMHASDICTDGGRCSSLNDDALHSTL 490

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRG-----------PV 381
                D  VL +G    +E E +DR  + LPG+Q EL+    K S G           P+
Sbjct: 491 GYDDLDLIVLSLGTGSEVEGENVDRQNITLPGKQPELLEETLKLSSGLGNSGLSKRTVPI 550

Query: 382 VLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRA 427
           +L++   GP+++S A  +  + AI W G+PG   G A+  +L G +
Sbjct: 551 ILLVFSAGPINISRAVENENVKAIFWCGFPGPLVGDAMRHLLLGSS 596


>gi|410634080|ref|ZP_11344720.1| beta-glucosidase [Glaciecola arctica BSs20135]
 gi|410146740|dbj|GAC21587.1| beta-glucosidase [Glaciecola arctica BSs20135]
          Length = 772

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 145/430 (33%), Positives = 221/430 (51%), Gaps = 22/430 (5%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR-LKVAACCKHYTAYD 70
           ++P V++ RDPRWGR  E  GED  LT   A + V+G QG+  S+   + A  KH+ AY 
Sbjct: 176 FAPMVDVARDPRWGRISEGSGEDVYLTTAIARARVQGFQGDDLSQPHTILATAKHFAAY- 234

Query: 71  LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
                   R +    +S ++L DTY  PFKA V  G V S M S+N++NG P  A+  +L
Sbjct: 235 --GQGQAGRDYHTTDMSDRELRDTYLPPFKAAVDAG-VTSFMTSFNELNGVPASANKYLL 291

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQH-YTRTPEEAAADAIKAGLDLDC-GPFLAIHT 188
            + +  +W  +G++V+D  S+  +   +H + R  + A   A+KAG+D+D  G     + 
Sbjct: 292 TDILRDEWSFEGFVVTDYTSINEM--VKHGFARDNDHAGELAVKAGVDMDMQGSVYFDYL 349

Query: 189 EGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQA 248
              V  G +  + ++ A    + ++ RLG+F+ +P          +++    + Q A   
Sbjct: 350 ANQVTQGKVSPQQIDNAARRILEMKYRLGLFE-DPYRYSNEEREAQEIYKEYNLQAAQDV 408

Query: 249 AHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP---LQGIS 305
           A + +VLLKN  + LPLS     T+AVIGP +D    +IG+++     Y  P   L GI 
Sbjct: 409 ARKSMVLLKNENQQLPLSK-SDLTIAVIGPLADSKEDLIGSWSAAGDRYEKPITLLTGIK 467

Query: 306 RY----AKTIHQAGC-FGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGL 360
                 +K ++  G  +  +   N    AA   A++AD  VL MG    +  E   R  L
Sbjct: 468 AKVADPSKVLYAKGASYEFSHQDNSGFEAAIAIAKKADVIVLAMGEKWDMTGEATSRTSL 527

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
             PG Q  L+ ++ K ++ P+VLVLM G P+ + +A  D  + AIL   YPG  GG AIA
Sbjct: 528 DFPGNQLALMQQLKKLAK-PMVLVLMNGRPMTIEWA--DQNVDAILEAWYPGTMGGPAIA 584

Query: 421 DVLFGRANPG 430
           DVLFG  NP 
Sbjct: 585 DVLFGDYNPS 594


>gi|189464498|ref|ZP_03013283.1| hypothetical protein BACINT_00840 [Bacteroides intestinalis DSM
           17393]
 gi|189438288|gb|EDV07273.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 862

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 122/317 (38%), Positives = 174/317 (54%), Gaps = 14/317 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHY 66
           GLT W+P VNI+RDPRWGRG ET GEDP LT +     V+GLQG   G+  K+ AC KH+
Sbjct: 127 GLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGPADGNYDKLHACAKHF 186

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  +  +DL +TY  PF+A V EG+V  VMC+YN+  G+P C 
Sbjct: 187 AVHSGPEWN---RHSFNAENIKPRDLYETYLPPFEALVKEGEVKEVMCAYNRFEGEPCCG 243

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE--EAAADAIKAGLDLDCGPF 183
              +L   +  +W  DG +VSDC ++   YN + +   P+   A+A A+ +G DL+CG  
Sbjct: 244 SNRLLMQILRDEWGFDGIVVSDCGAIADFYNDRGHHTHPDAESASAAAVVSGTDLECGSS 303

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
                E +V+ GL+ EE VN ++   +  +  LG  D +     +  +    V + AH  
Sbjct: 304 YKALVE-SVKKGLISEEKVNTSVRRLLKARFELGEMD-DLEKVSWAKIPFSVVASAAHDS 361

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           LAL+ A + + LL N    LPL      TVAV+GPN++ +V   GNY G+     T L G
Sbjct: 362 LALKIARKSMTLLMNKDNFLPLRR-GGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILDG 420

Query: 304 ISRYA----KTIHQAGC 316
           +        K I++ GC
Sbjct: 421 VRNLLGADDKLIYEQGC 437



 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           ++AD  +   G+  S+E E +          DR  + LP  Q+EL+  + +A +  +VLV
Sbjct: 597 KEADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQRELIDALYRAGK-KIVLV 655

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G P+         +  AIL   YPGQ GG A+A+VLFG  NP
Sbjct: 656 NCSGSPI--GLEPETKKCEAILQAWYPGQQGGTAVAEVLFGDYNP 698


>gi|260598593|ref|YP_003211164.1| beta-D-glucoside glucohydrolase [Cronobacter turicensis z3032]
 gi|260217770|emb|CBA32207.1| Periplasmic beta-glucosidase [Cronobacter turicensis z3032]
          Length = 765

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 223/457 (48%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGRG E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVTRDPRWGRGSEGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+K  +  G    VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ + G W   G  +SD  ++  L   +H T + PE+A   AIK+G+D+     + 
Sbjct: 267 GWLLKDLLRGDWGFKGITISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD-------VC 237
           + +    ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP+D         
Sbjct: 325 SKYLPNLIKSGKVSMEELDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAE 379

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
           +  H+  A   A + +VLLKN   TLPL   +  T+AV+GP +D    M+G++  AGVA 
Sbjct: 380 SRLHRDDARNVARESLVLLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVAD 437

Query: 296 GYTTPLQGISRYA----KTIHQAGCFGVACNG-------------------NQLIGAAEV 332
              T LQG+   A    K +   G       G                    ++I  A  
Sbjct: 438 QSVTLLQGMKNVAGDKAKILFAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKEMIDEAVN 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  L LP  Q++L+S + KA+  P+VLVLM G P+ 
Sbjct: 498 VAKQSDVVVAVVGEAQGMAHEASSRTDLTLPQSQRDLISAL-KATGKPLVLVLMNGRPL- 555

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            +  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 556 -ALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|294146655|ref|YP_003559321.1| putative beta-glucosidase [Sphingobium japonicum UT26S]
 gi|292677072|dbj|BAI98589.1| putative beta-glucosidase [Sphingobium japonicum UT26S]
          Length = 874

 Score =  199 bits (505), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 118/319 (36%), Positives = 168/319 (52%), Gaps = 17/319 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GL  WSPN+NIFRDPRWGRGQET GEDP LT +   ++V+G+QGN      V A  KH+ 
Sbjct: 132 GLDTWSPNINIFRDPRWGRGQETYGEDPYLTARIGVAFVQGMQGNDPDLPLVVATPKHFA 191

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +         R+  N   + +DLEDTY   F+A +VEGK  S+MC+YN+V+G+P C   
Sbjct: 192 VHSGPE---PSRHTDNIFATPRDLEDTYLPAFRAAIVEGKAGSIMCAYNRVDGQPACGSH 248

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG------ 181
            +LK+ + G W   GY+VSDCD+V  +Y    Y        A A++ G+D +C       
Sbjct: 249 MLLKDYLRGAWGFKGYVVSDCDAVVDIYEHHKYAPDAATGVAVALRHGVDSECNNATLGG 308

Query: 182 -PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
            P L    + ++  G +   D++ AL    + + R G   G  S +P   + P  + TP 
Sbjct: 309 RPDLGDRYKESLARGHIAMGDIDAALVRLFSARYRNGDLPGLSSRKP-NAIPPSAIGTPD 367

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           HQ LAL  A + +VLLKN    LPL       +A+ GP +D T  + GNY   +   + P
Sbjct: 368 HQALALSTAEKSLVLLKNDG-ILPLRP--GARIALAGPLADATRVLRGNY---SSAKSAP 421

Query: 301 LQGISRYAKTIHQAGCFGV 319
              ++   K +  AG   V
Sbjct: 422 PISVAEGLKQVMGAGAVTV 440



 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 33/75 (44%), Positives = 48/75 (64%), Gaps = 3/75 (4%)

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
           DR  L LP  Q+  + + A+A   P+V+V M G  +D+S+AK +    AI+   YPGQ+G
Sbjct: 632 DRTTLDLPADQRAFLEK-ARALGKPLVIVAMNGSAIDLSWAKEN--AAAIVEAWYPGQSG 688

Query: 416 GAAIADVLFGRANPG 430
           G A+ +VL GRA+PG
Sbjct: 689 GLAVGNVLSGRADPG 703


>gi|429087308|ref|ZP_19150040.1| Periplasmic beta-glucosidase [Cronobacter universalis NCTC 9529]
 gi|426507111|emb|CCK15152.1| Periplasmic beta-glucosidase [Cronobacter universalis NCTC 9529]
          Length = 765

 Score =  199 bits (505), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 148/457 (32%), Positives = 224/457 (49%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGRG E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVTRDPRWGRGSEGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+K  +  G    VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ + G W   G  +SD  ++  L   +H T + PE+A   AIK+G+D+     + 
Sbjct: 267 GWLLKDLLRGDWGFKGITISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD-------VC 237
           + +    ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP+D         
Sbjct: 325 SKYLPDLIKSGKVSMEELDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAE 379

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
           +  H+  A   A + +VLLKN   TLPL   +  T+AV+GP +D    M+G++  AGVA 
Sbjct: 380 SRLHRDDARNVARESLVLLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVAD 437

Query: 296 GYTTPLQGISRY----AKTIHQAGCFGVACNG-------------------NQLIGAAEV 332
              T LQG+       AK ++  G       G                    ++I  A  
Sbjct: 438 QSVTLLQGMKNVAGDKAKILYAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKEMIDEAVN 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  L +P  Q++L+S + KA+  P+VLVLM G P  
Sbjct: 498 VAKQSDVVVAVVGEAQGMAHEASSRTDLTIPQSQRDLISAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|295689400|ref|YP_003593093.1| glycosyl hydrolase family protein [Caulobacter segnis ATCC 21756]
 gi|295431303|gb|ADG10475.1| glycoside hydrolase family 3 domain protein [Caulobacter segnis
           ATCC 21756]
          Length = 895

 Score =  199 bits (505), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 121/293 (41%), Positives = 166/293 (56%), Gaps = 20/293 (6%)

Query: 9   LTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTA 68
           L  WSPN+NIFRDPRWGRGQET GEDP L  +   ++V G+QG      +V A  KHY  
Sbjct: 139 LNTWSPNINIFRDPRWGRGQETYGEDPHLAARMGVAFVEGVQGPDPDLPQVIATPKHYAV 198

Query: 69  YDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
           +         R+H N  VS++DLEDTY   F+A +VE K  S+MC+YN+++G+P CA   
Sbjct: 199 HSGPEST---RHHANVYVSRRDLEDTYLPAFRAAIVEAKAGSIMCAYNRIDGQPACASDM 255

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHT 188
           +LK+ +   W+ DGY+VSDCD+V  + +   Y      A A A++AG+D +C       T
Sbjct: 256 LLKDYLRTAWKFDGYVVSDCDAVKDINDNHKYAPDAATAVAAAVRAGVDNECNGATLTDT 315

Query: 189 EG-------AVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNL--GPRDVCTP 239
            G       A++ GL+   DV+ AL    + ++R+G     P  +P       P  V TP
Sbjct: 316 AGLSNRYRDALQQGLITIGDVDQALVRLYSARLRVGDL---PGVRPVDTAPKAPSAVMTP 372

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLST-LRHHTVAVIGPNSDVTVTMIGNYA 291
            H+ LAL A+ + +VLLKN    LPL   LR   VAVIGP  D T  + GNY+
Sbjct: 373 EHEALALAASEKSLVLLKNDG-LLPLKPGLR---VAVIGPLGDATRVLRGNYS 421



 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 43/102 (42%), Positives = 60/102 (58%), Gaps = 13/102 (12%)

Query: 338 DATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMC 387
           D  V V+GL   +EAE            D+  L +P  QQ L+ + AKA   P+V+V M 
Sbjct: 625 DVLVAVVGLTSDLEAEESPVEIPGFKGGDKTTLDIPADQQALLEQ-AKALGKPLVVVAMN 683

Query: 388 GGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           G P+++S+AK++    AIL   YPGQ+GG AIA+VL G+ NP
Sbjct: 684 GSPLNLSWAKDN--AAAILEAWYPGQSGGLAIANVLTGKTNP 723


>gi|429101023|ref|ZP_19162997.1| Periplasmic beta-glucosidase [Cronobacter turicensis 564]
 gi|426287672|emb|CCJ89110.1| Periplasmic beta-glucosidase [Cronobacter turicensis 564]
          Length = 759

 Score =  198 bits (504), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 148/457 (32%), Positives = 223/457 (48%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGRG E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 149 WAPMVDVTRDPRWGRGSEGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYG 208

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+K  +  G    VM + N +NG P  +D
Sbjct: 209 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASD 260

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ + G W   G  +SD  ++  L   +H T + PE+A   AIK+G+D+     + 
Sbjct: 261 GWLLKDLLRGDWGFKGITISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYY 318

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD-------VC 237
           + +    ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP+D         
Sbjct: 319 SKYLPNLIKSGKVSMEELDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAE 373

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
           +  H+  A   A + +VLLKN   TLPL   +  T+AV+GP +D    M+G++  AGVA 
Sbjct: 374 SRLHRDDARNVARESLVLLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVAD 431

Query: 296 GYTTPLQGISRYA----KTIHQAGCFGVACNG-------------------NQLIGAAEV 332
              T LQG+   A    K +   G       G                    ++I  A  
Sbjct: 432 QSVTLLQGMKNVAGDKAKILFAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKEMIDEAVN 491

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  L +P  Q++L+S + KA+  P+VLVLM G P+ 
Sbjct: 492 VAKQSDVVVAVVGEAQGMAHEASSRTDLTIPQSQRDLISAL-KATGKPLVLVLMNGRPL- 549

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            +  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 550 -ALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 585


>gi|29350122|ref|NP_813625.1| periplasmic beta-glucosidase , xylosidase/arabinosidase
           [Bacteroides thetaiotaomicron VPI-5482]
 gi|29342034|gb|AAO79819.1| periplasmic beta-glucosidase precursor, xylosidase/arabinosidase
           [Bacteroides thetaiotaomicron VPI-5482]
          Length = 769

 Score =  198 bits (504), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 147/447 (32%), Positives = 223/447 (49%), Gaps = 39/447 (8%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGL-QGNTGSRLKVAACCKHYTAYD 70
           + P +++ RDPRW R +ET GEDPVL+G+  A+ + GL  G+        A  KH+ AY 
Sbjct: 172 YGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMILGLGSGDLSCEYATIATLKHFLAYA 231

Query: 71  LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
           +        Y   A V  +DL + +  PF+  +  G + SVM SYN ++G P  A+  +L
Sbjct: 232 VPEGGQNGNY---ASVGTRDLHENFLPPFREAIDAGAL-SVMTSYNSIDGVPCTANHYLL 287

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEG 190
              +  +WR  G++VSD  S+  ++ +     T EEAA  A+ AG D+D G    ++   
Sbjct: 288 TQLLRNEWRFRGFVVSDLYSIEGVHESHFVAPTIEEAAMQAVSAGADIDLGGDAFMNLTH 347

Query: 191 AVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAH 250
           AV+ G + E  ++ A+   + ++  +G+F+  P   P      + V +  H +LA + A 
Sbjct: 348 AVQFGKISEAVIDTAVCRVLRMKFEIGLFE-HPYVNP--KTATKIVRSKDHIKLARKVAQ 404

Query: 251 QGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA--CGYTTPLQG-ISRY 307
             IVLLKN    LPL+  +   VAV+GPN+D    M+G+Y          T L G IS+ 
Sbjct: 405 SSIVLLKNENSILPLNK-KIKKVAVVGPNADNRYNMLGDYTAPQEDENIKTVLDGVISKL 463

Query: 308 --AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGL------------------D 347
             +K  +  GC       N++  A E A+R      +V G                   +
Sbjct: 464 SPSKVEYVRGCAIRDTTVNEIAEAVEAASRSEVIIAVVGGSSARDFKTSYQETGAAIADE 523

Query: 348 QSIE----AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIG 403
           +SI      E  DRA L L G+QQ+L+  + KA+  P+++V + G P+D  +A       
Sbjct: 524 KSISDMECGEGFDRATLTLLGKQQDLLIAL-KATGKPLIVVYIEGRPLDKVWASE--YAD 580

Query: 404 AILWVGYPGQAGGAAIADVLFGRANPG 430
           A+L   YPGQ GG AIADVLFG  NP 
Sbjct: 581 ALLTASYPGQEGGYAIADVLFGDYNPA 607


>gi|379722647|ref|YP_005314778.1| glycoside hydrolase family protein [Paenibacillus mucilaginosus
           3016]
 gi|378571319|gb|AFC31629.1| glycoside hydrolase family 3 domain-containing protein
           [Paenibacillus mucilaginosus 3016]
          Length = 937

 Score =  198 bits (504), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 123/320 (38%), Positives = 179/320 (55%), Gaps = 20/320 (6%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAAC 62
           N  + GLT W+P V++ RDPRWGR +E  GEDP LTG+ +   V+G+QG+    LK AA 
Sbjct: 103 NPEINGLTIWAPTVDMERDPRWGRTEEAYGEDPYLTGRLSTELVKGMQGDHPFYLKTAAT 162

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
            KH+    L N N +DR   +A +  +++ + Y   F+    EG   S+M +YN VNG P
Sbjct: 163 LKHF----LGNNNEMDRGECSASIDPRNMREYYLKAFEPAFREGGAQSMMTAYNSVNGTP 218

Query: 123 TCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL---D 179
              +PD+    + G+W +DG++VSD   V    N   Y  +  EA A ++++G+D    D
Sbjct: 219 CNLNPDV-NAIVKGEWGMDGFVVSDAGDVLGTVNEHRYFASYAEAVAASVRSGIDSITDD 277

Query: 180 CGPFL-AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCT 238
            G  L AI    A+ GGLL EED++ A+     V++RLG FD  P   P+ ++    +C 
Sbjct: 278 VGITLRAIRD--ALEGGLLAEEDLDRAVGNAFRVRIRLGEFDS-PEENPYAHVPEAKLCA 334

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT 298
           P H  L+LQAA +G VLLKN    LPL+  +  ++AVIGP +DV  T    Y+G      
Sbjct: 335 PEHAALSLQAAREGFVLLKNEG-LLPLA--KPASIAVIGPLADVVHT--DWYSGTPPYRI 389

Query: 299 TPLQGISRY---AKTIHQAG 315
           TPL+G++        IH+ G
Sbjct: 390 TPLEGVAERMAPGTVIHRTG 409



 Score = 44.3 bits (103), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 38/105 (36%), Positives = 56/105 (53%), Gaps = 6/105 (5%)

Query: 327 IGAAEVAARQADATVLVMGLDQSIEA-EFIDRAGLLL-PGRQQELVSRVAKASRGPVVLV 384
           + AA  AARQA+  ++ +G    I   E +DR  L L P ++  L + +A   R   V V
Sbjct: 544 LKAAVEAARQAETAIVFLGNSPFINGKECVDRPDLTLAPAKEALLQAVLAANPR--TVAV 601

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           L+   P  V++A+    + AIL+  + GQ  G A ADVLFG  +P
Sbjct: 602 LVGSYPFAVNWAQ--AHVPAILYTSHAGQELGRAAADVLFGDCSP 644


>gi|429110963|ref|ZP_19172733.1| Periplasmic beta-glucosidase [Cronobacter malonaticus 507]
 gi|426312120|emb|CCJ98846.1| Periplasmic beta-glucosidase [Cronobacter malonaticus 507]
          Length = 759

 Score =  198 bits (504), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 147/457 (32%), Positives = 225/457 (49%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGRG E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 149 WAPMVDVTRDPRWGRGSEGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYG 208

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+K  +  G    VM + N +NG P  +D
Sbjct: 209 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASD 260

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ + G W   G  +SD  ++  L   +H T + PE+A   AIK+G+D+     + 
Sbjct: 261 GWLLKDLLRGDWGFKGITISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYY 318

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD-------VC 237
           + +    ++ G +   +++ A  + + V+  +G+F+      P+ +LGP+D         
Sbjct: 319 SKYLPNLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPQDTNAE 373

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
           +  H+  A + A + +VLLKN   TLPL   +  T+AV+GP +D    M+G++  AGVA 
Sbjct: 374 SRLHRDDARKVARESLVLLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVAD 431

Query: 296 GYTTPLQGISRY----AKTIHQAGCFGVACNG-------------------NQLIGAAEV 332
              T LQG+ +     AK ++  G       G                    ++I  A  
Sbjct: 432 QSVTLLQGMKKVAGDKAKILYAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKEMIDEAVN 491

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  L +P  Q++L+S + KA+  P+VLVLM G P  
Sbjct: 492 VAKQSDMVVAVVGEAQGMAHEASSRTDLTIPQSQRDLISAL-KATGKPLVLVLMNGRP-- 548

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 549 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 585


>gi|325299987|ref|YP_004259904.1| Beta-glucosidase [Bacteroides salanitronis DSM 18170]
 gi|324319540|gb|ADY37431.1| Beta-glucosidase [Bacteroides salanitronis DSM 18170]
          Length = 864

 Score =  198 bits (504), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 114/326 (34%), Positives = 177/326 (54%), Gaps = 36/326 (11%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT+W+PNVNIFRDPRWGRGQET GEDP LT +   + VRGLQG   +   K+ AC KH+
Sbjct: 132 GLTFWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGMAVVRGLQGPEDAPYDKLHACAKHF 191

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +    WN   R+ FNA  ++ +DL +TY   FK  V +  V  VMC+YN++ G+P C 
Sbjct: 192 AVHSGPEWN---RHEFNAENIAPRDLWETYMPAFKDLVQKAHVKEVMCAYNRLEGEPCCG 248

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEE--AAADAIKAGLDLDCGPF 183
           +  +L + +  +W   G +VSDC ++   +    +   P++  A+A A+ +G DL+CG  
Sbjct: 249 NNRLLTHILRDEWGYQGIVVSDCGAISDFWRKGDHETHPDKAHASAGAVLSGTDLECGSN 308

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVC------ 237
                E AV+ GL+ E  +++++   +  +  LG  D             +DVC      
Sbjct: 309 YKSLPE-AVKAGLIAESQLDISVKRLLKARFELGEMD-------------KDVCWDTIPY 354

Query: 238 ----TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV 293
                 AH+ LAL+ A + IVLL+N    LPL   +   +A++GPN++ ++   GNY G 
Sbjct: 355 SVVDCQAHKDLALRMARESIVLLQNRNNILPLR--KDMKIALVGPNANDSIMHWGNYNGF 412

Query: 294 ACGYTTPLQGISRY---AKTIHQAGC 316
                T  + + +    ++ I++ GC
Sbjct: 413 PSHTETLYEALKKRLPASQLIYEFGC 438



 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 50/105 (47%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  +   G+  ++E E +          DR  + LP  Q++LV  + K  + P+V +
Sbjct: 599 KDADVILFAGGISPTLEGEEMPVDAEGFRGGDRTSIELPAIQRQLVGELKKLGK-PIVFI 657

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G  +    A        ++   YPGQAGG AIADVLFG  NP
Sbjct: 658 NYSGSAM--GLAPESEICDGMIQAWYPGQAGGTAIADVLFGDYNP 700


>gi|402299776|ref|ZP_10819350.1| glycoside hydrolase family protein [Bacillus alcalophilus ATCC
           27647]
 gi|401725066|gb|EJS98379.1| glycoside hydrolase family protein [Bacillus alcalophilus ATCC
           27647]
          Length = 937

 Score =  198 bits (504), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 118/313 (37%), Positives = 169/313 (53%), Gaps = 15/313 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLT W+P V++ RDPRWGR +E  GEDP LTG+     V+G+QGN    LK  A  KH+ 
Sbjct: 108 GLTLWAPTVDMERDPRWGRTEEAYGEDPELTGQLTTELVKGMQGNDPDYLKSVATLKHF- 166

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
              L N N ++R   +A +  +++ + Y   FK  +VEGK  S+M +YN +NG P    P
Sbjct: 167 ---LGNNNEINRDKCSASIDPRNMHEYYLKAFKPAIVEGKAKSIMTAYNSINGTPALLHP 223

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD--LDCGPFLA 185
             ++  +  +W +DG+IVSD   V  + N   Y  T  EA AD IK+G+D   D      
Sbjct: 224 -YVEEVVKQEWGMDGFIVSDAGDVVGIVNDHKYYETYSEALADTIKSGIDSITDDAEISL 282

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
                A+  GLL E D++ AL  T  V+ RLG FD E +  P+  +    +    H+QL+
Sbjct: 283 QAMRDALSKGLLVEADLDKALINTFKVRFRLGEFDEERN--PYAFVSESKLLAKEHKQLS 340

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
            QAA + +VLLKN    LPL+     ++AVIGP SD    ++  Y+G      T L+ I 
Sbjct: 341 YQAAQEQVVLLKNDG-LLPLNEGTTESIAVIGPLSD--DVLVDWYSGTPSYRITALEAIE 397

Query: 306 ---RYAKTIHQAG 315
              +  K IH++G
Sbjct: 398 HKLQNKKVIHKSG 410



 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 56/105 (53%), Gaps = 4/105 (3%)

Query: 327 IGAAEVAARQADATVLVMGLDQSIE-AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVL 385
           I  A  AA + D  V+V+G +  I   E IDR  ++LP  Q++L+  V + +   +  VL
Sbjct: 544 IEEAVKAAAEQDYAVVVLGNNPFINGKECIDREDIVLPPEQEKLLQAVKQVNPNTIA-VL 602

Query: 386 MCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           +   P  +++A  +  +  I++  + G   G A+ADVLFG  NP 
Sbjct: 603 VSSYPYALNWANEN--LSGIMYTSHAGPELGKAVADVLFGDYNPA 645


>gi|261405721|ref|YP_003241962.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
 gi|261282184|gb|ACX64155.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
           Y412MC10]
          Length = 765

 Score =  198 bits (503), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 146/457 (31%), Positives = 219/457 (47%), Gaps = 48/457 (10%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHY 66
           G   +SP +++ RDPRWGR +E  GEDP L  +YA + V GLQG +  S   VAA  KH+
Sbjct: 157 GAVTYSPVLDVVRDPRWGRTEECFGEDPYLISEYAVASVEGLQGESLDSPSSVAATLKHF 216

Query: 67  TAY-DLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             Y   +        H   R    +L +   +PFK  V E   AS+M +YN+++G P   
Sbjct: 217 VGYGSSEGGRNAGPVHMGTR----ELMEVDMLPFKKAV-EAGAASIMPAYNEIDGVPCTV 271

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD-CGPFL 184
           + ++L   +  +W  DG +++DC ++ +L +         +AA  AI+AG+D++  G   
Sbjct: 272 NTELLDGILRKEWGFDGMVITDCGAIDMLASGHDTAEDGMDAAVQAIRAGIDMEMSGEMF 331

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
             H + AV    L    ++ A+   +T++ +LG+F+  P   P        + +  H  L
Sbjct: 332 GKHLQKAVESNKLEVSVLDEAVRRVLTLKFKLGLFEN-PYVDP--QTAENVIGSEQHVGL 388

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA--CGYTTPLQ 302
           A Q A +GIVLLKN A+ LPLS      +AVIGPN+D     +G+Y         TT L 
Sbjct: 389 ARQLAAEGIVLLKNEAKALPLSK-EGGVIAVIGPNADQGYNQLGDYTSPQPPAAVTTVLG 447

Query: 303 GI-----SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG------------ 345
           GI         + ++  GC  +  +  +    A   A QAD  V+V+G            
Sbjct: 448 GIRAKLGEEAQRVLYAPGCR-IKDDSREGFEFALTCAEQADTVVMVLGGSSARDFGEGTI 506

Query: 346 ---------LDQSIE----AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
                     D ++      E IDR  L L G Q ELV  + K  +  +++V + G P+ 
Sbjct: 507 DLRTGASKVTDDALSDMDCGEGIDRMTLQLSGVQLELVQEIHKLGK-RMIVVYINGRPIA 565

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
             +   D    AIL   YPGQ GG A+AD+LFG  NP
Sbjct: 566 EPWI--DEHADAILEAWYPGQEGGHAVADILFGDVNP 600


>gi|319643197|ref|ZP_07997825.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
 gi|345520511|ref|ZP_08799899.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           4_3_47FAA]
 gi|254835034|gb|EET15343.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           4_3_47FAA]
 gi|317385101|gb|EFV66052.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
          Length = 788

 Score =  198 bits (503), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 137/442 (30%), Positives = 215/442 (48%), Gaps = 36/442 (8%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL 71
           +SP ++I +DPRWGR  ET GEDP L G+     +  LQ     +  + A  KH+  Y +
Sbjct: 196 YSPILDIAQDPRWGRCVETYGEDPYLVGELGKQMITSLQ-----KYNLVATPKHFAVYSI 250

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
                  +   +  V+ +++   Y  PF+    E     VM SYN  +G+P       L 
Sbjct: 251 PIGGRDGKTRTDPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLT 310

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC-----GPFLAI 186
             +  +W   GY+VSD ++V  + N      T E+  A A+ AGL++         F+ +
Sbjct: 311 EILRQEWGFKGYVVSDSEAVEFISNKHKVADTYEDGIAQAVNAGLNIRTHFTPPADFI-L 369

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLAL 246
               AV  G + +E ++  +A  + ++  LG+FD        G    + V +  HQ ++L
Sbjct: 370 PLRKAVDDGKISQETLDKRVAEILRIKFWLGLFDNPYRGN--GKQAEQIVHSKEHQAVSL 427

Query: 247 QAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
           +AA Q +VLLKN    LPLS ++R  ++AVIGPN+D    +I  Y        T  QGI 
Sbjct: 428 EAARQSLVLLKNETHLLPLSKSIR--SIAVIGPNADEQTQLICRYGPANAPIKTVYQGIK 485

Query: 306 R---YAKTIHQAGC--------------FGVACNGNQLIGAAEVAARQADATVLVMGLDQ 348
               +A+ I++ GC              F       +L+     AA+QA+  V+V+G ++
Sbjct: 486 ELLPHAEVIYKKGCDIIDPHFPESEILDFPKTAEEVRLMQEVIRAAKQAEVVVMVLGGNE 545

Query: 349 SIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWV 408
               E   R  L LPGRQ+EL+  V  A+  PV+LV++ G    +++A     + AIL  
Sbjct: 546 LTVREDRSRTSLNLPGRQEELLKAVC-ATGKPVILVMLDGRASSINYAA--AHVPAILHA 602

Query: 409 GYPGQAGGAAIADVLFGRANPG 430
            +PG+  G A+A+ LFG  NPG
Sbjct: 603 WFPGEFCGQAVAEALFGDYNPG 624


>gi|255693560|ref|ZP_05417235.1| periplasmic beta-glucosidase [Bacteroides finegoldii DSM 17565]
 gi|260620625|gb|EEX43496.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           finegoldii DSM 17565]
          Length = 770

 Score =  198 bits (503), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 142/438 (32%), Positives = 228/438 (52%), Gaps = 35/438 (7%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN--TGSRLKVAACCKHYT-- 67
           ++PNV + RD RWGR  ET GEDP L G+  A+ VRG Q    TG+  KV AC KH    
Sbjct: 184 FTPNVEVARDARWGRVGETFGEDPYLVGQMGAATVRGFQTKDFTGND-KVIACAKHLVGG 242

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
           +   +  NG       A +S++ L++ +  PFK C +E  V +VM ++N++NG P   + 
Sbjct: 243 SQPANGINGAP-----AELSERTLQEVFFPPFKDC-LEAGVFTVMTAHNELNGIPCHGNK 296

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC-GPFLAI 186
            ++   +  QW+ DG++VSD   +  +++  +   T ++A   ++ AG+ +   GP    
Sbjct: 297 YLMTEVLRNQWKFDGFVVSDWMDIERMHDYHNVAETLKDAYQISVDAGMGMHMHGPEFYE 356

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD--VCTPAHQQL 244
                V+ G + E+ ++ A++  + V+ RLG+F+      PF +L  +D  V    HQQ 
Sbjct: 357 AIIECVKEGSIPEKQIDAAVSKILEVKFRLGLFEN-----PFIDLKKKDEIVFNEKHQQT 411

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA--GVACGYTTPLQ 302
           AL+ A + IVLLKN    LPL   ++  V V G N++   +++G++A        TT L+
Sbjct: 412 ALEGARKSIVLLKNEGNMLPLDASKYKKVFVTGHNAN-NQSILGDWAMEQPEEHVTTVLK 470

Query: 303 GISRYAKTIHQAGC---FGVACNGNQLIGAAEVAARQADATVLVMG-------LDQSIEA 352
           G+   +   +       + V    +  I  A   AR +D  +LV+G        ++    
Sbjct: 471 GLKAISPETNYNFLDLGWNVRLLSDNQIKEAVQQARSSDLAILVVGENSMRYHWNEKTCG 530

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E  DR  L LPGRQQELV  VA A+  P V++L+ G P+   +   D  +  I+    PG
Sbjct: 531 ENSDRYELSLPGRQQELVEAVA-ATGVPTVVILVNGRPLTTEWI--DENMPCIIEAWEPG 587

Query: 413 QAGGAAIADVLFGRANPG 430
            AGG A+A++L+G+ NP 
Sbjct: 588 VAGGQALAEILYGKVNPS 605


>gi|307109345|gb|EFN57583.1| hypothetical protein CHLNCDRAFT_34794, partial [Chlorella
           variabilis]
          Length = 377

 Score =  198 bits (503), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 116/309 (37%), Positives = 172/309 (55%), Gaps = 15/309 (4%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR--LKVAA 61
           G +A    + P+V+I RDPRWGR  ET GEDP+L    A ++VRGLQG  G+   +K  A
Sbjct: 55  GPLAYSNCFGPHVHIVRDPRWGRMAETFGEDPLLQSNMAVAHVRGLQGGAGTDTYIKTVA 114

Query: 62  CCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 121
            CKH+   DL+ W GV R+ F+A ++++DL D++  PF+ACV EG   +VMCSYN +NG 
Sbjct: 115 TCKHFIGNDLEGWEGVTRHTFDANITERDLRDSFLPPFEACVREGGALAVMCSYNSLNGL 174

Query: 122 PTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQ----HYT-RTPEEAAADAIKAGL 176
           P C +  +L   + G+    G +V+DC ++  +   +    H+       A+A A++AG 
Sbjct: 175 PACVNKPLLTGLLRGELGFAGMVVTDCTALTRIVKPRPKGPHFAGGDKRRASALAVQAGT 234

Query: 177 DLDCGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDV 236
           D+ C      H    ++ G + + D++ A+   +  ++R G F+   S  PF +L     
Sbjct: 235 DMAC------HLFDMLQPGDVSQRDLDAAVRRVLHNRVRQGHFN-PLSELPFDHLDGSVF 287

Query: 237 CTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
              AH+  A + A +G VLLKN+  TLPL       VAVIGP +D    ++G Y G   G
Sbjct: 288 GRAAHRDTAREIAAKGTVLLKNAEGTLPLRPAAMKQVAVIGPFADQPTYILGKYYGATAG 347

Query: 297 -YTTPLQGI 304
             TTPLQ I
Sbjct: 348 PITTPLQAI 356


>gi|329956868|ref|ZP_08297436.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
 gi|328523625|gb|EGF50717.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
          Length = 864

 Score =  198 bits (503), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 121/318 (38%), Positives = 172/318 (54%), Gaps = 15/318 (4%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQ--GNTGSRLKVAACCKH 65
           GLT W+P VNI+RDPRWGRG ET GEDP L      S V+GLQ         KV AC KH
Sbjct: 129 GLTIWTPTVNIYRDPRWGRGIETYGEDPYLASVMGVSVVKGLQCLDENEKYDKVHACAKH 188

Query: 66  YTAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           +  +    WN   R+ FNA  +S +DL +TY  PF+A V EGKV  VMC+YN+  G+P C
Sbjct: 189 FAVHSGPEWN---RHSFNAENISPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGEPCC 245

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQ-HYTR-TPEEAAADAIKAGLDLDCGP 182
               +L + +  +W  DG +V+DC ++   +N + H T      A++ A+ +G DL+CG 
Sbjct: 246 GSNRLLNHILRREWGYDGIVVADCSAISDFHNDKGHKTHADAASASSAAVLSGTDLECGS 305

Query: 183 FLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
                TEG V+ G + E D++ ++   +  +  LG  D EP    +  +    VC+  H 
Sbjct: 306 NYRSLTEG-VKKGFIDEADIDRSVKRLLQARFELGEMD-EPDQVRWAQIPYSVVCSDKHD 363

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQ 302
            L+L  A + + LL N    LPL      T+AV+GPN++ +V   GNY G+     T L 
Sbjct: 364 SLSLDMARKSMTLLLNKNNALPLER-GGTTIAVMGPNANDSVMQWGNYNGLPKRTITILD 422

Query: 303 GIS----RYAKTIHQAGC 316
           GI     +  K I++ GC
Sbjct: 423 GIRSAMGKDDKLIYEQGC 440



 Score = 52.0 bits (123), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 50/105 (47%), Gaps = 13/105 (12%)

Query: 335 RQADATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLV 384
           + AD  +   G+   +E E +          DR  + LP  Q+E++  +  A +  V+ V
Sbjct: 600 KDADVVIFAGGISPQLEGEEMGVKLPGFRGGDRTDIELPAVQREMIKALHDAGK-KVIFV 658

Query: 385 LMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              G P+  +         AIL   YPGQ+GG A+A+VLFG  NP
Sbjct: 659 NCSGSPI--AMEPETEYCQAILQAWYPGQSGGKAVAEVLFGDYNP 701


>gi|237712573|ref|ZP_04543054.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
 gi|345512524|ref|ZP_08792050.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
           5_1_36/D4]
 gi|423239901|ref|ZP_17221016.1| hypothetical protein HMPREF1065_01639 [Bacteroides dorei
           CL03T12C01]
 gi|229435409|gb|EEO45486.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
           5_1_36/D4]
 gi|229453894|gb|EEO59615.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
 gi|392644890|gb|EIY38624.1| hypothetical protein HMPREF1065_01639 [Bacteroides dorei
           CL03T12C01]
          Length = 788

 Score =  198 bits (503), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 140/442 (31%), Positives = 215/442 (48%), Gaps = 36/442 (8%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL 71
           +SP ++I +DPRWGR  ET GEDP L G+     +  LQ +      + A  KH+  Y +
Sbjct: 196 YSPILDIAQDPRWGRCVETYGEDPYLVGELGKQMITSLQKHN-----LVATPKHFAVYSI 250

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
                  +   +  V+ +++   Y  PF+    E     VM SYN  +G+P       L 
Sbjct: 251 PVGGRDGKTRTDPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLT 310

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC-----GPFLAI 186
             +  +W   GY+VSD ++V  + +      T E+  A A+ AGL++         F+ +
Sbjct: 311 EILRQEWGFKGYVVSDSEAVEFISSKHKVANTYEDGIAQAVNAGLNIRTHFTPPADFI-L 369

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLAL 246
               AV  G + +E ++  +A  + V+  LG+FD        G    + V +  HQ ++L
Sbjct: 370 PLRKAVADGKISQETLDKRVAEILRVKFWLGLFDNPYRGN--GKQAEQIVHSKEHQAVSL 427

Query: 247 QAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
           +AA Q +VLLKN    LPLS +LR  ++AVIGPN+D    +I  Y        T  QGI 
Sbjct: 428 EAARQSLVLLKNEMNLLPLSKSLR--SIAVIGPNADERTQLICRYGPANAPIKTVYQGIK 485

Query: 306 R---YAKTIHQAGC--------------FGVACNGNQLIGAAEVAARQADATVLVMGLDQ 348
               + + I++ GC              F       +L+  A  AA+QA+  V+V+G ++
Sbjct: 486 ERLPHTEVIYRKGCDIIDPHFPESEVLDFPKTTEEARLMEEAIHAAKQAEVVVMVLGGNE 545

Query: 349 SIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWV 408
               E   R  L LPGRQ+EL+  V  A+  PVVLVL+ G    +++A     + AIL  
Sbjct: 546 LTVREDRSRTSLNLPGRQEELLKAVC-ATGKPVVLVLLDGRASSINYAA--AHVPAILHA 602

Query: 409 GYPGQAGGAAIADVLFGRANPG 430
            +PG+  G A+A+ LFG  NPG
Sbjct: 603 WFPGEFCGQAVAEALFGDYNPG 624


>gi|423229063|ref|ZP_17215468.1| hypothetical protein HMPREF1063_01288 [Bacteroides dorei
           CL02T00C15]
 gi|423244903|ref|ZP_17225977.1| hypothetical protein HMPREF1064_02183 [Bacteroides dorei
           CL02T12C06]
 gi|392634816|gb|EIY28728.1| hypothetical protein HMPREF1063_01288 [Bacteroides dorei
           CL02T00C15]
 gi|392640944|gb|EIY34735.1| hypothetical protein HMPREF1064_02183 [Bacteroides dorei
           CL02T12C06]
          Length = 788

 Score =  198 bits (503), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 140/442 (31%), Positives = 215/442 (48%), Gaps = 36/442 (8%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL 71
           +SP ++I +DPRWGR  ET GEDP L G+     +  LQ +      + A  KH+  Y +
Sbjct: 196 YSPILDIAQDPRWGRCVETYGEDPYLVGELGKQMITSLQKHN-----LVATPKHFAVYSI 250

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
                  +   +  V+ +++   Y  PF+    E     VM SYN  +G+P       L 
Sbjct: 251 PVGGRDGKTRTDPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLT 310

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC-----GPFLAI 186
             +  +W   GY+VSD ++V  + +      T E+  A A+ AGL++         F+ +
Sbjct: 311 EILRQEWGFKGYVVSDSEAVEFISSKHKVANTYEDGIAQAVNAGLNIRTHFTPPADFI-L 369

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLAL 246
               AV  G + +E ++  +A  + V+  LG+FD        G    + V +  HQ ++L
Sbjct: 370 PLRKAVADGKISQETLDKRVAEILRVKFWLGLFDNPYRGN--GKQAEQIVHSKEHQAVSL 427

Query: 247 QAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
           +AA Q +VLLKN    LPLS +LR  ++AVIGPN+D    +I  Y        T  QGI 
Sbjct: 428 EAARQSLVLLKNEMNLLPLSKSLR--SIAVIGPNADERTQLICRYGPANAPIKTVYQGIK 485

Query: 306 R---YAKTIHQAGC--------------FGVACNGNQLIGAAEVAARQADATVLVMGLDQ 348
               + + I++ GC              F       +L+  A  AA+QA+  V+V+G ++
Sbjct: 486 ERLPHTEVIYRKGCDIIDPHFPESEVLDFPKTTEEARLMEEAIHAAKQAEVVVMVLGGNE 545

Query: 349 SIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWV 408
               E   R  L LPGRQ+EL+  V  A+  PVVLVL+ G    +++A     + AIL  
Sbjct: 546 LTVREDRSRTSLNLPGRQEELLKAVC-ATGKPVVLVLLDGRASSINYAA--AHVPAILHA 602

Query: 409 GYPGQAGGAAIADVLFGRANPG 430
            +PG+  G A+A+ LFG  NPG
Sbjct: 603 WFPGEFCGQAVAEALFGDYNPG 624


>gi|383115540|ref|ZP_09936296.1| hypothetical protein BSGG_2590 [Bacteroides sp. D2]
 gi|313695055|gb|EFS31890.1| hypothetical protein BSGG_2590 [Bacteroides sp. D2]
          Length = 770

 Score =  198 bits (503), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 142/438 (32%), Positives = 228/438 (52%), Gaps = 35/438 (7%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGN--TGSRLKVAACCKHYT-- 67
           ++PNV + RD RWGR  ET GEDP L G+  A+ VRG Q    TG+  KV AC KH    
Sbjct: 184 FTPNVEVARDARWGRVGETFGEDPYLVGQMGAATVRGFQTKDFTGND-KVIACAKHLVGG 242

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
           +   +  NG       A +S++ L++ +  PFK C +E  V +VM ++N++NG P   + 
Sbjct: 243 SQPANGINGAP-----AELSERTLQEVFFPPFKDC-LEAGVFTVMTAHNELNGIPCHGNK 296

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC-GPFLAI 186
            ++   +  QW+ DG++VSD   +  +++  +   T ++A   ++ AG+ +   GP    
Sbjct: 297 YLMTEVLRNQWKFDGFVVSDWMDIERMHDYHNVAETLKDAYRISVDAGMGMHMHGPEFYE 356

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD--VCTPAHQQL 244
                V+ G + E+ ++ A++  + V+ RLG+F+      PF +L  +D  V    HQQ 
Sbjct: 357 AIIECVKEGSIPEKQIDAAVSKILEVKFRLGLFEN-----PFIDLKKKDEIVFNEKHQQT 411

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA--GVACGYTTPLQ 302
           AL+ A + IVLLKN    LPL   ++  V V G N++   +++G++A        TT L+
Sbjct: 412 ALEGARKSIVLLKNEGNMLPLDASKYKKVFVTGHNAN-NQSILGDWAMEQPEEHVTTVLK 470

Query: 303 GISRYAKTIHQAGC---FGVACNGNQLIGAAEVAARQADATVLVMG-------LDQSIEA 352
           G+   +   +       + V    +  I  A   AR +D  +LV+G        ++    
Sbjct: 471 GLKAISPETNYNFLDLGWNVRLLSDNQIKEAVQQARNSDLAILVVGENSMRYHWNEKTCG 530

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E  DR  L LPGRQQELV  VA A+  P V++L+ G P+   +   D  +  I+    PG
Sbjct: 531 ENSDRYELSLPGRQQELVKAVA-ATGVPTVVILVNGRPLTTEWI--DENMPCIIEAWEPG 587

Query: 413 QAGGAAIADVLFGRANPG 430
            AGG A+A++L+G+ NP 
Sbjct: 588 VAGGQALAEILYGKVNPS 605


>gi|383123909|ref|ZP_09944579.1| hypothetical protein BSIG_4072 [Bacteroides sp. 1_1_6]
 gi|382983834|gb|EES66944.2| hypothetical protein BSIG_4072 [Bacteroides sp. 1_1_6]
          Length = 815

 Score =  198 bits (503), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 146/447 (32%), Positives = 223/447 (49%), Gaps = 39/447 (8%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGL-QGNTGSRLKVAACCKHYTAYD 70
           + P +++ RDPRW R +ET GEDPVL+G+  A+ V GL  G+        A  KH+ AY 
Sbjct: 218 YGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMVIGLGSGDLSREYATIATLKHFLAYA 277

Query: 71  LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
           +        Y   A V  +DL + +  PF+  +  G + SVM SYN ++G P  A+  +L
Sbjct: 278 VPEGGQNGNY---ASVGTRDLHENFLPPFQEAIDAGAL-SVMTSYNSIDGIPCTANYYLL 333

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEG 190
              +  +WR  G++VSD  S+  ++ +     T EEAA   + AG+D+D G    ++   
Sbjct: 334 TQLLRNEWRFRGFVVSDLYSIEGVHESHFVAPTIEEAAMQVVSAGVDIDLGGNAFMNLTH 393

Query: 191 AVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAH 250
           AV+ G + E  ++ A+   + ++  +G+F+  P   P      + V +  H +LA + A 
Sbjct: 394 AVQSGKISEAVIDTAVCRVLRMKFEMGLFE-HPYVNP--KSATKVVRSEEHIRLAHKVAQ 450

Query: 251 QGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA--CGYTTPLQG-ISRY 307
             IVLLKN    LPL+  +   VAV+GPN+D    M+G+Y          T L G IS+ 
Sbjct: 451 SSIVLLKNKNSILPLNK-KIKKVAVVGPNADNRYNMLGDYTAPQEDENIKTVLDGVISKL 509

Query: 308 --AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGL------------------D 347
             +K  +  GC       N++    E A+R      +V G                   +
Sbjct: 510 SPSKVEYVRGCAIRDTTVNEIAEVVEAASRSEVIIAVVGGSSARDFKTSYQETGAAIADE 569

Query: 348 QSIE----AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIG 403
           +SI      E  DRA L L G+QQ+L++ + KA+  P+++V + G P+D  +A       
Sbjct: 570 KSISDMECGEGFDRATLTLLGKQQDLLNAL-KATGKPLIVVYIEGRPLDKVWASE--YAD 626

Query: 404 AILWVGYPGQAGGAAIADVLFGRANPG 430
           A+L   YPGQ GG AIADVLFG  NP 
Sbjct: 627 ALLTASYPGQEGGYAIADVLFGDYNPA 653


>gi|146301613|ref|YP_001196204.1| beta-glucosidase [Flavobacterium johnsoniae UW101]
 gi|146156031|gb|ABQ06885.1| Candidate beta-xylosidase; Glycoside hydrolase family 3
           [Flavobacterium johnsoniae UW101]
          Length = 864

 Score =  198 bits (503), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 114/309 (36%), Positives = 168/309 (54%), Gaps = 13/309 (4%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAA 61
           N     L+ W+PNVNIFRDPRWGRGQET GEDP LT +   S V+GLQG   ++  K+ A
Sbjct: 125 NKRFLSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQGPADAKYRKLLA 184

Query: 62  CCKHYTAYDLDNWNGVDRYHFNA-RVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           C KH+  +    W+   R+  N   V  ++L +TY   FKA V E  V  VMC+Y +++ 
Sbjct: 185 CAKHFAVHSGPEWS---RHELNLNNVKPRELYETYLPAFKALVQEADVRQVMCAYQRLDD 241

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
           +P C++  +L+  +  +W     +VSDC +V   Y T   +     AA+ A+ AG D++C
Sbjct: 242 EPCCSNTRLLQRILRDEWGFQYLVVSDCGAVTDFYTTHKVSSDEVHAASKAVLAGTDVEC 301

Query: 181 ----GPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDV 236
                PF  +    AV   L++EE++N +L   +  +  LG  D + S  P+  +    +
Sbjct: 302 VWDKYPFKKLPE--AVEKDLIKEEEINKSLLRVLIGRFDLGEMDDD-SIVPWAQIPASVL 358

Query: 237 CTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
            +  HQQLAL+ A + + LL+N    LPL+    + VAVIGPN+D    + GNY G    
Sbjct: 359 NSKEHQQLALEMAQKSMTLLQNKNNILPLNK-NVNKVAVIGPNADNEPMLWGNYNGTPNK 417

Query: 297 YTTPLQGIS 305
             T  +GI 
Sbjct: 418 TITIKKGIE 426



 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 47/103 (45%), Gaps = 13/103 (12%)

Query: 338 DATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMC 387
           D  +   GL   +E E +          DR  + LP  Q++ +  +  A  G  V+ + C
Sbjct: 603 DTVIFAGGLSTLLEGEEMPVSFPGFKGGDRTNIELPAVQRKCLKELKAA--GKKVIFVNC 660

Query: 388 GGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            G   ++         AIL   YPG++GG A+ADVLFG  NP 
Sbjct: 661 SGSA-IALTPETESCDAILQAWYPGESGGQAVADVLFGDYNPA 702


>gi|393784569|ref|ZP_10372732.1| hypothetical protein HMPREF1071_03600 [Bacteroides salyersiae
           CL02T12C01]
 gi|392665550|gb|EIY59074.1| hypothetical protein HMPREF1071_03600 [Bacteroides salyersiae
           CL02T12C01]
          Length = 929

 Score =  198 bits (503), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 111/301 (36%), Positives = 169/301 (56%), Gaps = 11/301 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GL YW P +N+ RDPRWGR +E  GEDP LTGK A  Y++G+QG+     K  A  KH+ 
Sbjct: 139 GLIYWCPTINMSRDPRWGRDEENYGEDPFLTGKIAVEYIKGMQGDDPKYYKTIATAKHFA 198

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
           A + +      R+  ++ +  ++L + Y   F+  V EG V SVM +YN +NG P  A+ 
Sbjct: 199 ANNYEK----GRHSTSSDMDARNLREYYLPAFEMAVKEGNVRSVMSAYNALNGIPCGANH 254

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLY--NTQHYTRTPEEAAADAIKAGLDLDCGPFLA 185
           ++L + +  +W  +G++ SDC +V  +Y  N  H+  T  EA+A +I  G DL+CG    
Sbjct: 255 ELLIDILRTEWGFNGFVTSDCGAVDDVYQSNRHHFVNTAAEASAVSIVNGEDLNCGNTFQ 314

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
            + + A+  G ++E D++ AL      +  +G FD   S  P+ ++    +    H+QLA
Sbjct: 315 DYCKEAIEKGYMQEADLDTALVRVFEARFSVGEFDN-ASNVPWRSISDDVLDCEEHRQLA 373

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
            +AA + IVLLKN    LPL   +  +VAVIGP  + T+T+ G Y+G     TTP  GI+
Sbjct: 374 YKAAQEAIVLLKNDNNILPLDKTK--SVAVIGPFGN-TITL-GGYSGSPTALTTPFGGIA 429

Query: 306 R 306
            
Sbjct: 430 E 430



 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 44/116 (37%), Positives = 61/116 (52%), Gaps = 4/116 (3%)

Query: 315 GCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVA 374
           GC         L  A E+AA+ AD  +   G D ++  E  DR  L LPG QQ+L+  V 
Sbjct: 592 GCAVTGTAETNLERAKEIAAK-ADVVIFAAGTDLTVSDESHDRTNLNLPGDQQKLLEAVY 650

Query: 375 KASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
            A+   ++L+  C   V +++AK    + AI+   Y GQA G AIADVL+G  NP 
Sbjct: 651 SANPNVILLLQTCSS-VTINWAKE--HVPAIIEAWYGGQAQGKAIADVLYGDYNPS 703


>gi|308208211|gb|ADO20356.1| putative beta-D-xylosidase/alpha-L-arabinosidase [uncultured rumen
           bacterium]
          Length = 780

 Score =  197 bits (502), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 163/492 (33%), Positives = 222/492 (45%), Gaps = 79/492 (16%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSR-LKVAAC 62
           G   G+T+W+PN+NIFRDPRWGRG ET GEDP LTG+   + VRGLQG + S  LK  AC
Sbjct: 147 GQYQGVTFWTPNINIFRDPRWGRGMETYGEDPYLTGQMGMAVVRGLQGPSDSPVLKAHAC 206

Query: 63  CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKP 122
            KHY  +    WN   R+ ++A VS++DL +TY   FK  V +  V  VM +YN+  G+P
Sbjct: 207 AKHYAVHSGPEWN---RHSYDAEVSERDLRETYLPAFKDLVTKANVQEVMTAYNRFRGEP 263

Query: 123 TCADPDILKNTI-HGQWRLDGYIVSDCDSVGVLYNTQHYTRTPE--EAAADAIKAGLDLD 179
            C   D L NTI  G+W   G I SDC +V   Y    +  +P+   AAA A+ AG+D +
Sbjct: 264 -CGASDYLINTILRGEWGYKGLITSDCWAVEDFYVQGRHGYSPDVASAAAAAVHAGVDTE 322

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTP 239
           CG     H   AV  GLL E+D++  L    T + +LG  D       + +L    +  P
Sbjct: 323 CGQAYR-HIPEAVERGLLDEKDLDRNLIRLFTARYQLGEMD---DISLWDDLPASILEGP 378

Query: 240 AHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTT 299
            H  L+ + A + +VLL+N    LPL+      VA++GPN D      GNY  V     T
Sbjct: 379 EHLALSRKMAQESMVLLQNKGGILPLAP--DVRVALVGPNGDDREMQWGNYNPVPGRTVT 436

Query: 300 PLQGI-SRYAKTIHQAGC------FGVACNGNQLIGAA--------EVAARQ-------- 336
               +  R+    +  GC      F    + N  +  A        E  ARQ        
Sbjct: 437 LYDALKERFPGIKYVRGCGIVGAEFAPKPDPNNPLSQALGKSREEMEAIARQYAIGVQDI 496

Query: 337 -----------------------------ADATVLVMGLDQSIEAEFI----------DR 357
                                         D  +   G+    E E +          DR
Sbjct: 497 LNYVRRQERMQASFLPELDVQSVLKELEGIDVVIFAGGISPRFEGEEMPVNLPGFKGGDR 556

Query: 358 AGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGA 417
             + LP  Q++L+  +  A +  V+LV   G    +          AIL   YPG+ GG 
Sbjct: 557 TDIQLPQVQRDLMKALHDAGKK-VILVNFSG--CAIGLVPETESCDAILQAWYPGEEGGL 613

Query: 418 AIADVLFGRANP 429
           AI DVLFG  NP
Sbjct: 614 AITDVLFGDVNP 625


>gi|153809292|ref|ZP_01961960.1| hypothetical protein BACCAC_03604 [Bacteroides caccae ATCC 43185]
 gi|149128062|gb|EDM19283.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           caccae ATCC 43185]
          Length = 946

 Score =  197 bits (502), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 146/448 (32%), Positives = 221/448 (49%), Gaps = 32/448 (7%)

Query: 6   MAGLT-YWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCK 64
           M G T  ++P +++ RD RWGR +E  GE P L  +     VRG+Q N     +VAA  K
Sbjct: 204 MLGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH----QVAATGK 259

Query: 65  HYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           H+ AY  +          + ++S +++E  +  PFK  + E  +  VM SYN  +G P  
Sbjct: 260 HFIAYSNNKGAREGMARVDPQMSPREVEMLHAYPFKRVIREAGLLGVMSSYNDYDGFPIQ 319

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG--- 181
           +    L   + G+    GY+VSD D+V  LY      +  +EA   +++AGL++ C    
Sbjct: 320 SSYYWLTTRLRGEMGFRGYVVSDSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRS 379

Query: 182 -PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
                +     V+ G L EE +N  +   + V+  +G+FD        G     +V    
Sbjct: 380 PDSYVLPLRELVKEGGLSEEVINDRVRDILRVKFLVGLFDTPYQTDLKG--ADEEVEKKE 437

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           ++++ALQA+ + IVLLKN    LPL   +   +AV GPN+D     + +Y  +A   T+ 
Sbjct: 438 NEEVALQASRESIVLLKNEKNVLPLDPSKIRKIAVCGPNADEHSYALTHYGPLAVEVTSV 497

Query: 301 LQGISRYAK----TIHQAGCFGVACN--------------GNQLIGAAEVAARQADATVL 342
           L+GI    K     ++  GC  V  N                + I  A   A+QAD  ++
Sbjct: 498 LKGIQEKMKDKADVLYTKGCDLVDANWPESELIDYPLTDEEQKEIDKAVSQAKQADVAIV 557

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           V+G  Q    E   R+ L LPGRQ +L+  V  A+  PVVLVL+ G P+ +++A  D  +
Sbjct: 558 VLGGGQRTCGENKSRSSLDLPGRQLDLLKAVV-ATGKPVVLVLINGRPLSINWA--DKFV 614

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
            AIL   YPG  GG A+AD+LFG  NPG
Sbjct: 615 PAILEAWYPGSKGGIAVADILFGDYNPG 642


>gi|389794138|ref|ZP_10197297.1| beta-glucosidase [Rhodanobacter fulvus Jip2]
 gi|388432924|gb|EIL89908.1| beta-glucosidase [Rhodanobacter fulvus Jip2]
          Length = 869

 Score =  197 bits (502), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 121/302 (40%), Positives = 167/302 (55%), Gaps = 17/302 (5%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GL  WSPN+NIFRDPRWGRGQET GEDP LT +   ++V G+QG    R +V A  KH+ 
Sbjct: 117 GLDTWSPNLNIFRDPRWGRGQETYGEDPFLTARMGVAFVEGMQGPNADRPEVIATPKHFA 176

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +   N     R+  N  VS+ DL DTY   F+A + E K  S+MC+YN+++G+P CA+ 
Sbjct: 177 VH---NGPESTRHEANVFVSRHDLVDTYLPAFRAAITEAKAGSIMCAYNRIDGQPACAND 233

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +LK+ + G W   GY+VSDCD+V  + +  HY      A A A++AG+D +C       
Sbjct: 234 LLLKHYLRGAWGFTGYVVSDCDAVKDISDHHHYAPDAATAVAAAVRAGVDNECNGATLSD 293

Query: 188 TEG-------AVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
           T G       A++  L+   D++ AL      + R+G   G P   P   + P  + + A
Sbjct: 294 TAGLEDRYSEALQRELISVADIDRALVRLFAARYRVGDLAGLPGTAP--AVPPSAIGSAA 351

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           + QLAL AA + +VLLKN    LPL       VAVIGP  D T  + GNY+    G  TP
Sbjct: 352 NSQLALDAAQKTLVLLKNDG-VLPLKA--GVKVAVIGPLGDATRVLRGNYSSPLSG--TP 406

Query: 301 LQ 302
           + 
Sbjct: 407 IS 408



 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 52/150 (34%), Positives = 79/150 (52%), Gaps = 15/150 (10%)

Query: 293 VACGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEV--AARQADATVLVMGLDQSI 350
           +A   T  L+   RY  T+   G   +A         A++  AA QAD  V V+GL   +
Sbjct: 552 LATMKTVTLERGHRYPITVDATGGVDLAWKRISTTPEADLDRAAAQADVLVAVVGLTSDL 611

Query: 351 EAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDP 400
           EAE            D+  L LP  Q  ++ + A+A+  P+++V M G P+D++++K + 
Sbjct: 612 EAEETSVTVPGFAGGDKTTLDLPADQIAMLKQ-ARATGKPLIVVAMNGSPIDLAWSKQN- 669

Query: 401 RIGAILWVGYPGQAGGAAIADVLFGRANPG 430
              AIL   YPGQ+GG AIA+VL G+ +PG
Sbjct: 670 -AAAILEAWYPGQSGGLAIANVLSGKTDPG 698


>gi|189464219|ref|ZP_03013004.1| hypothetical protein BACINT_00556 [Bacteroides intestinalis DSM
           17393]
 gi|189438009|gb|EDV06994.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 865

 Score =  197 bits (502), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 115/352 (32%), Positives = 186/352 (52%), Gaps = 24/352 (6%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GLTYWSP +N+ RDPRWGR +ET GEDP LT +   ++V+GLQG+    LK  A  KH+ 
Sbjct: 146 GLTYWSPTINMARDPRWGRNEETYGEDPYLTSRLGVAFVKGLQGDHPDYLKTVATIKHFV 205

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
           A + +N    DR+  ++++  + L + Y   ++ACV E    SVM +YN  NG       
Sbjct: 206 ANNQEN----DRFSSSSQIPTKQLYEYYFPAYEACVKEADAQSVMTAYNAFNGVAPSGST 261

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +L + +  +W  DG++VSDC ++GV+        + EEAAA  I +G DL+CG      
Sbjct: 262 WLLGDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGINSGCDLECGGTYREK 321

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              AV+ GL+ E+ ++ AL   +T + +LG FD      P+ +   + +      +LA +
Sbjct: 322 LVAAVKMGLVSEQAIDKALTRVLTARFKLGEFD-PIELVPYNHYDKKLLAGEKFGKLAYE 380

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS-- 305
           AA + IVLLKN    LP+   +  +VA++GP +D     +G Y+G      + LQG+   
Sbjct: 381 AAVKSIVLLKNDNDFLPVDKKKIRSVAIVGPFAD--NNYLGGYSGKPVHNVSLLQGVKDL 438

Query: 306 --RYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFI 355
             +  K  +  G   ++  G++ + A++             G+ Q + A++I
Sbjct: 439 VGKKVKVSYMEGTSVISPVGSEYLVASD-------------GVSQGLTADYI 477



 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 39/125 (31%), Positives = 58/125 (46%), Gaps = 4/125 (3%)

Query: 306 RYAKTIHQAGCFGVACNGNQ-LIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPG 364
            Y      A C  V+  GN   I   +     AD  ++ +G D+ +  E  D   + LP 
Sbjct: 577 EYINKTGAAACMLVSDFGNSDQIDKVKEFVSGADLVLVALGNDEKLARENRDLPSIYLPM 636

Query: 365 RQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLF 424
            Q+ L+  + K +     L+L  G P+   +A  +  + AIL   YPGQ GG A+A +LF
Sbjct: 637 TQELLLKEIYKVNP-RTALILHTGNPLTSKWAAEN--VPAILQAWYPGQEGGKALAGILF 693

Query: 425 GRANP 429
           G  NP
Sbjct: 694 GSENP 698


>gi|198274480|ref|ZP_03207012.1| hypothetical protein BACPLE_00628 [Bacteroides plebeius DSM 17135]
 gi|198272682|gb|EDY96951.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           plebeius DSM 17135]
          Length = 912

 Score =  197 bits (502), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 137/444 (30%), Positives = 219/444 (49%), Gaps = 37/444 (8%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL 71
           ++P +++ RD RWGR +E  GE P L  +   +  +GLQ +    ++VA+  KH+ AY  
Sbjct: 176 YAPILDVGRDQRWGRYEEVYGESPYLVAELGIAMGKGLQTD----MQVASTAKHFIAYSN 231

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
           +          + ++S +++E+ +  PF   + E  +  VM SYN  +G P  +    L 
Sbjct: 232 NKGAREGFARVDPQMSWREVENIHAYPFTRVIQEAGILGVMSSYNDYDGFPIQSSYYWLT 291

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG----PFLAIH 187
             + G     GY+VSD D+V  LY+     +  +EA   +++AGL++ C         + 
Sbjct: 292 QRLRGTMGFRGYVVSDSDAVEYLYSKHKTAKDMKEAVRQSVEAGLNVRCTFRSPESYVLP 351

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGN---LGPRDVCTPAHQQL 244
               ++ G L  E ++  +   + V+   G+FD      P+     L  ++V + AHQQ+
Sbjct: 352 LRELIQEGGLSMETIDNRVRDILRVKFLTGLFD-----TPYQTDLALADKEVNSEAHQQV 406

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           ALQA+ +G+VLLKN+   LPL   +   +AV GPN+D     + +Y  VA   TT L+GI
Sbjct: 407 ALQASREGLVLLKNANNLLPLDKSQIKRIAVCGPNADEASFALTHYGPVAVEVTTVLEGI 466

Query: 305 SRY----AKTIHQAGC--------------FGVACNGNQLIGAAEVAARQADATVLVMGL 346
            +      K  +  GC              + +       I  A    +++D  V+V+G 
Sbjct: 467 KQQVKEGTKVTYTKGCDLVDANWPESEIISYPLTAEEKTEIQKAVDNVKESDVAVVVLGG 526

Query: 347 DQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
                 E   R  L LPG QQ+L+  +  A+  PVVLVL+ G P+ +++A  D  + AIL
Sbjct: 527 GIRTCGENKSRTSLDLPGHQQQLLEAIV-ATGKPVVLVLINGRPLSINWA--DKFVPAIL 583

Query: 407 WVGYPGQAGGAAIADVLFGRANPG 430
              YPG  GG AIA+ LFG  NPG
Sbjct: 584 EAWYPGSQGGTAIAEALFGDYNPG 607


>gi|160882671|ref|ZP_02063674.1| hypothetical protein BACOVA_00625 [Bacteroides ovatus ATCC 8483]
 gi|423289150|ref|ZP_17268000.1| hypothetical protein HMPREF1069_03043 [Bacteroides ovatus
           CL02T12C04]
 gi|423298450|ref|ZP_17276507.1| hypothetical protein HMPREF1070_05172 [Bacteroides ovatus
           CL03T12C18]
 gi|156111986|gb|EDO13731.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
 gi|392662991|gb|EIY56545.1| hypothetical protein HMPREF1070_05172 [Bacteroides ovatus
           CL03T12C18]
 gi|392667846|gb|EIY61351.1| hypothetical protein HMPREF1069_03043 [Bacteroides ovatus
           CL02T12C04]
          Length = 1049

 Score =  197 bits (501), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 146/435 (33%), Positives = 219/435 (50%), Gaps = 24/435 (5%)

Query: 7   AGLTY-WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKH 65
           AGL + ++P V+I RD RWGR  E  GED  L  + A + V G Q N      V AC KH
Sbjct: 456 AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQWNLWENNSVLACAKH 515

Query: 66  YTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
           + AY L       R +    +S++ L DTY  PFKAC+  G V + M ++N +NG P  A
Sbjct: 516 WVAYGLPQ---AGRDYAPVDMSERTLFDTYLPPFKACIDAG-VLTFMSAFNDINGIPASA 571

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFL- 184
            P +LK+ + GQW  +G++VSD ++V  L   Q      ++A   A  +G+D+D    L 
Sbjct: 572 HPFLLKDLLRGQWNFNGFVVSDWEAVKQLV-AQGVAEDDKDATRLAFNSGIDMDMTDGLY 630

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
             + +  +  G +  EDV+ +++  + ++  LG+F  +P          + +        
Sbjct: 631 NKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF-VDPYKFCNEEYESQTIMKKEFLDA 689

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVACGYTTPLQ 302
           AL  AH+  VLLKN   TLPL+     ++AV+GP +D    ++G++   G     TT LQ
Sbjct: 690 ALDMAHKSAVLLKNDNHTLPLAK-NVRSIAVVGPLADNQTELLGSWRARGEDRHVTTVLQ 748

Query: 303 GIS-----RYAKTIHQAGCFGVACNGNQLIGAAEVA--ARQADATVLVMGLDQSIEAEFI 355
           GI         K  +  GC     +G    G  E    A ++D  + V+G    +  E  
Sbjct: 749 GIKNKIGGNKTKVGYARGC---DFDGEDKSGFKEAVKLASKSDMVIAVVGEKALMSGESR 805

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
            RA L LPG Q+EL+  +  A+  PVV+VLM G P+ + +   D  + AIL   + G + 
Sbjct: 806 SRAQLDLPGVQEELIKELV-ATGKPVVVVLMNGRPLSIEWV--DKNVSAILETWFLGTSA 862

Query: 416 GAAIADVLFGRANPG 430
           G AIAD+LFG  NP 
Sbjct: 863 GTAIADILFGDYNPS 877


>gi|423215778|ref|ZP_17202304.1| hypothetical protein HMPREF1074_03836 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392691421|gb|EIY84666.1| hypothetical protein HMPREF1074_03836 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 1049

 Score =  197 bits (501), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 146/435 (33%), Positives = 219/435 (50%), Gaps = 24/435 (5%)

Query: 7   AGLTY-WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKH 65
           AGL + ++P V+I RD RWGR  E  GED  L  + A + V G Q N      V AC KH
Sbjct: 456 AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQWNLWENNSVLACAKH 515

Query: 66  YTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
           + AY L       R +    +S++ L DTY  PFKAC+  G V + M ++N +NG P  A
Sbjct: 516 WVAYGLPQ---AGRDYAPVDMSERTLFDTYLPPFKACIDAG-VLTFMSAFNDINGIPASA 571

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFL- 184
            P +LK+ + GQW  +G++VSD ++V  L   Q      ++A   A  +G+D+D    L 
Sbjct: 572 HPFLLKDLLRGQWNFNGFVVSDWEAVKQLV-AQGVAEDDKDATRLAFNSGIDMDMTDGLY 630

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
             + +  +  G +  EDV+ +++  + ++  LG+F  +P          + +        
Sbjct: 631 NKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF-VDPYKFCNEEYESQTIMKKEFLDA 689

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVACGYTTPLQ 302
           AL  AH+  VLLKN   TLPL+     ++AV+GP +D    ++G++   G     TT LQ
Sbjct: 690 ALDMAHKSAVLLKNDNHTLPLAK-NVRSIAVVGPLADNQTELLGSWRARGEDRHVTTVLQ 748

Query: 303 GIS-----RYAKTIHQAGCFGVACNGNQLIGAAEVA--ARQADATVLVMGLDQSIEAEFI 355
           GI         K  +  GC     +G    G  E    A ++D  + V+G    +  E  
Sbjct: 749 GIKNKIGGNKTKVGYARGC---DFDGEDKSGFKEAVKLASKSDMVIAVVGEKALMSGESR 805

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
            RA L LPG Q+EL+  +  A+  PVV+VLM G P+ + +   D  + AIL   + G + 
Sbjct: 806 SRAQLDLPGVQEELIKELV-ATGKPVVVVLMNGRPLSIEWV--DKNVSAILETWFLGTSA 862

Query: 416 GAAIADVLFGRANPG 430
           G AIAD+LFG  NP 
Sbjct: 863 GTAIADILFGDYNPS 877


>gi|423217451|ref|ZP_17203947.1| hypothetical protein HMPREF1061_00720 [Bacteroides caccae
           CL03T12C61]
 gi|392628610|gb|EIY22636.1| hypothetical protein HMPREF1061_00720 [Bacteroides caccae
           CL03T12C61]
          Length = 946

 Score =  197 bits (501), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 146/448 (32%), Positives = 221/448 (49%), Gaps = 32/448 (7%)

Query: 6   MAGLT-YWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCK 64
           M G T  ++P +++ RD RWGR +E  GE P L  +     VRG+Q N     +VAA  K
Sbjct: 204 MLGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH----QVAATGK 259

Query: 65  HYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           H+ AY  +          + ++S +++E  +  PFK  + E  +  VM SYN  +G P  
Sbjct: 260 HFIAYSNNKGAREGMARVDPQMSPREVEMLHAYPFKRVIREAGLLGVMSSYNDYDGFPIQ 319

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG--- 181
           +    L   + G+    GY+VSD D+V  LY      +  +EA   +++AGL++ C    
Sbjct: 320 SSYYWLTTRLRGEMGFRGYVVSDSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRS 379

Query: 182 -PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
                +     V+ G L EE +N  +   + V+  +G+FD        G     +V    
Sbjct: 380 PDSYVLPLRELVKEGGLSEEVINDRVRDILRVKFLVGLFDTPYQTDLKG--ADEEVEKKE 437

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           ++++ALQA+ + IVLLKN    LPL   +   +AV GPN+D     + +Y  +A   T+ 
Sbjct: 438 NEEVALQASRESIVLLKNEKNVLPLDPSKIRKIAVCGPNADEHSYALTHYGPLAVEVTSV 497

Query: 301 LQGISRYAK----TIHQAGCFGVACN--------------GNQLIGAAEVAARQADATVL 342
           L+GI    K     ++  GC  V  N                + I  A   A+QAD  ++
Sbjct: 498 LKGIQEKMKDKADVLYTKGCDLVDANWPESELIDYPLTDEEQKEIDKAVSQAKQADVAIV 557

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           V+G  Q    E   R+ L LPGRQ +L+  V  A+  PVVLVL+ G P+ +++A  D  +
Sbjct: 558 VLGGGQRTCGENKSRSSLDLPGRQLDLLKAVV-ATGKPVVLVLINGRPLSINWA--DKFV 614

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
            AIL   YPG  GG A+AD+LFG  NPG
Sbjct: 615 PAILEAWYPGSKGGIAVADILFGDYNPG 642


>gi|336412663|ref|ZP_08593016.1| hypothetical protein HMPREF1017_00124 [Bacteroides ovatus
           3_8_47FAA]
 gi|335942709|gb|EGN04551.1| hypothetical protein HMPREF1017_00124 [Bacteroides ovatus
           3_8_47FAA]
          Length = 735

 Score =  197 bits (501), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 147/435 (33%), Positives = 227/435 (52%), Gaps = 27/435 (6%)

Query: 6   MAGLTY-WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAACC 63
           M+G+ + +SP +++ RDPRWGR  E  GEDP   G + A+ V+G QG +  +  ++AAC 
Sbjct: 154 MSGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFGAASVKGYQGDDLSAENRMAACL 213

Query: 64  KHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPT 123
           KHY  Y         R +    +SKQ L DTY +P++  V  G  A++M S+N ++G P 
Sbjct: 214 KHYVGYGASE---AGRDYVYTEISKQTLWDTYLLPYEMGVKAG-AATLMSSFNDISGVPG 269

Query: 124 CADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPF 183
            A+P I+   +  +W  DG+IVSD  ++  L N Q    T +EAA  A  AGL++D    
Sbjct: 270 SANPYIMTEILKKRWGHDGFIVSDWGAIEQLKN-QGLAATKKEAAWHAFTAGLEMDMMSH 328

Query: 184 -LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
               H +  V  G +    V+ A+   + ++ RLG+F  E    P  +   R    P   
Sbjct: 329 AYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLF--ERPYTPATSEKER-FFRPQSM 385

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAG------VACG 296
            +A + A + +VLLKN  +TLPL+  +   +AVIGP +     ++G++ G      VA  
Sbjct: 386 DIAARLAAESMVLLKNENKTLPLTDKKK--IAVIGPMAKNGWDLLGSWCGHGKDTDVAML 443

Query: 297 YTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEV--AARQADATVLVMGLDQSIEAEF 354
           Y       +  A+  + AGC   A  G+   G AE   AAR +D  VL +G   +   E 
Sbjct: 444 YNGLATEFAGKAELRYAAGC---ATKGDNREGFAEALEAARWSDVVVLCLGEMMTWSGEN 500

Query: 355 IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQA 414
             R+ + LP  Q+EL + + KA + P+VLVL+ G P++++  + +P   AIL +  PG  
Sbjct: 501 ASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPLELN--RLEPISDAILEIWQPGVN 557

Query: 415 GGAAIADVLFGRANP 429
           G   +A +L GR NP
Sbjct: 558 GALPMAGILSGRINP 572


>gi|383115617|ref|ZP_09936373.1| hypothetical protein BSGG_2514 [Bacteroides sp. D2]
 gi|313694979|gb|EFS31814.1| hypothetical protein BSGG_2514 [Bacteroides sp. D2]
          Length = 946

 Score =  197 bits (501), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 145/448 (32%), Positives = 222/448 (49%), Gaps = 32/448 (7%)

Query: 6   MAGLT-YWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCK 64
           M G T  ++P +++ RD RWGR +E  GE P L  +     VRG+Q +     +VAA  K
Sbjct: 204 MLGYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQQD----YQVAATGK 259

Query: 65  HYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTC 124
           H+ AY  +          + ++S +++E  +  PFK  + E  +  VM SYN  +G P  
Sbjct: 260 HFIAYSNNKGGREGMSRVDPQMSPREVEMVHVYPFKRVIREAGLLGVMSSYNDYDGFPIQ 319

Query: 125 ADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG--- 181
           +    L   + G+    GY+VSD D+V  LY   +  +  +EA   +++AGL++ C    
Sbjct: 320 SSYYWLTTRLRGEMGFRGYVVSDSDAVEYLYTKHNTAKDMKEAVRQSVEAGLNVRCTFRS 379

Query: 182 -PFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA 240
                +     V+ G L EE +N  +   + V+  +G+FD        G     +V    
Sbjct: 380 PDSYVLPLRELVKEGGLSEEVINDRVRDILRVKFLVGLFDHPYQTDLKG--ADEEVEKAE 437

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTP 300
           ++++ALQA+ + IVLLKN    LPL       +AV GPN+D     +G+Y  +A   T+ 
Sbjct: 438 NEEVALQASRESIVLLKNDQDVLPLDISGIKKIAVCGPNADECSYALGHYGPLAVEVTSV 497

Query: 301 LQGISRYA----KTIHQAGC--------------FGVACNGNQLIGAAEVAARQADATVL 342
           L+GI        + ++  GC              F +     + I  A   A++AD  V+
Sbjct: 498 LKGIQEKTDGKVEVLYSKGCELVDANWPESELIDFPLTEEEQKEIDRAVSQAKEADVAVV 557

Query: 343 VMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRI 402
           V+G  Q    E   R+ L LPGRQ +L+  V  A+  PVVLVL+ G P+ +++A  D  +
Sbjct: 558 VLGGGQRTCGENKSRSSLDLPGRQLDLLKAVV-ATGKPVVLVLINGRPLSINWA--DKFV 614

Query: 403 GAILWVGYPGQAGGAAIADVLFGRANPG 430
            AIL   YPG  GG A+ADVLFG  NPG
Sbjct: 615 PAILEAWYPGAKGGKAVADVLFGDYNPG 642


>gi|237718444|ref|ZP_04548925.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
 gi|229452377|gb|EEO58168.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
          Length = 746

 Score =  197 bits (501), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 146/435 (33%), Positives = 219/435 (50%), Gaps = 24/435 (5%)

Query: 7   AGLTY-WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKH 65
           AGL + ++P V+I RD RWGR  E  GED  L  + A + V G Q N      V AC KH
Sbjct: 153 AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQWNLWENNSVLACAKH 212

Query: 66  YTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
           + AY L       R +    +S++ L DTY  PFKAC+  G V + M ++N +NG P  A
Sbjct: 213 WVAYGLPQ---AGRDYAPVDMSERTLFDTYLPPFKACIDAG-VLTFMSAFNDINGIPASA 268

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFL- 184
            P +LK+ + GQW  +G++VSD ++V  L   Q      ++A   A  +G+D+D    L 
Sbjct: 269 HPFLLKDLLRGQWNFNGFVVSDWEAVKQLV-AQGVAEDDKDATRLAFNSGIDMDMTDGLY 327

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
             + +  +  G +  EDV+ +++  + ++  LG+F  +P          + +        
Sbjct: 328 NKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF-VDPYKFCNEEYESQTIMKKEFLDA 386

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVACGYTTPLQ 302
           AL  AH+  VLLKN   TLPL+     ++AV+GP +D    ++G++   G     TT LQ
Sbjct: 387 ALDMAHKSAVLLKNDNHTLPLAK-NVRSIAVVGPLADNQTELLGSWRARGEDRHVTTVLQ 445

Query: 303 GIS-----RYAKTIHQAGCFGVACNGNQLIGAAEVA--ARQADATVLVMGLDQSIEAEFI 355
           GI         K  +  GC     +G    G  E    A ++D  + V+G    +  E  
Sbjct: 446 GIKNKIGGNKTKVGYARGC---DFDGEDKSGFKEAVKLASKSDMVIAVVGEKALMSGESR 502

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
            RA L LPG Q+EL+  +  A+  PVV+VLM G P+ + +   D  + AIL   + G + 
Sbjct: 503 SRAQLDLPGVQEELIKELV-ATGKPVVVVLMNGRPLSIEWV--DKNVSAILETWFLGTSA 559

Query: 416 GAAIADVLFGRANPG 430
           G AIAD+LFG  NP 
Sbjct: 560 GTAIADILFGDYNPS 574


>gi|299149090|ref|ZP_07042152.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298513851|gb|EFI37738.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 1049

 Score =  197 bits (501), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 146/435 (33%), Positives = 219/435 (50%), Gaps = 24/435 (5%)

Query: 7   AGLTY-WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKH 65
           AGL + ++P V+I RD RWGR  E  GED  L  + A + V G Q N      V AC KH
Sbjct: 456 AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQWNLWENNSVLACAKH 515

Query: 66  YTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
           + AY L       R +    +S++ L DTY  PFKAC+  G V + M ++N +NG P  A
Sbjct: 516 WVAYGLPQ---AGRDYAPVDMSERTLFDTYLPPFKACIDAG-VLTFMSAFNDINGIPASA 571

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFL- 184
            P +LK+ + GQW  +G++VSD ++V  L   Q      ++A   A  +G+D+D    L 
Sbjct: 572 HPFLLKDLLRGQWNFNGFVVSDWEAVKQLV-AQGVAEDDKDATRLAFNSGIDMDMTDGLY 630

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
             + +  +  G +  EDV+ +++  + ++  LG+F  +P          + +        
Sbjct: 631 NKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF-VDPYKFCNEEYESQTIMKKEFLDA 689

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVACGYTTPLQ 302
           AL  AH+  VLLKN   TLPL+     ++AV+GP +D    ++G++   G     TT LQ
Sbjct: 690 ALDMAHKSAVLLKNDNHTLPLAK-NVRSIAVVGPLADNQTELLGSWRARGEDRHVTTVLQ 748

Query: 303 GIS-----RYAKTIHQAGCFGVACNGNQLIGAAEVA--ARQADATVLVMGLDQSIEAEFI 355
           GI         K  +  GC     +G    G  E    A ++D  + V+G    +  E  
Sbjct: 749 GIKNKIGGNKTKVGYARGC---DFDGEDKSGFKEAVKLASKSDMVIAVVGEKALMSGESR 805

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
            RA L LPG Q+EL+  +  A+  PVV+VLM G P+ + +   D  + AIL   + G + 
Sbjct: 806 SRAQLDLPGVQEELIKELV-ATGKPVVVVLMNGRPLSIEWV--DKNVSAILETWFLGTSA 862

Query: 416 GAAIADVLFGRANPG 430
           G AIAD+LFG  NP 
Sbjct: 863 GTAIADILFGDYNPS 877


>gi|293371439|ref|ZP_06617870.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|292633636|gb|EFF52194.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
          Length = 1049

 Score =  197 bits (501), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 146/435 (33%), Positives = 219/435 (50%), Gaps = 24/435 (5%)

Query: 7   AGLTY-WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKH 65
           AGL + ++P V+I RD RWGR  E  GED  L  + A + V G Q N      V AC KH
Sbjct: 456 AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQWNLWENNSVLACAKH 515

Query: 66  YTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
           + AY L       R +    +S++ L DTY  PFKAC+  G V + M ++N +NG P  A
Sbjct: 516 WVAYGLPQ---AGRDYAPVDMSERTLFDTYLPPFKACIDAG-VLTFMSAFNDINGIPASA 571

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFL- 184
            P +LK+ + GQW  +G++VSD ++V  L   Q      ++A   A  +G+D+D    L 
Sbjct: 572 HPFLLKDLLRGQWNFNGFVVSDWEAVKQLV-AQGVAEDDKDATRLAFNSGIDMDMTDGLY 630

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
             + +  +  G +  EDV+ +++  + ++  LG+F  +P          + +        
Sbjct: 631 NKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF-VDPYKFCNEEYESQTIMKKEFLDA 689

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVACGYTTPLQ 302
           AL  AH+  VLLKN   TLPL+     ++AV+GP +D    ++G++   G     TT LQ
Sbjct: 690 ALDMAHKSAVLLKNDNHTLPLAK-NVRSIAVVGPLADNQTELLGSWRARGEDRHVTTVLQ 748

Query: 303 GIS-----RYAKTIHQAGCFGVACNGNQLIGAAEVA--ARQADATVLVMGLDQSIEAEFI 355
           GI         K  +  GC     +G    G  E    A ++D  + V+G    +  E  
Sbjct: 749 GIKNKIGGNKTKVGYARGC---DFDGEDKSGFKEAVKLASKSDMVIAVVGEKALMSGESR 805

Query: 356 DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAG 415
            RA L LPG Q+EL+  +  A+  PVV+VLM G P+ + +   D  + AIL   + G + 
Sbjct: 806 SRAQLDLPGVQEELIKELV-ATGKPVVVVLMNGRPLSIEWV--DKNVSAILETWFLGTSA 862

Query: 416 GAAIADVLFGRANPG 430
           G AIAD+LFG  NP 
Sbjct: 863 GTAIADILFGDYNPS 877


>gi|431797765|ref|YP_007224669.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
           DSM 17526]
 gi|430788530|gb|AGA78659.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
           DSM 17526]
          Length = 799

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 150/455 (32%), Positives = 232/455 (50%), Gaps = 51/455 (11%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           + P +++ R+PRW R +ET GEDP +  +   + V G QG +  S   V +  KH+TAY 
Sbjct: 195 YGPVLDLAREPRWSRVEETYGEDPYINSQMGRAMVSGFQGESIASGKNVISTLKHFTAYG 254

Query: 71  LDN--WNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPD 128
           +     NG      +  V +++L ++Y  PFKA V EG + SVM +YN ++G P  ++  
Sbjct: 255 VPEGGHNGT-----SVSVGQRELHESYLPPFKAAVAEGAL-SVMTAYNSIDGVPCTSNGH 308

Query: 129 ILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPF-LAIH 187
           +L + +   W  +G++VSD  S+  L  + H T T E AA  AI AG+D D G +    +
Sbjct: 309 LLNDVLRDDWGFNGFVVSDLGSISGLRGSHHVTETAEGAAQLAINAGVDSDLGGYGFGKN 368

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
              AV+ G + +E ++ A+   + V+  +G+F+  P   P  +     V +  H  LA +
Sbjct: 369 LLAAVQAGGVSQEVLDEAVRRVLKVKFDMGLFEN-PYVDP--SKAESLVRSAKHIALARK 425

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVAC--GYTTPLQGIS 305
            A + +VLLKN    LPL   + +++AVIGPN+D T   +G+Y          T L+GI 
Sbjct: 426 VARESVVLLKNENDLLPLRK-KVNSIAVIGPNADNTYNQLGDYTAPQPNENVVTVLEGIK 484

Query: 306 ----RYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG------LDQSIE---- 351
               +  +  +  GC       +++  AA +AAR +D  V+V+G       D   E    
Sbjct: 485 NKVGKDVRVNYVKGCAIRDTTQSEIGKAASLAAR-SDVAVVVLGGSSARDFDTEYEETAA 543

Query: 352 -----------------AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVS 394
                             E  DR  L L G Q +LV  V +A+  PVV+VL+ G P++++
Sbjct: 544 AKVSEAEEGQVISDMESGEGFDRMTLDLLGDQLKLVQAV-QATGTPVVVVLIKGRPLNLN 602

Query: 395 FAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +   D  + AI+   YPGQ GG AIADVLFG  NP
Sbjct: 603 WI--DEHVPAIVDAWYPGQEGGNAIADVLFGDYNP 635


>gi|429105352|ref|ZP_19167221.1| Periplasmic beta-glucosidase [Cronobacter malonaticus 681]
 gi|426292075|emb|CCJ93334.1| Periplasmic beta-glucosidase [Cronobacter malonaticus 681]
          Length = 765

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 147/457 (32%), Positives = 224/457 (49%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGRG E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVTRDPRWGRGSEGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+K  +  G    VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ + G W   G  +SD  ++  L   +H T + PE+A   AIK+G+D+     + 
Sbjct: 267 GWLLKDLLRGDWGFKGITISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD-------VC 237
           + +    ++ G +   +++ A  + + V+  +G+F+      P+ +LGP+D         
Sbjct: 325 SKYLPNLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPQDTNAE 379

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
           +  H+  A + A + +VLLKN   TLPL   +  T+AV+GP +D    M+G++  AGVA 
Sbjct: 380 SRLHRDDARKVARESLVLLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVAD 437

Query: 296 GYTTPLQGISRY----AKTIHQAGCFGVACNG-------------------NQLIGAAEV 332
              T LQG+       AK ++  G       G                    ++I  A  
Sbjct: 438 QSVTLLQGMKNVAGDKAKILYAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKEMIDEAVN 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  L +P  Q++L+S + KA+  P+VLVLM G P  
Sbjct: 498 VAKQSDVVVAVVGEAQGMAHEASSRTDLTIPQSQRDLISAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|417519247|ref|ZP_12181440.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Uganda str. R8-3404]
 gi|353647113|gb|EHC90324.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Uganda str. R8-3404]
          Length = 755

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 149/458 (32%), Positives = 228/458 (49%), Gaps = 61/458 (13%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 257 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA---- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   PA    
Sbjct: 315 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKE-SDPADTNA 368

Query: 241 ----HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVA 294
               H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA
Sbjct: 369 ESRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVA 426

Query: 295 CGYTTPLQGISR---------YAKTIHQAGCFGVACNGN--------------QLIGAAE 331
               T L GI           YAK  +     G+    N               +I  A 
Sbjct: 427 NQSVTVLAGIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAV 486

Query: 332 VAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPV 391
            AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P 
Sbjct: 487 QAAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP- 544

Query: 392 DVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 -LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|404254492|ref|ZP_10958460.1| glycosyl hydrolase family protein [Sphingomonas sp. PAMC 26621]
          Length = 898

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 122/318 (38%), Positives = 171/318 (53%), Gaps = 34/318 (10%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYT 67
           GL  WSPN+NIFRDPRWGRGQET GEDP L      ++V G+QG       V A  KH+ 
Sbjct: 143 GLDTWSPNINIFRDPRWGRGQETYGEDPYLAAHMGVAFVTGMQGPNPDLPDVIATPKHFA 202

Query: 68  AYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
            +         R+  N  VS  DLEDTY   F+A +VEG+  S+MC+YN+++G+P CA+ 
Sbjct: 203 VHSGPEST---RHGANVFVSAHDLEDTYLPAFRAAIVEGRAGSIMCAYNRIDGQPACAND 259

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +LK+ + G W   GY+VSDCD+V  + +   Y      A A A+KAG D +C      H
Sbjct: 260 LLLKDHLRGAWGFTGYVVSDCDAVKDIADGHKYAPDQATAIAAAMKAGTDNEC------H 313

Query: 188 TE-------------GAVRGGLLREEDVNLALAYTITVQMRLGMFD---GEPSAQPFGNL 231
           T+              ++R GL+ ++D++ +L    + ++R G      G P+A P    
Sbjct: 314 TQTIGDMGGLPERFGDSLRRGLISQDDIDRSLVRLFSARLRNGDLPGIAGAPAAVPV--- 370

Query: 232 GPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA 291
             R + +P H  LALQAA + +VLLKN    LPL       +AVIGP  D T  + GNY+
Sbjct: 371 --RAILSPDHMALALQAAEKSLVLLKNDG-VLPLRP--GARIAVIGPLGDATRVLRGNYS 425

Query: 292 GVACGYTTPL-QGISRYA 308
                   P+ +G+ R A
Sbjct: 426 STQSAPPVPIVEGLRRAA 443



 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 37/96 (38%), Positives = 57/96 (59%), Gaps = 13/96 (13%)

Query: 345 GLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVS 394
           GL   +EAE            D+  L LP  Q  ++ R AKA+  PV++V M G P++++
Sbjct: 635 GLTSDLEAEETGTDIPGFQGGDKTSLDLPAEQMAMLER-AKATGKPVIVVAMNGSPINLA 693

Query: 395 FAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
           +AK++    AI+   YPGQ+GG A+A+VL G+ +PG
Sbjct: 694 WAKDN--AAAIVEAWYPGQSGGLAVANVLTGKTDPG 727


>gi|206901921|ref|YP_002251428.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
 gi|206741024|gb|ACI20082.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
          Length = 756

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 145/428 (33%), Positives = 217/428 (50%), Gaps = 33/428 (7%)

Query: 13  SPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDLD 72
           SP +NI RDPR GR +ET GEDP L  + A +Y++G+Q        V A  KH+ A  + 
Sbjct: 187 SPTINIARDPRCGRTEETYGEDPYLASRMAVAYIKGVQEQG-----VIATPKHFVANFVG 241

Query: 73  NWNGVDRY--HFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
           +  G D Y  HF+ R+    L + Y   F+A + E    S+M +YN ++G P  ++  +L
Sbjct: 242 D-GGRDSYPIHFSERL----LREIYFPAFRASIEEAGALSLMAAYNSLDGIPCSSNKWLL 296

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL---DCGPFLAIH 187
              +  +W   GY+VSD  SV  L        +  EAA  +++AGLD+   D   F  I 
Sbjct: 297 TRILRKEWGFKGYVVSDYFSVLHLMTKHKVAESKAEAAKLSLEAGLDMELPDSDCFEEI- 355

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
             G +R   L ++ ++ A+   + V+  +G+FD  P   P  +   R      H++LAL+
Sbjct: 356 -PGLIRESKLSQDTLDEAVRRVLRVKFWIGLFDN-PFVDP--DYAERINDCSEHRELALR 411

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISR- 306
            A + IVLLKN    LPL+     ++AVIGPN+   V  +G Y+G      TPL+GI   
Sbjct: 412 VARESIVLLKNEG-ILPLNK-DIRSIAVIGPNA--AVPRLGGYSGYGVKVVTPLEGIKNK 467

Query: 307 ---YAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLD-QSIEAEFIDRAGLLL 362
                K     GC G+          A   A+++D  +L MG      E E  DR  L L
Sbjct: 468 LGDKVKVYFAEGC-GLNDTSKSGFDEAIKIAQKSDVAILFMGNSVPETEGEQRDRHNLNL 526

Query: 363 PGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADV 422
           PG Q++L+  +   +  PV++VL+ G    ++      ++ A++   YPG+ GG AIADV
Sbjct: 527 PGVQEDLIKEICN-TNTPVIVVLINGSA--ITMMNWIDKVQAVIEAWYPGEEGGNAIADV 583

Query: 423 LFGRANPG 430
           LFG  NPG
Sbjct: 584 LFGDYNPG 591


>gi|393789623|ref|ZP_10377743.1| hypothetical protein HMPREF1068_04023 [Bacteroides nordii
           CL02T12C05]
 gi|392650339|gb|EIY44008.1| hypothetical protein HMPREF1068_04023 [Bacteroides nordii
           CL02T12C05]
          Length = 863

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 174/314 (55%), Gaps = 12/314 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT+W+PN+NIFRDPRWGRGQET GEDP LT +   + V+GLQG   ++  K+ AC KHY
Sbjct: 131 GLTFWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPADAKYDKLHACAKHY 190

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +         R+ FNA  +S +DL +TY   FK  V E  V  VMC+YN+  G P C 
Sbjct: 191 AVHSGPE---AKRHSFNAENISPRDLWETYLPAFKTLVQEADVKEVMCAYNRFEGDPCCG 247

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVG-VLYNTQHYT-RTPEEAAADAIKAGLDLDCGPF 183
              +L   +   W     +VSDC ++    Y+ +H T +   +A+A A+ +G DL+CG  
Sbjct: 248 SNRLLTQILRDDWGYKHVVVSDCGAISDFFYSDRHATHKNAADASAAAVLSGTDLECGIE 307

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            A H + AV+ GL+ EE +N +L   +  +  LG  D + +  P+  +    V    H+Q
Sbjct: 308 YA-HLDKAVKEGLISEERINTSLFRLLKARFELGEMDDD-ALVPWSKISIDTVDCQMHKQ 365

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           +AL    + +VLL N+   LPL       +AV+GPN++ +V   GNY G      T L+G
Sbjct: 366 MALDITRKSMVLLHNNG-VLPLGKTGAR-IAVMGPNANDSVMQWGNYEGTPSHTVTVLEG 423

Query: 304 I-SRYAKTIHQAGC 316
           I ++    I++ GC
Sbjct: 424 IRNKIGNVIYEKGC 437



 Score = 48.1 bits (113), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 54/106 (50%), Gaps = 14/106 (13%)

Query: 335 RQADATVLVMGLDQSIEAE---------FI--DRAGLLLPGRQQELVSRVAKASRGPVVL 383
           ++AD  + V G+  ++E E         F+  DR  + LP  Q+ ++  + KA +   V+
Sbjct: 597 KEADVVIFVGGISPNLEGEEKNFVNCPGFVGGDRTSIELPEVQRNILKALKKAGK--KVI 654

Query: 384 VLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            + C G   ++         AIL   YPGQAGG A+AD++FG  NP
Sbjct: 655 FVNCSGSA-MALVPETQSCDAILQAWYPGQAGGTAVADIIFGDYNP 699


>gi|161613146|ref|YP_001587111.1| hypothetical protein SPAB_00854 [Salmonella enterica subsp.
           enterica serovar Paratyphi B str. SPB7]
 gi|416529744|ref|ZP_11744511.1| hypothetical protein SEEM010_09961 [Salmonella enterica subsp.
           enterica serovar Montevideo str. LQC 10]
 gi|417466542|ref|ZP_12165045.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Montevideo str. S5-403]
 gi|161362510|gb|ABX66278.1| hypothetical protein SPAB_00854 [Salmonella enterica subsp.
           enterica serovar Paratyphi B str. SPB7]
 gi|353629977|gb|EHC77663.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Montevideo str. S5-403]
 gi|363552147|gb|EHL36453.1| hypothetical protein SEEM010_09961 [Salmonella enterica subsp.
           enterica serovar Montevideo str. LQC 10]
          Length = 755

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 149/458 (32%), Positives = 228/458 (49%), Gaps = 61/458 (13%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 257 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA---- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   PA    
Sbjct: 315 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKE-SDPADTNA 368

Query: 241 ----HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVA 294
               H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA
Sbjct: 369 ESRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVA 426

Query: 295 CGYTTPLQGISR---------YAKTIHQAGCFGVACNGN--------------QLIGAAE 331
               T L GI           YAK  +     G+    N               +I  A 
Sbjct: 427 NQSVTVLAGIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAV 486

Query: 332 VAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPV 391
            AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P 
Sbjct: 487 QAAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP- 544

Query: 392 DVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 -LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|224583333|ref|YP_002637131.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Paratyphi C strain RKS4594]
 gi|224467860|gb|ACN45690.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Paratyphi C strain RKS4594]
          Length = 765

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 150/457 (32%), Positives = 230/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 267 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 437

Query: 296 GYTTPLQGISRY----AKTIHQAGC--------------FGVAC-----NGNQLIGAAEV 332
              T L GI       AK +H  G               +  A      +   +I  A  
Sbjct: 438 QSVTVLAGIQNAVGDGAKILHAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 AAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|198425902|ref|XP_002120563.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 996

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 149/450 (33%), Positives = 222/450 (49%), Gaps = 40/450 (8%)

Query: 4   GGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACC 63
           G   GL+ +SP +NI R P WGR QET GEDPV++   A +YV GLQG+        A C
Sbjct: 548 GDHTGLSCFSPVINILRHPLWGRNQETFGEDPVMSSLMARAYVTGLQGDE-IYYPATANC 606

Query: 64  KHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPT 123
           KH+ AYD        R  FNA VS +DL  TY   F+ CV  G    ++CSYN +NG+P 
Sbjct: 607 KHFAAYDGPENIPSSRLSFNANVSIEDLGRTYFPAFRECVHSGAFG-IVCSYNAINGEPA 665

Query: 124 CADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPF 183
           CA    L+  +  ++   GY+ SD  ++        YT++   +A  A  AG+DL+   +
Sbjct: 666 CAS-SYLQTILRDKFNFKGYVSSDESAIEFFDIYFKYTKSNLLSAVAAFDAGVDLELTSY 724

Query: 184 LAIH----TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNL--GPRDVC 237
              +       AV  GL+ E  +  +       +M LG FD     Q F +    P DV 
Sbjct: 725 GKNNRYSLLNQAVEQGLVTEAALRRSAKRLFRTRMALGEFD----PQEFNHWLNVPIDVV 780

Query: 238 -TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY-AGVAC 295
            + AH++ A++ A +  VLLKN    LPL+  R+  VA++GP  + +  + G+Y      
Sbjct: 781 QSLAHRKQAVEVAAKSFVLLKNDG-VLPLNH-RYENVAIVGPFINNSEALTGDYHPNYNL 838

Query: 296 GY-TTPL--------QGISRYAKTIHQAGCFGVACNGNQLIGAA------EVAARQADAT 340
            Y ++PL         G++R+       GC G   N N  I A       +     +D  
Sbjct: 839 KYFSSPLFAANSLSSSGVARFT-----TGCVGTN-NQNLPICATYNSTHVKEVVTGSDIV 892

Query: 341 VLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDP 400
           ++ +G    IEAE  DR  + LPG+Q +++  V K + GPV++VL   GP+DVS+   + 
Sbjct: 893 LVTLGTGTGIEAEGRDRLSMELPGKQLDMIKDVVKYANGPVIVVLFNAGPLDVSWVMGN- 951

Query: 401 RIGAILWVGYPGQAGGAAIADVLFGRANPG 430
              A++   +  Q  G A+ +VL G  NP 
Sbjct: 952 -TAAVIACHFSAQMTGEAMLEVLTGIVNPA 980


>gi|238620766|ref|YP_002915592.1| glycoside hydrolase family protein [Sulfolobus islandicus M.16.4]
 gi|238381836|gb|ACR42924.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           M.16.4]
          Length = 755

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 141/455 (30%), Positives = 228/455 (50%), Gaps = 60/455 (13%)

Query: 13  SPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL- 71
           SP +++ RDPRWGR +ET GED  L      +YV+GLQG      ++ A  KH+ A+   
Sbjct: 140 SPVLDVCRDPRWGRCEETYGEDQYLVASIGLAYVKGLQGEN----ELIATVKHFAAHGFP 195

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
           +    +   H    V  ++L + +  PF+  +  GK  SVM +Y++++G P  ++ ++L 
Sbjct: 196 EGGRNIAPVH----VGNRELREVFLFPFEVAIKLGKAMSVMPAYHEIDGIPCHSNAELLT 251

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD-----LDC--GPFL 184
             +  +W  +G +VSD D++  L      +   +EAA  A++AG+D     +DC   P L
Sbjct: 252 KILRQEWGFEGIVVSDYDAIRQLEAIHKVSLNKKEAAILALEAGVDTEFPNIDCFGEPLL 311

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
                 AV+ GL+ E  ++ A+   + ++ +LG+F+     +   N  P  +     ++L
Sbjct: 312 E-----AVKEGLISESIIDRAVERVLRIKEKLGLFNDHYINE---NNVPEKLDNSKSREL 363

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY-------AGVACGY 297
           AL  A + IVLLKN    LPL+     T+AVIGPN++    ++G+Y       A V    
Sbjct: 364 ALDVARKSIVLLKND-NILPLNK-NIGTIAVIGPNANEPRNLLGDYTYTGHLNADVGIEV 421

Query: 298 TTPLQGISRYAK----TIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLD------ 347
            T L+GI R        ++  GC  +A    +    A   A++ D  + VMG        
Sbjct: 422 VTVLEGIMRKVSNNTNVLYAKGC-DIAAESKEGFSEAIEIAKKGDIIIAVMGEKSGLPLS 480

Query: 348 -------------QSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVS 394
                        Q++  E  DR  L LPG Q+EL+  + K  + P++LVL+ G P+ +S
Sbjct: 481 WTDVPGKDEFEKYQAVTGEGNDRTSLRLPGVQEELLKELHKTGK-PIILVLVNGRPLALS 539

Query: 395 FAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              N+  + AI+   +PG+ GG AIADV+FG  NP
Sbjct: 540 SIFNE--VNAIIDAWFPGEEGGNAIADVIFGDYNP 572


>gi|401676349|ref|ZP_10808334.1| glucosidase [Enterobacter sp. SST3]
 gi|400216388|gb|EJO47289.1| glucosidase [Enterobacter sp. SST3]
          Length = 765

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 150/458 (32%), Positives = 228/458 (49%), Gaps = 61/458 (13%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGRG E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRGSEGFGEDTFLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T + PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA---- 240
           + +  G V+ G +   +++ A  + + V+  +G+F+      P+ +LGP+D   PA    
Sbjct: 325 SKYLPGLVKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKD-SDPADTNA 378

Query: 241 ----HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVA 294
               H++ A   A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA
Sbjct: 379 ESRLHRKEARDVARESLVLLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVA 436

Query: 295 CGYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAE 331
               T L GI           YAK  +      +    NQ              +I  A 
Sbjct: 437 DQSVTVLTGIKSAVGENAKVVYAKGANVTNDKDIVTFLNQYEEAVKVDPRTPKEMIDEAV 496

Query: 332 VAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPV 391
            AA+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P 
Sbjct: 497 NAAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP- 554

Query: 392 DVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 -LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|417286612|ref|ZP_12073901.1| glycosyl hydrolase family 3, C-terminal domain protein [Escherichia
           coli TW07793]
 gi|386250071|gb|EII96240.1| glycosyl hydrolase family 3, C-terminal domain protein [Escherichia
           coli TW07793]
          Length = 765

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 230/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYT-RTPEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAEDPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN  +TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLKTLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|378984729|ref|YP_005247884.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. T000240]
 gi|312913157|dbj|BAJ37131.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. T000240]
          Length = 765

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 150/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 267 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGN--------------QLIGAAEV 332
              T L GI           YAK  +     G+    N               +I  A  
Sbjct: 438 QSVTVLAGIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 AAKQADVVVAVVGESQEMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|374311316|ref|YP_005057746.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
 gi|358753326|gb|AEU36716.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
          Length = 773

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 144/425 (33%), Positives = 218/425 (51%), Gaps = 26/425 (6%)

Query: 13  SPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAY-D 70
           +P +++ RDPRWGR +E  GEDP LTG+   +YVRG QG +  +   V A  KH+ A+  
Sbjct: 197 APVLDLARDPRWGRIEEDFGEDPYLTGQMGLAYVRGAQGESLNTDHNVVAEPKHFAAHGS 256

Query: 71  LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
            +        H   R    +L       F+    +G   + M +Y++++G P  ADP +L
Sbjct: 257 PEGGTNTSPVHIGER----ELRSVMLKSFEPAFRQGHAMATMAAYHEIDGIPVTADPYLL 312

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEG 190
           K  +  +W   G ++SD  ++  LY       +P+ A+  AIK+G+D+    F     + 
Sbjct: 313 KTILRQEWGFQGMVLSDLGAIRRLYQLHQVASSPKAASCLAIKSGVDMQFYDFDHDVFQK 372

Query: 191 A----VRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLAL 246
           A    V  G L + DV+ A +  + ++  LG+FD  P   P   L  +   +  H  ++L
Sbjct: 373 ALIDCVHEGSLPQADVDRAASAVLRLKFTLGLFD-RPYVDP--TLNAKAYRSKPHLDVSL 429

Query: 247 QAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG-YTTPLQGIS 305
           Q+A + +VLLKN    LP S      +AVIGPN+D  V   G+Y   A G + + LQG+ 
Sbjct: 430 QSARESLVLLKNENGLLPFSK-SIQRIAVIGPNAD--VARYGDYEEEANGLHISILQGVK 486

Query: 306 RYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGR 365
             A+  H      V  +  + I AA   A+ AD  +L +G  + I  E  DR  L LPG 
Sbjct: 487 --AEAPHAQ----VEFDSGKDIAAAVAKAKSADVVILGLGEWRGISGEAFDRTSLDLPGE 540

Query: 366 QQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFG 425
           Q++L+  +  A+  PVVLVL  G P+ + +AK    +GAI+   YPG+ GG AIA+ LFG
Sbjct: 541 QEKLLEAIT-ATNKPVVLVLENGRPLTIGWAKA--HVGAIVEAWYPGEFGGQAIAETLFG 597

Query: 426 RANPG 430
             NP 
Sbjct: 598 DNNPA 602


>gi|373956830|ref|ZP_09616790.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373893430|gb|EHQ29327.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 823

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 140/440 (31%), Positives = 212/440 (48%), Gaps = 32/440 (7%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL 71
           ++P +++ RD RWGR +E  GEDP L  +      +G+Q N     ++AA  KH+  Y  
Sbjct: 230 YAPILDVARDQRWGRLEEVYGEDPYLVARLGVEMAKGMQQNN----QIAATAKHFAVYSA 285

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
           +          + +V+ +++E+    PFK  + E  +  VM SYN  +G P       L 
Sbjct: 286 NKGGREGLARTDPQVAPREVENILLYPFKKVIKEAGLMGVMSSYNDYDGIPISGSSYWLI 345

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG----PFLAIH 187
             +  ++   GY+VSD D++  LYN  H     ++A   A  AG+++         + I+
Sbjct: 346 QRLRQEFGFKGYVVSDSDALEYLYNKHHVAADLKDAVYQAFMAGMNVRTTFRTPDSIIIY 405

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
               V+ G L  + +N  +   + V+ +LG+FD  P  Q       + V   A+Q +ALQ
Sbjct: 406 ARQLVKEGKLPIDTINSRVRDVLRVKFKLGLFD-HPYVQD-AEASAKLVNCAANQAVALQ 463

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNS---DVTVTMIGNYAGVACGYTTPLQGI 304
           A+ + IVLLKN    LPLS  +  T+AVIGPN+   D   T  G  A  +      +Q  
Sbjct: 464 ASKESIVLLKNKGAILPLS--KQQTLAVIGPNALNDDYAHTHYGPLASKSINILEGIQAK 521

Query: 305 SRYAKTIHQAGCFGVACNGNQ--------------LIGAAEVAARQADATVLVMGLDQSI 350
               K ++  GC  V  +  +               I +A   AR AD  V+V+G +   
Sbjct: 522 VGAGKVLYALGCNLVDKHWPESEILPQDPDQAEQAKIDSAVTIARHADVAVVVLGGNTQT 581

Query: 351 EAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGY 410
             E   R  L LPG Q  LV  V KA+  PVV+VL+   P+ +++   D  I  I++ GY
Sbjct: 582 AGENKSRTSLDLPGYQLRLVKAV-KATGKPVVVVLIGSQPMTINWI--DQHIDGIIYAGY 638

Query: 411 PGQAGGAAIADVLFGRANPG 430
           PG  GG A+ADVLFG  NPG
Sbjct: 639 PGTQGGTAVADVLFGDYNPG 658


>gi|420391858|ref|ZP_14891111.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli EPEC
           C342-62]
 gi|391312539|gb|EIQ70147.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli EPEC
           C342-62]
          Length = 765

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 146/457 (31%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISRYA----KTIHQAGCFGVACNG-------------------NQLIGAAEV 332
              T L GI  +     K ++  G    +  G                    ++I  A  
Sbjct: 438 QSVTVLTGIKNFVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|205357644|ref|ZP_02572188.2| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar 4,[5],12:i:- str. CVM23701]
 gi|374981198|ref|ZP_09722528.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Typhimurium str. TN061786]
 gi|383496875|ref|YP_005397564.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Typhimurium str. 798]
 gi|205330573|gb|EDZ17337.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar 4,[5],12:i:- str. CVM23701]
 gi|321224818|gb|EFX49881.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Typhimurium str. TN061786]
 gi|380463696|gb|AFD59099.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhimurium str. 798]
          Length = 755

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 150/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 257 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGN--------------QLIGAAEV 332
              T L GI           YAK  +     G+    N               +I  A  
Sbjct: 428 QSVTVLAGIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 AAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|416539671|ref|ZP_11750078.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Montevideo str. SARB30]
 gi|416551719|ref|ZP_11756625.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Montevideo str. 29N]
 gi|416566632|ref|ZP_11763924.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Montevideo str. 4441 H]
 gi|417327454|ref|ZP_12112866.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Adelaide str. A4-669]
 gi|418868407|ref|ZP_13422850.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Newport str. CVM 4176]
 gi|353570549|gb|EHC34783.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Adelaide str. A4-669]
 gi|363559751|gb|EHL43903.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Montevideo str. SARB30]
 gi|363566134|gb|EHL50153.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Montevideo str. 29N]
 gi|363579468|gb|EHL63250.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Montevideo str. 4441 H]
 gi|392838313|gb|EJA93877.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Newport str. CVM 4176]
          Length = 765

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 149/458 (32%), Positives = 228/458 (49%), Gaps = 61/458 (13%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 267 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA---- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   PA    
Sbjct: 325 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKE-SDPADTNA 378

Query: 241 ----HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVA 294
               H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA
Sbjct: 379 ESRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVA 436

Query: 295 CGYTTPLQGISR---------YAKTIHQAGCFGVACNGN--------------QLIGAAE 331
               T L GI           YAK  +     G+    N               +I  A 
Sbjct: 437 NQSVTVLAGIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAV 496

Query: 332 VAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPV 391
            AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P 
Sbjct: 497 QAAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP- 554

Query: 392 DVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 -LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|429116223|ref|ZP_19177141.1| Periplasmic beta-glucosidase [Cronobacter sakazakii 701]
 gi|426319352|emb|CCK03254.1| Periplasmic beta-glucosidase [Cronobacter sakazakii 701]
          Length = 625

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 147/457 (32%), Positives = 223/457 (48%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGRG E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 15  WAPMVDVTRDPRWGRGSEGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYG 74

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+K  +  G    VM + N +NG P  +D
Sbjct: 75  AVEGGKEYNTVD-------MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASD 126

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ + G W   G  +SD  ++  L   +H T + PE+A   AIK+G+D+     + 
Sbjct: 127 GWLLKDLLRGDWGFKGITISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYY 184

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD-------VC 237
           + +    ++ G +   +++ A  + + V+  +G+F+      P+ +LGP+D         
Sbjct: 185 SKYLPNLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAE 239

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
           +  H+  A + A + +VLLKN   TLPL   +  T+AV+GP +D    M+G++  AGVA 
Sbjct: 240 SRLHRDDARKVARESLVLLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVAD 297

Query: 296 GYTTPLQGISRY----AKTIHQAGCFGVACNG-------------------NQLIGAAEV 332
              T LQG+       AK ++  G       G                     +I  A  
Sbjct: 298 QSVTLLQGMKNVAGDKAKILYAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKAMIDEAVN 357

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  L +P  Q++L+S + KA+  P+VLVLM G P  
Sbjct: 358 VAKQSDVVVAVVGEAQGMAHEASSRTDLTIPQSQRDLISAL-KATGKPLVLVLMNGRP-- 414

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 415 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 451


>gi|387789382|ref|YP_006254447.1| beta-glucosidase-like glycosyl hydrolase [Solitalea canadensis DSM
           3403]
 gi|379652215|gb|AFD05271.1| beta-glucosidase-like glycosyl hydrolase [Solitalea canadensis DSM
           3403]
          Length = 771

 Score =  196 bits (499), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 145/460 (31%), Positives = 229/460 (49%), Gaps = 59/460 (12%)

Query: 8   GLTY-WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT--GSRLKVAACCK 64
           GL + +SP V++ RDPRWGR  E  GEDP L    A + V+G QG+    S   + AC K
Sbjct: 153 GLNWVFSPMVDVARDPRWGRIAEGSGEDPYLGSLIAKAMVKGYQGDNTYSSATNLMACVK 212

Query: 65  HYTAYDL----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           H+  Y       ++N VD       +S+Q + + Y  P+KA  VE  V SVM S+N+V G
Sbjct: 213 HFALYGAAEAGRDYNSVD-------MSRQKMYEFYLPPYKAA-VEAGVGSVMSSFNEVEG 264

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD- 179
            P   +  +L + +  QW  +G +VSD  SV  +   +H     +E +A AIKAGLD+D 
Sbjct: 265 VPATGNQWLLTDLLRKQWGFNGMVVSDYTSVNEM--MEHGMGNLQEVSALAIKAGLDMDM 322

Query: 180 CGPFLAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPR----D 235
            G       + +++ G + E D+NLA    +  + +LG+F     + P+  +  +    +
Sbjct: 323 VGEGYLSTLQKSLQEGKVSETDINLACRRILEAKYKLGLF-----SDPYKFINEKRAATE 377

Query: 236 VCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVAC 295
           + T      + +AA +  VLLKN  + LPL   +  T+A+IGP +D    M+G +A V+ 
Sbjct: 378 ILTTQSLSFSREAATRSFVLLKNEKQVLPLK--KTGTIALIGPLADSKRNMLGTWA-VSG 434

Query: 296 GYTTP-------LQGISRYAKTIHQAGC--------------FGVACN-----GNQLIGA 329
            + T        +  +  +AK ++  G               FGV  +       +L+  
Sbjct: 435 NWKTSVSVKEGLMNAVGTHAKVLYAKGANISDDSAFARRVNTFGVEIDIDKRSSKELLDE 494

Query: 330 AEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGG 389
           A   A+Q+D  ++ +G    +  E   R  + +P  Q+EL+  + +  + PVV+VL  G 
Sbjct: 495 ALSIAQQSDVIIVAVGEAADMSGEAASRTDINIPESQKELLKALVQTGK-PVVMVLFNGR 553

Query: 390 PVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           P+ +S+   +  + AIL V  PG   G AIADVLFG  NP
Sbjct: 554 PLTLSW--ENEHLNAILDVWAPGHQAGNAIADVLFGDYNP 591


>gi|423121272|ref|ZP_17108956.1| periplasmic beta-glucosidase [Klebsiella oxytoca 10-5246]
 gi|376394107|gb|EHT06758.1| periplasmic beta-glucosidase [Klebsiella oxytoca 10-5246]
          Length = 765

 Score =  196 bits (499), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 150/456 (32%), Positives = 221/456 (48%), Gaps = 57/456 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT +   + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTTEMGRAMVESMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG-PFLA 185
             +LK  +   W   G  VSD  ++  L         PE+A   A+K+G+++     + +
Sbjct: 267 AWLLKEVLRDDWGFKGITVSDHGAIKELIK-HGVAADPEDAVRVALKSGINMSMSDEYYS 325

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPR-------DVCT 238
            +  G V+ G +   +++ A  + + V+  +G+F+      P+ +LGP+       +  +
Sbjct: 326 KYLPGLVKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPQGSDPQDTNAES 380

Query: 239 PAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVACG 296
             H+Q A + A Q +VLLKN   TLPL   +  TVAVIGP +D    M+G++  AGVA  
Sbjct: 381 RLHRQEAREVARQSLVLLKNRLETLPLK--KSGTVAVIGPLADSKRDMMGSWSAAGVADQ 438

Query: 297 YTTPLQGISR----YAKTIHQAGCFGVACNG-------------------NQLIGAAEVA 333
             T L GI        K I+  G       G                     +I  A  A
Sbjct: 439 SVTVLTGIKEALGDKGKVIYARGANVTNDKGIVDFLNLYEKAVQVDPRSPQAMIDEAVAA 498

Query: 334 ARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDV 393
           A+Q+D  V V+G  Q +  E   R  + LP  Q++L+S + KA+  P+VLVLM G P  +
Sbjct: 499 AKQSDVVVAVVGEAQGMAHEASSRTDITLPQSQRDLISAL-KATGKPLVLVLMNGRP--L 555

Query: 394 SFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +  K D +  AIL   + G  GG AIADVLFG ANP
Sbjct: 556 ALVKEDQQADAILETWFAGTEGGHAIADVLFGDANP 591


>gi|432968243|ref|ZP_20157158.1| periplasmic beta-glucosidase [Escherichia coli KTE203]
 gi|431471360|gb|ELH51253.1| periplasmic beta-glucosidase [Escherichia coli KTE203]
          Length = 755

 Score =  196 bits (499), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 428 QSVTVLTGIKNSVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMINEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|417366389|ref|ZP_12138693.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Hvittingfoss str. A4-620]
 gi|353592073|gb|EHC50188.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Hvittingfoss str. A4-620]
          Length = 755

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 150/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 257 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGN--------------QLIGAAEV 332
              T L GI           YAK  +     G+    N               +I  A  
Sbjct: 428 QSVTVLAGIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 AAKQADVVVAVVGESQGMAHEASSRTDITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|393782347|ref|ZP_10370532.1| hypothetical protein HMPREF1071_01400 [Bacteroides salyersiae
           CL02T12C01]
 gi|392673618|gb|EIY67077.1| hypothetical protein HMPREF1071_01400 [Bacteroides salyersiae
           CL02T12C01]
          Length = 862

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 120/314 (38%), Positives = 174/314 (55%), Gaps = 12/314 (3%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAACCKHY 66
           GLT+W+PNVNIFRDPRWGRGQET GEDP LT +   + V GLQG   ++  K+ AC KHY
Sbjct: 130 GLTFWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVAVVNGLQGPADAKYDKLHACAKHY 189

Query: 67  TAYDLDNWNGVDRYHFNAR-VSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             +         R++FNA  +S +DL +TY   FK  V E  V  VMC+YN+  G P C 
Sbjct: 190 AVHSGPE---AKRHYFNAENISPRDLWETYLPAFKDLVQEADVKEVMCAYNRFEGDPCCG 246

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVG-VLYNTQHYT-RTPEEAAADAIKAGLDLDCGPF 183
              +L   +  +W     +VSDC ++    Y  +H T +   +A+A A+ +G DL+CG  
Sbjct: 247 SNRLLTQILRDEWGYKHMVVSDCGAISDFFYKDRHATHKDAADASAAAVLSGTDLECGIE 306

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            A H E AV+ GL+ EE +N +L   +  +  LG  D + +  P+  +    V    H+Q
Sbjct: 307 YA-HLEEAVKKGLISEERINTSLRRLLKARFELGEMDDD-ALVPWSKISIDTVDCETHKQ 364

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           +AL    + +VLL N+   LPL+      +AV+GPN+  +V   GNY G     +T L+G
Sbjct: 365 MALDVTRKSMVLLHNNG-VLPLAKTGTR-IAVMGPNAVDSVMQWGNYKGTPSHTSTILEG 422

Query: 304 I-SRYAKTIHQAGC 316
           I ++     ++ GC
Sbjct: 423 IRNKIGNVPYEKGC 436



 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 16/116 (13%)

Query: 327 IGAAEVAARQADATVLVM--GLDQSIEAE---FI--------DRAGLLLPGRQQELVSRV 373
           I    VA R +DA V++   G+   +E E   F+        DR  + LP  Q+ ++  +
Sbjct: 586 IAPEAVAGRVSDADVVIFVGGISPDLEGEDKYFVNCPGFSGGDRTTIELPEVQRNILKAL 645

Query: 374 AKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            +A  G  V+ + C G   V+         AIL   YPGQAGG A+ADVLFG  NP
Sbjct: 646 KQA--GKKVIFVNCSGSA-VALVPETKSCDAILQAWYPGQAGGTAVADVLFGDYNP 698


>gi|419932770|ref|ZP_14450057.1| beta-D-glucoside glucohydrolase, periplasmic, partial [Escherichia
           coli 576-1]
 gi|388415794|gb|EIL75707.1| beta-D-glucoside glucohydrolase, periplasmic, partial [Escherichia
           coli 576-1]
          Length = 626

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 16  WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 75

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 76  AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 127

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 128 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 185

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 186 SKYLPGLIKSGKVTMEELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 240

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 241 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 298

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 299 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 358

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 359 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 415

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 416 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 452


>gi|432617195|ref|ZP_19853310.1| periplasmic beta-glucosidase [Escherichia coli KTE75]
 gi|431154136|gb|ELE54958.1| periplasmic beta-glucosidase [Escherichia coli KTE75]
          Length = 765

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATAKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|331647783|ref|ZP_08348875.1| periplasmic beta-glucosidase (Gentiobiase)(Cellobiase)
           (Beta-D-glucoside glucohydrolase) [Escherichia coli
           M605]
 gi|331043507|gb|EGI15645.1| periplasmic beta-glucosidase (Gentiobiase)(Cellobiase)
           (Beta-D-glucoside glucohydrolase) [Escherichia coli
           M605]
          Length = 765

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 150/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG N   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKNPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|332671963|ref|YP_004454971.1| glycoside hydrolase family 3 domain-containing protein
           [Cellulomonas fimi ATCC 484]
 gi|332341001|gb|AEE47584.1| glycoside hydrolase family 3 domain protein [Cellulomonas fimi ATCC
           484]
          Length = 771

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 161/453 (35%), Positives = 228/453 (50%), Gaps = 50/453 (11%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAACCKHYTAYD 70
           ++P V+I RDPRWGR  E  GEDP L    AA+ VRG QG   G+  +V A  KH+  Y 
Sbjct: 166 FAPMVDIARDPRWGRIVEGAGEDPYLGSAVAAAQVRGFQGGRIGTPERVIAGPKHFAGYG 225

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                 +++ VD       +S Q L + Y  PFKA V  G   ++M +Y  +NG P  A+
Sbjct: 226 AAIGGRDYDEVD-------LSDQQLRNVYLPPFKAAVDAG-AGNIMTAYMPLNGVPATAN 277

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG---PF 183
             +LK+ +  +W  DG++VSD ++V  L  T  Y     +AA  A++ GLDL+     P 
Sbjct: 278 AWLLKDVLRDEWGFDGFLVSDANAVRNLV-THGYAADLPDAAVRAVEVGLDLEMAISDPA 336

Query: 184 LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQ 243
            A H   AV  G +  E V+  +   + V++RLG+F+ EP      +     +  PAH+Q
Sbjct: 337 YA-HLPEAVESGAVSIEAVDACVRRILEVKVRLGLFE-EPYVDE--DRAAAVLADPAHRQ 392

Query: 244 LALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA--GVACGYTTPL 301
           +A +AA +  VLL+N  R LPL      +VAVIGP +D     +G +          T L
Sbjct: 393 VAREAAQRSAVLLRNDGRLLPLDPDALGSVAVIGPLADSKRDTLGPWVFDDDLDETVTIL 452

Query: 302 QGIS---------RYAKTIHQA-----GCFGV-----ACNGNQLIGAAEVA-----ARQA 337
            G+          RYA  +  A       F +       +       AE+A     AR +
Sbjct: 453 DGLRARLGDTTDVRYAPGVRPAQRTFPSMFEMFPGNATPDPEDFDDEAELARAVELARTS 512

Query: 338 DATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAK 397
           D  VLV+G  Q +  E   R+ L LPGRQ EL+  VA A+  PVVL++M G P+D+ +A 
Sbjct: 513 DVAVLVVGEWQGMIGEAASRSSLELPGRQLELLQAVA-ATGTPVVLLVMNGRPLDLRWAA 571

Query: 398 NDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
               + AIL V YPG  GG A+AD+L G   PG
Sbjct: 572 Q--HVPAILDVWYPGTQGGTAVADLLVGDVAPG 602


>gi|16765496|ref|NP_461111.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. LT2]
 gi|378445601|ref|YP_005233233.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Typhimurium str. D23580]
 gi|378450850|ref|YP_005238209.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. 14028S]
 gi|378700080|ref|YP_005182037.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Typhimurium str. SL1344]
 gi|378989553|ref|YP_005252717.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. UK-1]
 gi|379701394|ref|YP_005243122.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. ST4/74]
 gi|422026439|ref|ZP_16372832.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm1]
 gi|422031463|ref|ZP_16377632.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm2]
 gi|427551058|ref|ZP_18928136.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm8]
 gi|427567333|ref|ZP_18932851.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm9]
 gi|427587543|ref|ZP_18937641.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm3]
 gi|427611175|ref|ZP_18942507.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm4]
 gi|427634848|ref|ZP_18947401.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm6]
 gi|427656473|ref|ZP_18952166.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm10]
 gi|427661625|ref|ZP_18957079.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm11]
 gi|427670100|ref|ZP_18961878.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm12]
 gi|20141276|sp|Q56078.2|BGLX_SALTY RecName: Full=Periplasmic beta-glucosidase; AltName:
           Full=Beta-D-glucoside glucohydrolase; AltName:
           Full=Cellobiase; AltName: Full=Gentiobiase; AltName:
           Full=T-cell inhibitor; Flags: Precursor
 gi|16420702|gb|AAL21070.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. LT2]
 gi|261247380|emb|CBG25205.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhimurium str. D23580]
 gi|267994228|gb|ACY89113.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. 14028S]
 gi|301158728|emb|CBW18240.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhimurium str. SL1344]
 gi|323130493|gb|ADX17923.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. ST4/74]
 gi|332989100|gb|AEF08083.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. UK-1]
 gi|414017758|gb|EKT01456.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm1]
 gi|414018535|gb|EKT02182.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm8]
 gi|414020221|gb|EKT03810.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm2]
 gi|414032204|gb|EKT15215.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm9]
 gi|414033625|gb|EKT16574.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm3]
 gi|414036785|gb|EKT19597.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm4]
 gi|414046773|gb|EKT29088.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm10]
 gi|414047776|gb|EKT30042.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm6]
 gi|414052520|gb|EKT34556.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm11]
 gi|414059327|gb|EKT40912.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm12]
          Length = 765

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 150/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 267 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGN--------------QLIGAAEV 332
              T L GI           YAK  +     G+    N               +I  A  
Sbjct: 438 QSVTVLAGIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 AAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|417419081|ref|ZP_12159904.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Mississippi str. A4-633]
 gi|353619740|gb|EHC70042.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Mississippi str. A4-633]
          Length = 769

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 150/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 257 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGN--------------QLIGAAEV 332
              T L GI           YAK  +     G+    N               +I  A  
Sbjct: 428 QSVTVLAGIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 AAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|417608761|ref|ZP_12259264.1| periplasmic beta-glucosidase [Escherichia coli STEC_DG131-3]
 gi|419804245|ref|ZP_14329406.1| periplasmic beta-glucosidase [Escherichia coli AI27]
 gi|423706180|ref|ZP_17680563.1| periplasmic beta-glucosidase [Escherichia coli B799]
 gi|425115524|ref|ZP_18517328.1| glycosyl hydrolase family 3 protein [Escherichia coli 8.0566]
 gi|432564386|ref|ZP_19800969.1| periplasmic beta-glucosidase [Escherichia coli KTE51]
 gi|432602694|ref|ZP_19838938.1| periplasmic beta-glucosidase [Escherichia coli KTE66]
 gi|432793342|ref|ZP_20027426.1| periplasmic beta-glucosidase [Escherichia coli KTE78]
 gi|432799300|ref|ZP_20033322.1| periplasmic beta-glucosidase [Escherichia coli KTE79]
 gi|432809786|ref|ZP_20043679.1| periplasmic beta-glucosidase [Escherichia coli KTE101]
 gi|432835123|ref|ZP_20068662.1| periplasmic beta-glucosidase [Escherichia coli KTE136]
 gi|432851424|ref|ZP_20081809.1| periplasmic beta-glucosidase [Escherichia coli KTE144]
 gi|345357970|gb|EGW90158.1| periplasmic beta-glucosidase [Escherichia coli STEC_DG131-3]
 gi|384472794|gb|EIE56844.1| periplasmic beta-glucosidase [Escherichia coli AI27]
 gi|385712064|gb|EIG49019.1| periplasmic beta-glucosidase [Escherichia coli B799]
 gi|408568627|gb|EKK44654.1| glycosyl hydrolase family 3 protein [Escherichia coli 8.0566]
 gi|431093314|gb|ELD98980.1| periplasmic beta-glucosidase [Escherichia coli KTE51]
 gi|431141268|gb|ELE43033.1| periplasmic beta-glucosidase [Escherichia coli KTE66]
 gi|431339005|gb|ELG26067.1| periplasmic beta-glucosidase [Escherichia coli KTE78]
 gi|431343166|gb|ELG30130.1| periplasmic beta-glucosidase [Escherichia coli KTE79]
 gi|431362554|gb|ELG49132.1| periplasmic beta-glucosidase [Escherichia coli KTE101]
 gi|431385483|gb|ELG69470.1| periplasmic beta-glucosidase [Escherichia coli KTE136]
 gi|431399762|gb|ELG83152.1| periplasmic beta-glucosidase [Escherichia coli KTE144]
          Length = 755

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 428 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|432948014|ref|ZP_20143170.1| periplasmic beta-glucosidase [Escherichia coli KTE196]
 gi|433043716|ref|ZP_20231212.1| periplasmic beta-glucosidase [Escherichia coli KTE117]
 gi|431457992|gb|ELH38329.1| periplasmic beta-glucosidase [Escherichia coli KTE196]
 gi|431556037|gb|ELI29872.1| periplasmic beta-glucosidase [Escherichia coli KTE117]
          Length = 755

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 428 QSVTVLTGIKNSVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|432354073|ref|ZP_19597346.1| periplasmic beta-glucosidase [Escherichia coli KTE2]
 gi|432402425|ref|ZP_19645177.1| periplasmic beta-glucosidase [Escherichia coli KTE26]
 gi|432426689|ref|ZP_19669190.1| periplasmic beta-glucosidase [Escherichia coli KTE181]
 gi|432461155|ref|ZP_19703304.1| periplasmic beta-glucosidase [Escherichia coli KTE204]
 gi|432476362|ref|ZP_19718361.1| periplasmic beta-glucosidase [Escherichia coli KTE208]
 gi|432489810|ref|ZP_19731684.1| periplasmic beta-glucosidase [Escherichia coli KTE213]
 gi|432518204|ref|ZP_19755392.1| periplasmic beta-glucosidase [Escherichia coli KTE228]
 gi|432538325|ref|ZP_19775227.1| periplasmic beta-glucosidase [Escherichia coli KTE235]
 gi|432631920|ref|ZP_19867846.1| periplasmic beta-glucosidase [Escherichia coli KTE80]
 gi|432641634|ref|ZP_19877468.1| periplasmic beta-glucosidase [Escherichia coli KTE83]
 gi|432666529|ref|ZP_19902110.1| periplasmic beta-glucosidase [Escherichia coli KTE116]
 gi|432771096|ref|ZP_20005435.1| periplasmic beta-glucosidase [Escherichia coli KTE50]
 gi|432775218|ref|ZP_20009497.1| periplasmic beta-glucosidase [Escherichia coli KTE54]
 gi|432839818|ref|ZP_20073304.1| periplasmic beta-glucosidase [Escherichia coli KTE140]
 gi|432887115|ref|ZP_20101189.1| periplasmic beta-glucosidase [Escherichia coli KTE158]
 gi|432913313|ref|ZP_20119010.1| periplasmic beta-glucosidase [Escherichia coli KTE190]
 gi|433019202|ref|ZP_20207423.1| periplasmic beta-glucosidase [Escherichia coli KTE105]
 gi|433053741|ref|ZP_20240922.1| periplasmic beta-glucosidase [Escherichia coli KTE122]
 gi|433068447|ref|ZP_20255237.1| periplasmic beta-glucosidase [Escherichia coli KTE128]
 gi|433159175|ref|ZP_20344014.1| periplasmic beta-glucosidase [Escherichia coli KTE177]
 gi|433178989|ref|ZP_20363389.1| periplasmic beta-glucosidase [Escherichia coli KTE82]
 gi|433203751|ref|ZP_20387527.1| periplasmic beta-glucosidase [Escherichia coli KTE95]
 gi|430875246|gb|ELB98788.1| periplasmic beta-glucosidase [Escherichia coli KTE2]
 gi|430924896|gb|ELC45569.1| periplasmic beta-glucosidase [Escherichia coli KTE26]
 gi|430955621|gb|ELC74403.1| periplasmic beta-glucosidase [Escherichia coli KTE181]
 gi|430988845|gb|ELD05314.1| periplasmic beta-glucosidase [Escherichia coli KTE204]
 gi|431005530|gb|ELD20551.1| periplasmic beta-glucosidase [Escherichia coli KTE208]
 gi|431020427|gb|ELD33772.1| periplasmic beta-glucosidase [Escherichia coli KTE213]
 gi|431050826|gb|ELD60502.1| periplasmic beta-glucosidase [Escherichia coli KTE228]
 gi|431069214|gb|ELD77543.1| periplasmic beta-glucosidase [Escherichia coli KTE235]
 gi|431170120|gb|ELE70314.1| periplasmic beta-glucosidase [Escherichia coli KTE80]
 gi|431181517|gb|ELE81379.1| periplasmic beta-glucosidase [Escherichia coli KTE83]
 gi|431200823|gb|ELE99541.1| periplasmic beta-glucosidase [Escherichia coli KTE116]
 gi|431314793|gb|ELG02725.1| periplasmic beta-glucosidase [Escherichia coli KTE50]
 gi|431318257|gb|ELG06023.1| periplasmic beta-glucosidase [Escherichia coli KTE54]
 gi|431388991|gb|ELG72706.1| periplasmic beta-glucosidase [Escherichia coli KTE140]
 gi|431416113|gb|ELG98600.1| periplasmic beta-glucosidase [Escherichia coli KTE158]
 gi|431439613|gb|ELH20946.1| periplasmic beta-glucosidase [Escherichia coli KTE190]
 gi|431531103|gb|ELI07772.1| periplasmic beta-glucosidase [Escherichia coli KTE105]
 gi|431570330|gb|ELI43246.1| periplasmic beta-glucosidase [Escherichia coli KTE122]
 gi|431584074|gb|ELI56061.1| periplasmic beta-glucosidase [Escherichia coli KTE128]
 gi|431678146|gb|ELJ44155.1| periplasmic beta-glucosidase [Escherichia coli KTE177]
 gi|431701354|gb|ELJ66273.1| periplasmic beta-glucosidase [Escherichia coli KTE82]
 gi|431721300|gb|ELJ85295.1| periplasmic beta-glucosidase [Escherichia coli KTE95]
          Length = 755

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 428 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|420363812|ref|ZP_14864696.1| beta-D-glucoside glucohydrolase, periplasmic [Shigella sonnei
           4822-66]
 gi|391294056|gb|EIQ52309.1| beta-D-glucoside glucohydrolase, periplasmic [Shigella sonnei
           4822-66]
          Length = 755

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 428 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|417540360|ref|ZP_12192410.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Wandsworth str. A4-580]
 gi|353662824|gb|EHD01701.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Wandsworth str. A4-580]
          Length = 612

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 150/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 257 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGN--------------QLIGAAEV 332
              T L GI           YAK  +     G+    N               +I  A  
Sbjct: 428 QSVTVLAGIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 AAKQADVVVAVVGESQGMAHEASSRTDITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|429084349|ref|ZP_19147356.1| Periplasmic beta-glucosidase [Cronobacter condimenti 1330]
 gi|426546667|emb|CCJ73397.1| Periplasmic beta-glucosidase [Cronobacter condimenti 1330]
          Length = 759

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 146/457 (31%), Positives = 225/457 (49%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGRG E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 149 WAPMVDVTRDPRWGRGSEGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYG 208

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+K  +  G    VM + N +NG P  +D
Sbjct: 209 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASD 260

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ + G+W   G  +SD  ++  L   +H T + PE+A   AIK+G+D+     + 
Sbjct: 261 SWLLKDLLRGEWGFKGITISDHGAIKEL--IKHGTASGPEDAVRVAIKSGVDMSMADEYY 318

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD-------VC 237
           + +    ++ G +   +++ A  + + V+  +G+F+      P+ +LGP+D         
Sbjct: 319 SKYLPDLIKNGSVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPQDTNAE 373

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
           +  H+  A + A + +VLLKN   TLPL   +  T+AV+GP +D    M+G++  AGVA 
Sbjct: 374 SRLHRDDARKVARESLVLLKNRLDTLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVAD 431

Query: 296 GYTTPLQGISRY----AKTIHQAGCFGVACNG-------------------NQLIGAAEV 332
              T LQG+       AK ++  G       G                    ++I  A  
Sbjct: 432 QSITLLQGMKNVAGDKAKILYAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKEMIDEAVN 491

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  L +P  Q++L+S + KA+  P+VLVLM G P  
Sbjct: 492 VAKQSDVVVAVVGEAQGMAHEASSRTDLTIPQSQRDLISAL-KATGKPLVLVLMNGRP-- 548

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K + +  AIL   + G  GG AIADVLFG  NP
Sbjct: 549 LALVKENQQADAILETWFAGTEGGNAIADVLFGDYNP 585


>gi|417863541|ref|ZP_12508589.1| hypothetical protein C22711_0474 [Escherichia coli O104:H4 str.
           C227-11]
 gi|341916830|gb|EGT66447.1| hypothetical protein C22711_0474 [Escherichia coli O104:H4 str.
           C227-11]
          Length = 675

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 428 QSVTVLTGIKNSVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|419930836|ref|ZP_14448429.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           541-1]
 gi|388399450|gb|EIL60247.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           541-1]
          Length = 765

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|1483154|dbj|BAA13102.1| T-cell inhibitor(STI) [Salmonella enterica subsp. enterica serovar
           Typhimurium]
          Length = 765

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 150/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 267 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGN--------------QLIGAAEV 332
              T L GI           YAK  +     G+    N               +I  A  
Sbjct: 438 QSVTVLAGIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 AAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|420336777|ref|ZP_14838350.1| periplasmic beta-glucosidase [Shigella flexneri K-315]
 gi|391261595|gb|EIQ20641.1| periplasmic beta-glucosidase [Shigella flexneri K-315]
          Length = 755

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 150/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEVMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PEEA   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEEAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 428 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|389840350|ref|YP_006342434.1| beta-D-glucoside glucohydrolase [Cronobacter sakazakii ES15]
 gi|417792062|ref|ZP_12439471.1| beta-D-glucoside glucohydrolase [Cronobacter sakazakii E899]
 gi|429121663|ref|ZP_19182278.1| Periplasmic beta-glucosidase [Cronobacter sakazakii 680]
 gi|333953864|gb|EGL71757.1| beta-D-glucoside glucohydrolase [Cronobacter sakazakii E899]
 gi|387850826|gb|AFJ98923.1| beta-D-glucoside glucohydrolase [Cronobacter sakazakii ES15]
 gi|426323856|emb|CCK13015.1| Periplasmic beta-glucosidase [Cronobacter sakazakii 680]
          Length = 765

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 147/457 (32%), Positives = 223/457 (48%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGRG E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVTRDPRWGRGSEGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+K  +  G    VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ + G W   G  +SD  ++  L   +H T + PE+A   AIK+G+D+     + 
Sbjct: 267 GWLLKDLLRGDWGFKGITISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD-------VC 237
           + +    ++ G +   +++ A  + + V+  +G+F+      P+ +LGP+D         
Sbjct: 325 SKYLPNLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAE 379

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
           +  H+  A + A + +VLLKN   TLPL   +  T+AV+GP +D    M+G++  AGVA 
Sbjct: 380 SRLHRDDARKVARESLVLLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVAD 437

Query: 296 GYTTPLQGISRY----AKTIHQAGCFGVACNG-------------------NQLIGAAEV 332
              T LQG+       AK ++  G       G                     +I  A  
Sbjct: 438 QSVTLLQGMKNVAGDKAKILYAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKAMIDEAVN 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  L +P  Q++L+S + KA+  P+VLVLM G P  
Sbjct: 498 VAKQSDVVVAVVGEAQGMAHEASSRTDLTIPQSQRDLISAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|417688707|ref|ZP_12337948.1| periplasmic beta-glucosidase [Shigella boydii 5216-82]
 gi|332093254|gb|EGI98314.1| periplasmic beta-glucosidase [Shigella boydii 5216-82]
          Length = 755

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 428 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|433092524|ref|ZP_20278792.1| periplasmic beta-glucosidase [Escherichia coli KTE138]
 gi|431610126|gb|ELI79428.1| periplasmic beta-glucosidase [Escherichia coli KTE138]
          Length = 755

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 428 QSVTVLTGIKNSVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|293446483|ref|ZP_06662905.1| beta-glucosidase [Escherichia coli B088]
 gi|307311288|ref|ZP_07590932.1| glycoside hydrolase family 3 domain protein [Escherichia coli W]
 gi|418041981|ref|ZP_12680191.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli W26]
 gi|419255491|ref|ZP_13798010.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC10A]
 gi|419397267|ref|ZP_13938035.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC15B]
 gi|422956501|ref|ZP_16968975.1| periplasmic beta-glucosidase [Escherichia coli H494]
 gi|422988259|ref|ZP_16979032.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           C227-11]
 gi|422995150|ref|ZP_16985914.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           C236-11]
 gi|423000225|ref|ZP_16990979.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           09-7901]
 gi|423003894|ref|ZP_16994640.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           04-8351]
 gi|423010467|ref|ZP_17001201.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-3677]
 gi|423019694|ref|ZP_17010403.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-4404]
 gi|423024861|ref|ZP_17015558.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-4522]
 gi|423030682|ref|ZP_17021370.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-4623]
 gi|423038508|ref|ZP_17029182.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str. 11-4632
           C1]
 gi|423043627|ref|ZP_17034294.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str. 11-4632
           C2]
 gi|423045356|ref|ZP_17036016.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str. 11-4632
           C3]
 gi|423053895|ref|ZP_17042702.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str. 11-4632
           C4]
 gi|423060870|ref|ZP_17049666.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str. 11-4632
           C5]
 gi|425380164|ref|ZP_18764204.1| glycosyl hydrolase family 3 protein [Escherichia coli EC1865]
 gi|429719739|ref|ZP_19254670.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           Ec11-9450]
 gi|429771621|ref|ZP_19303644.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-02030]
 gi|429781553|ref|ZP_19313482.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-02033-1]
 gi|429785293|ref|ZP_19317191.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-02092]
 gi|429791182|ref|ZP_19323039.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-02093]
 gi|429797009|ref|ZP_19328817.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-02281]
 gi|429798607|ref|ZP_19330408.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-02318]
 gi|429807120|ref|ZP_19338847.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-02913]
 gi|429812020|ref|ZP_19343706.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-03439]
 gi|429817540|ref|ZP_19349181.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-04080]
 gi|429822751|ref|ZP_19354349.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-03943]
 gi|429904131|ref|ZP_19370110.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           Ec11-9990]
 gi|429908268|ref|ZP_19374232.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           Ec11-9941]
 gi|429914139|ref|ZP_19380087.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           Ec11-4984]
 gi|429919170|ref|ZP_19385102.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           Ec11-5604]
 gi|429924989|ref|ZP_19390903.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           Ec11-4986]
 gi|429928926|ref|ZP_19394828.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           Ec11-4987]
 gi|429935465|ref|ZP_19401351.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           Ec11-4988]
 gi|429941145|ref|ZP_19407019.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           Ec11-5603]
 gi|429943825|ref|ZP_19409688.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           Ec11-6006]
 gi|429951425|ref|ZP_19417271.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           Ec12-0465]
 gi|429954737|ref|ZP_19420569.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           Ec12-0466]
 gi|432481506|ref|ZP_19723463.1| periplasmic beta-glucosidase [Escherichia coli KTE210]
 gi|432675252|ref|ZP_19910712.1| periplasmic beta-glucosidase [Escherichia coli KTE142]
 gi|432750619|ref|ZP_19985223.1| periplasmic beta-glucosidase [Escherichia coli KTE29]
 gi|432765530|ref|ZP_19999968.1| periplasmic beta-glucosidase [Escherichia coli KTE48]
 gi|432806294|ref|ZP_20040222.1| periplasmic beta-glucosidase [Escherichia coli KTE91]
 gi|432832164|ref|ZP_20065738.1| periplasmic beta-glucosidase [Escherichia coli KTE135]
 gi|432934928|ref|ZP_20134365.1| periplasmic beta-glucosidase [Escherichia coli KTE184]
 gi|433130704|ref|ZP_20316141.1| periplasmic beta-glucosidase [Escherichia coli KTE163]
 gi|433135366|ref|ZP_20320712.1| periplasmic beta-glucosidase [Escherichia coli KTE166]
 gi|433194215|ref|ZP_20378205.1| periplasmic beta-glucosidase [Escherichia coli KTE90]
 gi|291323313|gb|EFE62741.1| beta-glucosidase [Escherichia coli B088]
 gi|306908794|gb|EFN39291.1| glycoside hydrolase family 3 domain protein [Escherichia coli W]
 gi|354861985|gb|EHF22423.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           C236-11]
 gi|354867270|gb|EHF27692.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           C227-11]
 gi|354869341|gb|EHF29751.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           04-8351]
 gi|354873196|gb|EHF33573.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           09-7901]
 gi|354879950|gb|EHF40286.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-3677]
 gi|354889374|gb|EHF49623.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-4404]
 gi|354892969|gb|EHF53173.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-4522]
 gi|354895106|gb|EHF55295.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str. 11-4632
           C1]
 gi|354897380|gb|EHF57538.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-4623]
 gi|354898741|gb|EHF58892.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str. 11-4632
           C2]
 gi|354912792|gb|EHF72790.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str. 11-4632
           C5]
 gi|354915797|gb|EHF75773.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str. 11-4632
           C3]
 gi|354917712|gb|EHF77674.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str. 11-4632
           C4]
 gi|371599711|gb|EHN88492.1| periplasmic beta-glucosidase [Escherichia coli H494]
 gi|378100477|gb|EHW62173.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC10A]
 gi|378243388|gb|EHY03334.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC15B]
 gi|383475090|gb|EID67059.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli W26]
 gi|408296531|gb|EKJ14757.1| glycosyl hydrolase family 3 protein [Escherichia coli EC1865]
 gi|429346130|gb|EKY82911.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-02033-1]
 gi|429348882|gb|EKY85638.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-02092]
 gi|429361104|gb|EKY97761.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-02030]
 gi|429362535|gb|EKY99182.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-02093]
 gi|429362806|gb|EKY99451.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-02281]
 gi|429365924|gb|EKZ02536.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-02318]
 gi|429376779|gb|EKZ13307.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-02913]
 gi|429379253|gb|EKZ15754.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-03439]
 gi|429380821|gb|EKZ17310.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-03943]
 gi|429393042|gb|EKZ29441.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           11-04080]
 gi|429405143|gb|EKZ41409.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           Ec11-4984]
 gi|429406909|gb|EKZ43163.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           Ec11-9990]
 gi|429410657|gb|EKZ46878.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           Ec11-9450]
 gi|429414369|gb|EKZ50544.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           Ec11-4987]
 gi|429420978|gb|EKZ57100.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           Ec11-4986]
 gi|429429312|gb|EKZ65381.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           Ec11-5604]
 gi|429431999|gb|EKZ68039.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           Ec11-4988]
 gi|429436205|gb|EKZ72221.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           Ec11-5603]
 gi|429438410|gb|EKZ74403.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           Ec11-6006]
 gi|429448032|gb|EKZ83949.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           Ec12-0465]
 gi|429451750|gb|EKZ87638.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           Ec11-9941]
 gi|429457694|gb|EKZ93532.1| periplasmic beta-glucosidase [Escherichia coli O104:H4 str.
           Ec12-0466]
 gi|431006878|gb|ELD21847.1| periplasmic beta-glucosidase [Escherichia coli KTE210]
 gi|431214228|gb|ELF12053.1| periplasmic beta-glucosidase [Escherichia coli KTE142]
 gi|431296601|gb|ELF86312.1| periplasmic beta-glucosidase [Escherichia coli KTE29]
 gi|431309705|gb|ELF97898.1| periplasmic beta-glucosidase [Escherichia coli KTE48]
 gi|431354436|gb|ELG41162.1| periplasmic beta-glucosidase [Escherichia coli KTE91]
 gi|431376134|gb|ELG61457.1| periplasmic beta-glucosidase [Escherichia coli KTE135]
 gi|431453096|gb|ELH33506.1| periplasmic beta-glucosidase [Escherichia coli KTE184]
 gi|431646055|gb|ELJ13591.1| periplasmic beta-glucosidase [Escherichia coli KTE163]
 gi|431656516|gb|ELJ23499.1| periplasmic beta-glucosidase [Escherichia coli KTE166]
 gi|431715665|gb|ELJ79810.1| periplasmic beta-glucosidase [Escherichia coli KTE90]
          Length = 755

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 428 QSVTVLTGIKNSVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|74312656|ref|YP_311075.1| beta-D-glucoside glucohydrolase, periplasmic [Shigella sonnei
           Ss046]
 gi|73856133|gb|AAZ88840.1| beta-D-glucoside glucohydrolase, periplasmic [Shigella sonnei
           Ss046]
          Length = 765

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|218705661|ref|YP_002413180.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           UMN026]
 gi|293405601|ref|ZP_06649593.1| periplasmic beta-glucosidase [Escherichia coli FVEC1412]
 gi|298381283|ref|ZP_06990882.1| periplasmic beta-glucosidase [Escherichia coli FVEC1302]
 gi|387607818|ref|YP_006096674.1| beta-glucosidase [Escherichia coli 042]
 gi|417587127|ref|ZP_12237898.1| periplasmic beta-glucosidase [Escherichia coli STEC_C165-02]
 gi|218432758|emb|CAR13652.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           UMN026]
 gi|284922118|emb|CBG35199.1| periplasmic beta-glucosidase [Escherichia coli 042]
 gi|291427809|gb|EFF00836.1| periplasmic beta-glucosidase [Escherichia coli FVEC1412]
 gi|298278725|gb|EFI20239.1| periplasmic beta-glucosidase [Escherichia coli FVEC1302]
 gi|345336755|gb|EGW69189.1| periplasmic beta-glucosidase [Escherichia coli STEC_C165-02]
          Length = 765

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|218695744|ref|YP_002403411.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           55989]
 gi|218352476|emb|CAU98253.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           55989]
          Length = 765

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNSVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|421775758|ref|ZP_16212366.1| glycosyl hydrolase family 3 protein [Escherichia coli AD30]
 gi|408459227|gb|EKJ83010.1| glycosyl hydrolase family 3 protein [Escherichia coli AD30]
          Length = 765

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|329923020|ref|ZP_08278536.1| glycosyl hydrolase family 3 N-terminal domain protein
           [Paenibacillus sp. HGF5]
 gi|328941793|gb|EGG38078.1| glycosyl hydrolase family 3 N-terminal domain protein
           [Paenibacillus sp. HGF5]
          Length = 763

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 146/454 (32%), Positives = 226/454 (49%), Gaps = 44/454 (9%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHY 66
           G   +SP +++ RDPRWGR +ET GEDP L  ++A + V+GLQG    S   + A  KH+
Sbjct: 158 GAATYSPVLDVVRDPRWGRTEETFGEDPHLVAEFAVAAVQGLQGERLDSHTSLLATLKHF 217

Query: 67  TAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
             Y         R      +  ++L +   +PF+  V  G + S+M +YN+++G P  + 
Sbjct: 218 AGYGASEGG---RNGAPVHMGLRELHEVDLLPFRKAVESGAL-SIMTAYNEIDGVPCTSS 273

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD-CGPFLA 185
             +L+N +   W  DG++++DC ++ +L    +   +  EAA  ++KAG+D++  G    
Sbjct: 274 RYLLQNVLREAWGFDGFVITDCGAIHMLACGHNTAGSGVEAATQSLKAGVDMEMSGTMFR 333

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
            H + A+  GL+ E+D+N A    + ++ RLG+FD  P   P      + +    H  LA
Sbjct: 334 AHLQQALEQGLITEDDLNRAAGRVLELKFRLGLFD-RPYVDP--AWAEQVIGCKEHIALA 390

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA--CGYTTPLQG 303
            QAA +GIVLLKN    LPL +    T+AVIGPN+      +G+Y          T L G
Sbjct: 391 YQAAAEGIVLLKNEGNLLPLDS-SSGTIAVIGPNAHTPYHQLGDYTSPQPPGQIVTVLDG 449

Query: 304 ISRY---AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIE--------- 351
           I R    ++ ++  GC  +  +  +    A   A QAD  V+V+G   + +         
Sbjct: 450 IRRRLGDSRVLYAPGCR-IQGDSREGFPRALACAEQADVIVMVLGGSSARDFGEGTIDLR 508

Query: 352 ----------------AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSF 395
                            E IDR+ L L G Q EL+  + K  + PV++V + G P+   +
Sbjct: 509 TGASVVTGDAKSDMECGEGIDRSTLTLMGVQLELLQELQKLGK-PVIVVYINGRPITEPW 567

Query: 396 AKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
              D  I AI+   YPGQ GG AIAD+LFG  NP
Sbjct: 568 I--DEFIPAIIEAWYPGQEGGGAIADMLFGDINP 599


>gi|194431481|ref|ZP_03063773.1| beta-glucosidase, periplasmic [Shigella dysenteriae 1012]
 gi|416286656|ref|ZP_11648555.1| Periplasmic beta-glucosidase [Shigella boydii ATCC 9905]
 gi|194420306|gb|EDX36383.1| beta-glucosidase, periplasmic [Shigella dysenteriae 1012]
 gi|320178921|gb|EFW53884.1| Periplasmic beta-glucosidase [Shigella boydii ATCC 9905]
          Length = 787

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 177 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 236

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 237 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 288

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 289 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 346

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 347 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 401

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 402 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 459

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 460 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 519

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 520 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 576

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 577 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 613


>gi|157154880|ref|YP_001463478.1| beta-glucosidase, periplasmic [Escherichia coli E24377A]
 gi|191165415|ref|ZP_03027257.1| beta-glucosidase, periplasmic [Escherichia coli B7A]
 gi|193062306|ref|ZP_03043401.1| beta-glucosidase, periplasmic [Escherichia coli E22]
 gi|193070494|ref|ZP_03051434.1| beta-glucosidase, periplasmic [Escherichia coli E110019]
 gi|194427113|ref|ZP_03059664.1| beta-glucosidase, periplasmic [Escherichia coli B171]
 gi|209919591|ref|YP_002293675.1| beta-D-glucoside glucohydrolase [Escherichia coli SE11]
 gi|218554698|ref|YP_002387611.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           IAI1]
 gi|260844739|ref|YP_003222517.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           O103:H2 str. 12009]
 gi|260856106|ref|YP_003229997.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           O26:H11 str. 11368]
 gi|260868835|ref|YP_003235237.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           O111:H- str. 11128]
 gi|331653559|ref|ZP_08354560.1| periplasmic beta-glucosidase (Gentiobiase)(Cellobiase)
           (Beta-D-glucoside glucohydrolase) [Escherichia coli
           M718]
 gi|378712415|ref|YP_005277308.1| glycoside hydrolase family protein [Escherichia coli KO11FL]
 gi|386609526|ref|YP_006125012.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli W]
 gi|386700881|ref|YP_006164718.1| beta-D-glucoside glucohydrolase [Escherichia coli KO11FL]
 gi|386710007|ref|YP_006173728.1| beta-D-glucoside glucohydrolase [Escherichia coli W]
 gi|407470024|ref|YP_006783533.1| beta-D-glucoside glucohydrolase [Escherichia coli O104:H4 str.
           2009EL-2071]
 gi|407481313|ref|YP_006778462.1| beta-D-glucoside glucohydrolase [Escherichia coli O104:H4 str.
           2011C-3493]
 gi|410481860|ref|YP_006769406.1| beta-D-glucoside glucohydrolase [Escherichia coli O104:H4 str.
           2009EL-2050]
 gi|415793428|ref|ZP_11496125.1| periplasmic beta-glucosidase [Escherichia coli EPECa14]
 gi|415798009|ref|ZP_11498320.1| periplasmic beta-glucosidase [Escherichia coli E128010]
 gi|415823018|ref|ZP_11511537.1| periplasmic beta-glucosidase [Escherichia coli OK1180]
 gi|417132111|ref|ZP_11976896.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 5.0588]
 gi|417155748|ref|ZP_11993877.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 96.0497]
 gi|417159558|ref|ZP_11996586.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 99.0741]
 gi|417192056|ref|ZP_12014156.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 4.0522]
 gi|417213584|ref|ZP_12022625.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli JB1-95]
 gi|417237721|ref|ZP_12035452.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 9.0111]
 gi|417254023|ref|ZP_12045779.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 4.0967]
 gi|417298395|ref|ZP_12085634.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 900105 (10e)]
 gi|417581643|ref|ZP_12232445.1| periplasmic beta-glucosidase [Escherichia coli STEC_B2F1]
 gi|417592388|ref|ZP_12243085.1| periplasmic beta-glucosidase [Escherichia coli 2534-86]
 gi|417597446|ref|ZP_12248088.1| periplasmic beta-glucosidase [Escherichia coli 3030-1]
 gi|417624023|ref|ZP_12274322.1| periplasmic beta-glucosidase [Escherichia coli STEC_H.1.8]
 gi|417639865|ref|ZP_12290008.1| periplasmic beta-glucosidase [Escherichia coli TX1999]
 gi|417667551|ref|ZP_12317096.1| periplasmic beta-glucosidase [Escherichia coli STEC_O31]
 gi|417805696|ref|ZP_12452645.1| beta-D-glucoside glucohydrolase [Escherichia coli O104:H4 str.
           LB226692]
 gi|417833438|ref|ZP_12479886.1| beta-D-glucoside glucohydrolase [Escherichia coli O104:H4 str.
           01-09591]
 gi|419170787|ref|ZP_13714673.1| periplasmic beta-glucosidase [Escherichia coli DEC7A]
 gi|419181426|ref|ZP_13725040.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC7C]
 gi|419186871|ref|ZP_13730385.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC7D]
 gi|419192162|ref|ZP_13735616.1| periplasmic beta-glucosidase [Escherichia coli DEC7E]
 gi|419197587|ref|ZP_13740975.1| periplasmic beta-glucosidase [Escherichia coli DEC8A]
 gi|419204153|ref|ZP_13747336.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC8B]
 gi|419210283|ref|ZP_13753363.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC8C]
 gi|419216179|ref|ZP_13759181.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC8D]
 gi|419222116|ref|ZP_13765040.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC8E]
 gi|419227319|ref|ZP_13770177.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC9A]
 gi|419233064|ref|ZP_13775841.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC9B]
 gi|419238383|ref|ZP_13781104.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC9C]
 gi|419243848|ref|ZP_13786487.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC9D]
 gi|419249673|ref|ZP_13792258.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC9E]
 gi|419261692|ref|ZP_13804114.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC10B]
 gi|419267599|ref|ZP_13809954.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC10C]
 gi|419273222|ref|ZP_13815521.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC10D]
 gi|419278588|ref|ZP_13820837.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC10E]
 gi|419284732|ref|ZP_13826908.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC10F]
 gi|419290075|ref|ZP_13832167.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC11A]
 gi|419295403|ref|ZP_13837449.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC11B]
 gi|419300861|ref|ZP_13842860.1| periplasmic beta-glucosidase [Escherichia coli DEC11C]
 gi|419306992|ref|ZP_13848892.1| periplasmic beta-glucosidase [Escherichia coli DEC11D]
 gi|419312001|ref|ZP_13853863.1| periplasmic beta-glucosidase [Escherichia coli DEC11E]
 gi|419317410|ref|ZP_13859213.1| periplasmic beta-glucosidase [Escherichia coli DEC12A]
 gi|419323582|ref|ZP_13865275.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC12B]
 gi|419329545|ref|ZP_13871149.1| periplasmic beta-glucosidase [Escherichia coli DEC12C]
 gi|419335110|ref|ZP_13876643.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC12D]
 gi|419340583|ref|ZP_13882047.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC12E]
 gi|419376035|ref|ZP_13917060.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC14B]
 gi|419381376|ref|ZP_13922327.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC14C]
 gi|419386681|ref|ZP_13927559.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC14D]
 gi|419392227|ref|ZP_13933039.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC15A]
 gi|419407727|ref|ZP_13948416.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC15D]
 gi|419413310|ref|ZP_13953962.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC15E]
 gi|419874731|ref|ZP_14396639.1| beta-D-glucoside glucohydrolase [Escherichia coli O111:H11 str.
           CVM9534]
 gi|419885239|ref|ZP_14406027.1| beta-D-glucoside glucohydrolase [Escherichia coli O111:H11 str.
           CVM9545]
 gi|419892214|ref|ZP_14412244.1| beta-D-glucoside glucohydrolase [Escherichia coli O111:H8 str.
           CVM9570]
 gi|419895725|ref|ZP_14415510.1| beta-D-glucoside glucohydrolase [Escherichia coli O111:H8 str.
           CVM9574]
 gi|419902369|ref|ZP_14421595.1| beta-D-glucoside glucohydrolase [Escherichia coli O26:H11 str.
           CVM9942]
 gi|419906534|ref|ZP_14425435.1| glycoside hydrolase family protein [Escherichia coli O26:H11 str.
           CVM10026]
 gi|419950438|ref|ZP_14466652.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           CUMT8]
 gi|420089793|ref|ZP_14601573.1| beta-D-glucoside glucohydrolase [Escherichia coli O111:H8 str.
           CVM9602]
 gi|420095654|ref|ZP_14607134.1| beta-D-glucoside glucohydrolase [Escherichia coli O111:H8 str.
           CVM9634]
 gi|420104281|ref|ZP_14615006.1| beta-D-glucoside glucohydrolase [Escherichia coli O111:H11 str.
           CVM9455]
 gi|420106417|ref|ZP_14616827.1| beta-D-glucoside glucohydrolase [Escherichia coli O111:H11 str.
           CVM9553]
 gi|420117928|ref|ZP_14627272.1| beta-D-glucoside glucohydrolase [Escherichia coli O26:H11 str.
           CVM10021]
 gi|420124003|ref|ZP_14632876.1| beta-D-glucoside glucohydrolase [Escherichia coli O26:H11 str.
           CVM10030]
 gi|420128121|ref|ZP_14636682.1| beta-D-glucoside glucohydrolase [Escherichia coli O26:H11 str.
           CVM10224]
 gi|420134841|ref|ZP_14642941.1| beta-D-glucoside glucohydrolase [Escherichia coli O26:H11 str.
           CVM9952]
 gi|420386177|ref|ZP_14885528.1| periplasmic beta-glucosidase [Escherichia coli EPECa12]
 gi|422762401|ref|ZP_16816158.1| glycosyl hydrolase 3 domain-containing protein [Escherichia coli
           E1167]
 gi|422777426|ref|ZP_16831078.1| glycosyl hydrolase 3 domain-containing protein [Escherichia coli
           H120]
 gi|424749008|ref|ZP_18177130.1| beta-D-glucoside glucohydrolase [Escherichia coli O26:H11 str.
           CFSAN001629]
 gi|424761502|ref|ZP_18189074.1| beta-D-glucoside glucohydrolase [Escherichia coli O111:H11 str.
           CFSAN001630]
 gi|424772573|ref|ZP_18199667.1| beta-D-glucoside glucohydrolase [Escherichia coli O111:H8 str.
           CFSAN001632]
 gi|425422949|ref|ZP_18804117.1| glycosyl hydrolase family 3 protein [Escherichia coli 0.1288]
 gi|427805246|ref|ZP_18972313.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           chi7122]
 gi|427809805|ref|ZP_18976870.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli]
 gi|443618188|ref|YP_007382044.1| beta-D-glucoside glucohydrolase [Escherichia coli APEC O78]
 gi|450218284|ref|ZP_21895848.1| beta-D-glucoside glucohydrolase [Escherichia coli O08]
 gi|157076910|gb|ABV16618.1| beta-glucosidase, periplasmic [Escherichia coli E24377A]
 gi|190904578|gb|EDV64285.1| beta-glucosidase, periplasmic [Escherichia coli B7A]
 gi|192931972|gb|EDV84571.1| beta-glucosidase, periplasmic [Escherichia coli E22]
 gi|192956188|gb|EDV86651.1| beta-glucosidase, periplasmic [Escherichia coli E110019]
 gi|194414734|gb|EDX31005.1| beta-glucosidase, periplasmic [Escherichia coli B171]
 gi|209912850|dbj|BAG77924.1| beta-D-glucoside glucohydrolase [Escherichia coli SE11]
 gi|218361466|emb|CAQ99055.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           IAI1]
 gi|257754755|dbj|BAI26257.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           O26:H11 str. 11368]
 gi|257759886|dbj|BAI31383.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           O103:H2 str. 12009]
 gi|257765191|dbj|BAI36686.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           O111:H- str. 11128]
 gi|315061443|gb|ADT75770.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli W]
 gi|323152359|gb|EFZ38648.1| periplasmic beta-glucosidase [Escherichia coli EPECa14]
 gi|323161760|gb|EFZ47641.1| periplasmic beta-glucosidase [Escherichia coli E128010]
 gi|323176973|gb|EFZ62563.1| periplasmic beta-glucosidase [Escherichia coli OK1180]
 gi|323377976|gb|ADX50244.1| glycoside hydrolase family 3 domain protein [Escherichia coli
           KO11FL]
 gi|323944949|gb|EGB41014.1| glycosyl hydrolase 3 domain-containing protein [Escherichia coli
           H120]
 gi|324118027|gb|EGC11926.1| glycosyl hydrolase 3 domain-containing protein [Escherichia coli
           E1167]
 gi|331048408|gb|EGI20484.1| periplasmic beta-glucosidase (Gentiobiase)(Cellobiase)
           (Beta-D-glucoside glucohydrolase) [Escherichia coli
           M718]
 gi|340734320|gb|EGR63450.1| beta-D-glucoside glucohydrolase [Escherichia coli O104:H4 str.
           01-09591]
 gi|340739608|gb|EGR73840.1| beta-D-glucoside glucohydrolase [Escherichia coli O104:H4 str.
           LB226692]
 gi|345337414|gb|EGW69846.1| periplasmic beta-glucosidase [Escherichia coli STEC_B2F1]
 gi|345338725|gb|EGW71152.1| periplasmic beta-glucosidase [Escherichia coli 2534-86]
 gi|345353731|gb|EGW85960.1| periplasmic beta-glucosidase [Escherichia coli 3030-1]
 gi|345376984|gb|EGX08916.1| periplasmic beta-glucosidase [Escherichia coli STEC_H.1.8]
 gi|345393357|gb|EGX23133.1| periplasmic beta-glucosidase [Escherichia coli TX1999]
 gi|378014831|gb|EHV77728.1| periplasmic beta-glucosidase [Escherichia coli DEC7A]
 gi|378023674|gb|EHV86346.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC7C]
 gi|378028863|gb|EHV91479.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC7D]
 gi|378038691|gb|EHW01200.1| periplasmic beta-glucosidase [Escherichia coli DEC7E]
 gi|378047119|gb|EHW09491.1| periplasmic beta-glucosidase [Escherichia coli DEC8A]
 gi|378048873|gb|EHW11226.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC8B]
 gi|378054073|gb|EHW16359.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC8C]
 gi|378062663|gb|EHW24840.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC8D]
 gi|378065676|gb|EHW27819.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC8E]
 gi|378074832|gb|EHW36861.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC9A]
 gi|378077591|gb|EHW39585.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC9B]
 gi|378084283|gb|EHW46195.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC9C]
 gi|378090926|gb|EHW52761.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC9D]
 gi|378095719|gb|EHW57504.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC9E]
 gi|378106741|gb|EHW68369.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC10B]
 gi|378111427|gb|EHW73011.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC10C]
 gi|378116679|gb|EHW78199.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC10D]
 gi|378128804|gb|EHW90185.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC10E]
 gi|378129876|gb|EHW91246.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC11A]
 gi|378131986|gb|EHW93339.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC10F]
 gi|378142490|gb|EHX03692.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC11B]
 gi|378148979|gb|EHX10112.1| periplasmic beta-glucosidase [Escherichia coli DEC11D]
 gi|378150477|gb|EHX11592.1| periplasmic beta-glucosidase [Escherichia coli DEC11C]
 gi|378157629|gb|EHX18660.1| periplasmic beta-glucosidase [Escherichia coli DEC11E]
 gi|378164658|gb|EHX25599.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC12B]
 gi|378168993|gb|EHX29893.1| periplasmic beta-glucosidase [Escherichia coli DEC12A]
 gi|378169861|gb|EHX30747.1| periplasmic beta-glucosidase [Escherichia coli DEC12C]
 gi|378182189|gb|EHX42842.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC12D]
 gi|378187494|gb|EHX48105.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC12E]
 gi|378220122|gb|EHX80388.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC14B]
 gi|378227743|gb|EHX87911.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC14C]
 gi|378231208|gb|EHX91319.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC14D]
 gi|378237427|gb|EHX97450.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC15A]
 gi|378254106|gb|EHY13970.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC15D]
 gi|378258742|gb|EHY18558.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC15E]
 gi|383392408|gb|AFH17366.1| beta-D-glucoside glucohydrolase [Escherichia coli KO11FL]
 gi|383405699|gb|AFH11942.1| beta-D-glucoside glucohydrolase [Escherichia coli W]
 gi|386149965|gb|EIH01254.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 5.0588]
 gi|386168837|gb|EIH35353.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 96.0497]
 gi|386175009|gb|EIH47001.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 99.0741]
 gi|386190938|gb|EIH79684.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 4.0522]
 gi|386194280|gb|EIH88536.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli JB1-95]
 gi|386213499|gb|EII23924.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 9.0111]
 gi|386215950|gb|EII32442.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 4.0967]
 gi|386258075|gb|EIJ13557.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 900105 (10e)]
 gi|388348234|gb|EIL13852.1| beta-D-glucoside glucohydrolase [Escherichia coli O111:H8 str.
           CVM9570]
 gi|388350421|gb|EIL15802.1| beta-D-glucoside glucohydrolase [Escherichia coli O111:H11 str.
           CVM9534]
 gi|388350774|gb|EIL16100.1| beta-D-glucoside glucohydrolase [Escherichia coli O111:H11 str.
           CVM9545]
 gi|388360046|gb|EIL24293.1| beta-D-glucoside glucohydrolase [Escherichia coli O111:H8 str.
           CVM9574]
 gi|388374097|gb|EIL37299.1| beta-D-glucoside glucohydrolase [Escherichia coli O26:H11 str.
           CVM9942]
 gi|388379020|gb|EIL41712.1| glycoside hydrolase family protein [Escherichia coli O26:H11 str.
           CVM10026]
 gi|388416862|gb|EIL76736.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           CUMT8]
 gi|391305244|gb|EIQ63035.1| periplasmic beta-glucosidase [Escherichia coli EPECa12]
 gi|394386304|gb|EJE63810.1| beta-D-glucoside glucohydrolase [Escherichia coli O26:H11 str.
           CVM10224]
 gi|394387003|gb|EJE64473.1| beta-D-glucoside glucohydrolase [Escherichia coli O111:H8 str.
           CVM9602]
 gi|394391762|gb|EJE68591.1| beta-D-glucoside glucohydrolase [Escherichia coli O111:H8 str.
           CVM9634]
 gi|394401369|gb|EJE77182.1| beta-D-glucoside glucohydrolase [Escherichia coli O26:H11 str.
           CVM10021]
 gi|394404640|gb|EJE79991.1| beta-D-glucoside glucohydrolase [Escherichia coli O111:H11 str.
           CVM9455]
 gi|394415904|gb|EJE89734.1| beta-D-glucoside glucohydrolase [Escherichia coli O26:H11 str.
           CVM10030]
 gi|394416230|gb|EJE90037.1| beta-D-glucoside glucohydrolase [Escherichia coli O111:H11 str.
           CVM9553]
 gi|394420962|gb|EJE94460.1| beta-D-glucoside glucohydrolase [Escherichia coli O26:H11 str.
           CVM9952]
 gi|397784697|gb|EJK95550.1| periplasmic beta-glucosidase [Escherichia coli STEC_O31]
 gi|406777022|gb|AFS56446.1| beta-D-glucoside glucohydrolase [Escherichia coli O104:H4 str.
           2009EL-2050]
 gi|407053610|gb|AFS73661.1| beta-D-glucoside glucohydrolase [Escherichia coli O104:H4 str.
           2011C-3493]
 gi|407066059|gb|AFS87106.1| beta-D-glucoside glucohydrolase [Escherichia coli O104:H4 str.
           2009EL-2071]
 gi|408344018|gb|EKJ58409.1| glycosyl hydrolase family 3 protein [Escherichia coli 0.1288]
 gi|412963428|emb|CCK47353.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           chi7122]
 gi|412969984|emb|CCJ44627.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli]
 gi|421938124|gb|EKT95711.1| beta-D-glucoside glucohydrolase [Escherichia coli O111:H8 str.
           CFSAN001632]
 gi|421943044|gb|EKU00342.1| beta-D-glucoside glucohydrolase [Escherichia coli O26:H11 str.
           CFSAN001629]
 gi|421943522|gb|EKU00805.1| beta-D-glucoside glucohydrolase [Escherichia coli O111:H11 str.
           CFSAN001630]
 gi|443422696|gb|AGC87600.1| beta-D-glucoside glucohydrolase [Escherichia coli APEC O78]
 gi|449317902|gb|EMD07984.1| beta-D-glucoside glucohydrolase [Escherichia coli O08]
          Length = 765

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNSVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|437836465|ref|ZP_20845649.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SARB17]
 gi|435299411|gb|ELO75560.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SARB17]
          Length = 613

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 147/458 (32%), Positives = 228/458 (49%), Gaps = 61/458 (13%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 3   WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 62

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 63  AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 114

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 115 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 172

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA---- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   PA    
Sbjct: 173 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKE-SDPADTNA 226

Query: 241 ----HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVA 294
               H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA
Sbjct: 227 ESRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVA 284

Query: 295 CGYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAE 331
               T L GI           YAK               +++        +   +I  A 
Sbjct: 285 NQSVTVLAGIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAV 344

Query: 332 VAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPV 391
            AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P 
Sbjct: 345 QAAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP- 402

Query: 392 DVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 403 -LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 439


>gi|300901249|ref|ZP_07119350.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 198-1]
 gi|300355323|gb|EFJ71193.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 198-1]
          Length = 789

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 179 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 238

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 239 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 290

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 291 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 348

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 349 SKYLPGLIKSGKVTMEELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 403

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 404 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 461

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 462 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 521

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 522 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 578

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 579 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 615


>gi|432543741|ref|ZP_19780584.1| periplasmic beta-glucosidase [Escherichia coli KTE236]
 gi|432549231|ref|ZP_19785999.1| periplasmic beta-glucosidase [Escherichia coli KTE237]
 gi|432622384|ref|ZP_19858415.1| periplasmic beta-glucosidase [Escherichia coli KTE76]
 gi|432815872|ref|ZP_20049656.1| periplasmic beta-glucosidase [Escherichia coli KTE115]
 gi|431074151|gb|ELD81715.1| periplasmic beta-glucosidase [Escherichia coli KTE236]
 gi|431079509|gb|ELD86463.1| periplasmic beta-glucosidase [Escherichia coli KTE237]
 gi|431159184|gb|ELE59741.1| periplasmic beta-glucosidase [Escherichia coli KTE76]
 gi|431364096|gb|ELG50640.1| periplasmic beta-glucosidase [Escherichia coli KTE115]
          Length = 755

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLEAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 428 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|432412347|ref|ZP_19655010.1| periplasmic beta-glucosidase [Escherichia coli KTE39]
 gi|432432422|ref|ZP_19674851.1| periplasmic beta-glucosidase [Escherichia coli KTE187]
 gi|432436860|ref|ZP_19679248.1| periplasmic beta-glucosidase [Escherichia coli KTE188]
 gi|432457240|ref|ZP_19699424.1| periplasmic beta-glucosidase [Escherichia coli KTE201]
 gi|432496241|ref|ZP_19738037.1| periplasmic beta-glucosidase [Escherichia coli KTE214]
 gi|432504962|ref|ZP_19746686.1| periplasmic beta-glucosidase [Escherichia coli KTE220]
 gi|432524332|ref|ZP_19761460.1| periplasmic beta-glucosidase [Escherichia coli KTE230]
 gi|432569213|ref|ZP_19805725.1| periplasmic beta-glucosidase [Escherichia coli KTE53]
 gi|432593399|ref|ZP_19829716.1| periplasmic beta-glucosidase [Escherichia coli KTE60]
 gi|432608021|ref|ZP_19844206.1| periplasmic beta-glucosidase [Escherichia coli KTE67]
 gi|432784081|ref|ZP_20018260.1| periplasmic beta-glucosidase [Escherichia coli KTE63]
 gi|432845104|ref|ZP_20077937.1| periplasmic beta-glucosidase [Escherichia coli KTE141]
 gi|432974281|ref|ZP_20163120.1| periplasmic beta-glucosidase [Escherichia coli KTE209]
 gi|432995869|ref|ZP_20184474.1| periplasmic beta-glucosidase [Escherichia coli KTE218]
 gi|433000438|ref|ZP_20188963.1| periplasmic beta-glucosidase [Escherichia coli KTE223]
 gi|433058652|ref|ZP_20245698.1| periplasmic beta-glucosidase [Escherichia coli KTE124]
 gi|433087813|ref|ZP_20274184.1| periplasmic beta-glucosidase [Escherichia coli KTE137]
 gi|433116067|ref|ZP_20301858.1| periplasmic beta-glucosidase [Escherichia coli KTE153]
 gi|433125717|ref|ZP_20311276.1| periplasmic beta-glucosidase [Escherichia coli KTE160]
 gi|433139786|ref|ZP_20325042.1| periplasmic beta-glucosidase [Escherichia coli KTE167]
 gi|433149703|ref|ZP_20334724.1| periplasmic beta-glucosidase [Escherichia coli KTE174]
 gi|433208309|ref|ZP_20391983.1| periplasmic beta-glucosidase [Escherichia coli KTE97]
 gi|433213045|ref|ZP_20396635.1| periplasmic beta-glucosidase [Escherichia coli KTE99]
 gi|430934754|gb|ELC55101.1| periplasmic beta-glucosidase [Escherichia coli KTE39]
 gi|430952847|gb|ELC71761.1| periplasmic beta-glucosidase [Escherichia coli KTE187]
 gi|430962191|gb|ELC80048.1| periplasmic beta-glucosidase [Escherichia coli KTE188]
 gi|430981974|gb|ELC98693.1| periplasmic beta-glucosidase [Escherichia coli KTE201]
 gi|431023499|gb|ELD36694.1| periplasmic beta-glucosidase [Escherichia coli KTE214]
 gi|431038056|gb|ELD49025.1| periplasmic beta-glucosidase [Escherichia coli KTE220]
 gi|431051448|gb|ELD61111.1| periplasmic beta-glucosidase [Escherichia coli KTE230]
 gi|431099927|gb|ELE04943.1| periplasmic beta-glucosidase [Escherichia coli KTE53]
 gi|431127499|gb|ELE29799.1| periplasmic beta-glucosidase [Escherichia coli KTE60]
 gi|431137907|gb|ELE39747.1| periplasmic beta-glucosidase [Escherichia coli KTE67]
 gi|431328504|gb|ELG15808.1| periplasmic beta-glucosidase [Escherichia coli KTE63]
 gi|431394526|gb|ELG78059.1| periplasmic beta-glucosidase [Escherichia coli KTE141]
 gi|431489142|gb|ELH68770.1| periplasmic beta-glucosidase [Escherichia coli KTE209]
 gi|431505429|gb|ELH84035.1| periplasmic beta-glucosidase [Escherichia coli KTE218]
 gi|431508424|gb|ELH86696.1| periplasmic beta-glucosidase [Escherichia coli KTE223]
 gi|431568907|gb|ELI41869.1| periplasmic beta-glucosidase [Escherichia coli KTE124]
 gi|431604324|gb|ELI73733.1| periplasmic beta-glucosidase [Escherichia coli KTE137]
 gi|431633829|gb|ELJ02091.1| periplasmic beta-glucosidase [Escherichia coli KTE153]
 gi|431644951|gb|ELJ12604.1| periplasmic beta-glucosidase [Escherichia coli KTE160]
 gi|431660099|gb|ELJ26987.1| periplasmic beta-glucosidase [Escherichia coli KTE167]
 gi|431670372|gb|ELJ36725.1| periplasmic beta-glucosidase [Escherichia coli KTE174]
 gi|431729594|gb|ELJ93213.1| periplasmic beta-glucosidase [Escherichia coli KTE97]
 gi|431734070|gb|ELJ97471.1| periplasmic beta-glucosidase [Escherichia coli KTE99]
          Length = 755

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 230/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVQAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 428 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|416299903|ref|ZP_11652493.1| Periplasmic beta-glucosidase [Shigella flexneri CDC 796-83]
 gi|420351728|ref|ZP_14852911.1| periplasmic beta-glucosidase [Shigella boydii 4444-74]
 gi|320184843|gb|EFW59633.1| Periplasmic beta-glucosidase [Shigella flexneri CDC 796-83]
 gi|391287063|gb|EIQ45596.1| periplasmic beta-glucosidase [Shigella boydii 4444-74]
          Length = 755

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 428 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDMVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|260909489|ref|ZP_05916193.1| periplasmic beta-glucosidase [Prevotella sp. oral taxon 472 str.
           F0295]
 gi|260636414|gb|EEX54400.1| periplasmic beta-glucosidase [Prevotella sp. oral taxon 472 str.
           F0295]
          Length = 761

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 149/426 (34%), Positives = 217/426 (50%), Gaps = 20/426 (4%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL 71
           ++P V++ RDPRWGR  E  GEDP    K A + V+G Q N G    V AC KH+ AY  
Sbjct: 176 FAPMVDLARDPRWGRIVEGAGEDPFFGCKVAEARVKGFQWNLGKPNSVLACAKHFAAYGA 235

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
                  R +    +S   L + Y  PFKAC+  G V + M ++N +NG+P      +L 
Sbjct: 236 PQ---AGRDYAPVDISPATLAEMYLPPFKACIDAG-VQTFMSAFNDLNGEPATGSHWLLT 291

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC--GPFLAIHTE 189
           + +  QW   G++VSD ++V V    Q       +AA  A+KAG+D+D   G + A H  
Sbjct: 292 DLLRNQWAFKGFVVSDWNAV-VQLKAQGVVENDRDAAIMALKAGVDMDMVDGLYNA-HIA 349

Query: 190 GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAA 249
            AVR GL+ E  VN A+   +  + RLG+FD  P A        + V +     LA +AA
Sbjct: 350 DAVRKGLISEFTVNTAVERILRQKYRLGLFDN-PYAFLDNAREQQTVRSAELMALAREAA 408

Query: 250 HQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA--GVACGYTTPLQGISRY 307
            + +VLLKN    LPLS  +   +AVIGP +D    ++G++   G      T ++G+   
Sbjct: 409 TKSMVLLKNEGGVLPLSK-QVRRIAVIGPLADNQAEVLGSWKARGEDADVVTVVKGLRNK 467

Query: 308 ----AKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLP 363
               A  ++  GC  +    N+    A  AA  AD  + V+G    +  E   RA L LP
Sbjct: 468 LGTNANVVYVRGCDFLDTKNNEFA-KARQAAANADVVIAVVGEKALMSGESRSRAFLSLP 526

Query: 364 GRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVL 423
           G QQ+L+  + K++  P+V VLM G P  ++ A+   +  A+L   +PG   G A+ADVL
Sbjct: 527 GLQQQLIDTL-KSTGKPLVTVLMNGRP--LTLARVAEQSDALLEAWFPGTQCGNAVADVL 583

Query: 424 FGRANP 429
           FG  NP
Sbjct: 584 FGDVNP 589


>gi|200387658|ref|ZP_03214270.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Virchow str. SL491]
 gi|199604756|gb|EDZ03301.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Virchow str. SL491]
          Length = 765

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 150/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 267 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGN--------------QLIGAAEV 332
              T L GI           YAK  +     G+    N               +I  A  
Sbjct: 438 QSVTVLAGIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 AAKQADVVVAVVGESQGMAHEASSRTDITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|419862653|ref|ZP_14385247.1| beta-D-glucoside glucohydrolase [Escherichia coli O103:H25 str.
           CVM9340]
 gi|388344793|gb|EIL10617.1| beta-D-glucoside glucohydrolase [Escherichia coli O103:H25 str.
           CVM9340]
          Length = 765

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|423281958|ref|ZP_17260843.1| hypothetical protein HMPREF1204_00381 [Bacteroides fragilis HMW
           615]
 gi|404582445|gb|EKA87139.1| hypothetical protein HMPREF1204_00381 [Bacteroides fragilis HMW
           615]
          Length = 805

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 155/451 (34%), Positives = 221/451 (49%), Gaps = 48/451 (10%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLK-VAACCKHYTAYD 70
           + P +++ RDPRW R +ET GEDP L G    + VRG QG T +  K V A  KH+ +Y 
Sbjct: 198 YGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETLNDGKSVIATLKHFASY- 256

Query: 71  LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
              W         A + +++LE+    PF+  V  G + SVM SYN+++G P      +L
Sbjct: 257 --GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLL 313

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP-FLAIHTE 189
            + +  +W+  G++VSD  +VG L           EAA  A+ AG+D D G    A    
Sbjct: 314 TDILKDRWQFKGFVVSDLYAVGGL-REHGVAGNDYEAAIKAVNAGVDSDLGTNVYAEQLV 372

Query: 190 GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPF--GNLGPRDVCTPAHQQLALQ 247
            AV+ G +    ++ A+   ++++ ++G+FD      PF       + V +  H  LA +
Sbjct: 373 AAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPFVDEKQAAQLVASSEHTGLARE 427

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT--TPLQGI- 304
            A Q IVLLKN  + LPL      T+AVIGPN+D    M+G+Y       T  T L GI 
Sbjct: 428 VARQSIVLLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIR 486

Query: 305 ---SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG----LDQSIEAEFI-- 355
              S+  + ++  GC     +      A E  AR ADA V+VMG     D S E E    
Sbjct: 487 QKVSKETRVLYAKGCAVRDSSRTGFKDAIET-ARNADAVVMVMGGSSARDFSSEYEETGA 545

Query: 356 -----------------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKN 398
                            DRA L L GRQ EL+  +++  + PVVLVL+ G P+ +  A  
Sbjct: 546 AKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQ 604

Query: 399 DPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +    AI+   YPG  GG A+ADVLFG  NP
Sbjct: 605 EAE--AIVDAWYPGMQGGNAVADVLFGDYNP 633


>gi|194449077|ref|YP_002046219.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Heidelberg str. SL476]
 gi|205359117|ref|ZP_02667180.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Heidelberg str. SL486]
 gi|386591978|ref|YP_006088378.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Heidelberg str. B182]
 gi|417384271|ref|ZP_12149694.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Johannesburg str. S5-703]
 gi|194407381|gb|ACF67600.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Heidelberg str. SL476]
 gi|205338504|gb|EDZ25268.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Heidelberg str. SL486]
 gi|353609401|gb|EHC62716.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Johannesburg str. S5-703]
 gi|383799022|gb|AFH46104.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Heidelberg str. B182]
          Length = 755

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 147/458 (32%), Positives = 228/458 (49%), Gaps = 61/458 (13%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 257 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA---- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   PA    
Sbjct: 315 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKE-SDPADTNA 368

Query: 241 ----HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVA 294
               H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA
Sbjct: 369 ESRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVA 426

Query: 295 CGYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAE 331
               T L GI           YAK               +++        +   +I  A 
Sbjct: 427 NQSVTVLAGIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAV 486

Query: 332 VAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPV 391
            AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P 
Sbjct: 487 QAAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP- 544

Query: 392 DVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 -LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|156933295|ref|YP_001437211.1| hypothetical protein ESA_01107 [Cronobacter sakazakii ATCC BAA-894]
 gi|156531549|gb|ABU76375.1| hypothetical protein ESA_01107 [Cronobacter sakazakii ATCC BAA-894]
          Length = 757

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 147/457 (32%), Positives = 223/457 (48%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGRG E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 147 WAPMVDVTRDPRWGRGSEGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYG 206

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+K  +  G    VM + N +NG P  +D
Sbjct: 207 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASD 258

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ + G W   G  +SD  ++  L   +H T + PE+A   AIK+G+D+     + 
Sbjct: 259 GWLLKDLLRGDWGFKGITISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYY 316

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD-------VC 237
           + +    ++ G +   +++ A  + + V+  +G+F+      P+ +LGP+D         
Sbjct: 317 SKYLPNLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAE 371

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
           +  H+  A + A + +VLLKN   TLPL   +  T+AV+GP +D    M+G++  AGVA 
Sbjct: 372 SRLHRDDARKVARESLVLLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVAD 429

Query: 296 GYTTPLQGIS----RYAKTIHQAGCFGVACNG-------------------NQLIGAAEV 332
              T LQG+       AK ++  G       G                     +I  A  
Sbjct: 430 QSVTLLQGMKNVAGHKAKILYAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKAMIDEAVN 489

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  L +P  Q++L+S + KA+  P+VLVLM G P  
Sbjct: 490 VAKQSDVVVAVVGEAQGMAHEASSRTDLTIPQSQRDLISAL-KATGKPLVLVLMNGRP-- 546

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 547 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 583


>gi|161502686|ref|YP_001569798.1| hypothetical protein SARI_00733 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:- str. RSK2980]
 gi|160864033|gb|ABX20656.1| hypothetical protein SARI_00733 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:-]
          Length = 771

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 150/457 (32%), Positives = 228/457 (49%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG N   R  V    KH+ AY 
Sbjct: 161 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKNPADRYSVMTSVKHFAAYG 220

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 221 AVEGGKEYNSVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 272

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK  +  +W   G  VSD  ++  L   +H T   PE+A   A+K+G+D+     + 
Sbjct: 273 SWLLKEVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGVDMSMADEYY 330

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 331 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 385

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 386 SRLHRKEAREVARESLVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAD 443

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGN--------------QLIGAAEV 332
              T L GI           YAK  +     G+    N               +I  A  
Sbjct: 444 QSVTVLAGIQNAVGDGAKILYAKGANITNNKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 503

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 504 AAKQADVVVAVVGESQGMAHEASSRTDITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 560

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 561 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 597


>gi|417350089|ref|ZP_12128568.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Gaminara str. A4-567]
 gi|353570992|gb|EHC35077.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Gaminara str. A4-567]
          Length = 622

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 148/458 (32%), Positives = 228/458 (49%), Gaps = 61/458 (13%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 257 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA---- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   PA    
Sbjct: 315 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKE-SDPADTNA 368

Query: 241 ----HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVA 294
               H++ A + A + +VLLKN   TLP+   +  T+AV+GP +D    ++G++  AGVA
Sbjct: 369 ESRLHRKEAREVARESVVLLKNRLETLPMK--KSGTIAVVGPLADSQRDVMGSWSAAGVA 426

Query: 295 CGYTTPLQGISR---------YAKTIHQAGCFGVACNGN--------------QLIGAAE 331
               T L GI           YAK  +     G+    N               +I  A 
Sbjct: 427 NQSVTVLAGIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAV 486

Query: 332 VAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPV 391
            AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P 
Sbjct: 487 QAAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP- 544

Query: 392 DVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 -LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|432679306|ref|ZP_19914705.1| periplasmic beta-glucosidase [Escherichia coli KTE143]
 gi|431224366|gb|ELF21593.1| periplasmic beta-glucosidase [Escherichia coli KTE143]
          Length = 755

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 230/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVQAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 428 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|238912609|ref|ZP_04656446.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Tennessee str. CDC07-0191]
          Length = 765

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 147/458 (32%), Positives = 228/458 (49%), Gaps = 61/458 (13%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 267 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA---- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   PA    
Sbjct: 325 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKE-SDPADTNA 378

Query: 241 ----HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVA 294
               H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA
Sbjct: 379 ESRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVA 436

Query: 295 CGYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAE 331
               T L GI           YAK               +++        +   +I  A 
Sbjct: 437 NQSVTVLAGIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAV 496

Query: 332 VAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPV 391
            AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P 
Sbjct: 497 QAAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP- 554

Query: 392 DVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 -LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|289805550|ref|ZP_06536179.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhi str. AG3]
          Length = 657

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 148/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 47  WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 106

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 107 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 158

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 159 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 216

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 217 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 271

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 272 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 329

Query: 296 GYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAEV 332
              T L GI           YAK               +++        +   +I  A  
Sbjct: 330 QSVTVLAGIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 389

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 390 AAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 446

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 447 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 483


>gi|82543548|ref|YP_407495.1| beta-D-glucoside glucohydrolase, periplasmic [Shigella boydii
           Sb227]
 gi|417680936|ref|ZP_12330315.1| periplasmic beta-glucosidase [Shigella boydii 3594-74]
 gi|421683108|ref|ZP_16122905.1| &beta-D-glucoside glucohydrolase, periplasmic [Shigella flexneri
           1485-80]
 gi|81244959|gb|ABB65667.1| beta-D-glucoside glucohydrolase, periplasmic [Shigella boydii
           Sb227]
 gi|332097401|gb|EGJ02381.1| periplasmic beta-glucosidase [Shigella boydii 3594-74]
 gi|404339162|gb|EJZ65600.1| &beta-D-glucoside glucohydrolase, periplasmic [Shigella flexneri
           1485-80]
          Length = 765

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDMVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|425305848|ref|ZP_18695559.1| glycosyl hydrolase family 3 protein [Escherichia coli N1]
 gi|408228473|gb|EKI52007.1| glycosyl hydrolase family 3 protein [Escherichia coli N1]
          Length = 765

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 TWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMAELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGIS---------RYAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI          RYAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENGKVRYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|378954476|ref|YP_005211963.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Gallinarum/pullorum str. RKS5078]
 gi|357205087|gb|AET53133.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Gallinarum/pullorum str. RKS5078]
          Length = 755

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 148/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 257 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 427

Query: 296 GYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAEV 332
              T L GI           YAK               +H+        +   +I  A  
Sbjct: 428 QSVTVLAGIQNAVDDGAKILYAKGANITNDKDIVDFLNLHEEAVKIDPRSPQAMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QA+  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 AAKQANVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|417137893|ref|ZP_11981658.1| glycosyl hydrolase family 3, C-terminal domain protein [Escherichia
           coli 97.0259]
 gi|386158639|gb|EIH14974.1| glycosyl hydrolase family 3, C-terminal domain protein [Escherichia
           coli 97.0259]
          Length = 654

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 44  WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 103

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 104 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 155

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 156 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 213

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 214 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 268

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 269 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 326

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 327 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 386

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 387 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 443

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 444 LAMVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 480


>gi|217968103|ref|YP_002353609.1| glycoside hydrolase family 3 [Dictyoglomus turgidum DSM 6724]
 gi|217337202|gb|ACK42995.1| glycoside hydrolase family 3 domain protein [Dictyoglomus turgidum
           DSM 6724]
          Length = 756

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 152/430 (35%), Positives = 219/430 (50%), Gaps = 37/430 (8%)

Query: 13  SPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDLD 72
           SP +NI RDPR GR +ET GEDP L  + A +Y++G+Q        V A  KH+ A  + 
Sbjct: 187 SPTINIARDPRCGRTEETYGEDPYLASRMAVAYIKGVQEQG-----VIATPKHFAANFVG 241

Query: 73  NWNGVDRY--HFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
           +  G D Y  HF+ R+    L + Y   FKA + E    S+M +YN ++G P  ++  +L
Sbjct: 242 D-GGRDSYPIHFSERL----LREVYFPAFKASIKEAGALSLMAAYNSLDGIPCSSNKWLL 296

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDL-----DCGPFLA 185
            + +  +W   GY+VSD  SV  L        +  EAA  A++AGLD+     DC   + 
Sbjct: 297 TDVLRKEWGFKGYVVSDYFSVLHLMTKHKVAESKAEAARLALEAGLDMELPDSDCFEEMI 356

Query: 186 IHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLA 245
                 V+GG L EE +N A+   + V+   G+FD  P   P  +   R      H++LA
Sbjct: 357 ----NLVKGGKLSEETINEAVRRILGVKFWAGLFDN-PFVDP--DYAERVNDCAEHRELA 409

Query: 246 LQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI- 304
           L+ A + IVLLKN    LPLS     ++AVIGPN+   V  +G Y+G      TPL+GI 
Sbjct: 410 LRVARESIVLLKNEG-ILPLSK-DIGSIAVIGPNA--AVPRLGGYSGYGVKIVTPLEGIK 465

Query: 305 ---SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLD-QSIEAEFIDRAGL 360
                 AK     GC G+          A   A+++D  +L +G      E E  DR  L
Sbjct: 466 NKMENKAKIYFAEGC-GLNDTSKSGFDEAIKIAQKSDVAILFVGNSVPETEGEQRDRHNL 524

Query: 361 LLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIA 420
            LPG Q+EL+  +   +  PV++VL+ G    ++      ++ A++   YPG+ GG AIA
Sbjct: 525 NLPGVQEELIKEICN-TNTPVIVVLINGSA--ITMMNWIDKVQAVIEAWYPGEEGGNAIA 581

Query: 421 DVLFGRANPG 430
           DVLFG  NPG
Sbjct: 582 DVLFGDYNPG 591


>gi|197248682|ref|YP_002147137.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar Agona
           str. SL483]
 gi|440763077|ref|ZP_20942125.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Agona str. SH11G1113]
 gi|440769230|ref|ZP_20948190.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Agona str. SH08SF124]
 gi|440771374|ref|ZP_20950293.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Agona str. SH10GFN094]
 gi|197212385|gb|ACH49782.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Agona str. SL483]
 gi|436415218|gb|ELP13139.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Agona str. SH08SF124]
 gi|436421310|gb|ELP19157.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Agona str. SH11G1113]
 gi|436422409|gb|ELP20247.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Agona str. SH10GFN094]
          Length = 765

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 147/458 (32%), Positives = 228/458 (49%), Gaps = 61/458 (13%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 267 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA---- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   PA    
Sbjct: 325 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKE-SDPADTNA 378

Query: 241 ----HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVA 294
               H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA
Sbjct: 379 ESRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVA 436

Query: 295 CGYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAE 331
               T L GI           YAK               +++        +   +I  A 
Sbjct: 437 NQSVTVLAGIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAV 496

Query: 332 VAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPV 391
            AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P 
Sbjct: 497 QAAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP- 554

Query: 392 DVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 -LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|414576727|ref|ZP_11433909.1| periplasmic beta-glucosidase [Shigella sonnei 3233-85]
 gi|420359193|ref|ZP_14860167.1| periplasmic beta-glucosidase [Shigella sonnei 3226-85]
 gi|391281874|gb|EIQ40511.1| periplasmic beta-glucosidase [Shigella sonnei 3226-85]
 gi|391284480|gb|EIQ43075.1| periplasmic beta-glucosidase [Shigella sonnei 3233-85]
          Length = 765

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|419728314|ref|ZP_14255280.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41579]
 gi|419736497|ref|ZP_14263337.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41563]
 gi|419741060|ref|ZP_14267772.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41573]
 gi|419745173|ref|ZP_14271816.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41566]
 gi|419746816|ref|ZP_14273391.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41565]
 gi|421574444|ref|ZP_16020065.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00325]
 gi|421579948|ref|ZP_16025510.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00326]
 gi|421585722|ref|ZP_16031214.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00328]
 gi|381291806|gb|EIC33035.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41563]
 gi|381293004|gb|EIC34177.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41573]
 gi|381299851|gb|EIC40919.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41579]
 gi|381306597|gb|EIC47470.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41566]
 gi|381320690|gb|EIC61232.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41565]
 gi|402521423|gb|EJW28761.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00326]
 gi|402524644|gb|EJW31941.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00325]
 gi|402529572|gb|EJW36805.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00328]
          Length = 765

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 147/458 (32%), Positives = 228/458 (49%), Gaps = 61/458 (13%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 267 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA---- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   PA    
Sbjct: 325 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKE-SDPADTNA 378

Query: 241 ----HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVA 294
               H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA
Sbjct: 379 ESRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVA 436

Query: 295 CGYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAE 331
               T L GI           YAK               +++        +   +I  A 
Sbjct: 437 NQSVTVLAGIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAV 496

Query: 332 VAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPV 391
            AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P 
Sbjct: 497 QAAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP- 554

Query: 392 DVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 -LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|170019550|ref|YP_001724504.1| glycoside hydrolase family protein [Escherichia coli ATCC 8739]
 gi|194436057|ref|ZP_03068159.1| beta-glucosidase, periplasmic [Escherichia coli 101-1]
 gi|251785501|ref|YP_002999805.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           BL21(DE3)]
 gi|253772940|ref|YP_003035771.1| glycoside hydrolase [Escherichia coli 'BL21-Gold(DE3)pLysS AG']
 gi|254162147|ref|YP_003045255.1| beta-D-glucoside glucohydrolase [Escherichia coli B str. REL606]
 gi|254288909|ref|YP_003054657.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           BL21(DE3)]
 gi|297518468|ref|ZP_06936854.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           OP50]
 gi|301029467|ref|ZP_07192554.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 196-1]
 gi|312973618|ref|ZP_07787790.1| periplasmic beta-glucosidase [Escherichia coli 1827-70]
 gi|331663629|ref|ZP_08364539.1| periplasmic beta-glucosidase (Gentiobiase)(Cellobiase)
           (Beta-D-glucoside glucohydrolase) [Escherichia coli
           TA143]
 gi|331668831|ref|ZP_08369679.1| periplasmic beta-glucosidase (Gentiobiase)(Cellobiase)
           (Beta-D-glucoside glucohydrolase) [Escherichia coli
           TA271]
 gi|331673657|ref|ZP_08374420.1| periplasmic beta-glucosidase (Gentiobiase)(Cellobiase)
           (Beta-D-glucoside glucohydrolase) [Escherichia coli
           TA280]
 gi|383179091|ref|YP_005457096.1| beta-D-glucoside glucohydrolase [Shigella sonnei 53G]
 gi|415813243|ref|ZP_11505052.1| periplasmic beta-glucosidase [Escherichia coli LT-68]
 gi|416343694|ref|ZP_11677594.1| Periplasmic beta-glucosidase [Escherichia coli EC4100B]
 gi|417146021|ref|ZP_11986979.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 1.2264]
 gi|417221746|ref|ZP_12025186.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 96.154]
 gi|417232244|ref|ZP_12033642.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 5.0959]
 gi|417270267|ref|ZP_12057627.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 3.3884]
 gi|417272586|ref|ZP_12059935.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 2.4168]
 gi|417602722|ref|ZP_12253292.1| periplasmic beta-glucosidase [Escherichia coli STEC_94C]
 gi|418266487|ref|ZP_12885993.1| &beta-D-glucoside glucohydrolase, periplasmic [Shigella sonnei str.
           Moseley]
 gi|418943929|ref|ZP_13497060.1| beta-D-glucoside glucohydrolase [Escherichia coli O157:H43 str.
           T22]
 gi|419175856|ref|ZP_13719694.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC7B]
 gi|419345798|ref|ZP_13887173.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC13A]
 gi|419350211|ref|ZP_13891549.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC13B]
 gi|419355623|ref|ZP_13896881.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC13C]
 gi|419360715|ref|ZP_13901933.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC13D]
 gi|419365659|ref|ZP_13906821.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC13E]
 gi|419370636|ref|ZP_13911755.1| periplasmic beta-glucosidase [Escherichia coli DEC14A]
 gi|422334408|ref|ZP_16415415.1| periplasmic beta-glucosidase [Escherichia coli 4_1_47FAA]
 gi|422786751|ref|ZP_16839490.1| glycosyl hydrolase 3 domain-containing protein [Escherichia coli
           H489]
 gi|422790614|ref|ZP_16843318.1| glycosyl hydrolase 3 domain-containing protein [Escherichia coli
           TA007]
 gi|425120242|ref|ZP_18521945.1| periplasmic beta-glucosidase [Escherichia coli 8.0569]
 gi|432962302|ref|ZP_20151955.1| periplasmic beta-glucosidase [Escherichia coli KTE202]
 gi|433063596|ref|ZP_20250519.1| periplasmic beta-glucosidase [Escherichia coli KTE125]
 gi|442592268|ref|ZP_21010248.1| Periplasmic beta-glucosidase [Escherichia coli O10:K5(L):H4 str.
           ATCC 23506]
 gi|442599173|ref|ZP_21016904.1| Periplasmic beta-glucosidase [Escherichia coli O5:K4(L):H4 str.
           ATCC 23502]
 gi|169754478|gb|ACA77177.1| glycoside hydrolase family 3 domain protein [Escherichia coli ATCC
           8739]
 gi|194424785|gb|EDX40770.1| beta-glucosidase, periplasmic [Escherichia coli 101-1]
 gi|242377774|emb|CAQ32537.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           BL21(DE3)]
 gi|253323984|gb|ACT28586.1| glycoside hydrolase family 3 domain protein [Escherichia coli
           'BL21-Gold(DE3)pLysS AG']
 gi|253974048|gb|ACT39719.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli B
           str. REL606]
 gi|253978216|gb|ACT43886.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           BL21(DE3)]
 gi|299877670|gb|EFI85881.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 196-1]
 gi|310332213|gb|EFP99448.1| periplasmic beta-glucosidase [Escherichia coli 1827-70]
 gi|320199726|gb|EFW74315.1| Periplasmic beta-glucosidase [Escherichia coli EC4100B]
 gi|323171784|gb|EFZ57428.1| periplasmic beta-glucosidase [Escherichia coli LT-68]
 gi|323961640|gb|EGB57245.1| glycosyl hydrolase 3 domain-containing protein [Escherichia coli
           H489]
 gi|323972884|gb|EGB68082.1| glycosyl hydrolase 3 domain-containing protein [Escherichia coli
           TA007]
 gi|331059428|gb|EGI31405.1| periplasmic beta-glucosidase (Gentiobiase)(Cellobiase)
           (Beta-D-glucoside glucohydrolase) [Escherichia coli
           TA143]
 gi|331064025|gb|EGI35936.1| periplasmic beta-glucosidase (Gentiobiase)(Cellobiase)
           (Beta-D-glucoside glucohydrolase) [Escherichia coli
           TA271]
 gi|331068930|gb|EGI40322.1| periplasmic beta-glucosidase (Gentiobiase)(Cellobiase)
           (Beta-D-glucoside glucohydrolase) [Escherichia coli
           TA280]
 gi|345350388|gb|EGW82663.1| periplasmic beta-glucosidase [Escherichia coli STEC_94C]
 gi|373244634|gb|EHP64115.1| periplasmic beta-glucosidase [Escherichia coli 4_1_47FAA]
 gi|375320761|gb|EHS66675.1| beta-D-glucoside glucohydrolase [Escherichia coli O157:H43 str.
           T22]
 gi|378033259|gb|EHV95839.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC7B]
 gi|378186911|gb|EHX47532.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC13A]
 gi|378200389|gb|EHX60844.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC13B]
 gi|378200953|gb|EHX61406.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC13C]
 gi|378203156|gb|EHX63580.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC13D]
 gi|378212894|gb|EHX73213.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           DEC13E]
 gi|378217230|gb|EHX77509.1| periplasmic beta-glucosidase [Escherichia coli DEC14A]
 gi|386163473|gb|EIH25268.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 1.2264]
 gi|386201548|gb|EII00539.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 96.154]
 gi|386205243|gb|EII09754.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 5.0959]
 gi|386229072|gb|EII56428.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 3.3884]
 gi|386236286|gb|EII68262.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 2.4168]
 gi|397899717|gb|EJL16089.1| &beta-D-glucoside glucohydrolase, periplasmic [Shigella sonnei str.
           Moseley]
 gi|408569397|gb|EKK45386.1| periplasmic beta-glucosidase [Escherichia coli 8.0569]
 gi|431474093|gb|ELH53915.1| periplasmic beta-glucosidase [Escherichia coli KTE202]
 gi|431581590|gb|ELI54037.1| periplasmic beta-glucosidase [Escherichia coli KTE125]
 gi|441608421|emb|CCP99274.1| Periplasmic beta-glucosidase [Escherichia coli O10:K5(L):H4 str.
           ATCC 23506]
 gi|441652085|emb|CCQ02401.1| Periplasmic beta-glucosidase [Escherichia coli O5:K4(L):H4 str.
           ATCC 23502]
          Length = 765

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|417672543|ref|ZP_12322009.1| periplasmic beta-glucosidase [Shigella dysenteriae 155-74]
 gi|332092627|gb|EGI97698.1| periplasmic beta-glucosidase [Shigella dysenteriae 155-74]
          Length = 765

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|332278714|ref|ZP_08391127.1| conserved hypothetical protein [Shigella sp. D9]
 gi|332101066|gb|EGJ04412.1| conserved hypothetical protein [Shigella sp. D9]
          Length = 789

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 179 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 238

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 239 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 290

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 291 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 348

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 349 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 403

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 404 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 461

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 462 QSVTVLTGIKNSVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 521

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 522 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 578

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 579 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 615


>gi|300821764|ref|ZP_07101909.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 119-7]
 gi|300917029|ref|ZP_07133724.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 115-1]
 gi|300929307|ref|ZP_07144781.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 187-1]
 gi|309793010|ref|ZP_07687438.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 145-7]
 gi|331678081|ref|ZP_08378756.1| periplasmic beta-glucosidase (Gentiobiase)(Cellobiase)
           (Beta-D-glucoside glucohydrolase) [Escherichia coli
           H591]
 gi|386705397|ref|YP_006169244.1| Periplasmic beta-glucosidase [Escherichia coli P12b]
 gi|300415704|gb|EFJ99014.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 115-1]
 gi|300462726|gb|EFK26219.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 187-1]
 gi|300525606|gb|EFK46675.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 119-7]
 gi|308123296|gb|EFO60558.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 145-7]
 gi|331074541|gb|EGI45861.1| periplasmic beta-glucosidase (Gentiobiase)(Cellobiase)
           (Beta-D-glucoside glucohydrolase) [Escherichia coli
           H591]
 gi|383103565|gb|AFG41074.1| Periplasmic beta-glucosidase [Escherichia coli P12b]
          Length = 789

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 179 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 238

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 239 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 290

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 291 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 348

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 349 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 403

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 404 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 461

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 462 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 521

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 522 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 578

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 579 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 615


>gi|423258860|ref|ZP_17239783.1| hypothetical protein HMPREF1055_02060 [Bacteroides fragilis
           CL07T00C01]
 gi|423264169|ref|ZP_17243172.1| hypothetical protein HMPREF1056_00859 [Bacteroides fragilis
           CL07T12C05]
 gi|387776440|gb|EIK38540.1| hypothetical protein HMPREF1055_02060 [Bacteroides fragilis
           CL07T00C01]
 gi|392706435|gb|EIY99558.1| hypothetical protein HMPREF1056_00859 [Bacteroides fragilis
           CL07T12C05]
          Length = 805

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 155/451 (34%), Positives = 221/451 (49%), Gaps = 48/451 (10%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLK-VAACCKHYTAYD 70
           + P +++ RDPRW R +ET GEDP L G    + VRG QG T +  K V A  KH+ +Y 
Sbjct: 198 YGPVLDLARDPRWSRVEETYGEDPYLNGAMGTALVRGFQGETLNDGKSVIATLKHFASY- 256

Query: 71  LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
              W         A + +++LE+    PF+  V  G + SVM SYN+++G P      +L
Sbjct: 257 --GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLL 313

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP-FLAIHTE 189
            + +  +W+  G++VSD  +VG L           EAA  A+ AG+D D G    A    
Sbjct: 314 TDILKDRWQFKGFVVSDLYAVGGL-REHGVAGNDYEAAIKAVNAGVDSDLGTNVYAEQLV 372

Query: 190 GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPF--GNLGPRDVCTPAHQQLALQ 247
            AV+ G +    ++ A+   ++++ ++G+FD      PF       + V +  H  LA +
Sbjct: 373 AAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPFVDEKQAVQLVASSEHTGLARE 427

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT--TPLQGI- 304
            A Q IVLLKN  + LPL      T+AVIGPN+D    M+G+Y       T  T L GI 
Sbjct: 428 VARQSIVLLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIR 486

Query: 305 ---SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG----LDQSIEAEFI-- 355
              S+  + ++  GC     +      A E  AR ADA V+VMG     D S E E    
Sbjct: 487 QKVSKETRVLYAKGCAVRDSSRTGFKDAIET-ARNADAVVMVMGGSSARDFSSEYEETGA 545

Query: 356 -----------------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKN 398
                            DRA L L GRQ EL+  +++  + PVVLVL+ G P+ +  A  
Sbjct: 546 AKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQ 604

Query: 399 DPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +    AI+   YPG  GG A+ADVLFG  NP
Sbjct: 605 EAE--AIVDAWYPGMQGGNAVADVLFGDYNP 633


>gi|300818868|ref|ZP_07099073.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 107-1]
 gi|300902218|ref|ZP_07120218.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 84-1]
 gi|300924493|ref|ZP_07140462.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 182-1]
 gi|301305176|ref|ZP_07211275.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 124-1]
 gi|301325242|ref|ZP_07218760.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 78-1]
 gi|415863788|ref|ZP_11536992.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 85-1]
 gi|415878761|ref|ZP_11544425.1| periplasmic beta-glucosidase [Escherichia coli MS 79-10]
 gi|422356099|ref|ZP_16436791.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 117-3]
 gi|300405737|gb|EFJ89275.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 84-1]
 gi|300419302|gb|EFK02613.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 182-1]
 gi|300528487|gb|EFK49549.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 107-1]
 gi|300839580|gb|EFK67340.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 124-1]
 gi|300847902|gb|EFK75662.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 78-1]
 gi|315255355|gb|EFU35323.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 85-1]
 gi|324015918|gb|EGB85137.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 117-3]
 gi|342927160|gb|EGU95882.1| periplasmic beta-glucosidase [Escherichia coli MS 79-10]
          Length = 789

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 179 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 238

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 239 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 290

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 291 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 348

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 349 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 403

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 404 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 461

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 462 QSVTVLTGIKNSVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 521

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 522 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 578

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 579 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 615


>gi|421883572|ref|ZP_16314801.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Senftenberg str. SS209]
 gi|379986798|emb|CCF87074.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Senftenberg str. SS209]
          Length = 765

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 148/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 267 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFNA-----PYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 437

Query: 296 GYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAEV 332
              T L GI           YAK               +++        +   +I  A  
Sbjct: 438 QSVTVLAGIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 AAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|265765465|ref|ZP_06093740.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
 gi|263254849|gb|EEZ26283.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
          Length = 814

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 155/451 (34%), Positives = 221/451 (49%), Gaps = 48/451 (10%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLK-VAACCKHYTAYD 70
           + P +++ RDPRW R +ET GEDP L G    + VRG QG T +  K V A  KH+ +Y 
Sbjct: 207 YGPVLDLARDPRWSRVEETYGEDPYLNGAMGTALVRGFQGETLNDGKSVIATLKHFASY- 265

Query: 71  LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
              W         A + +++LE+    PF+  V  G + SVM SYN+++G P      +L
Sbjct: 266 --GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLL 322

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP-FLAIHTE 189
            + +  +W+  G++VSD  +VG L           EAA  A+ AG+D D G    A    
Sbjct: 323 TDILKDRWQFKGFVVSDLYAVGGL-REHGVAGNDYEAAIKAVNAGVDSDLGTNVYAEQLV 381

Query: 190 GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPF--GNLGPRDVCTPAHQQLALQ 247
            AV+ G +    ++ A+   ++++ ++G+FD      PF       + V +  H  LA +
Sbjct: 382 AAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPFVDEKQAVQLVASSEHTGLARE 436

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT--TPLQGI- 304
            A Q IVLLKN  + LPL      T+AVIGPN+D    M+G+Y       T  T L GI 
Sbjct: 437 VARQSIVLLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIR 495

Query: 305 ---SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG----LDQSIEAEFI-- 355
              S+  + ++  GC     +      A E  AR ADA V+VMG     D S E E    
Sbjct: 496 QKVSKETRVLYAKGCAVRDSSRTGFKDAIET-ARNADAVVMVMGGSSARDFSSEYEETGA 554

Query: 356 -----------------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKN 398
                            DRA L L GRQ EL+  +++  + PVVLVL+ G P+ +  A  
Sbjct: 555 AKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQ 613

Query: 399 DPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +    AI+   YPG  GG A+ADVLFG  NP
Sbjct: 614 EAE--AIVDAWYPGMQGGNAVADVLFGDYNP 642


>gi|227887187|ref|ZP_04004992.1| periplasmic beta-glucosidase precursor [Escherichia coli 83972]
 gi|300978676|ref|ZP_07174366.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 45-1]
 gi|301048865|ref|ZP_07195859.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 185-1]
 gi|386629921|ref|YP_006149641.1| periplasmic beta-glucosidase [Escherichia coli str. 'clone D i2']
 gi|386634841|ref|YP_006154560.1| periplasmic beta-glucosidase [Escherichia coli str. 'clone D i14']
 gi|422362797|ref|ZP_16443349.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 153-1]
 gi|227835537|gb|EEJ46003.1| periplasmic beta-glucosidase precursor [Escherichia coli 83972]
 gi|300299320|gb|EFJ55705.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 185-1]
 gi|300409608|gb|EFJ93146.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 45-1]
 gi|315294503|gb|EFU53851.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 153-1]
 gi|355420820|gb|AER85017.1| periplasmic beta-glucosidase precursor [Escherichia coli str.
           'clone D i2']
 gi|355425740|gb|AER89936.1| periplasmic beta-glucosidase precursor [Escherichia coli str.
           'clone D i14']
          Length = 789

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 230/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 179 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVQAMQGKSPADRYSVMTSVKHFAAYG 238

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 239 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 290

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 291 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 348

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 349 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 403

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 404 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 461

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 462 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 521

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 522 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 578

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 579 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 615


>gi|432397962|ref|ZP_19640743.1| periplasmic beta-glucosidase [Escherichia coli KTE25]
 gi|432422481|ref|ZP_19665026.1| periplasmic beta-glucosidase [Escherichia coli KTE178]
 gi|432446744|ref|ZP_19689043.1| periplasmic beta-glucosidase [Escherichia coli KTE191]
 gi|432500621|ref|ZP_19742378.1| periplasmic beta-glucosidase [Escherichia coli KTE216]
 gi|432559384|ref|ZP_19796053.1| periplasmic beta-glucosidase [Escherichia coli KTE49]
 gi|432694996|ref|ZP_19930195.1| periplasmic beta-glucosidase [Escherichia coli KTE162]
 gi|432711185|ref|ZP_19946245.1| periplasmic beta-glucosidase [Escherichia coli KTE6]
 gi|432723586|ref|ZP_19958506.1| periplasmic beta-glucosidase [Escherichia coli KTE17]
 gi|432728173|ref|ZP_19963052.1| periplasmic beta-glucosidase [Escherichia coli KTE18]
 gi|432741867|ref|ZP_19976586.1| periplasmic beta-glucosidase [Escherichia coli KTE23]
 gi|432919598|ref|ZP_20123712.1| periplasmic beta-glucosidase [Escherichia coli KTE173]
 gi|432927500|ref|ZP_20128929.1| periplasmic beta-glucosidase [Escherichia coli KTE175]
 gi|432981563|ref|ZP_20170338.1| periplasmic beta-glucosidase [Escherichia coli KTE211]
 gi|432991174|ref|ZP_20179838.1| periplasmic beta-glucosidase [Escherichia coli KTE217]
 gi|433097005|ref|ZP_20283189.1| periplasmic beta-glucosidase [Escherichia coli KTE139]
 gi|433106427|ref|ZP_20292402.1| periplasmic beta-glucosidase [Escherichia coli KTE148]
 gi|433111385|ref|ZP_20297250.1| periplasmic beta-glucosidase [Escherichia coli KTE150]
 gi|430916066|gb|ELC37144.1| periplasmic beta-glucosidase [Escherichia coli KTE25]
 gi|430944093|gb|ELC64192.1| periplasmic beta-glucosidase [Escherichia coli KTE178]
 gi|430973017|gb|ELC89985.1| periplasmic beta-glucosidase [Escherichia coli KTE191]
 gi|431028198|gb|ELD41242.1| periplasmic beta-glucosidase [Escherichia coli KTE216]
 gi|431090604|gb|ELD96355.1| periplasmic beta-glucosidase [Escherichia coli KTE49]
 gi|431233578|gb|ELF29165.1| periplasmic beta-glucosidase [Escherichia coli KTE162]
 gi|431248865|gb|ELF43040.1| periplasmic beta-glucosidase [Escherichia coli KTE6]
 gi|431266140|gb|ELF57702.1| periplasmic beta-glucosidase [Escherichia coli KTE17]
 gi|431273862|gb|ELF64936.1| periplasmic beta-glucosidase [Escherichia coli KTE18]
 gi|431283558|gb|ELF74417.1| periplasmic beta-glucosidase [Escherichia coli KTE23]
 gi|431443642|gb|ELH24668.1| periplasmic beta-glucosidase [Escherichia coli KTE173]
 gi|431444023|gb|ELH25047.1| periplasmic beta-glucosidase [Escherichia coli KTE175]
 gi|431490872|gb|ELH70479.1| periplasmic beta-glucosidase [Escherichia coli KTE211]
 gi|431495256|gb|ELH74842.1| periplasmic beta-glucosidase [Escherichia coli KTE217]
 gi|431615353|gb|ELI84482.1| periplasmic beta-glucosidase [Escherichia coli KTE139]
 gi|431627134|gb|ELI95545.1| periplasmic beta-glucosidase [Escherichia coli KTE148]
 gi|431628689|gb|ELI97065.1| periplasmic beta-glucosidase [Escherichia coli KTE150]
          Length = 755

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 428 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|375001990|ref|ZP_09726330.1| glycosyl hydrolase family 3 protein [Salmonella enterica subsp.
           enterica serovar Infantis str. SARB27]
 gi|353076678|gb|EHB42438.1| glycosyl hydrolase family 3 protein [Salmonella enterica subsp.
           enterica serovar Infantis str. SARB27]
          Length = 751

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 147/458 (32%), Positives = 228/458 (49%), Gaps = 61/458 (13%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 141 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 200

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 201 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 252

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 253 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 310

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA---- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   PA    
Sbjct: 311 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKE-SDPADTNA 364

Query: 241 ----HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVA 294
               H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA
Sbjct: 365 ESRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVA 422

Query: 295 CGYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAE 331
               T L GI           YAK               +++        +   +I  A 
Sbjct: 423 NQSVTVLAGIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAV 482

Query: 332 VAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPV 391
            AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P 
Sbjct: 483 QAAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP- 540

Query: 392 DVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 541 -LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 577


>gi|205357092|ref|ZP_02344874.2| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Saintpaul str. SARA29]
 gi|213650030|ref|ZP_03380083.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhi str. J185]
 gi|289824263|ref|ZP_06543858.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhi str. E98-3139]
 gi|378958841|ref|YP_005216327.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar Typhi
           str. P-stx-12]
 gi|205324021|gb|EDZ11860.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Saintpaul str. SARA29]
 gi|374352713|gb|AEZ44474.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Typhi str. P-stx-12]
          Length = 755

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 148/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 257 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 427

Query: 296 GYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAEV 332
              T L GI           YAK               +++        +   +I  A  
Sbjct: 428 QSVTVLAGIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 AAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|213852437|ref|ZP_03381969.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhi str. M223]
          Length = 721

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 148/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 111 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 170

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 171 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 222

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 223 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 280

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 281 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 335

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 336 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 393

Query: 296 GYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAEV 332
              T L GI           YAK               +++        +   +I  A  
Sbjct: 394 QSVTVLAGIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 453

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 454 AAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 510

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 511 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 547


>gi|329922637|ref|ZP_08278189.1| glycosyl hydrolase family 3 N-terminal domain protein
           [Paenibacillus sp. HGF5]
 gi|328941979|gb|EGG38262.1| glycosyl hydrolase family 3 N-terminal domain protein
           [Paenibacillus sp. HGF5]
          Length = 765

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 146/457 (31%), Positives = 218/457 (47%), Gaps = 48/457 (10%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHY 66
           G   +SP +++ RDPRWGR +E  GEDP L  +YA + V GLQG +  S   VAA  KH+
Sbjct: 157 GAVTYSPVLDVVRDPRWGRTEECFGEDPYLISEYAVASVEGLQGESLDSPSSVAATLKHF 216

Query: 67  TAY-DLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             Y   +        H   R    +L +   +PFK  V E   AS+M +YN+++G P   
Sbjct: 217 VGYGSSEGGRNAGPVHMGTR----ELMEVDMLPFKKAV-EAGAASIMPAYNEIDGVPCTV 271

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD-CGPFL 184
           + ++L   +  +W  DG +++DC ++ +L +         +AA  AI+AG+DL+  G   
Sbjct: 272 NTELLDGILRKEWGFDGMVITDCGAIDMLASGHDTAEDGMDAAVQAIRAGIDLEMSGEMF 331

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
             H + AV    L    ++ A+   +T++ +LG+F+  P   P        + +  H  L
Sbjct: 332 GKHLQKAVESNKLEVSVLDEAVRRVLTLKFKLGLFEN-PYVDP--QTAENVIGSGQHIGL 388

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA--CGYTTPLQ 302
           A Q A +GIVLLKN A+ LPLS      +AVIGPN+D     +G+Y         TT L 
Sbjct: 389 ARQLAAEGIVLLKNEAKALPLSK-EGGVIAVIGPNADQGYNQLGDYTSPQPPAAVTTVLG 447

Query: 303 GI-----SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG------------ 345
           GI         + ++  GC  +  +  +    A   A QAD  V+V+G            
Sbjct: 448 GIRAKLGEEAQRVLYAPGCR-IKDDSREGFEFALSCAEQADTVVMVLGGSSARDFGEGTI 506

Query: 346 ---------LDQSIE----AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
                     D ++      E IDR  L L G Q +L   + K  +  +++V + G P+ 
Sbjct: 507 DLRTGASKVTDDALSDMDCGEGIDRMTLQLSGVQLDLAQEIHKLGK-RMIVVYINGRPIA 565

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
             +   D    AIL   YPGQ GG AIAD+LFG  NP
Sbjct: 566 EPWI--DEHADAILEAWYPGQEGGHAIADILFGDVNP 600


>gi|409197254|ref|ZP_11225917.1| glycoside hydrolase 3 [Marinilabilia salmonicolor JCM 21150]
          Length = 734

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 139/436 (31%), Positives = 216/436 (49%), Gaps = 43/436 (9%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           ++P ++I RD RWGR  E+ GEDP LT    A+ V G QG++      +AAC KH+  Y 
Sbjct: 136 FAPMIDISRDARWGRIAESLGEDPYLTSVLGAAMVTGFQGDSLNGETSIAACAKHFAGYG 195

Query: 71  LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
                   R +    +  ++L D Y  PFKA V  G V + M  +N+V+G P  A+  +L
Sbjct: 196 AAEGG---RDYNTTSIPPRELRDIYLPPFKAAVDAG-VRTFMSGFNEVDGVPATANKYLL 251

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEG 190
            + +  +W+ DG++VSD  S   + N   +    +EAA  AIK G+D++           
Sbjct: 252 TDVLRNEWQFDGFVVSDWASTWEMIN-HGFAADEKEAAHRAIKVGVDMEMATTTYRDNIA 310

Query: 191 A-VRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAA 249
           A ++ G L  ED+N A+   + V+  LG+FD    A+   N   R    P + + A  AA
Sbjct: 311 ALLKEGALNIEDINQAVRNILRVKFELGLFDNPYIAEEKQNQFAR----PEYLEAANLAA 366

Query: 250 HQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGN--YAGVACGYTTPLQGISR- 306
            Q +VLLKN  +TLP+++     +A+IGP +D     +G   + G      TPLQ  ++ 
Sbjct: 367 TQSMVLLKNEQKTLPINS--SSKIALIGPMADQPYEQLGTWIFDGDTTLTVTPLQAFNKT 424

Query: 307 -------------YAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAE 353
                         ++T HQ G F  A             A+ +D  V   G +  +  E
Sbjct: 425 FGQENVLFAEGMPISRTRHQKG-FRKAIE----------QAKNSDVIVFCGGEESILSGE 473

Query: 354 FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQ 413
              RA + LPG Q EL+  + K  + P+VLV+M G P+ +   +      A+++  +PG 
Sbjct: 474 AHSRANIDLPGVQNELIKELKKTGK-PLVLVVMAGRPLTI--GEISEHADAVVYAWHPGT 530

Query: 414 AGGAAIADVLFGRANP 429
            GGAA+AD++ G+ANP
Sbjct: 531 MGGAALADIVSGKANP 546


>gi|418855797|ref|ZP_13410448.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19593]
 gi|392820838|gb|EJA76679.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19593]
          Length = 765

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 148/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 267 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 437

Query: 296 GYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAEV 332
              T L GI           YAK               +++        +   +I  A  
Sbjct: 438 QSVTVLAGIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 AAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|422829956|ref|ZP_16878119.1| periplasmic beta-glucosidase [Escherichia coli B093]
 gi|425278425|ref|ZP_18669671.1| beta-D-glucoside glucohydrolase [Escherichia coli ARS4.2123]
 gi|371607379|gb|EHN95954.1| periplasmic beta-glucosidase [Escherichia coli B093]
 gi|408202047|gb|EKI27181.1| beta-D-glucoside glucohydrolase [Escherichia coli ARS4.2123]
          Length = 755

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 NRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 428 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|194446334|ref|YP_002041437.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Newport str. SL254]
 gi|194469038|ref|ZP_03075022.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Kentucky str. CVM29188]
 gi|195874002|ref|ZP_02700028.2| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. SL317]
 gi|205358223|ref|ZP_02654575.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Kentucky str. CDC 191]
 gi|205359498|ref|ZP_02830128.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Weltevreden str. HI_N05-537]
 gi|417374418|ref|ZP_12144175.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Inverness str. R8-3668]
 gi|418765082|ref|ZP_13321175.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35199]
 gi|418772136|ref|ZP_13328140.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21539]
 gi|418825678|ref|ZP_13380951.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 22462]
 gi|194404997|gb|ACF65219.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. SL254]
 gi|194455402|gb|EDX44241.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Kentucky str. CVM29188]
 gi|195631424|gb|EDX49984.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. SL317]
 gi|205335699|gb|EDZ22463.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Kentucky str. CDC 191]
 gi|205344851|gb|EDZ31615.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Weltevreden str. HI_N05-537]
 gi|353599534|gb|EHC55680.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Inverness str. R8-3668]
 gi|392732250|gb|EIZ89461.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21539]
 gi|392742031|gb|EIZ99126.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35199]
 gi|392812566|gb|EJA68550.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 22462]
          Length = 755

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 148/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 257 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 427

Query: 296 GYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAEV 332
              T L GI           YAK               +++        +   +I  A  
Sbjct: 428 QSVTVLAGIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 AAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|417334866|ref|ZP_12117911.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Alachua str. R6-377]
 gi|353573495|gb|EHC36830.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Alachua str. R6-377]
          Length = 623

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 148/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 257 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 427

Query: 296 GYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAEV 332
              T L GI           YAK               +++        +   +I  A  
Sbjct: 428 QSVTVLAGIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 AAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|224536087|ref|ZP_03676626.1| hypothetical protein BACCELL_00952 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224522306|gb|EEF91411.1| hypothetical protein BACCELL_00952 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 791

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 134/443 (30%), Positives = 207/443 (46%), Gaps = 38/443 (8%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL 71
           +SP ++I +DPRWGR  E  GEDP L G+     ++ LQ +     ++ +  KH+  Y +
Sbjct: 200 YSPILDICQDPRWGRSVECYGEDPYLVGQLGKQMIQSLQKH-----RLVSTVKHFAVYSI 254

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
                  +   +  VS +++   Y  PF+    E     VM SYN  +G+P  +    L 
Sbjct: 255 PVGGRDGKTRTDPHVSPREMRTLYLEPFRRAFCEAGALGVMSSYNDYDGEPITSSHHFLT 314

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG----PFLAIH 187
             +  ++   GY+VSD ++V  +    H      E  A A+ AGL++           + 
Sbjct: 315 EILRQEYGFKGYVVSDSEAVEFITTKHHVVSNEVEGVAQAVNAGLNIRTHFTKPEDFVLP 374

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPAHQQL 244
              A++ G +  E +N  +A  + ++  LG+FD      P+     ++   V    HQQ+
Sbjct: 375 LRQAIKEGKVSPETINSRVADILRIKFWLGLFDN-----PYRGDEKQEEKIVHCKEHQQV 429

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           AL+AA Q +VLLKN  + LPL      +VAVIGPN++    +I  Y        T  QGI
Sbjct: 430 ALEAARQSLVLLKNENQLLPLKKTV-KSVAVIGPNANEQTQLICRYGPANAPIKTVYQGI 488

Query: 305 SRY---AKTIHQAGC--------------FGVACNGNQLIGAAEVAARQADATVLVMGLD 347
                  + +++ GC              F       Q++  A  AAR A+  VLV+G  
Sbjct: 489 KELLPETEVVYRKGCEIIDSHFPESEILPFEKTTEEQQMLDEAVAAARNAEVVVLVLGGS 548

Query: 348 QSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILW 407
           +    E   R  L LPG QQEL+  +  A+  P VLVL+ G    +++A     I AIL 
Sbjct: 549 ELTVREDRSRTSLDLPGHQQELMQAI-HATGKPTVLVLLDGRAATINYANQ--YIPAILH 605

Query: 408 VGYPGQAGGAAIADVLFGRANPG 430
             +PG+  G A+A+ LFG  NPG
Sbjct: 606 AWFPGEFAGTAVAEALFGDYNPG 628


>gi|194736410|ref|YP_002115255.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Schwarzengrund str. CVM19633]
 gi|197300833|ref|ZP_02661390.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Schwarzengrund str. SL480]
 gi|204929066|ref|ZP_03220209.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Javiana str. GA_MM04042433]
 gi|205360240|ref|ZP_02681470.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Hadar str. RI_05P066]
 gi|416491454|ref|ZP_11727088.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. OH_2009072675]
 gi|416496191|ref|ZP_11729048.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. CASC_09SCPH15965]
 gi|416586610|ref|ZP_11775622.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. MD_MDA09249507]
 gi|416674556|ref|ZP_11821234.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. MB102109-0047]
 gi|417359373|ref|ZP_12133763.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar Give
           str. S5-487]
 gi|417391848|ref|ZP_12154881.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Minnesota str. A4-603]
 gi|417476332|ref|ZP_12170882.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Rubislaw str. A4-653]
 gi|417511967|ref|ZP_12176427.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Senftenberg str. A4-543]
 gi|194711912|gb|ACF91133.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Schwarzengrund str. CVM19633]
 gi|197290594|gb|EDY29949.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Schwarzengrund str. SL480]
 gi|204321610|gb|EDZ06809.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Javiana str. GA_MM04042433]
 gi|205350836|gb|EDZ37467.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Hadar str. RI_05P066]
 gi|322649375|gb|EFY45811.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. OH_2009072675]
 gi|322656495|gb|EFY52784.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. CASC_09SCPH15965]
 gi|322667165|gb|EFY63332.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. MD_MDA09249507]
 gi|323222876|gb|EGA07232.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. MB102109-0047]
 gi|353589745|gb|EHC48461.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar Give
           str. S5-487]
 gi|353614040|gb|EHC65987.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Minnesota str. A4-603]
 gi|353641236|gb|EHC86018.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Rubislaw str. A4-653]
 gi|353641325|gb|EHC86084.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Senftenberg str. A4-543]
          Length = 755

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 148/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 257 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 427

Query: 296 GYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAEV 332
              T L GI           YAK               +++        +   +I  A  
Sbjct: 428 QSVTVLAGIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 AAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|116621797|ref|YP_823953.1| glycoside hydrolase family protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116224959|gb|ABJ83668.1| glycoside hydrolase, family 3 domain protein [Candidatus Solibacter
           usitatus Ellin6076]
          Length = 765

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 150/451 (33%), Positives = 231/451 (51%), Gaps = 45/451 (9%)

Query: 13  SPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLK--VAACCKHYTAYD 70
           +P V++ R+PRWGR +ET GEDP L  +   + VRG QG+   R K  V A  KH+ A+ 
Sbjct: 163 TPVVDVAREPRWGRVEETYGEDPFLVSRMGIAAVRGFQGDATFRDKTRVIATLKHFAAHG 222

Query: 71  LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
               +G +    N  VS + L +T+  PFK  + +G   SVM SYN+++G P+ A   +L
Sbjct: 223 QPE-SGTNCAPVN--VSMRVLRETFLFPFKEALDKGCAISVMASYNEIDGVPSHASRWLL 279

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVL-YNTQ---HYT-RTPEEAAADAIKAGLDLDC-GPFL 184
           ++ +  +W   G++VSD  ++  L Y  +   H+  +   EA A A++AG++++   P  
Sbjct: 280 RDVLRKEWGFKGFVVSDYYAIYELSYRPESHGHFVAKDKREACALAVQAGVNIELPEPDC 339

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
            +H    V  G+L+E  ++  +   +  + ++G+FD +P   P      R     AH++L
Sbjct: 340 YLHLVDLVHKGVLQESQLDELVEPMLRWKFQMGLFD-DPYVDP--AEAERIAGCDAHREL 396

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 304
           A+QAA + I LLKN    +PL      T+AVIGPN++   +++G Y+GV     T L GI
Sbjct: 397 AMQAARETITLLKNDGPVVPLDLSAIKTIAVIGPNAN--RSLLGGYSGVPKHDVTVLDGI 454

Query: 305 SR----YAKTIHQAGC---------------FGVACNGNQLIGAAEVAARQADATVLVMG 345
                  AK ++  GC                  A +  Q+  A +VA R AD  VL +G
Sbjct: 455 RERVGSRAKVVYAEGCKITIGGSWVQDEVTPSDPAEDRRQIAEAVKVAKR-ADVIVLAIG 513

Query: 346 LDQSIEAE------FIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKND 399
            ++    E        DR  L L GRQ+ELV R   A+  PV+  L  G P+ +++    
Sbjct: 514 GNEQTSREAWSPKHLGDRPSLDLVGRQEELV-RAMVATGKPVIAFLFNGRPISINYLAQS 572

Query: 400 PRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
             + AI    Y GQ  G A+A+VLFG  NPG
Sbjct: 573 --VPAIFECWYLGQETGRAVAEVLFGDTNPG 601


>gi|432862722|ref|ZP_20087011.1| periplasmic beta-glucosidase [Escherichia coli KTE146]
 gi|431404761|gb|ELG88007.1| periplasmic beta-glucosidase [Escherichia coli KTE146]
          Length = 755

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 428 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|366158362|ref|ZP_09458224.1| beta-D-glucoside glucohydrolase [Escherichia sp. TW09308]
          Length = 765

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 149/458 (32%), Positives = 227/458 (49%), Gaps = 61/458 (13%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA---- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   PA    
Sbjct: 325 SKYLPGLIKSGKVTMEELDDATRHVLNVKYDMGLFN-----DPYSHLGPKE-SDPADTNA 378

Query: 241 ----HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVA 294
               H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGV 
Sbjct: 379 ESRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVV 436

Query: 295 CGYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAE 331
               T L GI           YAK  +     G+    NQ              +I  A 
Sbjct: 437 DQSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAV 496

Query: 332 VAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPV 391
             A+QAD  V V+G  Q +  E   R  + +P  Q++LV+ + KA+  P+VLVLM G P 
Sbjct: 497 QTAKQADVVVAVVGEAQGMAHEASSRTDITIPQSQRDLVAAL-KATGKPLVLVLMNGRP- 554

Query: 392 DVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 -LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|432905393|ref|ZP_20114260.1| periplasmic beta-glucosidase [Escherichia coli KTE194]
 gi|431432628|gb|ELH14305.1| periplasmic beta-glucosidase [Escherichia coli KTE194]
          Length = 755

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFNA-----PYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 428 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|423300729|ref|ZP_17278753.1| hypothetical protein HMPREF1057_01894 [Bacteroides finegoldii
           CL09T03C10]
 gi|408472616|gb|EKJ91142.1| hypothetical protein HMPREF1057_01894 [Bacteroides finegoldii
           CL09T03C10]
          Length = 735

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 146/435 (33%), Positives = 219/435 (50%), Gaps = 27/435 (6%)

Query: 6   MAGLTY-WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAACC 63
           M+G+ + +SP +++ RDPRWGR  E  GEDP   G + A+ VRG QG N  +  +VAAC 
Sbjct: 154 MSGVDWTFSPMIDVARDPRWGRVAEGYGEDPYANGVFGAASVRGYQGDNMSAENRVAACL 213

Query: 64  KHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPT 123
           KHY  Y         R +    +SKQ L DTY +P+K  V  G  A++M S+N ++G P 
Sbjct: 214 KHYVGYGASE---AGRDYVYTEISKQTLWDTYLLPYKMGVKAG-AATLMSSFNDISGVPG 269

Query: 124 CADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPF 183
            A+P  +   +  +WR DG+IVSD  ++  L N Q    T +EAA  A  AGL++D    
Sbjct: 270 SANPYTMTEILKNRWRHDGFIVSDWGAIEQLKN-QGLAATKKEAARHAFTAGLEMDMMSH 328

Query: 184 -LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
               H +  V  G +    V+ A+   + ++ RLG+F  E    P      R    P   
Sbjct: 329 AYDRHLQELVEEGKVSMAQVDEAVRRVLLLKFRLGLF--ERPYTPVTTEKER-FLRPQSM 385

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAG------VACG 296
            +A + A + +VLLKN    LPL+  +   +AVIGP +     ++G++ G      V   
Sbjct: 386 DIAARLAAESMVLLKNENNVLPLADKKK--IAVIGPMAKNGWDLLGSWRGHGKDTDVVML 443

Query: 297 YTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEV--AARQADATVLVMGLDQSIEAEF 354
           Y       +  A+  +  GC      G+   G AE   AAR +D  VL +G   +   E 
Sbjct: 444 YDGLAAEFAGKAELRYALGC---NTKGDNREGFAEALGAARWSDVVVLCLGEMMTWSGEN 500

Query: 355 IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQA 414
             R+ + LP  Q+EL   + K  + PVVL+L+ G P++++  + +P   AIL +  PG  
Sbjct: 501 ASRSSIALPQMQEELAKELKKVGK-PVVLILVNGRPLELN--RLEPVSDAILEIWQPGVN 557

Query: 415 GGAAIADVLFGRANP 429
           G   +A +L GR NP
Sbjct: 558 GALPMAGILSGRINP 572


>gi|422972113|ref|ZP_16975165.1| periplasmic beta-glucosidase [Escherichia coli TA124]
 gi|432381874|ref|ZP_19624817.1| periplasmic beta-glucosidase [Escherichia coli KTE15]
 gi|432387686|ref|ZP_19630575.1| periplasmic beta-glucosidase [Escherichia coli KTE16]
 gi|432466227|ref|ZP_19708316.1| periplasmic beta-glucosidase [Escherichia coli KTE205]
 gi|432471500|ref|ZP_19713546.1| periplasmic beta-glucosidase [Escherichia coli KTE206]
 gi|432514436|ref|ZP_19751660.1| periplasmic beta-glucosidase [Escherichia coli KTE224]
 gi|432554223|ref|ZP_19790946.1| periplasmic beta-glucosidase [Escherichia coli KTE47]
 gi|432584390|ref|ZP_19820784.1| periplasmic beta-glucosidase [Escherichia coli KTE57]
 gi|432611943|ref|ZP_19848105.1| periplasmic beta-glucosidase [Escherichia coli KTE72]
 gi|432646705|ref|ZP_19882495.1| periplasmic beta-glucosidase [Escherichia coli KTE86]
 gi|432656288|ref|ZP_19891992.1| periplasmic beta-glucosidase [Escherichia coli KTE93]
 gi|432699565|ref|ZP_19934719.1| periplasmic beta-glucosidase [Escherichia coli KTE169]
 gi|432713912|ref|ZP_19948952.1| periplasmic beta-glucosidase [Escherichia coli KTE8]
 gi|432746176|ref|ZP_19980842.1| periplasmic beta-glucosidase [Escherichia coli KTE43]
 gi|432802350|ref|ZP_20036329.1| periplasmic beta-glucosidase [Escherichia coli KTE84]
 gi|432938485|ref|ZP_20136763.1| periplasmic beta-glucosidase [Escherichia coli KTE183]
 gi|432985893|ref|ZP_20174616.1| periplasmic beta-glucosidase [Escherichia coli KTE215]
 gi|433073326|ref|ZP_20259981.1| periplasmic beta-glucosidase [Escherichia coli KTE129]
 gi|433144724|ref|ZP_20329868.1| periplasmic beta-glucosidase [Escherichia coli KTE168]
 gi|433183798|ref|ZP_20368049.1| periplasmic beta-glucosidase [Escherichia coli KTE85]
 gi|371598182|gb|EHN86995.1| periplasmic beta-glucosidase [Escherichia coli TA124]
 gi|430906334|gb|ELC27934.1| periplasmic beta-glucosidase [Escherichia coli KTE16]
 gi|430907349|gb|ELC28847.1| periplasmic beta-glucosidase [Escherichia coli KTE15]
 gi|430993811|gb|ELD10155.1| periplasmic beta-glucosidase [Escherichia coli KTE205]
 gi|430997889|gb|ELD14138.1| periplasmic beta-glucosidase [Escherichia coli KTE206]
 gi|431041832|gb|ELD52327.1| periplasmic beta-glucosidase [Escherichia coli KTE224]
 gi|431084028|gb|ELD90199.1| periplasmic beta-glucosidase [Escherichia coli KTE47]
 gi|431115749|gb|ELE19243.1| periplasmic beta-glucosidase [Escherichia coli KTE57]
 gi|431148117|gb|ELE49408.1| periplasmic beta-glucosidase [Escherichia coli KTE72]
 gi|431180742|gb|ELE80629.1| periplasmic beta-glucosidase [Escherichia coli KTE86]
 gi|431190758|gb|ELE90144.1| periplasmic beta-glucosidase [Escherichia coli KTE93]
 gi|431243314|gb|ELF37701.1| periplasmic beta-glucosidase [Escherichia coli KTE169]
 gi|431256689|gb|ELF49624.1| periplasmic beta-glucosidase [Escherichia coli KTE8]
 gi|431291215|gb|ELF81728.1| periplasmic beta-glucosidase [Escherichia coli KTE43]
 gi|431348139|gb|ELG34997.1| periplasmic beta-glucosidase [Escherichia coli KTE84]
 gi|431463220|gb|ELH43413.1| periplasmic beta-glucosidase [Escherichia coli KTE183]
 gi|431500130|gb|ELH79146.1| periplasmic beta-glucosidase [Escherichia coli KTE215]
 gi|431588037|gb|ELI59385.1| periplasmic beta-glucosidase [Escherichia coli KTE129]
 gi|431661419|gb|ELJ28232.1| periplasmic beta-glucosidase [Escherichia coli KTE168]
 gi|431706360|gb|ELJ70931.1| periplasmic beta-glucosidase [Escherichia coli KTE85]
          Length = 755

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 428 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|437849166|ref|ZP_20847216.1| beta-D-glucoside glucohydrolase, partial [Salmonella enterica
           subsp. enterica serovar Enteritidis str. 6.0562-1]
 gi|435338617|gb|ELP07830.1| beta-D-glucoside glucohydrolase, partial [Salmonella enterica
           subsp. enterica serovar Enteritidis str. 6.0562-1]
          Length = 792

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 148/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 182 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 241

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 242 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 293

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 294 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 351

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 352 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 406

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 407 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 464

Query: 296 GYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAEV 332
              T L GI           YAK               +++        +   +I  A  
Sbjct: 465 QSVTVLAGIQNAVDDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 524

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 525 AAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 581

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 582 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 618


>gi|449307585|ref|YP_007439941.1| beta-D-glucoside glucohydrolase [Cronobacter sakazakii SP291]
 gi|449097618|gb|AGE85652.1| beta-D-glucoside glucohydrolase [Cronobacter sakazakii SP291]
          Length = 765

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 147/457 (32%), Positives = 223/457 (48%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGRG E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVTRDPRWGRGSEGFGEDTYLTSMMGKTMVESMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+K  +  G    VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKEALDAGS-GGVMIALNSLNGTPAASD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ + G W   G  +SD  ++  L   +H T + PE+A   AIK+G+D+     + 
Sbjct: 267 GWLLKDLLRGDWGFKGITISDHGAIKEL--IKHGTASDPEDAVRVAIKSGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD-------VC 237
           + +    ++ G +   +++ A  + + V+  +G+F+      P+ +LGP+D         
Sbjct: 325 SKYLPNLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPKDTNAE 379

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
           +  H+  A + A + +VLLKN   TLPL   +  T+AV+GP +D    M+G++  AGVA 
Sbjct: 380 SRLHRDDARKVARESLVLLKNRLETLPLK--KSGTIAVVGPLADSKRDMMGSWSAAGVAD 437

Query: 296 GYTTPLQGISRY----AKTIHQAGCFGVACNG-------------------NQLIGAAEV 332
              T LQG+       AK ++  G       G                     +I  A  
Sbjct: 438 QSVTLLQGMKNVAGDKAKILYAKGANVTDDKGIVDFLNLYEPAVVVDKRTPKAMIDEAVN 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  L +P  Q++L+S + KA+  P+VLVLM G P  
Sbjct: 498 VAKQSDVVVAVVGEAQGMAHEASSRTDLTIPQSQRDLISAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDLQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|421449803|ref|ZP_15899183.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 58-6482]
 gi|396068619|gb|EJI76965.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 58-6482]
          Length = 765

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 148/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 267 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 437

Query: 296 GYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAEV 332
              T L GI           YAK               +++        +   +I  A  
Sbjct: 438 QSVTVLAGIQNAVDDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 AAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|420346101|ref|ZP_14847526.1| periplasmic beta-glucosidase [Shigella boydii 965-58]
 gi|391274821|gb|EIQ33621.1| periplasmic beta-glucosidase [Shigella boydii 965-58]
          Length = 765

 Score =  195 bits (496), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 148/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P +++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMIDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|380692929|ref|ZP_09857788.1| glycoside hydrolase family protein [Bacteroides faecis MAJ27]
          Length = 777

 Score =  195 bits (496), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 137/444 (30%), Positives = 212/444 (47%), Gaps = 40/444 (9%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL 71
           +SP ++I +DPRWGR  E  GEDP L G+     ++ LQ     + K+ A  KH+  Y +
Sbjct: 185 YSPILDIAQDPRWGRAVECYGEDPYLVGQLGKQMIQSLQ-----KHKLVATPKHFAVYSI 239

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
                      +  V+ +++   Y  PF+    E     VM SYN  +G+P       L 
Sbjct: 240 PVGGRDGGTRTDPHVAPREMRTLYLEPFRVAFQEAGALGVMSSYNDYDGEPITGSYRFLT 299

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD---CGPFLAIH- 187
             +  +W   GY+VSD D+V  + +        EEA   ++ AGL++      P   I  
Sbjct: 300 QILRQEWGFKGYVVSDSDAVEFISSKHKVADNNEEAVVQSVNAGLNVRTNFSSPAGFIKP 359

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPF---GNLGPRDVCTPAHQQL 244
              A+  G + +  ++  ++  + V+  LG+FD      P+   G L  + V    HQ +
Sbjct: 360 LRSAIAKGKVSQATIDQRVSEILYVKFWLGLFD-----NPYRGDGKLADKIVHCKEHQAV 414

Query: 245 ALQAAHQGIVLLKNSARTLPL-STLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           AL+AA Q IVLLKN    LPL  TL+  +VAVIGPN+D    +I  Y        T  +G
Sbjct: 415 ALEAARQSIVLLKNQDNLLPLQKTLK--SVAVIGPNADEQKELICRYGPSNAPIKTVYKG 472

Query: 304 ISRY---AKTIHQAGC--------------FGVACNGNQLIGAAEVAARQADATVLVMGL 346
           I      AK +++ GC              F +     Q++  A  AA+ A+  ++V+G 
Sbjct: 473 IKEALPGAKVVYKKGCEIVDPHFPESEVLPFDITPKEQQIMDEAIEAAKSAEVVIMVLGG 532

Query: 347 DQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
            +    E   R  L LPGRQ+EL+  V K  + P +LV++ G    +++AK    + AIL
Sbjct: 533 SEVTVREERSRTSLDLPGRQEELLKAVCKLGK-PTILVMIDGRASSINYAKK--YVPAIL 589

Query: 407 WVGYPGQAGGAAIADVLFGRANPG 430
              +PG+  G A+A+ +FG  NPG
Sbjct: 590 HAWFPGEFCGQAVAETIFGDNNPG 613


>gi|416424053|ref|ZP_11691321.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 315996572]
 gi|416434178|ref|ZP_11697512.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-1]
 gi|416438422|ref|ZP_11699509.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-3]
 gi|416447728|ref|ZP_11705981.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-4]
 gi|416450525|ref|ZP_11707600.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 515920-1]
 gi|416460485|ref|ZP_11714793.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 515920-2]
 gi|416470222|ref|ZP_11718747.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 531954]
 gi|416476726|ref|ZP_11721214.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. NC_MB110209-0054]
 gi|416508101|ref|ZP_11735884.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. SARB31]
 gi|416513698|ref|ZP_11738019.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. ATCC BAA710]
 gi|416540687|ref|ZP_11750492.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 19N]
 gi|416561671|ref|ZP_11761668.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 42N]
 gi|416575337|ref|ZP_11768369.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 81038-01]
 gi|416594582|ref|ZP_11780414.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 414877]
 gi|416602160|ref|ZP_11785217.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 366867]
 gi|416607231|ref|ZP_11788413.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 413180]
 gi|416611818|ref|ZP_11791047.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 446600]
 gi|416623896|ref|ZP_11797678.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 609458-1]
 gi|416630098|ref|ZP_11800505.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 556150-1]
 gi|416636573|ref|ZP_11803136.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 609460]
 gi|416652084|ref|ZP_11811486.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. 507440-20]
 gi|416655624|ref|ZP_11812629.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 556152]
 gi|416669115|ref|ZP_11819140.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB101509-0077]
 gi|416700997|ref|ZP_11829262.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB110209-0055]
 gi|416705382|ref|ZP_11830863.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB111609-0052]
 gi|416713638|ref|ZP_11837193.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 2009083312]
 gi|416720123|ref|ZP_11841889.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 2009085258]
 gi|416721559|ref|ZP_11842724.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 315731156]
 gi|416729877|ref|ZP_11848338.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2009159199]
 gi|416738151|ref|ZP_11853179.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008282]
 gi|416744242|ref|ZP_11856524.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008283]
 gi|416759732|ref|ZP_11864557.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. IA_2010008284]
 gi|416760387|ref|ZP_11864780.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008285]
 gi|416767537|ref|ZP_11869997.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008287]
 gi|418484908|ref|ZP_13053898.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 80959-06]
 gi|418492113|ref|ZP_13058613.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035278]
 gi|418496941|ref|ZP_13063366.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035318]
 gi|418500305|ref|ZP_13066703.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035320]
 gi|418503319|ref|ZP_13069684.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035321]
 gi|418509304|ref|ZP_13075600.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035327]
 gi|418512924|ref|ZP_13079159.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Pomona str. ATCC 10729]
 gi|418527981|ref|ZP_13093934.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008286]
 gi|452119629|ref|YP_007469877.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Javiana str. CFSAN001992]
 gi|322615194|gb|EFY12116.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 315996572]
 gi|322617785|gb|EFY14681.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-1]
 gi|322624621|gb|EFY21452.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-3]
 gi|322626928|gb|EFY23724.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-4]
 gi|322634114|gb|EFY30850.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 515920-1]
 gi|322635695|gb|EFY32405.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 515920-2]
 gi|322640170|gb|EFY36834.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 531954]
 gi|322646407|gb|EFY42919.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. NC_MB110209-0054]
 gi|322661471|gb|EFY57695.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 19N]
 gi|322665641|gb|EFY61825.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 81038-01]
 gi|322671171|gb|EFY67299.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 414877]
 gi|322675188|gb|EFY71265.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 366867]
 gi|322680797|gb|EFY76832.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 413180]
 gi|322686972|gb|EFY82949.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 446600]
 gi|323192736|gb|EFZ77963.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 609458-1]
 gi|323198832|gb|EFZ83931.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 556150-1]
 gi|323205158|gb|EFZ90136.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 609460]
 gi|323213417|gb|EFZ98214.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 556152]
 gi|323215774|gb|EGA00517.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB101509-0077]
 gi|323223860|gb|EGA08163.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB110209-0055]
 gi|323231220|gb|EGA15335.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB111609-0052]
 gi|323233654|gb|EGA17746.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 2009083312]
 gi|323237724|gb|EGA21784.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 2009085258]
 gi|323245707|gb|EGA29701.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 315731156]
 gi|323248929|gb|EGA32853.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2009159199]
 gi|323253000|gb|EGA36833.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008282]
 gi|323258676|gb|EGA42338.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008283]
 gi|323259508|gb|EGA43143.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. IA_2010008284]
 gi|323268291|gb|EGA51766.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008285]
 gi|323271920|gb|EGA55336.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008287]
 gi|363552716|gb|EHL36999.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. SARB31]
 gi|363567278|gb|EHL51278.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. ATCC BAA710]
 gi|363573864|gb|EHL57738.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 42N]
 gi|366055944|gb|EHN20277.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035318]
 gi|366057212|gb|EHN21516.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 80959-06]
 gi|366058491|gb|EHN22779.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035278]
 gi|366069158|gb|EHN33284.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035320]
 gi|366073161|gb|EHN37236.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035321]
 gi|366078494|gb|EHN42495.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035327]
 gi|366082861|gb|EHN46791.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Pomona str. ATCC 10729]
 gi|366827184|gb|EHN54093.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. 507440-20]
 gi|372204105|gb|EHP17636.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008286]
 gi|451908633|gb|AGF80439.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Javiana str. CFSAN001992]
          Length = 765

 Score =  195 bits (496), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 148/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 267 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 437

Query: 296 GYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAEV 332
              T L GI           YAK               +++        +   +I  A  
Sbjct: 438 QSVTVLAGIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 AAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|409250776|ref|YP_006886584.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
           subsp. enterica serovar Weltevreden str. 2007-60-3289-1]
 gi|418760521|ref|ZP_13316675.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35185]
 gi|418775352|ref|ZP_13331310.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 33953]
 gi|418781475|ref|ZP_13337358.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35188]
 gi|418783741|ref|ZP_13339586.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21559]
 gi|418789285|ref|ZP_13345072.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19447]
 gi|418794086|ref|ZP_13349808.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19449]
 gi|418796683|ref|ZP_13352374.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19567]
 gi|418803316|ref|ZP_13358937.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35202]
 gi|418809719|ref|ZP_13365271.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21550]
 gi|418813873|ref|ZP_13369394.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 22513]
 gi|418815139|ref|ZP_13370647.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21538]
 gi|418822928|ref|ZP_13378339.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 22425]
 gi|418830026|ref|ZP_13384989.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM N18486]
 gi|418836408|ref|ZP_13391292.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM N1543]
 gi|418840973|ref|ZP_13395796.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21554]
 gi|418844732|ref|ZP_13399518.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19443]
 gi|418850848|ref|ZP_13405564.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 37978]
 gi|418857905|ref|ZP_13412528.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19470]
 gi|418862200|ref|ZP_13416744.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19536]
 gi|419789701|ref|ZP_14315381.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Newport str. Levine 1]
 gi|419792237|ref|ZP_14317879.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Newport str. Levine 15]
 gi|320086604|emb|CBY96375.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
           subsp. enterica serovar Weltevreden str. 2007-60-3289-1]
 gi|392615486|gb|EIW97925.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Newport str. Levine 1]
 gi|392618961|gb|EIX01347.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Newport str. Levine 15]
 gi|392742601|gb|EIZ99688.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35185]
 gi|392747243|gb|EJA04244.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35188]
 gi|392748425|gb|EJA05411.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 33953]
 gi|392756753|gb|EJA13648.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21559]
 gi|392760526|gb|EJA17361.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19447]
 gi|392762081|gb|EJA18897.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19449]
 gi|392770056|gb|EJA26784.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19567]
 gi|392772845|gb|EJA29542.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35202]
 gi|392773804|gb|EJA30500.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21550]
 gi|392775099|gb|EJA31794.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 22513]
 gi|392787185|gb|EJA43733.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 22425]
 gi|392793518|gb|EJA49962.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21538]
 gi|392800769|gb|EJA56999.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM N1543]
 gi|392802259|gb|EJA58473.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM N18486]
 gi|392809639|gb|EJA65673.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21554]
 gi|392813541|gb|EJA69505.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19443]
 gi|392818698|gb|EJA74582.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 37978]
 gi|392834583|gb|EJA90187.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19470]
 gi|392836652|gb|EJA92232.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19536]
          Length = 765

 Score =  195 bits (496), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 148/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 267 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 437

Query: 296 GYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAEV 332
              T L GI           YAK               +++        +   +I  A  
Sbjct: 438 QSVTVLAGIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 AAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|300936554|ref|ZP_07151463.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 21-1]
 gi|300458317|gb|EFK21810.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 21-1]
          Length = 789

 Score =  195 bits (496), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 230/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 179 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVQAMQGKSPADRYSVMTSVKHFAAYG 238

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 239 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 290

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 291 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 348

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 349 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 403

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 404 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 461

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 462 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 521

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 522 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 578

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 579 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 615


>gi|197263061|ref|ZP_03163135.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Saintpaul str. SARA23]
 gi|207857601|ref|YP_002244252.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. P125109]
 gi|421359420|ref|ZP_15809713.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 622731-39]
 gi|421364010|ref|ZP_15814248.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639016-6]
 gi|421367050|ref|ZP_15817252.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 640631]
 gi|421373030|ref|ZP_15823175.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-0424]
 gi|421375395|ref|ZP_15825508.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-6]
 gi|421382177|ref|ZP_15832228.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 485549-17]
 gi|421386891|ref|ZP_15836897.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-22]
 gi|421391230|ref|ZP_15841201.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-70]
 gi|421395600|ref|ZP_15845536.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-26]
 gi|421397961|ref|ZP_15847870.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-37]
 gi|421402484|ref|ZP_15852342.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-46]
 gi|421409068|ref|ZP_15858863.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-50]
 gi|421411752|ref|ZP_15861516.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-1427]
 gi|421418166|ref|ZP_15867872.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-2659]
 gi|421422724|ref|ZP_15872392.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 78-1757]
 gi|421424918|ref|ZP_15874555.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22510-1]
 gi|421433210|ref|ZP_15882778.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 8b-1]
 gi|421434152|ref|ZP_15883702.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648905 5-18]
 gi|421441895|ref|ZP_15891355.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 6-18]
 gi|421443218|ref|ZP_15892660.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 50-3079]
 gi|436600421|ref|ZP_20512957.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22704]
 gi|436686654|ref|ZP_20517957.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE30663]
 gi|436802328|ref|ZP_20525319.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS44]
 gi|436809476|ref|ZP_20528856.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1882]
 gi|436814772|ref|ZP_20532323.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1884]
 gi|436844194|ref|ZP_20537952.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1594]
 gi|436854476|ref|ZP_20544110.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1566]
 gi|436855800|ref|ZP_20544925.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1580]
 gi|436864301|ref|ZP_20550268.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1543]
 gi|436870232|ref|ZP_20554038.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1441]
 gi|436877723|ref|ZP_20558651.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1810]
 gi|436886662|ref|ZP_20563082.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1558]
 gi|436893559|ref|ZP_20567466.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1018]
 gi|436901307|ref|ZP_20572217.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1010]
 gi|436912656|ref|ZP_20578485.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1729]
 gi|436920338|ref|ZP_20582934.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0895]
 gi|436926675|ref|ZP_20586501.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0899]
 gi|436936605|ref|ZP_20592045.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1457]
 gi|436940618|ref|ZP_20594562.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1747]
 gi|436950717|ref|ZP_20599772.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0968]
 gi|436961959|ref|ZP_20605333.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1444]
 gi|436969147|ref|ZP_20608268.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1445]
 gi|436976823|ref|ZP_20612073.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1559]
 gi|436989589|ref|ZP_20616596.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1565]
 gi|437001001|ref|ZP_20620797.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1808]
 gi|437022365|ref|ZP_20628334.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1811]
 gi|437036112|ref|ZP_20633844.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0956]
 gi|437044148|ref|ZP_20637101.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1455]
 gi|437052068|ref|ZP_20641628.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1575]
 gi|437059125|ref|ZP_20645972.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1725]
 gi|437064832|ref|ZP_20648606.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1745]
 gi|437077112|ref|ZP_20655320.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1791]
 gi|437083638|ref|ZP_20659292.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1795]
 gi|437092016|ref|ZP_20663616.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 576709]
 gi|437113855|ref|ZP_20669057.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 635290-58]
 gi|437122048|ref|ZP_20672085.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-16]
 gi|437128670|ref|ZP_20675357.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-19]
 gi|437139171|ref|ZP_20681653.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-2]
 gi|437144157|ref|ZP_20684771.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-9]
 gi|437151251|ref|ZP_20689128.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629163]
 gi|437162727|ref|ZP_20696289.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE15-1]
 gi|437169975|ref|ZP_20700070.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_N202]
 gi|437179039|ref|ZP_20705157.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_56-3991]
 gi|437181201|ref|ZP_20706372.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_76-3618]
 gi|437239027|ref|ZP_20714239.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13183-1]
 gi|437261953|ref|ZP_20718699.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_81-2490]
 gi|437267405|ref|ZP_20721157.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL909]
 gi|437280676|ref|ZP_20728053.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL913]
 gi|437289916|ref|ZP_20731294.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_69-4941]
 gi|437311898|ref|ZP_20736006.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 638970-15]
 gi|437330435|ref|ZP_20741599.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 17927]
 gi|437346849|ref|ZP_20747003.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS4]
 gi|437411031|ref|ZP_20752807.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 22-17]
 gi|437442646|ref|ZP_20757878.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 40-18]
 gi|437461006|ref|ZP_20761959.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 1-1]
 gi|437479748|ref|ZP_20768095.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 4-1]
 gi|437494385|ref|ZP_20772414.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642046 4-7]
 gi|437505366|ref|ZP_20775420.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648898 4-5]
 gi|437535206|ref|ZP_20781440.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648899 3-17]
 gi|437559607|ref|ZP_20785823.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648900 1-16]
 gi|437572273|ref|ZP_20789035.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 1-17]
 gi|437583575|ref|ZP_20792569.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 39-2]
 gi|437599978|ref|ZP_20797137.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648902 6-8]
 gi|437616848|ref|ZP_20802600.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648903 1-6]
 gi|437631373|ref|ZP_20806367.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648904 3-6]
 gi|437662149|ref|ZP_20813366.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 653049 13-19]
 gi|437676329|ref|ZP_20816941.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 8-1]
 gi|437695929|ref|ZP_20822252.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 9-7]
 gi|437711846|ref|ZP_20826864.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 42-20]
 gi|437813102|ref|ZP_20841687.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 33944]
 gi|438082703|ref|ZP_20857889.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 81-2625]
 gi|438099351|ref|ZP_20863367.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 62-1976]
 gi|438109484|ref|ZP_20867447.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 53-407]
 gi|445165173|ref|ZP_21394056.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE8a]
 gi|445182314|ref|ZP_21398520.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 20037]
 gi|445229380|ref|ZP_21405024.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE10]
 gi|445333762|ref|ZP_21414961.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 18569]
 gi|445344304|ref|ZP_21417576.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13-1]
 gi|445361577|ref|ZP_21423869.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. PT23]
 gi|197241316|gb|EDY23936.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Saintpaul str. SARA23]
 gi|206709404|emb|CAR33745.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Enteritidis str. P125109]
 gi|395985043|gb|EJH94216.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 622731-39]
 gi|395985485|gb|EJH94655.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639016-6]
 gi|395989705|gb|EJH98839.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 640631]
 gi|395998658|gb|EJI07685.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-0424]
 gi|395999277|gb|EJI08299.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 485549-17]
 gi|396005381|gb|EJI14360.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-6]
 gi|396011532|gb|EJI20442.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-22]
 gi|396012239|gb|EJI21137.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-26]
 gi|396012640|gb|EJI21536.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-70]
 gi|396025999|gb|EJI34772.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-50]
 gi|396032021|gb|EJI40746.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-46]
 gi|396032157|gb|EJI40881.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-37]
 gi|396039333|gb|EJI47961.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-2659]
 gi|396042039|gb|EJI50662.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 78-1757]
 gi|396045253|gb|EJI53847.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-1427]
 gi|396049426|gb|EJI57969.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 8b-1]
 gi|396058514|gb|EJI66975.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22510-1]
 gi|396060758|gb|EJI69199.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648905 5-18]
 gi|396062572|gb|EJI70983.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 6-18]
 gi|396071002|gb|EJI79329.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 50-3079]
 gi|434957421|gb|ELL51060.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS44]
 gi|434967293|gb|ELL60128.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1882]
 gi|434972888|gb|ELL65276.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1884]
 gi|434975403|gb|ELL67694.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22704]
 gi|434978780|gb|ELL70772.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1594]
 gi|434983279|gb|ELL75087.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1566]
 gi|434992014|gb|ELL83484.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1580]
 gi|434995336|gb|ELL86652.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1543]
 gi|435002560|gb|ELL93625.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1441]
 gi|435003859|gb|ELL94860.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE30663]
 gi|435008607|gb|ELL99430.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1810]
 gi|435011905|gb|ELM02608.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1558]
 gi|435018555|gb|ELM09017.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1018]
 gi|435020741|gb|ELM11130.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1010]
 gi|435026901|gb|ELM17032.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1729]
 gi|435027857|gb|ELM17949.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0895]
 gi|435036516|gb|ELM26335.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0899]
 gi|435039443|gb|ELM29224.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1457]
 gi|435045523|gb|ELM35151.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1747]
 gi|435051098|gb|ELM40602.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1444]
 gi|435051184|gb|ELM40686.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0968]
 gi|435059248|gb|ELM48538.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1445]
 gi|435071156|gb|ELM60106.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1565]
 gi|435071299|gb|ELM60247.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1559]
 gi|435074446|gb|ELM63278.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0956]
 gi|435075555|gb|ELM64369.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1808]
 gi|435076993|gb|ELM65767.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1811]
 gi|435081352|gb|ELM69994.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1455]
 gi|435091470|gb|ELM79861.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1575]
 gi|435094938|gb|ELM83277.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1725]
 gi|435100574|gb|ELM88742.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1745]
 gi|435103950|gb|ELM92024.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1791]
 gi|435107381|gb|ELM95366.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1795]
 gi|435112922|gb|ELN00787.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 576709]
 gi|435116153|gb|ELN03904.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 635290-58]
 gi|435124194|gb|ELN11661.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-16]
 gi|435128186|gb|ELN15537.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-19]
 gi|435132693|gb|ELN19891.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-2]
 gi|435139148|gb|ELN26152.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-9]
 gi|435142647|gb|ELN29534.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE15-1]
 gi|435145134|gb|ELN31963.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629163]
 gi|435147755|gb|ELN34507.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_N202]
 gi|435152456|gb|ELN39086.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_56-3991]
 gi|435162324|gb|ELN48508.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_81-2490]
 gi|435166749|gb|ELN52715.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_76-3618]
 gi|435170164|gb|ELN55920.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL913]
 gi|435175921|gb|ELN61323.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL909]
 gi|435182180|gb|ELN67212.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_69-4941]
 gi|435182531|gb|ELN67539.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13183-1]
 gi|435183030|gb|ELN68005.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 638970-15]
 gi|435189277|gb|ELN73922.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS4]
 gi|435189619|gb|ELN74243.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 17927]
 gi|435204236|gb|ELN87933.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 22-17]
 gi|435211340|gb|ELN94479.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 40-18]
 gi|435219425|gb|ELO01787.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 4-1]
 gi|435220698|gb|ELO02980.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 1-1]
 gi|435227283|gb|ELO08792.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642046 4-7]
 gi|435235423|gb|ELO16226.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648898 4-5]
 gi|435238918|gb|ELO19527.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648900 1-16]
 gi|435240567|gb|ELO20958.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648899 3-17]
 gi|435247913|gb|ELO27842.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 1-17]
 gi|435261576|gb|ELO40730.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648903 1-6]
 gi|435262896|gb|ELO41978.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648902 6-8]
 gi|435263473|gb|ELO42520.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 39-2]
 gi|435267795|gb|ELO46460.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 653049 13-19]
 gi|435276722|gb|ELO54719.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 8-1]
 gi|435277229|gb|ELO55183.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 9-7]
 gi|435283446|gb|ELO61011.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648904 3-6]
 gi|435288066|gb|ELO65157.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 42-20]
 gi|435297184|gb|ELO73479.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 33944]
 gi|435320361|gb|ELO93000.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 81-2625]
 gi|435325905|gb|ELO97749.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 62-1976]
 gi|435332633|gb|ELP03544.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 53-407]
 gi|444865473|gb|ELX90243.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE10]
 gi|444867011|gb|ELX91716.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE8a]
 gi|444871369|gb|ELX95805.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 20037]
 gi|444875449|gb|ELX99648.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 18569]
 gi|444880476|gb|ELY04551.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13-1]
 gi|444884583|gb|ELY08407.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. PT23]
          Length = 765

 Score =  195 bits (496), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 148/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 267 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 437

Query: 296 GYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAEV 332
              T L GI           YAK               +++        +   +I  A  
Sbjct: 438 QSVTVLAGIQNAVDDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 AAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|170681058|ref|YP_001742992.1| beta-glucosidase, periplasmic [Escherichia coli SMS-3-5]
 gi|415840762|ref|ZP_11522113.1| periplasmic beta-glucosidase [Escherichia coli RN587/1]
 gi|417282190|ref|ZP_12069490.1| glycosyl hydrolase family 3, C-terminal domain protein [Escherichia
           coli 3003]
 gi|170518776|gb|ACB16954.1| beta-glucosidase, periplasmic [Escherichia coli SMS-3-5]
 gi|323187856|gb|EFZ73152.1| periplasmic beta-glucosidase [Escherichia coli RN587/1]
 gi|386246519|gb|EII88249.1| glycosyl hydrolase family 3, C-terminal domain protein [Escherichia
           coli 3003]
          Length = 765

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 NRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|432372810|ref|ZP_19615852.1| periplasmic beta-glucosidase [Escherichia coli KTE11]
 gi|430896085|gb|ELC18330.1| periplasmic beta-glucosidase [Escherichia coli KTE11]
          Length = 765

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 148/458 (32%), Positives = 227/458 (49%), Gaps = 61/458 (13%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA---- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   PA    
Sbjct: 325 SKYLPGLIKSGKVTMEELDDATRHVLNVKYDMGLFN-----DPYSHLGPKE-SDPADTNA 378

Query: 241 ----HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVA 294
               H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGV 
Sbjct: 379 ESRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVV 436

Query: 295 CGYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAE 331
               T L GI           YAK  +     G+    NQ              +I  A 
Sbjct: 437 DQSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAV 496

Query: 332 VAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPV 391
             A+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P 
Sbjct: 497 QTAKQADVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP- 554

Query: 392 DVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 -LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|26248511|ref|NP_754551.1| periplasmic beta-glucosidase [Escherichia coli CFT073]
 gi|26108916|gb|AAN81119.1|AE016763_78 Periplasmic beta-glucosidase precursor [Escherichia coli CFT073]
          Length = 765

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 230/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVQAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|295132888|ref|YP_003583564.1| beta-glucosidase [Zunongwangia profunda SM-A87]
 gi|294980903|gb|ADF51368.1| beta-glucosidase [Zunongwangia profunda SM-A87]
          Length = 855

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 113/326 (34%), Positives = 174/326 (53%), Gaps = 15/326 (4%)

Query: 3   NGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRL-KVAA 61
           N     L+ W+PNVNIFRDPRWGRGQET GEDP LT +     V+GLQG   ++  K+ A
Sbjct: 115 NKRFLSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQGPEDAKYKKLLA 174

Query: 62  CCKHYTAYDLDNWNGVDRYHFNA-RVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNG 120
           C KHY  +    W+   R+  N   VS++DL +TY   FK  V +  V  VMC+Y +++ 
Sbjct: 175 CAKHYAVHSGPEWS---RHELNLNNVSQRDLWETYLPAFKVLVQDANVRQVMCAYQRLDD 231

Query: 121 KPTCADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDC 180
           +P C    +L+  +  +W  +  +VSDC ++   Y + + +     AAA A+ AG D++C
Sbjct: 232 EPCCGSDRLLQQILREKWGFEHLVVSDCGAIQDFYTSHNVSSDAVHAAAKAVLAGTDVEC 291

Query: 181 GPFLAIHTEG----AVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDV 236
                 H       AV  GL++EED++ ++   +  +  LG  D +    P+  +    +
Sbjct: 292 Q--WDKHNYKLLPEAVEKGLVKEEDIDRSVKRVLIGRFELGEMDPDEIV-PYAQIPASVI 348

Query: 237 CTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACG 296
               H+QLAL+ A + + LL+N    LPLS  +   +AVIGPN+D    + GNY G    
Sbjct: 349 NNEEHRQLALKMARESMTLLQNKNNILPLSKGQDR-IAVIGPNADDEPMLWGNYNGTPVR 407

Query: 297 YTTPLQGISRYAKTIHQAGCFGVACN 322
             + L GI+  +K   ++  +  AC+
Sbjct: 408 TISILDGIT--SKIGEKSIVYDKACD 431



 Score = 40.0 bits (92), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 44/102 (43%), Gaps = 13/102 (12%)

Query: 338 DATVLVMGLDQSIEAEFI----------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMC 387
           +  + V GL   +E E +          DR  + LP  Q+  +  +  A +  V+ V   
Sbjct: 593 ETVIFVGGLSTKLEGEEMPVSYPGFKGGDRTDIALPSVQRNCLKTLKDAGK-KVIFVNNS 651

Query: 388 GGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           G  +            AIL   Y G++GG A+ADVLFG  NP
Sbjct: 652 GSAI--GLVPETTSCDAILQAWYGGESGGQAVADVLFGDYNP 691


>gi|198245676|ref|YP_002216250.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Dublin str. CT_02021853]
 gi|375119733|ref|ZP_09764900.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Dublin str. SD3246]
 gi|197940192|gb|ACH77525.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Dublin str. CT_02021853]
 gi|326624000|gb|EGE30345.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Dublin str. SD3246]
          Length = 755

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 148/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 257 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 427

Query: 296 GYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAEV 332
              T L GI           YAK               +++        +   +I  A  
Sbjct: 428 QSVTVLAGIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 AAKQADMVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|397164507|ref|ZP_10487962.1| periplasmic beta-glucosidase [Enterobacter radicincitans DSM 16656]
 gi|396093655|gb|EJI91210.1| periplasmic beta-glucosidase [Enterobacter radicincitans DSM 16656]
          Length = 755

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 144/457 (31%), Positives = 227/457 (49%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT +   + V  +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRVSEGFGEDTFLTSEMGRTMVEAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G    VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAALDAGS-GGVMVALNSLNGTPASSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  +SD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDLLRDQWHFKGITISDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD-------VC 237
           + +  G V+ G +  E+++ A  + + V+  +G+F+      P+ +LGP++         
Sbjct: 315 SKYLPGLVKSGKVTMEELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPQDTNAE 369

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
           +  H++ A Q A + +VLLKN   TLPL   +  TVAV+GP +D     +G++  AGVA 
Sbjct: 370 SRLHRKEARQVARESMVLLKNRLETLPLK--KTDTVAVVGPLADSKRDSMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAEV 332
              T L GI           YAK               +++        +  ++I  A  
Sbjct: 428 QSVTVLTGIRNALAGKGNVLYAKGANITNDKDIVNFLNLYEPAVVVDPRSPQEMIDEAVA 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+++D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 AAKKSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIAD+LFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADILFGDYNP 581


>gi|437730260|ref|ZP_20831186.1| beta-D-glucoside glucohydrolase, partial [Salmonella enterica
           subsp. enterica serovar Enteritidis str. 648901 16-16]
 gi|435289521|gb|ELO66481.1| beta-D-glucoside glucohydrolase, partial [Salmonella enterica
           subsp. enterica serovar Enteritidis str. 648901 16-16]
          Length = 764

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 148/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 267 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 437

Query: 296 GYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAEV 332
              T L GI           YAK               +++        +   +I  A  
Sbjct: 438 QSVTVLAGIQNAVDDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 AAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|297736784|emb|CBI25985.3| unnamed protein product [Vitis vinifera]
          Length = 241

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 105/226 (46%), Positives = 135/226 (59%), Gaps = 13/226 (5%)

Query: 32  GEDPVLTGKYAASYVRGLQGNTGSR---------LKVAACCKHYTAYDLDNWNGVDRYHF 82
           GEDP     YA SYVRGLQ   G+          LKV++  KH+ AYDLDNW  VDR HF
Sbjct: 9   GEDPFTVSVYAVSYVRGLQDVEGTENTTDLNSRPLKVSSSGKHFAAYDLDNWLNVDRNHF 68

Query: 83  NARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILKNTIHGQWRLDG 142
           NARVS+QD+ +T+  PF+ACV EG V+ VMCS+N +NG P CADP + K TI  +W L G
Sbjct: 69  NARVSEQDMAETFLRPFEACVREGDVSGVMCSFNNINGIPPCADPRLFKGTIRDEWNLHG 128

Query: 143 YIVSDCDSVGVLYNTQHYTR-TPEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREED 201
           YIVSDC S+  +   Q +   T EEA A  +KAGLDL+CG +       AV  G + + D
Sbjct: 129 YIVSDCWSIETIVEDQKFLDVTGEEAVALNLKAGLDLECGHYYNDSPASAVMAGRVGQHD 188

Query: 202 VNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQ 247
           ++ +L+    V MRLG FDG P+     +LG  D+       L+ Q
Sbjct: 189 LDQSLSNLYVVLMRLGFFDGIPA---LASLGKDDISAYLLNTLSWQ 231


>gi|386639709|ref|YP_006106507.1| periplasmic beta-glucosidase [Escherichia coli ABU 83972]
 gi|442607883|ref|ZP_21022643.1| Periplasmic beta-glucosidase [Escherichia coli Nissle 1917]
 gi|307554201|gb|ADN46976.1| periplasmic beta-glucosidase precursor [Escherichia coli ABU 83972]
 gi|441710488|emb|CCQ08620.1| Periplasmic beta-glucosidase [Escherichia coli Nissle 1917]
          Length = 765

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 230/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVQAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|433039134|ref|ZP_20226733.1| periplasmic beta-glucosidase [Escherichia coli KTE113]
 gi|431551234|gb|ELI25220.1| periplasmic beta-glucosidase [Escherichia coli KTE113]
          Length = 765

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFNA-----PYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|16761109|ref|NP_456726.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Typhi str. CT18]
 gi|29141195|ref|NP_804537.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Typhi str. Ty2]
 gi|213428232|ref|ZP_03360982.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhi str. E02-1180]
 gi|25289432|pir||AD0778 beta-glucosidase (EC 3.2.1.21) - Salmonella enterica subsp.
           enterica serovar Typhi (strain CT18)
 gi|16503407|emb|CAD02546.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhi]
 gi|29136821|gb|AAO68386.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhi str. Ty2]
          Length = 765

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 148/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 267 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 437

Query: 296 GYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAEV 332
              T L GI           YAK               +++        +   +I  A  
Sbjct: 438 QSVTVLAGIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 AAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|345300046|ref|YP_004829404.1| glycoside hydrolase family protein [Enterobacter asburiae LF7a]
 gi|345093983|gb|AEN65619.1| glycoside hydrolase family 3 domain protein [Enterobacter asburiae
           LF7a]
          Length = 765

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 150/457 (32%), Positives = 227/457 (49%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T + PE+A   A+K+G+++     + 
Sbjct: 267 AWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G V+ G +   +++ A  + + V+  +G+F+      P+ +LGP+D   V T A 
Sbjct: 325 SKYLPGLVKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A   A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEARDVARESLVLLKNRLETLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +      +    NQ              +I  A  
Sbjct: 438 QSITVLTGIKNAIGDNGKVVYAKGANVTNDKDIVTFLNQYEEAVKVDPRSPKEMIDEAVA 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 AAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +   K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LVLVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|432732868|ref|ZP_19967701.1| periplasmic beta-glucosidase [Escherichia coli KTE45]
 gi|432759954|ref|ZP_19994448.1| periplasmic beta-glucosidase [Escherichia coli KTE46]
 gi|431276055|gb|ELF67082.1| periplasmic beta-glucosidase [Escherichia coli KTE45]
 gi|431307608|gb|ELF95898.1| periplasmic beta-glucosidase [Escherichia coli KTE46]
          Length = 755

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 428 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEIIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|410097219|ref|ZP_11292201.1| hypothetical protein HMPREF1076_01379 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409224537|gb|EKN17469.1| hypothetical protein HMPREF1076_01379 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 805

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 140/443 (31%), Positives = 216/443 (48%), Gaps = 37/443 (8%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL 71
           +SP +++ RDPRWGR  E+ GED  L G+     V G+Q N     +V +  KH+  Y +
Sbjct: 213 YSPILDLSRDPRWGRTVESYGEDSYLAGELGRQQVLGIQSN-----RVVSTPKHFAIYGI 267

Query: 72  DNWNGVDRY-HFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
               G D Y   +   S Q++ + +  PF+    E      MCS+N  NG P  A   ++
Sbjct: 268 PG-GGRDCYSRTDPHASPQEVHELHLEPFRIAFQEAGALGTMCSHNDYNGTPVSASHYLM 326

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLD----LDCGPFLAI 186
              +  QW   GY+VSD  ++       H   T EEA A  + AGL+     +       
Sbjct: 327 TELLRNQWGFKGYVVSDSWAIDKNVKFYHIVDTEEEAVASELNAGLNVRTFFEQSEVFIE 386

Query: 187 HTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLAL 246
               A++ GL+ E  ++  +   + V+  LG+FD +P  +    L  + V +  +++++L
Sbjct: 387 ALRRALQKGLVEESTLDQRVREVLYVKFWLGLFD-DPYVKD-TKLADKIVNSDKNREVSL 444

Query: 247 QAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIS 305
           +AA + IVLLKN   TLPLS TL++  +AVIGP +D   ++   Y        T LQG+ 
Sbjct: 445 RAARESIVLLKNENNTLPLSKTLKN--IAVIGPQADEVKSLTSRYGSHNPNVITGLQGLK 502

Query: 306 RY----AKTIHQAGC--------------FGVACNGNQLIGAAEVAARQADATVLVMGLD 347
                    ++  GC              F ++    + I  A   A++A+  ++ +G D
Sbjct: 503 NLLGENVNLMYAKGCNVRDKNFPQSDVMYFELSDKEKEEIDEAVEIAKKAEVAIIYVGDD 562

Query: 348 QSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILW 407
                E   R  L L GRQ+ELV R  +A+  PVVLVL  G PV +++   D  + AI+ 
Sbjct: 563 FRTIGESRSRVNLDLSGRQKELV-RAVQATGTPVVLVLFNGRPVTLNW--EDANLPAIVE 619

Query: 408 VGYPGQAGGAAIADVLFGRANPG 430
             YPG+  G A+A+VLFG  NPG
Sbjct: 620 AWYPGEFSGQAVAEVLFGDYNPG 642


>gi|423269263|ref|ZP_17248235.1| hypothetical protein HMPREF1079_01317 [Bacteroides fragilis
           CL05T00C42]
 gi|423273173|ref|ZP_17252120.1| hypothetical protein HMPREF1080_00773 [Bacteroides fragilis
           CL05T12C13]
 gi|392701685|gb|EIY94842.1| hypothetical protein HMPREF1079_01317 [Bacteroides fragilis
           CL05T00C42]
 gi|392708205|gb|EIZ01313.1| hypothetical protein HMPREF1080_00773 [Bacteroides fragilis
           CL05T12C13]
          Length = 805

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 155/451 (34%), Positives = 221/451 (49%), Gaps = 48/451 (10%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLK-VAACCKHYTAYD 70
           + P +++ RDPRW R +ET GEDP L G    + VRG QG T +  K V A  KH+ +Y 
Sbjct: 198 YGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETLNDGKSVIATLKHFASY- 256

Query: 71  LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
              W         A + +++LE+    PF+  V  G + SVM SYN+++G P      +L
Sbjct: 257 --GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLL 313

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP-FLAIHTE 189
            + +  +W+  G++VSD  +VG L           EAA  A+ AG+D D G    A    
Sbjct: 314 TDILKDRWQFKGFVVSDLYAVGGL-REHGVAGNDYEAAIKAVNAGVDSDLGTNVYAEQLV 372

Query: 190 GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPF--GNLGPRDVCTPAHQQLALQ 247
            AV+ G +    ++ A+   ++++ ++G+FD      PF       + V +  H  LA +
Sbjct: 373 AAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPFVDEKQAVQLVASSEHTGLARE 427

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT--TPLQGI- 304
            A Q IVLLKN  + LPL      T+AVIGPN+D    M+G+Y       T  T L GI 
Sbjct: 428 VARQSIVLLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIR 486

Query: 305 ---SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG----LDQSIEAEFI-- 355
              S+  + ++  GC     +      A E  AR ADA V+VMG     D S E E    
Sbjct: 487 QKVSKETRVLYAKGCAVRDSSRTGFKDAIET-ARNADAVVMVMGGSSARDFSSEYEETGA 545

Query: 356 -----------------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKN 398
                            DRA L L GRQ EL+  +++  + PVVLVL+ G P+ +  A  
Sbjct: 546 AKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQ 604

Query: 399 DPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +    AI+   YPG  GG A+ADVLFG  NP
Sbjct: 605 EAE--AIVDAWYPGMQGGNAVADVLFGDYNP 633


>gi|419958177|ref|ZP_14474242.1| beta-D-glucoside glucohydrolase [Enterobacter cloacae subsp.
           cloacae GS1]
 gi|388606860|gb|EIM36065.1| beta-D-glucoside glucohydrolase [Enterobacter cloacae subsp.
           cloacae GS1]
          Length = 765

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 149/458 (32%), Positives = 228/458 (49%), Gaps = 61/458 (13%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T + PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA---- 240
           + +  G V+ G +   +++ A  + + V+  +G+F+      P+ +LGP+D   PA    
Sbjct: 325 SKYLPGLVKSGKVTMAELDDAARHVLNVKYDMGLFN-----DPYSHLGPKD-SDPADTNA 378

Query: 241 ----HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVA 294
               H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA
Sbjct: 379 ESRLHRKEAREVARESLVLLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVA 436

Query: 295 CGYTTPLQGISR---------YAKTIHQAGCFGVACNGNQLIGAAEV------------- 332
               T L GI           YAK  +      +    NQ   A +V             
Sbjct: 437 DQSVTVLTGIKSAVGDNAKVVYAKGANVTDDKDIVTFLNQYEEAVKVDARTPKEMLDEAV 496

Query: 333 -AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPV 391
            AA+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P 
Sbjct: 497 NAAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP- 554

Query: 392 DVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 -LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|398386388|ref|ZP_10544390.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
 gi|397718419|gb|EJK79008.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
          Length = 792

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 157/453 (34%), Positives = 219/453 (48%), Gaps = 56/453 (12%)

Query: 13  SPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGS--RLKVAACCKHYTAYD 70
           SP V+I RDPRWGR +ET GEDP L G+   + V GLQG +      KV A  KH T + 
Sbjct: 199 SPVVDIARDPRWGRIEETFGEDPYLCGEMGVAAVLGLQGESKQIGPDKVMATLKHMTGH- 257

Query: 71  LDNWNGVDRYHFN---ARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADP 127
                G  +   N   A +S+++L + +  PF+  V    +A+VM SYN+++G P+  + 
Sbjct: 258 -----GQPQAGENIAPAPISERELRENFFPPFRQVVKRTGIAAVMPSYNEIDGVPSHQNK 312

Query: 128 DILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIH 187
            +L + + G+W  DG +VSD  +V  L    H     E  A  A++AG+D +    LA  
Sbjct: 313 WLLGDILRGEWHFDGAVVSDYGAVPELDTIHHVQPDLEATARAALRAGVDCELPDGLAYR 372

Query: 188 T-EGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPR---DVCT--PAH 241
           T    VR G +  E VNLA    +T++ R G+F+         N  PR   D  T     
Sbjct: 373 TLVEQVRAGKVPLEAVNLACTRMLTLKFRAGLFE---------NPWPRADYDALTGNAEA 423

Query: 242 QQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPL 301
           + LAL+AAH+ IVLLKN   TLPL    H  VAV+GPN+   +  +G Y+ +     + L
Sbjct: 424 RALALKAAHKSIVLLKNDG-TLPLRPGAHRKVAVVGPNA--AIARLGGYSSIPRQAVSLL 480

Query: 302 QGIS----RYAKTIHQAGCF--------------GVACNGNQLIGAAEVAARQADATVLV 343
            G+       A  +H  G F                     QLI  A   A+ AD  +L 
Sbjct: 481 DGVKAKLGNRADIVHAQGVFITQSEDRSVDEVFLADPAKNRQLIAEAVEVAKTADIILLA 540

Query: 344 MGLDQSIEAEFI------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAK 397
           +G  +    E        DR  L L G Q EL + + KA+  PVV+  + G P   S+  
Sbjct: 541 IGDTEQTSREGFAKNHLGDRTSLDLVGEQNELFAAM-KATGKPVVVCAINGRP--PSYPA 597

Query: 398 NDPRIGAILWVGYPGQAGGAAIADVLFGRANPG 430
                 A+L   YPGQ GG A+AD+LFG  NPG
Sbjct: 598 VVDGANALLECWYPGQEGGTAMADILFGDVNPG 630


>gi|336408356|ref|ZP_08588849.1| hypothetical protein HMPREF1018_00864 [Bacteroides sp. 2_1_56FAA]
 gi|335937834|gb|EGM99730.1| hypothetical protein HMPREF1018_00864 [Bacteroides sp. 2_1_56FAA]
          Length = 805

 Score =  195 bits (495), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 155/451 (34%), Positives = 221/451 (49%), Gaps = 48/451 (10%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLK-VAACCKHYTAYD 70
           + P +++ RDPRW R +ET GEDP L G    + VRG QG T +  K V A  KH+ +Y 
Sbjct: 198 YGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETLNDGKSVIATLKHFASY- 256

Query: 71  LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
              W         A + +++LE+    PF+  V  G + SVM SYN+++G P      +L
Sbjct: 257 --GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLL 313

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP-FLAIHTE 189
            + +  +W+  G++VSD  +VG L           EAA  A+ AG+D D G    A    
Sbjct: 314 TDILKDRWQFKGFVVSDLYAVGGL-REHGVAGNDYEAAIKAVNAGVDSDLGTNVYAEQLV 372

Query: 190 GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPF--GNLGPRDVCTPAHQQLALQ 247
            AV+ G +    ++ A+   ++++ ++G+FD      PF       + V +  H  LA +
Sbjct: 373 AAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPFVDEKQAVQLVASSEHTGLARE 427

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT--TPLQGI- 304
            A Q IVLLKN  + LPL      T+AVIGPN+D    M+G+Y       T  T L GI 
Sbjct: 428 VARQSIVLLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIR 486

Query: 305 ---SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG----LDQSIEAEFI-- 355
              S+  + ++  GC     +      A E  AR ADA V+VMG     D S E E    
Sbjct: 487 QKVSKETRVLYAKGCAVRDSSRTGFKDAIET-ARNADAVVMVMGGSSARDFSSEYEETGA 545

Query: 356 -----------------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKN 398
                            DRA L L GRQ EL+  +++  + PVVLVL+ G P+ +  A  
Sbjct: 546 AKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQ 604

Query: 399 DPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +    AI+   YPG  GG A+ADVLFG  NP
Sbjct: 605 EAE--AIVDAWYPGMQGGNAVADVLFGDYNP 633


>gi|424661946|ref|ZP_18098983.1| hypothetical protein HMPREF1205_02332 [Bacteroides fragilis HMW
           616]
 gi|404578257|gb|EKA82992.1| hypothetical protein HMPREF1205_02332 [Bacteroides fragilis HMW
           616]
          Length = 814

 Score =  195 bits (495), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 159/453 (35%), Positives = 223/453 (49%), Gaps = 52/453 (11%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           + P +++ RDPRW R +ET GEDP L G   A+ VRG QG+T   R  V A  KH+ +Y 
Sbjct: 207 YGPVLDLARDPRWSRVEETYGEDPYLNGVMGAALVRGFQGDTLRGRKSVIATLKHFASY- 265

Query: 71  LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
              W         A + +++LE+    PF+  V  G + SVM SYN+++G P      +L
Sbjct: 266 --GWTEGGHNGGTAHLGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLL 322

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP-FLAIHTE 189
            + +  +W   G++VSD  ++G L        +  EAA  A+ AG+D D G    A    
Sbjct: 323 TDILKDRWLFKGFVVSDLYAIGGL-REHGVAGSDYEAAVKAVNAGVDSDLGTNVYAEQLV 381

Query: 190 GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPF--GNLGPRDVCTPAHQQLALQ 247
            AVR G +  E V+ A+   ++++  +G+FD      PF       + V +P H  LA +
Sbjct: 382 AAVRKGDVAMETVDKAVRRILSLKFHMGLFDA-----PFVDDKRPAQLVASPEHIGLARE 436

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVAC--GYTTPLQGI- 304
            A Q IVLLKN  + LPL      T+AVIGPN+D    M+G+Y          T L+GI 
Sbjct: 437 VARQSIVLLKNEDKLLPLKK-DIRTLAVIGPNADNGYNMLGDYTAPQADGSVVTVLEGIR 495

Query: 305 ---SRYAKTIHQAGCFGVACNGNQLIGAAEV--AARQADATVLVMG----LDQSIE---- 351
              S+  + ++  GC   A   +   G A+   AAR AD  V+V+G     D S E    
Sbjct: 496 QKVSKDTRVLYAKGC---AVRDSSRTGFADAIEAARSADVVVMVVGGSSARDFSSEYEET 552

Query: 352 ---------------AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFA 396
                           E  DRA L L GRQ EL+  V K  + P+VLVL+ G P+ +   
Sbjct: 553 GAAKVSANRVSDMESGEGYDRATLHLMGRQLELLEEVRKLGK-PMVLVLIKGRPLLMEGV 611

Query: 397 KNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
             +    AIL   YPG  GG A+ADVLFG  NP
Sbjct: 612 IQEA--DAILDAWYPGMQGGNAVADVLFGDYNP 642


>gi|386619749|ref|YP_006139329.1| Periplasmic beta-glucosidase [Escherichia coli NA114]
 gi|387830068|ref|YP_003350005.1| beta-D-glucoside glucohydrolase [Escherichia coli SE15]
 gi|425300971|ref|ZP_18690869.1| beta-D-glucoside glucohydrolase [Escherichia coli 07798]
 gi|432441630|ref|ZP_19683970.1| periplasmic beta-glucosidase [Escherichia coli KTE189]
 gi|433014399|ref|ZP_20202747.1| periplasmic beta-glucosidase [Escherichia coli KTE104]
 gi|433024017|ref|ZP_20212005.1| periplasmic beta-glucosidase [Escherichia coli KTE106]
 gi|433323609|ref|ZP_20400940.1| beta-D-glucoside glucohydrolase [Escherichia coli J96]
 gi|281179225|dbj|BAI55555.1| beta-D-glucoside glucohydrolase [Escherichia coli SE15]
 gi|333970250|gb|AEG37055.1| Periplasmic beta-glucosidase [Escherichia coli NA114]
 gi|408213357|gb|EKI37842.1| beta-D-glucoside glucohydrolase [Escherichia coli 07798]
 gi|430966084|gb|ELC83492.1| periplasmic beta-glucosidase [Escherichia coli KTE189]
 gi|431530497|gb|ELI07176.1| periplasmic beta-glucosidase [Escherichia coli KTE104]
 gi|431535710|gb|ELI12049.1| periplasmic beta-glucosidase [Escherichia coli KTE106]
 gi|432347707|gb|ELL42164.1| beta-D-glucoside glucohydrolase [Escherichia coli J96]
          Length = 765

 Score =  195 bits (495), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|417175820|ref|ZP_12005616.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 3.2608]
 gi|417182629|ref|ZP_12009186.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 93.0624]
 gi|419870505|ref|ZP_14392598.1| beta-D-glucoside glucohydrolase [Escherichia coli O103:H2 str.
           CVM9450]
 gi|386178512|gb|EIH55991.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 3.2608]
 gi|386184482|gb|EIH67221.1| glycosyl hydrolase family 3, N-terminal domain protein [Escherichia
           coli 93.0624]
 gi|388339306|gb|EIL05691.1| beta-D-glucoside glucohydrolase [Escherichia coli O103:H2 str.
           CVM9450]
          Length = 765

 Score =  195 bits (495), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 228/457 (49%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G +  G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLINSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNSVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|293410493|ref|ZP_06654069.1| periplasmic beta-glucosidase [Escherichia coli B354]
 gi|417307337|ref|ZP_12094209.1| Periplasmic beta-glucosidase [Escherichia coli PCN033]
 gi|291470961|gb|EFF13445.1| periplasmic beta-glucosidase [Escherichia coli B354]
 gi|338771208|gb|EGP25956.1| Periplasmic beta-glucosidase [Escherichia coli PCN033]
          Length = 765

 Score =  195 bits (495), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LAMVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|417662746|ref|ZP_12312327.1| periplasmic beta-glucosidase [Escherichia coli AA86]
 gi|330911964|gb|EGH40474.1| periplasmic beta-glucosidase [Escherichia coli AA86]
          Length = 765

 Score =  195 bits (495), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG N   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKNPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              + L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVSVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|255689965|ref|ZP_05413640.1| beta-glucosidase [Bacteroides finegoldii DSM 17565]
 gi|260624572|gb|EEX47443.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           finegoldii DSM 17565]
          Length = 688

 Score =  195 bits (495), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 145/437 (33%), Positives = 222/437 (50%), Gaps = 31/437 (7%)

Query: 6   MAGLTY-WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAACC 63
           M+G+ + +SP +++ RDPRWGR  E  GEDP   G + A+ VRG QG N  +  +VAAC 
Sbjct: 107 MSGVDWTFSPMIDVARDPRWGRVAEGYGEDPYANGVFGAASVRGYQGDNMSAENRVAACL 166

Query: 64  KHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPT 123
           KHY  Y         R +    +S+Q L DTY +P++  V  G  A++M S+N ++G P 
Sbjct: 167 KHYVGYGASE---AGRDYVYTEISQQTLWDTYLLPYEMGVKAG-AATLMSSFNDISGVPG 222

Query: 124 CADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPF 183
            A+P  +   +  +WR DG+IVSD  ++  L N Q    T +EAA  A  AGL++D    
Sbjct: 223 SANPYTMTEILKNRWRHDGFIVSDWGAIEQLKN-QGLAATKKEAARYAFTAGLEMDMMSH 281

Query: 184 -LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPF--GNLGPRDVCTPA 240
               H +  V  G +    V+ A+   + ++ RLG+F+     +P+            P 
Sbjct: 282 AYDRHLQELVEEGKVSMAQVDEAVRRVLLLKFRLGLFE-----RPYTPATTEKERFFRPK 336

Query: 241 HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAG------VA 294
              +A + A + +VLLKN    LPL+  +   +AVIGP +     ++G++ G      VA
Sbjct: 337 SMDIAARLAAESMVLLKNENNVLPLTDKKK--IAVIGPMAKNGWDLLGSWRGHGKDTDVA 394

Query: 295 CGYTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEV--AARQADATVLVMGLDQSIEA 352
             Y       +  A+  +  GC      G+   G AE   AAR +D  VL +G   +   
Sbjct: 395 MLYDGLAAEFAGKAELRYALGC---NTQGDNREGFAEALEAARWSDVVVLCLGEMMTWSG 451

Query: 353 EFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPG 412
           E   R+ + LP  Q+EL   + KA + PVVLVL+ G P++++  + +P   AIL +  PG
Sbjct: 452 ENASRSSIALPQMQEELAKELKKAGK-PVVLVLVNGRPLELN--RLEPVSDAILEIWQPG 508

Query: 413 QAGGAAIADVLFGRANP 429
             G   +A +L GR NP
Sbjct: 509 VNGALPMAGILSGRINP 525


>gi|62180752|ref|YP_217169.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
           subsp. enterica serovar Choleraesuis str. SC-B67]
 gi|375115081|ref|ZP_09760251.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
           subsp. enterica serovar Choleraesuis str. SCSA50]
 gi|62128385|gb|AAX66088.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
           subsp. enterica serovar Choleraesuis str. SC-B67]
 gi|322715227|gb|EFZ06798.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
           subsp. enterica serovar Choleraesuis str. SCSA50]
          Length = 765

 Score =  195 bits (495), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 267 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +   +AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGAIAVVGPLADSQRDVMGSWSAAGVAN 437

Query: 296 GYTTPLQGISRY----AKTIHQAGC--------------FGVAC-----NGNQLIGAAEV 332
              T L GI       AK +H  G               +  A      +   +I  A  
Sbjct: 438 QSVTVLAGIQNAVGDGAKILHAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 AAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|416898057|ref|ZP_11927705.1| periplasmic beta-glucosidase [Escherichia coli STEC_7v]
 gi|417115705|ref|ZP_11966841.1| glycosyl hydrolase family 3, C-terminal domain protein [Escherichia
           coli 1.2741]
 gi|422799407|ref|ZP_16847906.1| glycosyl hydrolase 3 domain-containing protein [Escherichia coli
           M863]
 gi|323968051|gb|EGB63461.1| glycosyl hydrolase 3 domain-containing protein [Escherichia coli
           M863]
 gi|327253259|gb|EGE64913.1| periplasmic beta-glucosidase [Escherichia coli STEC_7v]
 gi|386141124|gb|EIG82276.1| glycosyl hydrolase family 3, C-terminal domain protein [Escherichia
           coli 1.2741]
          Length = 765

 Score =  194 bits (494), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KTATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|110642341|ref|YP_670071.1| periplasmic beta-glucosidase [Escherichia coli 536]
 gi|300981493|ref|ZP_07175574.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 200-1]
 gi|422374750|ref|ZP_16455025.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 60-1]
 gi|110343933|gb|ABG70170.1| periplasmic beta-glucosidase precursor [Escherichia coli 536]
 gi|300307566|gb|EFJ62086.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 200-1]
 gi|324013916|gb|EGB83135.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 60-1]
          Length = 789

 Score =  194 bits (494), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 179 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 238

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 239 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 290

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 291 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 348

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 349 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 403

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 404 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 461

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 462 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 521

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 522 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 578

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 579 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 615


>gi|445146604|ref|ZP_21387818.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Dublin str. SL1438]
 gi|445154484|ref|ZP_21391811.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Dublin str. HWS51]
 gi|444845501|gb|ELX70711.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Dublin str. SL1438]
 gi|444850729|gb|ELX75826.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Dublin str. HWS51]
          Length = 765

 Score =  194 bits (494), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 148/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+KAG+D+     + 
Sbjct: 267 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKAGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESVVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAN 437

Query: 296 GYTTPLQGISR---------YAK--------------TIHQAGCFGVACNGNQLIGAAEV 332
              T L GI           YAK               +++        +   +I  A  
Sbjct: 438 QSVTVLAGIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 AAKQADMVVAVVGESQGMAHEASSRTNITIPQSQRDLITAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|218690280|ref|YP_002398492.1| beta-D-glucoside glucohydrolase [Escherichia coli ED1a]
 gi|222156889|ref|YP_002557028.1| Periplasmic beta-glucosidase [Escherichia coli LF82]
 gi|306814752|ref|ZP_07448914.1| periplasmic beta-glucosidase precursor [Escherichia coli NC101]
 gi|387617490|ref|YP_006120512.1| periplasmic beta-glucosidase [Escherichia coli O83:H1 str. NRG
           857C]
 gi|416335319|ref|ZP_11672012.1| Periplasmic beta-glucosidase [Escherichia coli WV_060327]
 gi|419700969|ref|ZP_14228571.1| beta-D-glucoside glucohydrolase [Escherichia coli SCI-07]
 gi|419914389|ref|ZP_14432787.1| periplasmic beta-glucosidase precursor [Escherichia coli KD1]
 gi|432972337|ref|ZP_20161204.1| periplasmic beta-glucosidase [Escherichia coli KTE207]
 gi|433078310|ref|ZP_20264848.1| periplasmic beta-glucosidase [Escherichia coli KTE131]
 gi|433083092|ref|ZP_20269549.1| periplasmic beta-glucosidase [Escherichia coli KTE133]
 gi|433101680|ref|ZP_20287766.1| periplasmic beta-glucosidase [Escherichia coli KTE145]
 gi|433188917|ref|ZP_20373015.1| periplasmic beta-glucosidase [Escherichia coli KTE88]
 gi|218427844|emb|CAR08757.2| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli
           ED1a]
 gi|222033894|emb|CAP76635.1| Periplasmic beta-glucosidase [Escherichia coli LF82]
 gi|305852146|gb|EFM52598.1| periplasmic beta-glucosidase precursor [Escherichia coli NC101]
 gi|312946751|gb|ADR27578.1| periplasmic beta-glucosidase precursor [Escherichia coli O83:H1
           str. NRG 857C]
 gi|320196002|gb|EFW70626.1| Periplasmic beta-glucosidase [Escherichia coli WV_060327]
 gi|380347715|gb|EIA36001.1| beta-D-glucoside glucohydrolase [Escherichia coli SCI-07]
 gi|388386660|gb|EIL48300.1| periplasmic beta-glucosidase precursor [Escherichia coli KD1]
 gi|431481839|gb|ELH61546.1| periplasmic beta-glucosidase [Escherichia coli KTE207]
 gi|431596540|gb|ELI66492.1| periplasmic beta-glucosidase [Escherichia coli KTE131]
 gi|431601984|gb|ELI71493.1| periplasmic beta-glucosidase [Escherichia coli KTE133]
 gi|431619274|gb|ELI88198.1| periplasmic beta-glucosidase [Escherichia coli KTE145]
 gi|431705623|gb|ELJ70213.1| periplasmic beta-glucosidase [Escherichia coli KTE88]
          Length = 765

 Score =  194 bits (494), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|415829171|ref|ZP_11515554.1| periplasmic beta-glucosidase [Escherichia coli OK1357]
 gi|323184244|gb|EFZ69621.1| periplasmic beta-glucosidase [Escherichia coli OK1357]
          Length = 765

 Score =  194 bits (494), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 146/457 (31%), Positives = 227/457 (49%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD-------VC 237
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++         
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDANAE 379

Query: 238 TPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
           +  H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNSVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|432895152|ref|ZP_20106872.1| periplasmic beta-glucosidase [Escherichia coli KTE165]
 gi|431421519|gb|ELH03731.1| periplasmic beta-glucosidase [Escherichia coli KTE165]
          Length = 755

 Score =  194 bits (494), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 428 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|422837811|ref|ZP_16885784.1| periplasmic beta-glucosidase [Escherichia coli H397]
 gi|432358478|ref|ZP_19601704.1| periplasmic beta-glucosidase [Escherichia coli KTE4]
 gi|432363212|ref|ZP_19606379.1| periplasmic beta-glucosidase [Escherichia coli KTE5]
 gi|432574214|ref|ZP_19810694.1| periplasmic beta-glucosidase [Escherichia coli KTE55]
 gi|432588462|ref|ZP_19824818.1| periplasmic beta-glucosidase [Escherichia coli KTE58]
 gi|432598121|ref|ZP_19834397.1| periplasmic beta-glucosidase [Escherichia coli KTE62]
 gi|432754949|ref|ZP_19989499.1| periplasmic beta-glucosidase [Escherichia coli KTE22]
 gi|432779074|ref|ZP_20013317.1| periplasmic beta-glucosidase [Escherichia coli KTE59]
 gi|432788021|ref|ZP_20022153.1| periplasmic beta-glucosidase [Escherichia coli KTE65]
 gi|432821470|ref|ZP_20055162.1| periplasmic beta-glucosidase [Escherichia coli KTE118]
 gi|432827602|ref|ZP_20061254.1| periplasmic beta-glucosidase [Escherichia coli KTE123]
 gi|433005592|ref|ZP_20194021.1| periplasmic beta-glucosidase [Escherichia coli KTE227]
 gi|433008175|ref|ZP_20196592.1| periplasmic beta-glucosidase [Escherichia coli KTE229]
 gi|433154254|ref|ZP_20339197.1| periplasmic beta-glucosidase [Escherichia coli KTE176]
 gi|433164009|ref|ZP_20348747.1| periplasmic beta-glucosidase [Escherichia coli KTE179]
 gi|371610720|gb|EHN99248.1| periplasmic beta-glucosidase [Escherichia coli H397]
 gi|430877133|gb|ELC00589.1| periplasmic beta-glucosidase [Escherichia coli KTE4]
 gi|430886431|gb|ELC09286.1| periplasmic beta-glucosidase [Escherichia coli KTE5]
 gi|431107812|gb|ELE11976.1| periplasmic beta-glucosidase [Escherichia coli KTE55]
 gi|431120795|gb|ELE23793.1| periplasmic beta-glucosidase [Escherichia coli KTE58]
 gi|431130988|gb|ELE33071.1| periplasmic beta-glucosidase [Escherichia coli KTE62]
 gi|431302158|gb|ELF91346.1| periplasmic beta-glucosidase [Escherichia coli KTE22]
 gi|431327227|gb|ELG14572.1| periplasmic beta-glucosidase [Escherichia coli KTE59]
 gi|431337738|gb|ELG24826.1| periplasmic beta-glucosidase [Escherichia coli KTE65]
 gi|431368317|gb|ELG54785.1| periplasmic beta-glucosidase [Escherichia coli KTE118]
 gi|431372851|gb|ELG58513.1| periplasmic beta-glucosidase [Escherichia coli KTE123]
 gi|431514579|gb|ELH92420.1| periplasmic beta-glucosidase [Escherichia coli KTE227]
 gi|431523505|gb|ELI00642.1| periplasmic beta-glucosidase [Escherichia coli KTE229]
 gi|431674144|gb|ELJ40327.1| periplasmic beta-glucosidase [Escherichia coli KTE176]
 gi|431687314|gb|ELJ52865.1| periplasmic beta-glucosidase [Escherichia coli KTE179]
          Length = 755

 Score =  194 bits (494), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQLIGAAEV-------------- 332
              T L GI           YAK  +     G+    NQ   A +V              
Sbjct: 428 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMLDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|295098160|emb|CBK87250.1| beta-glucosidase [Enterobacter cloacae subsp. cloacae NCTC 9394]
          Length = 765

 Score =  194 bits (494), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 148/458 (32%), Positives = 228/458 (49%), Gaps = 61/458 (13%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T + PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA---- 240
           + +  G V+ G +   +++ A  + + V+  +G+F+      P+ +LGP+D   PA    
Sbjct: 325 SKYLPGLVKSGKVTMAELDDAARHVLNVKYDMGLFN-----DPYSHLGPKD-SDPADTNA 378

Query: 241 ----HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVA 294
               H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA
Sbjct: 379 ESRLHRKEAREVARESLVLLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVA 436

Query: 295 CGYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAE 331
               T L GI           YAK  +      +    NQ              +I  A 
Sbjct: 437 DQSVTVLTGIKSAVGDNAKVVYAKGANVTNDKDIVTFLNQYEEAVKVDPRTPKEMIDEAV 496

Query: 332 VAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPV 391
            AA+Q+D  + V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P 
Sbjct: 497 NAAKQSDVVIAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP- 554

Query: 392 DVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 -LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|423140736|ref|ZP_17128374.1| glycosyl hydrolase family 3 protein [Salmonella enterica subsp.
           houtenae str. ATCC BAA-1581]
 gi|379053290|gb|EHY71181.1| glycosyl hydrolase family 3 protein [Salmonella enterica subsp.
           houtenae str. ATCC BAA-1581]
          Length = 765

 Score =  194 bits (494), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 150/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTWLTSIMGETMVKAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSSQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  +W   G  VSD  ++  L   +H T   PE+A   A+K+G+D+     + 
Sbjct: 267 SWLLKDVLRDEWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGVDMSMADEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSGTIAVVGPLADSQRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGN--------------QLIGAAEV 332
              T L GI           YAK  +     G+    N               +I  A  
Sbjct: 438 QSVTVLAGIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
           AA+QAD  V V+G  Q +  E   R  + LP  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 AAKQADVVVAVVGESQGMAHEASSRTDITLPQSQRDLITAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|432899153|ref|ZP_20109845.1| periplasmic beta-glucosidase [Escherichia coli KTE192]
 gi|433029109|ref|ZP_20216969.1| periplasmic beta-glucosidase [Escherichia coli KTE109]
 gi|431426805|gb|ELH08849.1| periplasmic beta-glucosidase [Escherichia coli KTE192]
 gi|431543325|gb|ELI18314.1| periplasmic beta-glucosidase [Escherichia coli KTE109]
          Length = 755

 Score =  194 bits (494), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 428 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTYITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|60680320|ref|YP_210464.1| beta-glucosidase [Bacteroides fragilis NCTC 9343]
 gi|60491754|emb|CAH06512.1| putative beta-glucosidase [Bacteroides fragilis NCTC 9343]
          Length = 814

 Score =  194 bits (494), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 155/451 (34%), Positives = 221/451 (49%), Gaps = 48/451 (10%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLK-VAACCKHYTAYD 70
           + P +++ RDPRW R +ET GEDP L G    + VRG QG T +  K V A  KH+ +Y 
Sbjct: 207 YGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETLNDGKSVIATLKHFASY- 265

Query: 71  LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
              W         A + +++LE+    PF+  V  G + SVM SYN+++G P      +L
Sbjct: 266 --GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLL 322

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP-FLAIHTE 189
            + +  +W+  G++VSD  +VG L           EAA  A+ AG+D D G    A    
Sbjct: 323 TDILKDRWQFKGFVVSDLYAVGGL-REHGVAGNDYEAAIKAVNAGVDSDLGTNVYAEQLV 381

Query: 190 GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPF--GNLGPRDVCTPAHQQLALQ 247
            AV+ G +    ++ A+   ++++ ++G+FD      PF       + V +  H  LA +
Sbjct: 382 AAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPFVDEKQAVQLVASSEHTGLARE 436

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT--TPLQGI- 304
            A Q IVLLKN  + LPL      T+AVIGPN+D    M+G+Y       T  T L GI 
Sbjct: 437 VARQSIVLLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIR 495

Query: 305 ---SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG----LDQSIEAEFI-- 355
              S+  + ++  GC     +      A E  AR ADA V+VMG     D S E E    
Sbjct: 496 QKVSKETRVLYAKGCTVRDSSRTGFKDAIE-TARNADAVVMVMGGSSARDFSSEYEETGA 554

Query: 356 -----------------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKN 398
                            DRA L L GRQ EL+  +++  + PVVLVL+ G P+ +  A  
Sbjct: 555 AKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQ 613

Query: 399 DPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +    AI+   YPG  GG A+ADVLFG  NP
Sbjct: 614 EAE--AIVDAWYPGMQGGNAVADVLFGDYNP 642


>gi|422382293|ref|ZP_16462454.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 57-2]
 gi|324006493|gb|EGB75712.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 57-2]
          Length = 789

 Score =  194 bits (494), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 179 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 238

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 239 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 290

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 291 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 348

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 349 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 403

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 404 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 461

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 462 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEIIDEAVQ 521

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 522 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 578

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 579 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 615


>gi|375357172|ref|YP_005109944.1| putative beta-glucosidase [Bacteroides fragilis 638R]
 gi|301161853|emb|CBW21397.1| putative beta-glucosidase [Bacteroides fragilis 638R]
          Length = 814

 Score =  194 bits (494), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 155/451 (34%), Positives = 221/451 (49%), Gaps = 48/451 (10%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLK-VAACCKHYTAYD 70
           + P +++ RDPRW R +ET GEDP L G    + VRG QG T +  K V A  KH+ +Y 
Sbjct: 207 YGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETLNDGKSVIATLKHFASY- 265

Query: 71  LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
              W         A + +++LE+    PF+  V  G + SVM SYN+++G P      +L
Sbjct: 266 --GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLL 322

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP-FLAIHTE 189
            + +  +W+  G++VSD  +VG L           EAA  A+ AG+D D G    A    
Sbjct: 323 TDILKDRWQFKGFVVSDLYAVGGL-REHGVAGNDYEAAIKAVNAGVDSDLGTNVYAEQLV 381

Query: 190 GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPF--GNLGPRDVCTPAHQQLALQ 247
            AV+ G +    ++ A+   ++++ ++G+FD      PF       + V +  H  LA +
Sbjct: 382 AAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPFVDEKQAVQLVASSEHTGLARE 436

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT--TPLQGI- 304
            A Q IVLLKN  + LPL      T+AVIGPN+D    M+G+Y       T  T L GI 
Sbjct: 437 VARQSIVLLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIR 495

Query: 305 ---SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG----LDQSIEAEFI-- 355
              S+  + ++  GC     +      A E  AR ADA V+VMG     D S E E    
Sbjct: 496 QKVSKETRVLYAKGCAVRDSSRTGFKDAIET-ARNADAVVMVMGGSSARDFSSEYEETGA 554

Query: 356 -----------------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKN 398
                            DRA L L GRQ EL+  +++  + PVVLVL+ G P+ +  A  
Sbjct: 555 AKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQ 613

Query: 399 DPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +    AI+   YPG  GG A+ADVLFG  NP
Sbjct: 614 EAE--AIVDAWYPGMQGGNAVADVLFGDYNP 642


>gi|423248809|ref|ZP_17229825.1| hypothetical protein HMPREF1066_00835 [Bacteroides fragilis
           CL03T00C08]
 gi|423253758|ref|ZP_17234689.1| hypothetical protein HMPREF1067_01333 [Bacteroides fragilis
           CL03T12C07]
 gi|392655387|gb|EIY49030.1| hypothetical protein HMPREF1067_01333 [Bacteroides fragilis
           CL03T12C07]
 gi|392657750|gb|EIY51381.1| hypothetical protein HMPREF1066_00835 [Bacteroides fragilis
           CL03T00C08]
          Length = 805

 Score =  194 bits (494), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 155/451 (34%), Positives = 221/451 (49%), Gaps = 48/451 (10%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLK-VAACCKHYTAYD 70
           + P +++ RDPRW R +ET GEDP L G    + VRG QG T +  K V A  KH+ +Y 
Sbjct: 198 YGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETLNDGKSVIATLKHFASY- 256

Query: 71  LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
              W         A + +++LE+    PF+  V  G + SVM SYN+++G P      +L
Sbjct: 257 --GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLL 313

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP-FLAIHTE 189
            + +  +W+  G++VSD  +VG L           EAA  A+ AG+D D G    A    
Sbjct: 314 TDILKDRWQFKGFVVSDLYAVGGL-REHGVAGNDYEAAIKAVNAGVDSDLGTNVYAEQLV 372

Query: 190 GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPF--GNLGPRDVCTPAHQQLALQ 247
            AV+ G +    ++ A+   ++++ ++G+FD      PF       + V +  H  LA +
Sbjct: 373 AAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPFVDEKQAVQLVASSEHTGLARE 427

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYT--TPLQGI- 304
            A Q IVLLKN  + LPL      T+AVIGPN+D    M+G+Y       T  T L GI 
Sbjct: 428 VARQSIVLLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIR 486

Query: 305 ---SRYAKTIHQAGCFGVACNGNQLIGAAEVAARQADATVLVMG----LDQSIEAEFI-- 355
              S+  + ++  GC     +      A E  AR ADA V+VMG     D S E E    
Sbjct: 487 QKVSKETRVLYAKGCAVRDSSRTGFKDAIET-ARNADAVVMVMGGSSARDFSSEYEETGA 545

Query: 356 -----------------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKN 398
                            DRA L L GRQ EL+  +++  + PVVLVL+ G P+ +  A  
Sbjct: 546 AKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQ 604

Query: 399 DPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +    AI+   YPG  GG A+ADVLFG  NP
Sbjct: 605 EAE--AIVDAWYPGMQGGNAVADVLFGDYNP 633


>gi|379996382|gb|AFD23677.1| glucosidase [uncultured organism]
          Length = 765

 Score =  194 bits (494), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 149/458 (32%), Positives = 227/458 (49%), Gaps = 61/458 (13%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGRG E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRGSEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T + PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA---- 240
           + +  G V+ G +   +++ A  + + V+  +G+F+      P+ +LGP+D   PA    
Sbjct: 325 SKYLPGLVKSGKVTMAELDDAARHVLNVKYDMGLFN-----DPYSHLGPKD-SDPADTNA 378

Query: 241 ----HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVA 294
               H++ A   A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA
Sbjct: 379 ESRLHRKEARDVARESLVLLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVA 436

Query: 295 CGYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAE 331
               T L GI           YAK  +      +    NQ              +I  A 
Sbjct: 437 DQSVTVLTGIKSAVGDNAKVVYAKGANVTNDKDIVTFLNQYEEAVKVDPRTPKEMIDEAV 496

Query: 332 VAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPV 391
             A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P 
Sbjct: 497 NTAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP- 554

Query: 392 DVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 -LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|324497317|gb|ADY39467.1| putative periplasmic beta-glucosidase precursor [bacterium
           enrichment culture clone P69-9E]
          Length = 765

 Score =  194 bits (494), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 149/458 (32%), Positives = 227/458 (49%), Gaps = 61/458 (13%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGRG E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRGSEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T + PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA---- 240
           + +  G V+ G +   +++ A  + + V+  +G+F+      P+ +LGP+D   PA    
Sbjct: 325 SKYLPGLVKSGKVTMAELDDAARHVLNVKYDMGLFN-----DPYSHLGPKD-SDPADTNA 378

Query: 241 ----HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVA 294
               H++ A   A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA
Sbjct: 379 ESRLHRKEARDVARESLVLLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVA 436

Query: 295 CGYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAE 331
               T L GI           YAK  +      +    NQ              +I  A 
Sbjct: 437 DQSVTVLTGIKSAVGDNAKVVYAKGANVTNDKDIVTFLNQYEEAVKVDPRTPKEMIDEAV 496

Query: 332 VAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPV 391
             A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P 
Sbjct: 497 NTAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP- 554

Query: 392 DVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 -LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|313145353|ref|ZP_07807546.1| periplasmic beta-glucosidase [Bacteroides fragilis 3_1_12]
 gi|313134120|gb|EFR51480.1| periplasmic beta-glucosidase [Bacteroides fragilis 3_1_12]
          Length = 802

 Score =  194 bits (494), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 159/453 (35%), Positives = 223/453 (49%), Gaps = 52/453 (11%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           + P +++ RDPRW R +ET GEDP L G   A+ VRG QG+T   R  V A  KH+ +Y 
Sbjct: 195 YGPVLDLARDPRWSRVEETYGEDPYLNGVMGAALVRGFQGDTLRGRKSVIATLKHFASY- 253

Query: 71  LDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDIL 130
              W         A + +++LE+    PF+  V  G + SVM SYN+++G P      +L
Sbjct: 254 --GWTEGGHNGGTAHLGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLL 310

Query: 131 KNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGP-FLAIHTE 189
            + +  +W+  G++VSD  ++G L        +  EAA  A+ AG+D D G    A    
Sbjct: 311 TDILKDRWQFKGFVVSDLYAIGGL-REHGVAGSDYEAAVKAVNAGVDSDLGTNVYAEQLV 369

Query: 190 GAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPF--GNLGPRDVCTPAHQQLALQ 247
            AVR G +  E V+ A+   + ++  +G+FD      PF       + V +P H  LA +
Sbjct: 370 AAVRKGDVAMETVDKAVRRILFLKFHMGLFDA-----PFVDDKRPAQLVASPEHIGLARE 424

Query: 248 AAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVAC--GYTTPLQGI- 304
            A Q IVLLKN  + LPL      T+AVIGPN+D    M+G+Y          T L+GI 
Sbjct: 425 VARQSIVLLKNEDKLLPLKK-DIRTLAVIGPNADNGYNMLGDYTAPQADGSVVTVLEGIR 483

Query: 305 ---SRYAKTIHQAGCFGVACNGNQLIGAAEV--AARQADATVLVMG----LDQSIEAEFI 355
              S+  + ++  GC   A   +   G A+   AAR AD  V+V+G     D S E E  
Sbjct: 484 QKVSKDTRVLYAKGC---AVRDSSRTGFADAIEAARSADVVVMVVGGSSARDFSSEYEET 540

Query: 356 -------------------DRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFA 396
                              DRA L L GRQ EL+  V K  + P+VLVL+ G P+ +   
Sbjct: 541 GAAKVSANRVSDMESGEGYDRATLHLMGRQLELLEEVRKLGK-PMVLVLIKGRPLLMEGV 599

Query: 397 KNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
             +    AIL   YPG  GG A+ADVLFG  NP
Sbjct: 600 IQEA--DAILDAWYPGMQGGNAVADVLFGDYNP 630


>gi|433120732|ref|ZP_20306404.1| periplasmic beta-glucosidase [Escherichia coli KTE157]
 gi|431642333|gb|ELJ10056.1| periplasmic beta-glucosidase [Escherichia coli KTE157]
          Length = 755

 Score =  194 bits (494), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSVTIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 428 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|255532174|ref|YP_003092546.1| glycoside hydrolase family protein [Pedobacter heparinus DSM 2366]
 gi|255345158|gb|ACU04484.1| glycoside hydrolase family 3 domain protein [Pedobacter heparinus
           DSM 2366]
          Length = 799

 Score =  194 bits (494), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 141/444 (31%), Positives = 212/444 (47%), Gaps = 39/444 (8%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLKVAACCKHYTAYDL 71
           ++P +++ RD RWGR +E  GEDP L  +       G+Q N     ++A+  KH+  Y  
Sbjct: 205 YAPILDVARDQRWGRLEEVYGEDPYLVARLGVEMTLGMQENN----QIASTAKHFAVYSA 260

Query: 72  DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILK 131
           +          + +VS +++ED    PFK  + E  +  VM SYN  NG P       L 
Sbjct: 261 NKGAREGLARTDPQVSPREVEDIMLYPFKKVIQEAGIMGVMSSYNDYNGIPITGSEYWLT 320

Query: 132 NTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCG----PFLAIH 187
             +   +   GY+VSD D++  LYN  H     +EA   A  AGL++         + I+
Sbjct: 321 QRLRKDFGFGGYVVSDSDALEYLYNKHHVAANLKEAVFQAFMAGLNVRTTFRPPDSIIIY 380

Query: 188 TEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPF---GNLGPRDVCTPAHQQL 244
               V  G +  E +N  +   + V+ +LG+FD     QP+        + V + AHQ +
Sbjct: 381 ARQLVNEGRIPIETINSRVKDVLRVKFKLGLFD-----QPYVKDAAASEKLVNSIAHQAV 435

Query: 245 ALQAAHQGIVLLKNSARTLPLS-TLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQG 303
           ALQA+ + IVLLKN+ + LPLS +L+   +AVIGPN+        +Y  +    T  L+G
Sbjct: 436 ALQASKESIVLLKNNNQILPLSRSLKK--IAVIGPNAADNDYAHTHYGPLQSKSTNILEG 493

Query: 304 ISRYA---KTIHQAGCFGVACNGNQ--------------LIGAAEVAARQADATVLVMGL 346
           I       K  +  GC  V  N  +              LI  A   A +AD  ++V+G 
Sbjct: 494 IRNKIGADKVWYAKGCELVDKNWPESEIFPEDPDATAIALIEDAVNTAMKADVAIVVLGG 553

Query: 347 DQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAIL 406
           +     E   R  L LPG Q  L+  + K  + PVV V++   P+ +++   D  I  I+
Sbjct: 554 NTKTAGENKSRTTLELPGFQLNLIKAIQKTGK-PVVAVMIGTQPMGINWI--DKYIDGIV 610

Query: 407 WVGYPGQAGGAAIADVLFGRANPG 430
           + GYPG  GG A+ADVLFG  NPG
Sbjct: 611 YAGYPGVKGGIAVADVLFGDYNPG 634


>gi|261340566|ref|ZP_05968424.1| periplasmic beta-glucosidase [Enterobacter cancerogenus ATCC 35316]
 gi|288316974|gb|EFC55912.1| periplasmic beta-glucosidase [Enterobacter cancerogenus ATCC 35316]
          Length = 765

 Score =  194 bits (494), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 149/458 (32%), Positives = 227/458 (49%), Gaps = 61/458 (13%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTATMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 AWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPA---- 240
           + +  G V+ G +   +++ A  + + V+  +G+F+      P+ +LGP+D   PA    
Sbjct: 325 SKYLPGLVKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKD-SDPADTNA 378

Query: 241 ----HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVA 294
               H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA
Sbjct: 379 ESRLHRKEAREVARESLVLLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVA 436

Query: 295 CGYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAE 331
               T L GI           YAK  +      +    NQ              +I  A 
Sbjct: 437 DQSVTVLTGIKSAVGDNAKVVYAKGANVTNDKDIVTFLNQYEEAVKVDPRTPKEMIDEAV 496

Query: 332 VAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPV 391
            AA+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P 
Sbjct: 497 NAAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP- 554

Query: 392 DVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 -LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|432651688|ref|ZP_19887442.1| periplasmic beta-glucosidase [Escherichia coli KTE87]
 gi|431190135|gb|ELE89535.1| periplasmic beta-glucosidase [Escherichia coli KTE87]
          Length = 755

 Score =  194 bits (493), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 230/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V+ +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVQVMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 428 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|433169114|ref|ZP_20353742.1| periplasmic beta-glucosidase [Escherichia coli KTE180]
 gi|431687966|gb|ELJ53507.1| periplasmic beta-glucosidase [Escherichia coli KTE180]
          Length = 755

 Score =  194 bits (493), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 145 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 204

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 205 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 256

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 257 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 314

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 315 SKYLLGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 369

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 370 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 427

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQLIGAAEV-------------- 332
              T L GI           YAK  +     G+    NQ   A +V              
Sbjct: 428 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMLDEAVQ 487

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 488 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 544

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 545 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 581


>gi|365971332|ref|YP_004952893.1| beta-glucosidase [Enterobacter cloacae EcWSU1]
 gi|365750245|gb|AEW74472.1| Periplasmic beta-glucosidase [Enterobacter cloacae EcWSU1]
          Length = 765

 Score =  194 bits (493), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 150/457 (32%), Positives = 228/457 (49%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGRG E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRGSEGFGEDTYLTATMGKTMVEAMQGKSPADRYAVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T + PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTASDPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G V+ G +   +++ A  + + V+  +G+F+      P+ +LGP+D   V T A 
Sbjct: 325 SKYLPGLVKSGKVTMAELDDATRHVLNVKYDMGLFN-----DPYSHLGPKDSDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A   A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEARDVARESLVLLKNRLDTLPLK--KSGTIAVVGPLADSKRDVMGSWSAAGVAE 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +      +    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENAKVVYAKGANVTNDKDIVTFLNQYEEAVKVDPRSPKEMIDEAVN 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|117624335|ref|YP_853248.1| beta-D-glucoside glucohydrolase-like protein [Escherichia coli APEC
           O1]
 gi|237704595|ref|ZP_04535076.1| periplasmic beta-glucosidase [Escherichia sp. 3_2_53FAA]
 gi|422360434|ref|ZP_16441068.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 110-3]
 gi|115513459|gb|ABJ01534.1| beta-D-glucoside glucohydrolase-like protein [Escherichia coli APEC
           O1]
 gi|226900961|gb|EEH87220.1| periplasmic beta-glucosidase [Escherichia sp. 3_2_53FAA]
 gi|315285796|gb|EFU45236.1| glycosyl hydrolase family 3 protein [Escherichia coli MS 110-3]
          Length = 789

 Score =  194 bits (493), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 179 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 238

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 239 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 290

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 291 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 348

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 349 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 403

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 404 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 461

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQLIGAAEV-------------- 332
              T L GI           YAK  +     G+    NQ   A +V              
Sbjct: 462 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMLDEAVQ 521

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 522 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 578

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 579 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 615


>gi|354582345|ref|ZP_09001247.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
 gi|353199744|gb|EHB65206.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
          Length = 765

 Score =  194 bits (493), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 148/459 (32%), Positives = 220/459 (47%), Gaps = 52/459 (11%)

Query: 8   GLTYWSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHY 66
           G   +SP +++ RDPRWGR +E  GEDP L G++AA+ V GLQG +      VAA  KH+
Sbjct: 157 GAVTYSPVLDVVRDPRWGRTEECFGEDPYLIGEFAAASVEGLQGESLDGEASVAATLKHF 216

Query: 67  TAY-DLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 125
             Y   +        H   R    +L +    PFK  V E   AS+M +YN+++G P   
Sbjct: 217 VGYGSSEGGRNAGPVHMGTR----ELMEVDMYPFKKAV-EAGAASIMPAYNEIDGVPCTV 271

Query: 126 DPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLD-CGPFL 184
           + ++L   +  +W  DG +++DC ++ +L           +AA  AI AG+D++  G   
Sbjct: 272 NEELLDGVLRKEWGFDGMVITDCGAINMLAAGHDTAEDGMDAAVSAISAGIDMEMSGEMF 331

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 244
            ++ E AV+   L    ++ A+   +T++ +LG+F+  P A P      + +    H+++
Sbjct: 332 GMYLERAVQEKRLDVSVLDEAVRRVLTLKFKLGLFEN-PYADP--ARAEQVIGCSRHREM 388

Query: 245 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVA--CGYTTPLQ 302
           A Q A +GIVLLKN   TLPLS      +AVIGPN+D     +G+Y          T L+
Sbjct: 389 ARQLAAEGIVLLKNEGSTLPLSK-EDGVIAVIGPNADQGYNQLGDYTSPQPPSRVVTVLE 447

Query: 303 GIS-----RYAKTIHQAGCFGVACNGNQLIG--AAEVAARQADATVLVMGLDQSIE---- 351
           GI         + ++  GC     NG+   G   A   A QAD  VLV+G   + +    
Sbjct: 448 GIRAKLGGDKGRVLYAPGC---RINGDSREGFELALSCAGQADTVVLVLGGSSARDFGEG 504

Query: 352 ---------------------AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGP 390
                                 E IDR  L L G Q EL   + K  +  +V+V + G P
Sbjct: 505 TIDLRTGASKVTGNDWSDMDCGEGIDRMTLQLSGVQLELAREIHKLGK-RLVVVYINGRP 563

Query: 391 VDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           +   +   D    AIL   YPGQ GG A+AD+LFG  NP
Sbjct: 564 IAEPWI--DRHADAILEAWYPGQEGGHAVADILFGDVNP 600


>gi|91211419|ref|YP_541405.1| beta-D-glucoside glucohydrolase [Escherichia coli UTI89]
 gi|218559051|ref|YP_002391964.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli S88]
 gi|386600011|ref|YP_006101517.1| beta-glucosidase, periplasmic [Escherichia coli IHE3034]
 gi|386603842|ref|YP_006110142.1| beta-D-glucoside glucohydrolase-like protein [Escherichia coli
           UM146]
 gi|417085346|ref|ZP_11952822.1| beta-D-glucoside glucohydrolase-like protein [Escherichia coli
           cloneA_i1]
 gi|419947065|ref|ZP_14463427.1| beta-D-glucoside glucohydrolase-like protein [Escherichia coli
           HM605]
 gi|422749413|ref|ZP_16803325.1| glycosyl hydrolase 3 domain-containing protein [Escherichia coli
           H252]
 gi|422755552|ref|ZP_16809376.1| glycosyl hydrolase 3 domain-containing protein [Escherichia coli
           H263]
 gi|91072993|gb|ABE07874.1| beta-D-glucoside glucohydrolase [Escherichia coli UTI89]
 gi|218365820|emb|CAR03560.1| beta-D-glucoside glucohydrolase, periplasmic [Escherichia coli S88]
 gi|294494167|gb|ADE92923.1| beta-glucosidase, periplasmic [Escherichia coli IHE3034]
 gi|307626326|gb|ADN70630.1| beta-D-glucoside glucohydrolase-like protein [Escherichia coli
           UM146]
 gi|323951928|gb|EGB47802.1| glycosyl hydrolase 3 domain-containing protein [Escherichia coli
           H252]
 gi|323956167|gb|EGB51919.1| glycosyl hydrolase 3 domain-containing protein [Escherichia coli
           H263]
 gi|355351376|gb|EHG00567.1| beta-D-glucoside glucohydrolase-like protein [Escherichia coli
           cloneA_i1]
 gi|388411502|gb|EIL71671.1| beta-D-glucoside glucohydrolase-like protein [Escherichia coli
           HM605]
          Length = 765

 Score =  194 bits (493), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQLIGAAEV-------------- 332
              T L GI           YAK  +     G+    NQ   A +V              
Sbjct: 438 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMLDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNP 591


>gi|387612710|ref|YP_006115826.1| beta-glucosidase [Escherichia coli ETEC H10407]
 gi|309702446|emb|CBJ01772.1| periplasmic beta-glucosidase [Escherichia coli ETEC H10407]
          Length = 765

 Score =  194 bits (493), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 148/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +   +++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMAELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL + + G  GG AIADVLFG  NP
Sbjct: 555 LALVKEDQQADAILEIWFAGTEGGNAIADVLFGDYNP 591


>gi|433198769|ref|ZP_20382671.1| periplasmic beta-glucosidase [Escherichia coli KTE94]
 gi|431721473|gb|ELJ85467.1| periplasmic beta-glucosidase [Escherichia coli KTE94]
          Length = 765

 Score =  194 bits (493), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 148/457 (32%), Positives = 229/457 (50%), Gaps = 59/457 (12%)

Query: 12  WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNT-GSRLKVAACCKHYTAYD 70
           W+P V++ RDPRWGR  E  GED  LT     + V  +QG +   R  V    KH+ AY 
Sbjct: 155 WAPMVDVSRDPRWGRASEGFGEDTYLTSIMGKTMVEAMQGKSPADRYSVMTSVKHFAAYG 214

Query: 71  L----DNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
                  +N VD       +S Q L + Y  P+KA +  G   +VM + N +NG P  +D
Sbjct: 215 AVEGGKEYNTVD-------MSPQRLFNDYMPPYKAGLDAGS-GAVMVALNSLNGTPATSD 266

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRT-PEEAAADAIKAGLDLDCG-PFL 184
             +LK+ +  QW   G  VSD  ++  L   +H T   PE+A   A+K+G+++     + 
Sbjct: 267 SWLLKDVLRDQWGFKGITVSDHGAIKEL--IKHGTAADPEDAVRVALKSGINMSMSDEYY 324

Query: 185 AIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRD---VCTPA- 240
           + +  G ++ G +  E+++ A  + + V+  +G+F+      P+ +LGP++   V T A 
Sbjct: 325 SKYLPGLIKSGKVTMEELDDAARHVLNVKYDMGLFN-----DPYSHLGPKESDPVDTNAE 379

Query: 241 ---HQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNY--AGVAC 295
              H++ A + A + +VLLKN   TLPL   +  T+AV+GP +D    ++G++  AGVA 
Sbjct: 380 SRLHRKEAREVARESLVLLKNRLETLPLK--KSATIAVVGPLADSKRDVMGSWSAAGVAD 437

Query: 296 GYTTPLQGISR---------YAKTIHQAGCFGVACNGNQ--------------LIGAAEV 332
              T L GI           YAK  +     G+    NQ              +I  A  
Sbjct: 438 QSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQ 497

Query: 333 AARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVD 392
            A+Q+D  V V+G  Q +  E   R  + +P  Q++L++ + KA+  P+VLVLM G P  
Sbjct: 498 TAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAAL-KATGKPLVLVLMNGRP-- 554

Query: 393 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
           ++  K D +  AIL   + G  GG AIAD+LFG  NP
Sbjct: 555 LALVKEDQQADAILETWFAGTEGGNAIADILFGDYNP 591


>gi|160901716|ref|YP_001567297.1| glycoside hydrolase family 3 protein [Petrotoga mobilis SJ95]
 gi|160359360|gb|ABX30974.1| glycoside hydrolase family 3 domain protein [Petrotoga mobilis
           SJ95]
          Length = 777

 Score =  194 bits (493), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 139/463 (30%), Positives = 230/463 (49%), Gaps = 67/463 (14%)

Query: 13  SPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQGNTGSRLK--VAACCKHYTAYD 70
           SP V++ RDPRWGR +ET GEDP L  K   +YV+GLQ +    LK  + A  KH+  Y 
Sbjct: 150 SPVVDVTRDPRWGRTEETFGEDPYLIAKMGVAYVKGLQSDD---LKNGIVATLKHFVGYG 206

Query: 71  LD----NWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCAD 126
           +     NW         A + +++L++T+  PF+A + EGKV SVM +Y++++G P  A 
Sbjct: 207 VSEGGMNWAP-------AHIPERELKETFLFPFEAAIKEGKVKSVMNAYHEIDGIPCGAS 259

Query: 127 PDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAI 186
             +L+  +  +W  DG +VSD  ++  L          EEAA  A+KAG+D++   F   
Sbjct: 260 ETLLRRILREEWGFDGIVVSDYFAINSLMEYHKIALNKEEAAIKALKAGIDVELPSFDCY 319

Query: 187 HT--EGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLG--PRDVCTPAHQ 242
               + A+  G   E  ++ ++   + ++  +G+F+      P+ +L   P ++ TP  +
Sbjct: 320 KEPLKNAIENGEFSEAFIDKSVRNILRLKFEMGLFE-----NPYVDLEKVPDNLDTPEDR 374

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGV--------- 293
           +LA + A + IVLLKN          +   VAVIGPN++    + G+Y  +         
Sbjct: 375 KLAYEIAKKSIVLLKNDGIVPLKKNSKIKKVAVIGPNANSARNLTGDYTYLTHLETLKQG 434

Query: 294 ACGYTTPLQGISRYA-----KTIHQA----------------GCFGVACNGNQLIGAAEV 332
           A G T+ ++GI+        KTI+++                GC  +  +  ++I  A  
Sbjct: 435 AFG-TSAMEGITFSESELPIKTIYESLKEKLEKLNVETSYAKGC-EINDDNKEMIKEAVE 492

Query: 333 AARQADATVLVMGLDQS------IEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLM 386
            A  +D  +LV+G D+S         E  D + L+LPG Q +L+  V      PV++VL+
Sbjct: 493 LAENSDVALLVLG-DKSGLTLDCTTGESRDSSTLILPGVQLDLLKSVINTGT-PVIVVLV 550

Query: 387 CGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANP 429
            G P  + +   +  + AI     PG+ GG A+AD++ G  +P
Sbjct: 551 NGRPYSLDWVSKN--VSAIFEAWLPGEEGGNALADIILGDESP 591


>gi|383115340|ref|ZP_09936096.1| hypothetical protein BSGG_2785 [Bacteroides sp. D2]
 gi|313695250|gb|EFS32085.1| hypothetical protein BSGG_2785 [Bacteroides sp. D2]
          Length = 735

 Score =  194 bits (493), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 146/435 (33%), Positives = 226/435 (51%), Gaps = 27/435 (6%)

Query: 6   MAGLTY-WSPNVNIFRDPRWGRGQETPGEDPVLTGKYAASYVRGLQG-NTGSRLKVAACC 63
           M+G+ + +SP +++ RDPRWGR  E  GEDP   G + A+ V+G QG +  +  ++AAC 
Sbjct: 154 MSGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFGAASVKGYQGDDLSAENRMAACL 213

Query: 64  KHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPT 123
           KHY  Y         R +    +SKQ L DTY +P++  V  G  A++M S+N ++G P 
Sbjct: 214 KHYVGYGASE---AGRDYVYTEISKQTLWDTYLLPYEMGVKAG-AATLMSSFNDISGVPG 269

Query: 124 CADPDILKNTIHGQWRLDGYIVSDCDSVGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPF 183
            A+  I+   +  +W  DG+IVSD  ++  L N Q    T +EAA  A  AGL++D    
Sbjct: 270 SANSYIMTEILKKRWGHDGFIVSDWGAIEQLKN-QGLAATKKEAAWHAFTAGLEMDMMSH 328

Query: 184 -LAIHTEGAVRGGLLREEDVNLALAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQ 242
               H +  V  G +    V+ A+   + ++ RLG+F  E    P  +   R    P   
Sbjct: 329 AYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLF--ERPYTPATSEKER-FFRPQSM 385

Query: 243 QLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAG------VACG 296
            +A + A + +VLLKN  +TLPL+  +   +AVIGP +     ++G++ G      VA  
Sbjct: 386 DIAARLAAESMVLLKNENKTLPLTDKKK--IAVIGPMAKNGWDLLGSWCGHGKDTDVAML 443

Query: 297 YTTPLQGISRYAKTIHQAGCFGVACNGNQLIGAAEV--AARQADATVLVMGLDQSIEAEF 354
           Y       +  A+  + AGC   A  G+   G AE   AAR +D  VL +G   +   E 
Sbjct: 444 YNGLATEFAGKAELRYAAGC---ATKGDNKEGFAEALEAARWSDVVVLCLGEMMTWSGEN 500

Query: 355 IDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQA 414
             R+ + LP  Q+EL + + KA + P+VLVL+ G P++++  + +P   AIL +  PG  
Sbjct: 501 ASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPLELN--RLEPISDAILEIWQPGVN 557

Query: 415 GGAAIADVLFGRANP 429
           G   +A +L GR NP
Sbjct: 558 GALPMAGILSGRINP 572


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.137    0.423 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,288,233,195
Number of Sequences: 23463169
Number of extensions: 314209188
Number of successful extensions: 655769
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 5717
Number of HSP's successfully gapped in prelim test: 1732
Number of HSP's that attempted gapping in prelim test: 623508
Number of HSP's gapped (non-prelim): 11299
length of query: 431
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 286
effective length of database: 8,957,035,862
effective search space: 2561712256532
effective search space used: 2561712256532
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 78 (34.7 bits)