BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 040836
         (758 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255557375|ref|XP_002519718.1| Beta-glucosidase, putative [Ricinus communis]
 gi|223541135|gb|EEF42691.1| Beta-glucosidase, putative [Ricinus communis]
          Length = 802

 Score = 1056 bits (2730), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 510/759 (67%), Positives = 610/759 (80%), Gaps = 17/759 (2%)

Query: 1   RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
           R++++ + ++ F +CD+ L Y  RAKDLV +MTL EKVQQ+GDLAYGVPRLG+P YEWWS
Sbjct: 55  RYDNLGLDMTTFGFCDSSLSYEVRAKDLVNQMTLKEKVQQLGDLAYGVPRLGIPKYEWWS 114

Query: 61  EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
           EALHGVS +G      PGT FD  VPGATSFPT ILTTASFNESLWK IGQ  S +ARAM
Sbjct: 115 EALHGVSDVG------PGTFFDDLVPGATSFPTTILTTASFNESLWKNIGQA-SAKARAM 167

Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
           YNLG AGLT+WSPN+NVVRDPRWGR +ETPGEDPYVVGRYA+NYVRGLQDVEG E + D 
Sbjct: 168 YNLGRAGLTYWSPNVNVVRDPRWGRTVETPGEDPYVVGRYAVNYVRGLQDVEGTENYTDL 227

Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
           ++RPLK+S+CCKHYAAYD++ W+G +R  FD+RVTEQDM ETF+ PFEMCV EGDVSSVM
Sbjct: 228 NTRPLKVSSCCKHYAAYDVEKWQGVERLTFDARVTEQDMVETFLRPFEMCVKEGDVSSVM 287

Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
           CS+NRVNGIPTCADPKLLNQTIRGDW+ HGYIVSDCDSI+ +V++HKFL DT EDAVA+V
Sbjct: 288 CSFNRVNGIPTCADPKLLNQTIRGDWDLHGYIVSDCDSIEVMVDNHKFLGDTNEDAVAQV 347

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           LKAGLDLDCG YYTNFT  +V+QGK  E  ID SL++LY+VLMRLG+FDG+PQY+ LGK 
Sbjct: 348 LKAGLDLDCGGYYTNFTETSVKQGKAREEYIDRSLKYLYVVLMRLGFFDGTPQYQKLGKK 407

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
           +IC  +++ELA +AAR+GIVLLKN N  LPL+   +K LA+VGPHANAT+ MIGNY G P
Sbjct: 408 DICTKENVELAKQAAREGIVLLKN-NDTLPLSMDKVKNLAVVGPHANATRVMIGNYAGVP 466

Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
           CRY SP+DGF  YS V  Y  GC D+ C+N S++  A+ AAKNADAT+IVAGLDL++EAE
Sbjct: 467 CRYVSPIDGFSIYSNV-TYEIGC-DVPCKNESLVFPAVHAAKNADATIIVAGLDLTIEAE 524

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
           G DR DLLLPG+QT+LIN+VA AA GPV LVIM+AG VDI+FA++N KIK+ILWVGYPG+
Sbjct: 525 GLDRNDLLLPGYQTQLINQVAGAANGPVILVIMAAGGVDISFARDNEKIKAILWVGYPGQ 584

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKFFDGPV 597
           EGG AIADV+FGKYNPGGRLPITWYEA++V ++P T M LRP     +PG+TYKF+DG  
Sbjct: 585 EGGHAIADVVFGKYNPGGRLPITWYEADFVEQVPMTYMQLRPDEELGYPGKTYKFYDGST 644

Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
           VYPFGYGLSYT F Y + S+ +S  I L+K Q CRD+ Y   T KP C AVL D + C D
Sbjct: 645 VYPFGYGLSYTTFSYNITSAKRSKHIALNKFQHCRDLRYGNETFKPSCPAVLTDHLPCND 704

Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
             F  ++EVEN G  DGSEVVMVYSK P GI G++IKQVIG++RVF+ AG   KV F  N
Sbjct: 705 -DFELEVEVENTGSRDGSEVVMVYSKTPEGIVGSYIKQVIGFKRVFVQAGSVEKVNFRFN 763

Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLN 755
            CKS +I+D  A S+L SG HTI+VG+ +  VS PL +N
Sbjct: 764 VCKSFRIIDYNAYSILPSGGHTIMVGDDI--VSIPLYIN 800


>gi|449433577|ref|XP_004134574.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
 gi|449530107|ref|XP_004172038.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
          Length = 812

 Score = 1031 bits (2666), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 491/761 (64%), Positives = 597/761 (78%), Gaps = 14/761 (1%)

Query: 1   RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
           R++ + +  S F +CD+ L +PERAKDL++RMTL EK  Q+G +A GV RLGLP Y WWS
Sbjct: 62  RYDKLGLDFSSFGFCDSSLSFPERAKDLIDRMTLSEKAAQLGHVASGVDRLGLPPYNWWS 121

Query: 61  EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
           EALHGVS +G      PGT FD  VPGATSFP VI T +SFNE LWK IGQ VSTEARAM
Sbjct: 122 EALHGVSNVG------PGTQFDKVVPGATSFPNVITTASSFNEDLWKTIGQAVSTEARAM 175

Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
           YNLG AGLT+WSP INV+RDPRWGR +ETPGEDP+VVG+YA NYVRGLQDVEG E   D 
Sbjct: 176 YNLGRAGLTYWSPTINVIRDPRWGRTVETPGEDPFVVGKYAKNYVRGLQDVEGSENVTDL 235

Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
           +SRPLK+S+CCKHYAAYD+DNW G +R+ FD+RVTEQDM ETF  PFEMCV EGDVSSVM
Sbjct: 236 NSRPLKVSSCCKHYAAYDVDNWLGVERYSFDARVTEQDMLETFNKPFEMCVKEGDVSSVM 295

Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
           CSYNRVNGIPTCADP LL  TIRG+W  HGYIVSDCDS++ +VE   +L DT EDAVA+ 
Sbjct: 296 CSYNRVNGIPTCADPVLLKDTIRGNWGLHGYIVSDCDSVKVMVEDAHYLQDTNEDAVAQT 355

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           LKAGLDLDCG  Y N+T   V+QGK+   +ID +L  LY+VLMRLGYFDG+  +++LGK 
Sbjct: 356 LKAGLDLDCGQIYPNYTESTVRQGKVGMRNIDNALNNLYVVLMRLGYFDGNTGFESLGKP 415

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
           +IC+ +HIELA EAARQG VLLKNDN  LP +  N KTLA+VGPHANAT AM+GNY G P
Sbjct: 416 DICSDEHIELATEAARQGTVLLKNDNDTLPFDPSNYKTLAVVGPHANATSAMLGNYAGVP 475

Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
           CR  SPMDG   Y+KV  Y  GC  + C+N++ I  A++AA+ +DATVI  G+DLS+EAE
Sbjct: 476 CRMNSPMDGLSEYAKV-KYQMGCDSVACKNDTFIFGAMEAARTSDATVIFVGIDLSIEAE 534

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
             DRVDLLLPG+QT+L+ +VA  +KGPV LVI+SAG +D++FAKNN  IK+I+W GYPGE
Sbjct: 535 SLDRVDLLLPGYQTQLVQQVATVSKGPVVLVILSAGGIDVSFAKNNSNIKAIIWAGYPGE 594

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKFFDGPV 597
           EGGRAIADVIFGK+NPGGRLP+TWYE +YV ++P TSMPLRPV +  +PGRTYKF+DGPV
Sbjct: 595 EGGRAIADVIFGKFNPGGRLPLTWYENDYVYQLPMTSMPLRPVKSLGYPGRTYKFYDGPV 654

Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
           VYPFG+GLSYT F + + S+ +S+ I L    QCRDI YT GT KP C AVL+DD+ C +
Sbjct: 655 VYPFGHGLSYTFFLHNLTSAKRSIAIDLSNRTQCRDIAYTNGTFKPECPAVLVDDLTCTE 714

Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
            +  FQ+EVEN G+ DGS+V++VYS PP GI+ THIKQV+G++RVF+ AG S  V F +N
Sbjct: 715 -EIEFQMEVENTGERDGSQVLLVYSVPPGGISSTHIKQVVGFQRVFLKAGDSETVTFKLN 773

Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLN 757
           ACKSL +VD    +LL +G HTI+VG+  G VSFP++L+ N
Sbjct: 774 ACKSLGLVDFTGYNLLPAGGHTIVVGD--GEVSFPVELSFN 812


>gi|225432136|ref|XP_002274651.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
          Length = 809

 Score = 1013 bits (2618), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 484/760 (63%), Positives = 591/760 (77%), Gaps = 15/760 (1%)

Query: 1   RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
           RF ++ + + DF YCD+  PY  RAKDLV+RMTL EKV Q GD A GV R+GLP Y WWS
Sbjct: 57  RFAALGLDMKDFHYCDSSSPYEVRAKDLVDRMTLSEKVMQTGDQASGVERIGLPKYNWWS 116

Query: 61  EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
           EALHGVS  GR         FD  VPGATSFPTVIL+ ASFN+SLWK +GQ VSTEARAM
Sbjct: 117 EALHGVSNFGR------CVFFDEVVPGATSFPTVILSAASFNQSLWKTLGQAVSTEARAM 170

Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
           YN GNAGLTFWSPNINVVRDPRWGR+LETPGEDP++VG YA+NYVRGLQDV G E   D 
Sbjct: 171 YNSGNAGLTFWSPNINVVRDPRWGRILETPGEDPHLVGLYAVNYVRGLQDVVGAENTTDL 230

Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
           +SRPLK+S+CCKHYAAYDLDNW+G DR HFD+RV+ QDM ETF+LPFEMCV EGDVSSVM
Sbjct: 231 NSRPLKVSSCCKHYAAYDLDNWKGADRVHFDARVSVQDMAETFVLPFEMCVKEGDVSSVM 290

Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
           CSYN++NGIP+CAD +LL QTIRG+W+ HGYIVSDCDS++ +    K+L+ +  D+ A+ 
Sbjct: 291 CSYNKINGIPSCADSRLLKQTIRGEWDLHGYIVSDCDSVEVMAVDQKWLDSSFSDSAAQA 350

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           L AG++LDCG +       AV QGK  +AD+D SLR+LY++LMR+G+FDG P + +LGK+
Sbjct: 351 LNAGMNLDCGTFNNRSLTEAVNQGKANQADLDHSLRYLYVLLMRVGFFDGIPAFASLGKD 410

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
           +IC+ +HIELA EAARQGIVLLKNDN  LPL +  +K +ALVGPHANAT AMIGNY G P
Sbjct: 411 DICSAEHIELAREAARQGIVLLKNDNATLPLKS--VKNIALVGPHANATDAMIGNYAGIP 468

Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
           C Y SP+D F +  +V  Y  GCAD+ C N + I  A++AAK ADAT+I AG DLS+EAE
Sbjct: 469 CYYVSPLDAFSSMGEV-RYEKGCADVQCLNETYIFNAMEAAKRADATIIFAGTDLSIEAE 527

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
             DRVDLLLPG+QT+LIN+VAD + GPV LVIMS G VDI+FA++NPKI +ILW GYPGE
Sbjct: 528 ALDRVDLLLPGYQTQLINQVADLSTGPVVLVIMSGGGVDISFARDNPKIAAILWAGYPGE 587

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNN--FPGRTYKFFDGPV 597
           +GG AIADVI GKYNPGGRLPITWYEA+YV  +P TSM LRPV++  +PGRTYKFF+G  
Sbjct: 588 QGGNAIADVILGKYNPGGRLPITWYEADYVDMLPMTSMALRPVDSLGYPGRTYKFFNGST 647

Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
           VYPFGYG+SYT F Y +++S +  +I L K Q+CR + Y   T  P C AVL+DD+ CK+
Sbjct: 648 VYPFGYGMSYTNFSYSLSTSQRWTNINLRKLQRCRSMVYINDTFVPDCPAVLVDDLSCKE 707

Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
               F++ V+N+G+MDGSEVV+VYS PP GIAGTHIK+V+G+ERVF+  G + KV F+MN
Sbjct: 708 -SIEFEVAVKNVGRMDGSEVVVVYSSPPLGIAGTHIKKVVGFERVFVKVGGTEKVKFSMN 766

Query: 717 ACKSLKIVDNAANSLLASGAHTILV-GEGVGGVSFPLQLN 755
            CKSL IVD+   +LL SG+HTI V G+    V+FP  +N
Sbjct: 767 VCKSLGIVDSTGYALLPSGSHTIKVGGDNTTSVAFPFHVN 806


>gi|225432134|ref|XP_002274619.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
          Length = 805

 Score =  980 bits (2533), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 471/762 (61%), Positives = 579/762 (75%), Gaps = 14/762 (1%)

Query: 1   RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
           RF ++ + + DF YCD+ LPY  R KDLV+R+TL EK + + D+A GVPR+GLP Y+WWS
Sbjct: 54  RFAALGLDMKDFVYCDSSLPYDVRVKDLVDRITLEEKARNVIDVASGVPRIGLPPYKWWS 113

Query: 61  EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
           EALHGV+ +G        T FD  VPGATSFP VIL+ ASFN+SLWK +GQ VSTEARAM
Sbjct: 114 EALHGVANVGS------ATFFDEVVPGATSFPNVILSAASFNQSLWKTLGQVVSTEARAM 167

Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
           YNLG+AGLTFWSPNINV RDPRWGR+LETPGEDP  VG Y +NYVRGLQD+EG E   D 
Sbjct: 168 YNLGHAGLTFWSPNINVARDPRWGRILETPGEDPLTVGVYGVNYVRGLQDIEGTENTTDL 227

Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
           +SRPLKI++ CKH+AAYDLD W   DR HFD++V+EQDM ETF+ PFEMCV EGD SSVM
Sbjct: 228 NSRPLKIASSCKHFAAYDLDQWFNVDRRHFDAKVSEQDMTETFLRPFEMCVKEGDTSSVM 287

Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
           CS+N +NGIP CADP+ L   IR  WN HGYIVSDC +I TIV+  KFL+ T E+ VA  
Sbjct: 288 CSFNNINGIPPCADPRFLKGVIREQWNLHGYIVSDCWAIDTIVQDQKFLDVTSEEGVALS 347

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           +KAGLDL+CG YY +    AV++G+++E D+D SL +LY+VLMR+G+FDG P   +LGK 
Sbjct: 348 MKAGLDLECGHYYNDSLATAVREGRVSEHDVDKSLSYLYVVLMRVGFFDGIPSLASLGKK 407

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
           +ICN +HIELA EAARQGIVLLKNDN  LPL    +K LALVGPHANAT AMIGNY G P
Sbjct: 408 DICNDEHIELAREAARQGIVLLKNDNATLPLKP--VKKLALVGPHANATVAMIGNYAGIP 465

Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
           C Y SP+D F     V  Y  GCAD+ C N++ +  A +AAKNADAT+I+ G DLS+EAE
Sbjct: 466 CHYVSPLDAFSELGDV-TYEVGCADVKCHNDTHVYKAAEAAKNADATIILVGTDLSIEAE 524

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
            +DR DLLLPG+QTE++N+V D + GPV LV+M  G +DI+FAKNNPKI +ILW G+PGE
Sbjct: 525 ERDREDLLLPGYQTEMVNQVTDLSTGPVILVVMCGGPIDISFAKNNPKIAAILWAGFPGE 584

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKFFDGPV 597
           +GG AIAD++FGKYNPGGR PITWYE  YV  +P TSM LRP+ +  +PGRTYKFF+G  
Sbjct: 585 QGGNAIADIVFGKYNPGGRSPITWYENGYVGMLPMTSMALRPIESLGYPGRTYKFFNGST 644

Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
           VYPFGYGLSYT F Y + +  +SV I L + QQCR + Y+  + +P C+AVL+DD+ C D
Sbjct: 645 VYPFGYGLSYTNFSYSLTAPTRSVHISLTRLQQCRSMAYSSDSFQPECSAVLVDDLSC-D 703

Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
             F FQ+ V+N+G MDGSEVVMVYS PP GI GTHIKQVIG+ERVF+  G + KV F+MN
Sbjct: 704 ESFEFQVAVKNVGSMDGSEVVMVYSSPPSGIVGTHIKQVIGFERVFVKVGNTEKVKFSMN 763

Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLNH 758
            CKSL +VD++   LL SG+HTI+ G+    VSFP Q+N ++
Sbjct: 764 VCKSLGLVDSSGYILLPSGSHTIMAGDNSTSVSFPFQVNYHN 805


>gi|224093292|ref|XP_002309869.1| predicted protein [Populus trichocarpa]
 gi|222852772|gb|EEE90319.1| predicted protein [Populus trichocarpa]
          Length = 694

 Score =  973 bits (2514), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 474/734 (64%), Positives = 576/734 (78%), Gaps = 46/734 (6%)

Query: 27  DLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVP 86
           DLV +MTL EKV Q+G+ AYGVPRLGL  Y+WWSEALHGVS +G      PGT FD  +P
Sbjct: 2   DLVNQMTLNEKVLQLGNKAYGVPRLGLAEYQWWSEALHGVSNVG------PGTFFDDLIP 55

Query: 87  GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
           G+TSFPTVI T A+FNESLWK IGQ VSTEARAMYNLG AGLT+WSPNINVVRDPRWGR 
Sbjct: 56  GSTSFPTVITTAAAFNESLWKVIGQAVSTEARAMYNLGRAGLTYWSPNINVVRDPRWGRA 115

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
           +ETPGEDPY+VGRYA+NYVRGLQDVEG E + D +SRPLK+S+CCKHYAAYD+DNW+G +
Sbjct: 116 IETPGEDPYLVGRYAVNYVRGLQDVEGSENYTDPNSRPLKVSSCCKHYAAYDVDNWKGVE 175

Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
           R+ FD+RV+EQDM ETF+ PFEMCV +GDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW
Sbjct: 176 RYTFDARVSEQDMVETFLRPFEMCVKDGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 235

Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKI 326
           + HGYIVSDCDS+Q +VE+HK+L              GLDLDCG YYT     AV+QGK+
Sbjct: 236 DLHGYIVSDCDSLQVMVENHKWL--------------GLDLDCGAYYTENVEAAVRQGKV 281

Query: 327 AEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDN 386
            EADID SL FLY+VLMRLG+FDG PQY + GKN++C+ ++IELA EAAR+G VLLKN+N
Sbjct: 282 READIDKSLNFLYVVLMRLGFFDGIPQYNSFGKNDVCSKENIELATEAAREGAVLLKNEN 341

Query: 387 GALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADI 446
            +LPL+   +KTLA++GPH+NAT AMIGNY G PC+  +P++G   Y+KV +Y  GC+DI
Sbjct: 342 DSLPLSIEKVKTLAVIGPHSNATSAMIGNYAGIPCQIITPIEGLSKYAKV-DYQMGCSDI 400

Query: 447 VCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG 506
            C++ S I  A+++AK ADAT+I+AG+DLS+EAE  DR DLLLPG+QT+LIN+VA  + G
Sbjct: 401 ACKDESFIFPAMESAKKADATIILAGIDLSIEAESLDRDDLLLPGYQTQLINQVASVSNG 460

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV LV+MSAG VDI+FAK+N  IKSILWVGYPGEEGG AIADVIFGKYNPGGRLP+TW+E
Sbjct: 461 PVVLVLMSAGGVDISFAKSNGDIKSILWVGYPGEEGGNAIADVIFGKYNPGGRLPLTWHE 520

Query: 567 ANYVK-IPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
           A+YV  +P TSMPLRP+++  +PGRTYKFF+G  VYPFG+GLSYTQF YK+ S+ +S+DI
Sbjct: 521 ADYVDMLPMTSMPLRPIDSLGYPGRTYKFFNGSTVYPFGHGLSYTQFTYKLTSTIRSLDI 580

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
           KLDK Q C D+ Y   + KP                     EV N G  DGSEVV+VY+K
Sbjct: 581 KLDKYQYCHDLGYKNDSFKP-------------------SFEVLNAGAKDGSEVVIVYAK 621

Query: 684 PP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           PP GI  T+IKQVIG++RVF+ AG S KV F  NA KSL++VD  A S+L SG HTI++G
Sbjct: 622 PPEGIDATYIKQVIGFKRVFVPAGGSEKVKFEFNASKSLQVVDFNAYSVLPSGGHTIMLG 681

Query: 743 EGVGGVSFPLQLNL 756
           + +  +SF +Q+  
Sbjct: 682 DDI--ISFSVQIRF 693


>gi|225432132|ref|XP_002274591.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
           [Vitis vinifera]
          Length = 805

 Score =  971 bits (2509), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 459/762 (60%), Positives = 578/762 (75%), Gaps = 14/762 (1%)

Query: 1   RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
           R+  + + +  F +CD  L Y ERAKDLV RMTL EKV Q    A GV RLGLP Y WWS
Sbjct: 53  RYALLGLDMKSFAFCDKSLSYKERAKDLVSRMTLQEKVMQSVHTASGVRRLGLPEYSWWS 112

Query: 61  EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
           EALHG+S +G      PG  FD  +PGATS PTVIL+TA+FN++LWK +G+ VSTE RAM
Sbjct: 113 EALHGISNLG------PGVFFDETIPGATSLPTVILSTAAFNQTLWKTLGRVVSTEGRAM 166

Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
           YNLG+AGLTFWSPNINVVRD RWGR  ET GEDP++VG +A+NYVRGLQDVEG E   D 
Sbjct: 167 YNLGHAGLTFWSPNINVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVEGTENVTDL 226

Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
           +SRPLK+S+CCKHYAAYD+D+W   DR  FD+RV+EQDM+ETF+ PFE CV EGDVSSVM
Sbjct: 227 NSRPLKVSSCCKHYAAYDIDSWLNVDRHTFDARVSEQDMKETFVSPFERCVREGDVSSVM 286

Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
           CS+N++NGIP C+DP+LL   IR +W+ HGYIVSDC  ++ IV++  +LND+K DAVA+ 
Sbjct: 287 CSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAKT 346

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           L+AGLDL+CG YYT+    +V  GK+++ ++D +L+ +Y++LMR+GYFDG P Y++LG  
Sbjct: 347 LQAGLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPAYESLGLK 406

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
           +IC   HIELA EAARQGIVLLKND   LPL  G  K +ALVGPHANAT+ MIGNY G P
Sbjct: 407 DICAADHIELAREAARQGIVLLKNDYEVLPLKPG--KKIALVGPHANATEVMIGNYAGLP 464

Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
           C+Y SP++ F A   V  YA GC D  C N++    A +AAK+A+ T+I  G DLS+EAE
Sbjct: 465 CKYVSPLEAFSAIGNV-TYATGCLDASCSNDTYFSEAKEAAKSAEVTIIFVGTDLSIEAE 523

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
             DRVD LLPG QTELI +VA+ + GPV LV++S   +DI FAKNNP+I +ILWVG+PGE
Sbjct: 524 FVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPGE 583

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNN--FPGRTYKFFDGPV 597
           +GG AIADV+FGKYNPGGRLP+TWYEA+YV  +P +SM LRPV+   +PGRTYKFFDG  
Sbjct: 584 QGGHAIADVVFGKYNPGGRLPVTWYEADYVDMLPMSSMSLRPVDELGYPGRTYKFFDGST 643

Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
           VYPFGYG+SYT+F Y +A+S  S+DI L+K Q+CR + YT     P C AVL+DD+ C D
Sbjct: 644 VYPFGYGMSYTKFSYSLATSKISIDIDLNKFQKCRTVAYTEDQKVPSCPAVLLDDMSCDD 703

Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
               F++ V N+G +DGSEV+MVYS PP GI GTHIKQVIG+++VF+AAG + +V F+MN
Sbjct: 704 -TIEFEVAVTNVGMVDGSEVLMVYSIPPSGIVGTHIKQVIGFQKVFVAAGDTERVKFSMN 762

Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLNH 758
           ACKSL+IVD+   SLL SG+HTI VG+     S+ LQ+N ++
Sbjct: 763 ACKSLRIVDSTGYSLLPSGSHTIRVGDYSNSASYSLQVNYHY 804


>gi|297736787|emb|CBI25988.3| unnamed protein product [Vitis vinifera]
          Length = 774

 Score =  933 bits (2411), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 457/762 (59%), Positives = 557/762 (73%), Gaps = 45/762 (5%)

Query: 1   RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
           RF ++ + + DF YCD+ LPY  R KDLV+R+TL EK + + D+A GVPR+GLP Y+WWS
Sbjct: 54  RFAALGLDMKDFVYCDSSLPYDVRVKDLVDRITLEEKARNVIDVASGVPRIGLPPYKWWS 113

Query: 61  EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
           EALHGV+ +G        T FD  VPGATSFP VIL+ ASFN+SLWK +GQ VSTEARAM
Sbjct: 114 EALHGVANVGS------ATFFDEVVPGATSFPNVILSAASFNQSLWKTLGQVVSTEARAM 167

Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
           YNLG+AGLTFWSPNINV RDPRWGR+LETPGEDP  VG Y +NYVRGLQD+EG E   D 
Sbjct: 168 YNLGHAGLTFWSPNINVARDPRWGRILETPGEDPLTVGVYGVNYVRGLQDIEGTENTTDL 227

Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
           +SRPLKI++ CKH+AAYDLD W   DR HFD++V+EQDM ETF+ PFEMCV EGD SSVM
Sbjct: 228 NSRPLKIASSCKHFAAYDLDQWFNVDRRHFDAKVSEQDMTETFLRPFEMCVKEGDTSSVM 287

Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
           CS+N +NGIP CADP+ L   IR  WN HGYIVSDC +I TIV+  KFL+ T E+ VA  
Sbjct: 288 CSFNNINGIPPCADPRFLKGVIREQWNLHGYIVSDCWAIDTIVQDQKFLDVTSEEGVALS 347

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           +KAGLDL+CG YY +    AV++G+++E D+D SL +LY+VLMR+G+FDG P   +LGK 
Sbjct: 348 MKAGLDLECGHYYNDSLATAVREGRVSEHDVDKSLSYLYVVLMRVGFFDGIPSLASLGKK 407

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
           +ICN +HIELA EAARQGIVLLKNDN  LPL    +K LALVGPHANAT AMIGNY G P
Sbjct: 408 DICNDEHIELAREAARQGIVLLKNDNATLPLKP--VKKLALVGPHANATVAMIGNYAGIP 465

Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
           C Y SP+D F     V  Y  GCAD+ C N++ +  A +AAKNADAT+I+ G DLS+EAE
Sbjct: 466 CHYVSPLDAFSELGDV-TYEVGCADVKCHNDTHVYKAAEAAKNADATIILVGTDLSIEAE 524

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
            +DR DLLLPG+QTE++N+V D + GPV LV+M  G +DI+FAKNNPKI +ILW G+PGE
Sbjct: 525 ERDREDLLLPGYQTEMVNQVTDLSTGPVILVVMCGGPIDISFAKNNPKIAAILWAGFPGE 584

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKFFDGPV 597
           +GG AIAD++FGKYNPGGR PITWYE  YV  +P TSM LRP+ +  +PGRTYKFF+G  
Sbjct: 585 QGGNAIADIVFGKYNPGGRSPITWYENGYVGMLPMTSMALRPIESLGYPGRTYKFFNGST 644

Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
           VYPFGYGLSYT F Y + +  +SV I L                                
Sbjct: 645 VYPFGYGLSYTNFSYSLTAPTRSVHISLT------------------------------- 673

Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
             F FQ+ V+N+G MDGSEVVMVYS PP GI GTHIKQVIG+ERVF+  G + KV F+MN
Sbjct: 674 -SFEFQVAVKNVGSMDGSEVVMVYSSPPSGIVGTHIKQVIGFERVFVKVGNTEKVKFSMN 732

Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLNH 758
            CKSL +VD++   LL SG+HTI+ G+    VSFP Q+N ++
Sbjct: 733 VCKSLGLVDSSGYILLPSGSHTIMAGDNSTSVSFPFQVNYHN 774


>gi|297736788|emb|CBI25989.3| unnamed protein product [Vitis vinifera]
          Length = 746

 Score =  915 bits (2365), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 450/760 (59%), Positives = 549/760 (72%), Gaps = 78/760 (10%)

Query: 1   RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
           RF ++ + + DF YCD+  PY  RAKDLV+RMTL EKV Q GD A GV R+GLP Y WWS
Sbjct: 57  RFAALGLDMKDFHYCDSSSPYEVRAKDLVDRMTLSEKVMQTGDQASGVERIGLPKYNWWS 116

Query: 61  EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
           EALHGVS  GR         FD  VPGATSFPTVIL+ ASFN+SLWK +GQ VSTEARAM
Sbjct: 117 EALHGVSNFGR------CVFFDEVVPGATSFPTVILSAASFNQSLWKTLGQAVSTEARAM 170

Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
           YN GNAGLTFWSPNINVVRDPRWGR+LETPGEDP++VG YA+NY                
Sbjct: 171 YNSGNAGLTFWSPNINVVRDPRWGRILETPGEDPHLVGLYAVNY---------------- 214

Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
                       HYAAYDLDNW+G DR HFD+RV+ QDM ETF+LPFEMCV EGDVSSVM
Sbjct: 215 ------------HYAAYDLDNWKGADRVHFDARVSVQDMAETFVLPFEMCVKEGDVSSVM 262

Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
           CSYN++NGIP+CAD +LL QTIRG+W+ HGYIVSDCDS++ +    K+L+ +  D+ A+ 
Sbjct: 263 CSYNKINGIPSCADSRLLKQTIRGEWDLHGYIVSDCDSVEVMAVDQKWLDSSFSDSAAQA 322

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           L AG++LDCG +       AV QGK  +AD+D SLR+LY++LMR+G+FDG P + +LGK+
Sbjct: 323 LNAGMNLDCGTFNNRSLTEAVNQGKANQADLDHSLRYLYVLLMRVGFFDGIPAFASLGKD 382

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
           +IC+ +HIELA EAARQGIVLLKNDN  LPL +  +K +ALVGPHANAT AMIGNY G P
Sbjct: 383 DICSAEHIELAREAARQGIVLLKNDNATLPLKS--VKNIALVGPHANATDAMIGNYAGIP 440

Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
           C Y SP+D F +  +V  Y  GCAD+ C N + I  A++AAK ADAT+I AG DLS+EAE
Sbjct: 441 CYYVSPLDAFSSMGEV-RYEKGCADVQCLNETYIFNAMEAAKRADATIIFAGTDLSIEAE 499

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
             DRVDLLLPG+QT+LIN+VAD + GPV LVIMS G VDI+FA++NPKI +ILW GYPGE
Sbjct: 500 ALDRVDLLLPGYQTQLINQVADLSTGPVVLVIMSGGGVDISFARDNPKIAAILWAGYPGE 559

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNN--FPGRTYKFFDGPV 597
           +GG AIADVI GKYNPGGRLPITWYEA+YV  +P TSM LRPV++  +PGRTYKFF+G  
Sbjct: 560 QGGNAIADVILGKYNPGGRLPITWYEADYVDMLPMTSMALRPVDSLGYPGRTYKFFNGST 619

Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
           VYPFGYG+SYT F Y +++S           Q C++                        
Sbjct: 620 VYPFGYGMSYTNFSYSLSTS-----------QSCKE------------------------ 644

Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
               F++ V+N+G+MDGSEVV+VYS PP GIAGTHIK+V+G+ERVF+  G + KV F+MN
Sbjct: 645 -SIEFEVAVKNVGRMDGSEVVVVYSSPPLGIAGTHIKKVVGFERVFVKVGGTEKVKFSMN 703

Query: 717 ACKSLKIVDNAANSLLASGAHTILV-GEGVGGVSFPLQLN 755
            CKSL IVD+   +LL SG+HTI V G+    V+FP  +N
Sbjct: 704 VCKSLGIVDSTGYALLPSGSHTIKVGGDNTTSVAFPFHVN 743


>gi|297736786|emb|CBI25987.3| unnamed protein product [Vitis vinifera]
          Length = 745

 Score =  888 bits (2295), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 434/762 (56%), Positives = 544/762 (71%), Gaps = 74/762 (9%)

Query: 1   RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
           R+  + + +  F +CD  L Y ERAKDLV RMTL EKV Q    A GV RLGLP Y WWS
Sbjct: 53  RYALLGLDMKSFAFCDKSLSYKERAKDLVSRMTLQEKVMQSVHTASGVRRLGLPEYSWWS 112

Query: 61  EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
           EALHG+S +G      PG  FD  +PGATS PTVIL+TA+FN++LWK +G+ VSTE RAM
Sbjct: 113 EALHGISNLG------PGVFFDETIPGATSLPTVILSTAAFNQTLWKTLGRVVSTEGRAM 166

Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
           YNLG+AGLTFWSPNINVVRD RWGR  ET GEDP++VG +A+NYVRGLQDVEG E     
Sbjct: 167 YNLGHAGLTFWSPNINVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVEGTE----- 221

Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
                 +S+CCKHYAAYD+D+W   DR  FD+RV+EQDM+ETF+ PFE CV EGDVSSVM
Sbjct: 222 -----NVSSCCKHYAAYDIDSWLNVDRHTFDARVSEQDMKETFVSPFERCVREGDVSSVM 276

Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
           CS+N++NGIP C+DP+LL   IR +W+ HGYIVSDC  ++ IV++  +LND+K DAVA+ 
Sbjct: 277 CSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAKT 336

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           L+AGLDL+CG YYT+    +V  GK+++ ++D +L+ +Y++LMR+GYFDG P Y++LG  
Sbjct: 337 LQAGLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPAYESLGLK 396

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
           +IC   HIELA EAARQGIVLLKND   LPL  G  K +ALVGPHANAT+ MIGNY G P
Sbjct: 397 DICAADHIELAREAARQGIVLLKNDYEVLPLKPG--KKIALVGPHANATEVMIGNYAGLP 454

Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
           C+Y SP++ F A   V  YA G                        T+I  G DLS+EAE
Sbjct: 455 CKYVSPLEAFSAIGNV-TYATGF-----------------------TIIFVGTDLSIEAE 490

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
             DRVD LLPG QTELI +VA+ + GPV LV++S   +DI FAKNNP+I +ILWVG+PGE
Sbjct: 491 FVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPGE 550

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNN--FPGRTYKFFDGPV 597
           +GG AIADV+FGKYNPGGRLP+TWYEA+YV  +P +SM LRPV+   +PGRTYKFFDG  
Sbjct: 551 QGGHAIADVVFGKYNPGGRLPVTWYEADYVDMLPMSSMSLRPVDELGYPGRTYKFFDGST 610

Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
           VYPFGYG+SYT+F Y +A+S  S+DI L+K Q+CR                         
Sbjct: 611 VYPFGYGMSYTKFSYSLATSKISIDIDLNKFQKCR------------------------- 645

Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
              TF++ V N+G +DGSEV+MVYS PP GI GTHIKQVIG+++VF+AAG + +V F+MN
Sbjct: 646 ---TFEVAVTNVGMVDGSEVLMVYSIPPSGIVGTHIKQVIGFQKVFVAAGDTERVKFSMN 702

Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLNH 758
           ACKSL+IVD+   SLL SG+HTI VG+     S+ LQ+N ++
Sbjct: 703 ACKSLRIVDSTGYSLLPSGSHTIRVGDYSNSASYSLQVNYHY 744


>gi|359477633|ref|XP_003632006.1| PREDICTED: LOW QUALITY PROTEIN: beta-D-xylosidase 3-like [Vitis
           vinifera]
          Length = 781

 Score =  879 bits (2272), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 446/763 (58%), Positives = 551/763 (72%), Gaps = 18/763 (2%)

Query: 1   RFESIKVKLSDFPYCDAKLP-YPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWW 59
           RF ++   + DF YC++ LP Y  R KDLV+RMTL EK   +   A GV R+GLP Y+WW
Sbjct: 21  RFAALGFDMKDFVYCNSSLPIYDVRVKDLVDRMTLEEKATNVIYKAAGVERIGLPPYQWW 80

Query: 60  SEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARA 119
           SEALHGVS +    N P  T FD  VPGATSFP VIL+ ASFN+SLWK I Q VS EARA
Sbjct: 81  SEALHGVSSVS--INGP--TFFDETVPGATSFPNVILSAASFNQSLWKTIRQVVSKEARA 136

Query: 120 MYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD 179
            YNLG+AGLTFW PN+NV RDPRWGR  ET GEDP+ V  YA++YVRGLQDVEG E   D
Sbjct: 137 TYNLGHAGLTFWCPNVNVARDPRWGRTQETXGEDPFTVSVYAVSYVRGLQDVEGTENTTD 196

Query: 180 SDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
            +SRPLK+S+  KH+AAYDLDNW   DR HF++RV+EQDM ETF+ PFE CV EGDVS V
Sbjct: 197 LNSRPLKVSSSGKHFAAYDLDNWLNVDRNHFNARVSEQDMAETFLRPFEACVREGDVSGV 256

Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
           MCS+N +NGIP CADP+L   TIR +WN HGYIVSDC SI+TIVE  KFL+ T E+AVA 
Sbjct: 257 MCSFNNINGIPPCADPRLFKGTIRDEWNLHGYIVSDCWSIETIVEDQKFLDVTGEEAVAL 316

Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK 359
            LKAGLDL+CG YY +    AV  G++ + D+D SL  LY+VLMRLG+FDG P   +LGK
Sbjct: 317 NLKAGLDLECGHYYNDSPASAVMAGRVGQHDLDQSLSNLYVVLMRLGFFDGIPALASLGK 376

Query: 360 NNIC-NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
           ++IC + +HIELA EAARQGIVLLKNDN  LPL +  +K LALVGP+A+A  AM+GNY G
Sbjct: 377 DDICLSAEHIELAREAARQGIVLLKNDNATLPLKS--VKNLALVGPNADAYGAMMGNYAG 434

Query: 419 TPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGL-DLSV 477
            PCR  SP D F A   V  Y  GC D++C N++ +  A++AAK+AD T+IV G+ D+S+
Sbjct: 435 PPCRSVSPRDAFSAIGNV-TYEMGCGDVLCHNDTYVYKAVEAAKHADTTIIVVGITDVSI 493

Query: 478 EAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMS--AGAVDINFAKNNPKIKSILWV 535
             E KDRVDLLLPG+QT L+N++A A   P+ LV+     G +DI+FA++NP I+ ILW 
Sbjct: 494 GTEDKDRVDLLLPGYQTHLVNQIAKATTAPIILVVCGHCGGPIDISFARDNPGIEPILWA 553

Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKF 592
           G+PGEEGG AIADV++GKYNPGGRLP+TWYE  YV  +P TSM LR V +  +PGR YKF
Sbjct: 554 GFPGEEGGNAIADVVYGKYNPGGRLPVTWYENGYVGMLPMTSMALRSVESLGYPGRKYKF 613

Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
           F G  VYPFG GLSYT F Y + +  +S+   L K Q CR + Y++ +  P C AVL+DD
Sbjct: 614 FSGSTVYPFGCGLSYTNFSYSLTAPTRSIHTHLKKLQPCRSMAYSICSVIPQCPAVLVDD 673

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKV 711
           + C +  F F++ V+ +G MDGSEVV+VYS PP GI GTHIKQVIG+ERVF+  G   KV
Sbjct: 674 LSCNE-TFEFEVAVKTVGSMDGSEVVIVYSSPPSGIVGTHIKQVIGFERVFVKVGXVEKV 732

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILV-GEGVGGVSFPLQ 753
            F+MN CKSL IV ++ ++LL SG+  I   G+    VSFP Q
Sbjct: 733 KFSMNVCKSLGIVHSSGHTLLPSGSDIIKAGGDNTISVSFPFQ 775


>gi|326523729|dbj|BAJ93035.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 810

 Score =  831 bits (2146), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 408/773 (52%), Positives = 548/773 (70%), Gaps = 31/773 (4%)

Query: 1   RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
           RF ++ ++++ F YCDA LPY +R +DLV R+TL EKV+ +GD A G  R+GLP Y WW 
Sbjct: 50  RFAALGLEMAGFRYCDASLPYADRVRDLVGRLTLEEKVRNLGDRAEGAARVGLPPYLWWG 109

Query: 61  EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
           EALHGVS  G     P GT F   VPGATSFP VI + A+FNE+LW  IG  VSTE RAM
Sbjct: 110 EALHGVSDTG-----PGGTRFGDVVPGATSFPLVINSAAAFNETLWGAIGGAVSTEIRAM 164

Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
           YNLG+A LT+WSPNINVVRDPRWGR  ETPGEDP+VVGRYA+++VR +QD++G      +
Sbjct: 165 YNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVSFVRAMQDIDGAGPGAGA 224

Query: 181 D--SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSS 238
           D  +RP+K+S+CCKHYAAYD+D W   DR  FD++V E+DM ETF  PFEMCV +GD S 
Sbjct: 225 DPFARPIKVSSCCKHYAAYDVDAWLTADRLTFDAQVEERDMIETFERPFEMCVRDGDASC 284

Query: 239 VMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVA 298
           VMCSYNR+NG+P CA+ +LL++T+RG+W  HGYIVSDCDS++ +V   K+L     +A A
Sbjct: 285 VMCSYNRINGVPACANARLLSETVRGEWQLHGYIVSDCDSVRVMVRDAKWLGYNGVEATA 344

Query: 299 RVLKAGLDLDCG-------DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS 351
             +KAGLDLDCG       D++T F + AV+QGK+ E+++D +LR LY+ LMRLG+FDG 
Sbjct: 345 AAMKAGLDLDCGMFWEGAQDFFTAFGLDAVRQGKLRESEVDNALRNLYLTLMRLGFFDGI 404

Query: 352 PQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG--PHANAT 409
           P+ ++LG N++C  +H ELAA+AARQG+VL+KND+G LPL+T  + +L+LVG   H NAT
Sbjct: 405 PELESLGANDVCTEEHKELAADAARQGMVLIKNDHGRLPLDTSKVNSLSLVGLLQHINAT 464

Query: 410 KAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVI 469
             M+G+Y G PCR  +P D   A  KV++     +  VC + +   AA    K  DAT++
Sbjct: 465 DVMLGDYRGKPCRVVTPYD---AIRKVVS---ATSMQVCDHGACSTAA--NGKTVDATIV 516

Query: 470 VAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
           +AGL++SVE EG DR DLLLP  QT  IN VA+A+  P+ LVI+SAG VD++FA+NNPKI
Sbjct: 517 IAGLNMSVEKEGNDREDLLLPWNQTNWINAVAEASPYPIILVIISAGGVDVSFAQNNPKI 576

Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FP 586
            +I+W GYPGEEGG AIADV+FGKYNPGGRLP+TWY++ Y+ KIP TSM LRPV +  +P
Sbjct: 577 GAIVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKSEYISKIPMTSMALRPVADKGYP 636

Query: 587 GRTYKFFDGP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKP-P 644
           GRTYKF+ GP V+YPFG+GLSY+ F Y   ++  SV +++   + C+ +    GT  P  
Sbjct: 637 GRTYKFYGGPEVLYPFGHGLSYSNFSYASDTTGASVTVRVGAWESCKQLTRKPGTTAPLA 696

Query: 645 CAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG-IAGTHIKQVIGYERVFI 703
           C AV +    CK+ + +F + V N G  DG+ VVMVY+ PP  +    +KQ++ + RVF+
Sbjct: 697 CPAVNVAGHGCKE-EVSFSLTVANRGSRDGAHVVMVYTVPPAEVDDAPLKQLVAFRRVFV 755

Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
            AG + +V FT+N CK+  IV+  A +++ SG  T+LVG+     SF +++ L
Sbjct: 756 PAGAAVQVPFTLNVCKAFAIVEETAYTVVPSGVSTVLVGDDALSFSFSVKIEL 808


>gi|242052713|ref|XP_002455502.1| hypothetical protein SORBIDRAFT_03g012290 [Sorghum bicolor]
 gi|241927477|gb|EES00622.1| hypothetical protein SORBIDRAFT_03g012290 [Sorghum bicolor]
          Length = 825

 Score =  820 bits (2117), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 410/778 (52%), Positives = 536/778 (68%), Gaps = 31/778 (3%)

Query: 1   RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
           RF ++ + +S F YCDA LPY ER +DLV R++L EKV+ +GD A G PR+GLP Y+WW 
Sbjct: 55  RFAALGLDMSRFRYCDASLPYAERVRDLVGRLSLEEKVRNLGDQAEGAPRVGLPPYKWWG 114

Query: 61  EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
           EALHGVS +G     P GT F   VPGATSFP VI + A+FNESLW+ IG  VSTE RAM
Sbjct: 115 EALHGVSDVG-----PGGTWFGDVVPGATSFPLVINSAAAFNESLWRAIGGVVSTEIRAM 169

Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDV---EGVEYH 177
           YNLG+A LT+WSPNINVVRDPRWGR  ETPGEDP+VVGRYA+N+VRG+QDV    G    
Sbjct: 170 YNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDVVIAAGAAAT 229

Query: 178 RDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVS 237
            D  SRP+K+S+CCKH+AAYD+D W   DR  FD++V E+DM ETF  PFEMC+ +GD S
Sbjct: 230 ADPFSRPIKVSSCCKHFAAYDVDAWFKADRLTFDAQVEERDMVETFERPFEMCIRDGDAS 289

Query: 238 SVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV 297
            VMCSYNR+NGIP CAD +LL++T+R  W  HGYIVSDCDS++ +V   K+LN T  +A 
Sbjct: 290 CVMCSYNRINGIPACADARLLSETVRSQWQLHGYIVSDCDSVRVMVRDAKWLNYTGVEAT 349

Query: 298 ARVLKAGLDLDCG-------DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
           A  +KAGLDLDCG       D++T + + AV+QGKI EAD+D +L  +Y  LMRLG+FDG
Sbjct: 350 AAAMKAGLDLDCGMFWEGARDFFTTYGVDAVRQGKIKEADVDNALGNVYTTLMRLGFFDG 409

Query: 351 SPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG--PHANA 408
            P++++LG +++C   H ELAA+AARQG+VLLKND   LPL+   I +++LVG   H NA
Sbjct: 410 MPEFESLGADDVCTRDHKELAADAARQGMVLLKNDARRLPLDPSKINSVSLVGLLEHINA 469

Query: 409 TKAMIGNYEGTPCRYTSPMDGFYAYSKVIN--YAPGCADIVCQNNSMIPAAIDAAKNADA 466
           T  M+G+Y G PCR  +P D   A  +V+N  Y   C    C     +  A   AK ADA
Sbjct: 470 TDVMLGDYRGKPCRIVTPYD---AIRQVVNATYVHACDSGACSTAEGMGRASRTAKIADA 526

Query: 467 TVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           T+++AGL++SVE E  DR DLLLP  Q+  IN VA+A+  P+ LVIMSAG VD++FA+NN
Sbjct: 527 TIVIAGLNMSVERESNDREDLLLPWNQSSWINAVAEASTTPIVLVIMSAGGVDVSFAQNN 586

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VN 583
            KI +I+W GYPGEEGG AIADV+FGKYNPGGRLP+TW++  YV +IP TSM LRP   +
Sbjct: 587 TKIGAIVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWFKNEYVNQIPMTSMALRPDAAH 646

Query: 584 NFPGRTYKFFDGP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG--- 639
            +PGRTYKF+ GP V+YPFG+GLSYT F Y   ++  +V I +   + C+ + Y  G   
Sbjct: 647 GYPGRTYKFYGGPAVLYPFGHGLSYTSFTYASGTTGATVTIPIGAWEHCKMLTYKSGKAP 706

Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG-THIKQVIGY 698
           +  P C A+ +   +C D   +F + V N G + G  VV VY+ PP   G    KQ++ +
Sbjct: 707 SPSPACPALNVASHRC-DEVVSFSLRVANTGGVGGDHVVPVYTAPPPEVGDAPRKQLVEF 765

Query: 699 ERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
            RVF+ AG +  V F +N CK+  IV+  A +++ SG  T++VG+    +SF + +NL
Sbjct: 766 RRVFVPAGAAVDVPFALNVCKTFAIVEETAYTVVPSGVSTVIVGDDALALSFAVTINL 823


>gi|226506870|ref|NP_001146482.1| uncharacterized protein LOC100280070 precursor [Zea mays]
 gi|219887469|gb|ACL54109.1| unknown [Zea mays]
 gi|413947917|gb|AFW80566.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 835

 Score =  818 bits (2113), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 407/776 (52%), Positives = 534/776 (68%), Gaps = 29/776 (3%)

Query: 1   RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
           RF ++ + +S F YCDA LPY +R +DLV R+ L EKV+ +GD A G PR+GLP Y+WW 
Sbjct: 67  RFVALGLDMSRFRYCDASLPYADRVRDLVGRLALEEKVRNLGDQAEGAPRVGLPPYKWWG 126

Query: 61  EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
           EALHGVS +G     P GT F   VPGATSFP VI + A+FNESLW+ IG  VSTE RAM
Sbjct: 127 EALHGVSDVG-----PGGTWFGDVVPGATSFPLVINSAAAFNESLWRAIGGVVSTEIRAM 181

Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
           YNLG+A LT+WSPNINVVRDPRWGR  ETPGEDP+VVGRYA+N+VRG+QDV+   Y   +
Sbjct: 182 YNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDVDDRPYAAAA 241

Query: 181 D--SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSS 238
           D  SRP+K+S+CCKH+AAYD+D W   DR  FD++V E+DM ETF  PFEMC+ +GD S 
Sbjct: 242 DPFSRPIKVSSCCKHFAAYDVDAWFKADRLTFDAQVEERDMVETFERPFEMCIRDGDASC 301

Query: 239 VMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVA 298
           VMCSYNR+NGIP CAD +LL++T+R  W  HGYIVSDCDS++ +V   K+LN T  +A A
Sbjct: 302 VMCSYNRINGIPACADARLLSETVRSQWQLHGYIVSDCDSVRVMVRDAKWLNYTGVEATA 361

Query: 299 RVLKAGLDLDCG-------DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS 351
             +KAGLDLDCG       D++T + + AV+QGKI E D+D +L  +Y  LMRLG+FDG 
Sbjct: 362 AAMKAGLDLDCGMFWEGARDFFTTYGVDAVRQGKIKEGDVDNALSNVYTTLMRLGFFDGM 421

Query: 352 PQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG--PHANAT 409
           P++++LG +N+C   H ELAA+AARQG+VLLKND   LPL+   I +++LVG   H NAT
Sbjct: 422 PEFESLGASNVCTDGHKELAADAARQGMVLLKNDARRLPLDPNKINSVSLVGLLEHINAT 481

Query: 410 KAMIGNYEGTPCRYTSPMDGFYAYSKVIN--YAPGCADIVCQNNSMIPAAIDAAKNADAT 467
             M+G+Y G PCR  +P   + A   ++N  Y   C    C     +  A   AK ADAT
Sbjct: 482 DVMLGDYRGKPCRIVTP---YNAIRNMVNATYVHACDSGACNTAEGMGRASSTAKIADAT 538

Query: 468 VIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
           +++AGL++SVE E  DR DLLLP  Q+  IN VA A+  P+ LVIMSAG VD++FA NN 
Sbjct: 539 IVIAGLNMSVERESNDREDLLLPWNQSSWINAVAMASPTPIVLVIMSAGGVDVSFAHNNT 598

Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNN 584
           KI +I+W GYPGEEGG AIADV+FGKYNPGGRLP+TW++  YV +IP TSM LRP     
Sbjct: 599 KIGAIVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWFKNEYVNQIPMTSMALRPDAALG 658

Query: 585 FPGRTYKFFDGP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG--TN 641
           +PGRTYKF+ GP V+YPFG+GLSYT F Y   ++  +V I +   + C+ + Y +G  + 
Sbjct: 659 YPGRTYKFYGGPAVLYPFGHGLSYTNFSYASGTTGATVTIHIGAWEHCKMLTYKMGAPSP 718

Query: 642 KPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG-THIKQVIGYER 700
            P C A+ +    C +   +F + V N G + G  VV VY+ PP   G   +KQ++ + R
Sbjct: 719 SPACPALNVASHMCSEV-VSFSLRVANTGGVGGDHVVPVYTAPPPEVGDAPLKQLVAFRR 777

Query: 701 VFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
           VF+ AG +  V F +N CK+  IV+  A +++ SG  T++VG+    +SFP+ +NL
Sbjct: 778 VFVPAGAAVDVPFALNVCKTFAIVEETAYTVVPSGVSTVVVGDDALVLSFPVTINL 833


>gi|14164501|dbj|BAB55751.1| putative alpha-L-arabinofuranosidase/beta-D- xylosidase isoenzyme
           ARA-I [Oryza sativa Japonica Group]
          Length = 818

 Score =  810 bits (2091), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 412/778 (52%), Positives = 538/778 (69%), Gaps = 34/778 (4%)

Query: 1   RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
           RF +  + ++ FPYCDA LPY +R +DLV RMTL EKV  +GD A G PR+GLP Y WW 
Sbjct: 51  RFAAAGLDMAGFPYCDASLPYADRVRDLVGRMTLEEKVANLGDRAGGAPRVGLPRYLWWG 110

Query: 61  EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
           EALHGVS +G     P GT F   VPGATSFP VI + ASFNE+LW+ IG  VSTE RAM
Sbjct: 111 EALHGVSDVG-----PGGTWFGDAVPGATSFPLVINSAASFNETLWRAIGGVVSTEIRAM 165

Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
           YNLG+A LT+WSPNINVVRDPRWGR  ETPGEDP+VVGRYA+N+VRG+QD++G      +
Sbjct: 166 YNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDIDGATTAASA 225

Query: 181 D------SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
                  SRP+K+S+CCKHYAAYD+D W G DR  FD+RV E+DM ETF  PFEMC+ +G
Sbjct: 226 AAATDAFSRPIKVSSCCKHYAAYDVDAWNGTDRLTFDARVQERDMVETFERPFEMCIRDG 285

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
           D S VMCSYNR+NG+P CAD +LL +T+R DW  HGYIVSDCDS++ +V   K+L  T  
Sbjct: 286 DASCVMCSYNRINGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYTGV 345

Query: 295 DAVARVLKAGLDLDCG-------DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY 347
           +A A  +KAGLDLDCG       D++T + + AV+QGK+ E+ +D +L  LY+ LMRLG+
Sbjct: 346 EATAAAMKAGLDLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTNLYLTLMRLGF 405

Query: 348 FDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG--PH 405
           FDG P+ ++LG  ++C  +H ELAA+AARQG+VLLKND   LPL+   + ++AL G   H
Sbjct: 406 FDGIPELESLGAADVCTEEHKELAADAARQGMVLLKNDAALLPLSPEKVNSVALFGQLQH 465

Query: 406 ANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNAD 465
            NAT  M+G+Y G PCR  +P DG     KV++     A   C   S    A  AAK  D
Sbjct: 466 INATDVMLGDYRGKPCRVVTPYDGV---RKVVSSTSVHA---CDKGS-CDTAAAAAKTVD 518

Query: 466 ATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
           AT++VAGL++SVE E  DR DLLLP  Q   IN VA+A+  P+ LVIMSAG VD++FA++
Sbjct: 519 ATIVVAGLNMSVERESNDREDLLLPWSQASWINAVAEASPSPIVLVIMSAGGVDVSFAQD 578

Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--V 582
           NPKI +++W GYPGEEGG AIADV+FGKYNPGGRLP+TWY+  YV KIP TSM LRP   
Sbjct: 579 NPKIGAVVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKNEYVSKIPMTSMALRPDAE 638

Query: 583 NNFPGRTYKFFDGP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTN 641
           + +PGRTYKF+ G  V+YPFG+GLSYT F Y  A++   V +K+   + C+ + Y  G +
Sbjct: 639 HGYPGRTYKFYGGADVLYPFGHGLSYTNFTYASATAAAPVTVKVGAWEYCKQLTYKAGVS 698

Query: 642 KPP-CAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG-IAGTHIKQVIGYE 699
            PP C AV +    C++ + +F + V N G  DG+ VV +Y+ PP  + G   KQ++ + 
Sbjct: 699 SPPACPAVNVASHACQE-EVSFAVTVANTGGRDGTHVVPMYTAPPAEVDGAPRKQLVAFR 757

Query: 700 RVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLN 757
           RV +AAG + +V F +N CK+  IV+  A +++ SG   +LVG+    +SFP+Q++L 
Sbjct: 758 RVRVAAGAAVEVAFALNVCKAFAIVEETAYTVVPSGVSRVLVGDDALSLSFPVQIDLQ 815


>gi|357128056|ref|XP_003565692.1| PREDICTED: beta-D-xylosidase 3-like [Brachypodium distachyon]
          Length = 821

 Score =  807 bits (2085), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 400/781 (51%), Positives = 541/781 (69%), Gaps = 36/781 (4%)

Query: 1   RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVP-RLGLPLYEWW 59
           RF S+ + ++ F YCDA LPY ER +DLV R+TL EKV  +GD A G   R+GLP Y WW
Sbjct: 50  RFASLGLDMAGFRYCDASLPYAERVRDLVGRLTLEEKVANLGDQAKGAEQRVGLPRYMWW 109

Query: 60  SEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARA 119
            EALHGVS       +P GT F   VPGATSFP V+ + A+FNE+LW+ IG   STE RA
Sbjct: 110 GEALHGVS-----DTNPGGTRFGDVVPGATSFPLVLNSAAAFNETLWRAIGGATSTEIRA 164

Query: 120 MYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD 179
           MYNLG+A LT+WSPNINVVRDPRWGR  ETPGEDP++VGR+A+++VR +QD++       
Sbjct: 165 MYNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFLVGRFAVSFVRAMQDIDDGANAGA 224

Query: 180 SDSRP----LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
             + P    LK+S+CCKHYAAYD+D W G DR  FD+ V E+DM ETF  PFEMCV +GD
Sbjct: 225 GAADPFARRLKVSSCCKHYAAYDVDKWFGADRLSFDANVQERDMVETFERPFEMCVRDGD 284

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKED 295
            S VMCSYNR+NG+P CA+ +LL  T+R DW  HGYIVSDCDS++ +V   K+L      
Sbjct: 285 ASCVMCSYNRINGVPACANGRLLTGTVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYDGVQ 344

Query: 296 AVARVLKAGLDLDCG-------DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
           A A  +KAGLDLDCG       D++T + + AV+QGK+ EA++D +L  LY+ LMRLG+F
Sbjct: 345 ATAAAMKAGLDLDCGMFWEGAKDFFTAYGLQAVRQGKLKEAEVDEALGHLYLTLMRLGFF 404

Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG--PHA 406
           DGSP++++LG +++C  +H E+AAEAARQG+VLLKND+  LPL+   + +LALVG   H 
Sbjct: 405 DGSPEFQSLGASDVCTEEHKEMAAEAARQGMVLLKNDHDRLPLDANKVNSLALVGLLQHI 464

Query: 407 NATKAMIGNYEGTPCRYTSPMDGFYAYSKVIN--YAPGCADIVCQNNSMIPAAIDAAKNA 464
           NAT  M+G+Y G PCR  +P   + A  KV++      C    C   ++   A  AAK  
Sbjct: 465 NATDVMLGDYRGKPCRVVTP---YEAIRKVVSGTSMQACDKGACGTTAL--GAAIAAKTV 519

Query: 465 DATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAK 524
           DAT+++ GL++SVE EG DR DLLLP  QT+ IN VA+A++ P+TLVI+SAG VDI+FA+
Sbjct: 520 DATIVITGLNMSVEREGNDREDLLLPWDQTQWINAVAEASRDPITLVIISAGGVDISFAQ 579

Query: 525 NNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVN 583
           NNPKI +ILW GYPGEEGG  IADV+FGKYNPGGRLP+TWY+  Y+ K+P TSM LRPV 
Sbjct: 580 NNPKIGAILWAGYPGEEGGTGIADVLFGKYNPGGRLPLTWYKNEYIGKLPMTSMALRPVA 639

Query: 584 N--FPGRTYKFFDGP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDK--DQQCRDINYTV 638
           +  +PGRTYKF+ GP V+YPFG+GLSYT F Y   ++  SV +K+    +  C+++ Y  
Sbjct: 640 DKGYPGRTYKFYSGPDVLYPFGHGLSYTNFTYDSYTTGASVTVKIGTAWEDSCKNLTYKP 699

Query: 639 GT--NKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG-IAGTHIKQV 695
           GT  +  PC A+ +    C++ + +F ++V N G + GS VV VY+ PP  +    +KQ+
Sbjct: 700 GTTASTAPCPAINVAGHGCQE-EVSFTLKVSNTGGIGGSHVVPVYTAPPAEVDDAPLKQL 758

Query: 696 IGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLN 755
           + + R+F+ AG + +V FT++ CK+  IV+  A +++ +G   +LVG+     SFP++++
Sbjct: 759 VAFRRMFVPAGDAVEVPFTLSVCKAFAIVEGTAYTVVPAGVSRVLVGDESLSFSFPVKID 818

Query: 756 L 756
           L
Sbjct: 819 L 819


>gi|9294427|dbj|BAB02547.1| beta-1,4-xylosidase [Arabidopsis thaliana]
          Length = 876

 Score =  798 bits (2062), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 401/747 (53%), Positives = 508/747 (68%), Gaps = 37/747 (4%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           + + +C+  L Y  RAKDLV R++L EKVQQ+ + A GVPRLG+P YEWWSEALHGVS +
Sbjct: 37  AKYGFCNVSLSYEARAKDLVSRLSLKEKVQQLVNKATGVPRLGVPPYEWWSEALHGVSDV 96

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
           G      PG HF+  VPGATSFP  ILT ASFN SLW K+G+ VSTEARAM+N+G AGLT
Sbjct: 97  G------PGVHFNGTVPGATSFPATILTAASFNTSLWLKMGEVVSTEARAMHNVGLAGLT 150

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
           +WSPN+NV RDPRWGR  ETPGEDP VV +YA+NYV+GLQDV     H    SR LK+S+
Sbjct: 151 YWSPNVNVFRDPRWGRGQETPGEDPLVVSKYAVNYVKGLQDV-----HDAGKSRRLKVSS 205

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
           CCKHY AYDLDNW+G DRFHFD++VT+QD+++T+  PF+ CV EGDVSSVMCSYNRVNGI
Sbjct: 206 CCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQTPFKSCVEEGDVSSVMCSYNRVNGI 265

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           PTCADP LL   IRG W   GYIVSDCDSIQ       +   T+EDAVA  LKAGL+++C
Sbjct: 266 PTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFNDIHY-TKTREDAVALALKAGLNMNC 324

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
           GD+   +T  AV+  K+  +D+D +L + YIVLMRLG+FDG P+   + NLG +++C+  
Sbjct: 325 GDFLGKYTENAVKLKKLNGSDVDEALIYNYIVLMRLGFFDGDPKSLPFGNLGPSDVCSKD 384

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
           H  LA EAA+QGIVLL+N  G LPL    +K LA++GP+ANATK MI NY G PC+YTSP
Sbjct: 385 HQMLALEAAKQGIVLLEN-RGDLPLPKTTVKKLAVIGPNANATKVMISNYAGVPCKYTSP 443

Query: 427 MDGFYAY-SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           + G   Y  + I Y PGC D+ C + ++I AA+ A   AD TV+V GLD +VEAEG DRV
Sbjct: 444 IQGLQKYVPEKIVYEPGCKDVKCGDQTLISAAVKAVSEADVTVLVVGLDQTVEAEGLDRV 503

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           +L LPG+Q +L+  VA+AAK  V LVIMSAG +DI+FAKN   I+++LWVGYPGE GG A
Sbjct: 504 NLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFAKNLSTIRAVLWVGYPGEAGGDA 563

Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
           IA VIFG YNP GRLP TWY   +  K+  T M +RP   + FPGR+Y+F+ G  +Y FG
Sbjct: 564 IAQVIFGDYNPSGRLPETWYPQEFADKVAMTDMNMRPNSTSGFPGRSYRFYTGKPIYKFG 623

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
           YGLSY+ F   V S+P  + IK          N  +  NK    +V I  V C D K   
Sbjct: 624 YGLSYSSFSTFVLSAPSIIHIK---------TNPIMNLNK--TTSVDISTVNCHDLKIRI 672

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIA------GTHIKQVIGYERVFIAAGQSAKVGFTMN 716
            I V+N G   GS VV+V+ KPP  +      G  + Q++G+ERV +    + K     +
Sbjct: 673 VIGVKNHGLRSGSHVVLVFWKPPKCSKSLVGGGVPLTQLVGFERVEVGRSMTEKFTVDFD 732

Query: 717 ACKSLKIVDNAANSLLASGAHTILVGE 743
            CK+L +VD      L +G H +++G 
Sbjct: 733 VCKALSLVDTHGKRKLVTGHHKLVIGS 759


>gi|15230897|ref|NP_188596.1| putative beta-D-xylosidase 5 [Arabidopsis thaliana]
 gi|259585724|sp|Q9LJN4.2|BXL5_ARATH RecName: Full=Probable beta-D-xylosidase 5; Short=AtBXL5; Flags:
           Precursor
 gi|332642747|gb|AEE76268.1| putative beta-D-xylosidase 5 [Arabidopsis thaliana]
          Length = 781

 Score =  798 bits (2060), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 401/747 (53%), Positives = 508/747 (68%), Gaps = 37/747 (4%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           + + +C+  L Y  RAKDLV R++L EKVQQ+ + A GVPRLG+P YEWWSEALHGVS +
Sbjct: 37  AKYGFCNVSLSYEARAKDLVSRLSLKEKVQQLVNKATGVPRLGVPPYEWWSEALHGVSDV 96

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
           G      PG HF+  VPGATSFP  ILT ASFN SLW K+G+ VSTEARAM+N+G AGLT
Sbjct: 97  G------PGVHFNGTVPGATSFPATILTAASFNTSLWLKMGEVVSTEARAMHNVGLAGLT 150

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
           +WSPN+NV RDPRWGR  ETPGEDP VV +YA+NYV+GLQDV     H    SR LK+S+
Sbjct: 151 YWSPNVNVFRDPRWGRGQETPGEDPLVVSKYAVNYVKGLQDV-----HDAGKSRRLKVSS 205

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
           CCKHY AYDLDNW+G DRFHFD++VT+QD+++T+  PF+ CV EGDVSSVMCSYNRVNGI
Sbjct: 206 CCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQTPFKSCVEEGDVSSVMCSYNRVNGI 265

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           PTCADP LL   IRG W   GYIVSDCDSIQ       +   T+EDAVA  LKAGL+++C
Sbjct: 266 PTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFNDIHY-TKTREDAVALALKAGLNMNC 324

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
           GD+   +T  AV+  K+  +D+D +L + YIVLMRLG+FDG P+   + NLG +++C+  
Sbjct: 325 GDFLGKYTENAVKLKKLNGSDVDEALIYNYIVLMRLGFFDGDPKSLPFGNLGPSDVCSKD 384

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
           H  LA EAA+QGIVLL+N  G LPL    +K LA++GP+ANATK MI NY G PC+YTSP
Sbjct: 385 HQMLALEAAKQGIVLLEN-RGDLPLPKTTVKKLAVIGPNANATKVMISNYAGVPCKYTSP 443

Query: 427 MDGFYAY-SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           + G   Y  + I Y PGC D+ C + ++I AA+ A   AD TV+V GLD +VEAEG DRV
Sbjct: 444 IQGLQKYVPEKIVYEPGCKDVKCGDQTLISAAVKAVSEADVTVLVVGLDQTVEAEGLDRV 503

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           +L LPG+Q +L+  VA+AAK  V LVIMSAG +DI+FAKN   I+++LWVGYPGE GG A
Sbjct: 504 NLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFAKNLSTIRAVLWVGYPGEAGGDA 563

Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
           IA VIFG YNP GRLP TWY   +  K+  T M +RP   + FPGR+Y+F+ G  +Y FG
Sbjct: 564 IAQVIFGDYNPSGRLPETWYPQEFADKVAMTDMNMRPNSTSGFPGRSYRFYTGKPIYKFG 623

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
           YGLSY+ F   V S+P  + IK          N  +  NK    +V I  V C D K   
Sbjct: 624 YGLSYSSFSTFVLSAPSIIHIK---------TNPIMNLNK--TTSVDISTVNCHDLKIRI 672

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIA------GTHIKQVIGYERVFIAAGQSAKVGFTMN 716
            I V+N G   GS VV+V+ KPP  +      G  + Q++G+ERV +    + K     +
Sbjct: 673 VIGVKNHGLRSGSHVVLVFWKPPKCSKSLVGGGVPLTQLVGFERVEVGRSMTEKFTVDFD 732

Query: 717 ACKSLKIVDNAANSLLASGAHTILVGE 743
            CK+L +VD      L +G H +++G 
Sbjct: 733 VCKALSLVDTHGKRKLVTGHHKLVIGS 759


>gi|357153280|ref|XP_003576399.1| PREDICTED: probable beta-D-xylosidase 2-like [Brachypodium
           distachyon]
          Length = 807

 Score =  795 bits (2054), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 400/789 (50%), Positives = 531/789 (67%), Gaps = 63/789 (7%)

Query: 1   RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
           RF +  + +S + YCDAKLPY +R +DL+  MT+ EKV  +GD A G PR+GLP Y+WWS
Sbjct: 49  RFAAAGLDMSRYRYCDAKLPYGDRVRDLIGWMTVEEKVSNLGDWAAGAPRVGLPPYKWWS 108

Query: 61  EALHGVSFIGRRTNSPPGTHFD-----------SEVPGATSFPTVILTTASFNESLWKKI 109
           EALHG+S  G      P T FD           + V   T F  VI + ASFNESLW+ I
Sbjct: 109 EALHGLSSTG------PTTKFDDLKKPRLHSGRAAVFNGTVFANVINSAASFNESLWRSI 162

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           GQ +STEARAMYNLG  GLT+WSPNINVVRDPRWGR LETPGEDP+VVGRYA+N+VRG+Q
Sbjct: 163 GQAISTEARAMYNLGKGGLTYWSPNINVVRDPRWGRALETPGEDPFVVGRYAVNFVRGMQ 222

Query: 170 DVE--GVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPF 227
           DV+     ++ D  SRPLK SACCKHYAAYD+D+W G+ RF FD+RVTE+DM ETF  PF
Sbjct: 223 DVDDAAAGFNGDPLSRPLKTSACCKHYAAYDVDDWYGHTRFKFDARVTERDMVETFQRPF 282

Query: 228 EMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHK 287
           EMCV +GD S+VMCSYNRVNGIP CAD +LL  T+R DW  HGYIVSDCD+++ + ++  
Sbjct: 283 EMCVRDGDASAVMCSYNRVNGIPACADARLLAGTLRRDWGLHGYIVSDCDAVRVMTDNAT 342

Query: 288 FLNDTKEDAVARVLKAGLDLDCG------------DYYTNFTMGAVQQGKIAEADIDTSL 335
           +L  T  +A A  LKAGLDLDCG            D+ + + M AV+QGK+ E+DID +L
Sbjct: 343 WLGYTPAEASAASLKAGLDLDCGESWIVQKGKPVMDFLSTYGMAAVRQGKMRESDIDNAL 402

Query: 336 RFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGN 395
             LY  LMRLGYFDG P+Y++L + +IC+  H  LA + ARQ +VLLKN +G LPL+   
Sbjct: 403 VNLYTTLMRLGYFDGMPRYESLDEKDICSEAHRSLALDGARQSMVLLKNLDGLLPLDASK 462

Query: 396 IKTLALVGPHANA-TKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMI 454
           + ++A+ GPHA A  K M G+Y G PCRY +P +G    SK +N                
Sbjct: 463 LASVAVRGPHAEAPEKVMDGDYTGPPCRYITPREGI---SKDVNI--------------- 504

Query: 455 PAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMS 514
                + +  D T+ + G+++ +E EG DR DLLLP  QTE I +VA A+  P+ LVI+S
Sbjct: 505 -----SQQGGDVTIYMGGINMHIEREGNDREDLLLPKNQTEEILRVAAASPSPIVLVILS 559

Query: 515 AGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIP 573
            G +D++FA+++PKI +ILW GYPG EGG AIADVIFG+YNPGGRLP+TW++  Y+ ++P
Sbjct: 560 GGGIDVSFAQSHPKIGAILWAGYPGGEGGHAIADVIFGRYNPGGRLPLTWFKNKYIHQLP 619

Query: 574 YTSMPLRPV--NNFPGRTYKFFDGP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQ 630
            TSM LRP   + +PGRTYKF+DGP V+YPFGYGLSYT+F+Y++ +   +V +   + + 
Sbjct: 620 MTSMALRPRPEHGYPGRTYKFYDGPDVLYPFGYGLSYTKFRYELLNKETAVTLAPGR-RH 678

Query: 631 CRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG-IAG 689
           CR ++Y  G+  P C AV +    C +   +F + V N GK DG+  V+VY+ PP  +AG
Sbjct: 679 CRQLSYKTGSVGPDCPAVDVASHACAE-TVSFNVSVVNAGKADGANAVLVYTAPPAELAG 737

Query: 690 THIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG-VGGV 748
             IKQV  + RV + AG +  V FT+N CK+  IV+  A +++ SG  T++V  G    V
Sbjct: 738 APIKQVAAFRRVAVKAGAAETVVFTLNVCKAFGIVEKTAYTVVPSGVSTVIVENGDSSAV 797

Query: 749 SFPLQLNLN 757
           SFP+Q++ +
Sbjct: 798 SFPVQISFS 806


>gi|413954831|gb|AFW87480.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 814

 Score =  788 bits (2034), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 399/785 (50%), Positives = 525/785 (66%), Gaps = 56/785 (7%)

Query: 1   RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
           RF  + + +S FPYCDA LPY +R +DL+  MT+ EKV  +GD+++G PR+GLP Y+WWS
Sbjct: 57  RFAEMGLNMSAFPYCDASLPYADRVRDLIGWMTVEEKVGNLGDISHGAPRVGLPPYKWWS 116

Query: 61  EALHGVSFIGRRT-----NSPPGTHFD-SEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
           EALHGVS  G        +S PG H   + V  AT F  VI + ASFNE+LW  IGQ VS
Sbjct: 117 EALHGVSSTGPTMLFDDLHSKPGNHSGRATVNNATVFANVINSAASFNETLWNSIGQAVS 176

Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
           TEARAMYNLG  GLT+WSPNINVVRDPRWGR LETPGEDPYV GRYA+N+VRG+QD+ G 
Sbjct: 177 TEARAMYNLGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVAGRYAVNFVRGMQDIPG- 235

Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
            Y  D  +RP+K SACCKH+AAYD+DNW    RF +D+RV+E+DM ETF+ PFEMCV EG
Sbjct: 236 HYSGDPSARPIKTSACCKHHAAYDVDNWHNQTRFTYDARVSERDMAETFLRPFEMCVREG 295

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
           DVSSVMCSYNRVNG+P CAD +LL+ T+RG+W+ +GYIVSDCD+++ + ++  +LN T  
Sbjct: 296 DVSSVMCSYNRVNGVPACADARLLSGTVRGEWHLNGYIVSDCDAVRVMTDNATWLNFTAA 355

Query: 295 DAVARVLKAGLDLDCG------------DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVL 342
           ++ A  L+AG+DLDC             DY + + M AV QGK+ E+DID +L  LY+ L
Sbjct: 356 ESSAVSLRAGMDLDCAESWIEEEGRPLRDYLSEYGMAAVAQGKMRESDIDNALTNLYMTL 415

Query: 343 MRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALV 402
           MRLGYFD  P+Y +L + ++C  +H  LA + ARQGIVLLKND+G LPL+      +A+ 
Sbjct: 416 MRLGYFDNIPRYASLNETDVCTDEHKSLALDGARQGIVLLKNDHGLLPLDPKKTLAVAVH 475

Query: 403 GPHANA-TKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAA 461
           GPHA A  K M G+Y G PCRY +P  G                        I   +  +
Sbjct: 476 GPHARAPEKIMDGDYTGPPCRYVTPRQG------------------------ISRDVKIS 511

Query: 462 KNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDIN 521
             A  T+ + G++L +E EG DR DLLLP  QTE I   A A+  P+ LVI+S G +DI+
Sbjct: 512 HKAKMTIYLGGINLYIEREGNDREDLLLPKNQTEEILHFAQASPTPIILVILSGGGIDIS 571

Query: 522 FAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR 580
           FA+ +PKI +ILW GYPG EGG AIADVIFG+YNPGGRLP+TW++  Y+ +IP TSM  R
Sbjct: 572 FAQKHPKIGAILWAGYPGGEGGNAIADVIFGRYNPGGRLPLTWFKNKYIEQIPMTSMEFR 631

Query: 581 PV--NNFPGRTYKFFDGP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINY- 636
           PV    +PGRTYKF+DGP V+YPFGYGLSYT+F+Y+ ++   SV +       C+ ++Y 
Sbjct: 632 PVPEKGYPGRTYKFYDGPEVLYPFGYGLSYTKFQYETSTDGVSVSLPA-PGGHCKGLSYK 690

Query: 637 -TVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQ 694
            +V T  P C AV + D  C +   +F + V N G   G+ VV+VY+  PP +A   IKQ
Sbjct: 691 PSVAT-VPACQAVNVADHACTE-TVSFNVSVTNAGGRGGAHVVLVYTAPPPEVAEAPIKQ 748

Query: 695 VIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV--GEGVGGVSFPL 752
           V  + RVF+AA  +A V F +N CK+  IV+  A +++ SG   +LV  G+    VSFP+
Sbjct: 749 VAAFRRVFVAARSTATVPFALNVCKAFGIVERTAYTVVPSGVSKVLVENGDSSSSVSFPV 808

Query: 753 QLNLN 757
           +++L+
Sbjct: 809 KIDLS 813


>gi|356574315|ref|XP_003555294.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 5-like
           [Glycine max]
          Length = 901

 Score =  783 bits (2022), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 403/749 (53%), Positives = 519/749 (69%), Gaps = 23/749 (3%)

Query: 8   KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
           K S+FP+CD  L Y +RAKDLV R+TL EK QQ+ + + G+ RLG+P YEWWSEALHGVS
Sbjct: 30  KTSNFPFCDTSLSYEDRAKDLVSRLTLQEKTQQLVNPSAGISRLGVPAYEWWSEALHGVS 89

Query: 68  FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG 127
            +G      PGT FD +VPGATSFP VIL+ ASFN SLW+K+GQ VSTEARAMYN+  AG
Sbjct: 90  NLG------PGTRFDKKVPGATSFPAVILSAASFNASLWQKMGQVVSTEARAMYNVDLAG 143

Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
           LTFWSPN+NV RDPRWGR  ETPGEDP VV RYA+ Y+RGLQ+VE       + +  LK+
Sbjct: 144 LTFWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVMYLRGLQEVED---EASAKADRLKV 200

Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
           S+CCKHY AYDLDNW+G DRFHFD++VT+QD+++++  PF+ CV EG VSSVMCSYNRVN
Sbjct: 201 SSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDSYQPPFKSCVVEGHVSSVMCSYNRVN 260

Query: 248 GIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL 307
           GIPTCADP LL   IRG W   GYIVSDCDS++    +  +   T EDAVA  LKAGL++
Sbjct: 261 GIPTCADPDLLKGIIRGQWGLDGYIVSDCDSVEVYYNAIHY-TATPEDAVALALKAGLNM 319

Query: 308 DCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG--SPQYKNLGKNNICNP 365
           +CGD+   +T  AV   K+  A +D +L + YIVLMRLG+FD   S  + NLG +++C  
Sbjct: 320 NCGDFLKKYTANAVNLKKVDVATVDQALVYNYIVLMRLGFFDDPKSLPFANLGPSDVCTK 379

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            + +LA +AA+QGIVLL+N+NGALPL+  NIK LA++GP+ANAT  MI NY G PCRYTS
Sbjct: 380 DNQQLALDAAKQGIVLLENNNGALPLSQTNIKKLAVIGPNANATTVMISNYAGIPCRYTS 439

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+ G   Y   +NYAPGC+++ C N S+I AA+ AA +ADA V+V GLD S+EAEG DR 
Sbjct: 440 PLQGLQKYISSVNYAPGCSNVKCDNQSLIAAAVKAAASADAVVLVVGLDQSIEAEGLDRE 499

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           +L LPGFQ + +  VA A KG V LVIM+AG +DI+  K+   I  ILWVGYPG+ GG A
Sbjct: 500 NLTLPGFQEKFVKDVAGATKGKVILVIMAAGPIDISSTKSVSNIGGILWVGYPGQAGGDA 559

Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
           IA VIFG YNPGGR P TWY  +YV ++P T M +R     NFPGRTY+F++G  +Y FG
Sbjct: 560 IAQVIFGDYNPGGRSPFTWYPQSYVDQVPMTDMNMRANKSRNFPGRTYRFYNGNSLYEFG 619

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDI--NYTVGTNKPPCA---AVLIDDVKCKD 657
           +GLSY+ F   VAS+P S+ I+     +  ++  +   GT     +   A+ I  + C+D
Sbjct: 620 HGLSYSTFSMYVASAPSSIMIENTSISEPHNMLSSNNSGTQVESLSDGQAIDISTINCQD 679

Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPPG---IAGTHIKQVIGYERVFIAAGQSAKVGFT 714
             F   I V+N G ++GS VV+V+ +P     + G  IKQ+IG+ERV +  G +  V   
Sbjct: 680 LTFLLVIGVKNNGPLNGSHVVLVFWEPATSEFVIGAPIKQLIGFERVQVVVGVTEFVTVK 739

Query: 715 MNACKSLKIVDNAANSLLASGAHTILVGE 743
           ++ C+ +  VD+     L  G HTILVG 
Sbjct: 740 IDICQLISNVDSDGKRKLVIGQHTILVGS 768


>gi|225437531|ref|XP_002270249.1| PREDICTED: probable beta-D-xylosidase 2 [Vitis vinifera]
 gi|297743965|emb|CBI36935.3| unnamed protein product [Vitis vinifera]
          Length = 768

 Score =  782 bits (2020), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/738 (51%), Positives = 501/738 (67%), Gaps = 33/738 (4%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           FP+C   +   ER KDL+ R+TL EKV+ + + A GVPRLG+  YEWWSEALHGVS +G 
Sbjct: 41  FPFCRKSIGIGERVKDLIGRLTLEEKVRLLVNNAAGVPRLGIKGYEWWSEALHGVSNVG- 99

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
                PGT F  + PGATSFP VI T ASFN SLW+ IGQ VS EARAMYN G AGLTFW
Sbjct: 100 -----PGTKFSGDFPGATSFPQVITTAASFNSSLWEAIGQVVSDEARAMYNGGAAGLTFW 154

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
           SPN+N+ RDPRWGR  ETPGEDP + G+YA  YVRGLQ   G           LK++ACC
Sbjct: 155 SPNVNIFRDPRWGRGQETPGEDPVLAGKYAARYVRGLQGNAGDR---------LKVAACC 205

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
           KH+ AYDLDNW G DRFHFD+RV++Q+M++TF +PF  CV EG V+SVMCSYN+VNG+PT
Sbjct: 206 KHFTAYDLDNWNGVDRFHFDARVSKQEMEDTFDVPFRSCVVEGKVASVMCSYNQVNGVPT 265

Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
           CADP LL  T+R  W+ +GY+VSDCDS+    ++  + N T E+A A  +KAGLDLDCG 
Sbjct: 266 CADPNLLRNTVRKQWHLNGYVVSDCDSVGVFYDNQHYTN-TPEEAAADAIKAGLDLDCGP 324

Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHI 368
           +    T  A+++G ++EAD+D++L     V MRLG FDG P    + +LG  ++C+P H 
Sbjct: 325 FLAVHTQDAIKKGLVSEADVDSALVNTVTVQMRLGMFDGEPSAQPFGDLGPKDVCSPAHQ 384

Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
           ELA EAARQGIVLLKN   +LPL+T + +++A++GP+++A   MIGNY G PC YT+P+ 
Sbjct: 385 ELAIEAARQGIVLLKNHGHSLPLSTRSHRSIAVIGPNSDANVTMIGNYAGIPCEYTTPLQ 444

Query: 429 GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLL 488
           G   YS+ I +  GCAD+ C  + +   AIDAA  ADATV+V GLD S+EAE KDR DLL
Sbjct: 445 GIGRYSRTI-HQKGCADVACSEDQLFAGAIDAASQADATVLVMGLDQSIEAEAKDRADLL 503

Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
           LPG Q EL++KVA A++GP  LV+MS G VD++FAK +P+I +I+W GYPG+ GG AIAD
Sbjct: 504 LPGRQQELVSKVAMASRGPTVLVLMSGGPVDVSFAKKDPRIAAIVWAGYPGQAGGAAIAD 563

Query: 549 VIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGL 605
           ++FG  NPGG+LP+TWY   Y+ K+P T+M +R  P   +PGRTY+F+ GPVVY FG+GL
Sbjct: 564 ILFGVANPGGKLPMTWYPQEYLSKVPMTTMAMRAIPSKAYPGRTYRFYKGPVVYRFGHGL 623

Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIE 665
           SYT F + +A +P +V I L         N TV        A+ +   KC        ++
Sbjct: 624 SYTNFVHTIAQAPTAVAIPLHGHH-----NTTVSGK-----AIRVTHAKCNRLSIALHLD 673

Query: 666 VENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVD 725
           V+N+G  DGS  ++V+SKPP       KQ++ +E+V +AA    +V   ++ CK L +VD
Sbjct: 674 VKNVGNKDGSHTLLVFSKPPAGHWAPHKQLVAFEKVHVAARTQQRVQINIHVCKYLSVVD 733

Query: 726 NAANSLLASGAHTILVGE 743
            +    +  G H + +G+
Sbjct: 734 RSGIRRIPMGQHGLHIGD 751


>gi|255548487|ref|XP_002515300.1| Beta-glucosidase, putative [Ricinus communis]
 gi|223545780|gb|EEF47284.1| Beta-glucosidase, putative [Ricinus communis]
          Length = 768

 Score =  781 bits (2018), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 377/738 (51%), Positives = 500/738 (67%), Gaps = 32/738 (4%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
           + P+C  KLP  +R KDL+ R+TL EKV  + + A  V RLG+  YEWWSEALHGVS +G
Sbjct: 39  NLPFCQVKLPIQDRVKDLIGRLTLAEKVGLLVNNAGAVSRLGIKGYEWWSEALHGVSNVG 98

Query: 71  RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
                 PGT F    PGATSFP VI T ASFN +LW+ IG+ VS EARAMYN G AGLT+
Sbjct: 99  ------PGTKFGGSFPGATSFPQVITTAASFNSTLWEAIGRVVSDEARAMYNGGAAGLTY 152

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           WSPN+N++RDPRWGR  ETPGEDP +VG+YA +YV+GLQ          +D   LK++AC
Sbjct: 153 WSPNVNILRDPRWGRGQETPGEDPLLVGKYAASYVKGLQG---------NDGERLKVAAC 203

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH+ AYDLDNW G DRFHF+++V++QDM++TF +PF MCV EG V+SVMCSYN+VNGIP
Sbjct: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDMKDTFDVPFRMCVKEGKVASVMCSYNQVNGIP 263

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
           TCADP LL +T+R  W  +GYIVSDCDS+    +   +   T E+A A  +KAGLDLDCG
Sbjct: 264 TCADPNLLRKTVRTQWGLNGYIVSDCDSVGVFYDKQHY-TSTPEEAAADAIKAGLDLDCG 322

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
            +    T  AV++G I+EAD++ +L     V MRLG FDG P    Y NLG  ++C P H
Sbjct: 323 PFLAVHTQDAVKRGLISEADVNGALFNTLTVQMRLGMFDGEPSAQPYGNLGPKDVCTPAH 382

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
            ELA EA RQGIVLLKN   +LPL+    +T+A++GP++N T  MIGNY G  C+YT+P+
Sbjct: 383 QELALEAGRQGIVLLKNHGPSLPLSPRRHRTVAIIGPNSNVTVTMIGNYAGVACQYTTPL 442

Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
            G  +Y+K I +  GCAD+ C  + +   AIDAA+ ADATV+V GLD S+EAE +DR  L
Sbjct: 443 QGIGSYAKTI-HQQGCADVGCVTDQLFSGAIDAARQADATVLVMGLDQSIEAEFRDRTGL 501

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
           LLPG Q EL++KVA A+KGP  LV+MS G +D++FAK +PKI +ILW GYPG+ GG AIA
Sbjct: 502 LLPGRQQELVSKVAMASKGPTILVLMSGGPIDVSFAKKDPKIAAILWAGYPGQAGGAAIA 561

Query: 548 DVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGYG 604
           DV+FG  NPGG+LP+TWY   Y+  +P T M +R   +  +PGRTY+F+ G VVYPFG+G
Sbjct: 562 DVLFGTINPGGKLPMTWYPQEYITNLPMTEMAMRSSQSKGYPGRTYRFYQGKVVYPFGHG 621

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
           +SYT F + +AS+P  V + LD  +         G       A+ +   KC       Q+
Sbjct: 622 MSYTHFVHNIASAPTMVSVPLDGHR---------GNTSISGKAIRVTHTKCNKLSLGIQV 672

Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIV 724
           +V+N+G  DG+  ++VYS PP    +  KQ++ +ERV ++AG   +VG +++ CK L +V
Sbjct: 673 DVKNVGSKDGTHTLLVYSAPPAGRWSPHKQLVAFERVHVSAGTQERVGISIHVCKLLSVV 732

Query: 725 DNAANSLLASGAHTILVG 742
           D +    +  G H+I +G
Sbjct: 733 DRSGIRRIPIGEHSIHIG 750


>gi|357442285|ref|XP_003591420.1| Beta xylosidase [Medicago truncatula]
 gi|355480468|gb|AES61671.1| Beta xylosidase [Medicago truncatula]
          Length = 765

 Score =  781 bits (2016), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/742 (50%), Positives = 508/742 (68%), Gaps = 34/742 (4%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           ++FP+C A LP P R  DL+ R+TL EKV  + + A  VPR+G+  YEWWSEALHGVS +
Sbjct: 33  NNFPFCKASLPIPTRVNDLIGRLTLQEKVSMLVNNAAAVPRVGIKGYEWWSEALHGVSNV 92

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
           G      PGT F  + P ATSFP VI T ASFN SLW+ IG+  S EARAMYN G AGLT
Sbjct: 93  G------PGTKFAGQFPAATSFPQVITTVASFNASLWEAIGRVASDEARAMYNGGTAGLT 146

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
           +WSPN+N+ RDPRWGR  ETPGEDP + G+YA +YVRGLQ          +DS  LK++A
Sbjct: 147 YWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQG---------TDSSRLKVAA 197

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
            CKH+ AYDLDNW G DRFHF+++V++QDM++TF +PF MCV EG+V+SVMCSYN+VNG+
Sbjct: 198 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDMEDTFNVPFRMCVKEGNVASVMCSYNQVNGV 257

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           PTCADP LL +TIRG W+  GYIVSDCDS+  +  +++    T E+A A  +KAGLDLDC
Sbjct: 258 PTCADPNLLKRTIRGQWHLDGYIVSDCDSVG-VFYTNQHYTSTPEEAAADAIKAGLDLDC 316

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
           G +    T  AV++G + E D++ +L     V MRLG FDG P    Y NLG  ++C P 
Sbjct: 317 GPFLAQHTQNAVKKGLLTETDVNGALANTLTVQMRLGMFDGEPSAQPYGNLGPTDVCTPT 376

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
           H ELA +AARQGIVLLKN   +LPL+T N +T+A++GP++NAT  MIGNY G  C YTSP
Sbjct: 377 HQELALDAARQGIVLLKNTGPSLPLSTKNHQTVAVIGPNSNATVTMIGNYAGIACGYTSP 436

Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
           + G   Y++ I + PGCA++ C ++    +A++AA+ ADATV+V GLD S+EAE  DR  
Sbjct: 437 LQGIGKYARTI-HEPGCANVACNDDKQFGSALNAARQADATVLVMGLDQSIEAEMVDRTG 495

Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
           LLLPG Q +L++KVA A++GP  LV+MS G +DI FAKN+P+I  ILW GYPG+ GG AI
Sbjct: 496 LLLPGHQQDLVSKVAAASRGPTILVLMSGGPIDITFAKNDPRIMGILWAGYPGQAGGAAI 555

Query: 547 ADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGY 603
           AD++FG  NPG +LP+TWY   Y+K +  T+M +RP ++  +PGRTY+F++GPVVYPFGY
Sbjct: 556 ADILFGTTNPGAKLPMTWYPQGYLKNLAMTNMAMRPSSSTGYPGRTYRFYNGPVVYPFGY 615

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           GLSYT F + +AS+PK V + +D  ++    N          AA+ +   +C        
Sbjct: 616 GLSYTNFVHTLASAPKVVSVPVDGHRRGNSSNK---------AAIRVTHARCGKLSIRLD 666

Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHI--KQVIGYERVFIAAGQSAKVGFTMNACKSL 721
           I+V+N+G  DG+  ++V+S PP   G     KQ++ +E+V++ A    +V   ++ CK L
Sbjct: 667 IDVKNVGSKDGTNTLLVFSVPPTGNGHWAPQKQLVAFEKVYVPAKAQQRVRINIHVCKLL 726

Query: 722 KIVDNAANSLLASGAHTILVGE 743
            +VD +    +  GAH+I +G+
Sbjct: 727 SVVDKSGTRRIPMGAHSIHIGD 748


>gi|242093144|ref|XP_002437062.1| hypothetical protein SORBIDRAFT_10g020500 [Sorghum bicolor]
 gi|241915285|gb|EER88429.1| hypothetical protein SORBIDRAFT_10g020500 [Sorghum bicolor]
          Length = 809

 Score =  780 bits (2015), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 397/786 (50%), Positives = 523/786 (66%), Gaps = 56/786 (7%)

Query: 1   RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
           RF  + + +S FPYCDA LPY +R +DL+  MT+ EKV  +GD+++G PR+GLP Y+WWS
Sbjct: 50  RFAEMGLNMSAFPYCDASLPYADRVRDLIGWMTVEEKVGNLGDVSHGAPRVGLPPYKWWS 109

Query: 61  EALHGVSFIGRRT-----NSPPGTHFD-SEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
           EALHGVS  G        +S PG H   + V  AT F  VI + ASFNE+LWK IGQ VS
Sbjct: 110 EALHGVSSTGPTMLFDDLHSKPGNHSGRATVNNATVFANVINSAASFNETLWKSIGQAVS 169

Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
           TEARAMYNLG  GLT+WSPNINVVRDPRWGR LETPGEDP+V GRYA+N+VRG+QD+ G 
Sbjct: 170 TEARAMYNLGKGGLTYWSPNINVVRDPRWGRALETPGEDPFVAGRYAVNFVRGMQDIPGH 229

Query: 175 EYHRDSDS-RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNE 233
           +   D  S RP+K SACCKHYAAYD+D+W  + RF FD+RV+E+DM ETF+ PFEMCV +
Sbjct: 230 DGGGDDPSTRPIKTSACCKHYAAYDVDDWHNHTRFTFDARVSERDMAETFLRPFEMCVRD 289

Query: 234 GDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTK 293
           GD S VMCSYNRVNGIP CAD +LL+ TIRGDW  HGYIVSDCD+++ + ++  +L+ T 
Sbjct: 290 GDASGVMCSYNRVNGIPACADARLLSGTIRGDWQLHGYIVSDCDAVRVMTDNATWLHFTG 349

Query: 294 EDAVARVLKAGLDLDCG------------DYYTNFTMGAVQQGKIAEADIDTSLRFLYIV 341
            ++ A  ++AGLDLDC             D+ + +   AV QGK+ E+DID++LR  Y+ 
Sbjct: 350 AESSAASIRAGLDLDCAESWIEEKGRPLRDFLSEYGKAAVAQGKMRESDIDSALRNQYMT 409

Query: 342 LMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLAL 401
           LMRLGYFD  P+Y +L + +IC  +H  LA + ARQG+VLLKND+G LPL+   I  +A+
Sbjct: 410 LMRLGYFDNIPRYASLNETDICTDEHKSLAHDGARQGMVLLKNDDGLLPLDPEKILAVAV 469

Query: 402 VGPHANA-TKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDA 460
            GPHA A  K M G+Y G PCRY +P  G                        I   +  
Sbjct: 470 HGPHARAPEKIMDGDYTGPPCRYVTPRQG------------------------ISKDVKI 505

Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
           +  A+ T+ + G++L +E EG DR DLLLP  QTE I   A A+  P+ LVI+S G +DI
Sbjct: 506 SHRANTTIYLGGINLHIEREGNDREDLLLPKNQTEEILHFAKASPNPIILVILSGGGIDI 565

Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPL 579
           +FA  +PKI +ILW GYPG EGG AIADVIFG+YNPGGRLP+TW++  Y+ +IP TSM  
Sbjct: 566 SFAHKHPKIGAILWAGYPGGEGGNAIADVIFGRYNPGGRLPLTWFKNKYIQQIPMTSMEF 625

Query: 580 RPV--NNFPGRTYKFFDGP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINY 636
           RPV    +PGRTYKF+DGP V+YPFGYGLSYT+F Y+ +++  +V +       C+ ++Y
Sbjct: 626 RPVPEKGYPGRTYKFYDGPEVLYPFGYGLSYTKFLYETSTNGTAVTLPA-TGGHCKGLSY 684

Query: 637 --TVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIK 693
             +V T  P C AV +    C +   +F I V N G   G+ VV+VY+  PP +A   IK
Sbjct: 685 KPSVATT-PACQAVDVAGHACTE-TVSFNISVTNAGGRGGAHVVLVYTAPPPEVAQAPIK 742

Query: 694 QVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV--GEGVGGVSFP 751
           QV  + RVF+ A  +A V FT+N CK+  IV+  A +++ SG   +LV  G+    VSFP
Sbjct: 743 QVAAFRRVFVPARSTATVPFTLNVCKAFGIVERTAYTVVPSGVSKVLVQNGDSSSSVSFP 802

Query: 752 LQLNLN 757
           ++++ +
Sbjct: 803 VKIDFS 808


>gi|357444469|ref|XP_003592512.1| Xylosidase [Medicago truncatula]
 gi|355481560|gb|AES62763.1| Xylosidase [Medicago truncatula]
          Length = 781

 Score =  777 bits (2006), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 401/750 (53%), Positives = 513/750 (68%), Gaps = 37/750 (4%)

Query: 8   KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
           K S+FP+C+  L Y  RAKDLV R+TL EK QQ+ + + G+ RLG+P YEWWSEALHGVS
Sbjct: 32  KTSNFPFCNTSLSYETRAKDLVSRLTLQEKAQQLVNPSTGISRLGVPAYEWWSEALHGVS 91

Query: 68  FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG 127
            +G      PGT FDS VPGATSFP VIL+ ASFNE+LW  +GQ VS EARAMYN+  AG
Sbjct: 92  NVG------PGTRFDSRVPGATSFPAVILSAASFNETLWYTMGQVVSNEARAMYNVDLAG 145

Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
           LTFWSPN+NV RDPRWGR  ETPGEDP VV RYA+NYVRGLQ+V G E     D   LK+
Sbjct: 146 LTFWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEV-GDEASAKGDR--LKV 202

Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
           S+CCKHY AYD+DNW+G DRFHFD++VT+QD+++T+  PF+ CV EG VSSVMCSYNRVN
Sbjct: 203 SSCCKHYTAYDVDNWKGVDRFHFDAKVTKQDLEDTYQPPFKSCVLEGHVSSVMCSYNRVN 262

Query: 248 GIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL 307
           GIPTCADP LL   IRG W   GYIVSDCDS++    S  +   T EDAVA  LKAGL++
Sbjct: 263 GIPTCADPDLLQGVIRGQWGLDGYIVSDCDSVEVYYNSIHY-TKTPEDAVALALKAGLNM 321

Query: 308 DCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG--SPQYKNLGKNNICNP 365
           +CGD+   +T  AV   K+  + +D +L + YIVLMRLG+F+   S  + NLG +++C  
Sbjct: 322 NCGDFLKKYTANAVNLKKVDVSIVDQALVYNYIVLMRLGFFENPKSLPFANLGPSDVCTK 381

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           ++ +LA EAA+QGIVLL+N+ GALPL+   IK LA++GP+ANAT  MI NY G PCRY+S
Sbjct: 382 ENQQLALEAAKQGIVLLENNKGALPLSKTKIKNLAVIGPNANATTVMISNYAGIPCRYSS 441

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+ G   Y   + YA GC+D+ C N ++  AA+ AA +ADA V+V GLD S+EAEG DRV
Sbjct: 442 PLQGLQKYISSVTYARGCSDVKCSNQNLFAAAVKAAASADAVVLVVGLDQSIEAEGLDRV 501

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           +L LPGFQ +L+  VA A KG + LVIM+AG +DI+F K+   I  ILWVGYPG++GG A
Sbjct: 502 NLTLPGFQEKLVKDVAAATKGTLILVIMAAGPIDISFTKSVSNIGGILWVGYPGQDGGNA 561

Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
           IA VIFG YNPGGR P TWY  +YV ++P T M +R     NFPGRTY+F++G  +Y FG
Sbjct: 562 IAQVIFGDYNPGGRSPFTWYPQSYVDQVPMTDMNMRANSSRNFPGRTYRFYNGKSLYEFG 621

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD-------VKC 655
           YGLSY+ F   +AS+P +  I L K+               P   + +DD       + C
Sbjct: 622 YGLSYSTFSTHIASAPST--IMLQKNTSISK----------PLNNIFLDDQVIDISTISC 669

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPP---GIAGTHIKQVIGYERVFIAAGQSAKVG 712
            +  F+  I V+N G  DGS VV+V+ +PP    ++G  +KQ+IG+ER  +  G++  V 
Sbjct: 670 FNLTFSLVIGVKNNGPFDGSHVVLVFLEPPSSEAVSGVPLKQLIGFERAQVKVGKTEFVT 729

Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVG 742
             ++ CK L  VD+     L  G H ILVG
Sbjct: 730 VKIDICKMLSNVDSDGKRKLVIGQHNILVG 759


>gi|357445735|ref|XP_003593145.1| Beta-xylosidase/alpha-L-arabinofuranosidase [Medicago truncatula]
 gi|355482193|gb|AES63396.1| Beta-xylosidase/alpha-L-arabinofuranosidase [Medicago truncatula]
          Length = 775

 Score =  775 bits (2001), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 382/740 (51%), Positives = 507/740 (68%), Gaps = 30/740 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           +S + +CD  L   +R  DLV+R+TL EK+  +G+ A  V RLG+P YEWWSEALHGVS 
Sbjct: 50  VSSYGFCDKSLSVEDRVSDLVKRLTLQEKIGNLGNSAVEVSRLGIPKYEWWSEALHGVSN 109

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           IG      PGTHF S VPGATSFP  ILT ASFN SL++ IG  VS EARAMYN+G AGL
Sbjct: 110 IG------PGTHFSSLVPGATSFPMPILTAASFNTSLFQAIGSVVSNEARAMYNVGLAGL 163

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T+WSPNIN+ RDPRWGR  ETPGEDP +  +YA  YV+GLQ  +      D DS  LK++
Sbjct: 164 TYWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAGYVKGLQQTD------DGDSDKLKVA 217

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYD+DNW+G  R+ FD+ V++QD+ +TF  PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 218 ACCKHYTAYDVDNWKGVQRYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNG 277

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTCADP LL   IRG W  +GYIVSDCDS++ + +   +   T E+A A+ + +GLDLD
Sbjct: 278 KPTCADPDLLKGVIRGKWKLNGYIVSDCDSVEVLFKDQHY-TKTPEEAAAKTILSGLDLD 336

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG Y   +T GAV+QG + EA I+ ++   +  LMRLG+FDG P    Y NLG  ++C P
Sbjct: 337 CGSYLGQYTGGAVKQGLVDEASINNAVSNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTP 396

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           ++ ELA EAARQGIVLLKN  G+LPL++  IK+LA++GP+ANAT+ MIGNYEG PC+YTS
Sbjct: 397 ENQELAREAARQGIVLLKNSPGSLPLSSKAIKSLAVIGPNANATRVMIGNYEGIPCKYTS 456

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+ G  A+    +YAPGC D+ C N + I  A   A +ADAT+IV G +L++EAE  DRV
Sbjct: 457 PLQGLTAFVPT-SYAPGCPDVQCAN-AQIDDAAKIAASADATIIVVGANLAIEAESLDRV 514

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           ++LLPG Q +L+N+VA+ +KGPV LVIMS G +D++FAK N KI SILWVGYPGE GG A
Sbjct: 515 NILLPGQQQQLVNEVANVSKGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAA 574

Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
           IADVIFG YNP GRLP+TWY  +YV KIP T+M +R  P   +PGRTY+F+ G  V+ FG
Sbjct: 575 IADVIFGSYNPSGRLPMTWYPQSYVEKIPMTNMNMRSDPATGYPGRTYRFYKGETVFSFG 634

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
            G+S+   ++K+  +P+ V + L +D +CR +          C ++ + D  C++  F  
Sbjct: 635 DGMSFGTVEHKIVKAPQLVSVPLAEDHECRSLE---------CKSLDVADEHCQNLAFDI 685

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
            + V+NMGKM  S  V+++  PP +     K ++G+E+V +A      V F ++ C  L 
Sbjct: 686 HLSVKNMGKMSSSHSVLLFFTPPNVHNAPQKHLLGFEKVQLAGKSEGMVRFKVDVCNDLS 745

Query: 723 IVDNAANSLLASGAHTILVG 742
           +VD   N  +  G H + VG
Sbjct: 746 VVDELGNRKVPLGDHMLHVG 765


>gi|356525896|ref|XP_003531557.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
           [Glycine max]
          Length = 776

 Score =  773 bits (1997), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 380/740 (51%), Positives = 506/740 (68%), Gaps = 30/740 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L+ + +CD  L   +R  DLV+R+TL EK+  + + A  V RLG+P YEWWSEALHGVS 
Sbjct: 51  LAGYGFCDKSLSLEDRVADLVKRLTLQEKIGSLVNSATSVSRLGIPKYEWWSEALHGVSN 110

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           +G      PGTHF S VPGATSFP  ILT ASFN SL++ IG+ VSTEARAMYN+G AGL
Sbjct: 111 VG------PGTHFSSLVPGATSFPMPILTAASFNASLFEAIGRVVSTEARAMYNVGLAGL 164

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T+WSPNIN+ RDPRWGR  ETPGEDP +  +YA  YV+GLQ  +      D DS  LK++
Sbjct: 165 TYWSPNINIFRDPRWGRGQETPGEDPLLSSKYATGYVKGLQQTD------DGDSNKLKVA 218

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYDLDNW+G  R+ F++ VT+QDM +TF  PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 219 ACCKHYTAYDLDNWKGIQRYTFNAVVTQQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNG 278

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTCADP LL   IRG+W  +GYIVSDCDS++ + +   +   T E+A A  + AGLDL+
Sbjct: 279 KPTCADPDLLKGVIRGEWKLNGYIVSDCDSVEVLFKDQHY-TKTPEEAAAETILAGLDLN 337

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG+Y   +T GAV+QG + EA I+ ++   +  LMRLG+FDG P    Y NLG N++C  
Sbjct: 338 CGNYLGQYTEGAVKQGLLDEASINNAVSNNFATLMRLGFFDGDPSKQTYGNLGPNDVCTS 397

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           ++ ELA EAARQGIVLLKN  G+LPLN   IK+LA++GP+ANAT+ MIGNYEG PC Y S
Sbjct: 398 ENRELAREAARQGIVLLKNSLGSLPLNAKAIKSLAVIGPNANATRVMIGNYEGIPCNYIS 457

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+    A     +YA GC ++ C  N+ +  A   A +ADATVIV G  L++EAE  DR+
Sbjct: 458 PLQALTALVPT-SYAAGCPNVQCA-NAELDDATQIAASADATVIVVGASLAIEAESLDRI 515

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           ++LLPG Q  L+++VA+A+KGPV LVIMS G +D++FAK+N KI SILWVGYPGE GG A
Sbjct: 516 NILLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSFAKSNDKITSILWVGYPGEAGGAA 575

Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
           IADVIFG YNP GRLP+TWY  +YV K+P T+M +R  P   +PGRTY+F+ G  V+ FG
Sbjct: 576 IADVIFGFYNPSGRLPMTWYPQSYVNKVPMTNMNMRADPATGYPGRTYRFYKGETVFSFG 635

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
            G+S++  ++K+  +P+ V + L +D +CR            C ++ + D  C++  F  
Sbjct: 636 DGISFSNIEHKIVKAPQLVSVPLAEDHECR---------SSECMSLDVADEHCQNLAFDI 686

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
            + V+NMGKM  S VV+++  PP +     K ++G+E+V +     A+V F ++ CK L 
Sbjct: 687 HLGVKNMGKMSSSHVVLLFFTPPDVHNAPQKHLLGFEKVHLPGKSEAQVRFKVDICKDLS 746

Query: 723 IVDNAANSLLASGAHTILVG 742
           +VD   N  +  G H + VG
Sbjct: 747 VVDELGNRKVPLGQHLLHVG 766


>gi|356503923|ref|XP_003520749.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
          Length = 775

 Score =  773 bits (1997), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 379/748 (50%), Positives = 508/748 (67%), Gaps = 32/748 (4%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
           + P+C A L  PER KDLV R+TL EKV+ + + A  VPRLG+  YEWWSEALHGVS +G
Sbjct: 42  NMPFCKASLAIPERVKDLVGRLTLQEKVRLLVNNAAAVPRLGMKGYEWWSEALHGVSNVG 101

Query: 71  RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
                 PG  F+++ PGATSFP VI T ASFN SLW+ IGQ VS EARAMYN G AGLT+
Sbjct: 102 ------PGVKFNAQFPGATSFPQVITTAASFNASLWEAIGQVVSDEARAMYNGGTAGLTY 155

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           WSPN+N+ RDPRWGR  ETPGEDP + G YA +YVRGLQ          +D   LK++AC
Sbjct: 156 WSPNVNIFRDPRWGRGQETPGEDPVLAGTYAASYVRGLQG---------TDGNRLKVAAC 206

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH+ AYDLDNW G DRFHF+++V++QD++ETF +PF MCV+EG V+SVMCSYN+VNG+P
Sbjct: 207 CKHFTAYDLDNWNGMDRFHFNAQVSKQDIEETFDVPFRMCVSEGKVASVMCSYNQVNGVP 266

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
           TCADP LL +T+RG W   GYIVSDCDS+    ++  +   T E+A A  +KAGLDLDCG
Sbjct: 267 TCADPNLLKKTVRGLWQLDGYIVSDCDSVGVFYDNQHY-TPTPEEAAADAIKAGLDLDCG 325

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
            +    T  AV++G ++EAD++ +L     V MRLG FDG P    Y  LG  ++C P H
Sbjct: 326 PFLAVHTQNAVEKGLLSEADVNGALVNTLTVQMRLGMFDGEPSAHAYGKLGPKDVCKPAH 385

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
            ELA EAARQGIVLLKN    LPL+     T+A++GP++ AT  MIGNY G  C YT+P+
Sbjct: 386 QELALEAARQGIVLLKNTGPVLPLSPQRHHTVAVIGPNSKATVTMIGNYAGVACGYTNPL 445

Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
            G   Y+K I +  GC ++ C+N+ +  +AI+AA+ ADATV+V GLD S+EAE  DR  L
Sbjct: 446 QGIGRYAKTI-HQLGCENVACKNDKLFGSAINAARQADATVLVMGLDQSIEAETVDRTGL 504

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
           LLPG Q +L++KVA A+KGP  LVIMS G+VDI FAKNNP+I  ILW GYPG+ GG AIA
Sbjct: 505 LLPGRQQDLVSKVAAASKGPTILVIMSGGSVDITFAKNNPRIVGILWAGYPGQAGGAAIA 564

Query: 548 DVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGYG 604
           D++FG  NPGG+LP+TWY   Y+ K+P T+M +R   +  +PGRTY+F++GPVVYPFG+G
Sbjct: 565 DILFGTTNPGGKLPVTWYPQEYLTKLPMTNMAMRGSKSAGYPGRTYRFYNGPVVYPFGHG 624

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
           L+YT F + +AS+P  V + L+     R  N T  +N+    A+ +   +C     + ++
Sbjct: 625 LTYTHFVHTLASAPTVVSVPLNGH---RRANVTNISNR----AIRVTHARCDKLSISLEV 677

Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHI--KQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
           +++N+G  DG+  ++V+S PP   G     KQ++ +E++ + A    +VG  ++ CK L 
Sbjct: 678 DIKNVGSRDGTHTLLVFSAPPAGFGHWALEKQLVAFEKIHVPAKGLQRVGVNIHVCKLLS 737

Query: 723 IVDNAANSLLASGAHTILVGEGVGGVSF 750
           +VD +    +  G H+  +G+    VS 
Sbjct: 738 VVDKSGIRRIPLGEHSFNIGDVKHSVSL 765


>gi|9972374|gb|AAG10624.1|AC022521_2 Similar to xylosidase [Arabidopsis thaliana]
          Length = 763

 Score =  772 bits (1993), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 374/743 (50%), Positives = 507/743 (68%), Gaps = 34/743 (4%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           +C   +P PER +DL+ R+TL EKV  +G+ A  +PRLG+  YEWWSEALHGVS +G   
Sbjct: 39  FCQLSVPIPERVRDLIGRLTLAEKVSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVG--- 95

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
              PGT F    P ATSFP VI T ASFN SLW+ IG+ VS EARAMYN G  GLT+WSP
Sbjct: 96  ---PGTKFGGVYPAATSFPQVITTVASFNASLWESIGRVVSNEARAMYNGGVGGLTYWSP 152

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           N+N++RDPRWGR  ETPGEDP V G+YA +YVRGLQ          +D   LK++ACCKH
Sbjct: 153 NVNILRDPRWGRGQETPGEDPVVAGKYAASYVRGLQG---------NDRSRLKVAACCKH 203

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           + AYDLDNW G DRFHF+++V++QD+++TF +PF MCV EG+V+S+MCSYN+VNG+PTCA
Sbjct: 204 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVKEGNVASIMCSYNQVNGVPTCA 263

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
           DP LL +TIR  W  +GYIVSDCDS+  + ++  +   T E+A A  +KAGLDLDCG + 
Sbjct: 264 DPNLLKKTIRNQWGLNGYIVSDCDSVGVLYDTQHY-TGTPEEAAADSIKAGLDLDCGPFL 322

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG---SPQYKNLGKNNICNPQHIEL 370
              T+ AV++  + E+D+D +L     V MRLG FDG   +  Y +LG  ++C P H  L
Sbjct: 323 GAHTIDAVKKNLLRESDVDNALINTLTVQMRLGMFDGDIAAQPYGHLGPAHVCTPVHKGL 382

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A EAA+QGIVLLKN   +LPL++   +T+A++GP+++AT  MIGNY G  C YTSP+ G 
Sbjct: 383 ALEAAQQGIVLLKNHGSSLPLSSQRHRTVAVIGPNSDATVTMIGNYAGVACGYTSPVQGI 442

Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
             Y++ I +  GC D+ C ++ +  AA++AA+ ADATV+V GLD S+EAE KDR  LLLP
Sbjct: 443 TGYARTI-HQKGCVDVHCMDDRLFDAAVEAARGADATVLVMGLDQSIEAEFKDRNSLLLP 501

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
           G Q EL+++VA AAKGPV LV+MS G +DI+FA+ + KI +I+W GYPG+EGG AIAD++
Sbjct: 502 GKQQELVSRVAKAAKGPVILVLMSGGPIDISFAEKDRKIPAIVWAGYPGQEGGTAIADIL 561

Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGYGLSY 607
           FG  NPGG+LP+TWY  +Y+  +P T M +RPV++   PGRTY+F+DGPVVYPFG+GLSY
Sbjct: 562 FGSANPGGKLPMTWYPQDYLTNLPMTEMSMRPVHSKRIPGRTYRFYDGPVVYPFGHGLSY 621

Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
           T+F + +A +PK + I +      R  N TV        ++ +   +C        +EV 
Sbjct: 622 TRFTHNIADAPKVIPIAV------RGRNGTVSGK-----SIRVTHARCDRLSLGVHVEVT 670

Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           N+G  DG+  ++V+S PPG      KQ++ +ERV +A G+  +V   ++ CK L +VD A
Sbjct: 671 NVGSRDGTHTMLVFSAPPGGEWAPKKQLVAFERVHVAVGEKKRVQVNIHVCKYLSVVDRA 730

Query: 728 ANSLLASGAHTILVGEGVGGVSF 750
            N  +  G H I +G+    VS 
Sbjct: 731 GNRRIPIGDHGIHIGDESHTVSL 753


>gi|18378991|ref|NP_563659.1| beta-glucosidase [Arabidopsis thaliana]
 gi|75250279|sp|Q94KD8.1|BXL2_ARATH RecName: Full=Probable beta-D-xylosidase 2; Short=AtBXL2; Flags:
           Precursor
 gi|14194121|gb|AAK56255.1|AF367266_1 At1g02640/T14P4_11 [Arabidopsis thaliana]
 gi|23506063|gb|AAN28891.1| At1g02640/T14P4_11 [Arabidopsis thaliana]
 gi|332189332|gb|AEE27453.1| beta-glucosidase [Arabidopsis thaliana]
          Length = 768

 Score =  771 bits (1992), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 374/743 (50%), Positives = 507/743 (68%), Gaps = 34/743 (4%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           +C   +P PER +DL+ R+TL EKV  +G+ A  +PRLG+  YEWWSEALHGVS +G   
Sbjct: 44  FCQLSVPIPERVRDLIGRLTLAEKVSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVG--- 100

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
              PGT F    P ATSFP VI T ASFN SLW+ IG+ VS EARAMYN G  GLT+WSP
Sbjct: 101 ---PGTKFGGVYPAATSFPQVITTVASFNASLWESIGRVVSNEARAMYNGGVGGLTYWSP 157

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           N+N++RDPRWGR  ETPGEDP V G+YA +YVRGLQ          +D   LK++ACCKH
Sbjct: 158 NVNILRDPRWGRGQETPGEDPVVAGKYAASYVRGLQG---------NDRSRLKVAACCKH 208

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           + AYDLDNW G DRFHF+++V++QD+++TF +PF MCV EG+V+S+MCSYN+VNG+PTCA
Sbjct: 209 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVKEGNVASIMCSYNQVNGVPTCA 268

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
           DP LL +TIR  W  +GYIVSDCDS+  + ++  +   T E+A A  +KAGLDLDCG + 
Sbjct: 269 DPNLLKKTIRNQWGLNGYIVSDCDSVGVLYDTQHY-TGTPEEAAADSIKAGLDLDCGPFL 327

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG---SPQYKNLGKNNICNPQHIEL 370
              T+ AV++  + E+D+D +L     V MRLG FDG   +  Y +LG  ++C P H  L
Sbjct: 328 GAHTIDAVKKNLLRESDVDNALINTLTVQMRLGMFDGDIAAQPYGHLGPAHVCTPVHKGL 387

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A EAA+QGIVLLKN   +LPL++   +T+A++GP+++AT  MIGNY G  C YTSP+ G 
Sbjct: 388 ALEAAQQGIVLLKNHGSSLPLSSQRHRTVAVIGPNSDATVTMIGNYAGVACGYTSPVQGI 447

Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
             Y++ I +  GC D+ C ++ +  AA++AA+ ADATV+V GLD S+EAE KDR  LLLP
Sbjct: 448 TGYARTI-HQKGCVDVHCMDDRLFDAAVEAARGADATVLVMGLDQSIEAEFKDRNSLLLP 506

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
           G Q EL+++VA AAKGPV LV+MS G +DI+FA+ + KI +I+W GYPG+EGG AIAD++
Sbjct: 507 GKQQELVSRVAKAAKGPVILVLMSGGPIDISFAEKDRKIPAIVWAGYPGQEGGTAIADIL 566

Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGYGLSY 607
           FG  NPGG+LP+TWY  +Y+  +P T M +RPV++   PGRTY+F+DGPVVYPFG+GLSY
Sbjct: 567 FGSANPGGKLPMTWYPQDYLTNLPMTEMSMRPVHSKRIPGRTYRFYDGPVVYPFGHGLSY 626

Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
           T+F + +A +PK + I +      R  N TV        ++ +   +C        +EV 
Sbjct: 627 TRFTHNIADAPKVIPIAV------RGRNGTVSGK-----SIRVTHARCDRLSLGVHVEVT 675

Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           N+G  DG+  ++V+S PPG      KQ++ +ERV +A G+  +V   ++ CK L +VD A
Sbjct: 676 NVGSRDGTHTMLVFSAPPGGEWAPKKQLVAFERVHVAVGEKKRVQVNIHVCKYLSVVDRA 735

Query: 728 ANSLLASGAHTILVGEGVGGVSF 750
            N  +  G H I +G+    VS 
Sbjct: 736 GNRRIPIGDHGIHIGDESHTVSL 758


>gi|115486735|ref|NP_001068511.1| Os11g0696400 [Oryza sativa Japonica Group]
 gi|77552754|gb|ABA95551.1| Glycosyl hydrolase family 3 C terminal domain containing protein
           [Oryza sativa Japonica Group]
 gi|113645733|dbj|BAF28874.1| Os11g0696400 [Oryza sativa Japonica Group]
          Length = 816

 Score =  771 bits (1991), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 398/790 (50%), Positives = 518/790 (65%), Gaps = 66/790 (8%)

Query: 1   RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
           RF  + + +++F YCDA LPY +R +DL+ RMT+ EKV  +GD   G  R+GLP Y WWS
Sbjct: 57  RFAGLGLNMTEFRYCDASLPYADRVRDLIGRMTVEEKVGALGDWTDGAARIGLPAYRWWS 116

Query: 61  EALHGVSFIGRRTNSPPGTHFD-----------SEVPGATSFPTVILTTASFNESLWKKI 109
           EALHG+S  G      P T FD           S V  AT F  VI + ASFNE+LWK I
Sbjct: 117 EALHGLSSTG------PTTKFDDLATPHLHSGVSAVYNATVFANVINSAASFNETLWKSI 170

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           GQ VSTEARAMYN+G  GLT+WSPNINVVRDPRWGR LETPGEDPYVVGRYA+N+VRG+Q
Sbjct: 171 GQAVSTEARAMYNMGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQ 230

Query: 170 DV---EGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILP 226
           D+   E V    D ++RPLK SACCKHYAAYDLD+W  + RF FD+RV E+DM ETF  P
Sbjct: 231 DIPGHEAVAAGGDPNTRPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRP 290

Query: 227 FEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESH 286
           FEMCV +GDVSSVMCSYNRVNGIP CAD +LL+QTIR DW  HGYIVSDCD+++ + ++ 
Sbjct: 291 FEMCVRDGDVSSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNA 350

Query: 287 KFLNDTKEDAVARVLKAGLDLDCG-------------DYYTNFTMGAVQQGKIAEADIDT 333
            +L  T  +A A  LKAGLDLDCG             D+ T + M AV +GK+ E+DID 
Sbjct: 351 TWLGYTGAEASAAALKAGLDLDCGESWKNDTDGHPLMDFLTTYGMEAVNKGKMRESDIDN 410

Query: 334 SLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNT 393
           +L   Y+ LMRLGYFD   QY +LG+ +IC  QH  LA + ARQGIVLLKNDN  LPL+ 
Sbjct: 411 ALTNQYMTLMRLGYFDDIAQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDA 470

Query: 394 GNIKTLALVGPHANA-TKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNS 452
             +  + + GPH  A  K M G+Y G PCRY +P  G   Y +                 
Sbjct: 471 NKVGFVNVRGPHVQAPEKIMDGDYTGPPCRYVTPRQGVSKYVRF---------------- 514

Query: 453 MIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVI 512
                   +  A+ T+   GL+L++E EG DR D+LLP  QTE I +VA A+  P+ LVI
Sbjct: 515 --------SHRANTTIYFGGLNLNIEREGNDREDILLPKNQTEEIIRVAKASPNPIILVI 566

Query: 513 MSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-K 571
           +S G +D++FA+NNPKI +ILW GYPG EGG AIADVIFGK+NP GRLP+TW++  Y+ +
Sbjct: 567 LSGGGIDVSFAQNNPKIGAILWAGYPGGEGGNAIADVIFGKHNPSGRLPLTWFKNKYIYQ 626

Query: 572 IPYTSMPLRPV--NNFPGRTYKFFDGP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKD 628
           +P TSM LRPV  + +PGRTYKF+DGP V+YPFGYGLSYT+F Y++ ++  ++ + +   
Sbjct: 627 LPMTSMDLRPVAKHGYPGRTYKFYDGPDVLYPFGYGLSYTKFLYEMGTNGTALIVPV-AG 685

Query: 629 QQCRDINYTVG-TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG- 686
             C+ ++Y  G +  P C A+ ++   C +   +F + V N G   GS  V+V+SKPP  
Sbjct: 686 GHCKKLSYKSGVSTAPACPAINVNGHVCTE-TVSFNVSVTNGGDTGGSHPVIVFSKPPAE 744

Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVG 746
           +    +KQV+ ++ VF+ A  +  V F +N CK+  IV+  A +++ SG  TILV     
Sbjct: 745 VDDAPMKQVVAFKSVFVPAWSTVSVSFELNVCKAFGIVEKTAYTVVPSGVSTILVENVDS 804

Query: 747 GVSFPLQLNL 756
            VSFP++++ 
Sbjct: 805 SVSFPVKIDF 814


>gi|125535311|gb|EAY81859.1| hypothetical protein OsI_37025 [Oryza sativa Indica Group]
          Length = 816

 Score =  769 bits (1985), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 397/792 (50%), Positives = 518/792 (65%), Gaps = 67/792 (8%)

Query: 1   RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
           RF  + + +++F YCDA LPY +R +DL+ RMT+ EKV  +GD   G  R+GLP Y WWS
Sbjct: 56  RFAGLGLNMTEFRYCDASLPYADRVRDLIGRMTVEEKVGALGDWTDGAARIGLPAYRWWS 115

Query: 61  EALHGVSFIGRRTNSPPGTHFD-----------SEVPGATSFPTVILTTASFNESLWKKI 109
           EALHG+S  G      P T FD           S V  AT F  VI + ASFNE+LWK I
Sbjct: 116 EALHGLSSTG------PTTKFDDLATPHLHSGVSAVYNATVFANVINSAASFNETLWKSI 169

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           GQ VSTEARAMYN+G  GLT+WSPNINVVRDPRWGR LETPGEDPYVVGRYA+N+VRG+Q
Sbjct: 170 GQAVSTEARAMYNMGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQ 229

Query: 170 DV---EGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILP 226
           D+   E V    D ++RPLK SACCKHYAAYDLD+W  + RF FD+RV E+DM ETF  P
Sbjct: 230 DIPGHEAVAAGGDPNTRPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRP 289

Query: 227 FEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESH 286
           FEMCV +GDVSSVMCSYNRVNGIP CAD +LL+QTIR DW  HGYIVSDCD+++ + ++ 
Sbjct: 290 FEMCVRDGDVSSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNA 349

Query: 287 KFLNDTKEDAVARVLKAGLDLDCG-------------DYYTNFTMGAVQQGKIAEADIDT 333
            +L  T  +A A  LKAGLDLDCG             D+ T + M AV +GK+ E+DID 
Sbjct: 350 TWLGYTGAEASAAALKAGLDLDCGESWKNDTEGHPLMDFLTTYGMEAVNKGKMRESDIDN 409

Query: 334 SLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNT 393
           +L   Y+ LMRLGYFD   QY +LG+ +IC  QH  LA + ARQGIVLLKNDN  LPL+ 
Sbjct: 410 ALTNQYMTLMRLGYFDDITQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDA 469

Query: 394 GNIKTLALVGPHANA-TKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNS 452
             +  + + GPH  A  K M G+Y G PCRY +P  G   Y +                 
Sbjct: 470 NKVGFVNVRGPHVQAPEKIMDGDYTGPPCRYVTPRQGVSKYVRF---------------- 513

Query: 453 MIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVI 512
                   +  A+ T+   GL+L++E EG DR D+LLP  QTE I +VA A+  P+ LVI
Sbjct: 514 --------SHRANTTIYFGGLNLNIEREGNDREDILLPKNQTEEIIRVAKASPNPIILVI 565

Query: 513 MSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-K 571
           +S G +D++FA+NNPKI +ILW GYPG EGG AIADVIFGK+NP GRLP+TW++  Y+ +
Sbjct: 566 LSGGGIDVSFAQNNPKIGAILWAGYPGGEGGNAIADVIFGKHNPSGRLPLTWFKNKYIYQ 625

Query: 572 IPYTSMPLRPV--NNFPGRTYKFFDGP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKD 628
           +P TSM LRPV  + +PGRTYKF++GP V+YPFGYGLSYT+F Y++ ++  ++ + +   
Sbjct: 626 LPMTSMDLRPVAKHGYPGRTYKFYNGPDVLYPFGYGLSYTKFLYEMGTNGTALTVPV-AG 684

Query: 629 QQCRDINYTVGTNK--PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
             C+ ++Y  G +   P C A+ ++   C +   +F + V N G   GS  V+V+SKPP 
Sbjct: 685 GHCKKLSYKSGVSSAAPACPAINVNGHACTE-TVSFNVSVTNGGDTGGSHPVIVFSKPPA 743

Query: 687 -IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGV 745
            +    IKQV+ +  VF+ A  +  V F +N CK+  IV+  A +++ SG  T+LV    
Sbjct: 744 EVDDAPIKQVVAFRSVFVPAWSTVSVSFELNVCKAFGIVEKTAYTVVPSGVSTVLVENVD 803

Query: 746 GGVSFPLQLNLN 757
             VSFP++++ +
Sbjct: 804 SSVSFPVKISFS 815


>gi|297834874|ref|XP_002885319.1| beta-1,4-xylosidase [Arabidopsis lyrata subsp. lyrata]
 gi|297331159|gb|EFH61578.1| beta-1,4-xylosidase [Arabidopsis lyrata subsp. lyrata]
          Length = 865

 Score =  768 bits (1984), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 387/747 (51%), Positives = 499/747 (66%), Gaps = 50/747 (6%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           + + +C+  L Y  RAKDLV R++L EKVQQ+ + A GV RLG+P YEWWSEALHGVS +
Sbjct: 37  AKYGFCNVSLSYEARAKDLVSRLSLKEKVQQLVNKATGVSRLGVPPYEWWSEALHGVSDV 96

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
           G      PG  F+  VPGATSFP  ILT ASFN SLW K+G+ VSTEARAM+N+G AGLT
Sbjct: 97  G------PGVRFNGTVPGATSFPATILTAASFNTSLWLKMGEVVSTEARAMHNVGLAGLT 150

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
           +WSPN+N+ RDPRWGR  ETPGEDP VV +YA+NYV+GLQDV+         SR LK+S+
Sbjct: 151 YWSPNVNIFRDPRWGRGQETPGEDPLVVSKYAVNYVKGLQDVQDA-----GKSRRLKVSS 205

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
           CCKHY AYDLDNW+G DRFHFD++VT+QD+++T+  PF+ CV EGDVSSVMCSYNRVNGI
Sbjct: 206 CCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQPPFKSCVEEGDVSSVMCSYNRVNGI 265

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           PTCADP LL   IRG W   GYIVSDCDSIQ   +   +             K  L+++C
Sbjct: 266 PTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFDDIHY------------TKTRLNMNC 313

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
           GD+   +T  AV+  K+  +++D +L + YIVLMRLG+FDG P+   +  LG +++C+  
Sbjct: 314 GDFLGKYTENAVKLKKLNGSEVDEALIYNYIVLMRLGFFDGDPKSLPFGQLGPSDVCSKD 373

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
           H  LA EAA+QGIVLL+N  G LPL+   +K +A++GP+ANATK MI NY G PC+YTSP
Sbjct: 374 HQMLALEAAKQGIVLLEN-RGDLPLSKTAVKKIAVIGPNANATKVMISNYAGVPCKYTSP 432

Query: 427 MDGFYAY--SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           + G   Y   KV+ Y PGC D+ C   ++I AA+ A   AD TV+V GLD +VEAEG DR
Sbjct: 433 LQGLQKYVPEKVV-YEPGCKDVNCGEQTLISAAVKAVSEADVTVLVVGLDQTVEAEGLDR 491

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
           V+L LPG+Q +L+  VA+AAK  V LVIMSAG +DI+FAKN   I ++LWVGYPGE GG 
Sbjct: 492 VNLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFAKNLSTISAVLWVGYPGEAGGD 551

Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPF 601
           AIA VIFG YNP GRLP TWY   +  K+  T M +RP   + FPGR+Y+F+ G  +Y F
Sbjct: 552 AIAQVIFGDYNPSGRLPETWYSQEFADKVAMTDMNMRPNSTSGFPGRSYRFYTGKPIYKF 611

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
           GYGLSY+ F   V S+P  + IK          N  +  NK    ++ I  V C D K  
Sbjct: 612 GYGLSYSAFSTFVLSAPSIIHIK---------TNPILNLNK--TTSIDISTVNCHDLKIR 660

Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTHI------KQVIGYERVFIAAGQSAKVGFTM 715
             I V+N G+  GS VV+V+ KPP  + T +       Q++G+ERV +    + KV    
Sbjct: 661 IVIGVKNRGQRSGSHVVLVFWKPPKCSKTLVGAGVPQTQLVGFERVEVGRSMTEKVTVEF 720

Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
           + CK+L +VD      L +G HT+++G
Sbjct: 721 DVCKALSLVDTHGKRKLVTGHHTLVIG 747


>gi|297843058|ref|XP_002889410.1| hypothetical protein ARALYDRAFT_470222 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335252|gb|EFH65669.1| hypothetical protein ARALYDRAFT_470222 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 763

 Score =  768 bits (1984), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/743 (50%), Positives = 507/743 (68%), Gaps = 34/743 (4%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           +C   +P  ER KDL+ R+TL EKV  +G+ A  +PRLG+  YEWWSEALHGVS +G   
Sbjct: 39  FCQLSVPITERVKDLIGRLTLVEKVSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVG--- 95

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
              PGT F    P ATSFP VI T ASFN SLW+ IG+ VS EARAMYN G  GLT+WSP
Sbjct: 96  ---PGTKFGGVYPAATSFPQVITTVASFNASLWESIGRVVSNEARAMYNGGVGGLTYWSP 152

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           N+N++RDPRWGR  ETPGEDP V G+YA +YVRGLQ          +D   LK++ACCKH
Sbjct: 153 NVNILRDPRWGRGQETPGEDPVVAGKYAASYVRGLQG---------NDRSRLKVAACCKH 203

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           + AYDLDNW G DRFHF+++V++QD+++TF +PF MCV EG+V+S+MCSYN VNG+PTCA
Sbjct: 204 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVKEGNVASIMCSYNEVNGVPTCA 263

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
           DP LL +TIR +W  +GYIVSDCDS+  + ++  +   T E+A A  +KAGLDLDCG + 
Sbjct: 264 DPNLLKKTIRNEWGLNGYIVSDCDSVGVLYDTQHY-TGTPEEAAADSIKAGLDLDCGPFL 322

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG---SPQYKNLGKNNICNPQHIEL 370
              T+ AV++  + E+D+D +L     V MRLG FDG   +  Y +LG  ++C P H  L
Sbjct: 323 GAHTIDAVKKNLLRESDVDNALINTLTVQMRLGMFDGDIAAQPYGHLGPAHVCTPVHKGL 382

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A EAA+QGIVLLKN   +LPL++   +T+A++GP+++AT AMIGNY G  C YTSP+ G 
Sbjct: 383 ALEAAQQGIVLLKNHGSSLPLSSQRHRTVAVIGPNSDATVAMIGNYAGIACGYTSPVQGI 442

Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
             Y++ + +  GC D+ C ++ +  AA++AA+ ADATV+V GLD S+EAE KDR  LLLP
Sbjct: 443 TGYARTV-HQKGCVDVHCMDDRLFDAAVEAARGADATVLVMGLDQSIEAEFKDRNSLLLP 501

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
           G Q ELI++VA AAKGPV LV+MS G +DI+FA+ + KI +I+W GYPG+EGG AIAD++
Sbjct: 502 GKQQELISRVAKAAKGPVILVLMSGGPIDISFAEKDRKIPAIVWAGYPGQEGGTAIADIL 561

Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGYGLSY 607
           FG  NPGG+LP+TWY  +Y+  +P T M +RP+++   PGRTY+F+DGPVVYPFG+GLSY
Sbjct: 562 FGSANPGGKLPMTWYPQDYLTNLPMTEMSMRPIHSKRIPGRTYRFYDGPVVYPFGHGLSY 621

Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
           T+F + +A +PK + I +      R  N TV        ++ +   +C        ++V 
Sbjct: 622 TRFTHSIADAPKVIPIAV------RGRNGTVSGK-----SIRVTHARCNRLSLGVHVDVT 670

Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           N+G  DG+  ++V+S PPG      KQ++ +ERV +A G+  +V   ++ CK L +VD A
Sbjct: 671 NVGSRDGTHTMLVFSAPPGGEWAPKKQLVAFERVHVAVGEKKRVQVNIHVCKYLSVVDRA 730

Query: 728 ANSLLASGAHTILVGEGVGGVSF 750
            N  +  G H I +G+    VS 
Sbjct: 731 GNRRIPIGDHGIHIGDESHTVSL 753


>gi|292630922|sp|A5JTQ2.1|XYL1_MEDVA RecName: Full=Beta-xylosidase/alpha-L-arabinofuranosidase 1;
           AltName: Full=Xylan
           1,4-beta-xylosidase/Alpha-N-arabinofuranosidase 1;
           Short=MsXyl1; Includes: RecName: Full=Beta-xylosidase;
           AltName: Full=1,4-beta-D-xylan xylohydrolase; AltName:
           Full=Xylan 1,4-beta-xylosidase; Includes: RecName:
           Full=Alpha-N-arabinofuranosidase; AltName:
           Full=Alpha-L-arabinofuranosidase; Short=Arabinosidase;
           Flags: Precursor
 gi|146762261|gb|ABQ45227.1| beta-xylosidase/alpha-L-arabinosidase [Medicago sativa subsp. x
           varia]
          Length = 774

 Score =  768 bits (1984), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 379/740 (51%), Positives = 505/740 (68%), Gaps = 30/740 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           +S + +CD  L   +R  DLV+R+TL EK+  +G+ A  V RLG+P YEWWSEALHGVS 
Sbjct: 49  VSSYGFCDNSLSVEDRVSDLVKRLTLQEKIGNLGNSAVEVSRLGIPKYEWWSEALHGVSN 108

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           IG      PGTHF S VPGAT+FP  ILT ASFN SL++ IG  VS EARAMYN+G AGL
Sbjct: 109 IG------PGTHFSSLVPGATNFPMPILTAASFNTSLFQAIGSVVSNEARAMYNVGLAGL 162

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T+WSPNIN+ RDPRWGR  ETPGEDP +  +YA  YV+GLQ  +      D DS  LK++
Sbjct: 163 TYWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAGYVKGLQQTD------DGDSDKLKVA 216

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYD+DNW+G  R+ FD+ V++QD+ +TF  PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 217 ACCKHYTAYDVDNWKGVQRYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNG 276

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTCADP LL   IRG W  +GYIVSDCDS++ + +   +   T E+A A+ + +GLDLD
Sbjct: 277 KPTCADPDLLKGVIRGKWKLNGYIVSDCDSVEVLYKDQHY-TKTPEEAAAKTILSGLDLD 335

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG Y   +T GAV+QG + EA I  ++   +  LMRLG+FDG P    Y NLG  ++C P
Sbjct: 336 CGSYLGQYTGGAVKQGLVDEASITNAVSNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTP 395

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           ++ ELA EAARQGIVLLKN   +LPL++  IK+LA++GP+ANAT+ MIGNYEG PC+YTS
Sbjct: 396 ENQELAREAARQGIVLLKNSPRSLPLSSKAIKSLAVIGPNANATRVMIGNYEGIPCKYTS 455

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+ G  A+    +YAPGC D+ C N + I  A   A +ADAT+IV G +L++EAE  DRV
Sbjct: 456 PLQGLTAFVPT-SYAPGCPDVQCAN-AQIDDAAKIAASADATIIVVGANLAIEAESLDRV 513

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           ++LLPG Q +L+N+VA+ +KGPV LVIMS G +D++FAK N KI SILWVGYPGE GG A
Sbjct: 514 NILLPGQQQQLVNEVANVSKGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAA 573

Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
           IADVIFG YNP GRLP+TWY  +YV K+P T+M +R  P   +PGRTY+F+ G  V+ FG
Sbjct: 574 IADVIFGSYNPSGRLPMTWYPQSYVEKVPMTNMNMRADPATGYPGRTYRFYKGETVFSFG 633

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
            G+S+   ++K+  +P+ V + L +D +CR +          C ++ + D  C++  F  
Sbjct: 634 DGMSFGTVEHKIVKAPQLVSVPLAEDHECRSLE---------CKSLDVADKHCQNLAFDI 684

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
            + V+NMGKM  S  V+++  PP +     K ++G+E+V +A      V F ++ C  L 
Sbjct: 685 HLSVKNMGKMSSSHSVLLFFTPPNVHNAPQKHLLGFEKVQLAGKSEGMVRFKVDVCNDLS 744

Query: 723 IVDNAANSLLASGAHTILVG 742
           +VD   N  +  G H + VG
Sbjct: 745 VVDELGNRKVPLGDHMLHVG 764


>gi|356558612|ref|XP_003547598.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
           [Glycine max]
          Length = 776

 Score =  768 bits (1982), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 381/754 (50%), Positives = 509/754 (67%), Gaps = 34/754 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L+ + +CD  L   +R  DLV+R+TL EK+  + + A  V RLG+P YEWWSEALHGVS 
Sbjct: 51  LAGYGFCDKSLSVEDRVADLVKRLTLQEKIGSLVNSATSVSRLGIPKYEWWSEALHGVSN 110

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           +G      PGTHF S VPGATSFP  ILT ASFN SL++ IG+ VSTEARAMYN+G AGL
Sbjct: 111 VG------PGTHFSSLVPGATSFPMPILTAASFNASLFEAIGRVVSTEARAMYNVGLAGL 164

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T+WSPNIN+ RDPRWGR  ETPGEDP +  +YA  YV+GLQ  +      D DS  LK++
Sbjct: 165 TYWSPNINIFRDPRWGRGQETPGEDPLLSSKYATGYVKGLQQTD------DGDSNKLKVA 218

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYDLDNW+G  R+ F++ VT+QDM +TF  PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 219 ACCKHYTAYDLDNWKGIQRYTFNAVVTQQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNG 278

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTCADP LL   IRG+W  +GYIVSDCDS++ + +   +   T E+A A+ + AGLDL+
Sbjct: 279 KPTCADPDLLKGIIRGEWKLNGYIVSDCDSVEVLFKDQHY-TKTPEEAAAQTILAGLDLN 337

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG+Y   +T GAV+QG + EA I+ ++   +  LMRLG+FDG P    Y NLG  ++C  
Sbjct: 338 CGNYLGQYTEGAVKQGLLDEASINNAVSNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTS 397

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           ++ ELA EAARQGIVLLKN  G+LPLN   IK+LA++GP+ANAT+ MIGNYEG PC Y S
Sbjct: 398 ENRELAREAARQGIVLLKNSPGSLPLNAKTIKSLAVIGPNANATRVMIGNYEGIPCNYIS 457

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+    A     +YA GC ++ C  N+ +  A   A +ADATVI+ G  L++EAE  DR+
Sbjct: 458 PLQTLTALVPT-SYAAGCPNVQCA-NAELDDATQIAASADATVIIVGASLAIEAESLDRI 515

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           ++LLPG Q  L+++VA+A+KGPV LVIMS G +D++FAK+N KI SILWVGYPGE GG A
Sbjct: 516 NILLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSFAKSNDKITSILWVGYPGEAGGAA 575

Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
           IADVIFG YNP GRLP+TWY   YV K+P T+M +R  P   +PGRTY+F+ G  V+ FG
Sbjct: 576 IADVIFGFYNPSGRLPMTWYPQAYVNKVPMTNMNMRADPATGYPGRTYRFYKGETVFSFG 635

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
            G+S++  ++K+  +P+ V + L +D +CR            C ++ I D  C++  F  
Sbjct: 636 DGISFSSIEHKIVKAPQLVSVPLAEDHECR---------SSECMSLDIADEHCQNLAFDI 686

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
            + V+N GKM  S VV+++  PP +     K ++G+E+V +     A+V F ++ CK L 
Sbjct: 687 HLGVKNTGKMSTSHVVLLFFTPPDVHNAPQKHLLGFEKVHLPGKSEAQVRFKVDVCKDLS 746

Query: 723 IVDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
           +VD   N  +  G H +     VG +  PL L +
Sbjct: 747 VVDELGNRKVPLGQHLL----HVGNLKHPLSLRV 776


>gi|147844622|emb|CAN82161.1| hypothetical protein VITISV_035506 [Vitis vinifera]
          Length = 925

 Score =  768 bits (1982), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 388/746 (52%), Positives = 502/746 (67%), Gaps = 24/746 (3%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S FP+C+  LPY +RA DLV R+TL EK +Q+ + A G+ RLG+P YEWWSEALHGVS  
Sbjct: 37  SQFPFCNTSLPYQDRASDLVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGVS-- 94

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
               NS  G HF   +P  T FP VIL+ ASFNESLW  +GQ VSTE RAMYN+G AGLT
Sbjct: 95  ----NSGIGVHFHDPIPAVTIFPAVILSAASFNESLWYTMGQVVSTEGRAMYNVGQAGLT 150

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
           +WSPN+N+ RDPRWGR  ETPGEDP VV RYA+NYVRGLQ+V G E +  +D   LK+S+
Sbjct: 151 YWSPNVNIFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEV-GKEGNFAADR--LKVSS 207

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
           CCKHY AYD+D W+G DRFHFD++VT QD+++T+  PF+ CV EG VSSVMCSYNRVNG+
Sbjct: 208 CCKHYTAYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFKXCVEEGHVSSVMCSYNRVNGV 267

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           PTCA+P+LL   IR  W   GYIVSDCDSI    E   +  +T EDAVA  LKAGL+L+C
Sbjct: 268 PTCANPELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNY-TETPEDAVALALKAGLNLNC 326

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
           G Y  ++T  AV  GK+ E+ +B +L + YIVLMRLG+FDG P    +  +G +++C   
Sbjct: 327 GSYLGDYTKNAVNLGKVKESIVBQALIYNYIVLMRLGFFDGDPTMLPFGKMGPSDVCTVD 386

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
           H  LA +AA+QGIVLL N NGALPL+    KTLA++GP+A+AT  M+ NY G PCRYTSP
Sbjct: 387 HQLLALDAAKQGIVLLHN-NGALPLSPNTTKTLAVIGPNADATNTMLSNYAGVPCRYTSP 445

Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
           + G   Y   ++Y  GCA++ C   ++I  A   A  ADATV+V GLDL +EAE  DRV+
Sbjct: 446 LQGLQKYVSAVSYEKGCANVSCSEETLIEGAASIASMADATVVVVGLDLFIEAEDLDRVN 505

Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
           L LPGFQ +L+ + A AA G V LV+MSAG VDI+F KN  KI  ILWVGYPG+ GG AI
Sbjct: 506 LTLPGFQEKLVMEAAKAANGTVILVVMSAGPVDISFVKNVSKIGGILWVGYPGQAGGDAI 565

Query: 547 ADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGY 603
           + VIFG YNPGGR P TWY   YV ++P T M +RP    NFPGRTY+F+ G  +Y FG+
Sbjct: 566 SQVIFGDYNPGGRSPFTWYPQEYVDQVPMTDMNMRPNATXNFPGRTYRFYTGKSLYQFGH 625

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDI---NY-TVGTNKPPCAAVLIDDVKCKDYK 659
           GLSY+ F   + S+P +V + L       +I   NY T+        A+ I  + C++  
Sbjct: 626 GLSYSTFYKFIKSAPXTVLVHLLPQMDMPNIFSSNYPTMPNPNTNGQAIDISAIDCRNLS 685

Query: 660 -FTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
                I V+N G++DG+ VV+ + KPP  G+ G    +++G+ERV +  G++  VG  ++
Sbjct: 686 NIDIVIGVKNAGEIDGTHVVLAFWKPPRSGVRGAPGVELVGFERVEVKRGKTEMVGMRLD 745

Query: 717 ACKSLKIVDNAANSLLASGAHTILVG 742
            C  +  VD      L  G HT++VG
Sbjct: 746 VCGKISNVDEEGKRKLVMGMHTLVVG 771


>gi|225428983|ref|XP_002264114.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
          Length = 818

 Score =  768 bits (1982), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 388/747 (51%), Positives = 503/747 (67%), Gaps = 24/747 (3%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S FP+C+  LPY +RA DLV R+TL EK +Q+ + A G+ RLG+P YEWWSEALHGVS  
Sbjct: 61  SQFPFCNTSLPYQDRASDLVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGVS-- 118

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
               NS  G HF   +P  T FP VIL+ ASFNESLW  +GQ VSTE RAMYN+G AGLT
Sbjct: 119 ----NSGIGVHFHDPIPAVTIFPAVILSAASFNESLWYTMGQVVSTEGRAMYNVGQAGLT 174

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
           +WSPN+N+ RDPRWGR  ETPGEDP VV RYA+NYVRGLQ+V G E +  +D   LK+S+
Sbjct: 175 YWSPNVNIFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEV-GKEGNFAADR--LKVSS 231

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
           CCKHY AYD+D W+G DRFHFD++VT QD+++T+  PF+ CV EG VSSVMCSYNRVNG+
Sbjct: 232 CCKHYTAYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFKSCVEEGHVSSVMCSYNRVNGV 291

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           PTCA+P+LL   IR  W   GYIVSDCDSI    E   +  +T EDAVA  LKAGL+L+C
Sbjct: 292 PTCANPELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNY-TETPEDAVALALKAGLNLNC 350

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
           G Y  ++T  AV  GK+ E+ ++ +L + YIVLMRLG+FDG P    +  +G +++C   
Sbjct: 351 GSYLGDYTKNAVNLGKVKESIVNQALIYNYIVLMRLGFFDGDPTMLPFGKMGPSDVCTVD 410

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
           H  LA +AA+QGIVLL N NGALPL+    KTLA++GP+A+AT  M+ NY G PCRYTSP
Sbjct: 411 HQLLALDAAKQGIVLLHN-NGALPLSPNTTKTLAVIGPNADATNTMLSNYAGVPCRYTSP 469

Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
           + G   Y   ++Y  GCA++ C   ++I  A   A  ADATV+V GLDL +EAE  DRV+
Sbjct: 470 LQGLQKYVSAVSYEKGCANVSCSEETLIEGAASIASMADATVVVVGLDLFIEAEDLDRVN 529

Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
           L LPGFQ +L+ + A AA G V LV+MSAG VDI+F KN  KI  ILWVGYPG+ GG AI
Sbjct: 530 LTLPGFQEKLVMEAAKAANGTVILVVMSAGPVDISFVKNVSKIGGILWVGYPGQAGGDAI 589

Query: 547 ADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGY 603
           + VIFG YNPGGR P TWY   YV ++P T M +RP   +NFPGRTY+F+ G  +Y FG+
Sbjct: 590 SQVIFGDYNPGGRSPFTWYPQEYVDQVPMTDMNMRPNATSNFPGRTYRFYTGKSLYQFGH 649

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDI---NY-TVGTNKPPCAAVLIDDVKCKDYK 659
           GLSY+ F   + S+P +V + L       +I   NY T+        A+ I  + C++  
Sbjct: 650 GLSYSTFYKFIKSAPTTVLVHLLPQMDMPNIFSSNYPTMPNPNTNGQAIDISAIDCRNLS 709

Query: 660 -FTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
                I V+N G++DG+ VV+ + KPP  G+ G    +++G+ERV +  G++  VG  ++
Sbjct: 710 NIDIVIGVKNAGEIDGTHVVLAFWKPPRSGVRGAPGVELVGFERVEVKRGKTEMVGMRLD 769

Query: 717 ACKSLKIVDNAANSLLASGAHTILVGE 743
            C  +  VD      L  G HT++VG 
Sbjct: 770 VCGKISNVDEEGKRKLVMGMHTLVVGS 796


>gi|356501877|ref|XP_003519750.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
          Length = 772

 Score =  766 bits (1977), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 377/741 (50%), Positives = 495/741 (66%), Gaps = 32/741 (4%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
           + P+C A L    R KDL+ R+TL EKV  + + A  VPRLG+  YEWWSEALHGVS +G
Sbjct: 39  NLPFCKASLATGARVKDLIGRLTLQEKVNLLVNNAAAVPRLGIKGYEWWSEALHGVSNVG 98

Query: 71  RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
                 PGT F  + P ATSFP VI T ASFN SLW+ IG+  S EARAMYN G AGLT+
Sbjct: 99  ------PGTKFGGQFPAATSFPQVITTAASFNASLWEAIGRVASDEARAMYNGGTAGLTY 152

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           WSPN+N+ RDPRWGR  ETPGEDP + G+YA +YVRGLQ  +G           LK++A 
Sbjct: 153 WSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQGTDGNR---------LKVAAS 203

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH+ AYDLDNW G DRFHF+++V++QD+++TF +PF MCV EG V+SVMCSYN+VNG+P
Sbjct: 204 CKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFNVPFRMCVKEGKVASVMCSYNQVNGVP 263

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
           TCADP LL +T+RG W  +GYIVSDCDS+     S  +   T E+A A  +KAGLDLDCG
Sbjct: 264 TCADPILLKRTVRGQWGLNGYIVSDCDSVGVFYNSQHY-TSTPEEAAADAIKAGLDLDCG 322

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
            +    T  AV++G I+EAD++ +L     V MRLG +DG P    Y NLG  ++C   H
Sbjct: 323 PFLGQHTQNAVKKGLISEADVNGALLNTLTVQMRLGMYDGEPSSHPYNNLGPRDVCTQSH 382

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
            ELA EAARQGIVLLKN   +LPL+T   +T+A++GP++N T  MIGNY G  C YTSP+
Sbjct: 383 QELALEAARQGIVLLKNKGPSLPLSTRRGRTVAVIGPNSNVTFTMIGNYAGIACGYTSPL 442

Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
            G   Y+K I Y  GCA++ C ++     AI+AA+ ADATV+V GLD S+EAE  DR  L
Sbjct: 443 QGIGTYTKTI-YEHGCANVACTDDKQFGRAINAAQQADATVLVMGLDQSIEAETVDRASL 501

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
           LLPG Q +L++KVA A+KGP  LVIMS G VDI FAKN+P+I+ ILW GYPG+ GG AIA
Sbjct: 502 LLPGHQQDLVSKVAAASKGPTILVIMSGGPVDITFAKNDPRIQGILWAGYPGQAGGAAIA 561

Query: 548 DVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGYG 604
           D++FG  NPGG+LP+TWY   Y+K +P T+M +R   +  +PGRTY+F++GPVVYPFGYG
Sbjct: 562 DILFGTSNPGGKLPMTWYPQGYIKNLPMTNMAMRASRSKGYPGRTYRFYNGPVVYPFGYG 621

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
           LSYT F + + S+PK V I +D  +     N     NK    A+ +   +C        +
Sbjct: 622 LSYTHFVHTLTSAPKLVSIPVDGHRHGNSSNI---ANK----AIKVTHARCGKLSINLHV 674

Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHI--KQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
           +V+N+G  DG   ++V+S PP   G     KQ++ +E+V I A    +V   ++ CK L 
Sbjct: 675 DVKNVGSKDGIHTLLVFSAPPAGNGHWAPHKQLVAFEKVHIPAKAQQRVRVKIHVCKLLS 734

Query: 723 IVDNAANSLLASGAHTILVGE 743
           +VD +    +  G H++ +G+
Sbjct: 735 VVDRSGTRRIPMGLHSLHIGD 755


>gi|356534827|ref|XP_003535953.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
          Length = 771

 Score =  765 bits (1976), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/741 (50%), Positives = 500/741 (67%), Gaps = 32/741 (4%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
           + P+C A L    R KDL+ R+TL EKV  + + A  VPRLG+  YEWWSEALHGVS +G
Sbjct: 38  NLPFCKAWLATGARVKDLIGRLTLQEKVNLLVNNAAAVPRLGIKGYEWWSEALHGVSNVG 97

Query: 71  RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
                 PGT F  + P ATSFP VI T ASFN SLW+ IG+  S EARAMYN G AGLT+
Sbjct: 98  ------PGTKFGGQFPAATSFPQVITTAASFNASLWEAIGRVASDEARAMYNGGTAGLTY 151

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           WSPN+N+ RDPRWGR  ETPGEDP + G+YA +YVRGLQ+ +G           LK++A 
Sbjct: 152 WSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQETDGNR---------LKVAAS 202

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH+ AYDLDNW G DRFHF+++V++QD+++TF +PF MCV EG V+SVMCSYN+VNG+P
Sbjct: 203 CKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFNVPFRMCVKEGKVASVMCSYNQVNGVP 262

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
           TCADP LL +T+RG W  +GYIVSDCDS+     S  +   T E+A A  +KAGLDLDCG
Sbjct: 263 TCADPILLKRTVRGQWGLNGYIVSDCDSVGVFYNSQHY-TSTPEEAAADAIKAGLDLDCG 321

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
            +    T  AV++G I+E D++ +L     V MRLG +DG P    Y  LG  ++C P H
Sbjct: 322 PFLGQHTQNAVKKGLISETDVNGALLNTLTVQMRLGMYDGEPSSHPYGKLGPRDVCTPSH 381

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
            ELA EAARQGIVLLKN   +LPL+T    T+A++GP++N T  MIGNY G  C YTSP+
Sbjct: 382 QELALEAARQGIVLLKNKGPSLPLSTRRHPTVAVIGPNSNVTVTMIGNYAGIACGYTSPL 441

Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
           +G   Y+K I +  GCA++ C N+     AI+ A+ ADATV+V GLD S+EAE  DR  L
Sbjct: 442 EGIGRYTKTI-HELGCANVACTNDKQFGRAINVAQQADATVLVMGLDQSIEAETVDRAGL 500

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
           LLPG Q +L++KVA A+KGP  LVIMS G VDI FAKNNP+I++ILW GYPG+ GG AIA
Sbjct: 501 LLPGRQQDLVSKVAAASKGPTILVIMSGGPVDITFAKNNPRIQAILWAGYPGQAGGAAIA 560

Query: 548 DVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGYG 604
           D++FG  NPGG+LP+TWY   Y+K +P T+M +R   +  +PGRTY+F++GPVVYPFGYG
Sbjct: 561 DILFGTSNPGGKLPMTWYPQGYIKNLPMTNMAMRASRSKGYPGRTYRFYNGPVVYPFGYG 620

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
           LSYT F + +AS+PK V I +D     R  N +   NK    A+ +   +C     + Q+
Sbjct: 621 LSYTHFVHTLASAPKLVSIPVDGH---RHGNSSSIANK----AIKVTHARCGKLSISLQV 673

Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHI--KQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
           +V+N+G  DG+  ++V+S PP   G     KQ++ ++++ I +    +V   ++ CK L 
Sbjct: 674 DVKNVGSKDGTHTLLVFSAPPAGNGHWAPHKQLVAFQKLHIPSKAQQRVNVNIHVCKLLS 733

Query: 723 IVDNAANSLLASGAHTILVGE 743
           +VD +    +  G H++ +G+
Sbjct: 734 VVDRSGTRRVPMGLHSLHIGD 754


>gi|371917282|dbj|BAL44717.1| SlArf/Xyl2 [Solanum lycopersicum]
          Length = 774

 Score =  765 bits (1976), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/748 (50%), Positives = 506/748 (67%), Gaps = 29/748 (3%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
             +FP+C   LP  +R +DL+ R+TL EKV+ +G+ A  VPRLG+  YEWWSEALHGVS 
Sbjct: 40  FRNFPFCQTNLPIGDRVRDLIGRLTLQEKVKLLGNNAAAVPRLGIKGYEWWSEALHGVSN 99

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           +G      PGT F  E PGATSFP VI T ASFN SLW++IG+ VS EARAMYN    GL
Sbjct: 100 VG------PGTKFGGEFPGATSFPQVITTAASFNASLWEEIGRVVSDEARAMYNGEMGGL 153

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T+WSPN+N+ RDPRWGR  ETPGEDP V   YA  YVRGLQ  E      D DS  LK++
Sbjct: 154 TYWSPNVNIFRDPRWGRGQETPGEDPVVAALYAERYVRGLQGNE------DGDS--LKVA 205

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYDLDNW G DRFHF+++VT+QD+++TF +PF  CV +G V+S+MCSYN+VNG
Sbjct: 206 ACCKHYTAYDLDNWGGVDRFHFNAKVTKQDIEDTFDVPFRSCVKQGKVASIMCSYNQVNG 265

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           IPTCADP+LL +TIRG W  +GYIVSDCDS+    ++  +   T E+A A  +KAGLDLD
Sbjct: 266 IPTCADPQLLRKTIRGGWGLNGYIVSDCDSVGVFYDTQHY-TSTPEEAAAAAIKAGLDLD 324

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNP 365
           CG + +  T  AV  G + EA IDT+L     V MRLG FDG P   QY +LG  ++C+P
Sbjct: 325 CGPFLSQHTENAVHIGILKEAAIDTNLANTVAVQMRLGMFDGEPSAQQYGHLGPRDVCSP 384

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            H ELA EAARQGIVLLKN   ALPL+    +T+A++GP+++ T  MIGNY G  C YTS
Sbjct: 385 AHQELAVEAARQGIVLLKNHGPALPLSPRRHRTVAVIGPNSDVTVTMIGNYAGVACGYTS 444

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+ G   Y+K I +  GC D+ C ++ +   A++AA+ ADATV+V GLD S+EAE +DR 
Sbjct: 445 PLQGISKYAKTI-HEKGCGDVACSDDKLFAGAVNAARQADATVLVMGLDQSIEAEFRDRT 503

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            LLLPGFQ ELI++V+ A++GPV LV+MS G VD+ FA N+P+I +I+W GYPG+ GG A
Sbjct: 504 GLLLPGFQQELISEVSKASRGPVVLVLMSGGPVDVTFANNDPRIGAIVWAGYPGQGGGAA 563

Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
           IADV+FG +NPGG+LP+TWY   Y+  +P T+M +R      +PGRTY+F+ GP+VYPFG
Sbjct: 564 IADVLFGAHNPGGKLPMTWYPQEYLNNLPMTTMDMRSNLAKGYPGRTYRFYKGPLVYPFG 623

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
           +GLSYT+F   +  +PK++ I +D        N +  +NK    ++ +   KC       
Sbjct: 624 HGLSYTKFITTIFEAPKTLAIPIDGRHT---YNSSTISNK----SIRVTHAKCSKISVQI 676

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
            ++V+N+G  DGS  ++V+SKPP       KQ++ +++V++ A    +V   ++ CK L 
Sbjct: 677 HVDVKNVGPKDGSHTLLVFSKPPVDIWVPHKQLVAFQKVYVPARSKQRVAINIHVCKYLS 736

Query: 723 IVDNAANSLLASGAHTILVGEGVGGVSF 750
           +VD A    +  G H+I +G+    +S 
Sbjct: 737 VVDRAGVRRIPIGEHSIHIGDAKHSLSL 764


>gi|359485890|ref|XP_002264183.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Vitis vinifera]
          Length = 774

 Score =  765 bits (1976), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 384/740 (51%), Positives = 498/740 (67%), Gaps = 30/740 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L  F +C+  L    R  DLV+R+TL EK+  + + A  V RLG+P YEWWSEALHGVS+
Sbjct: 49  LGQFGFCNTSLETAARVADLVKRLTLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSY 108

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           +G      PGTHF+S VPGATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AGL
Sbjct: 109 VG------PGTHFNSVVPGATSFPQVILTAASFNASLFEAIGKAVSTEARAMYNVGLAGL 162

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           TFWSPN+N+ RDPRWGR  ETPGEDP +  +YA  YVRGLQ  +      D     LK++
Sbjct: 163 TFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVRGLQQSD------DGSPDRLKVA 216

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYDLDNW+G DRFHF++ VT+QDM +TF  PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 217 ACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNG 276

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            P CADP LL+  +RG+W  +GYIVSDCDS+     S  +   T E+A A+ + AGLDL+
Sbjct: 277 KPACADPDLLSGIVRGEWKLNGYIVSDCDSVDVFYNSQHY-TKTPEEAAAKAILAGLDLN 335

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG +    T  AV+ G + E+ +D ++   +  LMRLG+FDG+P    Y  LG  ++C  
Sbjct: 336 CGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIYGKLGPKDVCTS 395

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H ELA EAARQGIVLLKN  G+LPL+   IKTLA++GP+AN TK MIGNYEGTPC+YT+
Sbjct: 396 EHQELAREAARQGIVLLKNSKGSLPLSPTAIKTLAVIGPNANVTKTMIGNYEGTPCKYTT 455

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+ G  A      Y PGC+++ C   + I  A   A  ADATV++ G+D S+EAEG+DRV
Sbjct: 456 PLQGLTALVAT-TYLPGCSNVAC-GTAQIDEAKKIAAAADATVLIVGIDQSIEAEGRDRV 513

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           ++ LPG Q  LI +VA A+KG V LV+MS G  DI+FAKN+ KI SILWVGYPGE GG A
Sbjct: 514 NIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISFAKNDDKITSILWVGYPGEAGGAA 573

Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
           IADVIFG YNP GRLP+TWY  +YV K+P T+M +R  P + +PGRTY+F+ G  +Y FG
Sbjct: 574 IADVIFGFYNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPASGYPGRTYRFYTGETIYTFG 633

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
            GLSYTQF + +  +PKSV I +++   C         +   C +V      C++  F  
Sbjct: 634 DGLSYTQFNHHLVQAPKSVSIPIEEGHSC---------HSSKCKSVDAVQESCQNLVFDI 684

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
            + V N G + GS  V ++S PP +  +  K ++G+E+VF+ A   A V F ++ CK L 
Sbjct: 685 HLRVNNAGNISGSHTVFLFSSPPSVHNSPQKHLLGFEKVFVTAKAKALVRFKVDVCKDLS 744

Query: 723 IVDNAANSLLASGAHTILVG 742
           IVD      +A G H + VG
Sbjct: 745 IVDELGTRKVALGLHVLHVG 764


>gi|292630923|sp|A5JTQ3.1|XYL2_MEDVA RecName: Full=Beta-xylosidase/alpha-L-arabinofuranosidase 2;
           AltName: Full=Xylan
           1,4-beta-xylosidase/Alpha-N-arabinofuranosidase 2;
           Short=MsXyl2; Includes: RecName: Full=Beta-xylosidase;
           AltName: Full=1,4-beta-D-xylan xylohydrolase; AltName:
           Full=Xylan 1,4-beta-xylosidase; Includes: RecName:
           Full=Alpha-N-arabinofuranosidase; AltName:
           Full=Alpha-L-arabinofuranosidase; Short=Arabinosidase;
           Flags: Precursor
 gi|146762263|gb|ABQ45228.1| beta-xylosidase/alpha-L-arabinosidase [Medicago sativa subsp. x
           varia]
          Length = 774

 Score =  764 bits (1974), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 377/745 (50%), Positives = 505/745 (67%), Gaps = 38/745 (5%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L+++ +C+ KL    R KDLV R+TL EKV  + + A  V RLG+P YEWWSEALHGVS 
Sbjct: 49  LANYGFCNKKLSVDARVKDLVRRLTLQEKVGNLVNSAVDVSRLGIPKYEWWSEALHGVSN 108

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           IG      PGTHF + +PGATSFP  IL  ASFN SL++ IG+ VSTEARAM+N+G AGL
Sbjct: 109 IG------PGTHFSNVIPGATSFPMPILIAASFNASLFQTIGKVVSTEARAMHNVGLAGL 162

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T+WSPNIN+ RDPRWGR  ETPGEDP +  +YA  YV+GLQ  +      D DS  LK++
Sbjct: 163 TYWSPNINIFRDPRWGRGQETPGEDPLLASKYAAGYVKGLQQTD------DGDSNKLKVA 216

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYD+D+W+G  R+ F++ VT+QD+ +T+  PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 217 ACCKHYTAYDVDDWKGVQRYTFNAVVTQQDLDDTYQPPFKSCVIDGNVASVMCSYNQVNG 276

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTCADP LL   IRG W  +GYIVSDCDS+  + ++  +   T E+A A+ + AGLDL+
Sbjct: 277 KPTCADPDLLKGVIRGKWKLNGYIVSDCDSVDVLFKNQHY-TKTPEEAAAKSILAGLDLN 335

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG +   +T GAV+QG I EA I+ ++   +  LMRLG+FDG P    Y NLG  ++C  
Sbjct: 336 CGSFLGRYTEGAVKQGLIGEASINNAVYNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTS 395

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            + ELA EAARQGIVLLKN  G+LPLN   IK+LA++GP+ANAT+AMIGNYEG PC+YTS
Sbjct: 396 ANQELAREAARQGIVLLKNCAGSLPLNAKAIKSLAVIGPNANATRAMIGNYEGIPCKYTS 455

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAK----NADATVIVAGLDLSVEAEG 481
           P+ G  A     ++A GC D+ C N     AA+D AK    +ADATVIV G +L++EAE 
Sbjct: 456 PLQGLTALVPT-SFAAGCPDVQCTN-----AALDDAKKIAASADATVIVVGANLAIEAES 509

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
            DR+++LLPG Q +L+ +VA+ AKGPV L IMS G +D++FAK N KI SILWVGYPGE 
Sbjct: 510 HDRINILLPGQQQQLVTEVANVAKGPVILAIMSGGGMDVSFAKTNKKITSILWVGYPGEA 569

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVV 598
           GG AIADVIFG +NP GRLP+TWY  +YV K+P T+M +R  P   +PGRTY+F+ G  V
Sbjct: 570 GGAAIADVIFGYHNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPATGYPGRTYRFYKGETV 629

Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
           + FG G+SY+ F++K+  +P+ V + L +D  CR            C ++ +    C++ 
Sbjct: 630 FSFGDGISYSTFEHKLVKAPQLVSVPLAEDHVCRS---------SKCKSLDVVGEHCQNL 680

Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
            F   + ++N GKM  S+ V ++S PP +     K ++ +E+V +     A V F ++ C
Sbjct: 681 AFDIHLRIKNKGKMSSSQTVFLFSTPPAVHNAPQKHLLAFEKVLLTGKSEALVSFKVDVC 740

Query: 719 KSLKIVDNAANSLLASGAHTILVGE 743
           K L +VD   N  +A G H + VG+
Sbjct: 741 KDLGLVDELGNRKVALGKHMLHVGD 765


>gi|359481045|ref|XP_002268626.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Vitis vinifera]
 gi|296089342|emb|CBI39114.3| unnamed protein product [Vitis vinifera]
          Length = 774

 Score =  763 bits (1970), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 382/741 (51%), Positives = 497/741 (67%), Gaps = 30/741 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L  F +C+  L    R  DLV+R+TL EK+  + + A  V RLG+P YEWWSEALHGVS+
Sbjct: 49  LGQFGFCNTSLETAARVADLVKRLTLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSY 108

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           +G      PGTHF+S VPGATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AGL
Sbjct: 109 VG------PGTHFNSIVPGATSFPQVILTAASFNASLFEAIGKVVSTEARAMYNVGLAGL 162

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           TFWSPN+N+ RDPRWGR  ETPGEDP +  +YA  YVRGLQ  +      D     LK++
Sbjct: 163 TFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASAYVRGLQQGD------DGSPDRLKVA 216

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYDLDNW+G DR HF++ VT+QDM +TF  PF+ CV +G+V+SVMCS+N+VNG
Sbjct: 217 ACCKHYTAYDLDNWKGVDRLHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMCSFNQVNG 276

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTCADP LL+  +RG+W  +GYIVSDCDS+     S  +   T E+A A+ + AGLDL+
Sbjct: 277 KPTCADPDLLSGIVRGEWKLNGYIVSDCDSVDVFYNSQHY-TKTPEEAAAKAILAGLDLN 335

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG +    T  AV+ G + E+ +D ++   +  LMRLG+FDG+P    Y  LG  ++C  
Sbjct: 336 CGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIYGKLGPKDVCTS 395

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H E+A EAARQGIVLLKN  G+LPL+   IKTLA++GP+AN TK MIGNYEGTPC+YT+
Sbjct: 396 EHQEMAREAARQGIVLLKNSKGSLPLSPTAIKTLAIIGPNANVTKTMIGNYEGTPCKYTT 455

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+ G  A      Y PGC+++ C   + I  A   A  ADATV++ G+D S+EAEG+DRV
Sbjct: 456 PLQGLTALVAT-TYLPGCSNVAC-GTAQIDEAKKIAAAADATVLIVGIDQSIEAEGRDRV 513

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            + LPG Q  LI +VA A+KG V LV+MS G  DI+FAKN+ KI SILWVGYPGE GG A
Sbjct: 514 SIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISFAKNDDKIASILWVGYPGEAGGAA 573

Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
           IADVIFG YNP GRLP+TWY  +YV K+P T+M +R  P + +PGRTY+F+ G  +Y FG
Sbjct: 574 IADVIFGFYNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPASGYPGRTYRFYTGETIYTFG 633

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
            GLSYTQF + +  +PKSV I +++   C         +   C +V      C++  F  
Sbjct: 634 DGLSYTQFNHHLVQAPKSVSIPIEEGHSC---------HSSKCKSVDAVQESCQNLAFDI 684

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
            + V N G + GS  V ++S PP +  +  K ++G+E+VF+ A   A V F ++ CK L 
Sbjct: 685 HLRVNNAGNISGSHTVFLFSSPPSVHNSPQKHLLGFEKVFVTAKAEALVRFKVDVCKDLS 744

Query: 723 IVDNAANSLLASGAHTILVGE 743
           IVD      +A G H + VG 
Sbjct: 745 IVDELGTQKVALGLHVLHVGS 765


>gi|224054312|ref|XP_002298197.1| predicted protein [Populus trichocarpa]
 gi|222845455|gb|EEE83002.1| predicted protein [Populus trichocarpa]
          Length = 741

 Score =  760 bits (1963), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 380/744 (51%), Positives = 500/744 (67%), Gaps = 32/744 (4%)

Query: 8   KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
            L+ F +C+  L   +R  DLV+R+TL EK+  + + A  V RLG+P YEWWSEALHGVS
Sbjct: 13  SLASFGFCNTSLGVSDRVVDLVKRLTLQEKILFLVNSAGSVSRLGIPKYEWWSEALHGVS 72

Query: 68  FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG 127
           ++G      PGTHF S VPGATSFP VILT ASFN SL+  IG+ VSTEARAMYN+G AG
Sbjct: 73  YVG------PGTHFSSVVPGATSFPQVILTAASFNTSLFVAIGKVVSTEARAMYNVGLAG 126

Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
           LTFWSPNIN+ RDPRWGR  ETPGEDP +  +Y   YV+GLQ  +      D +   LK+
Sbjct: 127 LTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSGYVKGLQQRD------DGNPDGLKV 180

Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
           +ACCKHY AYDLDNW+G DR+HF++ VT+QDM +TF  PF+ CV +G+V+SVMCSYN+VN
Sbjct: 181 AACCKHYTAYDLDNWKGVDRYHFNAVVTKQDMDDTFQPPFKSCVVDGNVASVMCSYNKVN 240

Query: 248 GIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG--L 305
           GIPTCADP LL+  IRG+W  +GYIV+DCDSI     S  +   T E+A A+ + AG  L
Sbjct: 241 GIPTCADPDLLSGVIRGEWKLNGYIVTDCDSIDVFYNSQHY-TKTPEEAAAKAILAGIRL 299

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNI 362
           DL+CG +    T  AV  G + E+ ID ++   +  LMRLG+FDG P    Y  LG  ++
Sbjct: 300 DLNCGSFLGKHTEAAVTAGLVNESAIDRAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDV 359

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
           C  ++ ELA EAARQGIVLLKN  G+LPL+   IK LA++GP+AN TK MIGNYEGTPC+
Sbjct: 360 CTAENQELAREAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 419

Query: 423 YTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
           YT+P+ G  A      Y PGC+++ C + + +  A   A  ADATV+V G DLS+EAE +
Sbjct: 420 YTTPLQGLAALVAT-TYLPGCSNVAC-STAQVDDAKKIAAAADATVLVMGADLSIEAESR 477

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
           DRVD+LLPG Q  LI  VA+A+ GPV LVIMS G +D++FAK N KI SILWVGYPGE G
Sbjct: 478 DRVDILLPGQQQLLITAVANASTGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAG 537

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVY 599
           G AIAD+IFG YNP GRLP+TWY  +YV K+P T+M +R  P N +PGRTY+F+ G  VY
Sbjct: 538 GAAIADIIFGSYNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPSNGYPGRTYRFYTGETVY 597

Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
            FG GLSY++F +++  +P  V + L+++  C             C +V   +  C++  
Sbjct: 598 SFGDGLSYSEFSHELTQAPGLVSVPLEENHVCY---------SSECKSVAAAEQTCQNLT 648

Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
           F   + ++N G   GS  V ++S PP +  +  K ++G+E+VF+ A   + VGF ++ CK
Sbjct: 649 FDVHLRIKNTGTTSGSHTVFLFSTPPSVHNSPQKHLVGFEKVFLHAQTDSHVGFKVDVCK 708

Query: 720 SLKIVDNAANSLLASGAHTILVGE 743
            L +VD   +  +A G H + +G 
Sbjct: 709 DLSVVDELGSKKVALGEHVLHIGS 732


>gi|356524862|ref|XP_003531047.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Glycine max]
          Length = 765

 Score =  760 bits (1962), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/741 (50%), Positives = 504/741 (68%), Gaps = 30/741 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           ++ + +CD  L    R KDLV R+TL EK+  + + A  V RLG+P YEWWSEALHGVS 
Sbjct: 40  VAGYGFCDKSLGVEARVKDLVGRLTLQEKIGNLVNSAVDVSRLGIPKYEWWSEALHGVSN 99

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           +G      PGT F + +PGATSFP  ILT ASFN SL++ IG+ VSTEARAMYN+G AGL
Sbjct: 100 VG------PGTRFSNVIPGATSFPMPILTAASFNTSLFEVIGRVVSTEARAMYNVGLAGL 153

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T+WSPNIN+ RDPRWGR LETPGEDP +  +YA  YV+GLQ  +G       D   LK++
Sbjct: 154 TYWSPNINIFRDPRWGRGLETPGEDPVLTSKYAAGYVKGLQQTDG------GDPNKLKVA 207

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYD+DNW+G  R+ F++ VT+QDM++TF  PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 208 ACCKHYTAYDVDNWKGIQRYTFNAVVTKQDMEDTFQPPFKSCVIDGNVASVMCSYNKVNG 267

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTCADP LL   +RG+W  +GYIVSDCDS++ + +   +   T E+A A  + AGLDL+
Sbjct: 268 KPTCADPDLLKGVVRGEWKLNGYIVSDCDSVEVLYKDQHY-TKTPEEAAAISILAGLDLN 326

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG +   +T GAV+QG I EA I+ ++   +  LMRLG+FDG P+   Y NLG  ++C  
Sbjct: 327 CGRFLGQYTEGAVKQGLIDEASINNAVTNNFATLMRLGFFDGDPRKQPYGNLGPKDVCTQ 386

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           ++ ELA EAARQGIVLLKN   +LPLN   IK+LA++GP+ANAT+ MIGNYEG PC+Y S
Sbjct: 387 ENQELAREAARQGIVLLKNSPASLPLNAKAIKSLAVIGPNANATRVMIGNYEGIPCKYIS 446

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+ G  A++   +YA GC D+ C N  ++  A   A +ADATVIV G  L++EAE  DRV
Sbjct: 447 PLQGLTAFAPT-SYAAGCLDVRCPN-PVLDDAKKIAASADATVIVVGASLAIEAESLDRV 504

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           ++LLPG Q  L+++VA+A+KGPV LVIMS G +D++FAKNN KI SILWVGYPGE GG A
Sbjct: 505 NILLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSFAKNNNKITSILWVGYPGEAGGAA 564

Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
           IADVIFG +NP GRLP+TWY  +YV K+P T+M +R  P   +PGRTY+F+ G  V+ FG
Sbjct: 565 IADVIFGFHNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPATGYPGRTYRFYKGETVFAFG 624

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
            GLSY+   +K+  +P+ V ++L +D  CR            C ++ +    C++  F  
Sbjct: 625 DGLSYSSIVHKLVKAPQLVSVQLAEDHVCRS---------SECKSIDVVGEHCQNLVFDI 675

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
            + ++N GKM  +  V ++S PP +     K ++G+E+V +     A V F ++ CK L 
Sbjct: 676 HLRIKNKGKMSSAHTVFLFSTPPAVHNAPQKHLLGFEKVHLIGKSEALVSFKVDVCKDLS 735

Query: 723 IVDNAANSLLASGAHTILVGE 743
           IVD   N  +A G H + VG+
Sbjct: 736 IVDELGNRKVALGQHLLHVGD 756


>gi|449438167|ref|XP_004136861.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Cucumis sativus]
          Length = 782

 Score =  759 bits (1959), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 379/740 (51%), Positives = 501/740 (67%), Gaps = 30/740 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           +S F +CD+ L +  R +DLV+R+TL EK+  + + A  V RLG+P YEWWSEALHGVS+
Sbjct: 57  VSGFAFCDSSLGFEARVEDLVKRLTLQEKIGFLINNARNVTRLGIPKYEWWSEALHGVSY 116

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           +G      PGT F + VPGATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AGL
Sbjct: 117 VG------PGTKFSNVVPGATSFPQVILTAASFNASLFEAIGKVVSTEARAMYNVGLAGL 170

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T+WSPN+N+ RDPRWGR  ETPGEDP +  +YA  YVRGLQ  +      D D   LK++
Sbjct: 171 TYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYAAGYVRGLQQRD------DGDPDRLKVA 224

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYDLDNW+G DR+HF++ V+ QD+++TF  PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 225 ACCKHYTAYDLDNWKGTDRYHFNAVVSPQDLEDTFQPPFKSCVIDGNVASVMCSYNQVNG 284

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTCADP LL   IRG W  +GYIVSDCDS+  +  S  +   + E+A A+ + AGLDLD
Sbjct: 285 KPTCADPDLLAGVIRGQWKLNGYIVSDCDSVDVLYNSQHY-TKSPEEAAAKTILAGLDLD 343

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CGD+    T  AV  G + EA I  ++    + LMRLG+FDG+P    Y  LG  ++C P
Sbjct: 344 CGDFLGKHTEAAVTGGLVNEAAISKAVFNNLLTLMRLGFFDGNPSKQLYGKLGPKDVCTP 403

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H ELA EAARQGIVLLKN   +LPL++  IK+LA++GP+AN TK MIGNYEGTPC+YT+
Sbjct: 404 EHQELAREAARQGIVLLKNSPKSLPLSSSAIKSLAVIGPNANVTKTMIGNYEGTPCKYTT 463

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+ G  A     ++ PGCA++ C  ++ +  A   A +ADATV+V G D S+EAE +DRV
Sbjct: 464 PLQGLSAVVST-SFQPGCANVAC-TSAQLDEAKKIAASADATVLVVGSDQSIEAESRDRV 521

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           DL LPG Q  LI +VA A+KGPV LVIM+ G +DI FAK + KI SILWVG+PGE GG A
Sbjct: 522 DLNLPGQQALLITEVAKASKGPVILVIMTGGGMDITFAKKDDKITSILWVGFPGEAGGAA 581

Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
           IADVIFG +NP GRLP+TWY  +YV K+P T M +RP   N FPGRTY+F+ G  +Y FG
Sbjct: 582 IADVIFGSFNPSGRLPMTWYPQSYVEKVPMTDMRMRPSASNGFPGRTYRFYTGETIYSFG 641

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
            GLSY+ FK+ +  +PK V I L++   C         +   C ++ +    C++  F  
Sbjct: 642 DGLSYSDFKHHLVKAPKLVSIPLEEGHIC---------HSSKCHSLEVVQESCQNLGFDV 692

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
            + V+N+G+  GS  V +YS PP +  +  K ++G+E+V +  G    V F ++ CK L 
Sbjct: 693 HLRVKNVGQRSGSHTVFLYSTPPSVHNSPQKHLLGFEKVSLGRGGETVVRFKVDVCKDLS 752

Query: 723 IVDNAANSLLASGAHTILVG 742
           + D   +  +A G H + VG
Sbjct: 753 VADEVGSRKVALGLHILHVG 772


>gi|255545293|ref|XP_002513707.1| Beta-glucosidase, putative [Ricinus communis]
 gi|223547158|gb|EEF48654.1| Beta-glucosidase, putative [Ricinus communis]
          Length = 777

 Score =  759 bits (1959), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 379/740 (51%), Positives = 497/740 (67%), Gaps = 29/740 (3%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L+ F +C+  L   +R  DLV R+TL EK+  + + A  V RLG+P YEWWSEALHGVS+
Sbjct: 51  LASFGFCNVSLGISDRVTDLVNRLTLQEKIGFLVNSAGSVSRLGIPKYEWWSEALHGVSY 110

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           +G      PGTHF + VPGATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AGL
Sbjct: 111 VG------PGTHFSNIVPGATSFPQVILTAASFNASLFEAIGKVVSTEARAMYNVGLAGL 164

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           TFWSPNIN+ RDPRWGR  ETPGEDP +  +Y   YVRGLQ  +      + DS  LK++
Sbjct: 165 TFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSCYVRGLQQTD------NGDSERLKVA 218

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYDLDNW+G DR+HF++ VT+QD+ +TF  PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 219 ACCKHYTAYDLDNWKGTDRYHFNAVVTKQDLDDTFQPPFKSCVIDGNVASVMCSYNQVNG 278

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTCADP LL   IRG+W  +GYIVSDCDS+  I  S  +   T E+A A  + AGLDL+
Sbjct: 279 KPTCADPDLLAGIIRGEWKLNGYIVSDCDSVDVIYNSQHY-TKTPEEAAAITILAGLDLN 337

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG +    T  AV  G +  + +D ++   +  LMRLG+FDG P    Y  LG  ++C  
Sbjct: 338 CGSFLGKHTEAAVNAGLLNVSAVDKAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCTA 397

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            + ELA EAARQGIVLLKN  G+LPL+   IKTLA++GP+AN TK MIGNYEGTPC+YT+
Sbjct: 398 VNQELAREAARQGIVLLKNSPGSLPLSPTAIKTLAVIGPNANVTKTMIGNYEGTPCKYTT 457

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+ G  A S    Y  GC+++ C   + +  A   A +ADATV+V G D S+EAE +DRV
Sbjct: 458 PLQGLTA-SVATTYLAGCSNVACA-AAQVDDAKKLAASADATVLVMGADQSIEAESRDRV 515

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           D+LLPG Q  LI +VA+ +KGPV LVIMS G +D++FAK N KI SILWVGYPGE GG A
Sbjct: 516 DVLLPGQQQLLITQVANVSKGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAA 575

Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
           IADVIFG YNP GRLP+TWY   YV K+P T+M +R  P + +PGRTY+F+ G  VY FG
Sbjct: 576 IADVIFGYYNPSGRLPMTWYPQAYVDKVPMTNMNMRPDPSSGYPGRTYRFYTGETVYSFG 635

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
            GLSY+++K+++  +P+ V I L+ D  CR        +   C +V   +  C+   F  
Sbjct: 636 DGLSYSEYKHQLVQAPQLVSIPLEDDHVCR--------SSSKCISVDAGEQNCQGLAFNI 687

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
            ++V N+GK+ G+  V ++  PP +  +  K ++ +E+V + A     V F ++ CK L 
Sbjct: 688 DLKVRNIGKVRGTHTVFLFFTPPSVHNSPQKHLVDFEKVSLDAKTYGMVSFKVDVCKHLS 747

Query: 723 IVDNAANSLLASGAHTILVG 742
           +VD   +  +A G H + VG
Sbjct: 748 VVDEFGSRKVALGGHVLHVG 767


>gi|356572781|ref|XP_003554544.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
          Length = 771

 Score =  758 bits (1957), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/745 (50%), Positives = 504/745 (67%), Gaps = 32/745 (4%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           +C   L   ER KDL+ R+TL EKV+ + + A  VPRLG+  YEWWSEALHGVS +G   
Sbjct: 41  FCKVSLAIAERVKDLIGRLTLEEKVRLLVNNAAAVPRLGMKGYEWWSEALHGVSNLG--- 97

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
              P   F+++ P ATSFP VI T ASFN SLW+ IGQ VS EARAMYN G AGLT+WSP
Sbjct: 98  ---PAVKFNAQFPAATSFPQVITTAASFNASLWEAIGQVVSDEARAMYNGGTAGLTYWSP 154

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           N+N+ RDPRWGR  ETPGEDP + G YA  YVRGLQ   G   +R      LK++ACCKH
Sbjct: 155 NVNIFRDPRWGRGQETPGEDPVLAGTYAATYVRGLQ---GTHANR------LKVAACCKH 205

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           + AYDLDNW G DRFHF+++V++QD+++TF +PF+MCV+EG V+SVMCSYN+VNG+PTCA
Sbjct: 206 FTAYDLDNWNGMDRFHFNAQVSKQDIEDTFDVPFKMCVSEGKVASVMCSYNQVNGVPTCA 265

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
           DP LL +T+RG W   GYIVSDCDS+    ++  +   T E+A A  +KAGLDLDCG + 
Sbjct: 266 DPNLLKKTVRGLWQLDGYIVSDCDSVGVFYDNQHY-TPTPEEAAADAIKAGLDLDCGPFL 324

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIEL 370
              T  AV++G ++EAD++ +L     V MRLG FDG P    Y +LG  ++C P H EL
Sbjct: 325 AVHTQNAVKKGLLSEADVNGALVNTLTVQMRLGMFDGEPTAHPYGHLGPKDVCKPAHQEL 384

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A EAARQGIVLLKN    LPL++   +T+A++GP++ AT  MIGNY G  C YT+P+ G 
Sbjct: 385 ALEAARQGIVLLKNTGPVLPLSSQLHRTVAVIGPNSKATITMIGNYAGVACGYTNPLQGI 444

Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
             Y++ ++   GC ++ C+N+ +   AI+AA+ ADATV+V GLD S+EAE  DR  LLLP
Sbjct: 445 GRYARTVHQL-GCQNVACKNDKLFGPAINAARQADATVLVMGLDQSIEAETVDRTGLLLP 503

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
           G Q +L++KVA A+KGP  LV+MS G VDI FAKNNP+I  ILW GYPG+ GG AIAD++
Sbjct: 504 GRQPDLVSKVAAASKGPTILVLMSGGPVDITFAKNNPRIVGILWAGYPGQAGGAAIADIL 563

Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGYGLSY 607
           FG  NPGG+LP+TWY   Y+ K+P T+M +R   +  +PGRTY+F++GPVVYPFG+GL+Y
Sbjct: 564 FGTANPGGKLPVTWYPEEYLTKLPMTNMAMRATKSAGYPGRTYRFYNGPVVYPFGHGLTY 623

Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
           T F + +AS+P  V + L+     R  N T  +N+    A+ +   +C     T Q++++
Sbjct: 624 THFVHTLASAPTVVSVPLNGH---RRANVTNISNR----AIRVTHARCDKLSITLQVDIK 676

Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHI--KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVD 725
           N+G  DG+  ++V+S PP   G     KQ++ +E+V + A    +VG  ++ CK L +VD
Sbjct: 677 NVGSRDGTHTLLVFSAPPAGFGHWALEKQLVAFEKVHVPAKGQHRVGVNIHVCKLLSVVD 736

Query: 726 NAANSLLASGAHTILVGEGVGGVSF 750
            +    +  G H+  +G+    VS 
Sbjct: 737 RSGIRRIPLGEHSFNIGDVKHSVSL 761


>gi|449479116|ref|XP_004155509.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Cucumis sativus]
          Length = 809

 Score =  758 bits (1956), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 379/740 (51%), Positives = 501/740 (67%), Gaps = 30/740 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           +S F +CD+ L +  R +DLV+R+TL EK+  + + A  V RLG+P YEWWSEALHGVS+
Sbjct: 84  VSGFAFCDSSLGFEARVEDLVKRLTLQEKIGFLINNARNVTRLGIPKYEWWSEALHGVSY 143

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           +G      PGT F + VPGATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AGL
Sbjct: 144 VG------PGTKFSNVVPGATSFPQVILTAASFNASLFEAIGKVVSTEARAMYNVGLAGL 197

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T+WSPN+N+ RDPRWGR  ETPGEDP +  +YA  YVRGLQ  +      D D   LK++
Sbjct: 198 TYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYAAGYVRGLQQRD------DGDPDRLKVA 251

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYDLDNW+G DR+HF++ V+ QD+++TF  PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 252 ACCKHYTAYDLDNWKGTDRYHFNAVVSPQDLEDTFQPPFKSCVIDGNVASVMCSYNQVNG 311

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTCADP LL   IRG W  +GYIVSDCDS+  +  S  +   + E+A A+ + AGLDLD
Sbjct: 312 KPTCADPDLLAGVIRGQWKLNGYIVSDCDSVDVLYNSQHY-TKSPEEAAAKTILAGLDLD 370

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CGD+    T  AV  G + EA I  ++    + LMRLG+FDG+P    Y  LG  ++C P
Sbjct: 371 CGDFLGKHTEAAVTGGLVNEAAISKAVFNNLLTLMRLGFFDGNPSKQLYGKLGPKDVCTP 430

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H ELA EAARQGIVLLKN   +LPL++  IK+LA++GP+AN TK MIGNYEGTPC+YT+
Sbjct: 431 EHQELAREAARQGIVLLKNSPKSLPLSSSAIKSLAVIGPNANVTKTMIGNYEGTPCKYTT 490

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+ G  A     ++ PGCA++ C  ++ +  A   A +ADATV+V G D S+EAE +DRV
Sbjct: 491 PLQGLSAVVST-SFQPGCANVAC-TSAQLDEAKKIAASADATVLVVGSDQSIEAESRDRV 548

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           DL LPG Q  LI +VA A+KGPV LVIM+ G +DI FAK + KI SILWVG+PGE GG A
Sbjct: 549 DLNLPGQQALLITEVAKASKGPVILVIMTGGGMDITFAKKDDKITSILWVGFPGEAGGAA 608

Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
           IADVIFG +NP GRLP+TWY  +YV K+P T M +RP   N FPGRTY+F+ G  +Y FG
Sbjct: 609 IADVIFGSFNPSGRLPMTWYPQSYVEKVPMTDMRMRPSASNGFPGRTYRFYTGETIYSFG 668

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
            GLSY+ FK+ +  +PK V I L++   C         +   C ++ +    C++  F  
Sbjct: 669 DGLSYSDFKHHLVKAPKLVSIPLEEGHIC---------HSSKCHSLEVVQESCQNLGFDV 719

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
            + V+N+G+  GS  V +YS PP +  +  K ++G+E+V +  G    V F ++ CK L 
Sbjct: 720 HLRVKNVGQRSGSHTVFLYSTPPSVHNSPQKHLLGFEKVSLGRGGETVVRFKVDVCKDLS 779

Query: 723 IVDNAANSLLASGAHTILVG 742
           + D   +  +A G H + VG
Sbjct: 780 VADEVGSRKVALGLHILHVG 799


>gi|224111912|ref|XP_002316021.1| predicted protein [Populus trichocarpa]
 gi|222865061|gb|EEF02192.1| predicted protein [Populus trichocarpa]
          Length = 768

 Score =  757 bits (1954), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/749 (49%), Positives = 504/749 (67%), Gaps = 33/749 (4%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           +C   LP   R +DL+ R+TL EK++ + + A  VPRLG+  YEWWSEALHGVS +G   
Sbjct: 42  FCRVNLPIHVRVRDLIGRLTLQEKIRLLVNNAAAVPRLGIQGYEWWSEALHGVSNVG--- 98

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
              PGT F    PGAT+FP VI T ASFNESLW++IG+ VS EARAMYN G AGLT+WSP
Sbjct: 99  ---PGTKFGGAFPGATAFPQVITTAASFNESLWEEIGRVVSDEARAMYNGGMAGLTYWSP 155

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           N+NV RDPRWGR  ETPGEDP V G+YA +YVRGLQ   G+          LK++ACCKH
Sbjct: 156 NVNVFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNNGLR---------LKVAACCKH 206

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           Y AYDLDNW G DR+HF++RV++QD+++T+ +PF+ CV  G V+SVMCSYN+VNG PTCA
Sbjct: 207 YTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKSCVVAGKVASVMCSYNQVNGKPTCA 266

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
           DP LL  TIRG+W  +GYIVSDCDS+  + ++  +   T E+A A  ++AGLDLDCG + 
Sbjct: 267 DPYLLKNTIRGEWGLNGYIVSDCDSVGVLFDTQHY-TATPEEAAASTIRAGLDLDCGPFL 325

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIEL 370
              T  AV+ G + E D++ +L     V MRLG FDG P    + NLG  ++C P H +L
Sbjct: 326 AIHTENAVKGGLLKEEDVNMALANTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 385

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A +AARQGIVLL+N    LPL+   ++T+A++GP+++ T  MIGNY G  C YT+P+ G 
Sbjct: 386 ALQAARQGIVLLQNRGRTLPLSR-TLQTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 444

Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
             Y+K +++ PGC D+ C  N    AA  AA++ADAT++V GLD S+EAE +DR  LLLP
Sbjct: 445 RRYAKTVHH-PGCNDVFCNGNQQFNAAEVAARHADATILVMGLDQSIEAEFRDRKGLLLP 503

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
           G+Q EL++ VA A++GP  LV+MS G +D++FAKN+P+I +ILWVGYPG+ GG AIADV+
Sbjct: 504 GYQQELVSIVARASRGPTILVLMSGGPIDVSFAKNDPRIGAILWVGYPGQAGGAAIADVL 563

Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSY 607
           FG  NPGG+LP+TWY  NY+ K+P T+M +R  P   +PGRTY+F+ GPVV+PFG+G+SY
Sbjct: 564 FGTANPGGKLPMTWYPHNYLAKVPMTNMGMRADPSRGYPGRTYRFYKGPVVFPFGHGMSY 623

Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
           T F + +  +P+ V + L      R  N T  +N     A+ +    C+       I+V+
Sbjct: 624 TTFAHSLVQAPREVSVPLASLHVSR--NTTGASN-----AIRVSHANCEALALGVHIDVK 676

Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           N G MDG+  ++V+S PPG   +  KQ+IG+E+V +  G   +V   ++ CK L +VD  
Sbjct: 677 NTGDMDGTHTLLVFSSPPGGKWSTQKQLIGFEKVHLVTGSQKRVKIDIHVCKHLSVVDRF 736

Query: 728 ANSLLASGAHTILVGEGVGGVSFPLQLNL 756
               +  G H + +G+    +S  LQ NL
Sbjct: 737 GIRRIPIGEHDLYIGDLKHSIS--LQANL 763


>gi|449484229|ref|XP_004156823.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 2-like
           [Cucumis sativus]
          Length = 769

 Score =  755 bits (1949), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 370/740 (50%), Positives = 493/740 (66%), Gaps = 27/740 (3%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           +D+P+C   L   ER KDL+ R+TL EKV+ +   A GVPRLG+  Y+WWSEALHGVS +
Sbjct: 37  TDYPFCRRSLVVEERVKDLIGRLTLEEKVKLLVSNAGGVPRLGIKAYQWWSEALHGVSNV 96

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
           G      PGT F  E P ATSFP VI T ASFN SLW+ IG+ VS EARAMYN G  GLT
Sbjct: 97  G------PGTRFGGEFPAATSFPQVISTAASFNASLWEAIGRVVSDEARAMYNGGVGGLT 150

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
           +WSPN+N+ RDPRWGR  ETPGEDP + G YA+NYVRGLQ  EG           LK++A
Sbjct: 151 YWSPNVNIFRDPRWGRGQETPGEDPILAGTYAVNYVRGLQGTEGNR---------LKVAA 201

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
           CCKH+ AYDLDNW G DRFHF+++V++QD+++TF +PF MCV  G VSSVMCSYN+VNG+
Sbjct: 202 CCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFEVPFRMCVKGGKVSSVMCSYNQVNGV 261

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           PTCADP LL  T+R  W+  GYIVSDCDS+     S  +   T E+A A  +KAGLDLDC
Sbjct: 262 PTCADPNLLTNTLRSQWHLDGYIVSDCDSVGVFYNSQHY-TSTPEEAAAMAIKAGLDLDC 320

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG---SPQYKNLGKNNICNPQ 366
           G +    T  AV++G + E+ I+ +L     V MRLG FDG   +  Y +LG  ++C+  
Sbjct: 321 GSFLETHTENAVKRGLLNESHINGALSNTLSVQMRLGMFDGDLKTQPYAHLGAKHVCSDH 380

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
           + +LA +AARQGIVLL+N  G+LPL+T   + +A+VGP++NAT  MIGNY G  C Y +P
Sbjct: 381 NRQLAVDAARQGIVLLENRRGSLPLSTNRHRIVAVVGPNSNATLTMIGNYAGIACEYITP 440

Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
           + G   Y++ I +  GC  + C++N     AI+AA+ ADA V+V GLD S+EAE +DR  
Sbjct: 441 LQGISKYTRTI-HQEGCRGVACRSNKFFGGAIEAARVADAVVLVMGLDQSIEAEFRDRAG 499

Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
           LLLPG Q +L+ KVA  AKGPV LV+MS G +D++FAK++PKI  I+W GYPG+ GG AI
Sbjct: 500 LLLPGLQPDLVLKVASVAKGPVILVLMSGGPIDVSFAKDHPKISGIIWGGYPGQAGGLAI 559

Query: 547 ADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGL 605
           ADV+FG+ NPGG+LP+TWY  +YV K+P T+M LRP  ++PGRTY+F+ GPVVYPFG+GL
Sbjct: 560 ADVLFGQTNPGGKLPMTWYPQDYVSKLPMTTMSLRPGTSYPGRTYRFYKGPVVYPFGHGL 619

Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIE 665
           SYT F +K+ S+P ++ + +   +   + +   G       AV +   KC       ++ 
Sbjct: 620 SYTAFTHKILSAPTTLTVPVTGHRHPHNGSEFWGK------AVRVTHAKCDRLSLVIKVA 673

Query: 666 VENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVD 725
           V N+G  DG+  ++VYS PP       KQ++ +E+V I A    +V   ++ CK L +VD
Sbjct: 674 VRNIGARDGAHTLLVYSIPPMGVWVPQKQLVAFEKVHIDAQALKEVQINIHVCKLLSVVD 733

Query: 726 NAANSLLASGAHTILVGEGV 745
                 +  G H I +G+ V
Sbjct: 734 KYGIRRVPMGEHGIDIGDNV 753


>gi|357511337|ref|XP_003625957.1| Beta-xylosidase [Medicago truncatula]
 gi|355500972|gb|AES82175.1| Beta-xylosidase [Medicago truncatula]
          Length = 771

 Score =  754 bits (1948), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/748 (49%), Positives = 499/748 (66%), Gaps = 33/748 (4%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
           + P+C+ KL  PER KDL+ R+T+ EKV  + + A  VPR+G+  YEWWSEALHGVS +G
Sbjct: 39  NLPFCNVKLAIPERVKDLIGRLTMQEKVNLLVNNAPAVPRVGMKSYEWWSEALHGVSNVG 98

Query: 71  RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
                 PGT F    P ATSFP VI T ASFN SLW+ IG+ VS EARAMYN G AGLT+
Sbjct: 99  ------PGTRFGGVFPAATSFPQVITTAASFNASLWEAIGRVVSDEARAMYNGGAAGLTY 152

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           WSPN+N+ RDPRWGR  ETPGEDP + GRYA +YV+GLQ  +G           LK++AC
Sbjct: 153 WSPNVNIFRDPRWGRGQETPGEDPVLAGRYAASYVKGLQGTDG---------NKLKVAAC 203

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH+ AYD+DNW G DRFHF++ V++QD+++TF +PF MCV EG V+SVMCSYN+VNG+P
Sbjct: 204 CKHFTAYDVDNWNGVDRFHFNALVSKQDIEDTFDVPFRMCVKEGKVASVMCSYNQVNGVP 263

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
           TCADP LL +T+RG W   GYIVSDCDS+  +  S  +   T E+A A  +KAGLDLDCG
Sbjct: 264 TCADPNLLKKTVRGVWGLDGYIVSDCDSVGVLYNSQHY-TSTPEEAAADAIKAGLDLDCG 322

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
            +    T  AV++G + EAD++ +L     V MRLG FDG P    Y  LG  ++C P H
Sbjct: 323 PFLGVHTQDAVKKGLLTEADVNNALVNTLKVQMRLGMFDGEPSAQAYGRLGPKDVCKPAH 382

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
            ELA EAARQGIVLLKN    LPL+    +T+A++GP+++ T  MIGNY G  C YTSP+
Sbjct: 383 QELALEAARQGIVLLKNTGPTLPLSPQRHRTVAVIGPNSDVTVTMIGNYAGIACGYTSPL 442

Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
            G   Y+K I +  GC+++ C+++     A+DAA++ADAT++V GLD S+EAE  DR  L
Sbjct: 443 QGIGRYAKTI-HQQGCSNVACRDDKQFGPALDAARHADATILVIGLDQSIEAETVDRTSL 501

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
           LLPG Q +L++KVA A+KGP  LV+MS G VDI FAKN+PK+  ILW GYPG+ GG AIA
Sbjct: 502 LLPGHQQDLVSKVAAASKGPTILVLMSGGPVDITFAKNDPKVAGILWAGYPGQAGGAAIA 561

Query: 548 DVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVN-NFPGRTYKFFDGPVVYPFGYGL 605
           D++FG  +PGG+LP+TWY   Y+K +  T+M +RP    +PGRTY+F+ GPVVYPFG+GL
Sbjct: 562 DILFGTASPGGKLPVTWYPQEYLKNLAMTNMAMRPSKIGYPGRTYRFYKGPVVYPFGHGL 621

Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIE 665
           +YT F ++++S+P  V + +   +   + N    +NK    A+ +   +C        ++
Sbjct: 622 TYTHFVHELSSAPTVVSVPVHGHRHGNNTNI---SNK----AIRVTHARCGKLSIALHVD 674

Query: 666 VENMGKMDGSEVVMVYSKPPGIAGTHI---KQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
           V+N+G  DG+  ++V+S PP   G H    K ++ +E+V + A    +V   ++ CK L 
Sbjct: 675 VKNVGSRDGTHTLLVFSAPPN-GGNHWVPQKSLVAFEKVHVPAKTKQRVRVNIHVCKLLS 733

Query: 723 IVDNAANSLLASGAHTILVGEGVGGVSF 750
           +VD +    +  G H++ +G+    VS 
Sbjct: 734 VVDKSGIRRIPMGEHSLHIGDVKHSVSL 761


>gi|449469042|ref|XP_004152230.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
          Length = 769

 Score =  754 bits (1947), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 370/740 (50%), Positives = 493/740 (66%), Gaps = 27/740 (3%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           +D+P+C   L   ER KDL+ R+TL EKV+ +   A GVPRLG+  Y+WWSEALHGVS +
Sbjct: 37  TDYPFCRRSLVVGERVKDLIGRLTLEEKVKLLVSNAGGVPRLGIKAYQWWSEALHGVSNV 96

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
           G      PGT F  E P ATSFP VI T ASFN SLW+ IG+ VS EARAMYN G  GLT
Sbjct: 97  G------PGTRFGGEFPAATSFPQVISTAASFNASLWEAIGRVVSDEARAMYNGGVGGLT 150

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
           +WSPN+N+ RDPRWGR  ETPGEDP + G YA+NYVRGLQ  EG           LK++A
Sbjct: 151 YWSPNVNIFRDPRWGRGQETPGEDPILAGTYAVNYVRGLQGTEGNR---------LKVAA 201

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
           CCKH+ AYDLDNW G DRFHF+++V++QD+++TF +PF MCV  G VSSVMCSYN+VNG+
Sbjct: 202 CCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFEVPFRMCVKGGKVSSVMCSYNQVNGV 261

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           PTCADP LL  T+R  W+  GYIVSDCDS+     S  +   T E+A A  +KAGLDLDC
Sbjct: 262 PTCADPNLLTNTLRSQWHLDGYIVSDCDSVGVFYNSQHY-TSTPEEAAAMAIKAGLDLDC 320

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG---SPQYKNLGKNNICNPQ 366
           G +    T  AV++G + E+ I+ +L     V MRLG FDG   +  Y +LG  ++C+  
Sbjct: 321 GSFLETHTENAVKRGLLNESHINGALSNTLSVQMRLGMFDGDLKTQPYAHLGAKHVCSDH 380

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
           + +LA +AARQGIVLL+N  G+LPL+T   + +A+VGP++NAT  MIGNY G  C Y +P
Sbjct: 381 NRQLAVDAARQGIVLLENRRGSLPLSTNRHRIVAVVGPNSNATLTMIGNYAGIACEYITP 440

Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
           + G   Y++ I +  GC  + C++N     AI+AA+ ADA V+V GLD S+EAE +DR  
Sbjct: 441 LQGISKYTRTI-HQEGCRGVACRSNKFFGGAIEAARVADAVVLVMGLDQSIEAEFRDRAG 499

Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
           LLLPG Q +L+ KVA  AKGPV LV+MS G +D++FAK++PKI  I+W GYPG+ GG AI
Sbjct: 500 LLLPGLQPDLVLKVASVAKGPVILVLMSGGPIDVSFAKDHPKISGIIWGGYPGQAGGLAI 559

Query: 547 ADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGL 605
           ADV+FG+ NPGG+LP+TWY  +YV K+P T+M LRP  ++PGRTY+F+ GPVVYPFG+GL
Sbjct: 560 ADVLFGQTNPGGKLPMTWYPQDYVSKLPMTTMSLRPGTSYPGRTYRFYKGPVVYPFGHGL 619

Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIE 665
           SYT F +K+ S+P ++ + +   +   + +   G       AV +   KC       ++ 
Sbjct: 620 SYTAFTHKILSAPTTLTVPVTGHRHPHNGSEFWGK------AVRVTHAKCDRLSLVIKVA 673

Query: 666 VENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVD 725
           V N+G  DG+  ++VYS PP       KQ++ +E+V I A    +V   ++ CK L +VD
Sbjct: 674 VRNIGARDGAHTLLVYSIPPMGVWVPQKQLVAFEKVHIDAQALKEVQINIHVCKLLSVVD 733

Query: 726 NAANSLLASGAHTILVGEGV 745
                 +  G H I +G+ V
Sbjct: 734 KYGIRRVPMGEHGIDIGDNV 753


>gi|356556038|ref|XP_003546334.1| PREDICTED: beta-D-xylosidase 1-like [Glycine max]
          Length = 775

 Score =  753 bits (1945), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/739 (50%), Positives = 499/739 (67%), Gaps = 30/739 (4%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           F +C+  +P   R +DL+ R+TLPEK++ + + A  VPRLG+  YEWWSEALHGVS +G 
Sbjct: 49  FKFCNTHVPIHVRVQDLIARLTLPEKIRLVVNNAIAVPRLGIQGYEWWSEALHGVSNVG- 107

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
                PGT F    PGAT FP VI T ASFN+SLW++IG+ VS EARAMYN G AGLT+W
Sbjct: 108 -----PGTKFGGAFPGATMFPQVISTAASFNQSLWQEIGRVVSDEARAMYNGGQAGLTYW 162

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
           SPN+N+ RDPRWGR  ETPGEDP +  +YA +YV+GLQ         DS    LK++ACC
Sbjct: 163 SPNVNIFRDPRWGRGQETPGEDPTLAAKYAASYVKGLQG--------DSAGNHLKVAACC 214

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
           KHY AYDLDNW G DRFHF+++V++QD+++T+ +PF+ CV EG V+SVMCSYN+VNG PT
Sbjct: 215 KHYTAYDLDNWNGVDRFHFNAKVSKQDLEDTYDVPFKACVLEGQVASVMCSYNQVNGKPT 274

Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
           CADP LL  TIRG W  +GYIVSDCDS+    ++  +   T E+A A  +KAGLDLDCG 
Sbjct: 275 CADPDLLRNTIRGQWRLNGYIVSDCDSVGVFFDNQHY-TKTPEEAAAEAIKAGLDLDCGP 333

Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHI 368
           +    T  A+++G I+E D++ +L  L  V MRLG FDG P    Y NLG  ++C   H 
Sbjct: 334 FLAIHTDSAIRKGLISENDLNLALANLISVQMRLGMFDGEPSTQPYGNLGPRDVCTSAHQ 393

Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
           +LA EAAR+ IVLL+N   +LPL+   ++T+ +VGP+A+AT  MIGNY G  C YT+P+ 
Sbjct: 394 QLALEAARESIVLLQNKGNSLPLSPSRLRTIGVVGPNADATVTMIGNYAGVACGYTTPLQ 453

Query: 429 GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLL 488
           G   Y K  +   GC  + C+ N +  AA   A+ ADA V+V GLD +VEAE +DRV LL
Sbjct: 454 GIARYVKTAHQV-GCRGVACRGNELFGAAETIARQADAIVLVMGLDQTVEAETRDRVGLL 512

Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
           LPG Q EL+ +VA AAKGPV L+IMS G VDI+FAKN+PKI +ILWVGYPG+ GG AIAD
Sbjct: 513 LPGLQQELVTRVARAAKGPVILLIMSGGPVDISFAKNDPKISAILWVGYPGQAGGTAIAD 572

Query: 549 VIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGL 605
           VIFG  NPGGRLP+TWY   Y+ K+P T+M +R  P   +PGRTY+F+ GPVV+PFG+GL
Sbjct: 573 VIFGTTNPGGRLPMTWYPQGYLAKVPMTNMDMRPNPTTGYPGRTYRFYKGPVVFPFGHGL 632

Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD-YKFTFQI 664
           SY++F + +A +PK V + +   Q     N T+ +      AV +    C D  +  F +
Sbjct: 633 SYSRFSHSLALAPKQVSVPIMSLQAL--TNSTLSSK-----AVKVSHANCDDSLEMEFHV 685

Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIV 724
           +V+N G MDG+  ++++S+PP    + IKQ++G+ +  + AG   +V   ++ CK L +V
Sbjct: 686 DVKNEGSMDGTHTLLIFSQPPHGKWSQIKQLVGFHKTHVLAGSKQRVKVGVHVCKHLSVV 745

Query: 725 DNAANSLLASGAHTILVGE 743
           D      + +G H + +G+
Sbjct: 746 DQFGVRRIPTGEHELHIGD 764


>gi|350534908|ref|NP_001233910.1| beta-D-xylosidase 1 precursor [Solanum lycopersicum]
 gi|37359706|dbj|BAC98298.1| LEXYL1 [Solanum lycopersicum]
          Length = 770

 Score =  753 bits (1945), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 369/741 (49%), Positives = 488/741 (65%), Gaps = 30/741 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L +  +CDA L    R  DLV R+TL EK+  +   A GV RLG+P YEWWSEALHGV++
Sbjct: 45  LGNLTFCDASLAVENRVNDLVNRLTLGEKIGFLVSGAGGVSRLGIPKYEWWSEALHGVAY 104

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
            G      PG HF S VPGATSFP VILT ASFN +L++ IG+ VSTEARAMYN+G AGL
Sbjct: 105 TG------PGVHFTSLVPGATSFPQVILTAASFNVTLFQTIGKVVSTEARAMYNVGLAGL 158

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T+WSPN+N+ RDPRWGR  ETPGEDP +  +Y + YV GLQ  +      D  +  LK++
Sbjct: 159 TYWSPNVNIFRDPRWGRGQETPGEDPTLTSKYGVAYVEGLQQTD------DGSTNKLKVA 212

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYD+DNW+G +R+ F++ V +QD+ +TF  PF  CV EG V+SVMCSYN+VNG
Sbjct: 213 ACCKHYTAYDVDNWKGIERYSFNAVVRQQDLDDTFQPPFRSCVLEGAVASVMCSYNQVNG 272

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTC DP LL   +RG+W  +GYIV+DCDS+Q I +S  +   T E+A A  L +G+DL+
Sbjct: 273 KPTCGDPNLLAGIVRGEWKLNGYIVTDCDSLQVIFKSQNY-TKTPEEAAALGLNSGVDLN 331

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG + + +T GAV Q  + E+ ID ++   +  LMRLG+FDG+P+   Y NLG  ++C P
Sbjct: 332 CGSWLSTYTQGAVNQKLVNESVIDRAISNNFATLMRLGFFDGNPKSRIYGNLGPKDVCTP 391

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           ++ ELA EAARQGIVLLKN  G+LPL    IK+LA++GP+AN TK MIGNYEG PC+YT+
Sbjct: 392 ENQELAREAARQGIVLLKNTAGSLPLTPTAIKSLAVIGPNANVTKTMIGNYEGIPCKYTT 451

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+ G  A    I Y PGCAD+ C N + I  A   A  ADA V+V G D S+E E  DR 
Sbjct: 452 PLQGLTASVATI-YKPGCADVSC-NTAQIDDAKQIATTADAVVLVMGSDQSIEKESLDRT 509

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            + LPG Q+ L+ +VA  AKGPV LVIMS G +D+ FA +NPKI SILWVG+PGE GG A
Sbjct: 510 SITLPGQQSILVAEVAKVAKGPVILVIMSGGGMDVQFAVDNPKITSILWVGFPGEAGGAA 569

Query: 546 IADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
           +ADVIFG YNP GRLP+TWY  +Y   +P T M +R  P  N+PGRTY+F+ GP V+ FG
Sbjct: 570 LADVIFGYYNPSGRLPMTWYPQSYADVVPMTDMNMRPNPATNYPGRTYRFYTGPTVFTFG 629

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
           +GLSY+QFK+ +  +P+ V + L +   CR            C  V      C +  F  
Sbjct: 630 HGLSYSQFKHHLDKAPQFVSLPLGEKHTCR---------LSKCKTVDAVGQSCSNMGFDI 680

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
            + V+N+GK+ GS ++ +++ PP +     K ++G+E+V +       V F +N CK L 
Sbjct: 681 HLRVKNVGKISGSHIIFLFTSPPSVHNAPKKHLLGFEKVHLTPQGEGVVKFNVNVCKHLS 740

Query: 723 IVDNAANSLLASGAHTILVGE 743
           + D   N  +A G H + +G+
Sbjct: 741 VHDELGNRKVALGPHVLHIGD 761


>gi|297797477|ref|XP_002866623.1| beta-xylosidase 4 [Arabidopsis lyrata subsp. lyrata]
 gi|297312458|gb|EFH42882.1| beta-xylosidase 4 [Arabidopsis lyrata subsp. lyrata]
          Length = 784

 Score =  751 bits (1940), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/742 (50%), Positives = 498/742 (67%), Gaps = 27/742 (3%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L+ + +C+  L    R  DLV R+TL EK+  +   A GV RLG+P YEWWSEALHGVS+
Sbjct: 54  LAAYGFCNTVLKIEYRVADLVARLTLQEKIGFLVSKANGVTRLGIPTYEWWSEALHGVSY 113

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           IG      PGTHF S+VPGATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AGL
Sbjct: 114 IG------PGTHFSSQVPGATSFPQVILTAASFNVSLFQAIGKVVSTEARAMYNVGLAGL 167

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T+WSPN+N+ RDPRWGR  ETPGEDP +  +YA  YV+GLQ+ +G       DS  LK++
Sbjct: 168 TYWSPNVNIFRDPRWGRGQETPGEDPLLASKYASGYVKGLQETDG------GDSNRLKVA 221

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYD+DNW+G +R+ F++ VT+QDM +T+  PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 222 ACCKHYTAYDVDNWKGVERYSFNAVVTQQDMDDTYQPPFKSCVVDGNVASVMCSYNQVNG 281

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTCADP LL+  IRG+W  +GYIVSDCDS+  + ++  +     E A   +L AGLDL+
Sbjct: 282 KPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQHYTKTPAEAAAISIL-AGLDLN 340

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG +    T  AV+ G + EA ID ++   ++ LMRLG+FDG+P+   Y  LG  ++C  
Sbjct: 341 CGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLTLMRLGFFDGNPKNQIYGGLGPTDVCTS 400

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            + ELAA+AARQGIVLLKN  G LPL+  +IKTLA++GP+AN TK MIGNYEGTPC+YT+
Sbjct: 401 ANQELAADAARQGIVLLKN-TGFLPLSPKSIKTLAVIGPNANVTKTMIGNYEGTPCKYTT 459

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+ G  A +    Y PGC+++ C     +  A   A  AD TV++ G D S+EAE +DRV
Sbjct: 460 PLQGL-AGAVSTTYLPGCSNVACAVAD-VAGATKLAATADVTVLLIGADQSIEAESRDRV 517

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           DL LPG Q EL+ +VA AAKGPV LVIMS G  DI FAKN+PKI  ILWVGYPGE GG A
Sbjct: 518 DLNLPGQQQELVIQVAKAAKGPVLLVIMSGGGFDITFAKNDPKIAGILWVGYPGEAGGIA 577

Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
           IAD+IFG+YNP GRLP+TWY  +YV K+P T M +RP     +PGRTY+F+ G  VY FG
Sbjct: 578 IADIIFGRYNPSGRLPMTWYPQSYVEKVPMTIMNMRPDKSKGYPGRTYRFYTGETVYAFG 637

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDIN-YTVGTNKPPCAAVLIDDVKCKDYKFT 661
            GLSYT+F + +  +P  V + L+++  CR     ++    P C     + V      F 
Sbjct: 638 DGLSYTKFSHSLVKAPSLVSLSLEENHVCRSSECQSLDAIGPHCE----NAVSGGGSAFE 693

Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
            QI+V N G  +G   V +++ PP I G+  K ++G+E++ +   + A V F +  CK L
Sbjct: 694 VQIKVRNGGDREGIHTVFLFTTPPAIHGSPRKHLLGFEKIRLGKMEEAVVRFKVEVCKDL 753

Query: 722 KIVDNAANSLLASGAHTILVGE 743
            +VD      +  G H + VG+
Sbjct: 754 SVVDEIGKRKIGLGKHLLHVGD 775


>gi|15237736|ref|NP_201262.1| beta-D-xylosidase 4 [Arabidopsis thaliana]
 gi|75262663|sp|Q9FLG1.1|BXL4_ARATH RecName: Full=Beta-D-xylosidase 4; Short=AtBXL4; Flags: Precursor
 gi|10178060|dbj|BAB11424.1| beta-xylosidase [Arabidopsis thaliana]
 gi|332010539|gb|AED97922.1| beta-D-xylosidase 4 [Arabidopsis thaliana]
          Length = 784

 Score =  749 bits (1934), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/742 (50%), Positives = 498/742 (67%), Gaps = 27/742 (3%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L+ + +C+  L    R  DLV R+TL EK+  +   A GV RLG+P YEWWSEALHGVS+
Sbjct: 54  LAAYGFCNTVLKIEYRVADLVARLTLQEKIGFLVSKANGVTRLGIPTYEWWSEALHGVSY 113

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           IG      PGTHF S+VPGATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AGL
Sbjct: 114 IG------PGTHFSSQVPGATSFPQVILTAASFNVSLFQAIGKVVSTEARAMYNVGLAGL 167

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T+WSPN+N+ RDPRWGR  ETPGEDP +  +YA  YV+GLQ+ +G       DS  LK++
Sbjct: 168 TYWSPNVNIFRDPRWGRGQETPGEDPLLASKYASGYVKGLQETDG------GDSNRLKVA 221

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYD+DNW+G +R+ F++ VT+QDM +T+  PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 222 ACCKHYTAYDVDNWKGVERYSFNAVVTQQDMDDTYQPPFKSCVVDGNVASVMCSYNQVNG 281

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTCADP LL+  IRG+W  +GYIVSDCDS+  + ++  +     E A   +L AGLDL+
Sbjct: 282 KPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQHYTKTPAEAAAISIL-AGLDLN 340

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG +    T  AV+ G + EA ID ++   ++ LMRLG+FDG+P+   Y  LG  ++C  
Sbjct: 341 CGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLTLMRLGFFDGNPKNQIYGGLGPTDVCTS 400

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            + ELAA+AARQGIVLLKN  G LPL+  +IKTLA++GP+AN TK MIGNYEGTPC+YT+
Sbjct: 401 ANQELAADAARQGIVLLKN-TGCLPLSPKSIKTLAVIGPNANVTKTMIGNYEGTPCKYTT 459

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+ G  A +    Y PGC+++ C     +  A   A  AD +V+V G D S+EAE +DRV
Sbjct: 460 PLQGL-AGTVSTTYLPGCSNVACAVAD-VAGATKLAATADVSVLVIGADQSIEAESRDRV 517

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           DL LPG Q EL+ +VA AAKGPV LVIMS G  DI FAKN+PKI  ILWVGYPGE GG A
Sbjct: 518 DLHLPGQQQELVIQVAKAAKGPVLLVIMSGGGFDITFAKNDPKIAGILWVGYPGEAGGIA 577

Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
           IAD+IFG+YNP G+LP+TWY  +YV K+P T M +RP   + +PGRTY+F+ G  VY FG
Sbjct: 578 IADIIFGRYNPSGKLPMTWYPQSYVEKVPMTIMNMRPDKASGYPGRTYRFYTGETVYAFG 637

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDIN-YTVGTNKPPCAAVLIDDVKCKDYKFT 661
            GLSYT+F + +  +P  V + L+++  CR     ++    P C     + V      F 
Sbjct: 638 DGLSYTKFSHTLVKAPSLVSLGLEENHVCRSSECQSLDAIGPHCE----NAVSGGGSAFE 693

Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
             I+V N G  +G   V +++ PP I G+  K ++G+E++ +   + A V F +  CK L
Sbjct: 694 VHIKVRNGGDREGIHTVFLFTTPPAIHGSPRKHLVGFEKIRLGKREEAVVRFKVEICKDL 753

Query: 722 KIVDNAANSLLASGAHTILVGE 743
            +VD      +  G H + VG+
Sbjct: 754 SVVDEIGKRKIGLGKHLLHVGD 775


>gi|115460876|ref|NP_001054038.1| Os04g0640700 [Oryza sativa Japonica Group]
 gi|38344900|emb|CAE02971.2| OSJNBb0079B02.3 [Oryza sativa Japonica Group]
 gi|113565609|dbj|BAF15952.1| Os04g0640700 [Oryza sativa Japonica Group]
 gi|116310882|emb|CAH67823.1| OSIGBa0138H21-OSIGBa0138E01.14 [Oryza sativa Indica Group]
 gi|218195682|gb|EEC78109.1| hypothetical protein OsI_17615 [Oryza sativa Indica Group]
          Length = 765

 Score =  749 bits (1933), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 372/742 (50%), Positives = 504/742 (67%), Gaps = 32/742 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           +S + +CD       RA DL+ R+TL EKV  + +    +PRLG+P YEWWSEALHGVS+
Sbjct: 40  VSGYGFCDRTKSSAARAADLLGRLTLAEKVGFLVNKQAALPRLGIPAYEWWSEALHGVSY 99

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           +G      PGT F + VPGATSFP  ILT ASFN SL++ IG+ VSTEARAM+N+G AGL
Sbjct: 100 VG------PGTRFSTLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNVGLAGL 153

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           TFWSPNIN+ RDPRWGR  ETPGEDP +  +YA+ YV GLQD  G        S  LK++
Sbjct: 154 TFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGG-------GSDALKVA 206

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYD+DNW+G +R+ FD+ V++QD+ +TF  PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 207 ACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNG 266

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTCAD  LL+  IRGDW  +GYIVSDCDS+  +  +  +  +  EDA A  +K+GLDL+
Sbjct: 267 KPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYTKN-PEDAAAITIKSGLDLN 325

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG++    T+ AVQ GK++E+D+D ++   +IVLMRLG+FDG P+   + +LG  ++C  
Sbjct: 326 CGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRLGFFDGDPRKLPFGSLGPKDVCTS 385

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            + ELA EAARQGIVLLKN  GALPL+  +IK++A++GP+ANA+  MIGNYEGTPC+YT+
Sbjct: 386 SNQELAREAARQGIVLLKN-TGALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTT 444

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           P+ G  A    + Y PGC ++ C  NS+ + AA  AA +AD TV+V G D SVE E  DR
Sbjct: 445 PLQGLGANVATV-YQPGCTNVGCSGNSLQLSAATQAAASADVTVLVVGADQSVERESLDR 503

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
             LLLPG Q +L++ VA+A++GPV LV+MS G  DI+FAK++ KI +ILWVGYPGE GG 
Sbjct: 504 TSLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDISFAKSSDKISAILWVGYPGEAGGA 563

Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPF 601
           A+AD++FG +NPGGRLP+TWY A++  K+  T M +RP     +PGRTY+F+ G  VY F
Sbjct: 564 ALADILFGYHNPGGRLPVTWYPASFADKVSMTDMRMRPDSSTGYPGRTYRFYTGDTVYAF 623

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
           G GLSYT+F + + S+P+ V ++L +   C   +         C +V      C    F 
Sbjct: 624 GDGLSYTKFAHSLVSAPEQVAVQLAEGHACHTEH---------CFSVEAAGEHCGSLSFD 674

Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
             + V N G M G   V ++S PP +     K ++G+E+V +  GQ+  V F ++ CK L
Sbjct: 675 VHLRVRNAGGMAGGHTVFLFSSPPSVHSAPAKHLLGFEKVSLEPGQAGVVAFKVDVCKDL 734

Query: 722 KIVDNAANSLLASGAHTILVGE 743
            +VD   N  +A G+HT+ VG+
Sbjct: 735 SVVDELGNRKVALGSHTLHVGD 756


>gi|296083056|emb|CBI22460.3| unnamed protein product [Vitis vinifera]
          Length = 896

 Score =  748 bits (1932), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 383/741 (51%), Positives = 489/741 (65%), Gaps = 62/741 (8%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S FP+C+  LPY +RA DLV R+TL EK +Q+ + A G+ RLG+P YEWWSEALHGVS  
Sbjct: 61  SQFPFCNTSLPYQDRASDLVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGVS-- 118

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
               NS  G HF   +P  T FP VIL+ ASFNESLW  +GQ VSTE RAMYN+G AGLT
Sbjct: 119 ----NSGIGVHFHDPIPAVTIFPAVILSAASFNESLWYTMGQVVSTEGRAMYNVGQAGLT 174

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
           +WSPN+N+ RDPRWGR  ETPGEDP VV RYA+NYVRGLQ+V G E +  +D   LK+S+
Sbjct: 175 YWSPNVNIFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEV-GKEGNFAADR--LKVSS 231

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
           CCKHY AYD+D W+G DRFHFD++VT QD+++T+  PF+ CV EG VSSVMCSYNRVNG+
Sbjct: 232 CCKHYTAYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFKSCVEEGHVSSVMCSYNRVNGV 291

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           PTCA+P+LL   IR  W   GYIVSDCDSI    E   +  +T EDAVA  LKAGL+L+C
Sbjct: 292 PTCANPELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNY-TETPEDAVALALKAGLNLNC 350

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
           G Y  ++T  AV  GK+ E+ ++ +L + YIVLMRLG+FDG P    +  +G +++C   
Sbjct: 351 GSYLGDYTKNAVNLGKVKESIVNQALIYNYIVLMRLGFFDGDPTMLPFGKMGPSDVCTVD 410

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
           H  LA +AA+QGIVLL N NGALPL+    KTLA++GP+A+AT  M+ NY G PCRYTSP
Sbjct: 411 HQLLALDAAKQGIVLLHN-NGALPLSPNTTKTLAVIGPNADATNTMLSNYAGVPCRYTSP 469

Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
           + G   Y   ++Y  GCA++ C   ++I  A   A  ADATV+V GLDL +EAE  DRV+
Sbjct: 470 LQGLQKYVSAVSYEKGCANVSCSEETLIEGAASIASMADATVVVVGLDLFIEAEDLDRVN 529

Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
           L LPGFQ +L+ + A AA G V LV+MSAG VDI+F KN  KI  ILWVGYPG+ GG AI
Sbjct: 530 LTLPGFQEKLVMEAAKAANGTVILVVMSAGPVDISFVKNVSKIGGILWVGYPGQAGGDAI 589

Query: 547 ADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGY 603
           + VIFG YNPGGR P TWY   YV ++P T M +RP   +NFPGRTY+F+ G  +Y FG+
Sbjct: 590 SQVIFGDYNPGGRSPFTWYPQEYVDQVPMTDMNMRPNATSNFPGRTYRFYTGKSLYQFGH 649

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           GLSY+ F YK  S+                                ID V          
Sbjct: 650 GLSYSTF-YKNLSN--------------------------------IDIV---------- 666

Query: 664 IEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
           I V+N G++DG+ VV+ + KPP  G+ G    +++G+ERV +  G++  VG  ++ C  +
Sbjct: 667 IGVKNAGEIDGTHVVLAFWKPPRSGVRGAPGVELVGFERVEVKRGKTEMVGMRLDVCGKI 726

Query: 722 KIVDNAANSLLASGAHTILVG 742
             VD      L  G HT++VG
Sbjct: 727 SNVDEEGKRKLVMGMHTLVVG 747


>gi|255573163|ref|XP_002527511.1| Beta-glucosidase, putative [Ricinus communis]
 gi|223533151|gb|EEF34909.1| Beta-glucosidase, putative [Ricinus communis]
          Length = 810

 Score =  748 bits (1932), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 377/749 (50%), Positives = 506/749 (67%), Gaps = 24/749 (3%)

Query: 8   KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
           + +D+ +C+  L Y +RAKDL+ R+TL EKVQQ+ + A G+PRLG+P YEWWSEALHGVS
Sbjct: 33  QTNDYSFCNTSLSYQDRAKDLISRLTLQEKVQQVVNHAAGIPRLGIPAYEWWSEALHGVS 92

Query: 68  FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG 127
            +G       G  F+  VPGATSFP +IL+ ASFNE+LW K+GQ VSTEAR M+++G AG
Sbjct: 93  NVGF------GVRFNGTVPGATSFPAMILSAASFNETLWLKMGQVVSTEARTMHSVGLAG 146

Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
           LT+WSPN+NV RDPRWGR  ETPGEDP VV RYA+NYVRGLQ+V G E +  +D   LK+
Sbjct: 147 LTYWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEV-GDEGNSTADK--LKV 203

Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
           S+CCKHY AYDLD W+G DRFHFD++VT+QD+++T+  PF  CV E  VSSVMCSYNRVN
Sbjct: 204 SSCCKHYTAYDLDKWKGVDRFHFDAKVTKQDLEDTYQPPFRSCVEEAHVSSVMCSYNRVN 263

Query: 248 GIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL 307
           GIPTCADP LL   IRG+WN  GYIVSDCDSI+   +S  +   T EDAVA  LKAGL++
Sbjct: 264 GIPTCADPDLLKGIIRGEWNLDGYIVSDCDSIEVYYDSINY-TATPEDAVALALKAGLNM 322

Query: 308 DCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICN 364
           +CG++   +T+ AV+  K+ E+ +D +L + +IVLMRLG+FDG P+   + NLG +++C+
Sbjct: 323 NCGEFLGKYTVDAVKLNKVEESVVDQALIYNFIVLMRLGFFDGDPKSLLFGNLGPSDVCS 382

Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
             H +LA +AARQGIVLL N  GALPL+  N + LA++GP+AN T  MI NY G PC+YT
Sbjct: 383 DGHQKLALDAARQGIVLLYN-KGALPLSKNNTRNLAVIGPNANVTTTMISNYAGIPCKYT 441

Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           +P+ G   Y   + YA GC  + C ++++I AA  AA  ADA V++ GLD S+E EG DR
Sbjct: 442 TPLQGLQKYVSTVTYAAGCKSVSCSDDTLIDAATQAAAAADAVVLLVGLDQSIEREGLDR 501

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
            +L LPGFQ +L+  V +A  G V LV+MS+  +D++FA N  KIK ILWVGYPG+ GG 
Sbjct: 502 ENLTLPGFQEKLVVDVVNATNGTVVLVVMSSSPIDVSFAVNKSKIKGILWVGYPGQAGGD 561

Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPF 601
           A+A V+FG YNP GR P TWY   Y  ++P T M +R     NFPGRTY+F+ G  +Y F
Sbjct: 562 AVAQVMFGDYNPAGRSPFTWYPQEYAHQVPMTDMNMRANSTANFPGRTYRFYAGNTLYKF 621

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV-GTNKPP---CAAVLIDDVKCKD 657
           G+GLSY+ F   + S P ++ +K + D +   I  T   T + P     A+ I  + C +
Sbjct: 622 GHGLSYSTFSNFIISGPSTLLLKTNSDLKPDIILSTHNSTEEHPFINSQAMDITTLNCTN 681

Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPPG---IAGTHIKQVIGYERVFIAAGQSAKVGFT 714
              +  + V N G + G  VV+V+ KPP    + G    Q++G+ RV +  G++  V   
Sbjct: 682 SLLSLILGVRNNGPVSGDHVVLVFWKPPNSSEVTGAANVQLVGFSRVEVNRGKTQNVTLE 741

Query: 715 MNACKSLKIVDNAANSLLASGAHTILVGE 743
           ++ CK L +VD+     L +G H   +G 
Sbjct: 742 IDVCKRLSLVDSEGKRKLVTGQHIFTIGS 770


>gi|242077366|ref|XP_002448619.1| hypothetical protein SORBIDRAFT_06g030270 [Sorghum bicolor]
 gi|241939802|gb|EES12947.1| hypothetical protein SORBIDRAFT_06g030270 [Sorghum bicolor]
          Length = 767

 Score =  748 bits (1932), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/741 (50%), Positives = 500/741 (67%), Gaps = 31/741 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L+ + +C+       RA DLV R+TL EKV  + D    +PRLG+PLYEWWSEALHGVS+
Sbjct: 43  LASYGFCNRSASASARAADLVSRLTLAEKVGFLVDKQAALPRLGIPLYEWWSEALHGVSY 102

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           +G      PGT F S VP ATSFP  ILT ASFN +L++ IG+ VS EARAM+N+G AGL
Sbjct: 103 VG------PGTRFSSLVPAATSFPQPILTAASFNATLFRAIGEVVSNEARAMHNVGLAGL 156

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           TFWSPNIN+ RDPRWGR  ETPGEDP +  +YA+ YV GLQD         S S  LK++
Sbjct: 157 TFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQDA-------GSGSGSLKVA 209

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYD+DNW+G +R+ F++ V++QD+ +TF  PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 210 ACCKHYTAYDVDNWKGVERYTFNAVVSQQDLDDTFQPPFKSCVVDGNVASVMCSYNQVNG 269

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTCAD  LL+  IRGDW  +GYI SDCDS+  +  +  +   T EDA A  +KAGLDL+
Sbjct: 270 KPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHY-TKTPEDAAAISIKAGLDLN 328

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG++    T+ AVQ GK++E+D+D ++   +I LMRLG+FDG P+   + NLG +++C  
Sbjct: 329 CGNFLAQHTVAAVQAGKLSESDVDRAITNNFITLMRLGFFDGDPRKLPFGNLGPSDVCTS 388

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            + ELA EAARQGIVLLKN +GALPL+  +IK+LA++GP+ANA+  MIGNYEGTPC+YT+
Sbjct: 389 SNQELAREAARQGIVLLKN-SGALPLSASSIKSLAVIGPNANASFTMIGNYEGTPCKYTT 447

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           P+ G  A    + Y PGC ++ C  NS+ + AA  AA +AD TV+V G D S+E E  DR
Sbjct: 448 PLQGLGANVATV-YQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSIERESLDR 506

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
             LLLPG Q +L++ VA+A++GP  LVIMS G  DI+FAK++ KI +ILWVGYPGE GG 
Sbjct: 507 TSLLLPGQQPQLVSAVANASRGPCILVIMSGGPFDISFAKSSDKIAAILWVGYPGEAGGA 566

Query: 545 AIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
           AIADV+FG +NP GRLP+TWY  ++ K+P   M +RP     +PGRTY+F+ G  VY FG
Sbjct: 567 AIADVLFGHHNPSGRLPVTWYPESFTKVPMIDMRMRPDASTGYPGRTYRFYTGDTVYAFG 626

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
            GLSYT F + + S+PK V ++L +   C             C +V  +   C+   F  
Sbjct: 627 DGLSYTSFAHHLVSAPKQVALQLAEGHTCL---------TEQCPSVEAEGAHCEGLAFDV 677

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
            + V N G M G+  V ++S PP +     K ++G+E+V +  GQ+  V F ++ CK L 
Sbjct: 678 HLRVRNAGDMSGAHTVFLFSSPPAVHNAPAKHLLGFEKVSLEPGQAGVVAFKVDVCKDLS 737

Query: 723 IVDNAANSLLASGAHTILVGE 743
           +VD   N  +A G HT+ VG+
Sbjct: 738 VVDELGNRKVALGNHTLHVGD 758


>gi|357130854|ref|XP_003567059.1| PREDICTED: probable beta-D-xylosidase 2-like [Brachypodium
           distachyon]
          Length = 779

 Score =  748 bits (1931), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 374/747 (50%), Positives = 486/747 (65%), Gaps = 31/747 (4%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           +  P+C   LP   RA+DLV R+T  EKV+ + + A GVPRLG+  YEWWSEALHGVS  
Sbjct: 36  TRLPFCRQALPPRARARDLVARLTRAEKVRLLVNNAAGVPRLGVEGYEWWSEALHGVSDT 95

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
           G      PG  F    PGAT+FP VI T ASFN SLW+ IG+ VS E RA+YN   AGLT
Sbjct: 96  G------PGVRFGGAFPGATAFPQVIGTAASFNASLWELIGRAVSDEGRAIYNGRQAGLT 149

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
           FWSPN+N+ RDPRWGR  ETPGEDP V GRYA  YVRGLQ            +  LK +A
Sbjct: 150 FWSPNVNIFRDPRWGRGQETPGEDPAVSGRYAAAYVRGLQQ---------QHAGRLKTAA 200

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
           CCKH+ AYDLD W G DRFHF++ VT QD+++TF  PF  CV EG  ++VMCSYN+VNG+
Sbjct: 201 CCKHFTAYDLDRWSGADRFHFNAIVTPQDLEDTFNAPFRACVVEGRAAAVMCSYNQVNGV 260

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           PTCAD   L  TIRG W   GYIVSDCDS+        +   T+EDAVA  L+AGLDLDC
Sbjct: 261 PTCADQGFLRGTIRGKWKLDGYIVSDCDSVDVFYREQHYTR-TREDAVAATLRAGLDLDC 319

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG---SPQYKNLGKNNICNPQ 366
           G +   +T  AV QGK+ EADID ++     V MRLG FDG   +  + +LG  ++C P 
Sbjct: 320 GPFLAQYTEAAVAQGKVKEADIDAAVVNTVTVQMRLGMFDGDVAAQPFGHLGPQHVCTPA 379

Query: 367 HIELAAEAARQGIVLLKNDNG---ALPLNTGNIK-TLALVGPHANATKAMIGNYEGTPCR 422
           H ELA EAA Q IVLLKN  G    LPL++ + + T+A+VGPH+ AT AMIGNY G PC 
Sbjct: 380 HRELALEAACQSIVLLKNGGGNNMRLPLSSHHRRGTVAVVGPHSEATVAMIGNYAGKPCA 439

Query: 423 YTSPMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDLSVEAEG 481
           YT+P+ G   Y++   +  GC D+ CQ +   I AA+DAA++ADATV+V GLD SVEAEG
Sbjct: 440 YTTPLQGVGRYARATVHQAGCTDVACQGSGQPIDAAVDAARHADATVVVVGLDQSVEAEG 499

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
            DR  LLLPG Q EL++ VA A+KGPV LV+MS G VDI FA+N+  + +ILW GYPG+ 
Sbjct: 500 LDRTTLLLPGRQAELVSAVARASKGPVILVLMSGGPVDIAFAQNDRNVAAILWAGYPGQA 559

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVV 598
           GG+AIADVIFG +NPGG+LP+TWY  +Y+ K P T+M +R  P   +PGRTY+F+ GP +
Sbjct: 560 GGQAIADVIFGHHNPGGKLPVTWYPEDYLRKAPMTNMAMRADPARGYPGRTYRFYAGPTI 619

Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
           +PFG+GLSYT+F + +A +P  + ++       R       T       V +   +C+  
Sbjct: 620 HPFGHGLSYTKFAHTLAHAPAHLTVRRAAGH--RTTAAINTTTASHLNDVRVAHAQCEGL 677

Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
             +  ++V+N+G  DG+  V VY+ PP   I G  ++Q++ +E+V +AAG  A+V   ++
Sbjct: 678 SVSVHVDVKNVGSRDGAHTVFVYASPPIAAIHGAPVRQLVAFEKVHVAAGAVARVKMGVD 737

Query: 717 ACKSLKIVDNAANSLLASGAHTILVGE 743
            C SL I D      +  G H +++GE
Sbjct: 738 VCGSLSIADQEGVRRIPIGEHRLMIGE 764


>gi|74355968|dbj|BAE44362.1| alpha-L-arabinofuranosidase [Raphanus sativus]
          Length = 780

 Score =  748 bits (1930), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/742 (50%), Positives = 501/742 (67%), Gaps = 26/742 (3%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L+ + +C+  +    R  DLV R+TL EK+  +    +GV RLG+P YEWWSEALHGVS+
Sbjct: 49  LAAYGFCNTAIKIEYRVADLVARLTLQEKIGVLTSKLHGVARLGIPTYEWWSEALHGVSY 108

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           +G      PGT F  +VPGATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AGL
Sbjct: 109 VG------PGTRFSGQVPGATSFPQVILTAASFNVSLFQAIGKVVSTEARAMYNVGLAGL 162

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T+WSPN+N+ RDPRWGR  ETPGEDP +  +YA  YV+GLQ+ +       SD+  LK++
Sbjct: 163 TYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVKGLQETD------SSDANRLKVA 216

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYD+DNW+G +R+ F++ V +QD+ +T+  PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 217 ACCKHYTAYDVDNWKGVERYSFNAVVNQQDLDDTYQPPFKSCVVDGNVASVMCSYNKVNG 276

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTCADP LL+  IRG+W  +GYIVSDCDS+  + ++  +   T E+A A  + AGLDL+
Sbjct: 277 KPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQHY-TKTPEEAAAISINAGLDLN 335

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG +  + T  AV+ G + EA ID ++   ++ LMRLG+FDG P+   Y  LG  ++C P
Sbjct: 336 CGYFLGDHTEAAVKAGLVKEAAIDKAITNNFLTLMRLGFFDGDPKKQIYGGLGPKDVCTP 395

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            + ELAAEAARQGIVLLKN  GALPL+   IKTLA++GP+AN TK MIGNYEGTPC+YT+
Sbjct: 396 ANQELAAEAARQGIVLLKN-TGALPLSPKTIKTLAVIGPNANVTKTMIGNYEGTPCKYTT 454

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+ G  A +    Y PGC+++ C   + +  +   A  +DATV+V G D S+EAE +DRV
Sbjct: 455 PLQGL-AGTVHTTYLPGCSNVACAV-ADVAGSTKLAAASDATVLVIGADQSIEAESRDRV 512

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           DL LPG Q EL+ +VA AAKGPV LVIMS G  DI FAKN+ KI  ILWVGYPGE GG A
Sbjct: 513 DLNLPGQQQELVTQVAKAAKGPVFLVIMSGGGFDITFAKNDAKIAGILWVGYPGEAGGIA 572

Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
            ADVIFG+YNP GRLP+TWY  +YV K+P T+M +RP   N +PGRTY+F+ G  VY FG
Sbjct: 573 TADVIFGRYNPSGRLPMTWYPQSYVEKVPMTNMNMRPDKSNGYPGRTYRFYTGETVYAFG 632

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDIN-YTVGTNKPPCAAVLIDDVKCKDYKFT 661
            GLSYT+F + +  +P+ V + L+++  CR     ++    P C   +          F 
Sbjct: 633 DGLSYTKFSHSLVKAPRLVSLSLEENHVCRSSECQSLNAIGPHCDNAV---SGTGGKAFE 689

Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
             I+V+N G  +G   V +++ PP + G+  K ++G+E++ +   + A V F ++ CK L
Sbjct: 690 VHIKVQNGGDREGIHTVFLFTTPPAVHGSPRKHLLGFEKIRLGKMEEAVVKFKVDVCKDL 749

Query: 722 KIVDNAANSLLASGAHTILVGE 743
            +VD      +  G H + VG+
Sbjct: 750 SVVDEVGKRKIGLGQHLLHVGD 771


>gi|356529243|ref|XP_003533205.1| PREDICTED: beta-D-xylosidase 1-like [Glycine max]
          Length = 774

 Score =  747 bits (1928), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/739 (49%), Positives = 497/739 (67%), Gaps = 30/739 (4%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           F +C+  +P   R +DL+ R+TLPEK++ + + A  VPRLG+  YEWWSEALHGVS +G 
Sbjct: 48  FKFCNTHVPIHVRVQDLIARLTLPEKIRLVVNNAIAVPRLGIQGYEWWSEALHGVSNVG- 106

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
                PGT F    PGAT FP VI T ASFN+SLW++IG+ VS EARAMYN G AGLT+W
Sbjct: 107 -----PGTKFGGAFPGATMFPQVISTAASFNQSLWQEIGRVVSDEARAMYNGGQAGLTYW 161

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
           SPN+N+ RDPRWGR  ETPGEDP +  +YA +YV+GLQ         D     LK++ACC
Sbjct: 162 SPNVNIFRDPRWGRGQETPGEDPTLAAKYAASYVKGLQG--------DGAGNRLKVAACC 213

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
           KHY AYDLDNW G DRFHF+++V++QD+++T+ +PF+ CV EG V+SVMCSYN+VNG PT
Sbjct: 214 KHYTAYDLDNWNGVDRFHFNAKVSKQDLEDTYDVPFKACVLEGQVASVMCSYNQVNGKPT 273

Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
           CADP LL  TIRG W  +GYIVSDCDS+    ++  +   T E+A A  +KAGLDLDCG 
Sbjct: 274 CADPDLLRNTIRGQWGLNGYIVSDCDSVGVFFDNQHY-TRTPEEAAAEAIKAGLDLDCGP 332

Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHI 368
           +    T  A+++G I+E D++ +L  L  V MRLG FDG P    + NLG  ++C P H 
Sbjct: 333 FLAIHTDSAIRKGLISENDLNLALANLITVQMRLGMFDGEPSTQPFGNLGPRDVCTPAHQ 392

Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
           +LA EAAR+ IVLL+N   +LPL+   ++ + ++GP+ +AT  MIGNY G  C YT+P+ 
Sbjct: 393 QLALEAARESIVLLQNKGNSLPLSPSRLRIVGVIGPNTDATVTMIGNYAGVACGYTTPLQ 452

Query: 429 GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLL 488
           G   Y K  +   GC  + C+ N +  AA   A+  DATV+V GLD ++EAE +DRV LL
Sbjct: 453 GIARYVKTAHQV-GCRGVACRGNELFGAAEIIARQVDATVLVMGLDQTIEAETRDRVGLL 511

Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
           LPG Q EL+ +VA AAKGPV LVIMS G VD++FAKNNPKI +ILWVGYPG+ GG AIAD
Sbjct: 512 LPGLQQELVTRVARAAKGPVILVIMSGGPVDVSFAKNNPKISAILWVGYPGQAGGTAIAD 571

Query: 549 VIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGL 605
           VIFG  NPGGRLP+TWY   Y+ K+P T+M +R  P   +PGRTY+F+ GPVV+PFG+GL
Sbjct: 572 VIFGATNPGGRLPMTWYPQGYLAKVPMTNMDMRPNPATGYPGRTYRFYKGPVVFPFGHGL 631

Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT-FQI 664
           SY++F   +A +PK V +++   Q     N T+ +      AV +    C D   T F +
Sbjct: 632 SYSRFSQSLALAPKQVSVQILSLQAL--TNSTLSSK-----AVKVSHANCDDSLETEFHV 684

Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIV 724
           +V+N G MDG+  ++++SKPP    + IKQ++ + +  + AG   ++   +++CK L +V
Sbjct: 685 DVKNEGSMDGTHTLLIFSKPPPGKWSQIKQLVTFHKTHVPAGSKQRLKVNVHSCKHLSVV 744

Query: 725 DNAANSLLASGAHTILVGE 743
           D      + +G H + +G+
Sbjct: 745 DQFGVRRIPTGEHELHIGD 763


>gi|224070626|ref|XP_002303181.1| predicted protein [Populus trichocarpa]
 gi|222840613|gb|EEE78160.1| predicted protein [Populus trichocarpa]
          Length = 773

 Score =  744 bits (1922), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/742 (50%), Positives = 501/742 (67%), Gaps = 30/742 (4%)

Query: 8   KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
            L+   +C+  +   +R  DLV+R+TL EK+  + + A  V RLG+P YEWWSEALHGVS
Sbjct: 47  SLASLGFCNTSIGINDRVVDLVKRLTLQEKIVFLVNSAGNVSRLGIPKYEWWSEALHGVS 106

Query: 68  FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG 127
           ++G      PGTHF  +V GATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AG
Sbjct: 107 YVG------PGTHFSDDVAGATSFPQVILTAASFNTSLFEAIGKVVSTEARAMYNVGLAG 160

Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
           LTFWSPNIN+ RDPRWGR  ETPGEDP +  +Y   YV+GLQ  +      D D   LK+
Sbjct: 161 LTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSCYVKGLQQRD------DGDPDKLKV 214

Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
           +ACCKHY AYDLDNW+G+DR+HF++ VT+QDM +TF  PF+ CV +G+V+SVMCSYN+VN
Sbjct: 215 AACCKHYTAYDLDNWKGSDRYHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQVN 274

Query: 248 GIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL 307
           G PTCADP LL+  IRG+WN +GYIV+DCDS+    +S  +    +E A A +L AG+DL
Sbjct: 275 GKPTCADPDLLSGVIRGEWNLNGYIVTDCDSLDVFYKSQNYTKTPEEAAAAAIL-AGVDL 333

Query: 308 DCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICN 364
           +CG +    T  AV+ G + E  ID ++   +  LMRLG+FDG P    Y  LG  ++C 
Sbjct: 334 NCGSFLGQHTEAAVKGGLVNEHAIDIAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCT 393

Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
            ++ ELA EAARQGIVLLKN  G+LPL+   IK LA++GP+AN TK MIGNYEGTPC+YT
Sbjct: 394 AENQELAREAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT 453

Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           +P+ G  A S    Y PGC+++ C + + +  A   A  ADATV+V G DLS+EAE +DR
Sbjct: 454 TPLQGL-AASVATTYLPGCSNVAC-STAQVDDAKKLAAAADATVLVMGADLSIEAESRDR 511

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
           VD+LLPG Q  LI  VA+ + GPV LVIMS G +D++FA+ N KI SILWVGYPGE GG 
Sbjct: 512 VDVLLPGQQQLLITAVANVSCGPVILVIMSGGGMDVSFARTNDKITSILWVGYPGEAGGA 571

Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPF 601
           AIAD+IFG YNP GRLP+TWY  +YV K+P T+M +R  P N +PGRTY+F+ G  VY F
Sbjct: 572 AIADIIFGYYNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPSNGYPGRTYRFYTGETVYSF 631

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
           G GLSY+QF +++  +P+ V + L++   C         +   C +V+  +  C++  F 
Sbjct: 632 GDGLSYSQFTHELIQAPQLVYVPLEESHVC---------HSSECQSVVASEQTCQNSTFD 682

Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
             + V+N G + GS  V ++S PP +  +  K ++G+E+VF+ A     V F ++ CK L
Sbjct: 683 MLLRVKNEGTISGSHTVFLFSSPPAVHNSPQKHLVGFEKVFLNAQTGRHVRFKVDICKDL 742

Query: 722 KIVDNAANSLLASGAHTILVGE 743
            +VD   +  +A G H + VG 
Sbjct: 743 SVVDELGSKKVALGEHVLHVGS 764


>gi|357449039|ref|XP_003594795.1| Beta xylosidase [Medicago truncatula]
 gi|355483843|gb|AES65046.1| Beta xylosidase [Medicago truncatula]
          Length = 762

 Score =  743 bits (1919), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 365/736 (49%), Positives = 495/736 (67%), Gaps = 31/736 (4%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           + +C+ ++P   R +DL+ R+ LPEK++ + + A  VPRLG+  YEWWSEALHGVS +G 
Sbjct: 39  YKFCNTRVPIHARVQDLIGRLALPEKIRLVVNNAIAVPRLGIQGYEWWSEALHGVSNVG- 97

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
                PGT F      ATSFP VI T ASFN+SLW +IG+ VS EARAMYN G AGLTFW
Sbjct: 98  -----PGTKFGGAFSAATSFPQVITTAASFNQSLWLEIGRIVSDEARAMYNGGAAGLTFW 152

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
           SPN+N+ RDPRWGR  ETPGEDP V G+YA +YV+GLQ         +     LK++ACC
Sbjct: 153 SPNVNIFRDPRWGRGQETPGEDPTVAGKYAASYVQGLQG--------NGAGNRLKVAACC 204

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
           KHY AYDLDNW G DRFHF+++V++QD+ +T+ +PF+ CV +G V+SVMCSYN+VNG PT
Sbjct: 205 KHYTAYDLDNWNGVDRFHFNAKVSKQDLADTYDVPFKACVRDGKVASVMCSYNQVNGKPT 264

Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
           CADP+LL  TIRG+W  +GYIVSDCDS+  + ++  +   T E A A  +KAGLDLDCG 
Sbjct: 265 CADPELLRNTIRGEWGLNGYIVSDCDSVGVLYDNQHY-TRTPEQAAAAAIKAGLDLDCGP 323

Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICNPQHIEL 370
           +    T GA++QG I+E D++ +L  L  V MRLG FDG  Q Y NLG  ++C P H ++
Sbjct: 324 FLALHTDGAIKQGLISENDLNLALANLITVQMRLGMFDGDAQPYGNLGTRDVCLPSHNDV 383

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A EAARQGIVLL+N   ALPL+    +T+ ++GP+++ T  MIGNY G  C YT+P+ G 
Sbjct: 384 ALEAARQGIVLLQNKGNALPLSPTRYRTVGVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 443

Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
             Y K I+ A GC D+ C  N +   +   A+ ADATV+V GLD S+EAE +DR  LLLP
Sbjct: 444 ARYVKTIHQA-GCKDVGCGGNQLFGLSEQVARQADATVLVMGLDQSIEAEFRDRTGLLLP 502

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
           G Q EL+++VA AA+GPV LV+MS G +D+ FAKN+PKI +ILWVGYPG+ GG AIADVI
Sbjct: 503 GHQQELVSRVARAARGPVILVLMSGGPIDVTFAKNDPKISAILWVGYPGQSGGTAIADVI 562

Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSY 607
           FG+ NP GRLP TWY  +YV K+P T+M +R  P   +PGRTY+F+ GPVV+PFG+GLSY
Sbjct: 563 FGRTNPSGRLPNTWYPQDYVRKVPMTNMDMRANPATGYPGRTYRFYKGPVVFPFGHGLSY 622

Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
           ++F + +A +PK V ++           +T  +NK    A+ +    C + +  F ++V+
Sbjct: 623 SRFTHSLALAPKQVSVQFTTPLTQA---FTNSSNK----AMKVSHANCDELEVGFHVDVK 675

Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           N G MDG+  ++VYSK P      +KQ++ + + ++ AG   +V   ++ C  L  VD  
Sbjct: 676 NEGSMDGAHTLLVYSKAP----NGVKQLVNFHKTYVPAGSKTRVKVGVHVCNHLSAVDEF 731

Query: 728 ANSLLASGAHTILVGE 743
               +  G H + +G+
Sbjct: 732 GVRRIPMGEHELQIGD 747


>gi|297745522|emb|CBI40687.3| unnamed protein product [Vitis vinifera]
          Length = 751

 Score =  743 bits (1917), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/740 (51%), Positives = 489/740 (66%), Gaps = 53/740 (7%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L  F +C+  L    R  DLV+R+TL EK+  + + A  V RLG+P YEWWSEALHGVS+
Sbjct: 49  LGQFGFCNTSLETAARVADLVKRLTLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSY 108

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           +G      PGTHF+S VPGATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AGL
Sbjct: 109 VG------PGTHFNSVVPGATSFPQVILTAASFNASLFEAIGKAVSTEARAMYNVGLAGL 162

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           TFWSPN+N+ RDPRWGR  ETPGEDP +  +YA  YVRGLQ  +      D     LK++
Sbjct: 163 TFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVRGLQQSD------DGSPDRLKVA 216

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYDLDNW+G DRFHF++ VT+QDM +TF  PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 217 ACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNG 276

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            P CADP LL+  +RG+W  +GYIVSDCDS+     S  +   T E+A A+ + AGLDL+
Sbjct: 277 KPACADPDLLSGIVRGEWKLNGYIVSDCDSVDVFYNSQHY-TKTPEEAAAKAILAGLDLN 335

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG +    T  AV+ G + E+ +D ++   +  LMRLG+FDG+P    Y  LG  ++C  
Sbjct: 336 CGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIYGKLGPKDVCTS 395

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H ELA EAARQGIVLLKN  G+LPL+   IKTLA++GP+AN TK MIGNYEGTPC+YT+
Sbjct: 396 EHQELAREAARQGIVLLKNSKGSLPLSPTAIKTLAVIGPNANVTKTMIGNYEGTPCKYTT 455

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+ G  A      Y PGC+++ C   + I  A   A  ADATV++ G+D S+EAEG+DRV
Sbjct: 456 PLQGLTALVAT-TYLPGCSNVAC-GTAQIDEAKKIAAAADATVLIVGIDQSIEAEGRDRV 513

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           ++ LPG Q  LI +VA A+KG V LV+MS G  DI+FAKN+ KI SILWVGYPGE GG A
Sbjct: 514 NIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISFAKNDDKITSILWVGYPGEAGGAA 573

Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
           IADVIFG YNP GRLP+TWY  +YV K+P T+M +R  P + +PGRTY+F+ G  +Y FG
Sbjct: 574 IADVIFGFYNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPASGYPGRTYRFYTGETIYTFG 633

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
            GLSYTQF + ++         +D  Q+                        C++  F  
Sbjct: 634 DGLSYTQFNHHLS---------VDAVQE-----------------------SCQNLVFDI 661

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
            + V N G + GS  V ++S PP +  +  K ++G+E+VF+ A   A V F ++ CK L 
Sbjct: 662 HLRVNNAGNISGSHTVFLFSSPPSVHNSPQKHLLGFEKVFVTAKAKALVRFKVDVCKDLS 721

Query: 723 IVDNAANSLLASGAHTILVG 742
           IVD      +A G H + VG
Sbjct: 722 IVDELGTRKVALGLHVLHVG 741


>gi|226531269|ref|NP_001145980.1| uncharacterized protein LOC100279508 precursor [Zea mays]
 gi|219885199|gb|ACL52974.1| unknown [Zea mays]
 gi|413920228|gb|AFW60160.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 794

 Score =  741 bits (1914), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 375/766 (48%), Positives = 487/766 (63%), Gaps = 33/766 (4%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           +  P+C   LP   RA+DLV R+T  EKV+ + + A GVPRLG+  YEWWSEALHGVS  
Sbjct: 36  ASLPFCRQSLPLRARARDLVSRLTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDT 95

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
           G      PG  F    PGAT+FP VI T AS N +LW+ +G+ VS EARAMYN G AGLT
Sbjct: 96  G------PGVRFGGAFPGATAFPQVIGTAASLNATLWELVGRAVSDEARAMYNGGRAGLT 149

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEY--HRDSDSRPLKI 187
           FWSPN+N+ RDPRWGR  ETPGEDP V  RYA  YVRGLQ         HR+     LK+
Sbjct: 150 FWSPNVNIFRDPRWGRGQETPGEDPAVSARYAAAYVRGLQQPYAAPNGGHRNR----LKL 205

Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
           +ACCKH+ AYDLD W G DRFHF++ V  QD+++TF +PF  CV +G  +SVMCSYN+VN
Sbjct: 206 AACCKHFTAYDLDKWGGTDRFHFNAVVAAQDLEDTFNVPFRACVEDGRAASVMCSYNQVN 265

Query: 248 GIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL 307
           G+PTCAD   L  TIRG W   GYIVSDCDS+        +   T EDA A  L+AGLDL
Sbjct: 266 GVPTCADAAFLRGTIRGRWGLDGYIVSDCDSVDVFFRDQHYTR-TPEDAAAATLRAGLDL 324

Query: 308 DCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICN 364
           DCG +   +   AV  GK+A+AD+D +L     V MRLG FDG P    +  LG  ++C 
Sbjct: 325 DCGPFLALYAGSAVAAGKVADADVDAALLNTVTVQMRLGMFDGDPAAGPFGRLGPADVCT 384

Query: 365 PQHIELAAEAARQGIVLLKNDNGA------LPLNTGNIKTLALVGPHANATKAMIGNYEG 418
            +H +LA +AARQG+VLLKN  GA      LPL     + +A+VGPHA+AT AMIGNY G
Sbjct: 385 REHQDLALDAARQGVVLLKNRRGARHNRDVLPLRPAAHRVVAVVGPHADATVAMIGNYAG 444

Query: 419 TPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
            PCRYT+P+ G  AY+  + +  GC D+ C+ N  I AA++AA+ ADATV+VAGLD  VE
Sbjct: 445 KPCRYTTPLQGVAAYAARVAHQAGCTDVACRGNQPIAAAVEAARQADATVVVAGLDQRVE 504

Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
           AEG DR  LLLPG Q ELI+ VA A+KGPV LV+MS G +DI FA+N+P+I  ILWVGYP
Sbjct: 505 AEGLDRTTLLLPGRQAELISAVAKASKGPVILVLMSGGPIDIAFAQNDPRIDGILWVGYP 564

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDG 595
           G+ GG+AIADVIFG +NPG +LP+TWY  +Y+ K+P T+M +R  P   +PGRTY+F+ G
Sbjct: 565 GQAGGQAIADVIFGHHNPGAKLPVTWYHQDYLQKVPMTNMAMRANPARGYPGRTYRFYTG 624

Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKL--DKDQQCRDINYTVGTNKPPCAAVLIDDV 653
           P +YPFG+GLSYTQF + +A +P  + ++L           +    T   P  AV +   
Sbjct: 625 PTIYPFGHGLSYTQFTHTLAHAPTQLTVRLSGSGHSAASAASLLNATLARPVRAVRVAHA 684

Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI------AGTHIKQVIGYERVFIAAGQ 707
           +C+       ++V N+G  DG+  V+VY   P        A    +Q++ +E+V + AG 
Sbjct: 685 RCEGLTVPVHVDVSNVGDRDGAHAVLVYHAAPSPSHAAPGADAPARQLVAFEKVHVPAGG 744

Query: 708 SAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQ 753
            A+V   +  C  L + D      +  G H +++GE    VS  ++
Sbjct: 745 VARVEMRIGVCDRLSVADRNGVRRVPVGEHRLMIGELTHSVSLGVE 790


>gi|225431898|ref|XP_002276351.1| PREDICTED: beta-D-xylosidase 1-like [Vitis vinifera]
          Length = 770

 Score =  741 bits (1913), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 374/739 (50%), Positives = 510/739 (69%), Gaps = 29/739 (3%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
           + P+C   LP  ERA+DLV R+TL EK++ + + A  VPRLG+  YEWWSEALHGVS +G
Sbjct: 38  NLPFCRVSLPIQERARDLVGRLTLQEKIRLLVNNAIDVPRLGIKGYEWWSEALHGVSNVG 97

Query: 71  RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
                 PGT F    PGATSFP VI T ASFN SLW++IG+ VS EARAMYN G AGLT+
Sbjct: 98  ------PGTKFGGSFPGATSFPQVITTAASFNASLWEEIGRVVSDEARAMYNGGMAGLTY 151

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           WSPN+N+ RDPRWGR  ETPGEDP V  +YA  YVRGLQ        RD     LK++AC
Sbjct: 152 WSPNVNIFRDPRWGRGQETPGEDPAVAAKYAAAYVRGLQGNA-----RDR----LKVAAC 202

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKHY AYDLD+W G DRFHF++RV++QD+++T+ +PF+ CV EG+V+SVMCSYN+VNG P
Sbjct: 203 CKHYTAYDLDHWGGIDRFHFNARVSKQDLEDTYDVPFKACVVEGNVASVMCSYNQVNGKP 262

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
           TCADP LL  TIRG+W  +GYIVSDCDS+    +   +   T E+A A  +KAGLDLDCG
Sbjct: 263 TCADPHLLRDTIRGEWKLNGYIVSDCDSVGVFYDEQHY-TATPEEAAAVAIKAGLDLDCG 321

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
            +    T  A++ GK+ EAD++ +L     V MRLG FDG P    Y NLG  ++C P H
Sbjct: 322 PFLAIHTEAAIRGGKLTEADVNGALMNTISVQMRLGMFDGEPSAQPYGNLGPRDVCTPAH 381

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
            +LA EAARQGIVL++N   ALPL+T   +T+A++GP+++ T+ MIGNY G  C YT+P+
Sbjct: 382 QQLALEAARQGIVLVQNRGPALPLSTSRHRTIAVIGPNSDVTETMIGNYAGVACGYTTPL 441

Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
            G   Y++ I+ A GC+ + C+++    AA+ AA+ ADATV+V GLD S+EAE +DRVD+
Sbjct: 442 QGIGRYARTIHQA-GCSGVACRDDQQFGAAVAAARQADATVLVMGLDQSIEAEFRDRVDI 500

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
           LLPG Q EL++KVA A++GP  LV+MS G +D++FAKN+P+I +I+WVGYPG+ GG AIA
Sbjct: 501 LLPGRQQELVSKVAVASRGPTVLVLMSGGPIDVSFAKNDPRIAAIIWVGYPGQAGGTAIA 560

Query: 548 DVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
           DV+FG+ NPGG+LP+TWY  +Y+ K P T+M +R  P   +PGRTY+F++GPVV+PFG+G
Sbjct: 561 DVLFGRTNPGGKLPVTWYPQSYLRKAPMTNMAMRAIPSRGYPGRTYRFYNGPVVFPFGHG 620

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
           LSY+ F + +A +P +V + L   Q  +  N T+ ++     A+ I    C      F I
Sbjct: 621 LSYSTFAHSLAQAPTTVSVSLASLQTIK--NSTIVSS----GAIRISHANCNTQPLGFHI 674

Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIV 724
           +V+N G MDGS  ++++S PP    +  K+++ +E+V + AG   +V F ++ CK L +V
Sbjct: 675 DVKNTGTMDGSHTLLLFSTPPPGTWSPNKRLLAFEKVHVGAGSQERVRFDVHVCKHLSVV 734

Query: 725 DNAANSLLASGAHTILVGE 743
           D+     +  G H   +G+
Sbjct: 735 DHFGIHRIPMGEHHFHIGD 753


>gi|255556320|ref|XP_002519194.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
 gi|223541509|gb|EEF43058.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
          Length = 782

 Score =  741 bits (1912), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/753 (49%), Positives = 504/753 (66%), Gaps = 35/753 (4%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
           +  +C A LP   R +DL+ R+TL EK++ + + A  VPRLG+  YEWWSEALHGVS +G
Sbjct: 53  NLKFCRANLPIHVRVRDLISRLTLQEKIRLLVNNAAAVPRLGIQGYEWWSEALHGVSNVG 112

Query: 71  RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
                 PG  F    PGATSFP VI T ASFN+SLW++IG+ VS EARAMYN G AGLT+
Sbjct: 113 ------PGVKFGGAFPGATSFPQVITTAASFNQSLWEQIGRVVSDEARAMYNGGLAGLTY 166

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           WSPN+NV RDPRWGR  ETPGEDP + G+YA +YVRGLQ   G++         LK++AC
Sbjct: 167 WSPNVNVFRDPRWGRGQETPGEDPVLAGKYAASYVRGLQSSTGLK---------LKVAAC 217

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKHY AYDLDNW G DR+HF++RV++QD+++T+ +PF+ CV EG V+SVMCSYN+VNG P
Sbjct: 218 CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYDVPFKACVVEGKVASVMCSYNQVNGKP 277

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
           TCADP LL  TIRG W  +GYIVSDCDS+  + ++  +   T E+A A  +KAGLDLDCG
Sbjct: 278 TCADPILLKNTIRGQWGLNGYIVSDCDSVGVLYDNQHY-TSTPEEAAAATIKAGLDLDCG 336

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
            +    T  AV++G + E D++ +L     V MRLG FDG P    Y NLG  ++C P H
Sbjct: 337 PFLAIHTENAVKKGLLVEEDVNLALANTITVQMRLGMFDGEPSAHPYGNLGPRDVCTPAH 396

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
            ELA EAARQGIVLL+N   ALPL++    T+A++GP+++ T  MIGNY G  C+YTSP+
Sbjct: 397 QELALEAARQGIVLLENRGQALPLSSSRHHTIAVIGPNSDVTVTMIGNYAGIACKYTSPL 456

Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
            G   Y+K + +  GC D+ C +N    AA  AA+ ADATV+V GLD S+EAE +DRV L
Sbjct: 457 QGISRYAKTL-HQNGCGDVACHSNQQFGAAEAAARQADATVLVMGLDQSIEAEFRDRVGL 515

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
           LLPG Q EL+++VA A++GP  LV+MS G +D++FAKN+P++ +ILW GYPG+ GG AIA
Sbjct: 516 LLPGHQQELVSRVARASRGPTILVLMSGGPIDVSFAKNDPRVGAILWAGYPGQAGGAAIA 575

Query: 548 DVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
           DV+FG  NPGG+LP+TWY   Y+ K+P T+M +R  P   +PGRTY+F+ G VV+PFG+G
Sbjct: 576 DVLFGTTNPGGKLPMTWYPQGYLAKVPMTNMGMRPDPATGYPGRTYRFYKGNVVFPFGHG 635

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
           +SYT F + +  +PK V + +        +N T+ +      A+ +  + C+       I
Sbjct: 636 MSYTSFSHSLTQAPKEVSLPI---TNLYALNTTISSK-----AIRVSHINCQT-SLGIDI 686

Query: 665 EVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKI 723
            V+N G MDG+  ++V+S PP G   +  KQ+IG+E+V + AG   +V   ++ CK L  
Sbjct: 687 NVKNTGTMDGTHTLLVFSSPPSGEKESSNKQLIGFEKVDLVAGSQIQVKIDIHVCKHLSA 746

Query: 724 VDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
           VD      +  G H I +G+    +S  LQ N+
Sbjct: 747 VDRFGIRRIPIGDHHIYIGDLKHSIS--LQANM 777


>gi|357166259|ref|XP_003580652.1| PREDICTED: beta-D-xylosidase 4-like [Brachypodium distachyon]
          Length = 774

 Score =  739 bits (1908), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 374/743 (50%), Positives = 505/743 (67%), Gaps = 32/743 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           ++ + +CD       RA DLV R+TL +KV  + +    + RLG+P YEWWSEALHGVS+
Sbjct: 47  VAGYAFCDRAKSASARAADLVSRLTLADKVGFLVNKQPALARLGIPAYEWWSEALHGVSY 106

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           +G      PGT F   VPGATSFP  ILT ASFN SL++ IG+ VS EARAM+N+G AGL
Sbjct: 107 VG------PGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSNEARAMHNVGLAGL 160

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           TFWSPNIN+ RDPRWGR  ETPGEDP +  RYA+ YV GLQD        D+D  PLK++
Sbjct: 161 TFWSPNINIFRDPRWGRGQETPGEDPLLASRYAVGYVSGLQDAGA-----DADG-PLKVA 214

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYD+DNW+G +R+ FD++V++QD+ +TF  PF+ CV +G V+SVMCSYN+VNG
Sbjct: 215 ACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVIDGKVASVMCSYNKVNG 274

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTCAD  LL+  IRGDW  +GYIVSDCDS+  ++ S +    T E+A A  +K+GLDL+
Sbjct: 275 KPTCADKDLLSGVIRGDWKLNGYIVSDCDSVD-VLYSQQHYTKTPEEAAAITIKSGLDLN 333

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CGD+    T+ AVQ G ++E+D+D ++   +I+LMRLG+FDG P+   Y +LG  ++C  
Sbjct: 334 CGDFLAKHTVAAVQAGNLSESDVDRAITNNFIMLMRLGFFDGDPRKLAYGSLGPKDVCTS 393

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            + ELA E ARQGIVLLKND GALPL+  +IK++A++GP+ANA+  MIGNYEGTPC+YT+
Sbjct: 394 SNQELARETARQGIVLLKND-GALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTT 452

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           P+ G       + Y PGC+++ C  NS+ + AA  AA +AD TV+V G D S+E E  DR
Sbjct: 453 PLHGLGNNVATV-YQPGCSNVGCSGNSLQLSAATAAAASADVTVLVVGADQSIEREALDR 511

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
             LLLPG Q +LI+ VA+A+KG V LV+MS G  DI+FAK + KI +ILWVGYPGE GG 
Sbjct: 512 TSLLLPGQQPDLISAVANASKGHVILVVMSGGPFDISFAKASDKISAILWVGYPGEAGGA 571

Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKFFDGPVVYPF 601
           AIAD+IFGKYNP GRLP+TWY A++  K+P T M +RP N+  +PGRTY+F+ G  V+ F
Sbjct: 572 AIADIIFGKYNPSGRLPVTWYPASFADKVPMTDMRMRPDNSTGYPGRTYRFYTGETVFAF 631

Query: 602 GYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
           G GLSYT   +  VA+ P  V ++L +   C         +   CA+V      C+   F
Sbjct: 632 GDGLSYTTMSHNLVAAPPSEVSMQLAEGHAC---------HTKECASVEAAGDHCEGMAF 682

Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
             ++ V N G+M G+  V+++S PP +     K ++G+E++ +  GQ+    F ++ CK 
Sbjct: 683 EVRLRVHNTGEMAGAHTVLLFSSPPAVHNAPAKHLLGFEKLNLEPGQAGVAAFKVDVCKD 742

Query: 721 LKIVDNAANSLLASGAHTILVGE 743
           L +VD   N  +A G HT+ VG+
Sbjct: 743 LSVVDELGNRKVALGGHTLHVGD 765


>gi|413919688|gb|AFW59620.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 773

 Score =  738 bits (1906), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 368/741 (49%), Positives = 496/741 (66%), Gaps = 31/741 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L+ + +C+       RA DLV R+TL EKV  + D    +PRLG+PLYEWWSEALHGVS+
Sbjct: 49  LASYGFCNRSAAAAARAADLVSRLTLAEKVGFLVDKQAALPRLGVPLYEWWSEALHGVSY 108

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           +G      PGT F   VPGATSFP  ILT ASFN +L++ IG+ VS EARAM+N+G AGL
Sbjct: 109 VG------PGTRFSPLVPGATSFPQPILTAASFNATLFRAIGEVVSNEARAMHNVGLAGL 162

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           TFWSPNIN+ RDPRWGR  ETPGEDP +  +YA+ YV GLQ          S +  LK++
Sbjct: 163 TFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQGAV-------SGAGALKVA 215

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYD+DNW+G +R+ FD+ V++QD+ +TF  PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 216 ACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVVDGNVASVMCSYNQVNG 275

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTCAD  LL+  IRGDW  +GYI SDCDS+  +  +  +   T EDA A  +KAGLDL+
Sbjct: 276 KPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHY-TKTPEDAAAISIKAGLDLN 334

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG +    T+ AVQ GK++E+D+D ++    + LMRLG+FDG P+   + NLG +++C P
Sbjct: 335 CGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNLGPSDVCTP 394

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            + ELA EAARQGIVLLKN  G LPL+  +IK++A++GP+ANA+  MIGNYEGTPC+YT+
Sbjct: 395 SNQELAREAARQGIVLLKN-TGKLPLSATSIKSMAVIGPNANASFTMIGNYEGTPCKYTT 453

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           P+ G  A    + Y PGC ++ C  NS+ + AA  AA +AD TV+V G D S+E E  DR
Sbjct: 454 PLQGLGANVATV-YQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSIERESLDR 512

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
             LLLPG Q +L++ VA+A+ GP  LV+MS G  DI+FAK++ KI +ILWVGYPGE GG 
Sbjct: 513 TSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWVGYPGEAGGA 572

Query: 545 AIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
           AIADV+FG +NP GRLP+TWY  ++ K+P T M +R  P   +PGRTY+F+ G  VY FG
Sbjct: 573 AIADVLFGYHNPSGRLPVTWYPESFTKVPMTDMRMRPDPSTGYPGRTYRFYTGDTVYAFG 632

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
            GLSYT F + + S+PK + ++L +   C             C +V  +   C+   F  
Sbjct: 633 DGLSYTSFAHHLVSAPKQLALQLAEGHACL---------TEQCPSVEAEGAHCEGLAFDV 683

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
            + V N G+  G   V ++S PP +     K ++G+E+V +  GQ+  V F ++ CK L 
Sbjct: 684 HLRVRNAGERSGGHTVFLFSSPPAVHNAPAKHLLGFEKVSLEPGQAGVVAFKVDVCKDLS 743

Query: 723 IVDNAANSLLASGAHTILVGE 743
           +VD   N  +A G+HT+ VG+
Sbjct: 744 VVDELGNRKVALGSHTLHVGD 764


>gi|86553064|gb|AAS17751.2| beta xylosidase [Fragaria x ananassa]
          Length = 772

 Score =  736 bits (1901), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/737 (49%), Positives = 490/737 (66%), Gaps = 29/737 (3%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           F +C  ++P   R +DL+ R+TL EK++ + + A  VPRLG+  YEWWSEALHGVS +G 
Sbjct: 44  FKFCRTRVPVHVRVQDLIGRLTLQEKIRLLVNNAIAVPRLGIQGYEWWSEALHGVSNVG- 102

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
                PGT F    PGATSFP VI T ASFN+SLW++IGQ VS EARAMYN G AGLT+W
Sbjct: 103 -----PGTKFGGAFPGATSFPQVITTAASFNQSLWQEIGQVVSDEARAMYNGGQAGLTYW 157

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
           SPN+N+ RDPRWGR  ETPGEDP +  +YA +YV+GLQ         D     LK++ACC
Sbjct: 158 SPNVNIFRDPRWGRGQETPGEDPVLSAKYAASYVKGLQG--------DGAGNRLKVAACC 209

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
           KHY AYDLDNW G DRFHF++RV++QD+ +T+ +PF  CV EG V+SVMCSYN+VNG PT
Sbjct: 210 KHYTAYDLDNWNGVDRFHFNARVSKQDLADTYDVPFRGCVLEGKVASVMCSYNQVNGKPT 269

Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
           CADP LL  TIRG+W  +GYIVSDCDS+    +   +   T E+A A  +KAGLDLDCG 
Sbjct: 270 CADPDLLKNTIRGEWKLNGYIVSDCDSVGVFYDQQHY-TRTPEEAAAEAIKAGLDLDCGP 328

Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQHI 368
           +    T GA++ G + E D+D +L     V MRLG FDG P   QY NLG  ++C P H 
Sbjct: 329 FLAIHTEGAIKAGLLPEIDVDYALANTLTVQMRLGMFDGEPSAQQYGNLGPRDVCTPAHQ 388

Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
           ELA EA+RQGIVLL+N+   LPL+T   +T+A+VGP+++ T+ MIGNY G  C YT+P+ 
Sbjct: 389 ELALEASRQGIVLLQNNGHTLPLSTVRHRTVAVVGPNSDVTETMIGNYAGVACGYTTPLQ 448

Query: 429 GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLL 488
           G   Y+K I +  GC ++ C  N +  AA  AA+ ADATV+V GLD S+EAE +DR DL+
Sbjct: 449 GIGRYTKTI-HQQGCTNVACTTNQLFGAAEAAARQADATVLVMGLDQSIEAEFRDRTDLV 507

Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
           +PG Q EL+++VA A++GP  LV+MS G +D++FAKN+PKI +I+WVGYPG+ GG A+AD
Sbjct: 508 MPGHQQELVSRVARASRGPTVLVLMSGGPIDVSFAKNDPKIGAIIWVGYPGQAGGTAMAD 567

Query: 549 VIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSY 607
           V+FG  NP G+LP+TWY  +YV K+P T+M +R    +PGRTY+F+ GPVV+PFG GLSY
Sbjct: 568 VLFGTTNPSGKLPMTWYPQDYVSKVPMTNMAMRAGRGYPGRTYRFYKGPVVFPFGLGLSY 627

Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPP-CAAVLIDDVKCKDYKFTFQIEV 666
           T F + +A  P SV + L         + +  TN     +AV +    C        + V
Sbjct: 628 TTFAHSLAQVPTSVSVPL--------TSLSATTNSTMLSSAVRVSHTNCNPLSLALHVVV 679

Query: 667 ENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDN 726
           +N G  DG+  ++V+S PP       KQ++G+ +V I AG   +V   ++ CK L +VD 
Sbjct: 680 KNTGARDGTHTLLVFSSPPSGKWAANKQLVGFHKVHIVAGSHKRVKVDVHVCKHLSVVDQ 739

Query: 727 AANSLLASGAHTILVGE 743
                +  G H + +G+
Sbjct: 740 FGIRRIPIGEHKLQIGD 756


>gi|298364130|gb|ADI79208.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Malus x domestica]
          Length = 774

 Score =  735 bits (1898), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 365/736 (49%), Positives = 488/736 (66%), Gaps = 29/736 (3%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           +C  ++P   R +DL+ R+TL EK+  + + A  VPRLG+  YEWWSEALHGVS +G   
Sbjct: 46  FCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPRLGIQGYEWWSEALHGVSNVG--- 102

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
              PGT F + + GATSFP VI T ASFNESLW++IG+ VS EARAMYN G AGLTFWSP
Sbjct: 103 ---PGTKFGTFL-GATSFPQVITTAASFNESLWEEIGRVVSDEARAMYNGGAAGLTFWSP 158

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           N+N+ RDPRWGR  ETPGEDP +  +Y   YV+GLQ         D     LK++ACCKH
Sbjct: 159 NVNIFRDPRWGRGQETPGEDPILAAKYGARYVKGLQG--------DGAGNRLKVAACCKH 210

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           Y AYDLDNW G DRFHF++RV++QD+++T+ +PF  CV +G+V+SVMCSYN+VNG PTCA
Sbjct: 211 YTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFRACVVDGNVASVMCSYNQVNGKPTCA 270

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
           DP+LL  TIRG W  +GYIVSDCDS+    ++  +   T E+A A  +KAGLDLDCG + 
Sbjct: 271 DPELLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHY-TKTPEEAAAYAIKAGLDLDCGPFL 329

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQHIEL 370
              T  AV+ G++ E DI+ +L     V MRLG FDG P   +Y NLG  ++C P   EL
Sbjct: 330 GIHTEAAVRFGQVNEIDINYALANTITVQMRLGMFDGEPSAQRYGNLGLADVCKPSSNEL 389

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A EAARQGIVLL+N   +LPL+T   +T+A++GP+++ T+ MIGNY G  C YT+P+ G 
Sbjct: 390 ALEAARQGIVLLENRGNSLPLSTMRHRTVAVIGPNSDVTETMIGNYAGIACGYTTPLQGI 449

Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
             Y++ I+ A GC D+ C  N +I AA  AA+ ADATV+V GLD S+EAE +DR DLLLP
Sbjct: 450 ARYTRTIHQA-GCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEAEFRDRTDLLLP 508

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
           G Q EL+++VA A++GP  LVIMS G +D+ FAKN+P+I +I+WVGYPG+ GG AIADV+
Sbjct: 509 GHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPRIGAIIWVGYPGQAGGTAIADVL 568

Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSY 607
           FG  NP G+LP+TWY  NYV  +P T M +R  P   +PGRTY+F+ GPVV+PFG GLSY
Sbjct: 569 FGTTNPSGKLPMTWYPQNYVANLPMTDMAMRADPARGYPGRTYRFYKGPVVFPFGLGLSY 628

Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
           T+F + +A  P  V +        +  N T+  N      + +    C        I+++
Sbjct: 629 TRFSHSLAQGPTLVSVPFTSLVASK--NTTMLGNHD----IRVSHTNCDSLSLDVHIDIK 682

Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           N G MDG+  ++V++ PP       KQ++G+ +V I AG   +V   +  CK L +VD  
Sbjct: 683 NSGTMDGTHTLLVFATPPTGKWAPNKQLVGFHKVHIVAGSERRVRVGVQVCKHLSVVDEL 742

Query: 728 ANSLLASGAHTILVGE 743
               +  G H + +G+
Sbjct: 743 GIRRIPLGQHKLEIGD 758


>gi|15239867|ref|NP_199747.1| beta-xylosidase 1 [Arabidopsis thaliana]
 gi|75262458|sp|Q9FGY1.1|BXL1_ARATH RecName: Full=Beta-D-xylosidase 1; Short=AtBXL1; AltName:
           Full=Alpha-L-arabinofuranosidase; Flags: Precursor
 gi|9759419|dbj|BAB09906.1| xylosidase [Arabidopsis thaliana]
 gi|21539545|gb|AAM53325.1| xylosidase [Arabidopsis thaliana]
 gi|332008419|gb|AED95802.1| beta-xylosidase 1 [Arabidopsis thaliana]
          Length = 774

 Score =  734 bits (1895), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 366/737 (49%), Positives = 493/737 (66%), Gaps = 29/737 (3%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           +C A +P   R +DL+ R+TL EK++ + + A  VPRLG+  YEWWSEALHG+S +G   
Sbjct: 49  FCRANVPIHVRVQDLLGRLTLQEKIRNLVNNAAAVPRLGIGGYEWWSEALHGISDVG--- 105

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
              PG  F    PGATSFP VI T ASFN+SLW++IG+ VS EARAMYN G AGLT+WSP
Sbjct: 106 ---PGAKFGGAFPGATSFPQVITTAASFNQSLWEEIGRVVSDEARAMYNGGVAGLTYWSP 162

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           N+N++RDPRWGR  ETPGEDP V  +YA +YVRGLQ          +    LK++ACCKH
Sbjct: 163 NVNILRDPRWGRGQETPGEDPIVAAKYAASYVRGLQGT--------AAGNRLKVAACCKH 214

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           Y AYDLDNW G DRFHF+++VT+QD+++T+ +PF+ CV EG V+SVMCSYN+VNG PTCA
Sbjct: 215 YTAYDLDNWNGVDRFHFNAKVTQQDLEDTYNVPFKSCVYEGKVASVMCSYNQVNGKPTCA 274

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
           D  LL  TIRG W  +GYIVSDCDS+        +   T E+A AR +KAGLDLDCG + 
Sbjct: 275 DENLLKNTIRGQWRLNGYIVSDCDSVDVFFNQQHY-TSTPEEAAARSIKAGLDLDCGPFL 333

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAA 372
             FT GAV++G + E DI+ +L     V MRLG FDG+   Y NLG  ++C P H  LA 
Sbjct: 334 AIFTEGAVKKGLLTENDINLALANTLTVQMRLGMFDGNLGPYANLGPRDVCTPAHKHLAL 393

Query: 373 EAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA 432
           EAA QGIVLLKN   +LPL+    +T+A++GP+++ T+ MIGNY G  C YTSP+ G   
Sbjct: 394 EAAHQGIVLLKNSARSLPLSPRRHRTVAVIGPNSDVTETMIGNYAGKACAYTSPLQGISR 453

Query: 433 YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGF 492
           Y++ ++ A GCA + C+ N    AA  AA+ ADATV+V GLD S+EAE +DR  LLLPG+
Sbjct: 454 YARTLHQA-GCAGVACKGNQGFGAAEAAAREADATVLVMGLDQSIEAETRDRTGLLLPGY 512

Query: 493 QTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFG 552
           Q +L+ +VA A++GPV LV+MS G +D+ FAKN+P++ +I+W GYPG+ GG AIA++IFG
Sbjct: 513 QQDLVTRVAQASRGPVILVLMSGGPIDVTFAKNDPRVAAIIWAGYPGQAGGAAIANIIFG 572

Query: 553 KYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFK 611
             NPGG+LP+TWY  +YV K+P T M +R   N+PGRTY+F+ GPVV+PFG+GLSYT F 
Sbjct: 573 AANPGGKLPMTWYPQDYVAKVPMTVMAMRASGNYPGRTYRFYKGPVVFPFGFGLSYTTFT 632

Query: 612 YKVASSP-KSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY-KFTFQIEVENM 669
           + +A SP   + + L       ++N           ++ +    C  + K    +EV N 
Sbjct: 633 HSLAKSPLAQLSVSLS------NLNSANTILNSSSHSIKVSHTNCNSFPKMPLHVEVSNT 686

Query: 670 GKMDGSEVVMVYSKPP--GIAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDN 726
           G+ DG+  V V+++PP  GI G  + KQ+I +E+V + AG    V   ++ACK L +VD 
Sbjct: 687 GEFDGTHTVFVFAEPPINGIKGLGVNKQLIAFEKVHVMAGAKQTVQVDVDACKHLGVVDE 746

Query: 727 AANSLLASGAHTILVGE 743
                +  G H + +G+
Sbjct: 747 YGKRRIPMGEHKLHIGD 763


>gi|183579871|dbj|BAG28345.1| arabinofuranosidase [Citrus unshiu]
          Length = 769

 Score =  732 bits (1890), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 358/717 (49%), Positives = 484/717 (67%), Gaps = 28/717 (3%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           +C   +P   R +DL+ R+TL EK++ + + A  VPRLG+  YEWWSEALHGVS +G   
Sbjct: 42  FCRTSVPIHVRVQDLIGRLTLQEKIRLLVNNAAAVPRLGIQGYEWWSEALHGVSNVG--- 98

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
              PGT F    PGATSFP VI T A+FNESLW++IG+ VS EARAMYN G AGLT+WSP
Sbjct: 99  ---PGTKFGGAFPGATSFPQVITTAAAFNESLWEEIGRVVSDEARAMYNGGMAGLTYWSP 155

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           N+N+ RDPRWGR  ETPGEDP + G+YA +YVR LQ         ++ SR LK++ACCKH
Sbjct: 156 NVNIFRDPRWGRGQETPGEDPVLAGKYAASYVRRLQG--------NTGSR-LKVAACCKH 206

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           Y AYDLDNW G DR+HF++RV++QD+++T+ +PF+ CV EG V+SVMCSYN+VNG PTCA
Sbjct: 207 YTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 266

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
           DP +L  TIRG W   GYIVSDCDS+  +  +  +   T E+A A  +KAGLDLDCG + 
Sbjct: 267 DPDILKNTIRGQWRLDGYIVSDCDSVGVLYNTQHY-TRTPEEAAADAIKAGLDLDCGPFL 325

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIEL 370
              T GAV+ G + E D++ +  +   V MRLG FDG P    + NLG  ++C P H +L
Sbjct: 326 AIHTEGAVRGGLLREEDVNLASAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 385

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A +AA QGIVLLKN    LPL+T    T+A++GP+++ T  MIGNY G  C YT+P+ G 
Sbjct: 386 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 445

Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
             Y+K I+ A GC  + C  N +I AA  AA+ ADATV+V GLD S+EAE  DR  LLLP
Sbjct: 446 SRYAKTIHQA-GCLGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLP 504

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
           G Q EL+++VA A++GPV LV+M  G VD++FAKN+P+I +ILWVGYPG+ GG AIADV+
Sbjct: 505 GRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVL 564

Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
           FG+ NPGG+LP+TWY  +YV ++P T M +R    +PGRTY+F+ GPVV+PFG+G+SYT 
Sbjct: 565 FGRANPGGKLPMTWYPQDYVARLPMTDMRMRAGRGYPGRTYRFYKGPVVFPFGHGMSYTT 624

Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD-YKFTFQIEVEN 668
           F + ++ +P    + +         N T+ +N     A+ +    C D       ++V+N
Sbjct: 625 FAHTLSKAPNQFSVPIATSLYAFK-NTTISSN-----AIRVAHTNCNDAMSLGLHVDVKN 678

Query: 669 MGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVD 725
            G M G+  ++V++KPP    +  KQ+IG+++V + AG    V   ++ CK L +VD
Sbjct: 679 TGDMAGTHTLLVFAKPPAGNWSPNKQLIGFKKVHVTAGALQSVRLDIHVCKHLSVVD 735


>gi|449436749|ref|XP_004136155.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
          Length = 772

 Score =  731 bits (1886), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 363/740 (49%), Positives = 487/740 (65%), Gaps = 29/740 (3%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS +P+C   LP PER KDL+ R+TL EKV+ + + A  VPRLG+  YEWWSEALHGVS 
Sbjct: 38  LSRYPFCRVALPIPERVKDLIGRLTLQEKVRLLVNNAAAVPRLGIKGYEWWSEALHGVSN 97

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           +G      PGT F  + PGATSFP VI T ASFN SLW+ IG+ VS EARAMYN G AGL
Sbjct: 98  VG------PGTEFGGDFPGATSFPQVITTVASFNVSLWEAIGRVVSDEARAMYNGGAAGL 151

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T+WSPN+N+ RDPRWGR  ETPGEDP V G YA  Y++GLQ          +D   LK++
Sbjct: 152 TYWSPNVNIFRDPRWGRGQETPGEDPVVAGEYAARYIKGLQG---------NDGDRLKVA 202

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKH+ AYDLDNW G DRFHF+++VT QDM +TF +PF  CV EG V+SVMCSYN+VNG
Sbjct: 203 ACCKHFTAYDLDNWNGTDRFHFNAKVTRQDMVDTFEVPFRKCVKEGKVASVMCSYNQVNG 262

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           +PTCADP LL  TIR  W  +GYIVSDCDS+    ++  +   T E+A A  +KAGLDLD
Sbjct: 263 VPTCADPNLLKGTIRNQWGLNGYIVSDCDSVGVFYDNQHY-TSTAEEAAADAIKAGLDLD 321

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG +    T  AV++G + +  I+ +L     V MRLG FDG+P    Y  LG  N+C+P
Sbjct: 322 CGPFLAVHTEDAVKKGLLTQTHINNALANTITVQMRLGMFDGAPSSHAYGKLGPKNVCSP 381

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            H +LA +AARQGIVLLKN    LPL+  + +T+A++GP+++    MIGNY G  C Y +
Sbjct: 382 SHQQLALDAARQGIVLLKNRLPGLPLSADHHRTVAVIGPNSDVNVTMIGNYAGVACGYVT 441

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P++G   Y+ V+ +  GC ++ C  +     A+ AA  ADATV+V GLD SVEAE KDR 
Sbjct: 442 PLEGIKRYTTVV-HRKGCDNVACATDYSFTDALAAASTADATVLVMGLDQSVEAETKDRD 500

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            LLLPG Q EL+ KVA A++GP  +++MS G +D++FA N+P+I +ILWVGYPG+ GG A
Sbjct: 501 GLLLPGRQQELVLKVAAASRGPTVVILMSGGPIDVSFADNDPRISAILWVGYPGQAGGAA 560

Query: 546 IADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYG 604
           IADV+FG  NPGG+LP+TWY  +Y+  +P T+M +R  +++PGRTY+F+ GPVVY FG+G
Sbjct: 561 IADVLFGTTNPGGKLPMTWYPQSYLSNLPMTNMAMRSTSSYPGRTYRFYAGPVVYEFGHG 620

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
           LSYT F + +  +P  V I L   +Q      T   +     A+ +   KC+       +
Sbjct: 621 LSYTNFIHTIVKAPTIVSISLSGHRQ------THSASTLSSKAIRVTHAKCQKLSLVIHV 674

Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHI--KQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
           +VEN G  DG   ++V+S PP    T +  KQ++ +E++ +A+ +  ++   ++ CK L 
Sbjct: 675 DVENKGDRDGFHTMLVFSTPPANGATWVPRKQLVAFEKLHLASREKRRLQVHVHVCKYLS 734

Query: 723 IVDNAANSLLASGAHTILVG 742
           +VD      +  G H I +G
Sbjct: 735 VVDKLGVRRIPLGDHYIHIG 754


>gi|408354266|gb|AFU54452.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Prunus salicina]
          Length = 775

 Score =  729 bits (1882), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 366/739 (49%), Positives = 490/739 (66%), Gaps = 34/739 (4%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           +C   +P   R +DL+ R+TL EK++ + + A  VPRLG+  YEWWSEALHGVS +G   
Sbjct: 46  FCRVTVPIHVRVQDLIGRLTLQEKIRLLVNNAIAVPRLGIQGYEWWSEALHGVSNVG--- 102

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
              PGT F    PGATSFP VI T ASFNESLW++IG+ V  EARAMYN G AGLT+WSP
Sbjct: 103 ---PGTKFGGAFPGATSFPQVITTAASFNESLWQEIGRVVPDEARAMYNGGMAGLTYWSP 159

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           N+N+ RDPRWGR  ETPGEDP +  +YA  YV+GLQ         D     LK++ACCKH
Sbjct: 160 NVNIFRDPRWGRGQETPGEDPVLASKYAARYVKGLQG--------DGAGNRLKVAACCKH 211

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           Y AYDLDNW G +RFHF++RV++QD+ +T+ +PF+ CV EG V+SVMCSYN+VNG PTCA
Sbjct: 212 YTAYDLDNWNGVNRFHFNARVSKQDLADTYNVPFKACVVEGHVASVMCSYNQVNGKPTCA 271

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
           DP LL  TIRG W  +GYIVSDCDS+  + E   +   T E+A A  +KAGLDLDCG + 
Sbjct: 272 DPDLLKGTIRGQWRLNGYIVSDCDSVGVLYEEQHY-TRTPEEAAADAIKAGLDLDCGPFL 330

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQHIEL 370
              T  AV++G +++ +I+ +L     V MRLG FDG P   QY NLG  ++C P H +L
Sbjct: 331 AIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQYGNLGPRDVCTPAHQQL 390

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A EAARQGIVLL+N   +LPL+    +T+A++GP+++ T  MIGNY G  C YT+P+ G 
Sbjct: 391 ALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 450

Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
             Y++ I+ A GC D+ C  N +  AA  AA+ ADATV+V GLD S+EAE  DRV LLLP
Sbjct: 451 GRYTRTIHQA-GCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQSIEAEFVDRVGLLLP 509

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
           G Q EL+++VA A++GP  LV+MS G +D+ FAKN+P+I +I+WVGYPG+ GG AIADV+
Sbjct: 510 GHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWVGYPGQAGGTAIADVL 569

Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSY 607
           FG  NPGG+LP+TWY  NYV  +P T M +R  P   +PGRTY+F+ GPVV+PFG GLSY
Sbjct: 570 FGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYRFYRGPVVFPFGLGLSY 629

Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRD---INYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
           T F + +A  P SV + L   +   +   ++  V  +   C A+   DV          +
Sbjct: 630 TTFAHNLAHGPTSVSVPLTSLKATANSTMLSKAVRVSHADCNALSPLDV---------HV 680

Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIV 724
           +V+N G MDG+  ++V++ PP       KQ++G+ ++ IAAG   +V   ++ CK L +V
Sbjct: 681 DVKNTGSMDGTHTLLVFTSPPDGKWAASKQLVGFHKIHIAAGSETRVRIAVHVCKHLSVV 740

Query: 725 DNAANSLLASGAHTILVGE 743
           D      +  G H + +G+
Sbjct: 741 DRFGIRRIPLGEHKLQIGD 759


>gi|302786124|ref|XP_002974833.1| hypothetical protein SELMODRAFT_101733 [Selaginella moellendorffii]
 gi|300157728|gb|EFJ24353.1| hypothetical protein SELMODRAFT_101733 [Selaginella moellendorffii]
          Length = 784

 Score =  729 bits (1882), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 357/751 (47%), Positives = 487/751 (64%), Gaps = 24/751 (3%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
           S    L  FP+CD KL    R +DLV R+TL EKV +M + A G+PRLG+P Y+WW EAL
Sbjct: 41  SSNASLGSFPFCDTKLGIDVRVQDLVSRLTLDEKVDEMVNAAQGIPRLGVPSYQWWQEAL 100

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           HGV+       S PG  F    P ATSFP  I T ASFN +L+  IG+ VS+EARA++NL
Sbjct: 101 HGVA-------SSPGVQFGGLAPAATSFPMPIATAASFNSTLFYSIGEAVSSEARALHNL 153

Query: 124 GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
           G AGLTFWSPN+N+ RDPRWGR  ETPGEDP +  ++A  YVRGLQ   G  Y   +   
Sbjct: 154 GRAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLASKFASLYVRGLQ---GGAYEGSASDG 210

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LK+SACCKH  AYD+DNW+G DR+HF++ V+EQD+ +T+  PF+ C+ +G VSSVMCSY
Sbjct: 211 FLKVSACCKHLTAYDVDNWKGMDRYHFNAEVSEQDLVDTYNPPFQSCIEDGRVSSVMCSY 270

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NRVNG+PTCAD  LL +T+R  W F+GYIVSDCD++Q + E   +   + EDAVA  + A
Sbjct: 271 NRVNGVPTCADRNLLTETVRNSWGFNGYIVSDCDALQVLFEDTTYA-PSAEDAVADSILA 329

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKN 360
           GLDL+CG +       A+Q GKI EAD+D ++  L    MRLG FDG P    Y +LG  
Sbjct: 330 GLDLNCGTFLGKHAKSALQAGKITEADLDHAVSNLMRTRMRLGLFDGDPNSQPYSSLGAT 389

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
           +IC+  H +LA +AA QG+VLLKND G+LPL+T  +KT+AL+GP+ANAT  M+GNYEG P
Sbjct: 390 DICSNDHQQLALDAALQGVVLLKND-GSLPLSTA-LKTVALIGPNANATYTMLGNYEGIP 447

Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
           C+Y SP+ G   YS  I Y+PGC ++ C    ++ +A++ A  ADA V+V GLD S E E
Sbjct: 448 CKYISPLQGMQIYSSNILYSPGCRNVACNEGDLVASAVEVATKADAVVLVVGLDQSQERE 507

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
             DR  LLLPG Q++L++ +A+A   P+ LVIMSAG VDI+  K+N +I S++W+GYPG+
Sbjct: 508 TFDRTSLLLPGMQSQLVSNIANAVTSPIVLVIMSAGPVDISTFKDNSRISSVIWLGYPGQ 567

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVV 598
            GG A+A V+FG YNPGGRLP TWY   +  +    M +R  P++ +PGR+Y+F+ G  +
Sbjct: 568 SGGAALAHVVFGAYNPGGRLPNTWYHEEFTNVSMLDMQMRPNPLSGYPGRSYRFYTGTPL 627

Query: 599 YPFGYGLSYTQFKYKVASSPKSVDI---KLDKDQQCRDINYTVGTNKPPCAAVLIDDVK- 654
           Y FG GLSY+ + YK   +P  +          + C  +N +    K  C  +  DD++ 
Sbjct: 628 YNFGDGLSYSTYFYKFLLAPTKLSFFKSNTGNSRGCPAVNRSKA--KSGCFHLPADDLET 685

Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
           C    F   +EV N+G   GS  V+++S PP + G  +KQ+I +++V + +  + ++ F 
Sbjct: 686 CNSILFQVSVEVSNLGPRSGSHSVLIFSAPPPVEGAPLKQLIAFQKVHLESDTTQRLIFG 745

Query: 715 MNACKSLKIVDNAANSLLASGAHTILVGEGV 745
           ++ CK L  V       L SG H +L+G  V
Sbjct: 746 IDPCKHLSSVRRNGKRFLHSGRHKLLIGNAV 776


>gi|157041199|dbj|BAF79669.1| beta-D-xylosidase [Pyrus pyrifolia]
          Length = 774

 Score =  729 bits (1881), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 366/736 (49%), Positives = 489/736 (66%), Gaps = 29/736 (3%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           +C  ++P   R +DL+ R+TL EK+  + + A  VPRLG+  YEWWSEALHGVS +G   
Sbjct: 46  FCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPRLGIQGYEWWSEALHGVSNVG--- 102

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
              PGT F + + GATSFP VI T ASFNESLW++IG+ VS EARAMYN G AGLTFWSP
Sbjct: 103 ---PGTKFGTFL-GATSFPQVITTAASFNESLWEEIGRVVSDEARAMYNGGAAGLTFWSP 158

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           N+N+ RDPRWGR  ETPGEDP +  +Y   YV+GLQ         D     LK++ACCKH
Sbjct: 159 NVNIFRDPRWGRGQETPGEDPVLAAKYGARYVKGLQG--------DGAGNRLKVAACCKH 210

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           Y AYDLDNW G DRFHF++RV++QD+++T+ +PF+ CV +G+V+SVMCSYN+VNG PTCA
Sbjct: 211 YTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFKACVVDGNVASVMCSYNQVNGKPTCA 270

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
           DP LL  TIRG W  +GYIVSDCDS+    ++  +   T E A A  +KAGLDLDCG + 
Sbjct: 271 DPDLLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHY-TKTPEAAAAYAIKAGLDLDCGPFL 329

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQHIEL 370
              T  A++ G++ E DI+ +L     V MRLG FDG P   +Y NLG  ++C P   EL
Sbjct: 330 GIHTEAAIRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQRYGNLGLADVCKPSSNEL 389

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A EAARQGIVLL+N   +LPL+T   +T+A++GP+++ T+ MIGNY G  C YT+P+ G 
Sbjct: 390 ALEAARQGIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMIGNYAGIACGYTTPLQGI 449

Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
             Y++ I+ A GC D+ C  N +I AA  AA+ ADATV+V GLD S+EAE +DR  LLLP
Sbjct: 450 ARYTRTIHQA-GCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEAEFRDRTGLLLP 508

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
           G Q EL+++VA A++GP  LVIMS G +D+ FAKN+P+I +I+WVGYPG+ GG AIADV+
Sbjct: 509 GHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPRIGAIIWVGYPGQAGGTAIADVL 568

Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSY 607
           FG  NP G+LP+TWY  NYV  +P T M +R  P   +PGRTY+F+ GPVV+PFG GLSY
Sbjct: 569 FGTTNPSGKLPMTWYPQNYVANLPMTDMAMRADPARGYPGRTYRFYKGPVVFPFGMGLSY 628

Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
           T+F + +A  P  V + L      +  N T+ +N      V +    C      F I+++
Sbjct: 629 TRFSHSLAQGPTLVSVPLTSLVAAK--NTTMLSNH----GVRVSHTNCDSLSLDFHIDIK 682

Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           N G MDG+  ++V++  P       KQ++G+ +V I AG   +V   ++ CK L IVD  
Sbjct: 683 NTGTMDGTHTLLVFATQPAGKWAPNKQLVGFHKVHIVAGSERRVRVGVHVCKHLSIVDKL 742

Query: 728 ANSLLASGAHTILVGE 743
               +  G H + +G+
Sbjct: 743 GIRRIPLGQHKLEIGD 758


>gi|408354264|gb|AFU54451.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Prunus salicina]
          Length = 775

 Score =  729 bits (1881), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 366/739 (49%), Positives = 490/739 (66%), Gaps = 34/739 (4%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           +C   +P   R +DL+ R+TL EK++ + + A  VPRLG+  YEWWSEALHGVS +G   
Sbjct: 46  FCRVTVPIHVRVQDLIGRLTLQEKIRLLVNNAIAVPRLGIQGYEWWSEALHGVSNVG--- 102

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
              PGT F    PGATSFP VI T ASFNESLW++IG+ V  EARAMYN G AGLT+WSP
Sbjct: 103 ---PGTKFGGAFPGATSFPQVITTAASFNESLWQEIGRGVPDEARAMYNGGMAGLTYWSP 159

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           N+N+ RDPRWGR  ETPGEDP +  +YA  YV+GLQ         D     LK++ACCKH
Sbjct: 160 NVNIFRDPRWGRGQETPGEDPVLASKYAARYVKGLQG--------DGAGNRLKVAACCKH 211

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           Y AYDLDNW G +RFHF++RV++QD+ +T+ +PF+ CV EG V+SVMCSYN+VNG PTCA
Sbjct: 212 YTAYDLDNWNGVNRFHFNARVSKQDLADTYNVPFKACVVEGHVASVMCSYNQVNGKPTCA 271

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
           DP LL  TIRG W  +GYIVSDCDS+  + E   +   T E+A A  +KAGLDLDCG + 
Sbjct: 272 DPDLLKGTIRGQWRLNGYIVSDCDSVGVLYEEQHY-TRTPEEAAADAIKAGLDLDCGPFL 330

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQHIEL 370
              T  AV++G +++ +I+ +L     V MRLG FDG P   QY NLG  ++C P H +L
Sbjct: 331 AIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQYGNLGPRDVCTPAHQQL 390

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A EAARQGIVLL+N   +LPL+    +T+A++GP+++ T  MIGNY G  C YT+P+ G 
Sbjct: 391 ALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 450

Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
             Y++ I+ A GC D+ C  N +  AA  AA+ ADATV+V GLD S+EAE  DRV LLLP
Sbjct: 451 GRYTRTIHQA-GCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQSIEAEFVDRVGLLLP 509

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
           G Q EL+++VA A++GP  LV+MS G +D+ FAKN+P+I +I+WVGYPG+ GG AIADV+
Sbjct: 510 GHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWVGYPGQAGGTAIADVL 569

Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSY 607
           FG  NPGG+LP+TWY  NYV  +P T M +R  P   +PGRTY+F+ GPVV+PFG GLSY
Sbjct: 570 FGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYRFYRGPVVFPFGLGLSY 629

Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRD---INYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
           T F + +A  P SV + L   +   +   ++  V  +   C A+   DV          +
Sbjct: 630 TTFAHNLAHGPTSVSVPLTSLKATANSTMLSKAVRVSHADCNALSPLDV---------HV 680

Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIV 724
           +V+N G MDG+  ++V++ PP       KQ++G+ ++ IAAG   +V   ++ CK L +V
Sbjct: 681 DVKNTGSMDGTHTLLVFTSPPDGKWAASKQLVGFHKIHIAAGSETRVRIAVHVCKHLSVV 740

Query: 725 DNAANSLLASGAHTILVGE 743
           D      +  G H + +G+
Sbjct: 741 DRFGIRRIPLGEHKLQIGD 759


>gi|326494302|dbj|BAJ90420.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326521150|dbj|BAJ96778.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326527851|dbj|BAK08165.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 775

 Score =  728 bits (1878), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/743 (49%), Positives = 506/743 (68%), Gaps = 30/743 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L+ + +C+ K     RA+DLV R+TL EKV  + +    + RLG+P YEWWSEALHGVS+
Sbjct: 46  LAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYEWWSEALHGVSY 105

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           +G      PGT F   VPGATSFP  ILT ASFN SL++ IG+ VSTEARAM+N+G AGL
Sbjct: 106 VG------PGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNVGLAGL 159

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           TFWSPNIN+ RDPRWGR  ETPGEDP +  +YA+ YV GLQD  G     D     LK++
Sbjct: 160 TFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDA-GAGGVTDG---ALKVA 215

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYD+DNW+G +R+ FD++V++QD+ +TF  PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 216 ACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNG 275

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTCAD  LL   IRGDW  +GYIVSDCDS+  ++ + +    T E+A A  +K+GLDL+
Sbjct: 276 KPTCADKDLLEGVIRGDWKLNGYIVSDCDSVD-VLYTQQHYTKTPEEAAAITIKSGLDLN 334

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG++    T+ AVQ G+++E D+D ++   +I+LMRLG+FDG P+   + +LG  ++C  
Sbjct: 335 CGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTS 394

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            + ELA E ARQGIVLLKN +GALPL+  +IK++A++GP+ANA+  MIGNYEGTPC+YT+
Sbjct: 395 SNRELARETARQGIVLLKN-SGALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTT 453

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           P+ G  A    + Y PGC ++ C  NS+ +  A+ AA +AD TV+V G D S+E E  DR
Sbjct: 454 PLQGLGAKVNTV-YQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESLDR 512

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
             LLLPG QT+L++ VA+A+ GPV LV+MS G  DI+FAK + KI +ILWVGYPGE GG 
Sbjct: 513 TSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAILWVGYPGEAGGA 572

Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPF 601
           A+AD++FG +NP GRLP+TWY A+Y   +  T M +RP     +PGRTY+F+ G  V+ F
Sbjct: 573 ALADILFGSHNPSGRLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRTYRFYTGDTVFAF 632

Query: 602 GYGLSYTQFKYKVASSPKS-VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
           G GLSYT+  + + S+P S V ++L +D  CR            CA+V      C D  F
Sbjct: 633 GDGLSYTKMSHSLVSAPPSYVSMRLAEDHPCR---------AEECASVEAAGDHCDDLAF 683

Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
             +++V N G++ G+  V+++S PP       K ++G+E+V +A G++  V F ++ C+ 
Sbjct: 684 DVKLQVRNAGEVAGAHSVLLFSSPPPAHNAPAKHLLGFEKVSLAPGEAGTVAFRVDVCRD 743

Query: 721 LKIVDNAANSLLASGAHTILVGE 743
           L +VD      +A G HT+ VG+
Sbjct: 744 LSVVDELGGRKVALGGHTLHVGD 766


>gi|449505346|ref|XP_004162442.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 2-like
           [Cucumis sativus]
          Length = 772

 Score =  728 bits (1878), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 362/740 (48%), Positives = 486/740 (65%), Gaps = 29/740 (3%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS +P+C   LP PER KDL+ R+TL EKV+ + + A  VPRLG+  YEWWSEALHGVS 
Sbjct: 38  LSRYPFCRVALPIPERVKDLIGRLTLQEKVRLLVNNAAAVPRLGIKGYEWWSEALHGVSN 97

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           +G      PGT F  + PGATSFP VI T ASFN SLW+ IG+ VS EARAMYN G AGL
Sbjct: 98  VG------PGTEFGGDFPGATSFPQVITTVASFNVSLWEAIGRVVSDEARAMYNGGAAGL 151

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T+WSPN+N+ RDPRWGR  ETPGEDP V G YA  Y++GLQ          +D   LK++
Sbjct: 152 TYWSPNVNIFRDPRWGRGQETPGEDPVVAGEYAARYIKGLQG---------NDGDRLKVA 202

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKH+ AYDLDNW G DRFHF+++VT QDM +TF +PF  CV EG V+SVMCSYN+VNG
Sbjct: 203 ACCKHFTAYDLDNWNGTDRFHFNAKVTRQDMVDTFEVPFRKCVKEGKVASVMCSYNQVNG 262

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           +PTCADP LL  TIR  W  +GYIVSDCDS+    ++  +   T E+A A  +KAGLDLD
Sbjct: 263 VPTCADPNLLKGTIRNQWGLNGYIVSDCDSVGVFYDNQHY-TSTAEEAAADAIKAGLDLD 321

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG +    T  AV++  + +  I+ +L     V MRLG FDG+P    Y  LG  N+C+P
Sbjct: 322 CGPFLAVHTEDAVKKXLLTQTHINNALANTITVQMRLGMFDGAPSSHAYGKLGPKNVCSP 381

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            H +LA +AARQGIVLLKN    LPL+  + +T+A++GP+++    MIGNY G  C Y +
Sbjct: 382 SHQQLALDAARQGIVLLKNRLPGLPLSAXHHRTVAVIGPNSDVNVTMIGNYAGVACGYVT 441

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P++G   Y+ V+ +  GC ++ C  +     A+ AA  ADATV+V GLD SVEAE KDR 
Sbjct: 442 PLEGIKRYTTVV-HRKGCDNVACATDYSFTDALAAASTADATVLVMGLDQSVEAETKDRD 500

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            LLLPG Q EL+ KVA A++GP  +++MS G +D++FA N+P+I +ILWVGYPG+ GG A
Sbjct: 501 GLLLPGRQQELVLKVAAASRGPTVVILMSGGPIDVSFADNDPRISAILWVGYPGQAGGAA 560

Query: 546 IADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYG 604
           IADV+FG  NPGG+LP+TWY  +Y+  +P T+M +R  +++PGRTY+F+ GPVVY FG+G
Sbjct: 561 IADVLFGTTNPGGKLPMTWYPQSYLSNLPMTNMAMRSTSSYPGRTYRFYAGPVVYEFGHG 620

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
           LSYT F + +  +P  V I L   +Q      T   +     A+ +   KC+       +
Sbjct: 621 LSYTNFIHTIVKAPTIVSISLSGHRQ------THSASTLSSKAIRVTHAKCQKLSLVIHV 674

Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHI--KQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
           +VEN G  DG   ++V+S PP    T +  KQ++ +E++ +A+ +  ++   ++ CK L 
Sbjct: 675 DVENKGDRDGFHTMLVFSTPPANGATWVPRKQLVAFEKLHLASREKRRLQVHVHVCKYLS 734

Query: 723 IVDNAANSLLASGAHTILVG 742
           +VD      +  G H I +G
Sbjct: 735 VVDKLGVRRIPLGDHYIHIG 754


>gi|32481073|gb|AAP83934.1| auxin-induced beta-glucosidase [Chenopodium rubrum]
          Length = 767

 Score =  727 bits (1877), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 370/736 (50%), Positives = 495/736 (67%), Gaps = 28/736 (3%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           +C   LP   R +DL+ R+ L EKV+ + + A  VPRLG+  YEWWSEALHGVS +G   
Sbjct: 40  FCRVNLPIRARVQDLIGRLNLQEKVKLLVNNAAPVPRLGISGYEWWSEALHGVSNVG--- 96

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
              PGT F    P ATSFP VI T ASFN SLW+ IGQ VS EARAMYN G AGLT+WSP
Sbjct: 97  ---PGTKFRGAFPAATSFPQVITTAASFNASLWEAIGQVVSDEARAMYNGGTAGLTYWSP 153

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           N+N+ RDPRWGR  ETPGEDP +  +YA +YVRGLQ +    Y+++     LK++ACCKH
Sbjct: 154 NVNIFRDPRWGRGQETPGEDPTLASQYAASYVRGLQGI----YNKNR----LKVAACCKH 205

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           Y AYDLDNW   DRFHF+++V++QD+++T+ +PF+ CV EG V+SVMCSYN+VNG PTCA
Sbjct: 206 YTAYDLDNWNAVDRFHFNAKVSKQDLEDTYNVPFKGCVQEGRVASVMCSYNQVNGKPTCA 265

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
           DP LL  TIRG W  +GYIVSDCDS+  + +   +   T E+A A  +KAGLDLDCG + 
Sbjct: 266 DPDLLRNTIRGQWRLNGYIVSDCDSVGVLYDDQHY-TRTPEEAAADTIKAGLDLDCGPFL 324

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG---SPQYKNLGKNNICNPQHIEL 370
              T  AV++G + EAD++ +L   + V MRLG FDG   +  + +LG  ++C+P H +L
Sbjct: 325 AVHTEAAVKRGLLTEADVNQALTNTFTVQMRLGMFDGEAAAQPFGHLGPKDVCSPAHQDL 384

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A +AARQGIVLL+N   +LPL+T   + +A++GP+A+AT  MIGNY G  C YTSP+ G 
Sbjct: 385 ALQAARQGIVLLQNRGRSLPLSTARHRNIAVIGPNADATVTMIGNYAGVACGYTSPLQGI 444

Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
             Y+K ++ A GC  + C +N    AA  AA +ADATV+V GLD S+EAE +DR  +LLP
Sbjct: 445 ARYAKTVHQA-GCIGVACTSNQQFGAATAAAAHADATVLVMGLDQSIEAEFRDRASVLLP 503

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
           G Q EL++KVA A++GP  LV+M  G VD+ FAKN+PKI +ILWVGYPG+ GG AIADV+
Sbjct: 504 GHQQELVSKVALASRGPTILVLMCGGPVDVTFAKNDPKISAILWVGYPGQAGGTAIADVL 563

Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSY 607
           FG  NPGG+LP TWY  +YV K+P T + +R  P N +PGRTY+F+ GPVV+PFG+GLSY
Sbjct: 564 FGTTNPGGKLPNTWYPQSYVAKVPMTDLAMRANPSNGYPGRTYRFYKGPVVFPFGFGLSY 623

Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
           T+F   +A +P  V + L    Q  + N T   NK    A+ +    C +   +  I+V+
Sbjct: 624 TRFTQSLAHAPTKVMVPL--ANQFTNSNIT-SFNKD---ALKVLHTNCDNIPLSLHIDVK 677

Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           N GK+DGS  ++V+S PP    +  KQ+IG++RV + AG   +V   ++ C  L   D  
Sbjct: 678 NKGKVDGSHTILVFSTPPKGTKSSEKQLIGFKRVHVFAGSKQRVRMNIHVCNHLSRADEF 737

Query: 728 ANSLLASGAHTILVGE 743
               +  G HT+ +G+
Sbjct: 738 GVRRIPIGEHTLHIGD 753


>gi|297795695|ref|XP_002865732.1| beta-xylosidase 1 [Arabidopsis lyrata subsp. lyrata]
 gi|297311567|gb|EFH41991.1| beta-xylosidase 1 [Arabidopsis lyrata subsp. lyrata]
          Length = 774

 Score =  726 bits (1875), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 363/737 (49%), Positives = 491/737 (66%), Gaps = 29/737 (3%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           +C   +P   R +DL+ R+TL EK++ + + A  VPRLG+  YEWWSEALHGVS +G   
Sbjct: 49  FCRVNVPIHVRVQDLIGRLTLQEKIRNLVNNAAAVPRLGIGGYEWWSEALHGVSDVG--- 105

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
              PG+ F    PGATSFP VI T ASFN+SLW++IG+ VS EARAMYN G AGLT+WSP
Sbjct: 106 ---PGSKFGGAFPGATSFPQVITTAASFNQSLWEEIGRVVSDEARAMYNGGVAGLTYWSP 162

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           N+N++RDPRWGR  ETPGEDP V  +YA +YVRGLQ          +    LK++ACCKH
Sbjct: 163 NVNILRDPRWGRGQETPGEDPIVAAKYAASYVRGLQGT--------AAGNRLKVAACCKH 214

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           Y AYDLDNW G DRFHF+++VT+QD+++T+ +PF+ CV EG V+SVMCSYN+VNG PTCA
Sbjct: 215 YTAYDLDNWNGVDRFHFNAKVTQQDLEDTYNVPFKSCVYEGKVASVMCSYNQVNGKPTCA 274

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
           D  LL  TIRG W  +GYIVSDCDS+        +   T E+A A  +KAGLDLDCG + 
Sbjct: 275 DENLLKNTIRGKWRLNGYIVSDCDSVDVFFNQQHY-TSTPEEAAAASIKAGLDLDCGPFL 333

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAA 372
             FT GAV++G + E DI+ +L     V MRLG FDG+   Y NLG  ++C+  H  LA 
Sbjct: 334 AIFTEGAVKKGLLTENDINLALANTLTVQMRLGMFDGNLGPYANLGPRDVCSLAHKHLAL 393

Query: 373 EAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA 432
           EAA QGIVLLKN   +LPL+    +T+A++GP+++ T+ MIGNY G  C YT+P+ G   
Sbjct: 394 EAAHQGIVLLKNSGRSLPLSPRRHRTVAVIGPNSDVTETMIGNYAGKACAYTTPLQGISR 453

Query: 433 YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGF 492
           Y++ ++ A GCA + C+ N    AA  AA+ ADATV+V GLD S+EAE +DR  LLLPG+
Sbjct: 454 YARTLHQA-GCAGVACKGNQGFGAAEAAAREADATVLVMGLDQSIEAETRDRTGLLLPGY 512

Query: 493 QTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFG 552
           Q +L+ +VA A++GPV LV+MS G +D+ FAKN+P++ +I+W GYPG+ GG AIA++IFG
Sbjct: 513 QQDLVTRVAQASRGPVILVLMSGGPIDVTFAKNDPRVAAIIWAGYPGQAGGAAIANIIFG 572

Query: 553 KYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFK 611
             NPGG+LP+TWY  +YV K+P T M +R   N+PGRTY+F+ GPVV+PFG+GLSYT F 
Sbjct: 573 AANPGGKLPMTWYPQDYVAKVPMTVMAMRASGNYPGRTYRFYKGPVVFPFGFGLSYTTFT 632

Query: 612 YKVASSP-KSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY-KFTFQIEVENM 669
             +A SP   + + L       ++N           ++ +    C  + K    +EV N 
Sbjct: 633 NSLAKSPLAQLSVSLS------NLNSANAILNSTSHSIKVSHTNCNSFPKMPLHVEVSNT 686

Query: 670 GKMDGSEVVMVYSKPP--GIAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDN 726
           G+ DG+  V V+++PP  GI G  + KQ+I +E+V + AG    V   ++ACK L +VD 
Sbjct: 687 GEFDGTHTVFVFAEPPKNGIKGLGVNKQLIAFEKVHVMAGAKQTVRVDVDACKHLGVVDE 746

Query: 727 AANSLLASGAHTILVGE 743
                +  G H + +G+
Sbjct: 747 YGKRRIPMGKHKLHIGD 763


>gi|65736613|dbj|BAD98523.1| alpha-L-arabinofuranosidase / beta-D-xylosidase [Pyrus pyrifolia]
          Length = 774

 Score =  726 bits (1875), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 366/736 (49%), Positives = 488/736 (66%), Gaps = 29/736 (3%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           +C  ++P   R +DL+ R+TL EK+  + + A  VPRLG+  YEWWSEALHGVS +G   
Sbjct: 46  FCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPRLGIQGYEWWSEALHGVSNVG--- 102

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
              PGT F + + GATSFP VI T ASFNESLW++IG+ VS EARAMYN G AGLTFWSP
Sbjct: 103 ---PGTKFGTFL-GATSFPQVITTAASFNESLWEEIGRVVSDEARAMYNGGAAGLTFWSP 158

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           N+N+ RDPRWGR  ETPGEDP +  +Y   YV+GLQ         D     LK++ACCKH
Sbjct: 159 NVNIFRDPRWGRGQETPGEDPVLAAKYGARYVKGLQG--------DGAGNRLKVAACCKH 210

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           Y AYDLDNW G DRFHF++RV++QD+++T+ +PF+ CV +G+V+SVMCSYN+VNG PTCA
Sbjct: 211 YTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFKACVVDGNVASVMCSYNQVNGKPTCA 270

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
           DP LL  TIRG W  +GYIVSDCDS+    ++  +   T E A A  +KAGLDLDCG + 
Sbjct: 271 DPDLLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHY-TKTPEAAAAYAIKAGLDLDCGPFL 329

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQHIEL 370
              T  A++ G++ E DI+ +L     V MRLG FDG P   +Y NLG  ++C P   EL
Sbjct: 330 GIHTEAAIRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQRYGNLGLADVCKPSSNEL 389

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A EAARQGIVLL+N   +LPL+T   +T+A++GP+++ T+ MIGNY G  C YT+P+ G 
Sbjct: 390 ALEAARQGIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMIGNYAGIACGYTTPLQGI 449

Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
             Y++ I+ A GC D+ C  N +I AA  AA+ ADATV+V GLD S+EAE +DR  LLLP
Sbjct: 450 ARYTRTIHQA-GCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEAEFRDRTGLLLP 508

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
           G Q EL+++VA A++GP  LVIMS G +D+ FAKN+P I +I+WVGYPG+ GG AIADV+
Sbjct: 509 GHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPCIGAIIWVGYPGQAGGTAIADVL 568

Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSY 607
           FG  NP G+LP+TWY  NYV  +P T M +R  P   +PGRTY+F+ GPVV+PFG GLSY
Sbjct: 569 FGTTNPSGKLPMTWYPQNYVANLPMTDMAMRADPARGYPGRTYRFYKGPVVFPFGMGLSY 628

Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
           T+F + +A  P  V + L      +  N T+ +N      V +    C      F I+++
Sbjct: 629 TRFSHSLAQGPTLVSVPLTSLVAAK--NTTMLSNH----GVRVSHTNCDSLSLDFHIDIK 682

Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           N G MDG+  ++V++  P       KQ++G+ +V I AG   +V   ++ CK L IVD  
Sbjct: 683 NTGTMDGTHTLLVFATQPAGKWAPNKQLVGFHKVHIVAGSERRVRVGVHVCKHLSIVDKL 742

Query: 728 ANSLLASGAHTILVGE 743
               +  G H + +G+
Sbjct: 743 GIRRIPLGQHKLEIGD 758


>gi|326492918|dbj|BAJ90315.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 775

 Score =  726 bits (1875), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 366/743 (49%), Positives = 506/743 (68%), Gaps = 30/743 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L+ + +C+ K     RA+DLV R+TL EKV  + +    + RLG+P YEWWSEALHGVS+
Sbjct: 46  LAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYEWWSEALHGVSY 105

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           +G      PGT F   VPGATSFP  ILT ASFN SL++ IG+ VSTEARAM+N+G AGL
Sbjct: 106 VG------PGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNVGLAGL 159

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           TFWSPNIN+ RDPRWGR  ETPGEDP +  +YA+ YV GLQD  G     D     LK++
Sbjct: 160 TFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDA-GAGGVTDG---ALKVA 215

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYD+DNW+G +R+ FD++V++QD+ +TF  PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 216 ACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNG 275

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTCAD  LL   IRGDW  +GYIVSDCDS+  ++ + +    T E+A A  +K+GLDL+
Sbjct: 276 KPTCADKDLLEGVIRGDWKLNGYIVSDCDSVD-VLYTQQHYTKTPEEAAAITIKSGLDLN 334

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG++    T+ AVQ G+++E D+D ++   +I+LMRLG+FDG P+   + +LG  ++C  
Sbjct: 335 CGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTS 394

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            + ELA E ARQGIVLLKN +GALPL+  +IK++A++GP+ANA+  MIGNYEGTPC+YT+
Sbjct: 395 SNRELARETARQGIVLLKN-SGALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTT 453

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           P+ G  A    + Y PGC ++ C  NS+ +  A+ AA +AD TV+V G D S+E E  DR
Sbjct: 454 PLQGLGAKVNTV-YQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESLDR 512

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
             LLLPG QT+L++ VA+A+ GPV LV+MS G  DI+FAK + KI +ILWVGYPGE GG 
Sbjct: 513 TSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAILWVGYPGEAGGA 572

Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPF 601
           A+AD++FG +NP G+LP+TWY A+Y   +  T M +RP     +PGRTY+F+ G  V+ F
Sbjct: 573 ALADILFGSHNPSGKLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRTYRFYTGDTVFAF 632

Query: 602 GYGLSYTQFKYKVASSPKS-VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
           G GLSYT+  + + S+P S V ++L +D  CR            CA+V      C D  F
Sbjct: 633 GDGLSYTKMSHSLVSAPPSYVSMRLAEDHPCR---------AEECASVEAAGDHCDDLAF 683

Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
             +++V N G++ G+  V+++S PP       K ++G+E+V +A G++  V F ++ C+ 
Sbjct: 684 DVKLQVRNAGEVAGAHSVLLFSSPPPAHNAPAKHLLGFEKVSLAPGEAGTVAFRVDVCRD 743

Query: 721 LKIVDNAANSLLASGAHTILVGE 743
           L +VD      +A G HT+ VG+
Sbjct: 744 LSVVDELGGRKVALGGHTLHVGD 766


>gi|15242492|ref|NP_196535.1| beta-xylosidase 3 [Arabidopsis thaliana]
 gi|75264323|sp|Q9LXD6.1|BXL3_ARATH RecName: Full=Beta-D-xylosidase 3; Short=AtBXL3; AltName:
           Full=Alpha-L-arabinofuranosidase; Flags: Precursor
 gi|7671416|emb|CAB89357.1| beta-xylosidase-like protein [Arabidopsis thaliana]
 gi|9759004|dbj|BAB09531.1| beta-xylosidase [Arabidopsis thaliana]
 gi|15450735|gb|AAK96639.1| AT5g09730/F17I14_80 [Arabidopsis thaliana]
 gi|332004056|gb|AED91439.1| beta-xylosidase 3 [Arabidopsis thaliana]
          Length = 773

 Score =  726 bits (1874), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 364/742 (49%), Positives = 491/742 (66%), Gaps = 28/742 (3%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L+   +C+A L    R  DLV R+TL EK+  +   A GV RLG+P Y+WWSEALHGVS 
Sbjct: 44  LAGLRFCNAGLSIKARVTDLVGRLTLEEKIGFLTSKAIGVSRLGIPSYKWWSEALHGVSN 103

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           +G       G+ F  +VPGATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G+AGL
Sbjct: 104 VG------GGSRFTGQVPGATSFPQVILTAASFNVSLFQAIGKVVSTEARAMYNVGSAGL 157

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           TFWSPN+N+ RDPRWGR  ETPGEDP +  +YA+ YV+GLQ+ +G + +R      LK++
Sbjct: 158 TFWSPNVNIFRDPRWGRGQETPGEDPTLSSKYAVAYVKGLQETDGGDPNR------LKVA 211

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYD+DNW   +R  F++ V +QD+ +TF  PF+ CV +G V+SVMCSYN+VNG
Sbjct: 212 ACCKHYTAYDIDNWRNVNRLTFNAVVNQQDLADTFQPPFKSCVVDGHVASVMCSYNQVNG 271

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTCADP LL+  IRG W  +GYIVSDCDS+  +     +   T E+AVA+ L AGLDL+
Sbjct: 272 KPTCADPDLLSGVIRGQWQLNGYIVSDCDSVDVLFRKQHYAK-TPEEAVAKSLLAGLDLN 330

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           C  +     MGAV+ G + E  ID ++   +  LMRLG+FDG P+   Y  LG  ++C  
Sbjct: 331 CDHFNGQHAMGAVKAGLVNETAIDKAISNNFATLMRLGFFDGDPKKQLYGGLGPKDVCTA 390

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            + ELA + ARQGIVLLKN  G+LPL+   IKTLA++GP+ANAT+ MIGNY G PC+YT+
Sbjct: 391 DNQELARDGARQGIVLLKNSAGSLPLSPSAIKTLAVIGPNANATETMIGNYHGVPCKYTT 450

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+ G  A +    Y  GC ++ C + + I +A+D A +ADA V+V G D S+E EG DRV
Sbjct: 451 PLQGL-AETVSSTYQLGC-NVACVD-ADIGSAVDLAASADAVVLVVGADQSIEREGHDRV 507

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           DL LPG Q EL+ +VA AA+GPV LVIMS G  DI FAKN+ KI SI+WVGYPGE GG A
Sbjct: 508 DLYLPGKQQELVTRVAMAARGPVVLVIMSGGGFDITFAKNDKKITSIMWVGYPGEAGGLA 567

Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
           IADVIFG++NP G LP+TWY  +YV K+P ++M +RP     +PGR+Y+F+ G  VY F 
Sbjct: 568 IADVIFGRHNPSGNLPMTWYPQSYVEKVPMSNMNMRPDKSKGYPGRSYRFYTGETVYAFA 627

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDIN-YTVGTNKPPCAAVLIDDVKCKDYKFT 661
             L+YT+F +++  +P+ V + LD++  CR     ++    P C     ++       F 
Sbjct: 628 DALTYTKFDHQLIKAPRLVSLSLDENHPCRSSECQSLDAIGPHC-----ENAVEGGSDFE 682

Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
             + V+N G   GS  V +++  P + G+ IKQ++G+E++ +   + A V F +N CK L
Sbjct: 683 VHLNVKNTGDRAGSHTVFLFTTSPQVHGSPIKQLLGFEKIRLGKSEEAVVRFNVNVCKDL 742

Query: 722 KIVDNAANSLLASGAHTILVGE 743
            +VD      +A G H + VG 
Sbjct: 743 SVVDETGKRKIALGHHLLHVGS 764


>gi|302760655|ref|XP_002963750.1| hypothetical protein SELMODRAFT_80102 [Selaginella moellendorffii]
 gi|300169018|gb|EFJ35621.1| hypothetical protein SELMODRAFT_80102 [Selaginella moellendorffii]
          Length = 785

 Score =  723 bits (1866), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 355/750 (47%), Positives = 486/750 (64%), Gaps = 22/750 (2%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
           S    L  FP+CD KL    R +DLV R+TL EKV +M + A G+PRLG+P Y+WW EAL
Sbjct: 42  SSNASLGSFPFCDTKLGVDVRVQDLVSRLTLDEKVDEMVNAAQGIPRLGVPSYQWWQEAL 101

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           HGV+       S PG  F    P ATSFP  I   ASFN +L+  IG+ VS+EARA++NL
Sbjct: 102 HGVA-------SSPGVQFGGLAPAATSFPMPIAMAASFNSTLFYSIGEAVSSEARALHNL 154

Query: 124 GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
           G AGLTFWSPN+N+ RDPRWGR  ETPGEDP +  ++A  YVRGLQ   G  Y   +   
Sbjct: 155 GRAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLASKFASLYVRGLQ---GGAYGGSASDG 211

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LK+SACCKH  AYD+DNW+G DR+HF++ V+EQD+ +T+  PF+ C+ +G VSSVMCSY
Sbjct: 212 FLKVSACCKHLTAYDMDNWKGMDRYHFNAEVSEQDLVDTYNPPFQSCIEDGRVSSVMCSY 271

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NRVNG+PTCAD  LL +T+R  W F+GYIVSDCD++Q + E   +   + EDAVA  + A
Sbjct: 272 NRVNGVPTCADRSLLTETVRNSWGFNGYIVSDCDALQVLFEDTTYA-PSAEDAVADSILA 330

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG---SPQYKNLGKN 360
           GLDL+CG +       A+Q GK+ EAD+D ++  L    MRLG FDG   +  Y +LG  
Sbjct: 331 GLDLNCGTFLGKHAKSALQAGKVTEADLDHAISNLMRTRMRLGLFDGDLNTRPYSSLGAT 390

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
           +IC+  H +LA +AA QG+VLLKND G+LPL+T  +KT+AL+GP+ANAT  M+GNYEG P
Sbjct: 391 DICSNDHQQLALDAALQGVVLLKND-GSLPLSTA-LKTVALIGPNANATYTMLGNYEGIP 448

Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
           C+Y SP+ G   Y+  I Y+PGC D+ C    ++ +A++ A  ADA V+V GLD S E E
Sbjct: 449 CKYVSPLQGMQIYNNNILYSPGCRDVACSEGDLVASAVEVATKADAVVLVVGLDQSQERE 508

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
             DR  LLLPG Q++L++ +A+A   P+ LVIMSAG VDI+  K+N +I S++W+GYPG+
Sbjct: 509 TFDRTSLLLPGMQSQLVSNIANAVTCPIVLVIMSAGPVDISTFKDNSRISSVIWIGYPGQ 568

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVV 598
            GG A+A V+FG YNPGGRLP TWY   +  +    M +R  P + +PGR+Y+F+ G  +
Sbjct: 569 SGGAALAHVVFGAYNPGGRLPNTWYHEEFTNVSMLDMRMRPNPPSGYPGRSYRFYTGTPL 628

Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPP--CAAVLIDDVK-C 655
           Y FG GLSY+ + YK   +P  +       +  RD   TV  ++    C  +  DD++ C
Sbjct: 629 YNFGDGLSYSTYLYKFLLAPTRLSFFKSNTRNSRDCP-TVNRSEAEFGCFHLPADDLETC 687

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
               F   +EV N+G   GS  V+++S PP + G  +KQ+I +++V + +  + ++ F +
Sbjct: 688 NSILFQVSVEVSNLGPRSGSHSVLIFSAPPPVEGAPLKQLIAFQKVHLESDTTQRLIFGI 747

Query: 716 NACKSLKIVDNAANSLLASGAHTILVGEGV 745
           + CK L  V       L SG H +L+G  V
Sbjct: 748 DPCKHLSSVRRNGKRFLHSGRHKLLIGNAV 777


>gi|371917280|dbj|BAL44716.1| SlArf/Xyl1 [Solanum lycopersicum]
          Length = 771

 Score =  722 bits (1864), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 364/739 (49%), Positives = 486/739 (65%), Gaps = 22/739 (2%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           + +  +C   LP   R +DL+ R+TL EK++ + + A  V RLG+  YEWWSEALHGVS 
Sbjct: 34  IRNLRFCKTSLPIHVRVQDLIARLTLQEKIRLLVNNAAPVQRLGISGYEWWSEALHGVS- 92

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
                N+  G  F    PGATSFP VI T ASFN SLW++IG+ VS E RAMYN G AGL
Sbjct: 93  -----NTGYGVKFGGAFPGATSFPQVITTAASFNASLWEEIGRVVSEEGRAMYNGGAAGL 147

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           TFWSPN+N+ RDPRWGR  ETPGEDP++V +Y ++YV+GLQ   G    R      LK++
Sbjct: 148 TFWSPNVNIFRDPRWGRGQETPGEDPHLVAQYGVSYVKGLQGGGGRGNTR------LKVA 201

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYDLD+W G DR+HF+++V+ QD+++T+  PF+ CV EG+V+SVMCSYN++NG
Sbjct: 202 ACCKHYTAYDLDDWNGYDRYHFNAKVSMQDLEDTYNAPFKACVVEGNVASVMCSYNQING 261

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            P+CADP LL  TIR  W+ +GYIVSDCDS+  + E   +     EDA A  +KAGLDLD
Sbjct: 262 KPSCADPTLLRDTIRNQWHLNGYIVSDCDSVGVLFEKQHYTR-YPEDAAAITIKAGLDLD 320

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNICNPQH 367
           CG +    T  AV  GK+++ +I+ +L     V MRLG FDG +  Y NLG  ++C+P H
Sbjct: 321 CGPFLAIHTDKAVHTGKVSQVEINNALANTITVQMRLGMFDGPNGPYANLGPKDVCSPAH 380

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
            +LA +AAR+GIVLLKN   ALPL+T   +T+A++GP+++AT AMIGNY G PC Y SP+
Sbjct: 381 QQLALQAAREGIVLLKNIGQALPLSTKRHRTVAVIGPNSDATLAMIGNYAGVPCGYISPL 440

Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
            G   Y++ I +  GC  + C  N     A  AA++ADATV+V GLD S+EAE KDRV L
Sbjct: 441 QGISRYARTI-HQQGCMGVACPGNQNFGLAEVAARHADATVLVMGLDQSIEAEAKDRVTL 499

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
           LLPG Q +LI++VA A+KGPV LV+MS G +D+ FAKN+P++ SI+WVGYPG+ GG AIA
Sbjct: 500 LLPGHQQDLISRVAMASKGPVVLVLMSGGPIDVTFAKNDPRVSSIVWVGYPGQAGGAAIA 559

Query: 548 DVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
           DV+FG  NPGG+LP+TWY  +YV K+   +M +R  P   +PGRTY+F+ GP V+PFG G
Sbjct: 560 DVLFGATNPGGKLPMTWYPQDYVAKVSMANMDMRANPSKGYPGRTYRFYKGPTVFPFGAG 619

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
           +SYT F   + S+P +V +           N T  T     A V      C+       I
Sbjct: 620 ISYTTFSQHLVSAPITVSVPTLHSHDLVSNNTT--TLMKAKATVRTIHTNCESLDIDMHI 677

Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIV 724
           +V+N G MDG+  V+++S PP    T  KQ++ +E+V + AG   +V   MNACK L + 
Sbjct: 678 DVKNTGDMDGTHAVLIFSTPPD--PTETKQLVAFEKVHVVAGAKQRVKINMNACKHLSVA 735

Query: 725 DNAANSLLASGAHTILVGE 743
           D      +  G H I VG+
Sbjct: 736 DEYGVRRIYMGEHKIHVGD 754


>gi|224099193|ref|XP_002311398.1| predicted protein [Populus trichocarpa]
 gi|222851218|gb|EEE88765.1| predicted protein [Populus trichocarpa]
          Length = 755

 Score =  718 bits (1853), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 369/749 (49%), Positives = 494/749 (65%), Gaps = 33/749 (4%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           +C   +P   R +DL+ R+TL EK++ + + A  VPRLG+  YEWWSEALHGVS +G   
Sbjct: 34  FCRVNMPLHVRVRDLIGRLTLQEKIRLLVNNAAAVPRLGIQGYEWWSEALHGVSNVG--- 90

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
              PGT F    PGATSFP VI T ASFN+SLW++IG+ VS EARAM+N G AGLT+WSP
Sbjct: 91  ---PGTKFGGAFPGATSFPQVITTAASFNKSLWEEIGRVVSDEARAMFNGGMAGLTYWSP 147

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           N+NV RDPRWGR  ETPGEDP V G+YA +YVRGLQ   G           LK++ACCKH
Sbjct: 148 NVNVFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNSGFR---------LKVAACCKH 198

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           Y AYDLDNW G DR+HF++RV++QD+++T+ +PF+ CV EG V+SVMCSYN+VNG PTCA
Sbjct: 199 YTAYDLDNWNGVDRYHFNARVSKQDLEDTYDVPFKSCVVEGKVASVMCSYNQVNGKPTCA 258

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
           DP LL  TIRG+W  +GYIVSDCDS+  + E+  +    +E A A + KAGLDLDCG + 
Sbjct: 259 DPNLLKNTIRGEWRLNGYIVSDCDSVGVLYENQHYTATPEEAAAATI-KAGLDLDCGPFL 317

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIEL 370
              T  AV+ G + E D++ +L     V MRLG FDG P    +  LG  ++C P H +L
Sbjct: 318 AIHTENAVKGGLLNEEDVNMALANTITVQMRLGLFDGEPSAQPFGKLGPRDVCTPAHQQL 377

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A  AA+QGIVLL+N    LPL+  N+ T+A++GP A+ T  MIGNY G  C YT+P+ G 
Sbjct: 378 ALHAAQQGIVLLQNSGRTLPLSRPNL-TVAVIGPIADVTVTMIGNYAGVACGYTTPLQGI 436

Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
             Y+K I+ + GC D+ C  N     A  AA  ADATV+V GLD S+EAE +DR DLLLP
Sbjct: 437 SRYAKTIHQS-GCIDVACNGNQQFGMAEAAASQADATVLVMGLDQSIEAEFRDRKDLLLP 495

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
           G+Q ELI++VA A++GP  LV+MS G +D++FAKN+P+I +ILW GYPG+ GG AIADV+
Sbjct: 496 GYQQELISRVARASRGPTILVLMSGGPIDVSFAKNDPRIGAILWAGYPGQAGGAAIADVL 555

Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSY 607
           FG  NPGG+LP+TWY  +Y+ K+P T+M +R  P   +PGRTY+F+ GPVV+PFG+G+SY
Sbjct: 556 FGTTNPGGKLPMTWYPQDYLAKVPMTNMGMRADPSRGYPGRTYRFYKGPVVFPFGHGMSY 615

Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
           T F + +  +P+ V +        +  N T   N     ++ +    C+       I+V+
Sbjct: 616 TTFAHSLVQAPQEVAVPFTSLYALQ--NTTAARN-----SIRVSHANCEPLVLGVHIDVK 668

Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           N G MDG + ++V+S PP    +  K++IG+E+V I AG   +V   +  CK L +VD  
Sbjct: 669 NTGDMDGIQTLLVFSSPPEGKWSANKKLIGFEKVHIVAGSKKRVKIDIPVCKHLSVVDRF 728

Query: 728 ANSLLASGAHTILVGEGVGGVSFPLQLNL 756
               L  G H + +G+    +S  LQ NL
Sbjct: 729 GIRRLPIGKHDLHIGDLKHSIS--LQANL 755


>gi|297811069|ref|XP_002873418.1| beta-xylosidase 3 [Arabidopsis lyrata subsp. lyrata]
 gi|297319255|gb|EFH49677.1| beta-xylosidase 3 [Arabidopsis lyrata subsp. lyrata]
          Length = 780

 Score =  717 bits (1851), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 368/742 (49%), Positives = 491/742 (66%), Gaps = 29/742 (3%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L+   +C+  L    R  DLV R+TL EK+  +G  A GV RLG+P Y+WWSEALHGVS 
Sbjct: 49  LAGLRFCNTGLNIKSRVTDLVGRLTLEEKIGFLGSNAIGVSRLGIPAYKWWSEALHGVSN 108

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           +G       G+ F  +VPGATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G+AGL
Sbjct: 109 VGG------GSSFSGQVPGATSFPQVILTAASFNVSLFQAIGKVVSTEARAMYNVGSAGL 162

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           TFWSPN+N+ RDPRWGR  ETPGEDP +  +YA+ YVRGLQ+ +G + +R      LK++
Sbjct: 163 TFWSPNVNIFRDPRWGRGQETPGEDPELSSKYAVAYVRGLQETDGGDPNR------LKVA 216

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYD+DNW+   RF F++ V +QDM +TF  PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 217 ACCKHYTAYDVDNWKDVHRFTFNAVVNQQDMADTFQPPFKSCVVDGNVASVMCSYNQVNG 276

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTCADP LL+  IRG W  +GYIVSDCDS+  +     +   T E+AVA+ + AGLDL+
Sbjct: 277 KPTCADPDLLSGVIRGQWKLNGYIVSDCDSVDVLYTKQHY-TKTPEEAVAKSILAGLDLN 335

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICN 364
           C  +   + M AV+ G + E  ID ++   +  LMRLG+FDG P+    Y  LG N++C 
Sbjct: 336 CDHFTGQYAMKAVKVGLVNETAIDKAISNNFATLMRLGFFDGDPKKQQLYGGLGPNDVCT 395

Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
             + ELA +AARQGIVLLKN  G+LPL+   IKTLA++GP+ANAT+ MIGNY G PC+YT
Sbjct: 396 ANNQELARDAARQGIVLLKNSAGSLPLSPSAIKTLAVIGPNANATETMIGNYNGIPCKYT 455

Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           +P+ G  A +    Y  GC ++ C     + +A   A +ADA V+V G D S+E E  DR
Sbjct: 456 TPLQGL-AETVSSTYQLGC-NVACAE-PDLGSAAALAASADAVVLVMGADQSIEQENLDR 512

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
           +DL LPG Q EL+ +VA  AKGPV LVIMS GA DI FAKN  KI  I+WVGYPGE GG 
Sbjct: 513 LDLYLPGKQQELVTQVAKVAKGPVVLVIMSGGAFDITFAKNEEKITGIMWVGYPGEAGGL 572

Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPF 601
           AIADVIFG++NP G LP+TWY  +YV K+P T+M +RP   N +PGRTY+F+ G  VY F
Sbjct: 573 AIADVIFGRHNPSGNLPMTWYPQSYVEKVPMTNMNMRPDKSNGYPGRTYRFYTGETVYAF 632

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDIN-YTVGTNKPPCAAVLIDDVKCKDYKF 660
           G GLSYT F +++  +PK V + LD++  CR     +V    P C     D+       F
Sbjct: 633 GDGLSYTNFNHQILKAPKLVSLDLDENHACRSSECQSVDAIGPHC-----DNAVGGGLNF 687

Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
             Q++V N+G  +GS  V +++ PP + G+  K ++G+E++ +   +   + F ++ CK 
Sbjct: 688 EVQLKVRNVGDREGSHTVFLFTTPPEVHGSPRKHLLGFEKIRLGEKEETVIRFNVDVCKD 747

Query: 721 LKIVDNAANSLLASGAHTILVG 742
           L +VD      +A G + + VG
Sbjct: 748 LSVVDEIGKRKIALGHYLLHVG 769


>gi|296083274|emb|CBI22910.3| unnamed protein product [Vitis vinifera]
          Length = 738

 Score =  717 bits (1851), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 366/739 (49%), Positives = 496/739 (67%), Gaps = 61/739 (8%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
           + P+C   LP  ERA+DLV R+TL EK++ + + A  VPRLG+  YEWWSEALHGVS +G
Sbjct: 38  NLPFCRVSLPIQERARDLVGRLTLQEKIRLLVNNAIDVPRLGIKGYEWWSEALHGVSNVG 97

Query: 71  RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
                 PGT F    PGATSFP VI T ASFN SLW++IG+ VS EARAMYN G AGLT+
Sbjct: 98  ------PGTKFGGSFPGATSFPQVITTAASFNASLWEEIGRVVSDEARAMYNGGMAGLTY 151

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           WSPN+N+ RDPRWGR  ETPGEDP V  +YA  YVRGLQ        RD     LK++AC
Sbjct: 152 WSPNVNIFRDPRWGRGQETPGEDPAVAAKYAAAYVRGLQGNA-----RDR----LKVAAC 202

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKHY AYDLD+W G DRFHF++RV++QD+++T+ +PF+ CV EG+V+SVMCSYN+VNG P
Sbjct: 203 CKHYTAYDLDHWGGIDRFHFNARVSKQDLEDTYDVPFKACVVEGNVASVMCSYNQVNGKP 262

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
           TCADP LL  TIRG+W  +GYIVSDCDS+    +   +   T E+A A  +KAGLDLDCG
Sbjct: 263 TCADPHLLRDTIRGEWKLNGYIVSDCDSVGVFYDEQHY-TATPEEAAAVAIKAGLDLDCG 321

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
            +    T  A++ GK+ EAD++ +L     V MRLG FDG P    Y NLG  ++C P H
Sbjct: 322 PFLAIHTEAAIRGGKLTEADVNGALMNTISVQMRLGMFDGEPSAQPYGNLGPRDVCTPAH 381

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
            +LA EAARQGIVL++N   ALPL+T   +T+A++GP+++ T+ MIGNY G  C YT+P+
Sbjct: 382 QQLALEAARQGIVLVQNRGPALPLSTSRHRTIAVIGPNSDVTETMIGNYAGVACGYTTPL 441

Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
            G   Y++ I+ A GC+ + C+++    AA+ AA+ ADATV+V GLD S+EAE +DRVD+
Sbjct: 442 QGIGRYARTIHQA-GCSGVACRDDQQFGAAVAAARQADATVLVMGLDQSIEAEFRDRVDI 500

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
           LLPG Q EL++KVA A++GP  LV+MS G +D++FAKN+P+I +I+WVGYPG+ GG AIA
Sbjct: 501 LLPGRQQELVSKVAVASRGPTVLVLMSGGPIDVSFAKNDPRIAAIIWVGYPGQAGGTAIA 560

Query: 548 DVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
           DV+FG+ NPGG+LP+TWY  +Y+ K P T+M +R  P   +PGRTY+F++GPVV+PFG+G
Sbjct: 561 DVLFGRTNPGGKLPVTWYPQSYLRKAPMTNMAMRAIPSRGYPGRTYRFYNGPVVFPFGHG 620

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
           LSY+ F + +A +P +                                         F I
Sbjct: 621 LSYSTFAHSLAQAPTT--------------------------------------PLGFHI 642

Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIV 724
           +V+N G MDGS  ++++S PP    +  K+++ +E+V + AG   +V F ++ CK L +V
Sbjct: 643 DVKNTGTMDGSHTLLLFSTPPPGTWSPNKRLLAFEKVHVGAGSQERVRFDVHVCKHLSVV 702

Query: 725 DNAANSLLASGAHTILVGE 743
           D+     +  G H   +G+
Sbjct: 703 DHFGIHRIPMGEHHFHIGD 721


>gi|18025340|gb|AAK38481.1| alpha-L-arabinofuranosidase/beta-D-xylosidase isoenzyme ARA-I
           [Hordeum vulgare]
          Length = 777

 Score =  716 bits (1849), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 362/743 (48%), Positives = 501/743 (67%), Gaps = 30/743 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L+ + +C+ K     RA+DLV R+TL EKV  + +    + RLG+P YEWWSEALHGVS+
Sbjct: 48  LAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYEWWSEALHGVSY 107

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           +G      PGT F   VPGATSFP  ILT ASFN SL++ IG+ VSTEARAM+N+G AGL
Sbjct: 108 VG------PGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNVGLAGL 161

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           TFWSPNIN+ RDPRWGR  ETPGEDP +  +YA+ YV GLQD  G     D     LK++
Sbjct: 162 TFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDA-GAGGVTDG---ALKVA 217

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYD+DNW+G +R+ FD++V++QD+ +TF  PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 218 ACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNG 277

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTCAD  LL   IRGDW  +GYIVSDCDS+  ++ + +    T E+A A  +K+G+DL+
Sbjct: 278 KPTCADKDLLEGVIRGDWKLNGYIVSDCDSVD-VLYTQQHYTKTPEEAAAITIKSGVDLN 336

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG++    T+ AVQ G+++E D+D ++   +I+LMRLG+FDG P+   + +LG  ++C  
Sbjct: 337 CGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTS 396

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            + ELA E ARQGIVLLKN +GALPL+  +IK++A++GP+ANA+  MIGNYEGTPC+YT+
Sbjct: 397 SNRELARETARQGIVLLKN-SGALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTT 455

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           P+ G  A    + Y PGC ++ C  NS+ +  A+ AA +AD TV+V G D S+E E  DR
Sbjct: 456 PLQGLGAKVNTV-YQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESLDR 514

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
             LLLPG QT+L++ VA+A+ GPV LV+MS G  DI+FAK + KI + LWVGYPGE GG 
Sbjct: 515 TSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAATLWVGYPGEAGGA 574

Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPF 601
           A+ D +FG +NP GRLP+TWY A+Y   +  T M +RP     +PGRTY+F+ G  V+ F
Sbjct: 575 ALDDTLFGSHNPSGRLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRTYRFYTGDTVFAF 634

Query: 602 GYGLSYTQFKYKVASSPKS-VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
           G GLSYT+  + + S+P S V ++L +D  CR            CA+V      C D   
Sbjct: 635 GDGLSYTKMSHSLVSAPPSYVSMRLAEDHLCR---------AEECASVEAAGDHCDDLAL 685

Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
             +++V N G++ G+  V+++S PP       K ++G+E+V +A G++  V F ++ C+ 
Sbjct: 686 DVKLQVRNAGEVAGAHSVLLFSSPPPAHNAPAKHLVGFEKVSLAPGEAGTVAFRVDVCRD 745

Query: 721 LKIVDNAANSLLASGAHTILVGE 743
           L +VD      +A G HT+  G+
Sbjct: 746 LSVVDELGGRKVALGGHTLHDGD 768


>gi|302811514|ref|XP_002987446.1| hypothetical protein SELMODRAFT_426206 [Selaginella moellendorffii]
 gi|300144852|gb|EFJ11533.1| hypothetical protein SELMODRAFT_426206 [Selaginella moellendorffii]
          Length = 772

 Score =  716 bits (1847), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/752 (48%), Positives = 486/752 (64%), Gaps = 48/752 (6%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L+ FP+C+  LP  +R +D V R+TL EK+ Q+ + A G+PRLG+P Y+WW EALHGV+ 
Sbjct: 39  LAAFPFCNTSLPITDRVEDYVARLTLEEKISQLINTATGIPRLGVPKYQWWQEALHGVA- 97

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
                 S PG  F   VP ATSFP  I T ASFN SL+  IGQ VSTEARAM+NLG +GL
Sbjct: 98  ------SSPGVQFGGSVPAATSFPMPITTAASFNTSLFYGIGQAVSTEARAMHNLGQSGL 151

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           TFWSPNIN+ RDPRWGR  ETPGEDP +   +A  YVRGLQ+ +       + S  LK+S
Sbjct: 152 TFWSPNINIYRDPRWGRGQETPGEDPLLSSNFATYYVRGLQESQ-------AGSDKLKVS 204

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKH  AYD+DNW G DR+HF++ VTEQD+++T+  PF+ CV +G VSSVMCSYNR+NG
Sbjct: 205 ACCKHMTAYDVDNWLGTDRYHFNAIVTEQDLEDTYNAPFKSCVEDGGVSSVMCSYNRLNG 264

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           +PTCAD +LL  T+R  W  +GYIVSDCDS+Q   ++  +    ++ A   +  AGL+L+
Sbjct: 265 VPTCADHELLTTTVRETWKLNGYIVSDCDSLQVFFDNTNYAATAEDAAADAL-LAGLNLN 323

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG +    T+ A+QQ K+ EA I+ +L +L  V MRLG +DG P+   Y +LG +++C  
Sbjct: 324 CGTFLAKHTLSAIQQKKVTEATINQALTYLVTVQMRLGLYDGDPKSQTYGSLGASDVCTS 383

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H  LA EAARQG+VLLKN  GALPL+T  IK+LA+VGPHANAT+AMIGNY G PC+YTS
Sbjct: 384 EHQTLALEAARQGMVLLKN-LGALPLSTSKIKSLAVVGPHANATRAMIGNYAGIPCKYTS 442

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+  F  Y++V +YAPGCA++ C ++S+I  A+ AA  ADA V+  GLDL++EAE  DR 
Sbjct: 443 PLQAFQKYAQV-SYAPGCANVACSSDSLISGAVSAAAAADAVVVAVGLDLTIEAESLDRT 501

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            LLLPG Q EL+++V  AAKGPV +VI+SAGA+DI FA ++ +I  ILW GYPG+ GG A
Sbjct: 502 SLLLPGKQQELVSQVMQAAKGPVVIVILSAGAIDIPFALSDSRIAGILWAGYPGQAGGAA 561

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGY 603
           IA+VIFG +NP G+LP TWY  N+  I    M +RP     +PGRTY+F+ GP ++ FG 
Sbjct: 562 IAEVIFGDHNPSGKLPATWYPQNFTSISMLDMNMRPNASTGYPGRTYRFYTGPTIFKFGD 621

Query: 604 GLSYTQFKYKVASSPKSVDIK----------LDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
           GLSYT    K   +P  + I           L K   C  ++ T             D+ 
Sbjct: 622 GLSYTSLSAKFIKAPSFLSIPSTAPMQPCTGLKKSSSCFHLDAT-------------DEK 668

Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERVFIAAGQ-SAK 710
            C+  K    I V N G M  S  +M++S PP  G  G   +Q++G+ ++ IA    S  
Sbjct: 669 SCESLKSQVAISVRNKGAMAISHTLMLFSTPPSAGSDGVPQRQLVGFNKIQIAGDSISNP 728

Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           V F ++ C+     D     LL SG H +  G
Sbjct: 729 VIFDLDPCRHFVHADRDGKKLLRSGTHVLTAG 760


>gi|302786474|ref|XP_002975008.1| hypothetical protein SELMODRAFT_103038 [Selaginella moellendorffii]
 gi|300157167|gb|EFJ23793.1| hypothetical protein SELMODRAFT_103038 [Selaginella moellendorffii]
          Length = 772

 Score =  712 bits (1838), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 355/748 (47%), Positives = 475/748 (63%), Gaps = 27/748 (3%)

Query: 8   KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
           + S FP+CD  LP P+R  DLV RM L EK+ Q+   A G+PRLG+P Y+WW EALHGV+
Sbjct: 29  RSSSFPFCDVSLPVPDRVADLVGRMNLSEKIAQIVSNASGIPRLGIPGYQWWEEALHGVA 88

Query: 68  FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG 127
                    PG  F + VP ATSFP VILT ASFN SLW KI Q +S EA AMYN G +G
Sbjct: 89  -------ESPGVKFAAPVPSATSFPQVILTVASFNSSLWNKIAQAISIEAIAMYNAGRSG 141

Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS--DSRP- 184
           LTFWSPNIN+ RDPRWGR  ETPGEDP +  +YA  +VRGLQ+ +  E    S    RP 
Sbjct: 142 LTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAYFVRGLQEGDYDEGTAISTMQRRPT 201

Query: 185 -LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LK+S+CCKH+ AYD++  EG D FHF+++VT QD+Q+TF  PF  C+ +G  S +MCSY
Sbjct: 202 RLKVSSCCKHFTAYDMEKSEGTDCFHFNAQVTVQDLQDTFDPPFRSCIVDGQASGLMCSY 261

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NRVNG+P+CAD   L +T+R  W F GYIVSDCD++  + E   +   T EDAVA VL A
Sbjct: 262 NRVNGVPSCADYTFLTETVRNSWGFEGYIVSDCDAVALLYEYINY-TTTAEDAVADVLSA 320

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNN 361
           G+DL+CG +    T  A++QGK+ EA +D +L  +  V MRLG FDG+    Y ++G + 
Sbjct: 321 GMDLNCGTFLLRHTAAAIEQGKVTEAAVDRALSNVMTVRMRLGLFDGNSGETYNSIGPDA 380

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           +C  +H +L+ EAA QGIVLLKN    LP    ++ T+A++GP  NAT+ M+GNY G PC
Sbjct: 381 VCTREHRQLSLEAAEQGIVLLKNSGNVLPFPRNDLMTIAVIGPSGNATETMLGNYAGVPC 440

Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
           +Y +P  G   Y+K + + PGC DI+C + ++  AA+ AA+N+DA VIV GLD   E EG
Sbjct: 441 QYITPFQGLQEYTKGVVFEPGCKDIMCNDTTLFLAAVRAAENSDAVVIVVGLDKDQEREG 500

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
            DR  LLLPG+Q +L+ +V+  AKGPV LV+MS G +D+ FAK N KI S+LWVGYPGE 
Sbjct: 501 LDRTSLLLPGYQQDLVLEVSKVAKGPVILVVMSGGPIDVTFAKGNCKISSVLWVGYPGEA 560

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRP--VNNFPGRTYKFFDGPVV 598
           GG+AIA VIFG +NP GRLP+TWY   + + +   +M LRP     FPGRTY+F+ G  V
Sbjct: 561 GGKAIARVIFGDHNPAGRLPMTWYPQAFAEHVSILNMHLRPNTSTGFPGRTYRFYTGENV 620

Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
           Y FG+GLSYT F Y   S+P ++  +       R      G    P     ID   C+  
Sbjct: 621 YEFGHGLSYTNFTYTNFSAPSNITAR--NTVAIRTPLREDGARHFP-----IDYTGCEAL 673

Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI---KQVIGYERVFIAAGQSAKVGFTM 715
            F     + N G  D   + ++Y+ PP  + +     KQ+I ++R  + AG+ AKV F +
Sbjct: 674 AFKVVAYISNTGTRDSDHISLLYAIPPAASSSLSPPRKQLISFKRQHLIAGRCAKVEFDV 733

Query: 716 NACKSLKIVDNAANSLLASGAHTILVGE 743
           + CK L + + A   +L  G + + +G+
Sbjct: 734 DTCKDLGLTNEAGTKVLVHGDYKLSLGD 761


>gi|302791321|ref|XP_002977427.1| hypothetical protein SELMODRAFT_106899 [Selaginella moellendorffii]
 gi|300154797|gb|EFJ21431.1| hypothetical protein SELMODRAFT_106899 [Selaginella moellendorffii]
          Length = 772

 Score =  711 bits (1836), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 352/749 (46%), Positives = 475/749 (63%), Gaps = 29/749 (3%)

Query: 8   KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
           + S FP+CD  LP P+R  DLV RM L EK+ Q+   A G+PRLG+P Y+WW EALHGV+
Sbjct: 29  RSSSFPFCDVSLPVPDRVADLVGRMNLSEKIAQIVSNASGIPRLGIPGYQWWEEALHGVA 88

Query: 68  FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG 127
                    PG  F + VP ATSFP VILT ASFN SLW KI Q +S EA AMYN G +G
Sbjct: 89  -------ESPGVKFAAPVPSATSFPQVILTVASFNSSLWNKIAQAISIEAIAMYNAGRSG 141

Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVE-----GVEYHRDSDS 182
           LTFWSPNIN+ RDPRWGR  ETPGEDP +  +YA  +VRGLQ+ +      +   + S +
Sbjct: 142 LTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAYFVRGLQEGDYDEGTAISTMQGSPT 201

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
           R LK+S+CCKH+ AYD++  EG D FHF+++VT QD+Q+TF  PF  C+ +G  S +MCS
Sbjct: 202 R-LKVSSCCKHFTAYDMEKSEGTDCFHFNAQVTVQDLQDTFDPPFRSCIVDGQASGLMCS 260

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           YNRVNG+P+CAD   L +T+R  W F GYIVSDCD++  + E   +   T EDAVA VL 
Sbjct: 261 YNRVNGVPSCADYTFLTETVRNSWGFEGYIVSDCDAVALLYEYINY-TTTAEDAVADVLS 319

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKN 360
           AG+DL+CG +    T  A++QGK+ EA +D +L  +  V MRLG FDG+    Y ++G +
Sbjct: 320 AGMDLNCGTFLLRHTAAAIEQGKVTEAAVDRALSNVMTVRMRLGLFDGNSGETYNSIGPD 379

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            +C P+H +L+ EAA QGIVLLKN    LP    ++ T+A++GP  NAT+ M+GNY G P
Sbjct: 380 AVCTPEHRQLSLEAAEQGIVLLKNSGNVLPFPRNDLMTIAVIGPSGNATETMLGNYAGVP 439

Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
           C+Y +P  G   Y+K + + PGC DI+C + ++  AA+ AA+N+DA VIV GLD   E E
Sbjct: 440 CQYITPFQGLQEYTKCVVFEPGCKDIMCNDTTLFLAAVRAAENSDAVVIVVGLDKDQERE 499

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
           G DR  LLLPG Q  L+ +V+  AKGPV LV+MS G +D+ FAK N KI ++LWVGYPGE
Sbjct: 500 GLDRTSLLLPGNQQGLVLEVSKVAKGPVILVVMSGGPIDVTFAKENCKISNVLWVGYPGE 559

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRP--VNNFPGRTYKFFDGPV 597
            GG+AIA VIFG +NP GRLP+TWY   + + +   +M LRP     FPGRTY+F+ G  
Sbjct: 560 AGGKAIARVIFGDHNPAGRLPMTWYPQAFAEHVSILNMHLRPNTSTGFPGRTYRFYTGEN 619

Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
           VY FG+GLSYT F Y    +P ++  +       R      G  + P     ID   C+ 
Sbjct: 620 VYEFGHGLSYTNFTYTNFCAPSNITAR--NTVAIRTPLREDGARQFP-----IDYTGCEA 672

Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI---KQVIGYERVFIAAGQSAKVGFT 714
             F     + N G  D   + ++Y+ PP  + +     KQ+I ++R  + AG+ AKV F 
Sbjct: 673 LAFKVVAYISNTGTRDSDHISLLYAIPPAASSSLSPPRKQLISFKRQHLIAGRCAKVEFD 732

Query: 715 MNACKSLKIVDNAANSLLASGAHTILVGE 743
           ++ CK L + + A   +L  G + + +G+
Sbjct: 733 VDTCKDLGLTNEAGTKVLVHGDYKLSLGD 761


>gi|449466797|ref|XP_004151112.1| PREDICTED: beta-D-xylosidase 1-like [Cucumis sativus]
          Length = 770

 Score =  711 bits (1835), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 370/755 (49%), Positives = 488/755 (64%), Gaps = 33/755 (4%)

Query: 7   VKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGV 66
           V   +  +C   L   ER KDL+ R+TL EK++ + + A  VPRLG+  YEWWSEALHGV
Sbjct: 39  VGTRNMGFCQESLGIEERVKDLIGRLTLGEKIRLLVNNAIAVPRLGIRGYEWWSEALHGV 98

Query: 67  SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
           S +G      PGT F    PGATSFP VI T ASFN+SLW  IG+ VS EARAMYN G A
Sbjct: 99  SNVG------PGTKFGGTFPGATSFPQVITTAASFNQSLWLLIGRVVSDEARAMYNGGTA 152

Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLK 186
           GLT+WSPN+N+ RDPRWGR  ETPGEDP +  +YA NYV+GLQ  +G         + LK
Sbjct: 153 GLTYWSPNVNIFRDPRWGRGQETPGEDPILAAKYAANYVQGLQGNDG--------KKRLK 204

Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
           ++ACCKHY AYDLDNW G DR+HF+++V++QD+++T+ +PF+ CV EG V+SVMCSYN+V
Sbjct: 205 VAACCKHYTAYDLDNWNGVDRYHFNAKVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 264

Query: 247 NGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLD 306
           NG PTCADP LL  TIRG W   GYIVSDCDS+  + +S  F   T E+A A  +KAGLD
Sbjct: 265 NGKPTCADPDLLKNTIRGAWGLDGYIVSDCDSVGVLYDSQHF-TPTPEEAAASTIKAGLD 323

Query: 307 LDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNIC 363
           LDCG +    T  AV +G + E D++ +L  L  V MRLG FDG P    Y NLG  ++C
Sbjct: 324 LDCGPFLAVHTATAVGRGLLKEVDLNNALANLLSVQMRLGMFDGEPAAQPYGNLGPKDVC 383

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
            P H  LA EAARQGIVLL+N  GALPL+    +T+A++GP+++AT  MIGNY G  C Y
Sbjct: 384 TPAHKHLALEAARQGIVLLQNRAGALPLSPTRHRTVAVIGPNSDATVTMIGNYAGVACEY 443

Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
           T+P+ G   Y K I +A GCA++ C  + +I  A  AA+ ADA V+V GLD S+EAE +D
Sbjct: 444 TTPVQGISKYVKTI-HAKGCANVACVGDQLIGEAEAAARVADAAVVVVGLDQSIEAESRD 502

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R  +LLPG Q EL+ ++  A KGP  +V+MS G +D++FAKN+ KI  ILWVGYPG+ GG
Sbjct: 503 RNGVLLPGKQEELVRRIGLACKGPTVVVLMSGGPIDVSFAKNDGKISGILWVGYPGQAGG 562

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYP 600
            AIADV+FG  NPGG+LP+TWY  +Y+ K+P T+M LR  P   +PGRTY+F+ GPVV+P
Sbjct: 563 AAIADVLFGATNPGGKLPMTWYPQSYLAKVPMTNMGLRPDPSTGYPGRTYRFYKGPVVFP 622

Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
           FG+GLSY++F    A +P    I L       + + TV  +   CA+V            
Sbjct: 623 FGFGLSYSKFSQSFAEAP--TKISLPLSSLSPNSSATVKVSHTDCASV---------SDL 671

Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
              I+V+N G +DGS  ++V+S  P    +  K +IG+E+V + AG   +V   ++ C  
Sbjct: 672 PIMIDVKNTGTVDGSHTILVFSTVPNQTWSPEKHLIGFEKVHLIAGSQKRVRIGIHVCDH 731

Query: 721 LKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLN 755
           L  VD      +  G H + +G+    +S    L 
Sbjct: 732 LSRVDEFGTRRIPMGEHKLHIGDLTHSISLQADLQ 766


>gi|302796583|ref|XP_002980053.1| hypothetical protein SELMODRAFT_112087 [Selaginella moellendorffii]
 gi|300152280|gb|EFJ18923.1| hypothetical protein SELMODRAFT_112087 [Selaginella moellendorffii]
          Length = 772

 Score =  711 bits (1835), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 366/752 (48%), Positives = 485/752 (64%), Gaps = 48/752 (6%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L+ FP+C+  L   +R +D V R+TL EK+ Q+ + A G+PRLG+P Y+WW EALHGV+ 
Sbjct: 39  LAAFPFCNTSLAITDRVEDYVARLTLEEKISQLINTATGIPRLGVPKYQWWQEALHGVA- 97

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
                 S PG  F   VP ATSFP  I T ASFN SL+  IGQ VSTEARAM+NLG +GL
Sbjct: 98  ------SSPGVQFGGSVPAATSFPMPITTAASFNTSLFYGIGQAVSTEARAMHNLGQSGL 151

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           TFWSPNIN+ RDPRWGR  ETPGEDP +   +A  YVRGLQ+ +       + S  LK+S
Sbjct: 152 TFWSPNINIYRDPRWGRGQETPGEDPLLSSNFATYYVRGLQESQ-------AGSDKLKVS 204

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKH  AYD+DNW G DR+HF++ VTEQD+++T+  PF+ CV +G VSSVMCSYNR+NG
Sbjct: 205 ACCKHMTAYDVDNWLGTDRYHFNAIVTEQDLEDTYNAPFKSCVEDGGVSSVMCSYNRLNG 264

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           +PTCAD +LL  T+R  W  +GYIVSDCDS+Q   ++  +    ++ A   +  AGL+L+
Sbjct: 265 VPTCADHELLTTTVRETWKLNGYIVSDCDSLQVFFDNTNYAATAEDAAADAL-LAGLNLN 323

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG +    T+ A+QQ K+ EA I+ +L +L  V MRLG +DG P+   Y +LG +++C  
Sbjct: 324 CGTFLAKHTLSAIQQKKVTEATINQALTYLVTVQMRLGLYDGDPKSQTYGSLGASDVCTS 383

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H  LA EAARQG+VLLKN  GALPL+T  IK+LA+VGPHANAT+AMIGNY G PC+YTS
Sbjct: 384 EHQTLALEAARQGMVLLKN-LGALPLSTSKIKSLAVVGPHANATRAMIGNYAGIPCKYTS 442

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+  F  Y++V +YAPGCA++ C ++S+I  A+ AA  ADA V+  GLDL++EAE  DR 
Sbjct: 443 PLQAFQKYAQV-SYAPGCANVACSSDSLISGAVSAAAAADAVVVAVGLDLTIEAESLDRT 501

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            LLLPG Q EL+++V  AAKGPV +VI+SAGA+DI FA ++ +I  ILW GYPG+ GG A
Sbjct: 502 SLLLPGKQQELVSQVMQAAKGPVVIVILSAGAIDIPFALSDSRIAGILWAGYPGQAGGAA 561

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGY 603
           IA+VIFG +NP G+LP TWY  N+  I    M +RP     +PGRTY+F+ GP ++ FG 
Sbjct: 562 IAEVIFGDHNPSGKLPATWYPQNFTSISMLDMNMRPNASTGYPGRTYRFYTGPTIFKFGD 621

Query: 604 GLSYTQFKYKVASSPKSVDIK----------LDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
           GLSYT    K   +P  + I           L K   C  ++ T             D+ 
Sbjct: 622 GLSYTSLSAKFIKAPSFLSIPSTAPMQPCTGLKKSSSCFHLDAT-------------DEK 668

Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERVFIAAGQ-SAK 710
            C+  K    I V N G M  S  +M++S PP  G  G   +Q++G+ ++ IA    S  
Sbjct: 669 SCESLKSQVAISVRNKGAMAISHTLMLFSTPPNAGSDGVPQRQLVGFNKIQIAGDSISNP 728

Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           V F ++ C+     D     LL SG H +  G
Sbjct: 729 VIFDLDPCRHFVHADPDGKKLLRSGTHVLTAG 760


>gi|115486595|ref|NP_001068441.1| Os11g0673200 [Oryza sativa Japonica Group]
 gi|113645663|dbj|BAF28804.1| Os11g0673200 [Oryza sativa Japonica Group]
          Length = 822

 Score =  706 bits (1822), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/794 (47%), Positives = 497/794 (62%), Gaps = 64/794 (8%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           +  P+C   LP   RA+DLV R+T  EKV+ + + A GVPRLG+  YEWWSEALHGVS  
Sbjct: 39  ATLPFCRRSLPARARARDLVARLTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDT 98

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQ------------------ 111
           G      PG  F    PGAT+FP VI T ASFN +LW+ IGQ                  
Sbjct: 99  G------PGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQVMPILKGGHARCNQRPSC 152

Query: 112 --------------TVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVV 157
                          VS E RAMYN G AGLTFWSPN+N+ RDPRWGR  ETPGEDP V 
Sbjct: 153 IRISVFMYVYVCAQAVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVA 212

Query: 158 GRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQ 217
            RYA  YVRGLQ        +   S  LK++ACCKH+ AYDLDNW G DRFHF++ VT Q
Sbjct: 213 ARYAAAYVRGLQ-------QQQPSSGRLKLAACCKHFTAYDLDNWSGTDRFHFNAVVTRQ 265

Query: 218 DMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCD 277
           D+++TF +PF  CV +G  +SVMCSYN+VNG+PTCAD   L  TIR  W   GYIVSDCD
Sbjct: 266 DLEDTFNVPFRSCVVDGRAASVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCD 325

Query: 278 SIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRF 337
           S+  +  S +    T+EDAVA  L+AGLDLDCG +   +T GAV QGK+ + DID ++  
Sbjct: 326 SVD-VFYSDQHYTRTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTN 384

Query: 338 LYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTG 394
              V MRLG FDG P    + +LG  ++C   H ELA EAARQGIVLLKND  ALPL+  
Sbjct: 385 TVTVQMRLGMFDGDPAAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPA 444

Query: 395 NIK-TLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSM 453
             +  +A+VGPHA AT AMIGNY G PCRYT+P+ G   Y+    + PGC D+ C  +  
Sbjct: 445 TARRAVAVVGPHAEATVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQ 504

Query: 454 -IPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVI 512
            I AA+DAA+ ADAT++VAGLD  +EAEG DR  LLLPG Q ELI+ VA A+KGPV LV+
Sbjct: 505 PIAAAVDAARRADATIVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVL 564

Query: 513 MSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-K 571
           MS G +DI FA+N+PKI  ILW GYPG+ GG+AIADVIFG +NPGG+LP+TWY  +Y+ K
Sbjct: 565 MSGGPIDIGFAQNDPKIAGILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPQDYLQK 624

Query: 572 IPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQ 629
           +P T+M +R  P   +PGRTY+F+ GP ++PFG+GLSYT F + +A +P  + ++L    
Sbjct: 625 VPMTNMAMRANPAKGYPGRTYRFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLTVRLSAHH 684

Query: 630 QCRDINYTVGTNK--PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY------ 681
                + ++         AAV +   +C++ +    ++V N+G+ DG+  V+VY      
Sbjct: 685 AAASASASLNATARLSRAAAVRVAHARCEELRMPVHVDVRNVGERDGAHTVLVYAAAPAS 744

Query: 682 --SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
             ++     G  ++Q++ +E+V + AG +A+V   ++ C  L + D      +  G H +
Sbjct: 745 SAAEAAAGHGAPVRQLVAFEKVHVGAGGTARVEMGIDVCDGLSVADRNGVRRIPVGEHRL 804

Query: 740 LVGEGVGGVSFPLQ 753
           ++GE    V+  L+
Sbjct: 805 IIGELTHTVTIALE 818


>gi|326489197|dbj|BAK01582.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 709

 Score =  704 bits (1818), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 357/716 (49%), Positives = 489/716 (68%), Gaps = 33/716 (4%)

Query: 39  QQMGDLAYGVP---RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVI 95
           Q++G L    P   RLG+P YEWWSEALHGVS++G      PGT F   VPGATSFP  I
Sbjct: 7   QKVGFLVNKQPALGRLGIPAYEWWSEALHGVSYVG------PGTRFSPLVPGATSFPQPI 60

Query: 96  LTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPY 155
           LT ASFN SL++ IG+ VSTEARAM+N+G AGLTFWSPNIN+ RDPRWGR  ETPGEDP 
Sbjct: 61  LTAASFNASLFRAIGEVVSTEARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPL 120

Query: 156 VVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVT 215
           +  +YA+ YV GLQD  G     D     LK++ACCKHY AYD+DNW+G +R+ FD++V+
Sbjct: 121 LASKYAVGYVTGLQDA-GAGGVTDG---ALKVAACCKHYTAYDVDNWKGVERYTFDAKVS 176

Query: 216 EQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSD 275
           +QD+ +TF  PF+ CV +G+V+SVMCSYN+VNG PTCAD  LL   IRGDW  +GYIVSD
Sbjct: 177 QQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSD 236

Query: 276 CDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSL 335
           CDS+  ++ + +    T E+A A  +K+GLDL+CG++    T+ AVQ G+++E D+D ++
Sbjct: 237 CDSVD-VLYTQQHYTKTPEEAAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAI 295

Query: 336 RFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLN 392
              +I+LMRLG+FDG P+   + +LG  ++C   + ELA E ARQGIVLLKN +GALPL+
Sbjct: 296 TNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTSSNRELARETARQGIVLLKN-SGALPLS 354

Query: 393 TGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNS 452
             +IK++A++GP+ANA+  MIGNYEGTPC+YT+P+ G  A    + Y PGC ++ C  NS
Sbjct: 355 AKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPLQGLGAKVNTV-YQPGCTNVGCSGNS 413

Query: 453 M-IPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLV 511
           + +  A+ AA +AD TV+V G D S+E E  DR  LLLPG QT+L++ VA+A+ GPV LV
Sbjct: 414 LQLSTAVAAAASADVTVLVVGADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILV 473

Query: 512 IMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV- 570
           +MS G  DI+FAK + KI +ILWVGYPGE GG A+AD++FG +NP GRLP+TWY A+Y  
Sbjct: 474 VMSGGPFDISFAKASDKIAAILWVGYPGEAGGAALADILFGSHNPSGRLPVTWYPASYAD 533

Query: 571 KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKS-VDIKLDK 627
            +  T M +RP     +PGRTY+F+ G  V+ FG GLSYT+  + + S+P S V ++L +
Sbjct: 534 TVTMTDMRMRPDTSTGYPGRTYRFYTGDTVFAFGDGLSYTKMSHSLVSAPPSYVSMRLAE 593

Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI 687
           D  CR            CA+V      C D  F  +++V N G++ G+  V+++S PP  
Sbjct: 594 DHPCR---------AEECASVEAAGDHCDDLAFDVKLQVRNAGEVAGAHSVLLFSSPPPA 644

Query: 688 AGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
                K ++G+E+V +A G++  V F ++ C+ L +VD      +A G HT+ VG+
Sbjct: 645 HNAPAKHLLGFEKVSLAPGEAGTVAFRVDVCRDLSVVDELGGRKVALGGHTLHVGD 700


>gi|302811516|ref|XP_002987447.1| hypothetical protein SELMODRAFT_426207 [Selaginella moellendorffii]
 gi|300144853|gb|EFJ11534.1| hypothetical protein SELMODRAFT_426207 [Selaginella moellendorffii]
          Length = 779

 Score =  703 bits (1814), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 359/765 (46%), Positives = 476/765 (62%), Gaps = 69/765 (9%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L  F +C+ +LP   R +DL+ RMTL EK+ Q+ + A G+PRLGLP YEWW EALHGV+ 
Sbjct: 41  LLQFGFCNTRLPTSTRVEDLISRMTLQEKIIQLVNNAAGIPRLGLPRYEWWQEALHGVAV 100

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
                   PG  F  + PGATSFP  ILT ASF+          VSTEARAM+N   AGL
Sbjct: 101 -------SPGVKFGGKFPGATSFPMPILTAASFD---------AVSTEARAMHNYQRAGL 144

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T+WSPN+N+ RDPRWGR  ETPGEDP +  +YA  YVRGLQD        +     LK+S
Sbjct: 145 TYWSPNVNIYRDPRWGRGQETPGEDPLLSSKYATFYVRGLQDT-------NLGGDKLKVS 197

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKH  AYD+DNW+G  RF F++ VT+QD+ +T+  PF+ CV +  VSSVMCSYNRVNG
Sbjct: 198 ACCKHMTAYDVDNWKGTTRFKFNAIVTQQDLSDTYNPPFQSCVEDAKVSSVMCSYNRVNG 257

Query: 249 IPTCADPKLLNQTIRGDWNFHG----------------YIVSDCDSIQTIVESHKFLNDT 292
           +PTCAD  LL+ T+R  WN +G                YIVSDCDS+QT  ++  +   T
Sbjct: 258 VPTCADYNLLSATVRSSWNLNGSILLTCEVLLLYLPCSYIVSDCDSLQTFFDNTNYAK-T 316

Query: 293 KEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP 352
            ED VA  L AGL+LDCG +    T  A+  GKI EA+++ +LR+LY V MRLG +DG+P
Sbjct: 317 AEDVVADALLAGLNLDCGPFLAIHTQSAITNGKITEANVNQALRYLYNVQMRLGLYDGNP 376

Query: 353 Q---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT 409
           +   Y NLG  ++C  ++ +LA +AA++GIVLLKN+   LP +  NI+T+A +GPHA AT
Sbjct: 377 RSQPYGNLGPQSVCTGENQQLALDAAKEGIVLLKNNGNVLPFSKSNIRTVAAIGPHAKAT 436

Query: 410 KAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVI 469
           +AMIGNY+G PC+YT+P DG  AY++V+ Y+ GC+D+ C ++S+I +A+  A  ADA V+
Sbjct: 437 RAMIGNYQGIPCKYTTPHDGLSAYARVV-YSAGCSDVACYSDSLIGSAVSTASQADAVVL 495

Query: 470 VAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
             GLDL+ EAEGKDR  LLLPG Q EL+ +V  AAKGP  LVI S G+VD++FAK N K+
Sbjct: 496 FVGLDLNQEAEGKDRTSLLLPGKQQELVTEVTKAAKGPAVLVIFSGGSVDVSFAKYNNKV 555

Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP--VNNFPG 587
           + ILW GYPGE GG AIA V+FG +NPGGRLP+TWY  ++  I    M +RP     +PG
Sbjct: 556 QGILWAGYPGEAGGAAIAQVLFGDHNPGGRLPVTWYPESFTGITMLDMNMRPDASRGYPG 615

Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKS--------VDIKLDKDQQCRDINYTVG 639
           RTY+F+ G  VY FGYG +Y++  +K   +P S        V    D +  C  +N    
Sbjct: 616 RTYRFYTGQSVYNFGYGKTYSKLSHKFKEAPLSLGFPEAAAVKRSCDGNLTCFHLNAH-- 673

Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIG 697
                      D++ C       +I V N G    +  V++YS PP  G  G  I+Q+ G
Sbjct: 674 -----------DEITCSTLTSKVRILVHNKGDRPSNRAVLLYSSPPNAGRDGAPIRQLAG 722

Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           + +V +A G    V   ++ CK L         +L  G HT+ VG
Sbjct: 723 FGKVSVAPGAVENVEIEIDPCKHLSHAGANGVRILHGGIHTLAVG 767


>gi|302796585|ref|XP_002980054.1| hypothetical protein SELMODRAFT_419541 [Selaginella moellendorffii]
 gi|300152281|gb|EFJ18924.1| hypothetical protein SELMODRAFT_419541 [Selaginella moellendorffii]
          Length = 779

 Score =  702 bits (1811), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 359/765 (46%), Positives = 476/765 (62%), Gaps = 69/765 (9%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L  F +C+ +LP   R +DL+ RMTL EK+ Q+ + A G+PRLGLP YEWW EALHGV+ 
Sbjct: 41  LLQFGFCNTRLPTSTRVEDLISRMTLQEKIIQLVNNAAGIPRLGLPRYEWWQEALHGVAV 100

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
                   PG  F  + PGATSFP  ILT ASF+          VSTEARAM+N   AGL
Sbjct: 101 -------SPGVKFGGKFPGATSFPMPILTAASFD---------AVSTEARAMHNYQRAGL 144

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T+WSPN+N+ RDPRWGR  ETPGEDP +  +YA  YVRGLQD        +     LK+S
Sbjct: 145 TYWSPNVNIYRDPRWGRGQETPGEDPLLSSKYATFYVRGLQDT-------NLGGDKLKVS 197

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKH  AYD+DNW+G  RF F++ VT+QD+ +T+  PF+ CV +  VSSVMCSYNRVNG
Sbjct: 198 ACCKHMTAYDVDNWKGTTRFKFNAIVTQQDLSDTYNPPFQSCVEDAKVSSVMCSYNRVNG 257

Query: 249 IPTCADPKLLNQTIRGDWNFHG----------------YIVSDCDSIQTIVESHKFLNDT 292
           +PTCAD  LL+ T+R  WN +G                YIVSDCDS+QT  ++  +   T
Sbjct: 258 VPTCADYNLLSATVRSSWNLNGSILLTCEVLLLYLPCSYIVSDCDSLQTFFDNTNYAK-T 316

Query: 293 KEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP 352
            ED VA  L AGL+LDCG +    T  A+  GKI EA+++ +LR+LY V MRLG +DG+P
Sbjct: 317 AEDVVADALLAGLNLDCGPFLAIHTQSAITNGKITEANVNQALRYLYNVQMRLGLYDGNP 376

Query: 353 Q---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT 409
           +   Y NLG  ++C  ++ +LA +AA++GIVLLKN+   LP +  NI+T+A +GPHA AT
Sbjct: 377 RSQPYGNLGPQSVCTGENQQLALDAAKEGIVLLKNNGNVLPFSKSNIRTVAAIGPHAKAT 436

Query: 410 KAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVI 469
           +AMIGNY+G PC+YT+P DG  AY++V+ Y+ GC+D+ C +NS+I +A   A  ADA V+
Sbjct: 437 RAMIGNYQGIPCKYTTPHDGLSAYARVV-YSAGCSDVACYSNSLIGSAASTASQADAVVL 495

Query: 470 VAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
             GLDL+ EAEGKDR  LLLPG Q EL+ +V  AAKGPV LVI S G+VD++FAK + K+
Sbjct: 496 FVGLDLNQEAEGKDRTSLLLPGKQQELVTEVTKAAKGPVVLVIFSGGSVDVSFAKYDKKV 555

Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP--VNNFPG 587
           + +LW GYPGE GG AIA V+FG +NPGGRLP+TWY  ++  I    M +RP     +PG
Sbjct: 556 QGMLWAGYPGEAGGAAIAQVLFGDHNPGGRLPVTWYPESFTGITMLDMNMRPDASRGYPG 615

Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKS--------VDIKLDKDQQCRDINYTVG 639
           RTY+F+ G  VY FGYG +Y++  +K   +P S        V    D +  C  +N    
Sbjct: 616 RTYRFYTGQSVYNFGYGKTYSKLSHKFKEAPLSLGFPEAAAVKRSCDGNLTCFHLNAH-- 673

Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIG 697
                      D++ C       +I V N G    +  V++YS PP  G  G  I+Q+ G
Sbjct: 674 -----------DEITCSTLTSKVRILVHNEGDRPSNRAVLLYSSPPNAGRDGAPIRQLAG 722

Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           + +V +A G    V   ++ CK L         +L  G HT+ VG
Sbjct: 723 FGKVSVAPGAVENVEIEIDPCKHLSHAGANGVRILHGGIHTLAVG 767


>gi|296084630|emb|CBI25718.3| unnamed protein product [Vitis vinifera]
          Length = 768

 Score =  701 bits (1809), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 349/756 (46%), Positives = 474/756 (62%), Gaps = 44/756 (5%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           SD+P+C+  LP   RA+ LV  +TL EK+QQ+ D A  +PRL +P YEWWSE+LHG++  
Sbjct: 38  SDYPFCNTSLPISTRAQSLVSLLTLSEKIQQLSDEAAAIPRLYIPAYEWWSESLHGIATN 97

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
           G      PG  F+  V  ATSFP V+LT ASFN SLW  IG  ++ EARAMYN+G AGLT
Sbjct: 98  G------PGVSFNGTVSAATSFPQVLLTAASFNRSLWFSIGSAIAVEARAMYNVGQAGLT 151

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
           FW+PNIN+ RDPRWGR  ETPGEDP V   YA+ +VRG Q         DSD   L +SA
Sbjct: 152 FWAPNINIFRDPRWGRGQETPGEDPMVASAYAVEFVRGFQG--------DSDGDGLMLSA 203

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
           CCKH  AYDL+ W    R+ FD+ V+ QD+++T+  PF  CV +G  S +MCSYNRVNG+
Sbjct: 204 CCKHLTAYDLEKWGNFSRYSFDAVVSNQDLEDTYQPPFRSCVQQGKASCLMCSYNRVNGV 263

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           P CA   L  Q  + +W F GYI SDCD++ T+ E   + N + EDAVA VLKAG D++C
Sbjct: 264 PACARQDLF-QKAKTEWGFKGYITSDCDAVATVYEYQHYAN-SPEDAVADVLKAGTDINC 321

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
           G Y    T  A+ QGK+ E DID +L  L+ V MRLG FDG P    Y NLG  ++C  +
Sbjct: 322 GSYMLRHTQSAIDQGKVKEEDIDRALFNLFSVQMRLGLFDGDPANGLYGNLGPKDVCTKE 381

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
           H  LA EAARQGIVLLKND   LPL+   I +LA++GP A+    + G Y G PC+  S 
Sbjct: 382 HRTLALEAARQGIVLLKNDKKFLPLDKSRISSLAIIGPQAD-QPFLGGGYTGIPCKPESL 440

Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
           ++G   Y +  ++A GC D+ C +++    A+  A+ AD  V+VAGLDLS E E  DRV 
Sbjct: 441 VEGLKTYVEKTSFAAGCVDVPCLSDTGFDEAVSIARKADIVVVVAGLDLSQETEDHDRVS 500

Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
           LLLPG Q  LI+ VA A + P+ LV+   G +D++FA+ +P+I SILW+GYPGE G +A+
Sbjct: 501 LLLPGKQMALISSVASAIQKPLVLVLTGGGPLDVSFAEQDPRIASILWIGYPGEAGAKAL 560

Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
           A++IFG +NPGGRLP+TWY  ++ ++P   M +R  P   +PGRTY+F+ G  VY FG G
Sbjct: 561 AEIIFGDFNPGGRLPMTWYPESFTRVPMNDMNMRADPYRGYPGRTYRFYIGHRVYGFGQG 620

Query: 605 LSYTQFKYKVASSPKSVDIKLDKD---------QQCRDINYTVGTNKPPCAAVLIDDV-K 654
           LSYT+F Y+  S+P  +++    D         Q+  ++NY             I+++  
Sbjct: 621 LSYTKFAYQFVSAPNKLNLLRSSDTVSSKNLPRQRREEVNY-----------FHIEELDT 669

Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSAKVGF 713
           C   +F  +I V N+G MDGS VVM++S+ P I  GT  KQ+IG+ RV   + +S +   
Sbjct: 670 CDSLRFHVEISVTNVGDMDGSHVVMLFSRVPKIVKGTPEKQLIGFSRVHTVSRRSTETSI 729

Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVS 749
            ++ C+   I +     ++  G HTI++G+ V  VS
Sbjct: 730 MVDPCEHFSIANEQGKRIMPLGDHTIMLGDVVHSVS 765


>gi|255545664|ref|XP_002513892.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
 gi|223546978|gb|EEF48475.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
          Length = 774

 Score =  699 bits (1804), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 344/743 (46%), Positives = 483/743 (65%), Gaps = 27/743 (3%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S F +C   LP  +R +DLV R+TL EK+ Q+   A  +PRLG+P YEWWSEALHGV+ +
Sbjct: 39  SSFLFCKTSLPISQRVRDLVSRLTLDEKISQLVSSAPSIPRLGIPAYEWWSEALHGVANV 98

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GL 128
           GR      G HF+  +  ATSFP VILT ASF+   W +IGQ +  EARA+YN G A G+
Sbjct: 99  GR------GIHFEGAIKAATSFPQVILTAASFDAYQWYRIGQVIGREARAVYNAGQATGM 152

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           TFW+PNIN+ RDPRWGR  ETPGEDP V G+YA++YVRG+Q   G  +        L+ S
Sbjct: 153 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ---GDSFQGGKLKGHLQAS 209

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKH+ AYDLDNW+G +RF FD+RVT QD+ +T+  PF+ CV +G  S +MC+YNRVNG
Sbjct: 210 ACCKHFTAYDLDNWKGVNRFVFDARVTMQDLADTYQPPFQSCVQQGKASGIMCAYNRVNG 269

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           IP+CAD  LL++T RG W+FHGYI SDCD++  I ++  +   + EDAV  VLKAG+D++
Sbjct: 270 IPSCADFNLLSRTARGQWDFHGYIASDCDAVSIIYDNQGYAK-SPEDAVVDVLKAGMDVN 328

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG Y    T  AV+Q K+ EA ID +L  L+ V MRLG F+G+P    + N+G + +C+ 
Sbjct: 329 CGSYLQKHTKAAVEQKKLPEASIDRALHNLFSVRMRLGLFNGNPTEQPFSNIGPDQVCSQ 388

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H  LA EAAR GIVLLKN    LPL      +LA++GP+AN+ + ++GNY G PC+  +
Sbjct: 389 EHQILALEAARNGIVLLKNSARLLPLQKSKTVSLAVIGPNANSVQTLLGNYAGPPCKTVT 448

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+     Y K   Y  GC  + C + S I  A+D AK  D  V++ GLD + E E  DR+
Sbjct: 449 PLQALQYYVKNTIYYSGCDTVKCSSAS-IDKAVDIAKGVDRVVMIMGLDQTQEREELDRL 507

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           DL+LPG Q ELI  VA +AK P+ LV++S G VDI+FAK +  I SILW GYPGE GG A
Sbjct: 508 DLVLPGKQQELITNVAKSAKNPIVLVLLSGGPVDISFAKYDENIGSILWAGYPGEAGGIA 567

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
           +A++IFG +NPGG+LP+TWY   +VK+P T M +R  P + +PGRTY+F+ G  V+ FGY
Sbjct: 568 LAEIIFGDHNPGGKLPMTWYPQEFVKVPMTDMRMRPDPSSGYPGRTYRFYKGRNVFEFGY 627

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK---CKDYKF 660
           GLSY+++ Y++    ++  + L++    R I+     N  P  A L+  +    CK+ KF
Sbjct: 628 GLSYSKYSYELKYVSQT-KLYLNQSSTMRIID-----NSDPVRATLVAQLGAEFCKESKF 681

Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
           + ++ VEN G+M G   V+++++      G   +Q+IG++ V + AG+ A++ F ++ C+
Sbjct: 682 SVKVGVENQGEMAGKHPVLLFARHARHGNGRPRRQLIGFKSVILNAGEKAEIEFELSPCE 741

Query: 720 SLKIVDNAANSLLASGAHTILVG 742
                +     ++  G H ++VG
Sbjct: 742 HFSRANEDGLRVMEEGTHFLMVG 764


>gi|18025342|gb|AAK38482.1| beta-D-xylosidase [Hordeum vulgare]
          Length = 777

 Score =  699 bits (1804), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 356/754 (47%), Positives = 477/754 (63%), Gaps = 31/754 (4%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S   +CD +LP  +RA DLV ++TL EK+ Q+GD +  V RLG+P Y+WWSEALHGV+  
Sbjct: 40  SSAAFCDRRLPIEQRAADLVSKLTLEEKISQLGDESPAVDRLGVPAYKWWSEALHGVANA 99

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GL 128
           GR      G H D  +  ATSFP VILT ASFN  LW +IGQ + TEAR +YN G A GL
Sbjct: 100 GR------GVHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARGVYNNGQAEGL 153

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           TFW+PNINV RDPRWGR  ETPGEDP + G+YA  +VRG+Q   G       +S  L+ S
Sbjct: 154 TFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQ---GYGMSGAINSSDLEAS 210

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKH+ AYDL+NW+G  RF FD++VTEQD+ +T+  PF+ CV +G  S +MCSYNRVNG
Sbjct: 211 ACCKHFTAYDLENWKGVTRFAFDAKVTEQDLADTYNPPFKSCVEDGGASGIMCSYNRVNG 270

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           +PTCAD  LL++T RGDW+F+GYI SDCD++  I +   +     EDAVA VLKAG+D++
Sbjct: 271 VPTCADHNLLSKTARGDWSFNGYITSDCDAVAIIHDVQGYAK-APEDAVADVLKAGMDVN 329

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYK---NLGKNNICNP 365
           CG Y     + A QQGKI   DID +LR L+ + MRLG FDG+P+Y    N+G + +C+ 
Sbjct: 330 CGGYIQTHGVSAYQQGKITGEDIDRALRNLFAIRMRLGLFDGNPKYNRYGNIGADQVCSK 389

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H +LA +AAR GIVLLKND  ALPL+   + +LA++GP+ N    ++GNY G PC   +
Sbjct: 390 EHQDLALQAARDGIVLLKNDGAALPLSKSKVSSLAVIGPNGNNASLLLGNYFGPPCISVT 449

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+     Y K   +  GC   VC N S I  A+ AA +AD  V+  GLD + E E  DR+
Sbjct: 450 PLQALQGYVKDARFVQGCNAAVC-NVSNIGEAVHAAGSADYVVLFMGLDQNQEREEVDRL 508

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           +L LPG Q  L+N VADAAK PV LV++  G VD+ FAKNNPKI +I+W GYPG+ GG A
Sbjct: 509 ELGLPGMQESLVNSVADAAKKPVILVLLCGGPVDVTFAKNNPKIGAIVWAGYPGQAGGIA 568

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
           IA V+FG +NPGGRLP+TWY   +  +P T M +R  P   +PGRTY+F+ G  VY FGY
Sbjct: 569 IAQVLFGDHNPGGRLPVTWYPKEFTAVPMTDMRMRADPSTGYPGRTYRFYKGKTVYNFGY 628

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK------CKD 657
           GLSY+++ ++ AS       K  K      I     T +   A  +  DV+      C  
Sbjct: 629 GLSYSKYSHRFAS-------KGTKPPSMSGIEGLKATARASAAGTVSYDVEEMGAEACDR 681

Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
            +F   + V+N G MDG  +V+++ + P    G    Q+IG++ V + A ++A V F ++
Sbjct: 682 LRFPAVVRVQNHGPMDGGHLVLLFLRWPNATDGRPASQLIGFQSVHLRADEAAHVEFEVS 741

Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
            CK L         ++  G+H + VG+    +SF
Sbjct: 742 PCKHLSRAAEDGRKVIDQGSHFVRVGDDEFELSF 775


>gi|225469218|ref|XP_002264031.1| PREDICTED: probable beta-D-xylosidase 6-like [Vitis vinifera]
          Length = 789

 Score =  697 bits (1798), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 350/769 (45%), Positives = 478/769 (62%), Gaps = 49/769 (6%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           SD+P+C+  LP   RA+ LV  +TL EK+QQ+ D A  +PRL +P YEWWSE+LHG++  
Sbjct: 38  SDYPFCNTSLPISTRAQSLVSLLTLSEKIQQLSDEAAAIPRLYIPAYEWWSESLHGIATN 97

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
           G      PG  F+  V  ATSFP V+LT ASFN SLW  IG  ++ EARAMYN+G AGLT
Sbjct: 98  G------PGVSFNGTVSAATSFPQVLLTAASFNRSLWFSIGSAIAVEARAMYNVGQAGLT 151

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ--------DVEGVEYHR--- 178
           FW+PNIN+ RDPRWGR  ETPGEDP V   YA+ +VRG Q        ++ G    +   
Sbjct: 152 FWAPNINIFRDPRWGRGQETPGEDPMVASAYAVEFVRGFQGGNWKGGDEIRGAVGKKRVL 211

Query: 179 --DSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDV 236
             DSD   L +SACCKH  AYDL+ W    R+ FD+ V+ QD+++T+  PF  CV +G  
Sbjct: 212 RGDSDGDGLMLSACCKHLTAYDLEKWGNFSRYSFDAVVSNQDLEDTYQPPFRSCVQQGKA 271

Query: 237 SSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDA 296
           S +MCSYNRVNG+P CA   L  Q  + +W F GYI SDCD++ T+ E   + N + EDA
Sbjct: 272 SCLMCSYNRVNGVPACARQDLF-QKAKTEWGFKGYITSDCDAVATVYEYQHYAN-SPEDA 329

Query: 297 VARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--- 353
           VA VLKAG D++CG Y    T  A+ QGK+ E DID +L  L+ V MRLG FDG P    
Sbjct: 330 VADVLKAGTDINCGSYMLRHTQSAIDQGKVKEEDIDRALFNLFSVQMRLGLFDGDPANGL 389

Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
           Y NLG  ++C  +H  LA EAARQGIVLLKND   LPL+   I +LA++GP A+    + 
Sbjct: 390 YGNLGPKDVCTKEHRTLALEAARQGIVLLKNDKKFLPLDKSRISSLAIIGPQAD-QPFLG 448

Query: 414 GNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGL 473
           G Y G PC+  S ++G   Y +  ++A GC D+ C +++    A+  A+ AD  V+VAGL
Sbjct: 449 GGYTGIPCKPESLVEGLKTYVEKTSFAAGCVDVPCLSDTGFDEAVSIARKADIVVVVAGL 508

Query: 474 DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
           DLS E E  DRV LLLPG Q  LI+ VA A + P+ LV+   G +D++FA+ +P+I SIL
Sbjct: 509 DLSQETEDHDRVSLLLPGKQMALISSVASAIQKPLVLVLTGGGPLDVSFAEQDPRIASIL 568

Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYK 591
           W+GYPGE G +A+A++IFG +NPGGRLP+TWY  ++ ++P   M +R  P   +PGRTY+
Sbjct: 569 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTRVPMNDMNMRADPYRGYPGRTYR 628

Query: 592 FFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKD---------QQCRDINYTVGTNK 642
           F+ G  VY FG GLSYT+F Y+  S+P  +++    D         Q+  ++NY      
Sbjct: 629 FYIGHRVYGFGQGLSYTKFAYQFVSAPNKLNLLRSSDTVSSKNLPRQRREEVNY------ 682

Query: 643 PPCAAVLIDDV-KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYER 700
                  I+++  C   +F  +I V N+G MDGS VVM++S+ P I  GT  KQ+IG+ R
Sbjct: 683 -----FHIEELDTCDSLRFHVEISVTNVGDMDGSHVVMLFSRVPKIVKGTPEKQLIGFSR 737

Query: 701 VFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVS 749
           V   + +S +    ++ C+   I +     ++  G HTI++G+ V  VS
Sbjct: 738 VHTVSRRSTETSIMVDPCEHFSIANEQGKRIMPLGDHTIMLGDVVHSVS 786


>gi|242062502|ref|XP_002452540.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
 gi|241932371|gb|EES05516.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
          Length = 784

 Score =  696 bits (1795), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 350/755 (46%), Positives = 481/755 (63%), Gaps = 36/755 (4%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
           + P+CD  LP   R  DLV R+T+ EK+ Q+GD +  +PRLG+P Y+WWSEALHGV+  G
Sbjct: 49  NIPFCDTALPIDRRVDDLVSRLTVAEKISQLGDESPAIPRLGVPAYKWWSEALHGVANAG 108

Query: 71  RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLT 129
           R      G H D  +  ATSFP VILT ASFN  LW +IGQ +  EARA+YN G A GLT
Sbjct: 109 R------GIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGVEARAVYNNGQAEGLT 162

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
           FW+PNINV RDPRWGR  ETPGEDP + G+YA  +VRG+Q   G       +S  L+ SA
Sbjct: 163 FWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQ---GYGVAGPVNSTDLEASA 219

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
           CCKH+ AYDL+NW+G  R+ +D++VT QD+++T+  PF+ CV +G  S +MCSYNRVNG+
Sbjct: 220 CCKHFTAYDLENWKGITRYVYDAKVTAQDLEDTYNPPFKSCVEDGHASGIMCSYNRVNGV 279

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           PTCAD  LL++T R  W F+GYI SDCD++  I ++  +   T EDAVA VLKAG+D++C
Sbjct: 280 PTCADYNLLSKTARQSWGFYGYITSDCDAVSIIHDAQGYAK-TSEDAVADVLKAGMDVNC 338

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
           G Y   +   A+QQGKI E DI+ +L  L+ V MRLG F+G P+   Y N+G + +C  +
Sbjct: 339 GGYVQKYGASALQQGKITEQDINRALHNLFTVRMRLGLFNGDPRRNRYGNIGPDQVCTQE 398

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
           H +LA EAA+ GIVLLKND GALPL+   + +LA++G +AN   +++GNY G PC   +P
Sbjct: 399 HQDLALEAAQDGIVLLKNDGGALPLSKSGVASLAVIGFNANNATSLLGNYFGPPCVTVTP 458

Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
           +     Y K  ++  GC    C N + IP A+ AA +AD+ V+  GLD + E E  DR+D
Sbjct: 459 LQVLQGYVKDTSFVAGCNSAAC-NVTTIPEAVQAASSADSVVLFMGLDQNQEREEVDRLD 517

Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
           L LPG Q  LI  VA+AAK PV LV++  G VD++FAK NPKI +ILW GYPGE GG AI
Sbjct: 518 LTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAGYPGEAGGIAI 577

Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
           A V+FG++NPGGRLP+TWY  ++ K+P T M +R  P   +PGRTY+F+ GP V+ FGYG
Sbjct: 578 AQVLFGEHNPGGRLPVTWYPQDFTKVPMTDMRMRADPATGYPGRTYRFYRGPTVFNFGYG 637

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK------CKDY 658
           LSY+++ ++  + P      +      + +  T G        V   DV+      C   
Sbjct: 638 LSYSKYSHRFVTKPPP---SMSNVAGLKALATTAG-------GVATYDVEAIGSETCDRL 687

Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPPGI---AGTHIKQVIGYERVFIAAGQSAKVGFTM 715
           KF   + V+N G MDG   V+V+ + P     +G   +Q+IG++ + + A Q+A V F +
Sbjct: 688 KFPAVVRVQNHGPMDGKHPVLVFLRWPNATDGSGRPARQLIGFQSLHLRATQTAHVEFEV 747

Query: 716 NACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
           + CK           ++  G+H ++VG+    +SF
Sbjct: 748 SPCKHFSRATEDGRKVIDQGSHFVMVGDDEFEMSF 782


>gi|242071935|ref|XP_002451244.1| hypothetical protein SORBIDRAFT_05g026400 [Sorghum bicolor]
 gi|241937087|gb|EES10232.1| hypothetical protein SORBIDRAFT_05g026400 [Sorghum bicolor]
          Length = 790

 Score =  693 bits (1788), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 356/759 (46%), Positives = 470/759 (61%), Gaps = 50/759 (6%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           +  P+C   LP   RA+DLV R+T  EKV+ + + A GV RLG+  YEWWSEALHGVS  
Sbjct: 43  TTLPFCRQSLPLHARARDLVSRLTRAEKVRLLVNNAAGVARLGVGGYEWWSEALHGVSDT 102

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
           G      PG  F    PGAT+FP VI   A+ N +LW+ IG+ VS EARAMYN G AGLT
Sbjct: 103 G------PGVKFGGAFPGATAFPQVIGAAAALNATLWELIGRAVSDEARAMYNGGRAGLT 156

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
           FWSPN+N+ RDPRWGR  ETPGEDP +  RYA  YVRGLQ        +  D   LK++A
Sbjct: 157 FWSPNVNIFRDPRWGRGQETPGEDPAISSRYAAAYVRGLQ--------QPYDHNRLKLAA 208

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
           CCKH+ AYDLD+W G DRFHF++ V+ QD+++TF +PF  CV  G  +SVMCSYN+VNG+
Sbjct: 209 CCKHFTAYDLDSWGGTDRFHFNAVVSPQDLEDTFNVPFRACVAGGRAASVMCSYNQVNGV 268

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           PTCAD   L  TIR  W   GYIVSDCDS+        +   T EDAVA  L+AGLDLDC
Sbjct: 269 PTCADQGFLRGTIRKAWGLDGYIVSDCDSVDVFFRDQHYTR-TAEDAVAATLRAGLDLDC 327

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
           G +   +T  AV + K+++AD+D +L     V MRLG FDG P    + +LG  ++C   
Sbjct: 328 GPFLALYTENAVARKKVSDADVDAALLNTVTVQMRLGMFDGDPASGPFGHLGAADVCTKA 387

Query: 367 HIELAAEAARQGIVLLKNDNG-------ALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
           H +LA +AARQ +VLLKN  G        LPL     + +A+VGPHA+AT AMIGNY G 
Sbjct: 388 HQDLALDAARQSVVLLKNQRGRKHRDRDVLPLRPAAHRVVAVVGPHADATVAMIGNYAGK 447

Query: 420 PCRYTSPMDGFYAYSKVINYAPGCADIVCQ-NNSMIPAAIDAAKNADATVIVAGLDLSVE 478
           PCRYT+P+ G  AY+  + +  GCAD+ CQ  N  I AA+DAA+         GL  S  
Sbjct: 448 PCRYTTPLQGVAAYAARVVHQAGCADVACQGKNQPIAAAVDAARRLTPPSSSPGLTRS-- 505

Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
                   LLLPG Q ELI+ VA AAKGPV LV+MS G +DI FA+N+P+I  ILWVGYP
Sbjct: 506 --------LLLPGRQAELISAVAKAAKGPVILVLMSGGPIDIAFAQNDPRIDGILWVGYP 557

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDG 595
           G+ GG+AIADVIFG++NPGG+LP+TWY  +Y+ K+P T+M +R  P   +PGRTY+F+ G
Sbjct: 558 GQAGGQAIADVIFGQHNPGGKLPVTWYPQDYLEKVPMTNMAMRANPARGYPGRTYRFYTG 617

Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG---TNKPPCAAVLIDD 652
           P ++ FG+GLSYTQF + +A +P  + ++L         + +         P  AV +  
Sbjct: 618 PTIHAFGHGLSYTQFTHTLAHAPAQLTVRLSTSSASASASASAASLLNATRPSRAVRVAH 677

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVY------SKPPGIAGTH--IKQVIGYERVFIA 704
            +C+       ++V N+G  DG+  V+VY      S     AGT    +Q++ +E+V + 
Sbjct: 678 ARCEGLTVPVHVDVRNVGDRDGAHAVLVYHVAPSSSSSSAPAGTDAPARQLVAFEKVHVP 737

Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
           AG  A+V   ++ C  L + D      +  G H +++GE
Sbjct: 738 AGGVARVEMGIDVCDRLSVADRDGVRRIPVGEHRLMIGE 776


>gi|212275712|ref|NP_001130324.1| uncharacterized protein LOC100191418 precursor [Zea mays]
 gi|194688848|gb|ACF78508.1| unknown [Zea mays]
 gi|413938927|gb|AFW73478.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 780

 Score =  693 bits (1788), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 353/751 (47%), Positives = 475/751 (63%), Gaps = 27/751 (3%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
           + P+CDA LP   R  DLV RMT+ EK+ Q+GD +  +PRLG+P Y+WWSEALHG+S  G
Sbjct: 44  NIPFCDAGLPIDRRVDDLVSRMTVAEKISQLGDQSPAIPRLGVPAYKWWSEALHGISNQG 103

Query: 71  RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLT 129
           R      G H D  +  ATSFP VILT ASFN  LW +IGQ +  EARA+YN G A GLT
Sbjct: 104 R------GIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGVEARAVYNNGQAEGLT 157

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
           FW+PNINV RDPRWGR  ETPGEDP + G+YA  +VRG+Q   G       +S  L+ SA
Sbjct: 158 FWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQ---GYGLAGPVNSTGLEASA 214

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
           CCKH+ AYDL+NW+G  R+ FD++VT QD+ +T+  PF+ CV +G  S +MCSYNRVNG+
Sbjct: 215 CCKHFTAYDLENWKGVTRYVFDAKVTAQDLADTYNPPFKSCVEDGHASGIMCSYNRVNGV 274

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           PTCAD  LL+ T R DW F+GYI SDCD++  I ++  +   T EDAVA VLKAG+D++C
Sbjct: 275 PTCADYNLLSTTARQDWGFYGYITSDCDAVAIIHDAQGYAK-TAEDAVADVLKAGMDVNC 333

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
           G Y  +    A+QQGKI E DI+ +L  L+ V MRLG F+G P+   Y ++G + +C  +
Sbjct: 334 GSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGDIGPDQVCTQE 393

Query: 367 HIELAAEAARQGIVLLKNDNGA--LPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
           H +LA EAA+ GIVLLKND GA  LPL+  N+ +LA++G +AN    + GNY G PC   
Sbjct: 394 HQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRGNYFGPPCVTV 453

Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           +P+     Y K  ++  GC    C N + IP A+ AA +AD+ V+  GLD   E E  DR
Sbjct: 454 TPLQVLQGYVKDTSFVAGCNSAAC-NVTTIPEAVQAASSADSVVLFMGLDQDQEREEVDR 512

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
           +DL LPG Q  LI  VA+AAK PV LV++  G VD++FAK NPKI +ILW GYPGE GG 
Sbjct: 513 LDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAGYPGEAGGI 572

Query: 545 AIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
           AIA V+FG++NPGGRLP+TWY  ++ ++P T M +R  P   +PGRTY+F+ GP V+ FG
Sbjct: 573 AIAQVLFGEHNPGGRLPVTWYPQDFTRVPMTDMRMRADPATGYPGRTYRFYRGPTVFNFG 632

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
           YGLSY+++ ++ A+ P             + +  T G          I    C   KF  
Sbjct: 633 YGLSYSKYSHRFATKPPPT----SNVAGLKAVEATAG-GMASYDVEAIGSETCDRLKFPA 687

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGI---AGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
            + V+N G MDG   V+V+ + P     +G    Q+IG++ + + A Q+A V F ++ CK
Sbjct: 688 VVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPASQLIGFQSLHLRATQTAHVEFEVSPCK 747

Query: 720 SLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
                      ++  G+H ++VGE    +SF
Sbjct: 748 HFSRATEDGRKVIDQGSHFVMVGEDEFEMSF 778


>gi|449451581|ref|XP_004143540.1| PREDICTED: probable beta-D-xylosidase 6-like [Cucumis sativus]
          Length = 777

 Score =  692 bits (1786), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 337/749 (44%), Positives = 482/749 (64%), Gaps = 28/749 (3%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +P+C+  L +  RA+ LV  +TL EK+QQ+ + A  +PRLG+P Y+WWSE LHG++  G 
Sbjct: 30  YPFCNRSLSFTARAQSLVSLLTLDEKIQQLSNNASSIPRLGIPSYQWWSEGLHGIATNG- 88

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
                PG  F+  +  ATSFP V++T ASFN +LW  IG  ++ EARAM+N+G  GLT W
Sbjct: 89  -----PGVSFNGSITSATSFPQVLVTAASFNRTLWFLIGSAIAVEARAMFNVGQCGLTIW 143

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHR--------DSDSR 183
           +PNIN+ RDPRWGR  ETPGEDP V   Y+I +VRGLQ    ++ H         D+   
Sbjct: 144 APNINIFRDPRWGRGQETPGEDPMVASAYSIQFVRGLQSGNWMKEHEIRNEVLEEDNGMG 203

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            L +SACCKH+ AYDL+ W    R+ FDS VTEQD+ +T+  PF  C+ +G  S +MCSY
Sbjct: 204 SLMVSACCKHFTAYDLEKWNNFTRYTFDSVVTEQDLGDTYQPPFRSCIQQGKASCLMCSY 263

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N VNG+P CA+P LL +  R DW   GYI SDCD++ T+ E  K+  DT EDA+A VLKA
Sbjct: 264 NAVNGVPACANPDLLKKA-RNDWGLKGYITSDCDAVATVYEYQKY-TDTPEDAIADVLKA 321

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKN 360
           G+D++CG +    T  A+ QGK+ E ++D++L  L+ V  RLG+FDG+P   ++  LG  
Sbjct: 322 GMDINCGTFMLRGTKSAIDQGKVREEELDSALINLFSVQARLGFFDGNPREGKFGELGAQ 381

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
           ++C  QH  LA EAARQGIVLLKN+N  LPL+   I +L ++G  AN +  ++G Y G P
Sbjct: 382 DVCTAQHKTLALEAARQGIVLLKNENKFLPLDKNAISSLTVIGSLANDSSKLLGGYAGVP 441

Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
           C   S ++GF  Y++ I +A GC D+ C +++    AI  AK AD  + VAGLD S E E
Sbjct: 442 CSPMSLVEGFQEYAETIFFASGCLDVPCASDNRFEDAILIAKKADFVIAVAGLDASQETE 501

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
             DRV LLLPG Q +L++ VA  +K P+ LV++  G +DI+FAK + ++ SILW+G PGE
Sbjct: 502 DLDRVSLLLPGKQMDLVSSVASVSKKPIILVLIGGGPLDISFAKKDSRVASILWIGNPGE 561

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVV 598
            GG+A+A+VIFG YNPGGRLP+TWY  ++  +P   M +R  P   +PGRTY+F+ G  +
Sbjct: 562 AGGKALAEVIFGDYNPGGRLPVTWYPQSFTNVPMNDMHMRPNPSRGYPGRTYRFYTGDRI 621

Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV--GTNKPPCAAVLIDDVK-C 655
           Y FG GLSYT FKY++ S+PK V++    +   R I   V  G N    + + +++V+ C
Sbjct: 622 YGFGEGLSYTSFKYRLLSAPKKVNLLGKAETSRRRIIPQVRDGVNM---SYMEVEEVESC 678

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
              +F  ++ V N+G+ DGS VVM++S+ P  + GT  +Q+IG++R+++   QSA+    
Sbjct: 679 DLLRFEVKLSVSNIGEFDGSHVVMMFSEFPKVLTGTPQRQLIGFDRLYVKRNQSAESSIM 738

Query: 715 MNACKSLKIVDNAANSLLASGAHTILVGE 743
           ++ C  + + D     ++  G HTI +G+
Sbjct: 739 VDPCNHVSLADEYGKRVIPLGDHTISLGD 767


>gi|15238197|ref|NP_196618.1| putative beta-D-xylosidase 6 [Arabidopsis thaliana]
 gi|75264319|sp|Q9LXA8.1|BXL6_ARATH RecName: Full=Probable beta-D-xylosidase 6; Short=AtBXL6; Flags:
           Precursor
 gi|7671447|emb|CAB89387.1| beta-xylosidase-like protein [Arabidopsis thaliana]
 gi|15982753|gb|AAL09717.1| AT5g10560/F12B17_90 [Arabidopsis thaliana]
 gi|332004180|gb|AED91563.1| putative beta-D-xylosidase 6 [Arabidopsis thaliana]
          Length = 792

 Score =  692 bits (1786), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 340/757 (44%), Positives = 481/757 (63%), Gaps = 32/757 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
            S +P+C+  L   +RA  LV  + LPEK+ Q+ + A  VPRLG+P YEWWSE+LHG++ 
Sbjct: 37  FSSYPFCNVSLSIKQRAISLVSLLMLPEKIGQLSNTAASVPRLGIPPYEWWSESLHGLA- 95

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
                ++ PG  F+  +  ATSFP VI++ ASFN +LW +IG  V+ E RAMYN G AGL
Sbjct: 96  -----DNGPGVSFNGSISAATSFPQVIVSAASFNRTLWYEIGSAVAVEGRAMYNGGQAGL 150

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR----- 183
           TFW+PNINV RDPRWGR  ETPGEDP VV  Y + +VRG Q+ +  +  +   S      
Sbjct: 151 TFWAPNINVFRDPRWGRGQETPGEDPKVVSEYGVEFVRGFQEKKKRKVLKRRFSDDVDDD 210

Query: 184 --------PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
                    L +SACCKH+ AYDL+ W    R+ F++ VTEQDM++T+  PFE C+ +G 
Sbjct: 211 RHDDDADGKLMLSACCKHFTAYDLEKWGNFTRYDFNAVVTEQDMEDTYQPPFETCIRDGK 270

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKED 295
            S +MCSYN VNG+P CA   LL Q  R +W F GYI SDCD++ TI  +++    + E+
Sbjct: 271 ASCLMCSYNAVNGVPACAQGDLL-QKARVEWGFEGYITSDCDAVATIF-AYQGYTKSPEE 328

Query: 296 AVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--- 352
           AVA  +KAG+D++CG Y    T  A++QGK++E  +D +L  L+ V +RLG FDG P   
Sbjct: 329 AVADAIKAGVDINCGTYMLRHTQSAIEQGKVSEELVDRALLNLFAVQLRLGLFDGDPRRG 388

Query: 353 QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAM 412
           QY  LG N+IC+  H +LA EA RQGIVLLKND+  LPLN  ++ +LA+VGP AN    M
Sbjct: 389 QYGKLGSNDICSSDHRKLALEATRQGIVLLKNDHKLLPLNKNHVSSLAIVGPMANNISNM 448

Query: 413 IGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG 472
            G Y G PC+  +       Y K  +YA GC+D+ C +++    A+  AK AD  ++VAG
Sbjct: 449 GGTYTGKPCQRKTLFTELLEYVKKTSYASGCSDVSCDSDTGFGEAVAIAKGADFVIVVAG 508

Query: 473 LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
           LDLS E E KDRV L LPG Q +L++ VA  +K PV LV+   G VD+ FAKN+P+I SI
Sbjct: 509 LDLSQETEDKDRVSLSLPGKQKDLVSHVAAVSKKPVILVLTGGGPVDVTFAKNDPRIGSI 568

Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNN--FPGRTY 590
           +W+GYPGE GG+A+A++IFG +NPGGRLP TWY  ++  +  + M +R  ++  +PGRTY
Sbjct: 569 IWIGYPGETGGQALAEIIFGDFNPGGRLPTTWYPESFTDVAMSDMHMRANSSRGYPGRTY 628

Query: 591 KFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
           +F+ GP VY FG GLSYT+F+YK+ S+P  + +     QQ          +      + +
Sbjct: 629 RFYTGPQVYSFGTGLSYTKFEYKILSAPIRLSLSELLPQQSSHKKQL--QHGEELRYLQL 686

Query: 651 DDV---KCKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAG 706
           DDV    C+  +F  ++ V N G++DGS VVM++SK PP ++G   KQ+IGY+RV + + 
Sbjct: 687 DDVIVNSCESLRFNVRVHVSNTGEIDGSHVVMLFSKMPPVLSGVPEKQLIGYDRVHVRSN 746

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
           +  +  F ++ CK L + ++    ++  G+H + +G+
Sbjct: 747 EMMETVFVIDPCKQLSVANDVGKRVIPLGSHVLFLGD 783


>gi|449496501|ref|XP_004160150.1| PREDICTED: probable beta-D-xylosidase 6-like, partial [Cucumis
           sativus]
          Length = 767

 Score =  692 bits (1785), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 337/749 (44%), Positives = 482/749 (64%), Gaps = 28/749 (3%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +P+C+  L +  RA+ LV  +TL EK+QQ+ + A  +PRLG+P Y+WWSE LHG++  G 
Sbjct: 20  YPFCNRSLSFTARAQSLVSLLTLDEKIQQLSNNASSIPRLGIPSYQWWSEGLHGIATNG- 78

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
                PG  F+  +  ATSFP V++T ASFN +LW  IG  ++ EARAM+N+G  GLT W
Sbjct: 79  -----PGVSFNGSITSATSFPQVLVTAASFNRTLWFLIGSAIAVEARAMFNVGQCGLTIW 133

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHR--------DSDSR 183
           +PNIN+ RDPRWGR  ETPGEDP V   Y+I +VRGLQ    ++ H         D+   
Sbjct: 134 APNINIFRDPRWGRGQETPGEDPMVASAYSIQFVRGLQSGNWMKEHEIRNEVLEEDNGMG 193

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            L +SACCKH+ AYDL+ W    R+ FDS VTEQD+ +T+  PF  C+ +G  S +MCSY
Sbjct: 194 SLMVSACCKHFTAYDLEKWNNFTRYTFDSVVTEQDLGDTYQPPFRSCIQQGKASCLMCSY 253

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N VNG+P CA+P LL +  R DW   GYI SDCD++ T+ E  K+  DT EDA+A VLKA
Sbjct: 254 NAVNGVPACANPDLLKKA-RNDWGLKGYITSDCDAVATVYEYQKY-TDTPEDAIADVLKA 311

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKN 360
           G+D++CG +    T  A+ QGK+ E ++D++L  L+ V  RLG+FDG+P   ++  LG  
Sbjct: 312 GMDINCGTFMLRGTKSAIDQGKVREEELDSALINLFSVQARLGFFDGNPREGKFGELGAQ 371

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
           ++C  QH  LA EAARQGIVLLKN+N  LPL+   I +L ++G  AN +  ++G Y G P
Sbjct: 372 DVCTAQHKTLALEAARQGIVLLKNENKFLPLDKNAISSLTVIGSLANDSSKLLGGYAGVP 431

Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
           C   S ++GF  Y++ I +A GC D+ C +++    AI  AK AD  + VAGLD S E E
Sbjct: 432 CSPMSLVEGFQEYAETIFFASGCLDVPCASDNRFEDAILIAKKADFVIAVAGLDASQETE 491

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
             DRV LLLPG Q +L++ VA  +K P+ LV++  G +DI+FAK + ++ SILW+G PGE
Sbjct: 492 DLDRVSLLLPGKQMDLVSSVASVSKKPIILVLIGGGPLDISFAKKDSRVASILWIGNPGE 551

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVV 598
            GG+A+A+VIFG YNPGGRLP+TWY  ++  +P   M +R  P   +PGRTY+F+ G  +
Sbjct: 552 AGGKALAEVIFGDYNPGGRLPVTWYPQSFTNVPMNDMHMRPNPSRGYPGRTYRFYTGDRI 611

Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV--GTNKPPCAAVLIDDVK-C 655
           Y FG GLSYT FKY++ S+PK V++    +   R I   V  G N    + + +++V+ C
Sbjct: 612 YGFGEGLSYTSFKYRLLSAPKKVNLLGKAETSRRRIIPQVRDGVNM---SYMEVEEVESC 668

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
              +F  ++ V N+G+ DGS VVM++S+ P  + GT  +Q+IG++R+++   QSA+    
Sbjct: 669 DLLRFEVKLSVSNIGEFDGSHVVMMFSEFPKVLTGTPQRQLIGFDRLYVKRNQSAESSIM 728

Query: 715 MNACKSLKIVDNAANSLLASGAHTILVGE 743
           ++ C  + + D     ++  G HTI +G+
Sbjct: 729 VDPCNHVSLADEYGKRVIPLGDHTISLGD 757


>gi|384872601|gb|AFI25186.1| putative beta-D-xylosidase [Nicotiana tabacum]
          Length = 791

 Score =  691 bits (1783), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 336/752 (44%), Positives = 469/752 (62%), Gaps = 34/752 (4%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           + +C+  LP   R + L+  +T+ EK+  + D    +PRLGLP YEWWSE+LHG++  G 
Sbjct: 41  YTFCNKNLPISTRVQSLISLLTIDEKILHLSDNTTSIPRLGLPAYEWWSESLHGIATNG- 99

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
                P  +F+ ++ G TSFP VILT A+FN +LW  I   ++ EARAMYNLG AGLTFW
Sbjct: 100 -----PAVNFNGQIKGVTSFPQVILTAAAFNRTLWHSIATAIAVEARAMYNLGQAGLTFW 154

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDV---------------EGVEY 176
           +PNIN++RDPRWGR  ETPGEDP VV  YAI YV G Q +                 V  
Sbjct: 155 APNINILRDPRWGRGQETPGEDPMVVSAYAIEYVTGFQGLNPKAKKGNRNGYGKKRRVLK 214

Query: 177 HRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDV 236
             D+D   L +SACCKH+ AYDL+ W    R+ F++ VT+QDM++TF  PF  C+ +G  
Sbjct: 215 EDDNDGERLMLSACCKHFTAYDLEKWGDATRYDFNAVVTKQDMEDTFQAPFRSCIQQGKA 274

Query: 237 SSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDA 296
           S +MCSYN VNG+P CAD +LL++ +R DW F GYI SDCD++ TI E+ K+   T EDA
Sbjct: 275 SCLMCSYNSVNGVPACADKELLDK-VRTDWGFDGYITSDCDAVATIYENQKY-TKTPEDA 332

Query: 297 VARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---Q 353
           VA  LKAG +++CG Y       A QQG + E D+D +L++L+ V  RLG FDG+P   Q
Sbjct: 333 VAVALKAGTNINCGTYMLRHMKSAFQQGSVLEEDLDRALQYLFSVQFRLGLFDGNPADGQ 392

Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
           + N G  ++C   H+ LA +AARQGIVLLKND   LPL+  ++ TLA+VGP AN +    
Sbjct: 393 FANFGAQDVCTSNHLNLALDAARQGIVLLKNDQKFLPLDKTSVSTLAIVGPMANVSSPG- 451

Query: 414 GNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGL 473
           G Y G PC+  S  +GF+ +     YA GC D+ C + +    AI   K AD  ++VAG 
Sbjct: 452 GTYSGVPCKLKSIREGFHRHINRTLYAAGCLDVGCNSTAGFQDAISIVKEADYVIVVAGS 511

Query: 474 DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
           DLS E E  DR  LLLPG QT L+  +A A+K P+ LV+   G VD++FA+ +P+I SIL
Sbjct: 512 DLSEETEDHDRYSLLLPGQQTNLVTTLAAASKKPIILVLTGGGPVDVSFAEKDPRIASIL 571

Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYK 591
           WV YPGE GG+A++++IFG  NPGG+LP+TWY  ++ K+P T M +R  P N +PGRTY+
Sbjct: 572 WVAYPGETGGKALSEIIFGYQNPGGKLPMTWYLESFTKVPMTDMNMRADPSNGYPGRTYR 631

Query: 592 FFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLID 651
           F+ G V+Y FG+GLSYT F  ++ S+P  + + L K  + R I   +   +     + +D
Sbjct: 632 FYTGDVLYGFGHGLSYTSFSSQLLSAPSRLSLSLAKSNRKRSI---LAKGRSRLGYIHVD 688

Query: 652 DVK-CKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSA 709
           +V+ C   KF   I V N G MDGS V+M++S+      G   KQ++G++RV + A +  
Sbjct: 689 EVESCHSSKFFVHISVTNDGDMDGSHVLMLFSRVLQNFQGAPQKQLVGFDRVHVPARKYV 748

Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
           +    ++ C+     ++  N +LA G HT ++
Sbjct: 749 ETSLLVDPCELFSFANDQGNRILALGEHTFIL 780


>gi|371917286|dbj|BAL44719.1| SlArf/Xyl4 [Solanum lycopersicum]
          Length = 775

 Score =  690 bits (1780), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 343/742 (46%), Positives = 478/742 (64%), Gaps = 30/742 (4%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           +C   LP   R  DLV R+TL EK+ Q+ + A  +PRLG+P YEWWSE+LHGV   G+  
Sbjct: 43  FCQTGLPISVRVLDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSESLHGVGSAGK-- 100

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTFWS 132
               G  F+  + GATSFP VILT A+F+E+LW +IGQ +  EAR +YN G A G+TFW+
Sbjct: 101 ----GIFFNGSIAGATSFPQVILTAATFDENLWYRIGQVIGVEARGVYNAGQAIGMTFWA 156

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ--DVEGVEYHRDSDSRPLKISAC 190
           PNIN+ RDPRWGR  ETPGEDP + G+YAI YVRG+Q     G +  +      L+ SAC
Sbjct: 157 PNINIFRDPRWGRGQETPGEDPIMTGKYAIRYVRGVQGDSFNGGQLKKGH----LQASAC 212

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH+ AYDLD W+  DRF F++ VT QDM +TF  PF+ C+ +   S +MCSYN VNGIP
Sbjct: 213 CKHFTAYDLDQWKNLDRFSFNAIVTPQDMADTFQPPFQDCIQKAQASGIMCSYNSVNGIP 272

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
           +CA+  LL +T R  W FHGYI SDCD++Q + ++H++ N T ED+ A  LKAG+D+DCG
Sbjct: 273 SCANYNLLTKTARQQWGFHGYITSDCDAVQVMHDNHRYGN-TPEDSTAFALKAGMDIDCG 331

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
           DY   +T  AV + K+++  ID +L  L+ + MRLG F+G P+   Y N+  + +C PQH
Sbjct: 332 DYLKKYTKSAVMKKKVSQVHIDRALHNLFSIRMRLGLFNGDPRKQLYGNISPSQVCAPQH 391

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
            +LA EAAR GIVLLKN    LPL+     +LA++G +AN    + GNY+G PC+Y   +
Sbjct: 392 QQLALEAARNGIVLLKNTGKLLPLSKAKTNSLAVIGHNANNAYILRGNYDGPPCKYIEIL 451

Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
                Y+K + Y  GC    C  ++ I  A++ A+NAD  V++ GLD + E E  DR DL
Sbjct: 452 KALVGYAKSVQYQQGCNAANC-TSANIDQAVNIARNADYVVLIMGLDQTQEREQFDRDDL 510

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
           +LPG Q  LIN VA AAK PV LVI+S G VDI+FAK NPKI SILW GYPGE GG A+A
Sbjct: 511 VLPGQQENLINSVAKAAKKPVILVILSGGPVDISFAKYNPKIGSILWAGYPGEAGGIALA 570

Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGL 605
           ++IFG++NPGG+LP+TWY   +VKIP T M +R  P   +PGRTY+F+ GP VY FGYGL
Sbjct: 571 EIIFGEHNPGGKLPVTWYPQAFVKIPMTDMRMRPDPKTGYPGRTYRFYKGPKVYEFGYGL 630

Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYKFTF 662
           SYT + Y   S+  +  I+L++    + +      N        +D++    C+  KF+ 
Sbjct: 631 SYTTYSYGFHSATPNT-IQLNQLLSVKTVE-----NSDSIRYTFVDEIGSDNCEKAKFSA 684

Query: 663 QIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
            + VEN G+MDG   V+++ K      G+ IKQ++G++ V + AG+++++ F ++ C+ L
Sbjct: 685 HVSVENSGEMDGKHPVLLFVKQDKARNGSPIKQLVGFQSVSLKAGENSQLVFEISPCEHL 744

Query: 722 KIVDNAANSLLASGAHTILVGE 743
              +     ++  G+  ++VG+
Sbjct: 745 SSANEDGLMMIEEGSRYLVVGD 766


>gi|297811163|ref|XP_002873465.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319302|gb|EFH49724.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 796

 Score =  689 bits (1778), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 340/759 (44%), Positives = 478/759 (62%), Gaps = 32/759 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
            S +P+C+  L   +RA  LV  +TLPEK+ Q+   A  VPRLG+P YEWWSE+LHG++ 
Sbjct: 37  FSSYPFCNVSLSIKQRAISLVSLLTLPEKIGQLSTTAASVPRLGIPPYEWWSESLHGLA- 95

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
                ++ PG  F+  +  ATSFP VI++ ASFN +LW +IG  V+ EARAMYN G AGL
Sbjct: 96  -----DNGPGVSFNGSISAATSFPQVIVSAASFNRTLWYEIGSAVAVEARAMYNGGQAGL 150

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQD-----VEGVEYHRDS--- 180
           TFW+PNIN+ RDPRWGR  ETPGEDP VV  Y + +VRG Q+     V    +  D+   
Sbjct: 151 TFWAPNINLFRDPRWGRGQETPGEDPKVVSEYGVEFVRGFQEKKKRKVLKTRFGSDNVDD 210

Query: 181 -------DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNE 233
                      L +SACCKH+ AYDL+ W    R+ F++ VTEQDM++T+  PFE C+ +
Sbjct: 211 DARYDDDADGKLMLSACCKHFTAYDLEKWGNFTRYDFNAVVTEQDMEDTYQPPFETCIKD 270

Query: 234 GDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTK 293
           G  S +MCSYN VNG+P CA   LL Q  R +W F GYI SDCD++ TI E   +   + 
Sbjct: 271 GKASCLMCSYNAVNGVPACAQGDLL-QKARVEWGFDGYITSDCDAVATIFEYQGY-TKSP 328

Query: 294 EDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
           E+AVA  +KAG+D++CG Y    T  A++QGK++E  +D +L  L+ V +RLG FDG P+
Sbjct: 329 EEAVADAIKAGVDINCGTYMLRNTQSAIEQGKVSEELVDRALLNLFAVQLRLGLFDGDPR 388

Query: 354 ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATK 410
              Y  LG N+IC+  H +LA EAARQGIVLLKND   LPLN  ++ +LA+VGP AN   
Sbjct: 389 GGHYGKLGSNDICSSDHRKLALEAARQGIVLLKNDYKLLPLNKNHVSSLAIVGPMANNIS 448

Query: 411 AMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV 470
            M G Y G PC+  +       Y K  +YA GC+D+ C +++    A+  AK AD  ++V
Sbjct: 449 NMGGTYTGKPCQRKTLFTELLEYVKKTSYASGCSDVSCVSDTGFGEAVAIAKGADFVIVV 508

Query: 471 AGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIK 530
           AGLDLS E E KDR  L LPG Q +L++ VA  +K PV LV+   G VD+ FAK +P+I 
Sbjct: 509 AGLDLSQETEDKDRFSLSLPGKQKDLVSSVAAVSKKPVILVLTGGGPVDVTFAKTDPRIG 568

Query: 531 SILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNN--FPGR 588
           SI+W+GYPGE GG+A+A++IFG +NPGGRLPITWY  ++  +P + M +R  ++  +PGR
Sbjct: 569 SIIWIGYPGETGGQALAEIIFGDFNPGGRLPITWYPESFADVPMSDMHMRADSSRGYPGR 628

Query: 589 TYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
           TY+F+ GP VY FG GLSYT+F YK+ S+P  + +     QQ       +   +     +
Sbjct: 629 TYRFYTGPQVYSFGTGLSYTKFDYKIISAPIRLSLSELLPQQSSHKKQLLQHGEEQLQYI 688

Query: 649 LIDDV---KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI-AGTHIKQVIGYERVFIA 704
            +DDV    C+  +F  ++ V N G++DGS V+M++SK   + +G   KQ+IG++RV I 
Sbjct: 689 QLDDVMVNSCESLRFNVRVNVRNTGEIDGSHVLMLFSKMARVLSGVPEKQLIGFDRVHIR 748

Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
           + +  +  F ++ CK L + ++    ++  G H + +G+
Sbjct: 749 SNEMMETVFVIDPCKYLSVANDVGKRVIPLGIHALFLGD 787


>gi|224066931|ref|XP_002302285.1| predicted protein [Populus trichocarpa]
 gi|222844011|gb|EEE81558.1| predicted protein [Populus trichocarpa]
          Length = 773

 Score =  687 bits (1774), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 342/740 (46%), Positives = 483/740 (65%), Gaps = 27/740 (3%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           FP+C+  LP  +RA+DLV R+TL EK+ Q+ + A  +PRLG+P YEWWSEALHGVS    
Sbjct: 40  FPFCETTLPISQRARDLVSRLTLDEKISQLVNSAPPIPRLGIPGYEWWSEALHGVS---- 95

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
             N+ PG HF+  + GATSFP VILT ASF+   W +IGQ +  EARA+YN G A G+TF
Sbjct: 96  --NAGPGIHFNDNIKGATSFPQVILTAASFDAYQWYRIGQAIGKEARALYNAGQATGMTF 153

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           W+PNIN+ RDPRWGR  ETPGEDP V G YA +YV+G+Q   G  +        L+ SAC
Sbjct: 154 WAPNINIFRDPRWGRGQETPGEDPLVTGLYAASYVKGVQ---GDSFEGGKIKGHLQASAC 210

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH+ AYDLDNW+G +RF FD+RVT QD+ +T+  PF+ CV +G  S +MC+YN+VNG+P
Sbjct: 211 CKHFTAYDLDNWKGMNRFVFDARVTMQDLADTYQPPFKSCVEQGRASGIMCAYNKVNGVP 270

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
           +CAD  LL++T R  W F GYI SDCD++ +I+   +    + EDAV  VLKAG+D++CG
Sbjct: 271 SCADSNLLSKTARAQWGFRGYITSDCDAV-SIIHDDQGYAKSPEDAVVDVLKAGMDVNCG 329

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
            Y       AV+Q K++E+DID +L  L+ V MRLG F+G P+   + N+G + +C+ +H
Sbjct: 330 SYLLKHAKVAVEQKKLSESDIDKALHNLFSVRMRLGLFNGRPEGQLFGNIGPDQVCSQEH 389

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
             LA EAAR GIVLLKN    LPL+    K+LA++GP+AN+ + ++GNY G PCR+ +P+
Sbjct: 390 QILALEAARNGIVLLKNSARLLPLSKSKTKSLAVIGPNANSGQMLLGNYAGPPCRFVTPL 449

Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
               +Y K   Y P C  + C + S +  A+D AK AD  V++ GLD + E E  DR DL
Sbjct: 450 QALQSYIKQTVYHPACDTVQCSSAS-VDRAVDVAKGADNVVLMMGLDQTQEREELDRTDL 508

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
           LLPG Q ELI  VA AAK PV LV+ S G VDI+FAKN+  I SILW GYPGE G  A+A
Sbjct: 509 LLPGKQQELIIAVAKAAKNPVVLVLFSGGPVDISFAKNDKNIGSILWAGYPGEGGAIALA 568

Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGYGL 605
           +++FG +NPGGRLP+TWY   +VK+P T M +RP   + +PGRTY+F+ G  V+ FGYG+
Sbjct: 569 EIVFGDHNPGGRLPMTWYPQEFVKVPMTDMGMRPEASSGYPGRTYRFYRGRSVFEFGYGI 628

Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK---CKDYKFTF 662
           SY+++ Y++ +  ++  + L++      IN     +     + LI ++    C+  K   
Sbjct: 629 SYSKYSYELTAVSQNT-LYLNQSSTMHIIN-----DFDSVRSTLISELGTEFCEQNKCRA 682

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
           +I V+N G+M G   V+++++      G   KQ+IG++ V + AG+ A++ F ++ C+ L
Sbjct: 683 RIGVKNHGEMAGKHPVLLFARQEKHGNGRPRKQLIGFQSVVLGAGERAEIEFEVSPCEHL 742

Query: 722 KIVDNAANSLLASGAHTILV 741
              +     ++  G H ++V
Sbjct: 743 SRANEDGLMVMEEGRHFLVV 762


>gi|168065036|ref|XP_001784462.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162663987|gb|EDQ50724.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 726

 Score =  687 bits (1774), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 359/749 (47%), Positives = 482/749 (64%), Gaps = 49/749 (6%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           +CD  L    R  DLV R+TL EKV Q+ + A  +PRL +P YEWW E LHGV+ +    
Sbjct: 3   FCDTSLSDEIRVFDLVSRLTLEEKVTQLVNTASAIPRLSIPAYEWWQEGLHGVAHVS--- 59

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
                  F   +P ATSFP  ILTTASFN+ LW +IGQ  STEARA YN G AGLT+WSP
Sbjct: 60  -------FGGSLPRATSFPLPILTTASFNKDLWNQIGQAFSTEARAFYNDGIAGLTYWSP 112

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
            IN+ RDPRWGR+ ET GEDPY    YA ++V+G+Q  EG     D++S+ LK+SACCKH
Sbjct: 113 VINIARDPRWGRIQETSGEDPYTTSAYATHFVQGMQ--EG-----DANSKRLKLSACCKH 165

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           + AYD+DNWEG DR+HFD++    ++ +T+  PF+ CV EG  +S+MCSYN+VNG+PTCA
Sbjct: 166 FTAYDVDNWEGIDRYHFDAKA---NLADTYNPPFQSCVQEGRSASLMCSYNKVNGVPTCA 222

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
           +   L  T+R  W  +GYIVSDCDS+  + ES  +   T EDA A  L AGLDL+CGDY 
Sbjct: 223 NYDFLENTVRRAWGLNGYIVSDCDSVLVMHESTNYA-PTTEDAAADALNAGLDLNCGDYL 281

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQHIEL 370
            ++T GAV  GK+  + +D ++  +++V MRLG FDG+P   ++ N+G  ++C P H EL
Sbjct: 282 ASYTEGAVAMGKVNASRVDNAVYNVFLVRMRLGMFDGNPANQEFGNIGVADVCTPAHQEL 341

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A EAARQGIVLLKND   LPL + NI T A++GP+ANAT  M+GNYEG PC+Y +P+ G 
Sbjct: 342 AVEAARQGIVLLKNDGNILPL-SKNINT-AVIGPNANATHTMLGNYEGIPCQYITPLQGL 399

Query: 431 YA-----YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
                  Y KV  ++ GC +  CQ +  I +A+  A  ADA V+V GL    E+E  DR 
Sbjct: 400 VKFGSGDYHKVW-FSEGCVNTACQQDDQISSAVSTAAVADAVVLVVGLSQVQESEALDRT 458

Query: 486 DLLLPGFQTELINKVADAAKG-PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
            LLLPG+Q  LI++VA AA G PV LV+M AG VDINFAKN+ +I+SILWVGYPG+ GG+
Sbjct: 459 SLLLPGYQQTLIDEVAGAAAGRPVVLVLMCAGPVDINFAKNDKRIQSILWVGYPGQSGGQ 518

Query: 545 AIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
           AIA+VIFG +NPGG+LP++WY  +Y KI  T+M +RP   +N+PGRTY+F+ G  +Y FG
Sbjct: 519 AIAEVIFGAHNPGGKLPMSWYPEDYTKISMTNMNMRPDSRSNYPGRTYRFYTGEKIYDFG 578

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
           YGLSYT++K+  A +P +V       Q C     + G+              C    F  
Sbjct: 579 YGLSYTEYKHSFALAPTTVMTPSIHSQLCDPHQTSAGSK------------TCSSSNFDV 626

Query: 663 QIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
            I VEN+G M G+  ++++   P  G  GT +KQ+  ++ V+I +G   KV  T+N C+ 
Sbjct: 627 HINVENIGAMAGNHTLLLFFTAPSAGKNGTPLKQLAAFDSVYIRSGSQEKVVLTLNPCQH 686

Query: 721 LKIVDNAANSLLASGAHTILVGEGVGGVS 749
           L  V      +L +G H + VG+    +S
Sbjct: 687 LGTVAEDGTRMLEAGNHILSVGDAKHSLS 715


>gi|224082152|ref|XP_002306583.1| predicted protein [Populus trichocarpa]
 gi|222856032|gb|EEE93579.1| predicted protein [Populus trichocarpa]
          Length = 745

 Score =  683 bits (1762), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 333/740 (45%), Positives = 474/740 (64%), Gaps = 51/740 (6%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           FP+C   LP  +RA DLV R+TL EK+ Q+ + A  +PRLG+P Y+WWSEALHGV++ G 
Sbjct: 40  FPFCKTTLPISQRANDLVSRLTLEEKISQLVNSAQPIPRLGIPGYQWWSEALHGVAYAG- 98

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
                PG  F+  +  ATSFP VIL+ ASF+ + W +I Q +  EARA+YN G A G+TF
Sbjct: 99  -----PGIRFNGTIKRATSFPQVILSAASFDANQWYRISQAIGKEARALYNAGQATGMTF 153

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           W+PNIN+ RDPRWGR  ETPGEDP + G+YA++YVRGLQ   G  +       PL+ SAC
Sbjct: 154 WAPNINIFRDPRWGRGQETPGEDPLMTGKYAVSYVRGLQ---GDSFKGGEIKGPLQASAC 210

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH+ AYDL+NW G  R+ FD+ VT QD+ +T+  PF+ CV EG  S +MC+YNRVNGIP
Sbjct: 211 CKHFTAYDLENWNGTSRYVFDAYVTAQDLADTYQPPFKSCVEEGRASGIMCAYNRVNGIP 270

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
            CAD   L++T R  W F GYI SDCD++  I ++  +   T EDAV  VLKAG+D++CG
Sbjct: 271 NCADSNFLSRTARAQWGFDGYIASDCDAVSIIHDAQGYAK-TPEDAVVAVLKAGMDVNCG 329

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQH 367
            Y    T  AV Q K+  ++ID +L  L+ V MRLG F+G+P   Q+ N+G + +C+ ++
Sbjct: 330 SYLQQHTKAAVDQKKLTISEIDRALHNLFSVRMRLGLFNGNPTGQQFGNIGPDQVCSQEN 389

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
             LA +AAR GIVLLKN  G LPL+     +LA++GP+AN+ + ++GNY G PC+  +P+
Sbjct: 390 QILALDAARNGIVLLKNSAGLLPLSKSKTMSLAVIGPNANSVQTLLGNYAGPPCKLVTPL 449

Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
               +Y K     PGC  + C + S++  A++ AK AD  V++ GLD + E EG DR DL
Sbjct: 450 QALQSYIKHTIPYPGCDSVQCSSASIV-GAVNVAKGADHVVLIMGLDDTQEKEGLDRRDL 508

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
           +LPG Q ELI  VA AAK PV LV++S G VDI+FAKN+  I SILW GYPGE G  A+A
Sbjct: 509 VLPGKQQELIISVAKAAKNPVVLVLLSGGPVDISFAKNDKNIGSILWAGYPGEAGAIALA 568

Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGYGL 605
           ++IFG +NPGG+LP+TWY   +VK+P T M +RP   + +PGRTY+F+ GP V+ FGYGL
Sbjct: 569 EIIFGDHNPGGKLPMTWYPQEFVKVPMTDMRMRPETSSGYPGRTYRFYKGPTVFEFGYGL 628

Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIE 665
           SY+++ Y++                                A+ I + +C++ KF   + 
Sbjct: 629 SYSKYTYEL-------------------------------RAIYIGEEQCENIKFKVTVS 657

Query: 666 VENMGKMDGSEVVMVYSK--PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKI 723
           V+N G+M G   V+++++   PG  G  IK+++G++ V + AG+  ++ + ++ C+ L  
Sbjct: 658 VKNEGQMAGKHPVLLFARHAKPG-KGRPIKKLVGFQTVKLGAGEKTEIEYELSPCEHLSS 716

Query: 724 VDNAANSLLASGAHTILVGE 743
            +     ++  G+  +LVG+
Sbjct: 717 ANEDGVMVMEEGSQILLVGD 736


>gi|85813772|emb|CAJ65922.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
          Length = 757

 Score =  682 bits (1760), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 362/757 (47%), Positives = 476/757 (62%), Gaps = 79/757 (10%)

Query: 8   KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
            L+ F +C+  L   +R  DLV+R+TL EK+  + + A  V RLG+P YEWWSEALHGVS
Sbjct: 50  SLASFGFCNTSLGVSDRVVDLVKRLTLQEKILFLVNSAGSVSRLGIPKYEWWSEALHGVS 109

Query: 68  FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIG----QTVSTEARAMYNL 123
           ++G      PGTHF S VPGATSFP VILT ASFN SL+  IG    Q VSTEARAMYN+
Sbjct: 110 YVG------PGTHFSSVVPGATSFPQVILTAASFNTSLFVAIGKVISQVVSTEARAMYNV 163

Query: 124 GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
           G AGLTFWSPNIN+ RDPRWGR  ETPGEDP +  +Y   YV+GLQ  +      D +  
Sbjct: 164 GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSGYVKGLQQRD------DGNPD 217

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDS-RVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
            LK++ACCKHY AYDLDNW+G DR+HF++  VT+QDM +TF  PF+ CV +G+V+SVMCS
Sbjct: 218 GLKVAACCKHYTAYDLDNWKGVDRYHFNAVVVTKQDMDDTFQPPFKSCVVDGNVASVMCS 277

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHG--YIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
           YN+VNGIPTCADP LL+  IRG+W  +G  YIV+DCDSI     S  +   T E+A A+ 
Sbjct: 278 YNKVNGIPTCADPDLLSGVIRGEWKLNGYVYIVTDCDSIDVFYNSQHY-TKTPEEAAAKA 336

Query: 301 LKA--GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YK 355
           + A  GLDL+CG +    T  AV  G + E+ ID ++   +  LMRLG+FDG P    Y 
Sbjct: 337 ILAGIGLDLNCGSFLGKHTEAAVTAGLVNESAIDRAVSNNFATLMRLGFFDGDPSKQLYG 396

Query: 356 NLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGN 415
            LG  ++C  ++ ELA EAARQGIVLLKN                               
Sbjct: 397 KLGPKDVCTAENQELAREAARQGIVLLKN------------------------------- 425

Query: 416 YEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL 475
             GTPC+YT+P+ G  A      Y PGC+++ C + + +  A   A  ADATV+V G DL
Sbjct: 426 -TGTPCKYTTPLQGLAALVAT-TYLPGCSNVAC-STAQVDDAKKIAAAADATVLVMGADL 482

Query: 476 SVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
           S+EAE +DRVD+LLPG Q  LI  VA+A+ GPV LVIMS G +D++FAK N KI SILWV
Sbjct: 483 SIEAESRDRVDILLPGQQQLLITAVANASTGPVILVIMSGGGMDVSFAKTNDKITSILWV 542

Query: 536 GYPGEEGGRAIADVIFGKYN------PGGRLPITWYEANYV-KIPYTSMPLR--PVNNFP 586
           GYPGE GG AIAD+IFG YN      PGGRLP+TWY  +YV K+P T+M +R  P N +P
Sbjct: 543 GYPGEAGGAAIADIIFGSYNPSTHQPPGGRLPMTWYPQSYVDKVPMTNMNMRPDPSNGYP 602

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY+F+ G  VY FG GLSY++F +++  +P  V + L+++  C             C 
Sbjct: 603 GRTYRFYTGETVYSFGDGLSYSEFSHELTQAPGLVSVPLEENHVCY---------SSECK 653

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
           +V   +  C++  F   + ++N G   GS  V ++S PP +  +  K ++G+E+VF+ A 
Sbjct: 654 SVAAAEQTCQN--FDVHLRIKNTGTTSGSHTVFLFSTPPSVHNSPQKHLVGFEKVFLHAQ 711

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
             + VGF ++ CK L +VD   +  +A G H + +G 
Sbjct: 712 TDSHVGFKVDVCKDLSVVDELGSKKVALGEHVLHIGS 748


>gi|115459584|ref|NP_001053392.1| Os04g0530700 [Oryza sativa Japonica Group]
 gi|38346629|emb|CAD41212.2| OSJNBa0074L08.23 [Oryza sativa Japonica Group]
 gi|38346760|emb|CAE03865.2| OSJNBa0081C01.11 [Oryza sativa Japonica Group]
 gi|113564963|dbj|BAF15306.1| Os04g0530700 [Oryza sativa Japonica Group]
 gi|218195263|gb|EEC77690.1| hypothetical protein OsI_16749 [Oryza sativa Indica Group]
          Length = 770

 Score =  682 bits (1759), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 343/744 (46%), Positives = 476/744 (63%), Gaps = 28/744 (3%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S +P+C+A LP+P RA+ LV  +TL EK+ Q+ + A G PRLG+P +EWWSE+LHGV   
Sbjct: 36  SAYPFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSESLHGV--- 92

Query: 70  GRRTNSPPGTHFDS-EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
               ++ PG +F S  V  AT FP VIL+ A+FN SLW+   + ++ EARAM+N G AGL
Sbjct: 93  ---CDNGPGVNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAGL 149

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           TFW+PNINV RDPRWGR  ETPGEDP VV  Y++ YV+G Q   G E         + +S
Sbjct: 150 TFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGEEGR-------MMLS 202

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYDL+ W G  R+ F+++V  QDM++T+  PF+ C+ EG  S +MCSYN+VNG
Sbjct: 203 ACCKHYIAYDLEKWRGFTRYTFNAKVNAQDMEDTYQPPFKSCIQEGRASCLMCSYNQVNG 262

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           +P CA   +L Q  R +W F GYI SDCD++  I E+  +   + ED++A VLKAG+D++
Sbjct: 263 VPACARKDIL-QRARDEWGFQGYITSDCDAVAIIHENQTY-TASDEDSIAVVLKAGMDIN 320

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG +    T  A+++GK+ E DI+ +L  L+ V +RLG+FD + +   +  LG NN+C  
Sbjct: 321 CGSFLIRHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCTT 380

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H ELAAEA RQG VLLKNDNG LPL    +  +AL+GP AN    + G+Y G PC  T+
Sbjct: 381 EHRELAAEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHSTT 440

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
            + G  AY     +A GC D+ C +      AI+AAK AD  V++AGL+L+ E E  DRV
Sbjct: 441 FVKGMQAYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDRV 500

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            LLLPG Q +LI+ VA   K PV LV+M  G VD++FAK++P+I SILW+GYPGE GG  
Sbjct: 501 SLLLPGRQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGNV 560

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
           + +++FGKYNPGG+LPITWY  ++  +P   M +R      +PGRTY+F+ G VVY FGY
Sbjct: 561 LPEILFGKYNPGGKLPITWYPESFTAVPMDDMNMRADASRGYPGRTYRFYTGDVVYGFGY 620

Query: 604 GLSYTQFKYKVASSPKSVDIKLDK--DQQCRDINYTVGTNKPPCAAVLIDDV-KCKDYKF 660
           GLSY+++ Y +  +PK + +      D   R   Y   T +     V ++D+  C+  +F
Sbjct: 621 GLSYSKYSYSILQAPKKISLSRSSVPDLISRKPAY---TRRDGVDYVQVEDIASCEALQF 677

Query: 661 TFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
              I V N G MDGS  V+++ S  P   G+ IKQ++G+ERV  AAG+S  V  T++ CK
Sbjct: 678 PVHISVSNDGAMDGSHAVLLFASSKPSFPGSPIKQLVGFERVHTAAGRSTDVEITVDPCK 737

Query: 720 SLKIVDNAANSLLASGAHTILVGE 743
            +   +     +L  G H ++VG+
Sbjct: 738 LMSFANTEGTRVLFLGTHVLMVGD 761


>gi|358349509|ref|XP_003638778.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
 gi|355504713|gb|AES85916.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
          Length = 776

 Score =  681 bits (1758), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 345/743 (46%), Positives = 474/743 (63%), Gaps = 28/743 (3%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +P+C+ KLP  +R KDLV R+TL EK+ Q+ + A  +PRLG+P YEWWSEALHG+  +GR
Sbjct: 42  YPFCNPKLPITQRTKDLVSRLTLDEKLAQLVNSAPPIPRLGIPAYEWWSEALHGIGNVGR 101

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
                 G  F+  +  ATSFP VILT ASF+  LW +IGQ +  EARA+YN G A G+TF
Sbjct: 102 ------GIFFNGSITSATSFPQVILTAASFDSHLWYRIGQAIGVEARAIYNGGQAMGMTF 155

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           W+PNIN+ RDPRWGR  ET GEDP +   YA++YVRGLQ   G  +        L+ SAC
Sbjct: 156 WAPNINIFRDPRWGRGQETAGEDPMMTSNYAVSYVRGLQ---GDSFQGGKLRGHLQASAC 212

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH+ AYDLDNW+G +RFHFD+RV+ QD+ +T+  PF  C+ +G  S +MC+YNRVNGIP
Sbjct: 213 CKHFTAYDLDNWKGVNRFHFDARVSLQDLADTYQPPFRSCIEQGRASGIMCAYNRVNGIP 272

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
           +CAD  LL  T+R  W FHGYIVSDC ++  I +   +   + EDAVA VL AG+DL+CG
Sbjct: 273 SCADFNLLTNTVRKQWEFHGYIVSDCGAVGIIHDEQGYAK-SAEDAVADVLHAGMDLECG 331

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
            Y T+    AVQQ K+    ID +L  L+ + +RLG FDG+P    +  +G N++C+  H
Sbjct: 332 SYLTDHAKSAVQQKKLPIVRIDRALHNLFSIRIRLGQFDGNPAKLPFGMIGPNHVCSENH 391

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATK-AMIGNYEGTPCRYTSP 426
           + LA EAAR GIVLLKN    LPL   +I +LA++GP+ANA+   ++GNY G PC+  + 
Sbjct: 392 LYLALEAARNGIVLLKNTASLLPLPKTSI-SLAVIGPNANASPLTLLGNYAGPPCKSITI 450

Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
           + GF  Y K   + PGC       ++ I  A+  AKNAD  V+V GLD SVE E +DRV 
Sbjct: 451 LQGFQHYVKNAVFHPGCDGGPKCASAPIDKAVKVAKNADYVVLVMGLDQSVEREERDRVH 510

Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
           L LPG Q ELIN VA A+K PV LV++  G +DI+ AKNN KI  I+W GYPGE GG A+
Sbjct: 511 LDLPGKQLELINSVAKASKRPVILVLLCGGPIDISSAKNNDKIGGIIWAGYPGELGGIAL 570

Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
           A +IFG +NPGGRLPITWY  +Y+K+P T M +R  P   +PGRTY+F+ GP VY FG+G
Sbjct: 571 AQIIFGDHNPGGRLPITWYPKDYIKVPMTDMRMRADPTTGYPGRTYRFYKGPTVYEFGHG 630

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI---DDVKCKDYKFT 661
           LSYT++ Y+       V +  DK    +   + +  N       L+   D+  CK    +
Sbjct: 631 LSYTKYSYEF------VSVTHDKLHFNQSSTHLMTENSETIRYKLVSELDEETCKSMSVS 684

Query: 662 FQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
             + V+N G + G   ++++ +P      + +KQ++G+  + + AG+ + VGF ++ C+ 
Sbjct: 685 VTVGVKNHGNIVGRHPILLFMRPQKHRTRSPMKQLVGFHSLLLDAGEMSHVGFELSPCEH 744

Query: 721 LKIVDNAANSLLASGAHTILVGE 743
           L   + A   ++  G+H + VGE
Sbjct: 745 LSRANEAGLKIIEEGSHLLHVGE 767


>gi|224058158|ref|XP_002299457.1| predicted protein [Populus trichocarpa]
 gi|222846715|gb|EEE84262.1| predicted protein [Populus trichocarpa]
          Length = 780

 Score =  681 bits (1757), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 339/729 (46%), Positives = 462/729 (63%), Gaps = 19/729 (2%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           + +C+  LP   RA+ L+  +TL EK+QQ+ D A G+PRLG+P YEWWSE+LHG+S  G 
Sbjct: 40  YSFCNKSLPITRRAQSLISHLTLQEKIQQLSDNASGIPRLGIPHYEWWSESLHGISING- 98

Query: 72  RTNSPPGTHFDSEVP--GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
                PG  F +  P   AT FP VI++ ASFN +LW  IG  ++ EARAMYN+G AGLT
Sbjct: 99  -----PGVSFKNGGPVTSATGFPQVIVSAASFNRTLWFLIGSAIAIEARAMYNVGQAGLT 153

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
           FW+PNIN+ RDPRWGR  ETPGEDP V   YAI +V+G Q         + +   L +SA
Sbjct: 154 FWAPNINIFRDPRWGRGQETPGEDPMVASAYAIEFVKGFQGGHWKNEDGEINDDKLMLSA 213

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
           CCKH  AYDL+ W    R+ F++ VTEQDM++T+  PF  C+ +G  S +MCSYN VNG+
Sbjct: 214 CCKHSTAYDLEKWGNFSRYSFNAVVTEQDMEDTYQPPFRSCIQKGKASCLMCSYNEVNGV 273

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           P CA   LL Q  R +W F GYI SDCD++ TI E   + + + EDAVA  LKAG+D++C
Sbjct: 274 PACAREDLL-QKPRTEWGFKGYITSDCDAVATIFEYQNY-SKSPEDAVAIALKAGMDINC 331

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQ 366
           G Y       AV++GK+ E DID +L  L+ V +RLG FDG P   Q+  LG  N+C  +
Sbjct: 332 GTYVLRNAQSAVEKGKLQEEDIDRALHNLFSVQLRLGLFDGDPRKGQFGKLGPKNVCTKE 391

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
           H  LA EAARQGIVLLKND   LPLN   + +LA++GP AN   ++ G+Y G PC   S 
Sbjct: 392 HKTLALEAARQGIVLLKNDKKLLPLNKKAVSSLAIIGPLANMANSLGGDYTGYPCDPQSL 451

Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
            +G  AY K  +YA GC D+ C +++    AI  AK AD  +IVAGLDLS E E  DRV 
Sbjct: 452 FEGLKAYVKKTSYAIGCLDVACVSDTQFHKAIIVAKRADFVIIVAGLDLSQETEEHDRVS 511

Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
           LLLPG Q  L++ VA A+K PV LV+   G +D++FAK +P+I SILW+GYPGE G +A+
Sbjct: 512 LLLPGKQMSLVSSVAAASKKPVILVLTGGGPLDVSFAKGDPRIASILWIGYPGEAGAKAL 571

Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
           A++IFG+YNPGGRLP+TWY  ++ ++  T M +R  P   +PGRTY+F+ G  VY FG G
Sbjct: 572 AEIIFGEYNPGGRLPMTWYPESFTEVSMTDMNMRPNPSRGYPGRTYRFYTGNRVYGFGGG 631

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV-KCKDYKFTFQ 663
           LSYT F YK+ S+P  + +        R      G  +   + + I+++  C   +F  Q
Sbjct: 632 LSYTNFTYKILSAPSKLSLSGSLSSNSRKRILQQGGER--LSYININEITSCDSLRFYMQ 689

Query: 664 IEVENMGKMDGSEVVMVYSKPPGI-AGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
           I VEN+G MDG  VVM++S+ P +  G   KQ++G++RV   + +S ++   ++ C+ L 
Sbjct: 690 ILVENVGNMDGGHVVMLFSRVPTVFRGAPEKQLVGFDRVHTISHRSTEMSILVDPCEHLS 749

Query: 723 IVDNAANSL 731
           + +     +
Sbjct: 750 VANEQGKKI 758


>gi|357485313|ref|XP_003612944.1| Beta-D-xylosidase [Medicago truncatula]
 gi|355514279|gb|AES95902.1| Beta-D-xylosidase [Medicago truncatula]
          Length = 783

 Score =  678 bits (1750), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 335/745 (44%), Positives = 472/745 (63%), Gaps = 20/745 (2%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S +P+C+  LP   R   L+  +TL +K+ Q+ + A  +  LG+P Y+WWSEALHG++  
Sbjct: 38  SHYPFCNISLPISTRTTSLISLLTLSDKINQLSNTASSISHLGIPSYQWWSEALHGIATN 97

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
           G      PG +F+  V  AT+FP VI++ A+FN SLW  IG  V  E RAM+N+G AGL+
Sbjct: 98  G------PGVNFNGSVKSATNFPQVIVSAAAFNRSLWFLIGYAVGVEGRAMFNVGQAGLS 151

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEY---HRDSDSRPLK 186
           FW+PN+NV RDPRWGR  ETPGEDP V   YA+ +VRG+Q V+G++      DSD   L 
Sbjct: 152 FWAPNVNVFRDPRWGRGQETPGEDPMVGSAYAVEFVRGIQGVDGIKKVLNDHDSDDDGLM 211

Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
           +SACCKH+ AYDL+ W    R++F++ VT+QD+++T+  PF  CV +G  S +MCSYN V
Sbjct: 212 VSACCKHFTAYDLEKWGEFSRYNFNAVVTQQDLEDTYQPPFRGCVQQGKASCLMCSYNEV 271

Query: 247 NGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLD 306
           NG+P CA   LL   +R  W F GYI SDCD++ T+ E  K+   + EDAVA VLKAG+D
Sbjct: 272 NGVPACASKDLLG-LVRNKWGFEGYIASDCDAVATVFEYQKYAK-SAEDAVADVLKAGMD 329

Query: 307 LDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNIC 363
           ++CG +    T  A++QG + E D+D +L  L+ V MRLG F+G P+   +  LG  ++C
Sbjct: 330 INCGTFMLRHTESAIEQGLVKEEDLDRALFNLFSVQMRLGLFNGDPEKGKFGKLGPQDVC 389

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
            P+H +LA EAARQGIVLLKNDN  LPL+  +  +LA++GP A  T  + G Y G PC  
Sbjct: 390 TPEHKKLALEAARQGIVLLKNDNKFLPLDKKDRVSLAIIGPMAT-TSELGGGYSGIPCSP 448

Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
            S  DG   Y K I+YA GC+D+ C ++     AID AK AD  VIVAGLD ++E E  D
Sbjct: 449 RSLYDGLKEYVKTISYAFGCSDVKCDSDDGFAVAIDIAKQADFVVIVAGLDTTLETEDLD 508

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           RV LLLPG Q +L+++VA A+K PV LV+   G +D++FA++N  I SILW+GYPGE GG
Sbjct: 509 RVSLLLPGKQMDLVSRVAAASKRPVILVLTGGGPLDVSFAESNQLITSILWIGYPGEAGG 568

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPF 601
           +A+A++IFG++NP GRLP+TWY  ++  +P   M +R  P   +PGRTY+F+ G  +Y F
Sbjct: 569 KALAEIIFGEFNPAGRLPMTWYPESFTNVPMNDMGMRADPSRGYPGRTYRFYTGSRIYGF 628

Query: 602 GYGLSYTQFKYKVASSPKSVDI-KLDKDQQCRDINYTVGTNKPPCAAVLIDDVK-CKDYK 659
           G+GLSY+ F Y+V S+P  + + K       R +   V  +      V +D+++ C    
Sbjct: 629 GHGLSYSDFSYRVLSAPSKLSLSKTTNGGLRRSLLNKVEKDVFEVDHVHVDELQNCNSLS 688

Query: 660 FTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
           F+  I V N+G MDGS VVM++SK P  I G+   Q++G  R+   + +S +     + C
Sbjct: 689 FSVHISVMNVGDMDGSHVVMLFSKWPKNIQGSPESQLVGPSRLHTVSNKSIETSILADPC 748

Query: 719 KSLKIVDNAANSLLASGAHTILVGE 743
           +     D     +L  G H + VG+
Sbjct: 749 EHFSFADEQGKRILPLGNHILNVGD 773


>gi|253761874|ref|XP_002489311.1| hypothetical protein SORBIDRAFT_0010s012040 [Sorghum bicolor]
 gi|241946959|gb|EES20104.1| hypothetical protein SORBIDRAFT_0010s012040 [Sorghum bicolor]
          Length = 791

 Score =  677 bits (1747), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 337/754 (44%), Positives = 461/754 (61%), Gaps = 39/754 (5%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            P+C+ KLP  +RA DLV RMT  EK  Q+GD+A GVPRLG+P Y+WW+EALHGV+  G+
Sbjct: 60  LPFCNMKLPASQRAADLVSRMTPAEKASQLGDIANGVPRLGVPSYKWWNEALHGVAISGK 119

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
                 G H +  V  ATSFP V+ T ASFN++LW +IGQ    EARA YN+G A GLT 
Sbjct: 120 ------GIHMNQGVRSATSFPQVLHTAASFNDNLWFRIGQATGKEARAFYNIGQAEGLTM 173

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           WSPN+N+ RDPRWGR  ETPGEDP V  RY   +VRGLQ   G   +  S    L+ SAC
Sbjct: 174 WSPNVNIFRDPRWGRGQETPGEDPAVASRYGAAFVRGLQ---GSSSNTKSVPPVLQTSAC 230

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH  AYDL++W+G  R+ F + VT QD+ +TF  PF  CV +G  S VMC+Y  VNG+P
Sbjct: 231 CKHATAYDLEDWKGVSRYSFKATVTIQDLADTFNPPFRSCVVDGKASCVMCAYTIVNGVP 290

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
           +CA+  LL +T RG W   GY+ +DCD++  I+ + +F   T ED VA  LKAGLD+DCG
Sbjct: 291 SCANGDLLTKTFRGSWGLDGYVAADCDAV-AIMRNSQFYRPTAEDTVAATLKAGLDIDCG 349

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
            Y   + M A+Q+GK+ + D+D +++ L    MRLG+FDG P+   Y NLG  +IC  +H
Sbjct: 350 PYIQQYAMAAIQKGKLTQQDVDKAVKNLLTTRMRLGHFDGDPKTNVYGNLGAGHICTAEH 409

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
             LA EAA  GIVLLKN  G LPL  G + + A++G +AN   A++GNY G PC  T+P+
Sbjct: 410 KNLALEAALDGIVLLKNSAGVLPLKRGTVNSAAVIGHNANDVLALLGNYWGPPCAPTTPL 469

Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
            G   Y K + +  GC    C N +  P A   A ++DA ++  GL    E+EGKDR  L
Sbjct: 470 QGIQGYVKNVKFLAGCNKAAC-NVAATPQATALASSSDAVILFMGLSQEQESEGKDRTTL 528

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
           LLPG Q  LIN VA+AAK PV LV+++ G VDI FA+ NPKI +ILW GYPG+ GG AIA
Sbjct: 529 LLPGNQQSLINAVANAAKRPVILVLLTGGPVDITFAQANPKIGAILWAGYPGQAGGLAIA 588

Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSY 607
            V+FG+ NP G+LP TWY   + +IP T M +R   ++PGRTY+F++G  +Y FGYGLSY
Sbjct: 589 KVLFGEKNPSGKLPNTWYPEEFTRIPMTDMRMRAAGSYPGRTYRFYNGKTIYKFGYGLSY 648

Query: 608 TQFKYKVASSPKSVD----------IKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
           ++F ++V +  K+              + +D     + +             I DV C  
Sbjct: 649 SKFSHRVVTGRKNPAHNTSLLAAGLAAMTEDNLSYHVEH-------------IGDVVCDQ 695

Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
            KF   ++V+N G +DG    +++ + P    G   +Q+IG++   I AG+ A + F ++
Sbjct: 696 LKFLAVVKVQNHGPIDGKHTALMFLRWPSATDGRPTRQLIGFQSQHIKAGEKANLRFEVS 755

Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
            C+    V      ++  G+H + VG+    +SF
Sbjct: 756 PCEHFSRVRQDGRKVIDKGSHFLKVGKHELEISF 789


>gi|115448721|ref|NP_001048140.1| Os02g0752200 [Oryza sativa Japonica Group]
 gi|46390122|dbj|BAD15557.1| putative beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|46390225|dbj|BAD15656.1| putative beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|113537671|dbj|BAF10054.1| Os02g0752200 [Oryza sativa Japonica Group]
 gi|125583710|gb|EAZ24641.1| hypothetical protein OsJ_08409 [Oryza sativa Japonica Group]
          Length = 780

 Score =  677 bits (1746), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 350/757 (46%), Positives = 468/757 (61%), Gaps = 36/757 (4%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S   +C+ +LP  +RA DLV R+TL EK+ Q+GD +  V RLG+P Y+WWSEALHGVS  
Sbjct: 42  SSAAFCNPRLPIEQRADDLVSRLTLEEKISQLGDQSPAVDRLGVPAYKWWSEALHGVSNA 101

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GL 128
           GR      G H D  +  ATSFP VILT ASFN  LW +IGQ + TEARA+YN G A GL
Sbjct: 102 GR------GIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARAVYNNGQAEGL 155

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           TFW+PNINV RDPRWGR  ETPGEDP V G+YA  +VRG+Q   G       +S  L+ S
Sbjct: 156 TFWAPNINVFRDPRWGRGQETPGEDPTVTGKYAAVFVRGVQ---GYALAGAINSTDLEAS 212

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKH+ AYDL+NW+G  R+ FD++VT QD+ +T+  PF  CV +G  S +MCSYNRVNG
Sbjct: 213 ACCKHFTAYDLENWKGVTRYAFDAKVTAQDLADTYNPPFRSCVEDGGASGIMCSYNRVNG 272

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           +PTCAD  LL++T RGDW F+GYI SDCD++  I +   +   T EDAVA VLKAG+D++
Sbjct: 273 VPTCADYNLLSKTARGDWRFYGYITSDCDAVSIIHDVQGYAK-TAEDAVADVLKAGMDVN 331

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYK---NLGKNNICNP 365
           CG Y     + A+QQGKI E DI+ +L  L+ V MRLG F+G+P+Y    N+G + +C  
Sbjct: 332 CGSYVQEHGLSAIQQGKITEQDINRALHNLFAVRMRLGLFNGNPKYNRYGNIGPDQVCTQ 391

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H  LA EAA+ G+VLLKND  ALPL+   + ++A++G +AN    ++GNY G PC   +
Sbjct: 392 EHQNLALEAAQHGVVLLKNDANALPLSKSQVSSIAVIGHNANDATRLLGNYFGPPCISVT 451

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+     Y K   +  GC    C N S I  A   A + D  V+  GLD   E E  DR+
Sbjct: 452 PLQVLQGYVKDTRFLAGCNSAAC-NVSSIGEAAQLASSVDYVVLFMGLDQDQEREEVDRL 510

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           +L LPG Q  LIN VA+AAK PV LV++  G VD+ FAK NPKI +ILW GYPGE GG A
Sbjct: 511 ELSLPGMQENLINTVANAAKKPVILVLLCGGPVDVTFAKYNPKIGAILWAGYPGEAGGIA 570

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
           IA V+FG++NPGGRLP+TWY   +  +P T M +R  P   +PGRTY+F+ G  VY FGY
Sbjct: 571 IAQVLFGEHNPGGRLPVTWYPKEFTSVPMTDMRMRADPSTGYPGRTYRFYRGNTVYKFGY 630

Query: 604 GLSYTQFKYKVAS------SPKSVD-IKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
           GLSY+++ +   +      S  S+D +K         ++Y V    P           C 
Sbjct: 631 GLSYSKYSHHFVANGTKLPSLSSIDGLKAMATAAAGTVSYDVEEIGPE---------TCD 681

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA---GTHIKQVIGYERVFIAAGQSAKVGF 713
             KF   + V+N G MDG   V+++ + P  A   G    Q+IG++ + + + Q+  V F
Sbjct: 682 KLKFPALVRVQNHGPMDGRHPVLLFLRWPNGAADGGRPASQLIGFQSLHLKSMQTVHVEF 741

Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
            ++ CK           ++  G+H ++VG+    +SF
Sbjct: 742 EVSPCKHFSRATEDGKKVIDHGSHFMMVGDDEFEMSF 778


>gi|218191593|gb|EEC74020.1| hypothetical protein OsI_08964 [Oryza sativa Indica Group]
          Length = 774

 Score =  676 bits (1745), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 350/757 (46%), Positives = 468/757 (61%), Gaps = 36/757 (4%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S   +C+ +LP  +RA DLV R+TL EK+ Q+GD +  V RLG+P Y+WWSEALHGVS  
Sbjct: 36  SSAAFCNPRLPIEQRADDLVSRLTLEEKISQLGDQSPAVDRLGVPAYKWWSEALHGVSNA 95

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GL 128
           GR      G H D  +  ATSFP VILT ASFN  LW +IGQ + TEARA+YN G A GL
Sbjct: 96  GR------GIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARAVYNNGQAEGL 149

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           TFW+PNINV RDPRWGR  ETPGEDP V G+YA  +VRG+Q   G       +S  L+ S
Sbjct: 150 TFWAPNINVFRDPRWGRGQETPGEDPTVTGKYAAVFVRGVQ---GYALAGAINSTDLEAS 206

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKH+ AYDL+NW+G  R+ FD++VT QD+ +T+  PF  CV +G  S +MCSYNRVNG
Sbjct: 207 ACCKHFTAYDLENWKGVTRYAFDAKVTAQDLADTYNPPFRSCVEDGGASGIMCSYNRVNG 266

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           +PTCAD  LL++T RGDW F+GYI SDCD++  I +   +   T EDAVA VLKAG+D++
Sbjct: 267 VPTCADYNLLSKTARGDWRFYGYITSDCDAVSIIHDVQGYAK-TAEDAVADVLKAGMDVN 325

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYK---NLGKNNICNP 365
           CG Y     + A+QQGKI E DI+ +L  L+ V MRLG F+G+P+Y    N+G + +C  
Sbjct: 326 CGSYVQEHGLSAIQQGKITEQDINRALHNLFAVRMRLGLFNGNPKYNRYGNIGPDQVCTQ 385

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H  LA EAA+ G+VLLKND  ALPL+   + ++A++G +AN    ++GNY G PC   +
Sbjct: 386 EHQNLALEAAQHGVVLLKNDANALPLSKSQVSSIAVIGHNANDATRLLGNYFGPPCISVT 445

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+     Y K   +  GC    C N S I  A   A + D  V+  GLD   E E  DR+
Sbjct: 446 PLQVLQGYVKDTRFLAGCNSAAC-NVSSIGEAAQLASSVDYVVLFMGLDQDQEREEVDRL 504

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           +L LPG Q  LIN VA+AAK PV LV++  G VD+ FAK NPKI +ILW GYPGE GG A
Sbjct: 505 ELSLPGMQENLINTVANAAKKPVILVLLCGGPVDVTFAKYNPKIGAILWAGYPGEAGGIA 564

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
           IA V+FG++NPGGRLP+TWY   +  +P T M +R  P   +PGRTY+F+ G  VY FGY
Sbjct: 565 IAQVLFGEHNPGGRLPVTWYPKEFTSVPMTDMRMRADPSTGYPGRTYRFYRGNTVYKFGY 624

Query: 604 GLSYTQFKYKVAS------SPKSVD-IKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
           GLSY+++ +   +      S  S+D +K         ++Y V           I    C 
Sbjct: 625 GLSYSKYSHHFVANGTKLPSLSSIDGLKAMATAAAGTVSYDVEE---------IGTETCD 675

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA---GTHIKQVIGYERVFIAAGQSAKVGF 713
             KF   + V+N G MDG   V+++ + P  A   G    Q+IG++ + + + Q+  V F
Sbjct: 676 KLKFPALVRVQNHGPMDGRHPVLLFLRWPNGAADGGRPASQLIGFQSLHLKSMQTVHVEF 735

Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
            ++ CK           ++  G+H ++VG+    +SF
Sbjct: 736 EVSPCKHFSRATEDGKKVIDHGSHFMMVGDDEFEMSF 772


>gi|356515806|ref|XP_003526589.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
          Length = 772

 Score =  676 bits (1744), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 346/744 (46%), Positives = 476/744 (63%), Gaps = 34/744 (4%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +P+C+ KLP P+R KDL+ R+TL EK+ Q+ + A  +PRLG+P Y+WWSEALHGVS +G 
Sbjct: 38  YPFCNPKLPIPQRTKDLLSRLTLDEKLSQLVNTAPPIPRLGIPAYQWWSEALHGVSGVG- 96

Query: 72  RTNSPPGTHFD--SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GL 128
                PG  FD  S +  ATSFP VILT ASF+  LW +IG  +  EARA++N G A GL
Sbjct: 97  -----PGILFDNNSTISSATSFPQVILTAASFDSRLWYRIGHAIGIEARAIFNAGQANGL 151

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           TFW+PNIN+ RDPRWGR  ET GEDP +  RYA+++VRGLQ       H       L  S
Sbjct: 152 TFWAPNINIFRDPRWGRGQETAGEDPLLTSRYAVSFVRGLQGDSFKGAH-------LLAS 204

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKH+ AYDLDNW+G DRF FD+RV+ QD+ +T+  PF+ CV +G  S +MC+YNRVNG
Sbjct: 205 ACCKHFTAYDLDNWKGVDRFVFDARVSLQDLADTYQPPFQSCVQQGRASGIMCAYNRVNG 264

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           +P CAD  LL QT R  W+F+GYI SDC ++  I +  ++   + ED VA VL+AG+DL+
Sbjct: 265 VPNCADYGLLTQTARNQWDFNGYITSDCGAVGFIHDRQRYAK-SPEDVVADVLRAGMDLE 323

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNP 365
           CG Y T     AV Q K+  ++ID +L+ L+ + MRLG FDG+P    +  +G N++C+ 
Sbjct: 324 CGSYLTYHAKSAVLQKKLGMSEIDRALQNLFSIRMRLGLFDGNPTRLSFGLIGSNHVCSK 383

Query: 366 QHIELAAEAARQGIVLLKNDNGALPL-NTGNIKTLALVGPHANATK-AMIGNYEGTPCRY 423
           +H  LA EAAR GIVLLKN    LPL  T    +LA++GP+AN++   ++GNY G PC+Y
Sbjct: 384 EHQYLALEAARNGIVLLKNSPTLLPLPKTSPSISLAVIGPNANSSPLTLLGNYAGPPCKY 443

Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
            + + GF  Y K   Y PGC      +++ I  A++ AK  D  V+V GLD S E E +D
Sbjct: 444 VTILQGFRHYVKNAFYHPGCDGGPKCSSAQIDQAVEVAKKVDYVVLVMGLDQSEEREERD 503

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           RV L LPG Q ELIN VA+A+K PV LV++S G +DI  AK N KI  ILW GYPGE GG
Sbjct: 504 RVHLDLPGKQLELINGVAEASKKPVILVLLSGGPLDITSAKYNHKIGGILWAGYPGELGG 563

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPF 601
            A+A +IFG +NPGGRLP TWY  +Y+K+P T M +R  P   +PGRTY+F+ GP VY F
Sbjct: 564 IALAQIIFGDHNPGGRLPTTWYPKDYIKVPMTDMRMRADPSTGYPGRTYRFYKGPKVYEF 623

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI---DDVKCKDY 658
           GYGLSY+++ Y+       V +  DK    +   + +  N    +  L+   D+  C+  
Sbjct: 624 GYGLSYSKYSYEF------VSVTHDKLHFNQSSTHLMVENSETISYKLVSELDEQTCQSM 677

Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNA 717
             +  + V+N G M G   V+++ +P    +G+ +KQ++G+E V + AG+ A V F ++ 
Sbjct: 678 SLSVTVRVQNHGSMVGKHPVLLFIRPKRQKSGSPVKQLVGFESVMLDAGEMAHVEFEVSP 737

Query: 718 CKSLKIVDNAANSLLASGAHTILV 741
           C+ L   + A   ++  G+H +LV
Sbjct: 738 CEHLSRANEAGAMIIEEGSHMLLV 761


>gi|26449574|dbj|BAC41913.1| putative beta-xylosidase [Arabidopsis thaliana]
          Length = 732

 Score =  674 bits (1738), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 332/732 (45%), Positives = 468/732 (63%), Gaps = 32/732 (4%)

Query: 34  LPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPT 93
           LPEK+ Q+ + A  VPRLG+P YEWWSE+LHG++      ++ PG  F+  +  ATSFP 
Sbjct: 2   LPEKIGQLSNTAASVPRLGIPPYEWWSESLHGLA------DNGPGVSFNGSISAATSFPQ 55

Query: 94  VILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGED 153
           VI++ ASFN +LW +IG  V+ E RAMYN G AGLTFW+PNINV RDPRWGR  ETPGED
Sbjct: 56  VIVSAASFNRTLWYEIGSAVAVEGRAMYNGGQAGLTFWAPNINVFRDPRWGRGQETPGED 115

Query: 154 PYVVGRYAINYVRGLQDVEGVEYHRDSDSR-------------PLKISACCKHYAAYDLD 200
           P VV  Y + +VRG Q+ +  +  +   S               L +SACCKH+ AYDL+
Sbjct: 116 PKVVSEYGVEFVRGFQEKKKRKVLKRRFSDDVDDDRHDDDADGKLMLSACCKHFTAYDLE 175

Query: 201 NWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQ 260
            W    R+ F++ VTEQDM++T+  PFE C+ +G  S +MCSYN VNG+P CA   LL Q
Sbjct: 176 KWGNFTRYDFNAVVTEQDMEDTYQPPFETCIRDGKASCLMCSYNAVNGVPACAQGDLL-Q 234

Query: 261 TIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGA 320
             R +W F GYI SDCD++ TI  +++    + E+AVA  +KAG+D++CG Y    T  A
Sbjct: 235 KARVEWGFEGYITSDCDAVATIF-AYQGYTKSPEEAVADAIKAGVDINCGTYMLRHTQSA 293

Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQHIELAAEAARQ 377
           ++QGK++E  +D +L  L+ V +RLG FDG P   QY  LG N+IC+  H +LA EA RQ
Sbjct: 294 IEQGKVSEELVDRALLNLFAVQLRLGLFDGDPRRGQYGKLGSNDICSSDHRKLALEATRQ 353

Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVI 437
           GIVLLKND+  LPLN  ++ +LA+VGP AN    M G Y G PC+  +       Y K  
Sbjct: 354 GIVLLKNDHKLLPLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKT 413

Query: 438 NYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELI 497
           +YA GC+D+ C +++    A+  AK AD  ++VAGLDLS E E KDRV L LPG Q +L+
Sbjct: 414 SYASGCSDVSCDSDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRVSLSLPGKQKDLV 473

Query: 498 NKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPG 557
           + VA  +K PV LV+   G VD+ FAKN+P+I SI+W+GYPGE GG+A+A++IFG +NPG
Sbjct: 474 SHVAAVSKKPVILVLTGGGPVDVTFAKNDPRIGSIIWIGYPGETGGQALAEIIFGDFNPG 533

Query: 558 GRLPITWYEANYVKIPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGYGLSYTQFKYKVA 615
           GRLP TWY  ++  +  + M +R  ++  +PGRTY+F+ GP VY FG GLSYT+F+YK+ 
Sbjct: 534 GRLPTTWYPESFTDVAMSDMHMRANSSRGYPGRTYRFYTGPQVYSFGTGLSYTKFEYKIL 593

Query: 616 SSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYKFTFQIEVENMGKM 672
           S+P  + +     QQ          +      + +DDV    C+  +F  ++ V N G++
Sbjct: 594 SAPIRLSLSELLPQQSSHKKQL--QHGEELRYLQLDDVIVNSCESLRFNVRVHVSNTGEI 651

Query: 673 DGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL 731
           DGS VVM++SK PP ++G   KQ+IGY+RV + + +  +  F ++ CK L + ++    +
Sbjct: 652 DGSHVVMLFSKMPPVLSGVPEKQLIGYDRVHVRSNEMMETVFVIDPCKQLSVANDVGKRV 711

Query: 732 LASGAHTILVGE 743
           +  G+H + +G+
Sbjct: 712 IPLGSHVLFLGD 723


>gi|189380221|gb|ACD93208.1| beta xylosidase [Camellia sinensis]
          Length = 767

 Score =  673 bits (1737), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 346/755 (45%), Positives = 480/755 (63%), Gaps = 40/755 (5%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
           + P+C   LP  +R +DL+ R+TL EK++ + + A  VPRLG+  YEWWSEALHGVS   
Sbjct: 39  NLPFCRVSLPIQDRVRDLIGRLTLQEKIRLLVNNAAAVPRLGIKGYEWWSEALHGVS--- 95

Query: 71  RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
              N+ PG  F    PGATSFP VI T ASFN SLW+ IG+ VS EARAMYN G AGLT+
Sbjct: 96  ---NADPGVKFGGAFPGATSFPQVISTAASFNASLWEHIGRVVSDEARAMYNGGMAGLTY 152

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           WSPN+N+ RDPRWGR  ETPGEDP + G+YA +YVRGLQ   G +         LK++AC
Sbjct: 153 WSPNVNIFRDPRWGRGQETPGEDPVLAGKYAASYVRGLQGNSGNQ---------LKVAAC 203

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKHY AYDLDNW   DR+ F++RV++QD+ +T+ +PF+ CV EG    V C++     I 
Sbjct: 204 CKHYTAYDLDNWNSVDRYRFNARVSKQDLADTYDVPFKACVVEGKYQ-VYCAHT----IK 258

Query: 251 TCADPKLLN--QTIRGDWNFHGYIVSDCDSIQTI--VESHKFLNDTKEDAVARVLKAGLD 306
             A+P +L         W++H ++   C  +        H  L+ T EDA A  +KAGLD
Sbjct: 259 LMANPLVLTLISPQHHPWSWHSWL--HCFRLYRCWGFICHSTLHSTPEDAAAATIKAGLD 316

Query: 307 LDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNIC 363
           L+CG +    T  AV+QGK+ EAD++ +L     V MRLG FDG P    Y NLG  ++C
Sbjct: 317 LECGPFLAIHTEQAVRQGKLGEADVNGALINTLSVQMRLGMFDGEPSSQPYGNLGPRDVC 376

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
            P H +LA EAARQGIVLL+N   +LPL+T   +T+A++GP+++ T  M+GNY G  C +
Sbjct: 377 TPAHQQLALEAARQGIVLLQNRGRSLPLSTQLHRTVAVIGPNSDVTVTMLGNYAGVACGF 436

Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
           T+P+ G   Y + I+ + GC  + C NN +   A  AA+ ADATV+V GLD S+E E KD
Sbjct: 437 TTPLQGIERYVRTIHQS-GCDSVACSNNQLFGVAETAARQADATVLVMGLDQSIETEFKD 495

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           RV LLLPG Q EL+++VA A++GPV LV+MS G +D++FAKN+P+I +ILWVGYPG+ GG
Sbjct: 496 RVGLLLPGPQQELVSRVAMASRGPVVLVLMSGGPIDVSFAKNDPRIGAILWVGYPGQAGG 555

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYP 600
            AIADV+FG+ NPGGRLP+TWY  +Y+ K P T+M +R  P + +PGRTY+F+ GPVV+P
Sbjct: 556 TAIADVLFGRTNPGGRLPMTWYPQDYLAKAPMTNMAMRANPSSGYPGRTYRFYKGPVVFP 615

Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
           FG+G+SYT F +++A +P +V + L      +  N T   N      + +    C     
Sbjct: 616 FGHGMSYTTFAHELAHAPTTVSVPLTSLYGLQ--NSTTFNN-----GIRVTHTNCDTLIL 668

Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
              I+V+N G MDG+  V+V+S PP       KQ+IG+++V + A    +V   ++ C  
Sbjct: 669 GIHIDVKNTGDMDGTHTVLVFSTPPVGKWGANKQLIGFKKVHVVARGRQRVKIHVHVCNQ 728

Query: 721 LKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLN 755
           L +VD      +  G H++ +G+    +S  + L+
Sbjct: 729 LSVVDQFGIRRIPIGEHSLHIGDIKHSISLQVTLD 763


>gi|115485165|ref|NP_001067726.1| Os11g0297800 [Oryza sativa Japonica Group]
 gi|62734696|gb|AAX96805.1| beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|77549999|gb|ABA92796.1| Glycosyl hydrolase family 3 C terminal domain containing protein,
           expressed [Oryza sativa Japonica Group]
 gi|113644948|dbj|BAF28089.1| Os11g0297800 [Oryza sativa Japonica Group]
 gi|125534139|gb|EAY80687.1| hypothetical protein OsI_35869 [Oryza sativa Indica Group]
 gi|215766717|dbj|BAG98945.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 782

 Score =  673 bits (1737), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 332/739 (44%), Positives = 459/739 (62%), Gaps = 30/739 (4%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           +CDA LP  +RA DLV R+T  EKV Q+GD A GVPRLG+P Y+WWSEALHG++  GR  
Sbjct: 52  FCDATLPAEQRAADLVARLTAAEKVAQLGDQAAGVPRLGVPAYKWWSEALHGLATSGR-- 109

Query: 74  NSPPGTHFD---SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLT 129
               G HFD   S    ATSFP V+LT A+F++ LW +IGQ + TEARA+YN+G A GLT
Sbjct: 110 ----GLHFDAPGSAARAATSFPQVLLTAAAFDDDLWFRIGQAIGTEARALYNIGQAEGLT 165

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
            WSPN+N+ RDPRWGR  ETPGEDP +  +YA+ +V+G+Q          + S  L+ SA
Sbjct: 166 MWSPNVNIFRDPRWGRGQETPGEDPTMASKYAVAFVKGMQG---------NSSAILQTSA 216

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
           CCKH  AYDL++W G  R++F+++VT QD+++T+  PF  CV +   + +MC+Y  +NG+
Sbjct: 217 CCKHVTAYDLEDWNGVQRYNFNAKVTAQDLEDTYNPPFRSCVVDAKATCIMCAYTGINGV 276

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           P CA+  LL +T+RGDW   GYI SDCD++  + ++ ++   T EDAVA  LKAGLD++C
Sbjct: 277 PACANADLLTKTVRGDWGLDGYIASDCDAVAIMRDAQRY-TQTPEDAVAVALKAGLDMNC 335

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNP 365
           G Y       A+QQGK+ E DID +L+ L+ + MRLG+FDG P+    Y  LG  +IC P
Sbjct: 336 GTYMQQHATAAIQQGKLTEEDIDKALKNLFAIRMRLGHFDGDPRSNSVYGGLGAADICTP 395

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H  LA EAA  GIVLLKND G LPL+   + + A++GP+AN   A+IGNY G PC  T+
Sbjct: 396 EHRSLALEAAMDGIVLLKNDAGILPLDRTAVASAAVIGPNANDGLALIGNYFGPPCESTT 455

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P++G   Y K + +  GC    C   +    A   A ++D   +  GL    E+EG+DR 
Sbjct: 456 PLNGILGYIKNVRFLAGCNSAACDVAATD-QAAAVASSSDYVFLFMGLSQKQESEGRDRT 514

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            LLLPG Q  LI  VADAAK PV LV+++ G VD+ FA+ NPKI +ILW GYPG+ GG A
Sbjct: 515 SLLLPGEQQSLITAVADAAKRPVILVLLTGGPVDVTFAQTNPKIGAILWAGYPGQAGGLA 574

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
           IA V+FG +NPGGRLP+TWY   + K+P T M +R  P   +PGR+Y+F+ G  VY FGY
Sbjct: 575 IARVLFGDHNPGGRLPVTWYPEEFTKVPMTDMRMRADPATGYPGRSYRFYQGKTVYKFGY 634

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           GLSY+ +  ++ S  K  +   +     R    + G        +  D   C+  KF   
Sbjct: 635 GLSYSSYSRQLVSGGKPAESYTNLLASLRTTTTSEGDESYHIEEIGTDG--CEQLKFPAV 692

Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
           +EV+N G MDG   V++Y + P   G     Q+IG+    +  G+ A + F ++ C+   
Sbjct: 693 VEVQNHGPMDGKHSVLMYLRWPNAKGGRPTTQLIGFRSQHLKVGEKANIRFDISPCEHFS 752

Query: 723 IVDNAANSLLASGAHTILV 741
            V      ++  G+H ++V
Sbjct: 753 RVRKDGKKVIDRGSHYLMV 771


>gi|357164885|ref|XP_003580200.1| PREDICTED: probable beta-D-xylosidase 6-like [Brachypodium
           distachyon]
          Length = 771

 Score =  673 bits (1736), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 339/741 (45%), Positives = 478/741 (64%), Gaps = 24/741 (3%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +P+CDA LP+P RA+ LV  +TL EK+ Q+ + A GVPRLG+P YEWWSE+LHG++    
Sbjct: 37  YPFCDASLPFPVRARALVSLLTLDEKIAQLSNTAAGVPRLGIPPYEWWSESLHGLA---- 92

Query: 72  RTNSPPGTHFDS-EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
             ++ PG +F S  V  AT FP VIL+ ASFN SLW+ + + V+ EARAM+N G AGLT+
Sbjct: 93  --DNGPGVNFSSGPVGAATIFPQVILSAASFNRSLWRAVAEAVAVEARAMHNAGQAGLTY 150

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           W+PNINV RDPRWGR  ETPGEDP V+  Y++ YV+G Q     EY    + R + +SAC
Sbjct: 151 WAPNINVFRDPRWGRGQETPGEDPAVIAAYSVEYVKGFQG----EYGDGKEGR-MMLSAC 205

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKHY AYDL+ W    R+ F+++V EQD ++T+  PF+ C+ EG  S +MCSYN+VNG+P
Sbjct: 206 CKHYVAYDLEKWGNFTRYTFNAKVNEQDFEDTYEPPFKSCIQEGRASCLMCSYNQVNGVP 265

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
            CA   LL Q +R +W F GY+VSDCD++  I     + N + ED++A VLKAG+D++CG
Sbjct: 266 ACARKDLL-QKVRDEWGFQGYVVSDCDAVGIIYGYQNYTN-SDEDSIAIVLKAGMDINCG 323

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD---GSPQYKNLGKNNICNPQH 367
            +    T  A+Q+GKI E DI+ +L  L+ V +RLG FD   G+  +  LG +NIC  +H
Sbjct: 324 SFLIRHTKSAIQKGKITEEDINHALFNLFSVQLRLGLFDKTSGNQWFTQLGPSNICTKEH 383

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
            ELAAEAARQG VLLKNDN  LPL    +  +A++GP AN    M G+Y G PC  T+ +
Sbjct: 384 RELAAEAARQGTVLLKNDNSFLPLKRSEVSHIAIIGPVANDAYIMGGDYTGVPCNPTTFL 443

Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
            G  A       A GC DI C +      AI+ AK AD  V++AGL+L+ E E  DRV L
Sbjct: 444 KGMQAVVPQTTIAAGCKDISCNSTDGFGEAIEVAKRADIVVLIAGLNLTQETEDLDRVSL 503

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
           LLPG Q +LIN +A   K P+ LVI   G VD++FAK + +I S+LW+GYPGE GG+ + 
Sbjct: 504 LLPGKQMDLINSIASVTKKPLVLVITGGGPVDVSFAKQDKRIASVLWIGYPGEVGGQVLP 563

Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGL 605
           +++FG+YNPGG+LPITWY  ++  +P   M +R  P  ++PGRTY+F+ G VVY FGYGL
Sbjct: 564 EILFGEYNPGGKLPITWYPESFTAVPMNDMNMRADPSRSYPGRTYRFYTGDVVYGFGYGL 623

Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYT-VGTNKPPCAAVLIDDV-KCKDYKFTFQ 663
           SY+++ Y +  +P    I L +      I+     T +     V ++D+  C+  KF+  
Sbjct: 624 SYSKYSYNIIQAP--TKISLSRSSAVDFISTKRAHTRRDGLDYVQVEDIASCESIKFSVH 681

Query: 664 IEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
           I V N G MDGS  V+++++    + G  +KQ++G+ER++ AAG++  V  T++ CK + 
Sbjct: 682 ISVANDGAMDGSHAVLLFTRSKSSVPGFPLKQLVGFERLYAAAGKATNVEITVDPCKLMS 741

Query: 723 IVDNAANSLLASGAHTILVGE 743
             +     +L  G+H ++VG+
Sbjct: 742 SANTEGRRVLLLGSHLLMVGD 762


>gi|242076578|ref|XP_002448225.1| hypothetical protein SORBIDRAFT_06g023450 [Sorghum bicolor]
 gi|241939408|gb|EES12553.1| hypothetical protein SORBIDRAFT_06g023450 [Sorghum bicolor]
          Length = 766

 Score =  670 bits (1728), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 334/744 (44%), Positives = 480/744 (64%), Gaps = 28/744 (3%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S +P+CDA L  P RA+ LV  +TL EK+ Q+ + A GVPRLG+P Y+WWSE+LHG++  
Sbjct: 32  SAYPFCDASLSIPARARALVSLLTLDEKIAQLSNTAGGVPRLGIPPYQWWSESLHGLA-- 89

Query: 70  GRRTNSPPGTHFDS-EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
               ++ PG +F S  V  AT+FP VIL+TA+FN SLW+ + + V+TEA  M+N G AGL
Sbjct: 90  ----DNGPGVNFSSGPVRAATTFPQVILSTAAFNRSLWRAVAEAVATEALGMHNAGQAGL 145

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T+W+PNIN+ RDPRWGR  ET GEDP V   Y++ YV+G Q  +G E         +++S
Sbjct: 146 TYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEQGEEGR-------IRLS 198

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYD++ WEG  R+ F+++V  QD+++T+  PF+ C+ E   S +MC+YN+VNG
Sbjct: 199 ACCKHYTAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMCAYNQVNG 258

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           +P CA+  LL +T R +W F GYI SDCD++  I E+  +   + ED++A VLKAG+D++
Sbjct: 259 VPMCANKDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTY-TKSDEDSIAIVLKAGMDIN 316

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYK-NLGKNNICNP 365
           CG +    T  AV++GK+ E DID +L  L+ V +RLG FD   + Q+   LG NN+C  
Sbjct: 317 CGSFLVRHTKSAVEKGKVQEQDIDRALFNLFSVQLRLGIFDKPNNNQWSTQLGPNNVCTK 376

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H ELAAEA RQG VLLKND+  LPL    ++ +A++GP AN   AM G+Y G  C  T+
Sbjct: 377 EHRELAAEAVRQGAVLLKNDHSFLPLKRSEVRHVAIIGPSANDVYAMGGDYTGVACNPTT 436

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
            + G  AY+    +A GC D+ C +  +   AI AAK AD  V+VAGL+L+ E E  DRV
Sbjct: 437 FLKGIQAYATQTTFAAGCKDVSCNSTELFGEAIAAAKRADIVVVVAGLNLTEEREDFDRV 496

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            LLLPG Q  LI+ VA  AK P+ LV++  G VD++FAK +P+I SILW+GYPGE GG+ 
Sbjct: 497 SLLLPGKQMSLIHAVASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPGEVGGQV 556

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
           + +++FG+YNPGG+L +TWY  ++  IP T M +R  P   +PGRTY+F+ G VVY FGY
Sbjct: 557 LPEILFGEYNPGGKLAMTWYPESFTAIPMTDMNMRADPSRGYPGRTYRFYTGDVVYGFGY 616

Query: 604 GLSYTQFKYKVASSPKSVDIKLDK--DQQCRDINYTVGTNKPPCAAVLIDDV-KCKDYKF 660
           GLSY+++ Y + S+PK + +      D   R  +Y     +     V  +D+  C+   F
Sbjct: 617 GLSYSKYSYSILSAPKKITMSRSSVLDIISRKPSY---IRRDGLDFVKTEDIASCEALAF 673

Query: 661 TFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
           +  + V N G MDGS  V+++++    + G  IKQ++G+ERV  AAG ++ V  +++ CK
Sbjct: 674 SVHVAVSNHGSMDGSHAVLLFARSKSSVPGFPIKQLVGFERVHTAAGSASNVEISVDPCK 733

Query: 720 SLKIVDNAANSLLASGAHTILVGE 743
            +   +     +L  G H + VG+
Sbjct: 734 HMSAANPEGKRVLLLGDHVLTVGD 757


>gi|413925164|gb|AFW65096.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 829

 Score =  670 bits (1728), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 338/752 (44%), Positives = 465/752 (61%), Gaps = 33/752 (4%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            P+C+ KLP  +RA DLV RMT  EK  Q+GD+A GVPRLG+P Y+WW+EALHGV+  G+
Sbjct: 96  LPFCNTKLPAAQRAADLVSRMTPAEKASQLGDVANGVPRLGVPSYKWWNEALHGVAISGK 155

Query: 72  RTNSPPGTHFD-SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLT 129
                 G H D   V  ATSFP V+LT ASFN++LW +IGQ    EARA YN+G A GLT
Sbjct: 156 ------GIHMDRGAVRSATSFPQVLLTAASFNDNLWFRIGQATGKEARAFYNIGQAEGLT 209

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
            WSPN+N+ RDPRWGR  ETPGEDP V  RYA  +VRGLQ   G   +  S    L  SA
Sbjct: 210 MWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRGLQ---GSSSNTKSVPPVLLTSA 266

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
           CCKH  AYDL++W+G  R+ F + VT QD+ +TF  PF  CV +G  S VMC+Y  VNG+
Sbjct: 267 CCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKASCVMCAYTSVNGV 326

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           P+CA+  LL +T RG W   GY+ +DCD++ +I+ + +F   T ED VA  LKAGLD+DC
Sbjct: 327 PSCANADLLTKTFRGSWGLDGYVAADCDAV-SIMRNSQFYRPTAEDTVATTLKAGLDIDC 385

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
           G Y     M A+Q+GK+ + D+D +++ L+   MRLG+FDG P+   Y NLG  +IC  +
Sbjct: 386 GPYVQQHAMAAIQKGKLTQQDVDKAVKNLFTTRMRLGHFDGDPKAHVYGNLGAAHICTQE 445

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
           H  LA EAA  GIVLLKN  G LPL  G++ + A++G +AN   A++GNY G PC  T+P
Sbjct: 446 HKNLALEAALDGIVLLKNSAGVLPLKRGSVASAAVIGHNANDVLALLGNYWGPPCAPTTP 505

Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
           + G   Y K + +  GC    C N +  P A   A  +D+ ++  GL    E+EGKDR  
Sbjct: 506 LQGIQGYVKNVRFLAGCHKAAC-NVAATPQAAALASTSDSVILFMGLSQEQESEGKDRTT 564

Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
           LLLPG Q  LI  VA+AAK PV LV+++ G VDI FA+ NPKI +ILW GYPG+ GG AI
Sbjct: 565 LLLPGNQQSLITAVANAAKRPVILVLLTGGPVDITFAQANPKIGAILWAGYPGQAGGLAI 624

Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLS 606
           A V+FG+ NP GRLP+TWY   + K+P T M +R   ++PGR+Y+F+ G  +Y FGYGLS
Sbjct: 625 AKVLFGEKNPSGRLPVTWYPEEFTKVPMTDMRMRSAGSYPGRSYRFYKGKTIYKFGYGLS 684

Query: 607 YTQFKYKVASS----PKSVDIKLDKDQQCR---DINYTVGTNKPPCAAVLIDDVKCKDYK 659
           Y++F ++V ++      +  + L          +++Y V           I D  C+  K
Sbjct: 685 YSKFSHRVVTARNNPAHNTTLLLAAGHAATTEDNLSYHVDH---------IGDELCRQLK 735

Query: 660 FTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
           F   ++V+N G MDG    +++ + P    G   +Q++G++   I AG+ A + F ++ C
Sbjct: 736 FLAVVKVQNHGPMDGKHTALMFLRWPNATDGRPARQLVGFQSQHIKAGEKAHLRFEVSPC 795

Query: 719 KSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
           +    V +    ++  G+H + VG+    +SF
Sbjct: 796 EDFSRVRDDGRKVIDKGSHFLKVGKHELEISF 827


>gi|356531391|ref|XP_003534261.1| PREDICTED: probable beta-D-xylosidase 6-like [Glycine max]
          Length = 780

 Score =  669 bits (1727), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 334/740 (45%), Positives = 472/740 (63%), Gaps = 18/740 (2%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           P+CD  LP   RA+ LV  +TLPEK+  + + A  +PRLG+P Y+WWSE+LHG++  G  
Sbjct: 40  PFCDTSLPTLTRARSLVSLLTLPEKILLLSNNASSIPRLGIPAYQWWSESLHGLALNG-- 97

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
               PG  F   VP ATSFP VIL+ ASFN SLW +    ++ EARAM+N+G AGLTFW+
Sbjct: 98  ----PGVSFAGAVPSATSFPQVILSAASFNRSLWLRTAAAIAREARAMFNVGQAGLTFWA 153

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR-PLKISACC 191
           PNIN+ RDPRWGR  ETPGEDP +   YA+ YVRGLQ + G++     D    L +SACC
Sbjct: 154 PNINLFRDPRWGRGQETPGEDPMLASAYAVEYVRGLQGLSGIQDAVVVDDDDTLMVSACC 213

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
           KH+ AYDLD W    R++F++ V++QD+++T+  PF  C+ +G  S +MCSYN VNG+P 
Sbjct: 214 KHFTAYDLDMWGQFSRYNFNAVVSQQDLEDTYQPPFRSCIQQGKASCLMCSYNEVNGVPA 273

Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
           CA  +LL    R  W F GYI SDCD++ T+ E  K+   ++EDAVA VLKAG+D++CG 
Sbjct: 274 CASEELLGLA-RDKWGFKGYITSDCDAVATVYEYQKYAK-SQEDAVADVLKAGMDINCGT 331

Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQHI 368
           +    T  A++QGK+ E D+D +L  L+ V +RLG FDG P   ++  LG  ++C  +H 
Sbjct: 332 FMLRHTESAIEQGKVKEEDLDRALLNLFSVQLRLGLFDGDPIRGRFGKLGPKDVCTQEHK 391

Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
            LA +AARQGIVLLKND   LPL+     +LA++GP A  TK + G Y G PC  +S  +
Sbjct: 392 TLALDAARQGIVLLKNDKKFLPLDRDIGASLAVIGPLATTTK-LGGGYSGIPCSSSSLYE 450

Query: 429 GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLL 488
           G   +++ I+YA GC D+ C ++     AID AK AD  VIVAGLD + E E  DRV LL
Sbjct: 451 GLGEFAERISYAFGCYDVPCDSDDGFAEAIDTAKQADFVVIVAGLDATQETEDHDRVSLL 510

Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
           LPG Q  L++ VADA+K PV LV++  G +D++FA+ NP+I SI+W+GYPGE GG+A+A+
Sbjct: 511 LPGKQMNLVSSVADASKNPVILVLIGGGPLDVSFAEKNPQIASIIWLGYPGEAGGKALAE 570

Query: 549 VIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLS 606
           +IFG++NP GRLP+TWY   +  +P   M +R  P   +PGRTY+F+ G  VY FG+GLS
Sbjct: 571 IIFGEFNPAGRLPMTWYPEAFTNVPMNEMSMRADPSRGYPGRTYRFYTGGRVYGFGHGLS 630

Query: 607 YTQFKYKVASSPKSVDI-KLDKDQQCRDINYTVGTNKPPCAAVLIDDVK-CKDYKFTFQI 664
           ++ F Y   S+P  + + +  KD   + + Y V         V ++ ++ C    F+  I
Sbjct: 631 FSDFSYNFLSAPSKISLSRTIKDGSRKRLLYQVENEVYGVDYVPVNQLQNCNKLSFSVHI 690

Query: 665 EVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKI 723
            V N+G +DGS VVM++SK P +  G+   Q++G+ R+   + +  +    ++ C+ L  
Sbjct: 691 SVMNLGGLDGSHVVMLFSKGPKVVDGSPETQLVGFSRLHTISSKPTETSILVHPCEHLSF 750

Query: 724 VDNAANSLLASGAHTILVGE 743
            D     +L  G HT+ VG+
Sbjct: 751 ADKQGKRILPLGPHTLSVGD 770


>gi|15218202|ref|NP_177929.1| putative beta-D-xylosidase 7 [Arabidopsis thaliana]
 gi|259585708|sp|Q9SGZ5.2|BXL7_ARATH RecName: Full=Probable beta-D-xylosidase 7; Short=AtBXL7; Flags:
           Precursor
 gi|18086336|gb|AAL57631.1| At1g78060/F28K19_32 [Arabidopsis thaliana]
 gi|332197942|gb|AEE36063.1| putative beta-D-xylosidase 7 [Arabidopsis thaliana]
          Length = 767

 Score =  669 bits (1727), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 344/756 (45%), Positives = 481/756 (63%), Gaps = 35/756 (4%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           + +C   LP  +RA+DLV R+T+ EK+ Q+ + A G+PRLG+P YEWWSEALHGV++ G 
Sbjct: 36  YQFCRTDLPIGKRARDLVSRLTIDEKISQLVNTAPGIPRLGVPAYEWWSEALHGVAYAG- 94

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
                PG  F+  V  ATSFP VILT ASF+   W +I Q +  EAR +YN G A G+TF
Sbjct: 95  -----PGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEARGVYNAGQANGMTF 149

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ--DVEGVEYHRDSDSRPLKIS 188
           W+PNIN+ RDPRWGR  ETPGEDP + G YA+ YVRGLQ    +G    R + S  L+ S
Sbjct: 150 WAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDSFDG----RKTLSNHLQAS 205

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKH+ AYDLD W+G  R+ F+++V+  D+ ET+  PF+ C+ EG  S +MC+YNRVNG
Sbjct: 206 ACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQPPFKKCIEEGRASGIMCAYNRVNG 265

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           IP+CADP LL +T RG W F GYI SDCD++  I ++  +   + EDAVA VLKAG+D++
Sbjct: 266 IPSCADPNLLTRTARGQWAFRGYITSDCDAVSIIYDAQGYAK-SPEDAVADVLKAGMDVN 324

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG Y    T  A+QQ K++E DID +L  L+ V +RLG F+G P    Y N+  N +C+P
Sbjct: 325 CGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRLGLFNGDPTKLPYGNISPNEVCSP 384

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            H  LA +AAR GIVLLKN+   LP +  ++ +LA++GP+A+  K ++GNY G PC+  +
Sbjct: 385 AHQALALDAARNGIVLLKNNLKLLPFSKRSVSSLAVIGPNAHVVKTLLGNYAGPPCKTVT 444

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+D   +Y K   Y  GC  + C +N+ I  A+  AKNAD  V++ GLD + E E  DRV
Sbjct: 445 PLDALRSYVKNAVYHQGCDSVAC-SNAAIDQAVAIAKNADHVVLIMGLDQTQEKEDFDRV 503

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           DL LPG Q ELI  VA+AAK PV LV++  G VDI+FA NN KI SI+W GYPGE GG A
Sbjct: 504 DLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISFAANNNKIGSIIWAGYPGEAGGIA 563

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGL 605
           I+++IFG +NPGGRLP+TWY  ++V I  T M +R    +PGRTYKF+ GP VY FG+GL
Sbjct: 564 ISEIIFGDHNPGGRLPVTWYPQSFVNIQMTDMRMRSATGYPGRTYKFYKGPKVYEFGHGL 623

Query: 606 SYTQFKYKVAS-SPKSVDIKLDKDQQCRD-INYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           SY+ + Y+  + +  ++ +   K Q   D + YT+ +         +    C   K    
Sbjct: 624 SYSAYSYRFKTLAETNLYLNQSKAQTNSDSVRYTLVSE--------MGKEGCDVAKTKVT 675

Query: 664 IEVENMGKMDGSEVVMVYSKPP--GIAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKS 720
           +EVEN G+M G   V+++++    G  G    KQ++G++ + ++ G+ A++ F +  C+ 
Sbjct: 676 VEVENQGEMAGKHPVLMFARHERGGEDGKRAEKQLVGFKSIVLSNGEKAEMEFEIGLCEH 735

Query: 721 LKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
           L   +     +L  G + + VG+       PL +N+
Sbjct: 736 LSRANEFGVMVLEEGKYFLTVGDS----ELPLIVNV 767


>gi|297842585|ref|XP_002889174.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335015|gb|EFH65433.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 766

 Score =  669 bits (1725), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 341/756 (45%), Positives = 481/756 (63%), Gaps = 35/756 (4%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           + +C   LP  +RA+DLV R+ + EK+ Q+G+ A G+PRLG+P YEWWSEALHGV++ G 
Sbjct: 35  YQFCRTDLPISQRARDLVSRLNIDEKISQLGNTAPGIPRLGVPAYEWWSEALHGVAYAG- 93

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
                PG  F+  V  ATSFP VILT ASF+   W +I Q +  EAR +YN G A G+TF
Sbjct: 94  -----PGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEARGVYNAGQAQGMTF 148

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ--DVEGVEYHRDSDSRPLKIS 188
           W+PNIN+ RDPRWGR  ETPGEDP + G YA+ YVRGLQ    +G    R + S  L+ S
Sbjct: 149 WAPNINIFRDPRWGRGQETPGEDPIMTGTYAVAYVRGLQGDSFDG----RKTLSIHLQAS 204

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKH+ AYDLD W+G  R+ F+++V+  D+ ET+  PF+ C+ EG  S +MC+YNRVNG
Sbjct: 205 ACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQPPFKKCIEEGRASGIMCAYNRVNG 264

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           IP+CADP LL +T RG W F GYI SDCD++  I ++  +   T EDAVA VLKAG+D++
Sbjct: 265 IPSCADPNLLTRTARGLWRFRGYITSDCDAVSIIHDAQGYAK-TPEDAVADVLKAGMDVN 323

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG Y    T  A+QQ K++E DID +L  L+ V +RLG F+G P    Y N+  N++C+P
Sbjct: 324 CGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRLGLFNGDPTKLPYGNISPNDVCSP 383

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            H  LA EAAR GIVLLKN+   LP +  ++ +LA++GP+A+  K ++GNY G PC+  +
Sbjct: 384 AHQALALEAARNGIVLLKNNLKLLPFSKRSVSSLAVIGPNAHVAKTLLGNYAGPPCKTVT 443

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+D   +Y K   Y  GC  + C +N+ I  A+  A+NAD  V++ GLD + E E  DRV
Sbjct: 444 PLDALRSYVKNAVYHNGCDSVAC-SNAAIDQAVAIARNADHVVLIMGLDQTQEKEDMDRV 502

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           DL LPG Q ELI  VA+AAK PV LV++  G VDI+FA NN KI SI+W GYPGE GG A
Sbjct: 503 DLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISFATNNDKIGSIMWAGYPGEAGGIA 562

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGL 605
           +A++IFG +NPGGRLP+TWY  ++V +  T M +R    +PGRTYKF+ GP V+ FG+GL
Sbjct: 563 LAEIIFGDHNPGGRLPVTWYPQSFVNVQMTDMRMRSATGYPGRTYKFYKGPKVFEFGHGL 622

Query: 606 SYTQFKYKVAS-SPKSVDIKLDKDQQCRD-INYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           SY+ + Y+  +    ++ +   K Q   D + YT+ +         + +  C   K    
Sbjct: 623 SYSTYSYRFKTLGATNLYLNQSKAQLNSDSVRYTLVSE--------MGEEGCNIAKTKVI 674

Query: 664 IEVENMGKMDGSEVVMVYSKPP--GIAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKS 720
           + VEN G+M G   V+++++    G  G    KQ++G++ + ++ G+ A++ F +  C+ 
Sbjct: 675 VTVENQGEMAGKHPVLMFARHERGGENGKRAEKQLVGFKSIVLSNGEKAEMEFEIGLCEH 734

Query: 721 LKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
           L   +     ++  G + + VG+       PL +N+
Sbjct: 735 LSRANEVGVMVVEEGKYFLTVGDS----ELPLTINV 766


>gi|413925162|gb|AFW65094.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 774

 Score =  668 bits (1724), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 341/743 (45%), Positives = 460/743 (61%), Gaps = 40/743 (5%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           +CD  L   +RA DLV R+T  EK+ Q+GD A GVPRLG+P Y+WW+EALHG++  G+  
Sbjct: 46  FCDVTLAPAQRAADLVSRLTAAEKIAQLGDQAPGVPRLGVPGYKWWNEALHGLATSGK-- 103

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTFWS 132
               G HFD+ V  ATSFP V+LT A+F++ LW +IGQ +  EARA++N+G A GLT WS
Sbjct: 104 ----GLHFDAAVRAATSFPQVLLTAAAFDDDLWLRIGQAIGREARALFNVGQAEGLTIWS 159

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           PN+N+ RDPRWGR  ETPGEDP V  RYA+ +VRG+Q         +S S  L+ SACCK
Sbjct: 160 PNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQG--------NSSSSLLQTSACCK 211

Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           H  AYDL++W G  R+ F +RVTEQD+++TF  PF  CV E   S VMC+Y  +NG+P C
Sbjct: 212 HATAYDLEDWNGVARYSFVARVTEQDLEDTFNPPFRSCVVEAKASCVMCAYTAINGVPAC 271

Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
           A+  LL  T+RGDW   GY+ SDCD++  + ++ ++   T EDAVA  LKAGLD+DCG Y
Sbjct: 272 ANSDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRYA-PTPEDAVAVSLKAGLDIDCGSY 330

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIE 369
                  A+QQGK+ E DID +L  LY V MRLG+FDG P+   Y  LG  +IC P+H  
Sbjct: 331 VQQHAAAAIQQGKLTEQDIDKALTNLYAVRMRLGHFDGDPRKNMYGVLGAADICTPEHRN 390

Query: 370 LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG 429
           LA EAA+ GIVLLKND G LPL+   + + A++GP+AN   A+I NY G PC  T+P+ G
Sbjct: 391 LALEAAQDGIVLLKNDGGILPLDRSTVTSAAVIGPNANDGMALIANYFGPPCESTTPLKG 450

Query: 430 FYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLL 489
             +Y   + +  GC    C + +    A+  A + D   +  GL    E+EGKDR  LLL
Sbjct: 451 LQSYVNDVRFLAGCNSAAC-DVAATDQAVALAGSEDYVFLFMGLSQKQESEGKDRTSLLL 509

Query: 490 PGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADV 549
           PG Q  LI  VADA+K PV LV++S G VDI FA++NPKI +ILW GYPG+ GG AIA V
Sbjct: 510 PGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSNPKIGAILWAGYPGQAGGLAIAKV 569

Query: 550 IFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSY 607
           +FG +NP GRLP+TWY   + K+P T M +R  P + +PGR+Y+F+ G  VY FGYGLSY
Sbjct: 570 LFGDHNPSGRLPVTWYPEEFTKVPMTDMRMRADPTSGYPGRSYRFYQGNTVYKFGYGLSY 629

Query: 608 TQFKYKVA--------SSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
           + F  ++         SS     ++     Q  D +Y V           I    C+  K
Sbjct: 630 STFSRRLVHGTSVPALSSTLLTGLRETMTPQDGDRSYHVDA---------IGTEGCEQLK 680

Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGI-AGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
           F   +EV+N G MDG   V+++ + P    G    Q+IG+    + AG++AK+ F ++ C
Sbjct: 681 FPAMVEVQNHGPMDGKHSVLMFLRWPNTKQGRPASQLIGFRSQHLKAGETAKLRFDISPC 740

Query: 719 KSLKIVDNAANSLLASGAHTILV 741
           K    V      ++  G+H ++V
Sbjct: 741 KHFSRVRADGRKVIDIGSHFLMV 763


>gi|357152329|ref|XP_003576084.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
           distachyon]
          Length = 779

 Score =  667 bits (1722), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 332/756 (43%), Positives = 463/756 (61%), Gaps = 42/756 (5%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           + +CD  LP   RA DLV R+TL EKV Q+GD A  VPRLG+P Y+WWSE LHG+SF G 
Sbjct: 47  YAFCDKALPVERRAADLVSRLTLAEKVSQLGDEADAVPRLGVPAYKWWSEGLHGLSFWGH 106

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
                 G HFD  V   TSFP V+LT ASF++ +W +IGQ + TEARA+YNLG A GLT 
Sbjct: 107 ------GMHFDGAVRAITSFPQVLLTAASFDQDIWYRIGQAIGTEARALYNLGQAQGLTI 160

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           WSPN+N+ RDPRWGR  ETPGEDP    +YA+ +V+GLQ          + +  L+ SAC
Sbjct: 161 WSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQG---------TSATTLQTSAC 211

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH  AYDL++W G  R++F+++VT QD+ +TF  PF+ CV EG  + VMC+Y  +NG+P
Sbjct: 212 CKHATAYDLEDWNGVVRYNFNAKVTLQDLADTFNPPFKSCVEEGKATCVMCAYTNINGVP 271

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
            CA   L+ +T +GDW  +GY+ SDCD++  + ++ ++   T ED VA  LKAGLDL+CG
Sbjct: 272 ACASSDLITKTFKGDWGLNGYVSSDCDAVALLRDAQRY-RATPEDTVAVALKAGLDLNCG 330

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNPQ 366
           +Y     M A+QQGK+ E D+D +L+ L+ V MRLG+FDG P+    Y +LG  ++C+P 
Sbjct: 331 NYTQVHGMSALQQGKMTEQDVDNALKNLFAVRMRLGHFDGDPRTSALYGSLGAADVCSPA 390

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
           H  LA EAA+ GIVLLKND G LPL+   + + A +G +AN   A+ GNY G PC  T+P
Sbjct: 391 HKNLALEAAQSGIVLLKNDAGILPLDPSAVASAAAIGHNANDPAALNGNYFGPPCETTTP 450

Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
           + G   Y K + +  GC    C   +    A+  A ++D  ++  GL    E EG DR  
Sbjct: 451 LQGLQGYVKNVKFLAGCDSAAC-GFAATGQAVTLASSSDYVILFMGLSQKEEQEGIDRTS 509

Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
           LLLPG Q  LI  VA A+K PV LV+++ G+VDI FAK+NPKI +ILW GYPG+ GG AI
Sbjct: 510 LLLPGKQQNLITAVASASKRPVILVLLTGGSVDITFAKSNPKIGAILWAGYPGQAGGLAI 569

Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
           A V+FG +NP GRLP+TWY   + K+P T M +R  P   +PGR+Y+F+ G  VY FG G
Sbjct: 570 ARVLFGDHNPSGRLPVTWYPEEFTKVPMTDMRMRADPATGYPGRSYRFYQGKTVYKFGDG 629

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC-----AAVLIDDV---KCK 656
           LSY++F  ++ SS  +         Q  + N   G           +   ++++    C 
Sbjct: 630 LSYSKFSRQLVSSTNT--------HQVPNTNLLTGLTARTATDGGMSYYHVEEIGVEGCD 681

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH--IKQVIGYERVFIAAGQSAKVGFT 714
             KF   +EV+N G MDG   VM++ + P   GT   + Q++G+    + AG+ A + F 
Sbjct: 682 KLKFPAVVEVQNHGPMDGKHSVMMFLRWPNSTGTGRPVSQLVGFRSQHLKAGEKASLTFD 741

Query: 715 MNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
           ++ C+           ++  G+H ++VG+    +SF
Sbjct: 742 VSPCEHFARAREDGKKVIDRGSHFLVVGKDEREISF 777


>gi|302141935|emb|CBI19138.3| unnamed protein product [Vitis vinifera]
          Length = 1411

 Score =  666 bits (1719), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 338/740 (45%), Positives = 461/740 (62%), Gaps = 55/740 (7%)

Query: 12   FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            + +C+  L   +RA DL+ R+TL EK+ Q+   A  +PRLG+P YEWWSEALHG+     
Sbjct: 710  YAFCNTTLRISQRASDLISRLTLDEKISQLISSAASIPRLGIPAYEWWSEALHGI----- 764

Query: 72   RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
                  G  F+  +  ATSFP VILT ASF+  LW +IGQ +  E RAMYN G A G+TF
Sbjct: 765  --RDRHGIRFNGTIRSATSFPQVILTAASFDAHLWYRIGQAIGIETRAMYNAGQAMGMTF 822

Query: 131  WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
            W+PNIN+ RDPRWGR  ETPGEDP V G+YA++YVRGLQ     +         L+ SAC
Sbjct: 823  WAPNINIFRDPRWGRGQETPGEDPVVAGKYAVSYVRGLQG----DTFEGGKVDVLQASAC 878

Query: 191  CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
            CKH+ AYDLDNW   DR+ FD+RVT QD+ +T+  PF  C+ EG  S +MC+YN VNG+P
Sbjct: 879  CKHFTAYDLDNWTSIDRYTFDARVTMQDLADTYQPPFRSCIEEGRASGLMCAYNLVNGVP 938

Query: 251  TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
             CAD  LL++T RG W F GYIVSDCD++  + +   +   + EDAVA VL AG+D+ CG
Sbjct: 939  NCADFNLLSKTARGQWGFDGYIVSDCDAVSLVHDVQGYAK-SPEDAVAIVLTAGMDVACG 997

Query: 311  DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
             Y       AV Q K+ E++ID +L  L+ V MRLG F+G+P+   + N+G + +C+ +H
Sbjct: 998  GYLQKHAKSAVSQKKLTESEIDRALLNLFTVRMRLGLFNGNPRKLPFGNIGPDQVCSTEH 1057

Query: 368  IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
              LA EAAR GIVLLKN +  LPL+ G   +LA++GP+ANAT  ++GNY G PC++ SP+
Sbjct: 1058 QTLALEAARSGIVLLKNSDRLLPLSKGETLSLAVIGPNANATDTLLGNYAGPPCKFISPL 1117

Query: 428  DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
             G  +Y     Y  GC D+ C + S I  A+D AK AD  V+V GLD + E E  DR+DL
Sbjct: 1118 QGLQSYVNNTMYHAGCNDVACSSAS-IENAVDVAKQADYVVLVMGLDQTQEREKYDRLDL 1176

Query: 488  LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
            +LPG Q +LI  VA AAK PV LV++  G VDI+FAK +  I SILW GYPGE GG AIA
Sbjct: 1177 VLPGKQEQLITGVAKAAKKPVVLVLLCGGPVDISFAKGSSNIGSILWAGYPGEAGGAAIA 1236

Query: 548  DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGL 605
            + IFG +NPGGRLP+TWY  +++KIP T M +R  P + +PGRT++F+ G  V+ FG GL
Sbjct: 1237 ETIFGDHNPGGRLPVTWYPKDFIKIPMTDMRMRPEPQSGYPGRTHRFYTGKTVFEFGNGL 1296

Query: 606  SYTQFKYKVAS-SPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
            SY+ + Y+  S +P  + +                 N+P    V                
Sbjct: 1297 SYSPYSYEFLSVTPNKLYL-----------------NQPSTTHV---------------- 1323

Query: 665  EVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKI 723
             VEN GKM G   V+++ K      G+ +KQ++G++ VF+ AG+S+ V F ++ C+ L  
Sbjct: 1324 -VENSGKMAGKHPVLLFVKQAKAGNGSPMKQLVGFQNVFLDAGESSNVEFILSPCEHLSR 1382

Query: 724  VDNAANSLLASGAHTILVGE 743
             +     ++  G H ++VG+
Sbjct: 1383 ANKDGLMVMEQGIHLLVVGD 1402



 Score =  628 bits (1619), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 315/607 (51%), Positives = 417/607 (68%), Gaps = 21/607 (3%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           + +C   LP P+R +DLV R+TL EK+ Q+ + A  +PRLG+P YEWWSEALHGV+  G 
Sbjct: 41  YHFCKTTLPIPDRVRDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVADAG- 99

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
                PG  F+  +  ATSFP VILT ASF+  LW +IG+ +  EARA+YN G   G+TF
Sbjct: 100 -----PGIRFNGTIRSATSFPQVILTAASFDVHLWYRIGRAIGVEARAVYNAGQTKGMTF 154

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQD--VEGVEYHRDSDSRPLKIS 188
           W+PNIN+ RDPRWGR  ETPGEDP V G YA++YVRG+Q   + G++   +     L+ S
Sbjct: 155 WAPNINIFRDPRWGRGQETPGEDPLVTGSYAVSYVRGVQGDCLRGLKRCGE-----LQAS 209

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKH+ AYDLD+W+G DRF FD+RVT QD+ +T+  PF  C+ EG  S +MC+YNRVNG
Sbjct: 210 ACCKHFTAYDLDDWKGIDRFKFDARVTMQDLADTYQPPFHRCIEEGRASGIMCAYNRVNG 269

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           +P+CAD  LL  T R  WNF GYI SDCD++  I +S+ F   T EDAV  VLKAG+D++
Sbjct: 270 VPSCADFNLLTNTARKRWNFQGYITSDCDAVSLIHDSYGFAK-TPEDAVVDVLKAGMDVN 328

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG Y  N T  AV Q K+ E+++D +L  L+ V MRLG F+G+P+   Y ++G N +C+ 
Sbjct: 329 CGTYLLNHTKSAVMQKKLPESELDRALENLFAVRMRLGLFNGNPKGQPYGDIGPNQVCSV 388

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H  LA +AAR GIVLLKN    LPL  G   +LA++GP+AN+ K +IGNY G PC++ +
Sbjct: 389 EHQTLALDAARDGIVLLKNSQRLLPLPKGKTMSLAVIGPNANSPKTLIGNYAGPPCKFIT 448

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+    +Y K   Y PGC  + C + S I  A++ A+ AD  V+V GLD + E E  DR+
Sbjct: 449 PLQALQSYVKSTMYHPGCDAVACSSPS-IEKAVEIAQKADYVVLVMGLDQTQEREAHDRL 507

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           DL+LPG Q +LI  VA+AAK PV LV++S G VDI+FAK +  I SILW GYPG  GG A
Sbjct: 508 DLVLPGKQQQLIICVANAAKKPVVLVLLSGGPVDISFAKYSNNIGSILWAGYPGGAGGAA 567

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGY 603
           IA+ IFG +NPGGRLP+TWY  ++ KIP T M +RP +N  +PGRTY+F+ G  V+ FGY
Sbjct: 568 IAETIFGDHNPGGRLPVTWYPQDFTKIPMTDMRMRPESNSGYPGRTYRFYTGEKVFEFGY 627

Query: 604 GLSYTQF 610
           GLSY+ +
Sbjct: 628 GLSYSTY 634


>gi|357156390|ref|XP_003577440.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
           distachyon]
          Length = 755

 Score =  666 bits (1718), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 340/758 (44%), Positives = 469/758 (61%), Gaps = 43/758 (5%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           + + +C+  LP  +RA DLV ++TL EKV Q+GD A GVPR G+P Y WWSE LHGVS  
Sbjct: 22  AQYAFCNRALPAEQRAADLVAKLTLEEKVSQLGDQAPGVPRFGVPGYNWWSEGLHGVSMW 81

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GL 128
           G       G HF+  V G T+FP V+LTTASF++S+W +IGQ + TEARAM+NLG A GL
Sbjct: 82  GH------GMHFNGAVRGVTTFPQVLLTTASFDDSIWYRIGQAIGTEARAMFNLGQADGL 135

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T WSPN+N+ RDPRWGR  ETPGEDP    +YA+ +VRGLQ          + +  L+ S
Sbjct: 136 TIWSPNVNIYRDPRWGRGQETPGEDPATASKYAVAFVRGLQG---------TSTTTLQTS 186

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKH  AYDLD+W    R++F+++VT QD++ETF  PF+ CV EG  + VMC+Y  VNG
Sbjct: 187 ACCKHATAYDLDDWNRIGRYNFNAKVTAQDLEETFNPPFKSCVVEGKATCVMCAYTSVNG 246

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           IP CAD  LL +TI+G+W  +GYI SDCD++  +  +    + T EDAVA  +KAGLD++
Sbjct: 247 IPACADSGLLTKTIKGEWGMNGYISSDCDAVALLYGTR--YSGTPEDAVAAAIKAGLDMN 304

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG----SPQYKNLGKNNICN 364
           CG++     M A+QQ K++E D+D +LR L+ + MRLG+FDG    SP Y  LG  ++C+
Sbjct: 305 CGNFSQVHGMAALQQRKMSEQDVDKALRNLFAIRMRLGHFDGDPLQSPLYGRLGAQDVCS 364

Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLN--TGNIKTLALVGPHANATKAMIGNYEGTPCR 422
           P H +LA EAA+ GIVLLKND   LPL+  T    + A++GP+AN   A++GNY G PC 
Sbjct: 365 PAHKDLALEAAQNGIVLLKNDAATLPLSRPTAASASFAVIGPNANEPGALLGNYFGPPCE 424

Query: 423 YTSPMDGFYA-YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
            T+P+      YSK + + PGC    C N +    A   A  +D T++  GL    E EG
Sbjct: 425 TTTPLQALQKFYSKNVRFVPGCDSAAC-NVADTYQASGLAATSDYTILFMGLSQKQEQEG 483

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
            DR  LLLPG Q  LI  VA AAK P+ LV+++ G VDI FAK NPKI +ILW GYPG+ 
Sbjct: 484 LDRTSLLLPGKQESLITAVAAAAKRPIILVLLTGGPVDITFAKFNPKIGAILWAGYPGQA 543

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVY 599
           GG AIA V+FG++NP GRLP+TWY   Y K+P   M +R  P   +PGR+Y+F+ G  VY
Sbjct: 544 GGLAIAKVLFGEHNPSGRLPVTWYPEEYTKVPMDDMRMRADPATGYPGRSYRFYKGNAVY 603

Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA---VLIDDVK-- 654
            FGYGLSY++F  ++  +  S +   + +         +      C A    L++++   
Sbjct: 604 KFGYGLSYSKFSRQLVRNSSSNNRAPNTE--------LLAAAAVDCGASRYYLVEEIGGE 655

Query: 655 -CKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVG 712
            C+  KF   +EVEN G MDG + V+++ + P    G    Q++G+    + AG+ A V 
Sbjct: 656 VCERLKFPAVVEVENHGPMDGKQSVLLFLRWPTATEGRPASQLVGFRSQDLRAGEKASVS 715

Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
           F ++ C+           ++  G+H ++V E    +SF
Sbjct: 716 FDISPCEHFSRTTVDGTKVIDRGSHFLMVDEDEMEISF 753


>gi|413925166|gb|AFW65098.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 830

 Score =  665 bits (1716), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 338/753 (44%), Positives = 465/753 (61%), Gaps = 34/753 (4%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            P+C+ KLP  +RA DLV RMT  EK  Q+GD+A GVPRLG+P Y+WW+EALHGV+  G+
Sbjct: 96  LPFCNTKLPAAQRAADLVSRMTPAEKASQLGDVANGVPRLGVPSYKWWNEALHGVAISGK 155

Query: 72  RTNSPPGTHFD-SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLT 129
                 G H D   V  ATSFP V+LT ASFN++LW +IGQ    EARA YN+G A GLT
Sbjct: 156 ------GIHMDRGAVRSATSFPQVLLTAASFNDNLWFRIGQATGKEARAFYNIGQAEGLT 209

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
            WSPN+N+ RDPRWGR  ETPGEDP V  RYA  +VRGLQ   G   +  S    L  SA
Sbjct: 210 MWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRGLQ---GSSSNTKSVPPVLLTSA 266

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
           CCKH  AYDL++W+G  R+ F + VT QD+ +TF  PF  CV +G  S VMC+Y  VNG+
Sbjct: 267 CCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKASCVMCAYTSVNGV 326

Query: 250 PTCADPKLLNQTIRGDWNFHG-YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           P+CA+  LL +T RG W   G Y+ +DCD++ +I+ + +F   T ED VA  LKAGLD+D
Sbjct: 327 PSCANADLLTKTFRGSWGLDGRYVAADCDAV-SIMRNSQFYRPTAEDTVATTLKAGLDID 385

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG Y     M A+Q+GK+ + D+D +++ L+   MRLG+FDG P+   Y NLG  +IC  
Sbjct: 386 CGPYVQQHAMAAIQKGKLTQQDVDKAVKNLFTTRMRLGHFDGDPKAHVYGNLGAAHICTQ 445

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H  LA EAA  GIVLLKN  G LPL  G++ + A++G +AN   A++GNY G PC  T+
Sbjct: 446 EHKNLALEAALDGIVLLKNSAGVLPLKRGSVASAAVIGHNANDVLALLGNYWGPPCAPTT 505

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+ G   Y K + +  GC    C N +  P A   A  +D+ ++  GL    E+EGKDR 
Sbjct: 506 PLQGIQGYVKNVRFLAGCHKAAC-NVAATPQAAALASTSDSVILFMGLSQEQESEGKDRT 564

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            LLLPG Q  LI  VA+AAK PV LV+++ G VDI FA+ NPKI +ILW GYPG+ GG A
Sbjct: 565 TLLLPGNQQSLITAVANAAKRPVILVLLTGGPVDITFAQANPKIGAILWAGYPGQAGGLA 624

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGL 605
           IA V+FG+ NP GRLP+TWY   + K+P T M +R   ++PGR+Y+F+ G  +Y FGYGL
Sbjct: 625 IAKVLFGEKNPSGRLPVTWYPEEFTKVPMTDMRMRSAGSYPGRSYRFYKGKTIYKFGYGL 684

Query: 606 SYTQFKYKVASS----PKSVDIKLDKDQQCR---DINYTVGTNKPPCAAVLIDDVKCKDY 658
           SY++F ++V ++      +  + L          +++Y V           I D  C+  
Sbjct: 685 SYSKFSHRVVTARNNPAHNTTLLLAAGHAATTEDNLSYHVDH---------IGDELCRQL 735

Query: 659 KFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNA 717
           KF   ++V+N G MDG    +++ + P    G   +Q++G++   I AG+ A + F ++ 
Sbjct: 736 KFLAVVKVQNHGPMDGKHTALMFLRWPNATDGRPARQLVGFQSQHIKAGEKAHLRFEVSP 795

Query: 718 CKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
           C+    V +    ++  G+H + VG+    +SF
Sbjct: 796 CEDFSRVRDDGRKVIDKGSHFLKVGKHELEISF 828


>gi|357489441|ref|XP_003615008.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
 gi|355516343|gb|AES97966.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
          Length = 798

 Score =  664 bits (1712), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 343/761 (45%), Positives = 475/761 (62%), Gaps = 48/761 (6%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            P+C+  L   +RAKD+V R+TL EK+ Q+ + A  +PRLG+P Y+WW EALHGV+  G+
Sbjct: 48  LPFCNLNLTITQRAKDIVSRLTLDEKISQLVNTAPSIPRLGIPSYQWWDEALHGVANAGK 107

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
                 G   +  V GATSFP VILT ASF+  LW +I + + TEAR +YN G A G+TF
Sbjct: 108 ------GIRLNGSVAGATSFPQVILTAASFDSKLWYQISKVIGTEARGVYNAGQAQGMTF 161

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ--DVEGVEYHRDSDSRPLKIS 188
           W+PNIN+ RDPRWGR  ET GEDP V  +Y ++YVRGLQ    EG +   D     LK S
Sbjct: 162 WAPNINIFRDPRWGRGQETAGEDPLVNSKYGVSYVRGLQGDSFEGGKLIGDR----LKAS 217

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRV----------------TEQDMQETFILPFEMCVN 232
           ACCKH+ AYDLDNW+G DRF FD++V                T QD+ +T+  PF  C+ 
Sbjct: 218 ACCKHFTAYDLDNWKGLDRFDFDAKVSFLFSMAYSPWMINYVTLQDLADTYQPPFHSCIV 277

Query: 233 EGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDT 292
           +G  S +MC+YNRVNG+P CAD  LL +T R  WNF+GYI SDC++++ I ++  +   T
Sbjct: 278 QGRSSGIMCAYNRVNGVPNCADYNLLTKTARQKWNFNGYITSDCEAVRIIYDNQGYAK-T 336

Query: 293 KEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP 352
            EDAVA VL+AG+D++CGDY T     AV Q K+  + ID +L  L+ + +RLG FDG+P
Sbjct: 337 PEDAVADVLQAGMDVECGDYLTKHAKAAVLQKKVPISQIDRALHNLFTIRIRLGLFDGNP 396

Query: 353 ---QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN-A 408
              QY  +G N +C+ ++++LA EAAR GIVLLKN    LPL    + TL ++GP+AN +
Sbjct: 397 TKLQYGRIGPNQVCSKENLDLALEAARSGIVLLKNTASILPLP--RVNTLGVIGPNANKS 454

Query: 409 TKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
           +K ++GNY G PCR    + GFY Y+   +Y  GC D     ++ I  A++ AK +D  +
Sbjct: 455 SKVVLGNYFGRPCRLVPILKGFYTYASQTHYRSGCLDGTKCASAEIDRAVEVAKISDYVI 514

Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
           +V GLD S E E +DR DL LPG Q ELIN VA A+K PV LV++  G VDI FAKNN K
Sbjct: 515 LVMGLDQSQERESRDRDDLELPGKQQELINSVAKASKKPVILVLLCGGPVDITFAKNNDK 574

Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFP 586
           I  I+W GYPGE GGRA+A V+FG YNPGGRLP+TWY  +++KIP T M +R  P + +P
Sbjct: 575 IGGIIWAGYPGELGGRALAQVVFGDYNPGGRLPMTWYPKDFIKIPMTDMRMRADPSSGYP 634

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY+F+ GP VY FGYGLSY+ + Y        + +K +     +   +++  N     
Sbjct: 635 GRTYRFYTGPKVYEFGYGLSYSNYSYNF------ISVKNNNLHINQSTTHSILENSETIY 688

Query: 647 AVLIDDV---KCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVF 702
             L+ ++    CK    +  + + N G M G   V+++ KP  G  G  +KQ++G+E V 
Sbjct: 689 YKLVSELGEETCKTMSISVTLGITNTGSMAGKHPVLLFVKPKKGRNGNPVKQLVGFESVT 748

Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
           +  G   +VGF ++ C+ L   + +   ++  G H ++VGE
Sbjct: 749 VEGGGKGEVGFEVSVCEHLSRANESGVKVIEEGGHLLVVGE 789


>gi|225459350|ref|XP_002285805.1| PREDICTED: probable beta-D-xylosidase 7-like [Vitis vinifera]
          Length = 774

 Score =  663 bits (1710), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 345/741 (46%), Positives = 475/741 (64%), Gaps = 25/741 (3%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           + +C   LP P+R +DLV R+TL EK+ Q+ + A  +PRLG+P YEWWSEALHGV+  G 
Sbjct: 41  YHFCKTTLPIPDRVRDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVADAG- 99

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
                PG  F+  +  ATSFP VILT ASF+  LW +IG+ +  EARA+YN G   G+TF
Sbjct: 100 -----PGIRFNGTIRSATSFPQVILTAASFDVHLWYRIGRAIGVEARAVYNAGQTKGMTF 154

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQD--VEGVEYHRDSDSRPLKIS 188
           W+PNIN+ RDPRWGR  ETPGEDP V G YA++YVRG+Q   + G++   +     L+ S
Sbjct: 155 WAPNINIFRDPRWGRGQETPGEDPLVTGSYAVSYVRGVQGDCLRGLKRCGE-----LQAS 209

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKH+ AYDLD+W+G DRF FD+RVT QD+ +T+  PF  C+ EG  S +MC+YNRVNG
Sbjct: 210 ACCKHFTAYDLDDWKGIDRFKFDARVTMQDLADTYQPPFHRCIEEGRASGIMCAYNRVNG 269

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           +P+CAD  LL  T R  WNF GYI SDCD++  I +S+ F   T EDAV  VLKAG+D++
Sbjct: 270 VPSCADFNLLTNTARKRWNFQGYITSDCDAVSLIHDSYGFAK-TPEDAVVDVLKAGMDVN 328

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG Y  N T  AV Q K+ E+++D +L  L+ V MRLG F+G+P+   Y ++G N +C+ 
Sbjct: 329 CGTYLLNHTKSAVMQKKLPESELDRALENLFAVRMRLGLFNGNPKGQPYGDIGPNQVCSV 388

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H  LA +AAR GIVLLKN    LPL  G   +LA++GP+AN+ K +IGNY G PC++ +
Sbjct: 389 EHQTLALDAARDGIVLLKNSQRLLPLPKGKTMSLAVIGPNANSPKTLIGNYAGPPCKFIT 448

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+    +Y K   Y PGC  + C + S I  A++ A+ AD  V+V GLD + E E  DR+
Sbjct: 449 PLQALQSYVKSTMYHPGCDAVACSSPS-IEKAVEIAQKADYVVLVMGLDQTQEREAHDRL 507

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           DL+LPG Q +LI  VA+AAK PV LV++S G VDI+FAK +  I SILW GYPG  GG A
Sbjct: 508 DLVLPGKQQQLIICVANAAKKPVVLVLLSGGPVDISFAKYSNNIGSILWAGYPGGAGGAA 567

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGY 603
           IA+ IFG +NPGGRLP+TWY  ++ KIP T M +RP +N  +PGRTY+F+ G  V+ FGY
Sbjct: 568 IAETIFGDHNPGGRLPVTWYPQDFTKIPMTDMRMRPESNSGYPGRTYRFYTGEKVFEFGY 627

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           GLSY+ +  +     ++   KL  +Q      Y    +    +   +    C     +  
Sbjct: 628 GLSYSTYSCETIPVTRN---KLYFNQSSTAHVYENTDSIRYTSVAELGKELCDSNNISIS 684

Query: 664 IEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
           I V N G+M G   V+++  +    AG+ IKQ++ ++ V +  G+SA VGF +N C+   
Sbjct: 685 IRVRNDGEMAGKHSVLLFVRRLKASAGSPIKQLVAFQSVHLNGGESADVGFLLNPCEHFS 744

Query: 723 IVDNAANSLLASGAHTILVGE 743
             +     ++  G H ++VG+
Sbjct: 745 GPNKDGLMVIEEGTHFLVVGD 765


>gi|414586138|tpg|DAA36709.1| TPA: putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 769

 Score =  662 bits (1709), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 336/744 (45%), Positives = 483/744 (64%), Gaps = 28/744 (3%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S +P+CDA L  P RA+ LV  +TL EK+ Q+ + A GVPRLG+P Y+WWSE+LHG++  
Sbjct: 35  SAYPFCDASLSIPARARALVSLLTLDEKIAQLSNTAGGVPRLGIPPYQWWSESLHGLA-- 92

Query: 70  GRRTNSPPGTHFDS-EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
               ++ PG +F S  V  AT FP VIL+TA+FN SLW+ + + V+TEA  M+N G AGL
Sbjct: 93  ----DNGPGVNFSSGPVRAATDFPQVILSTAAFNRSLWRAVAEAVATEALGMHNAGQAGL 148

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T+W+PNIN+ RDPRWGR  ET GEDP V   Y++ YV+G Q  EG E         +++S
Sbjct: 149 TYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEEGEEGR-------IRLS 201

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYD++ WEG  R+ F+++V  QD+++T+  PF+ C+ E   S +MC+YN+VNG
Sbjct: 202 ACCKHYTAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMCAYNQVNG 261

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           +P CA   LL +T R +W F GYI SDCD++  I E+  +   + ED++A VLKAG+D++
Sbjct: 262 VPMCAHKDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTY-TKSGEDSIAIVLKAGMDIN 319

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD---GSPQYKNLGKNNICNP 365
           CG +    T  A+++GKI E DID +L  L+ V +RLG FD    +  +  LG N++C  
Sbjct: 320 CGSFLVRHTKSAIEKGKIQEEDIDRALFNLFSVQLRLGIFDKPSNNQWFSQLGPNSVCTK 379

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H ELAAEA RQG VLLKND+  LPL    ++ +A++GP AN   AM G+Y G PC  T+
Sbjct: 380 EHRELAAEAVRQGAVLLKNDHNFLPLKRSEVRHVAIIGPSANDAYAMGGDYTGVPCNPTT 439

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
            + G  AY+   ++APGC D  C +  +   A++AAK AD  V++AGL+L+ E E  DRV
Sbjct: 440 FLKGIQAYATQTSFAPGCKDASCNSTDLFGEAVEAAKRADIVVVIAGLNLTEEREDFDRV 499

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            LLLPG Q  LI+ +A  AK P+ LV++  G VD++FAK +P+I SILW+GYPGE GG+ 
Sbjct: 500 SLLLPGKQMGLIHAIASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPGEVGGQV 559

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
           + +++FG+YNPGG+LPITWY  ++  IP T M +R  P   +PGRTY+F+ G VVY FGY
Sbjct: 560 LPEILFGEYNPGGKLPITWYPESFTAIPMTDMNMRADPSRGYPGRTYRFYTGDVVYGFGY 619

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQ--CRDINYTVGTNKPPCAAVLIDDV-KCKDYKF 660
           GLSY+++ Y ++S+PK + +    D     R   Y   T +    +V  +D+  C+   F
Sbjct: 620 GLSYSKYSYSISSAPKKITVSRSSDLGIISRKPAY---TRRDGLGSVKTEDIASCEALVF 676

Query: 661 TFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
           +  + V N G MDGS  V+++++    + G  IKQ++G+E V  AAG ++ V  T++ CK
Sbjct: 677 SVHVAVSNHGSMDGSHAVLLFARSKSSVPGFPIKQLVGFESVHTAAGSASNVEITVDPCK 736

Query: 720 SLKIVDNAANSLLASGAHTILVGE 743
            +   +     +L  GAH + VG+
Sbjct: 737 QMSAANPEGKRVLLLGAHVLTVGD 760


>gi|357489431|ref|XP_003615003.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
 gi|355516338|gb|AES97961.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
          Length = 780

 Score =  661 bits (1706), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 339/744 (45%), Positives = 473/744 (63%), Gaps = 28/744 (3%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
            FP+C+  L   +RAKD+V R+TL EK+ Q+ + A  +PRLG+P Y+WW+EALHGVS++G
Sbjct: 45  SFPFCNLNLTITQRAKDIVSRLTLDEKISQLVNTAPAIPRLGIPSYQWWNEALHGVSYVG 104

Query: 71  RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLT 129
           +      G   +  +  ATSFP +IL  ASF+  LW +I + + TEAR +YN G A G+T
Sbjct: 105 K------GIRLNGSITAATSFPQIILIAASFDPKLWYRISKVIGTEARGVYNAGQAQGMT 158

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
           FW+PNIN+ RDPRWGR  ET GEDP V  +Y ++YVRGLQ  +  E  +    R LK SA
Sbjct: 159 FWAPNINIFRDPRWGRGQETAGEDPLVNSKYGVSYVRGLQG-DSFEGGKLIGGR-LKASA 216

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
           CCKH+ AYDL+NW+G +R+ FD++VT QD+ +T+   F  CV +G  S +MC+YNRVNG+
Sbjct: 217 CCKHFTAYDLENWKGVNRYVFDAKVTLQDLADTYQPSFHSCVVQGRSSGIMCAYNRVNGV 276

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           P CAD  LL  T R  WNF+GYI SDCD+++ I E   +   T ED VA VL+AG+D++C
Sbjct: 277 PNCADYNLLTNTARKKWNFNGYIASDCDAVRFIYEKQGYAK-TPEDVVADVLRAGMDVEC 335

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQ 366
           G+Y T     AV Q KI  + ID +L  L+ + +RLG FDG+P   QY  +G N +C+ +
Sbjct: 336 GNYMTKHAKSAVLQKKIPISQIDRALHNLFTIRIRLGLFDGNPTKLQYGRIGPNQVCSKE 395

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATK-AMIGNYEGTPCRYTS 425
           +++LA EAAR GIVLLKN    LPL    + TL ++GP+AN +   ++GNY G PC+  S
Sbjct: 396 NLDLALEAARSGIVLLKNTASILPLP--RVNTLGVIGPNANKSSIVLLGNYFGQPCKQVS 453

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
            + GFY Y+   +Y  GC D V   ++ I  A++ AK +D  ++V GLD S E E  DR 
Sbjct: 454 ILKGFYTYASQTHYRSGCTDGVKCASAEIDRAVEVAKISDYVILVMGLDQSQETETLDRD 513

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            L LPG Q +LIN VA A+K PV LVI+  G VDI FAKNN KI  I+W GYPGE GGRA
Sbjct: 514 HLELPGKQQKLINSVAKASKKPVILVILCGGPVDITFAKNNDKIGGIIWAGYPGELGGRA 573

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
           +A V+FG YNPGGRLP+TWY  +++KIP T M +R  P + +PGRTY+F+ GP VY FGY
Sbjct: 574 LAQVVFGDYNPGGRLPMTWYPKDFIKIPMTDMRMRADPSSGYPGRTYRFYTGPKVYEFGY 633

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYKF 660
           GLSY+ + Y   S  K+ +I +++        +++  N       L+ ++    CK    
Sbjct: 634 GLSYSNYSYNFISV-KNNNIHINQST-----THSILENSETIRYKLVSELGKKACKTMSI 687

Query: 661 TFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
           +  + + N G M G   V+++ KP  G  G  +KQ++G+E V +  G   +VGF ++ C+
Sbjct: 688 SVTLGITNTGSMAGKHPVLLFVKPKKGRNGNPVKQLVGFESVTVEGGGKGEVGFEVSVCE 747

Query: 720 SLKIVDNAANSLLASGAHTILVGE 743
            L   + +   ++  G +  LVGE
Sbjct: 748 HLSRANESGVKVIEEGGYLFLVGE 771


>gi|357156904|ref|XP_003577615.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
           distachyon]
          Length = 767

 Score =  660 bits (1703), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 341/746 (45%), Positives = 464/746 (62%), Gaps = 37/746 (4%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S + +CDA LP  +RA DLV R+T  EKV Q+GD A GVPRLG+P Y+WW+EALHG++  
Sbjct: 34  SSYAFCDAALPVAQRAADLVSRLTAAEKVAQLGDEAAGVPRLGVPGYKWWNEALHGLATS 93

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GL 128
           G+      G HFD  V  ATSFP V LT A+F++ LW +IGQ +  EARA+YNLG A GL
Sbjct: 94  GK------GLHFDGAVRSATSFPQVCLTAAAFDDDLWFRIGQAIGREARALYNLGQAEGL 147

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T WSPN+N+ RDPRWGR  ETPGEDP    RYA+ +VRG+Q          + +  L+ S
Sbjct: 148 TMWSPNVNIYRDPRWGRGQETPGEDPTTASRYAVAFVRGMQG---------NSTSLLQAS 198

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKH  AYDL++W G  R++FD++VT QD+++TF  PF  CV +G  S VMC+Y  +NG
Sbjct: 199 ACCKHATAYDLEDWNGVARYNFDAKVTAQDLEDTFNPPFRSCVVDGKASCVMCAYTGING 258

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           +P CA+  LL +T+RGDW   GY  SDCD++  + ++ ++   + EDAVA  LKAGLD+D
Sbjct: 259 VPACANADLLTKTVRGDWGLDGYTASDCDAVAIMRDAQRYAQ-SPEDAVALALKAGLDID 317

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG Y       A+QQGKI E DID +L+ L+ + MRLG+FDG P+   Y  LG  +IC  
Sbjct: 318 CGTYMQQHAAAAIQQGKITEEDIDKALKNLFAIRMRLGHFDGDPRTNMYGGLGAADICTA 377

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H  LA +AA+ GIVLLKND G LPL+   + + A++GP+AN   A+I NY G PC  T+
Sbjct: 378 EHRSLALDAAQDGIVLLKNDAGILPLDRAAVASTAVIGPNANNPGALIANYFGPPCESTT 437

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+ G   Y K   +  GC+   C   +   AA   A  +D   +  GL    E+EG+DR 
Sbjct: 438 PLKGIQGYVKDARFLAGCSSTACDVATTDQAAA-LASTSDYVFLFMGLGQRQESEGRDRT 496

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            LLLPG Q  LI  VADAA+ PV LV++S G VD+ FA+ NPKI +ILW GYPG+ GG A
Sbjct: 497 SLLLPGKQQSLITAVADAAQRPVILVLLSGGPVDVTFAQTNPKIGAILWAGYPGQAGGLA 556

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
           IA V+FG +NP GRLP+TWY   +  +P T M +R  P N +PGR+Y+F+ G  VY FGY
Sbjct: 557 IARVLFGDHNPSGRLPVTWYPEEFTNVPMTDMRMRADPANGYPGRSYRFYQGKTVYKFGY 616

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL-------IDDVKCK 656
           GLSY+ +  ++ SS  S            D+  ++ T  P    +L       I    C+
Sbjct: 617 GLSYSSYSRRLLSSGTSTPAP------NADLLASLTTTMPSAENILGSYHVEQIGAQGCE 670

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
             KF   +EV+N G MDG + V++Y + P   AG   +Q+IG+++  + AG+ A + F +
Sbjct: 671 MLKFPAVVEVQNHGPMDGKQSVLMYLRWPNATAGRPERQLIGFKKEHLKAGEKAHIKFEI 730

Query: 716 NACKSLKIVDNAANSLLASGAHTILV 741
             C+ L  V    N ++  G+H + V
Sbjct: 731 RPCEHLSRVREDGNKVIDRGSHFLRV 756


>gi|253761872|ref|XP_002489310.1| hypothetical protein SORBIDRAFT_0010s010920 [Sorghum bicolor]
 gi|241946958|gb|EES20103.1| hypothetical protein SORBIDRAFT_0010s010920 [Sorghum bicolor]
          Length = 772

 Score =  659 bits (1700), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 331/740 (44%), Positives = 457/740 (61%), Gaps = 33/740 (4%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           +CD  L   +RA DLV R+T  EK+ Q+GD A GVPRLG+P Y+WW+EALHG++  G+  
Sbjct: 43  FCDVTLSPAQRAADLVSRLTPAEKIAQLGDQATGVPRLGVPGYKWWNEALHGLATSGK-- 100

Query: 74  NSPPGTHFD--SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
               G HFD    V  ATSFP V+LT A+F++ LW +IGQ +  EARA++N+G A GLT 
Sbjct: 101 ----GLHFDVVGGVRAATSFPQVLLTAAAFDDDLWFRIGQAIGREARALFNVGQAEGLTI 156

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           WSPN+N+ RDPRWGR  ETPGEDP V  RYA+ +VRG+Q         +S S  L+ SAC
Sbjct: 157 WSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQG--------NSSSSLLQTSAC 208

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH  AYDL++W G  R+ F +RVT QD+++TF  PF  CV EG  S +MC+Y  +NG+P
Sbjct: 209 CKHATAYDLEDWNGVARYSFVARVTAQDLEDTFNPPFRSCVVEGKASCIMCAYTAINGVP 268

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
            CA+  LL  T+RGDW   GY+ SDCD++  + ++ ++   T EDAVA  LKAGLD+DCG
Sbjct: 269 ACANTDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRYA-PTPEDAVAVSLKAGLDIDCG 327

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
            Y       A+QQGK+ E DID +L  L+ V MRLG+FDG P+   Y  L   +IC P+H
Sbjct: 328 SYIQQHATAAIQQGKLTELDIDKALVNLFAVRMRLGHFDGDPRKNMYGALSAADICTPEH 387

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
             LA EAA+ GIVLLKND G LPL+   + + A++GP++N   A+I NY G PC  T+P+
Sbjct: 388 RSLALEAAQDGIVLLKNDGGILPLDRSTVTSAAVIGPNSNDGMALIANYFGPPCESTTPL 447

Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
            G  +Y   + +  GC+   C + ++   A+  + + D   +  GL    E+EGKDR  L
Sbjct: 448 QGLQSYVNNVRFLAGCSSAAC-DVAVTDQAVVLSGSEDYVFLFMGLSQQQESEGKDRTSL 506

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
           LLPG Q  LI  VADA+K PV LV++S G VDI FA++NPKI +ILW GYPG+ GG AIA
Sbjct: 507 LLPGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSNPKIGAILWAGYPGQAGGLAIA 566

Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGL 605
            V+FG +NP GRLP+TWY  ++ K+P T M +R  P + +PGR+Y+F+ G  VY FGYGL
Sbjct: 567 KVLFGDHNPSGRLPMTWYPEDFTKVPMTDMRMRADPTSGYPGRSYRFYQGNAVYKFGYGL 626

Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYKFTF 662
           SY+ F  ++        +        R+     G       +  IDD+    C+  KF  
Sbjct: 627 SYSTFSSRLLYGTSMPALSSTVLAGLRETVTEEGDR-----SYHIDDIGTDGCEQLKFPA 681

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGI-AGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
            +EV+N G MDG    +++ + P    G    Q+IG+    + AG++A + F ++ C+  
Sbjct: 682 MVEVQNHGPMDGKHSALMFLRWPNTNGGRPASQLIGFMSQHLKAGETANLRFDISPCEHF 741

Query: 722 KIVDNAANSLLASGAHTILV 741
             V      ++  G+H + V
Sbjct: 742 SRVRADGMKVIDIGSHFLTV 761


>gi|62701898|gb|AAX92971.1| beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|62733926|gb|AAX96035.1| beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|77550045|gb|ABA92842.1| Glycosyl hydrolase family 3 C terminal domain containing protein,
           expressed [Oryza sativa Japonica Group]
 gi|125576900|gb|EAZ18122.1| hypothetical protein OsJ_33667 [Oryza sativa Japonica Group]
          Length = 771

 Score =  659 bits (1699), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 332/744 (44%), Positives = 459/744 (61%), Gaps = 29/744 (3%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S + +CDA+LP   RA DLV R+T  EKV Q+GD A GVPRLG+P Y+WWSE LHG+S+ 
Sbjct: 36  SGYAFCDARLPPARRAADLVSRLTAAEKVAQLGDEAGGVPRLGVPPYKWWSEGLHGLSYW 95

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GL 128
           G       G HF+  V   TSFP V+LT A+F++ LW +IGQ + TEARA+YNLG A GL
Sbjct: 96  GH------GMHFNGAVTAITSFPQVLLTAAAFDDRLWFRIGQAIGTEARALYNLGQAEGL 149

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T WSPN+N+ RDPRWGR  ETPGEDP    +YA+ +V+GLQ          S    L+ S
Sbjct: 150 TIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQG---------STPGTLQTS 200

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKH  AYDL+ W G  R++F+++VT QD+ +TF  PF+ CV +   S VMC+Y  +NG
Sbjct: 201 ACCKHATAYDLEEWNGVARYNFNAKVTAQDLADTFNPPFKSCVVDAKASCVMCAYTDING 260

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           +P CA   LL++T RG W   GY+ SDCD++  + ++ ++   T ED VA  +KAGLDL+
Sbjct: 261 VPACASSDLLSKTFRGQWGLDGYVSSDCDAVALLRDAQRYA-PTPEDTVAVAIKAGLDLN 319

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICN 364
           CG+Y     M A+QQGK+ E+D+D +L  L+ V MRLG+FDG P+    Y +LG  ++C 
Sbjct: 320 CGNYTQVHGMAALQQGKMRESDVDRALTNLFAVRMRLGHFDGDPRSNAAYGHLGAADVCT 379

Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
             H +LA EAA+ GIVLLKND GALPL+   +++ A++GP+AN   A+ GNY G PC  T
Sbjct: 380 QAHRDLALEAAQDGIVLLKNDAGALPLDRATVRSAAVIGPNANDPAALNGNYFGPPCETT 439

Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           +P+ G   Y   + +  GC    C   +    A   A ++D  ++  GL    E EG DR
Sbjct: 440 TPLQGVQRYISSVRFLAGCDSPAC-GFAATGQAAALASSSDQVIMFMGLSQDQEKEGLDR 498

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
             LLLPG Q  LI  VA AA+ PV LV+++ G VD+ FAKNNPKI +ILW GYPG+ GG 
Sbjct: 499 TSLLLPGKQQSLITAVASAARRPVILVLLTGGPVDVTFAKNNPKIGAILWAGYPGQAGGL 558

Query: 545 AIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
           AIA V+FG +NP GRLP+TWY   + +IP T M +R  P   +PGR+Y+F+ G  VY FG
Sbjct: 559 AIAKVLFGDHNPSGRLPVTWYPEEFTRIPMTDMRMRADPATGYPGRSYRFYQGNPVYKFG 618

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
           YGLSY++F  ++ ++ K    + +++     I    G          I +  C+  KF  
Sbjct: 619 YGLSYSKFSRRLVAAAKPR--RPNRNLLAGVIPKPAGDGGESYHVEEIGEEGCERLKFPA 676

Query: 663 QIEVENMGKMDGSEVVMVYSK-PPGIAGTH--IKQVIGYERVFIAAGQSAKVGFTMNACK 719
            +EV N G MDG   V+V+ + P   AG     +Q++G+    + AG+ A++   +N C+
Sbjct: 677 TVEVHNHGPMDGKHSVLVFVRWPNATAGASRPARQLVGFSSQHVRAGEKARLTMEINPCE 736

Query: 720 SLKIVDNAANSLLASGAHTILVGE 743
            L         ++  G+H + VGE
Sbjct: 737 HLSRAREDGTKVIDRGSHFLKVGE 760


>gi|222629651|gb|EEE61783.1| hypothetical protein OsJ_16354 [Oryza sativa Japonica Group]
          Length = 771

 Score =  659 bits (1699), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 346/745 (46%), Positives = 468/745 (62%), Gaps = 76/745 (10%)

Query: 48  VPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWK 107
           +PRLG+P YEWWSEALHGVS++G      PGT F + VPGATSFP  ILT ASFN SL++
Sbjct: 45  LPRLGIPAYEWWSEALHGVSYVG------PGTRFSTLVPGATSFPQPILTAASFNASLFR 98

Query: 108 KIGQT------------------------------------------VSTEARAMYNLGN 125
            IG++                                          VSTEARAM+N+G 
Sbjct: 99  AIGESACNNTSQFFFSSKSPFSICIAMENLHCDFRSRLVRFYRGARVVSTEARAMHNVGL 158

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
           AGLTFWSPNIN+ RDPRWGR  ETPGEDP +  +YA+ YV GLQD  G        S  L
Sbjct: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGG-------GSDAL 211

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K++ACCKHY AYD+DNW+G +R+ FD+ V++QD+ +TF  PF+ CV +G+V+SVMCSYN+
Sbjct: 212 KVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNK 271

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           VNG PTCAD  LL+  IRGDW  +GYIVSDCDS+  +  +  +  +  EDA A  +K+GL
Sbjct: 272 VNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYTKN-PEDAAAITIKSGL 330

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNI 362
           DL+CG++    T+ AVQ GK++E+D+D ++   +IVLMRLG+FDG P+   + +LG  ++
Sbjct: 331 DLNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRLGFFDGDPRKLPFGSLGPKDV 390

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
           C   + ELA EAARQGIVLLKN  GALPL+  +IK++A++GP+ANA+  MIGNYEGTPC+
Sbjct: 391 CTSSNQELAREAARQGIVLLKN-TGALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCK 449

Query: 423 YTSPMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDLSVEAEG 481
           YT+P+ G  A    + Y PGC ++ C  NS+ + AA  AA +AD TV+V G D SVE E 
Sbjct: 450 YTTPLQGLGANVATV-YQPGCTNVGCSGNSLQLSAATQAAASADVTVLVVGADQSVERES 508

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
            DR  LLLPG Q +L++ VA+A++GPV LV+MS G  DI+FAK++ KI +ILWVGYP   
Sbjct: 509 LDRTSLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDISFAKSSDKISAILWVGYPRRS 568

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVV 598
             R               LP+TWY A++  K+  T M +RP     +PGRTY+F+ G  V
Sbjct: 569 RWRRPRRHPLRIPQ--SWLPVTWYPASFADKVSMTDMRMRPDSSTGYPGRTYRFYTGDTV 626

Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
           Y FG GLSYT+F + + S+P+ V ++L +   C         +   C +V      C   
Sbjct: 627 YAFGDGLSYTKFAHSLVSAPEQVAVQLAEGHAC---------HTEHCFSVEAAGEHCGSL 677

Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
            F   + V N G M G   V ++S PP +     K ++G+E+V +  GQ+  V F ++ C
Sbjct: 678 SFDVHLRVRNAGGMAGGHTVFLFSSPPSVHSAPAKHLLGFEKVSLEPGQAGVVAFKVDVC 737

Query: 719 KSLKIVDNAANSLLASGAHTILVGE 743
           K L +VD   N  +A G+HT+ VG+
Sbjct: 738 KDLSVVDELGNRKVALGSHTLHVGD 762


>gi|224066929|ref|XP_002302284.1| predicted protein [Populus trichocarpa]
 gi|222844010|gb|EEE81557.1| predicted protein [Populus trichocarpa]
          Length = 742

 Score =  658 bits (1697), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 331/741 (44%), Positives = 467/741 (63%), Gaps = 60/741 (8%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +P+C  KLP  +R +DLV R+TL EKV Q+ D A  +PRLG+P YEWWSEALHGV+    
Sbjct: 44  YPFCQTKLPISQRVEDLVSRLTLDEKVSQLVDTAPAIPRLGIPAYEWWSEALHGVAL--- 100

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
           +T    G  F+  +  ATSFP VILT ASF+  LW +IGQ +  EAR +YN G A G+TF
Sbjct: 101 QTTVRQGIRFNGTIRFATSFPQVILTAASFDAHLWYRIGQVIGKEARGIYNAGQATGMTF 160

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           W+PNIN+ RDPRWGR  ETPGEDP V G+YA++YVRG+Q   G  +   +    L+ SAC
Sbjct: 161 WAPNINIFRDPRWGRGQETPGEDPLVAGKYAVSYVRGVQ---GDSFGGGTLGEQLQASAC 217

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH+ AYDLD W+G +RF FD+    QD+ +T+  PF+ C+ EG  S +MC+YNRVNG+P
Sbjct: 218 CKHFTAYDLDKWKGMNRFVFDA----QDLADTYQPPFQSCIQEGKASGIMCAYNRVNGVP 273

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
            CAD  LL++  RG W F+GYI SDCD++  I +   +   + EDAVA VLKAG+D++CG
Sbjct: 274 NCADYNLLSKKARGQWGFYGYITSDCDAVAIIHDDQGYAK-SPEDAVADVLKAGMDVNCG 332

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
           DY  N+T  AV++ K+ E++ID +L  L+ + MRLG F+G+P    Y N+  + +C+ +H
Sbjct: 333 DYLKNYTKSAVKKKKLPESEIDRALHNLFSIRMRLGLFNGNPTKQPYGNIAPDQVCSQEH 392

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
             LA +AA+ GIVLLKN +  LPL+    K+LA++GP+AN +  ++GNY G PC+  +P+
Sbjct: 393 QALALKAAQDGIVLLKNPDKLLPLSKLETKSLAVIGPNANNSTKLLGNYFGPPCKTVTPL 452

Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
            G   Y K   Y PGC+ + C + S I  A+  AK AD  ++V GLD + E E +DRVDL
Sbjct: 453 QGLQNYIKNTRYHPGCSRVACSSAS-INQAVKIAKGADQVILVMGLDQTQEKEEQDRVDL 511

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
           +LPG Q ELI  VA AAK PV LV+   G VD++FAK +  I SI+W GYPGE GG A+A
Sbjct: 512 VLPGKQRELITAVAKAAKKPVVLVLFCGGPVDVSFAKYDQNIGSIIWAGYPGEAGGTALA 571

Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGYGL 605
            +IFG +NPGGRLP+TWY  ++ K+P T M +RP   + +PGRTY+F++G  V+ FGYGL
Sbjct: 572 QIIFGDHNPGGRLPMTWYPQDFTKVPMTDMRMRPQLSSGYPGRTYRFYNGKKVFEFGYGL 631

Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK---CKDYKFTF 662
           SY+ + Y++AS  ++   KL      R  +  +  N       LI ++    C+  KFT 
Sbjct: 632 SYSNYSYELASDTQN---KL----YLRASSNQITKNSNTIRHKLISNIGKELCEKTKFTV 684

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
            + V+N G+M                                AG++A++ + ++ C+ L 
Sbjct: 685 TVRVKNHGEM--------------------------------AGENAEIQYELSPCEHLS 712

Query: 723 IVDNAANSLLASGAHTILVGE 743
             D+    ++  G+  +L+G+
Sbjct: 713 SPDDRGMMVMEEGSQFLLIGD 733


>gi|449508468|ref|XP_004163321.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 7-like
           [Cucumis sativus]
          Length = 783

 Score =  657 bits (1694), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 342/757 (45%), Positives = 471/757 (62%), Gaps = 39/757 (5%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            P+C   LP   RA+DLV R+TL EKV Q+ +    +PRLG+P YEWWSEALHGV+ +G 
Sbjct: 50  LPFCKTYLPIKLRARDLVSRLTLDEKVLQLVNTVPPIPRLGIPAYEWWSEALHGVANVGY 109

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
                 G   +  +  ATSFP VILT ASF+E+LW +IGQ + TEARA+YN G A G+TF
Sbjct: 110 ------GIRLNGTITAATSFPQVILTAASFDENLWYQIGQAIGTEARAVYNAGQAKGMTF 163

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQD--VEGVEYHRDSDSRPLKIS 188
           W+PNIN+ RDPRWGR  ETPGEDP + G+Y++ YVRG+Q   +EG +         LK S
Sbjct: 164 WTPNINIFRDPRWGRGQETPGEDPLMTGKYSVAYVRGIQGDAIEGGKL-----GNQLKAS 218

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKH+ AYDLD W G  R+ FD++VT QDM +T+  PFE CV EG  S +MC+YNRVNG
Sbjct: 219 ACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMADTYQPPFESCVEEGKASGIMCAYNRVNG 278

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           +P+CAD  LL  T R  W F+GYI SDCD++  I ++  +     EDAVA VL+AG+D++
Sbjct: 279 VPSCADHHLLTATARKQWKFNGYITSDCDAVSIIHDAQGYAK-IPEDAVADVLRAGMDVN 337

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG Y    T  AV+  K+    ID +LR L+ V MRLG FDG+P    +  +G++ +C+ 
Sbjct: 338 CGTYLKEHTKSAVEMKKVPMLHIDRALRNLFSVRMRLGLFDGNPTKLPFGQIGRDQVCSQ 397

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           QH  LA +AAR+GIVLLKN    LPL+  N  +LA++G + N  K + GNY G PC+  +
Sbjct: 398 QHQNLALQAAREGIVLLKNSAKLLPLSKSNTHSLAVIGHNGNDPKTLRGNYAGIPCKSAT 457

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P  G   Y K   Y  GC    C   + I  A+  AK+ D  V+V GLD + E E  DR 
Sbjct: 458 PFQGLNNYVKNTVYHRGCNYANC-TEATIYQAVKIAKSVDYVVLVMGLDQTQEREDFDRT 516

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           +L LPG Q +LI +VA AAK PV LVI+S G VDI+ AK N KI SILW GYPG+ GG A
Sbjct: 517 ELGLPGKQDKLIAEVAKAAKXPVILVILSGGPVDISSAKYNEKIGSILWAGYPGQAGGTA 576

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
           IA++IFG +NPGGRLP+TWY  +++K P T M +R      +PGRTY+F++GP VY FGY
Sbjct: 577 IAEIIFGDHNPGGRLPLTWYPHDFIKFPMTDMRMRADSSTGYPGRTYRFYNGPKVYEFGY 636

Query: 604 GLSYTQFKYKVASSPKSVDI----KLDKDQQCRD-INYTVGTNKPPCAAVLIDDVKCKDY 658
           GLSY+   Y+  S  +S  +    K  +  +  D ++Y + +         +D   C+  
Sbjct: 637 GLSYSNHIYEFTSVSESKLLLSHPKASQPAKNSDLVSYRLVSE--------LDKKFCESK 688

Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPPG-IAGTHIKQVIGYERVFIAAGQSAKVGFTMNA 717
                + V N G+M G   V+++ KP   I G+ +KQ++G+++V I AG+  ++ F ++ 
Sbjct: 689 TVNVTVGVRNEGEMGGKHSVLLFIKPSKPINGSPVKQLVGFKKVEINAGERREIEFLVSP 748

Query: 718 CKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQL 754
           C  +         ++  G+++++VG+    V  PL +
Sbjct: 749 CDHISKASEEGLMIIEEGSYSLVVGD----VEHPLDI 781


>gi|414588273|tpg|DAA38844.1| TPA: putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 775

 Score =  657 bits (1694), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 335/750 (44%), Positives = 462/750 (61%), Gaps = 28/750 (3%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +P+CD  LP   RA DLV R+T+ EKV Q+GD A GVPRLG+P Y+WWSE LHG++F G 
Sbjct: 41  YPFCDRSLPAARRAADLVSRLTVAEKVSQLGDEAAGVPRLGVPPYKWWSEGLHGLAFWGH 100

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
                 G  F+  V   TSFP V+LTTASF+ESLW +IGQ +  EARA+YNLG A GLT 
Sbjct: 101 ------GMRFNGTVSAVTSFPQVLLTTASFDESLWFRIGQAIGREARALYNLGQAEGLTI 154

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           WSPN+N+ RDPRWGR  ETPGEDP V  +YA+ +VRG+Q          + + PL+ SAC
Sbjct: 155 WSPNVNIFRDPRWGRGQETPGEDPAVASKYAVAFVRGIQGSNPAG----AAAAPLQASAC 210

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH  AYDL++W G  R++FD+RVT QD+ +TF  PF+ CV +G  S VMC+Y  +NG+P
Sbjct: 211 CKHATAYDLEDWNGVARYNFDARVTLQDLADTFNPPFQSCVVDGKASCVMCAYTVINGVP 270

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
            CA   LL +T RG W   GY+ SDCD++  + ++ ++   T ED VA  LKAGLDL+CG
Sbjct: 271 ACASSDLLTKTFRGAWGLDGYVSSDCDAVAIMRDAQRY-EPTPEDTVAVALKAGLDLNCG 329

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNPQ 366
            Y     M A+QQGK+ E D+D +L  L+ V MRLG+FDG P+    Y  LG  ++C   
Sbjct: 330 TYTQQHGMAAIQQGKMTEKDVDKALTNLFAVRMRLGHFDGDPRGNALYGRLGAADVCTAD 389

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
           H  LA EAA+ GIVLLKND G LPL+   + + A++G +AN    + GNY G  C  T+P
Sbjct: 390 HKNLALEAAQDGIVLLKNDAGILPLDRSAVGSAAVIGHNANDPLVLSGNYFGPACETTTP 449

Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
           ++G  +Y + + +  GC+   C   +    A   A +A+   +  GL    E EG DR  
Sbjct: 450 LEGLQSYVRNVRFLAGCSSAAC-GYAATGQAAALASSAEYVFLFMGLSQDQEKEGLDRTS 508

Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
           LLLPG Q  L+  VA AAK PV LV+++ G VDI FA++NPKI +ILW GYPG+ GG AI
Sbjct: 509 LLLPGKQQSLVTAVASAAKRPVVLVLLTGGPVDITFAQSNPKIGAILWAGYPGQAGGLAI 568

Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
           A V+FG +NP GRLP+TWY  ++ K+P T M +R  P   +PGRTY+F+ G  +Y FGYG
Sbjct: 569 ARVLFGDHNPSGRLPVTWYTEDFTKVPMTDMRMRADPATGYPGRTYRFYRGKTIYKFGYG 628

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD---VKCKDYKFT 661
           LSY++F  ++ +  K++             + +  T     +   +DD   V C+  KF 
Sbjct: 629 LSYSKFSRQLVTGDKNL-----APNTSLLAHLSAKTQHAATSYYHVDDIGTVGCEQLKFP 683

Query: 662 FQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
            ++EV N G MDG   V+++ + P    G  ++Q+IG+    I AG+ A V F ++ C+ 
Sbjct: 684 AEVEVLNHGPMDGKHSVLMFLRWPNATDGRPVRQLIGFRSQHIKAGEKANVRFHVSPCEH 743

Query: 721 LKIVDNAANSLLASGAHTILVGEGVGGVSF 750
                     ++  G+H ++VG+    +SF
Sbjct: 744 FSRTRADGKKVIDRGSHFLMVGKEELEISF 773


>gi|326517420|dbj|BAK00077.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 781

 Score =  657 bits (1694), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 339/747 (45%), Positives = 462/747 (61%), Gaps = 44/747 (5%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           + +CDA LP  +RA DLV R+T  EKV Q+GD A GVPRLG+P Y+WW+EALHG++  G+
Sbjct: 51  YAFCDATLPVAQRAADLVARLTTAEKVAQLGDEAAGVPRLGVPAYKWWNEALHGLATSGK 110

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
                 G HF+  V  ATSFP V LT A+F++ LW +IGQ +  EARA+YN+G A GLT 
Sbjct: 111 ------GLHFNGAVRSATSFPQVSLTAAAFDDDLWLRIGQAIGREARALYNVGQAEGLTM 164

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           WSPN+N+ RDPRWGR  ETPGEDP    RY + +V+GLQ          + S  L+ SAC
Sbjct: 165 WSPNVNIYRDPRWGRGQETPGEDPTTASRYGVAFVKGLQG-------NSTSSSLLQTSAC 217

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH  AYDL++W G  R++FD+RVT QD+++T+  PF  CV +G  S VMC+Y  +NG+P
Sbjct: 218 CKHATAYDLEDWGGVARYNFDARVTAQDLEDTYNPPFRSCVVDGKASCVMCAYTAINGVP 277

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
            CA+  LL  T+R DW   GY+ SDCD++  + ++ ++   T EDAVA  LKAGLD+DCG
Sbjct: 278 ACANSGLLTNTVRADWGLDGYVASDCDAVAIMRDAQRYA-PTPEDAVALALKAGLDIDCG 336

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
            Y       A+QQGKI E D+D +L+ L+ + MRLG+FDG P+   Y  L   +IC P+H
Sbjct: 337 TYMQQHAPAALQQGKITEDDVDKALKNLFAIRMRLGHFDGDPRANIYGGLNAAHICTPEH 396

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
             LA EAA+ GIVLLKND G LPL+   I + A++GP+AN    +IGNY G PC   +P+
Sbjct: 397 RSLALEAAQDGIVLLKNDAGILPLDRAAIASAAVIGPNANNPGLLIGNYFGPPCESVTPL 456

Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
            G   Y K + +  GC    C       AA   A ++D  ++  GL    E+EG+DR  L
Sbjct: 457 KGVQGYVKDVRFMAGCGSAACDVADTDQAAT-LAGSSDYVLLFMGLSQQQESEGRDRTSL 515

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
           LLPG Q  LI  VADAAK PV LV+++ G VD+ FAKNNPKI +ILW GYPG+ GG AIA
Sbjct: 516 LLPGQQQSLITAVADAAKRPVILVLLTGGPVDVTFAKNNPKIGAILWAGYPGQAGGLAIA 575

Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGL 605
            V+FG +NPGGRLP+TWY   + K+P T M +R  P   +PGR+Y+F+ G  VY FGYGL
Sbjct: 576 RVLFGDHNPGGRLPVTWYPEEFTKVPMTDMRMRADPATGYPGRSYRFYQGETVYKFGYGL 635

Query: 606 SYTQFKYKVA--SSPKSVDIKLDKDQQCRDINYTVGTNKPPC-----AAVLIDDVK---C 655
           SY+ +  ++    +P +            D+   + T   P      A+  ++ +    C
Sbjct: 636 SYSSYSRRLLSSGTPNT------------DLLAGLSTMPTPAEEGGVASYHVEHIGARGC 683

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
           +  KF   +EVEN G MDG   V++Y +     AG   KQ+IG+ R  + AG+ A + F 
Sbjct: 684 EQLKFPAVVEVENHGPMDGKHSVLMYLRWANATAGRPAKQLIGFRRQHLKAGEKASLTFD 743

Query: 715 MNACKSLKIVDNAANSLLASGAHTILV 741
           ++ C+    V    N ++  G+H ++V
Sbjct: 744 ISPCEHFSRVRKDGNKVVDRGSHFLMV 770


>gi|449465962|ref|XP_004150696.1| PREDICTED: probable beta-D-xylosidase 7-like [Cucumis sativus]
          Length = 783

 Score =  657 bits (1694), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 342/757 (45%), Positives = 471/757 (62%), Gaps = 39/757 (5%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            P+C   LP   RA+DLV R+TL EKV Q+ +    +PRLG+P YEWWSEALHGV+ +G 
Sbjct: 50  LPFCKTYLPIKLRARDLVSRLTLDEKVLQLVNTVPPIPRLGIPAYEWWSEALHGVANVGY 109

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
                 G   +  +  ATSFP VILT ASF+E+LW +IGQ + TEARA+YN G A G+TF
Sbjct: 110 ------GIRLNGTITAATSFPQVILTAASFDENLWYQIGQAIGTEARAVYNAGQAKGMTF 163

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQD--VEGVEYHRDSDSRPLKIS 188
           W+PNIN+ RDPRWGR  ETPGEDP + G+Y++ YVRG+Q   +EG +         LK S
Sbjct: 164 WTPNINIFRDPRWGRGQETPGEDPLMTGKYSVAYVRGIQGDAIEGGKL-----GNQLKAS 218

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKH+ AYDLD W G  R+ FD++VT QDM +T+  PFE CV EG  S +MC+YNRVNG
Sbjct: 219 ACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMADTYQPPFESCVEEGKASGIMCAYNRVNG 278

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           +P+CAD  LL  T R  W F+GYI SDCD++  I ++  +     EDAVA VL+AG+D++
Sbjct: 279 VPSCADHHLLTATARKQWKFNGYITSDCDAVSIIHDAQGYAK-IPEDAVADVLRAGMDVN 337

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG Y    T  AV+  K+    ID +LR L+ V MRLG FDG+P    +  +G++ +C+ 
Sbjct: 338 CGTYLKEHTKSAVEMKKVPMLHIDRALRNLFSVRMRLGLFDGNPTKLPFGQIGRDQVCSQ 397

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           QH  LA +AAR+GIVLLKN    LPL+  N  +LA++G + N  K + GNY G PC+  +
Sbjct: 398 QHQNLALQAAREGIVLLKNSAKLLPLSKSNTHSLAVIGHNGNDPKTLRGNYAGIPCKSAT 457

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P  G   Y K   Y  GC    C   + I  A+  AK+ D  V+V GLD + E E  DR 
Sbjct: 458 PFQGLNNYVKNTVYHRGCNYANC-TEATIYQAVKIAKSVDYVVLVMGLDQTQEREDFDRT 516

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           +L LPG Q +LI +VA AAK PV LVI+S G VDI+ AK N KI SILW GYPG+ GG A
Sbjct: 517 ELGLPGKQDKLIAEVAKAAKRPVILVILSGGPVDISSAKYNEKIGSILWAGYPGQAGGTA 576

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
           IA++IFG +NPGGRLP+TWY  +++K P T M +R      +PGRTY+F++GP VY FGY
Sbjct: 577 IAEIIFGDHNPGGRLPLTWYPHDFIKFPMTDMRMRADSSTGYPGRTYRFYNGPKVYEFGY 636

Query: 604 GLSYTQFKYKVASSPKSVDI----KLDKDQQCRD-INYTVGTNKPPCAAVLIDDVKCKDY 658
           GLSY+   Y+  S  +S  +    K  +  +  D ++Y + +         +D   C+  
Sbjct: 637 GLSYSNHIYEFTSVSESKLLLSHPKASQPAKNSDLVSYRLVSE--------LDKKFCESK 688

Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPPG-IAGTHIKQVIGYERVFIAAGQSAKVGFTMNA 717
                + V N G+M G   V+++ KP   I G+ +KQ++G+++V I AG+  ++ F ++ 
Sbjct: 689 TVNVTVGVRNEGEMGGKHSVLLFIKPSKPINGSPVKQLVGFKKVEINAGERREIEFLVSP 748

Query: 718 CKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQL 754
           C  +         ++  G+++++VG+    V  PL +
Sbjct: 749 CDHISKASEEGLMIIEEGSYSLVVGD----VEHPLDI 781


>gi|85813770|emb|CAJ65921.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
          Length = 704

 Score =  656 bits (1693), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 348/679 (51%), Positives = 454/679 (66%), Gaps = 52/679 (7%)

Query: 8   KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
            L+   +C+  +   +R  DLV+R+TL EK+  + + A  V RLG+P YEWWSEALHGVS
Sbjct: 48  SLASLGFCNTSIGINDRVVDLVKRLTLQEKIVFLVNSAGNVSRLGIPKYEWWSEALHGVS 107

Query: 68  FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIG-----QTVSTEARAMYN 122
           ++G      PGTHF  +V GATSFP VILT ASFN SL++ IG     Q VSTEARAMYN
Sbjct: 108 YVG------PGTHFSDDVAGATSFPQVILTAASFNTSLFEAIGKVYYTQVVSTEARAMYN 161

Query: 123 LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
           +G AGLTFWSPNIN+ RDPRWGR  ETPGEDP +  +Y   YV+GLQ  +      D D 
Sbjct: 162 VGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSCYVKGLQQRD------DGDP 215

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRV-TEQDMQETFILPFEMCVNEGDVSSVMC 241
             LK++ACCKHY AYDLDNW+G+DR+HF++ V T+QDM +TF  PF+ CV +G+V+SVMC
Sbjct: 216 DKLKVAACCKHYTAYDLDNWKGSDRYHFNAVVVTKQDMDDTFQPPFKSCVIDGNVASVMC 275

Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGY-------IVSDCDSIQTIVESHKFLNDTKE 294
           SYN+VNG PTCADP LL+  IRG+WN +GY       IV+DCDS+    +S  +    +E
Sbjct: 276 SYNQVNGKPTCADPDLLSGVIRGEWNLNGYQWGCCRYIVTDCDSLDVFYKSQNYTKTPEE 335

Query: 295 DAVARVLKA-----GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
            A A +L       G+DL+CG +    T  AV+ G + E  ID ++   +  LMRLG+FD
Sbjct: 336 AAAAAILAGNSLVTGVDLNCGSFLGQHTEAAVKGGLVNEHAIDIAVSNNFATLMRLGFFD 395

Query: 350 GSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHA 406
           G P    Y  LG  ++C  ++ ELA EAARQGIVLLKN  G+LPL+   IK LA++GP+A
Sbjct: 396 GDPSKQLYGKLGPKDVCTAENQELAREAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNA 455

Query: 407 NATKAMIGNYEG-TPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNAD 465
           N TK MIGNYEG TPC+YT+P+ G  A S    Y PGC+++ C + + +  A   A  AD
Sbjct: 456 NVTKTMIGNYEGGTPCKYTTPLQGLAA-SVATTYLPGCSNVAC-STAQVDDAKKLAAAAD 513

Query: 466 ATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
           ATV+V G DLS+EAE +DRVD+LLPG Q  LI  VA+ + GPV LVIMS G +D++FA+ 
Sbjct: 514 ATVLVMGADLSIEAESRDRVDVLLPGQQQLLITAVANVSCGPVILVIMSGGGMDVSFART 573

Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPG----GRLPITWYEANYV-KIPYTSMPLR 580
           N KI SILWVGYPGE GG AIAD+IFG YNP     GRLP+TWY  +YV K+P T+M +R
Sbjct: 574 NDKITSILWVGYPGEAGGAAIADIIFGYYNPSTHQPGRLPMTWYPQSYVDKVPMTNMNMR 633

Query: 581 --PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV 638
             P N +PGRTY+F+ G  VY FG GLSY+QF +++  +P+ V + L++   C       
Sbjct: 634 PDPSNGYPGRTYRFYTGETVYSFGDGLSYSQFTHELIQAPQLVYVPLEESHVC------- 686

Query: 639 GTNKPPCAAVLIDDVKCKD 657
             +   C +V+  +  C++
Sbjct: 687 --HSSECQSVVASEQTCQN 703


>gi|356548162|ref|XP_003542472.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
          Length = 778

 Score =  656 bits (1692), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 336/748 (44%), Positives = 470/748 (62%), Gaps = 36/748 (4%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           + +C+ KLP  +RA+DLV R+TL EK+ Q+ + A  +PRLG+P Y+WWSEALHGV+  G 
Sbjct: 42  YSFCNTKLPITKRAQDLVSRLTLDEKLAQLVNTAPAIPRLGIPSYQWWSEALHGVADAGF 101

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
                 G  F+  +  ATSFP VILT ASF+ +LW +I +T+  EARA+YN G A G+TF
Sbjct: 102 ------GIRFNGTIKSATSFPQVILTAASFDPNLWYQISKTIGREARAVYNAGQATGMTF 155

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           W+PNINV RDPRWGR  ET GEDP +  +Y + YVRGLQ   G  +     +  L+ SAC
Sbjct: 156 WAPNINVFRDPRWGRGQETAGEDPLMNAKYGVAYVRGLQ---GDSFEGGKLAERLQASAC 212

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH+ AYDLD W+G DRF FD+RVT QD+ +T+  PF+ C+ +G  S +MC+YNRVNG+P
Sbjct: 213 CKHFTAYDLDQWKGLDRFVFDARVTSQDLADTYQPPFQSCIEQGRASGIMCAYNRVNGVP 272

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
            CAD  LL +T R  W F GYI SDC ++  I E   +   T EDA+A V +AG+D++CG
Sbjct: 273 NCADFNLLTKTARQQWKFDGYITSDCGAVSIIHEKQGYAK-TAEDAIADVFRAGMDVECG 331

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
           DY T     AV Q K+  + ID +L+ L+ + +RLG FDG+P    +  +G N +C+ Q 
Sbjct: 332 DYITKHAKSAVFQKKLPISQIDRALQNLFSIRIRLGLFDGNPTKLPFGTIGPNEVCSKQS 391

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT-KAMIGNYEGTPCRYTSP 426
           ++LA EAAR GIVLLKN N  LPL   N  T+AL+GP+ANA+ K  +GNY G PC   + 
Sbjct: 392 LQLALEAARDGIVLLKNTNSLLPLPKTN-PTIALIGPNANASSKVFLGNYYGRPCNLVTL 450

Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
           + GF  Y+K + Y PGC D      + I  A++ AK  D  V+V GLD S E E  DR  
Sbjct: 451 LQGFEGYAKTV-YHPGCDDGPQCAYAQIEEAVEVAKKVDYVVLVMGLDQSQERESHDREY 509

Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
           L LPG Q ELI  VA AAK PV +V++  G VDI  AK + K+  ILW GYPGE GG A+
Sbjct: 510 LGLPGKQEELIKSVARAAKRPVVVVLLCGGPVDITSAKFDDKVGGILWAGYPGELGGVAL 569

Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
           A V+FG +NPGG+LPITWY  +++K+P T M +R  P + +PGRTY+F+ GP VY FGYG
Sbjct: 570 AQVVFGDHNPGGKLPITWYPKDFIKVPMTDMRMRADPASGYPGRTYRFYTGPKVYEFGYG 629

Query: 605 LSYTQFKYKVAS-SPKSVDIKLDK----DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
           LSYT++ YK+ S S  ++ I         Q    I Y + +         + +  C+   
Sbjct: 630 LSYTKYSYKLLSLSHSTLHINQSSTHLMTQNSETIRYKLVSE--------LAEETCQTML 681

Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIA----GTHIKQVIGYERVFIAAGQSAKVGFTM 715
            +  + V N G + G   V+++ +   +     G  +KQ++G++ V + AG++ +VGF +
Sbjct: 682 LSIALGVTNRGNLAGKHPVLLFVRQGKVRNINNGNPVKQLVGFQSVKVNAGETVQVGFEL 741

Query: 716 NACKSLKIVDNAANSLLASGAHTILVGE 743
           + C+ L + + A + ++  G++  +VG+
Sbjct: 742 SPCEHLSVANEAGSMVIEEGSYLFIVGD 769


>gi|356552866|ref|XP_003544783.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
          Length = 776

 Score =  655 bits (1691), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 333/746 (44%), Positives = 471/746 (63%), Gaps = 33/746 (4%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +P+C+ +LP  +RA+DLV R+TL EK+ Q+ + A  +PRLG+P Y+WWSEALHGV+  G 
Sbjct: 41  YPFCNTRLPISKRAQDLVSRLTLDEKLAQLVNTAPAIPRLGIPSYQWWSEALHGVADAGF 100

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
                 G  F+  +  ATSFP VILT ASF+ +LW +I +T+  EARA+YN G A G+TF
Sbjct: 101 ------GIRFNGTIKSATSFPQVILTAASFDPNLWYQISKTIGKEARAVYNAGQATGMTF 154

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           W+PNINV RDPRWGR  ET GEDP +  +Y + YVRGLQ   G  +        L+ SAC
Sbjct: 155 WAPNINVFRDPRWGRGQETAGEDPLMNAKYGVAYVRGLQ---GDSFEGGKLGERLQASAC 211

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH+ AYDLD+W+G DRF +D+RVT QD+ +T+  PF+ C+ +G  S +MC+YNRVNG+P
Sbjct: 212 CKHFTAYDLDHWKGLDRFVYDARVTSQDLADTYQPPFQSCIEQGRASGIMCAYNRVNGVP 271

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
            CA+  LL +T R  W F GYI SDC ++ +I+   +    T EDA+A V +AG+D++CG
Sbjct: 272 NCANFNLLTKTARQQWKFDGYITSDCGAV-SIIHDEQGYAKTAEDAIADVFRAGMDVECG 330

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
           DY T     AV Q K+  + ID +L+ L+ + +RLG  DG+P    +  +G + +C+ Q 
Sbjct: 331 DYITKHGKSAVSQKKLPISQIDRALQNLFSIRIRLGLLDGNPTKLPFGTIGPDQVCSKQS 390

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT-KAMIGNYEGTPCRYTSP 426
           ++LA EAAR GIVLLKN N  LPL   N  T+AL+GP+ANA+ K  +GNY G PC   + 
Sbjct: 391 LQLALEAARDGIVLLKNTNSLLPLPKTN-PTIALIGPNANASSKVFLGNYYGRPCNLVTL 449

Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
           + GF  Y+K   Y PGC D      + I  A++ AK  D  V+V GLD S E E  DR  
Sbjct: 450 LQGFEGYAKDTVYHPGCDDGPQCAYAQIEGAVEVAKKVDYVVLVMGLDQSQERESHDREY 509

Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
           L LPG Q ELI  VA A+K PV LV++  G VDI  AK + K+  ILW GYPGE GG A+
Sbjct: 510 LGLPGKQEELIKSVARASKRPVVLVLLCGGPVDITSAKFDDKVGGILWAGYPGELGGVAL 569

Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
           A V+FG +NPGG+LPITWY  +++K+P T M +R  P + +PGRTY+F+ GP VY FGYG
Sbjct: 570 AQVVFGDHNPGGKLPITWYPKDFIKVPMTDMRMRADPASGYPGRTYRFYTGPKVYEFGYG 629

Query: 605 LSYTQFKYKVAS-SPKSVDIKLDK----DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
           LSYT++ YK+ S S  ++ I         Q    I Y + +         + +  C+   
Sbjct: 630 LSYTKYSYKLLSLSHNTLHINQSSTHLTTQNSETIRYKLVSE--------LAEETCQTML 681

Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIA--GTHIKQVIGYERVFIAAGQSAKVGFTMNA 717
            +  + V N G M G   V+++ +   +   G  +KQ++G++ V + AG++ +VGF ++ 
Sbjct: 682 LSIALGVTNHGNMAGKHPVLLFVRQGKVRNNGNPVKQLVGFQSVKLNAGETVQVGFELSP 741

Query: 718 CKSLKIVDNAANSLLASGAHTILVGE 743
           C+ L + + A + ++  G++ +LVG+
Sbjct: 742 CEHLSVANEAGSMVIEEGSYLLLVGD 767


>gi|125534112|gb|EAY80660.1| hypothetical protein OsI_35838 [Oryza sativa Indica Group]
          Length = 771

 Score =  655 bits (1689), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 330/742 (44%), Positives = 458/742 (61%), Gaps = 29/742 (3%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           + +CDA+LP   RA DLV R+T  EKV Q+GD A GV RLG+P Y+WWSE LHG+S+ G 
Sbjct: 38  YAFCDARLPPARRAADLVSRLTAAEKVAQLGDEAGGVARLGVPPYKWWSEGLHGLSYWGH 97

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
                 G HF+  V   TSFP V+LT A+F++ LW +IGQ + TEARA+YNLG A GLT 
Sbjct: 98  ------GMHFNGAVTAITSFPQVLLTAAAFDDRLWFRIGQAIGTEARALYNLGQAEGLTI 151

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           WSPN+N+ RDPRWGR  ETPGEDP    +YA+ +V+GLQ          S    L+ SAC
Sbjct: 152 WSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQG---------STPGTLQTSAC 202

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH  AYDL+ W G  R++F+++VT QD+ +TF  PF+ CV +   S VMC+Y  +NG+P
Sbjct: 203 CKHATAYDLEEWNGVARYNFNAKVTAQDLADTFNPPFKSCVVDAKASCVMCAYTDINGVP 262

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
            CA   LL++T RG W   GY+ SDCD++  + ++ ++   T ED VA  +KAGLDL+CG
Sbjct: 263 ACASSDLLSKTFRGQWGLDGYVSSDCDAVALLRDAQRYA-PTPEDTVAVAIKAGLDLNCG 321

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNPQ 366
           +Y     M A+QQGK+ E+D+D +L  L+ V MRLG+FDG P+    Y +LG  ++C   
Sbjct: 322 NYTQVHGMAALQQGKMRESDVDRALTNLFAVRMRLGHFDGDPRSNAAYGHLGAADVCTQA 381

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
           H +LA EAA+ GIVLLKND GALPL+   +++ A++GP+AN   A+ GNY G PC  T+P
Sbjct: 382 HRDLALEAAQNGIVLLKNDAGALPLDRATVRSAAVIGPNANDPAALNGNYFGPPCETTTP 441

Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
           + G   Y   + +  GC    C   +    A   A ++D  ++  GL    E EG DR  
Sbjct: 442 LQGVQRYISSVRFLAGCDSPAC-GFAATGQAAALASSSDQVIMFMGLSQDQEKEGLDRTS 500

Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
           LLLPG Q  LI  VA AA+ PV LV+++ G VD+ FAKNNPKI +ILW GYPG+ GG AI
Sbjct: 501 LLLPGKQQSLITAVASAARRPVILVLLTGGPVDVTFAKNNPKIGAILWAGYPGQAGGLAI 560

Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
           A V+FG +NP GRLP+TWY   + +IP T M +R  P   +PGR+Y+F+ G  VY FGYG
Sbjct: 561 AKVLFGDHNPSGRLPVTWYPEEFTRIPMTDMRMRADPATGYPGRSYRFYQGNPVYKFGYG 620

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
           LSY++F  ++ ++ K    + +++     I    G          I +  C+  KF   +
Sbjct: 621 LSYSKFTRRLVAAAKPR--RPNRNLLAGVIPKPAGDGGESYHVEEIGEEGCERLKFPATV 678

Query: 665 EVENMGKMDGSEVVMVYSK-PPGIAGTH--IKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
           EV N G MDG   V+V+ + P   AG     +Q++G+    + AG+ A++   +N C+ L
Sbjct: 679 EVHNHGPMDGKHSVLVFVQWPNATAGASRPARQLVGFSSQHVRAGEKARLTMEINPCEHL 738

Query: 722 KIVDNAANSLLASGAHTILVGE 743
               +    ++  G+H + VGE
Sbjct: 739 SRARDDGTKVIDRGSHFLKVGE 760


>gi|326491679|dbj|BAJ94317.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 772

 Score =  654 bits (1688), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 331/746 (44%), Positives = 474/746 (63%), Gaps = 26/746 (3%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           + + +CD  LP+P RA+ LV  +TL EK+ Q+ + A GVPRLG+P YEWWSE+LHG++  
Sbjct: 36  NSYAFCDGSLPFPVRARALVSLLTLDEKIAQLSNTAAGVPRLGVPPYEWWSESLHGLA-- 93

Query: 70  GRRTNSPPGTHFDS-EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
               ++ PG +F S  V  AT FP VIL+ A+FN SLW+ + + V+ EARAM+N G AGL
Sbjct: 94  ----DNGPGVNFSSGPVAAATIFPQVILSAAAFNRSLWRAVAEAVAVEARAMHNAGQAGL 149

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T+W+PNINV RDPRWGR  ETPGEDP ++  Y++ YV+G Q     EY    + R + +S
Sbjct: 150 TYWAPNINVFRDPRWGRGQETPGEDPAMIAAYSVEYVKGFQG----EYGDGREGR-MMLS 204

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYDL+ W    R+ F++ V  QD ++T+  PF+ C+ EG  S +MCSYN+VNG
Sbjct: 205 ACCKHYIAYDLEKWGKFARYTFNAEVNAQDFEDTYEPPFKSCIQEGRASCLMCSYNQVNG 264

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           +P CA   LL Q IR +W F GYIVSDCD++  I E+  +   + ED+VA VLKAG+D++
Sbjct: 265 VPACARKDLL-QKIRDEWGFKGYIVSDCDAVAIIHENQTY-TSSDEDSVAIVLKAGMDVN 322

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG +    T  A+++GKI E DI+ +L  L+ V +RLG F+ + +   +  LG +N+C  
Sbjct: 323 CGSFLIRHTKSAIEKGKIQEEDINHALYNLFSVQLRLGLFEKANENQWFTRLGPSNVCTK 382

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H ELAAEA RQG VLLKNDN  LPL    +  +AL+G  AN    M G+Y G PC   +
Sbjct: 383 EHRELAAEAVRQGTVLLKNDNSFLPLKRSKVSHIALIGAAANDAYIMGGDYTGVPCDPIT 442

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
            + G  A+      A GC D+ C +      AI+AAK AD  V++AGL+L+ E+E  DRV
Sbjct: 443 FLKGMQAFVPQTTVAAGCKDVSCDSPDGFGEAIEAAKRADIVVVIAGLNLTQESEDLDRV 502

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            LLLPG Q +L+N +A   K P+ LVI   G VD+ FAK +P+I S+LW+GYPGE GG+ 
Sbjct: 503 TLLLPGRQQDLVNIIASVTKKPIVLVITGGGPVDVAFAKQDPRIASVLWIGYPGEVGGQV 562

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
           + +++FG+YNPGG+LP+TWY  ++  +P   M +R  P   +PGRTY+F+ G VVY FGY
Sbjct: 563 LPEILFGEYNPGGKLPMTWYPESFTAVPMNDMNMRADPSRGYPGRTYRFYTGEVVYGFGY 622

Query: 604 GLSYTQFKYKVASSPKSVDIKLD--KDQQCRDINYTVGTNKPPCAAVLIDDV-KCKDYKF 660
           GLSY+++ Y +  +P+ + +          R   Y   T +     V ++D+  C+   F
Sbjct: 623 GLSYSKYSYNIVQAPQRISLSHSPVPGLISRKPAY---TRRDGLDYVQVEDIASCESLVF 679

Query: 661 TFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
           +  I V N G MDGS  V+++++    + G  +KQ++G+ERV+ AAG S  V  T++ CK
Sbjct: 680 SVHISVANDGAMDGSHAVLLFARSKSSVPGFPLKQLVGFERVYTAAGSSKNVAITVDPCK 739

Query: 720 SLKIVDNAANSLLASGAHTILVGEGV 745
            +   +     +L  G+H ++VG+ V
Sbjct: 740 YMSAANTEGRRVLLLGSHHLMVGDEV 765


>gi|168046596|ref|XP_001775759.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672911|gb|EDQ59442.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 784

 Score =  654 bits (1688), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 339/748 (45%), Positives = 477/748 (63%), Gaps = 30/748 (4%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           FP+C+  +   +R +DL+ R+T+ EK++Q+ + A  V RLG+P Y+WW E LHGV+    
Sbjct: 32  FPFCNTSISDDDRVEDLISRLTIQEKIEQLVNTAANVSRLGIPPYQWWGEGLHGVAI--- 88

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
                P  +F    P ATSFP   L+  S+N +LW KIGQ VSTE RAMYN G +GLT+W
Sbjct: 89  ----SPSVYFGGATPAATSFPLPCLSVCSYNRTLWNKIGQVVSTEGRAMYNQGRSGLTYW 144

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR---PLKIS 188
           SPNIN+ RDPRWGR  ETPGEDP +   YA+++V+GLQ+ +  +    + SR    LKIS
Sbjct: 145 SPNINIARDPRWGRTQETPGEDPKLSSGYAVHFVKGLQEGDYDQNQPQAVSRGPRRLKIS 204

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKH+ A+DLD W+  DR HFDS+VT+QD+++T+   F+ CV EG  SSVMCSYNR+NG
Sbjct: 205 ACCKHFTAHDLDRWKDYDRDHFDSKVTQQDLEDTYNPSFKSCVKEGQSSSVMCSYNRLNG 264

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLN--DTKEDAVARVLKAGLD 306
           IP C   +LL  T+R  W F GYIVSDCD++  I   H ++N   T EDAV+ V+ AG+D
Sbjct: 265 IPMCTHYELLTLTVRNQWGFDGYIVSDCDAVALI---HDYINYAPTSEDAVSYVMLAGMD 321

Query: 307 LDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNIC 363
           L+CG       + A+ +  I E  ID  LR L+ V MRLG FDG+P    Y +LG  ++C
Sbjct: 322 LNCGSTTLVHGLAALDKKLIWEGLIDMHLRNLFRVRMRLGMFDGNPSTLPYGSLGPEDMC 381

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
              +  LA EAARQ +VLLKN+  ALP    +   LA++G HA+AT+ M+GNYEG PC++
Sbjct: 382 TEDNQHLALEAARQSLVLLKNEKNALPWKKTHGLKLAVIGHHADATREMLGNYEGYPCKF 441

Query: 424 TSPMDGFYA----YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
            SP+ GF      +S  I++  GC+D  C++   I AA +AA  ADA V+V G+  + E 
Sbjct: 442 VSPLQGFAKVLSDHSPRISHERGCSDAACEDQFYIYAAKEAAAQADAVVLVLGISQAQEK 501

Query: 480 EGKDRVDLLLPGFQTELINKVADAAKG-PVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
           EG+DR  LLLPG Q EL++ V +A+ G PV LV++S   +D++FA ++P+I+SI+W GYP
Sbjct: 502 EGRDRDSLLLPGRQMELVSSVVEASAGRPVVLVLLSGSPLDVSFANDDPRIQSIIWAGYP 561

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP--VNNFPGRTYKFFDGP 596
           G+ GG AIA+ IFG  NPGGRL  +WY  NY  I  ++M +RP     +PGRTY+FF   
Sbjct: 562 GQSGGEAIAEAIFGLVNPGGRLAQSWYYENYTNIDMSNMNMRPNASTGYPGRTYRFFTDT 621

Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
            ++ FG+GLSY+ FKY + S+P+S+     + Q C   +  V T+   C  +  +   CK
Sbjct: 622 PLWEFGHGLSYSDFKYTMVSAPQSIMAPHLRYQLCSS-DRAVMTSDLNC--LHYEKEACK 678

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
           +  F  ++ V N G + G   V+++SKPP  GI G  +KQ++ +ERV + AG   ++ F 
Sbjct: 679 ESSFHVRVWVINHGPLSGDHSVLLFSKPPSRGIDGIPLKQLVSFERVHLEAGAGQEILFK 738

Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
           +N C+ L  V +     +  G HT++VG
Sbjct: 739 VNPCEDLGTVGDDGIRTVELGEHTLMVG 766


>gi|224128360|ref|XP_002320310.1| predicted protein [Populus trichocarpa]
 gi|222861083|gb|EEE98625.1| predicted protein [Populus trichocarpa]
          Length = 635

 Score =  645 bits (1664), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 316/646 (48%), Positives = 430/646 (66%), Gaps = 28/646 (4%)

Query: 111 QTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQD 170
           Q VS EARAM+N G AGLT+WSPN+N+ RDPRWGR  ETPGEDP VVG+YA +YVRGLQ 
Sbjct: 2   QVVSDEARAMFNGGVAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVVGKYAASYVRGLQG 61

Query: 171 VEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMC 230
                    SD   LK++ACCKH+ AYDLDNW G DRFHF++ V++QDM++TF +PF MC
Sbjct: 62  ---------SDGNRLKVAACCKHFTAYDLDNWNGVDRFHFNAEVSKQDMEDTFDVPFRMC 112

Query: 231 VNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLN 290
           V EG V+SVMCSYN+VNGIPTCADP LL +T+RG       +      ++ I+ S+  L 
Sbjct: 113 VKEGKVASVMCSYNQVNGIPTCADPNLLKKTVRGT------LFQTVTLLEFIMGSNTILQ 166

Query: 291 DTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
             ++     + +A LDLDCG +    T  AV++G + EA+I+ +L     V MRLG FDG
Sbjct: 167 PRRKQPRMLLKQASLDLDCGPFLGQHTEDAVKKGLLNEAEINNALLNTLTVQMRLGMFDG 226

Query: 351 SPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
            P    Y NLG N++C P H ELA EAARQGIVLLKN   +LPL+T    ++A+VGP++N
Sbjct: 227 EPSSQLYGNLGPNDVCTPAHQELALEAARQGIVLLKNHGPSLPLSTRRHLSVAIVGPNSN 286

Query: 408 ATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADAT 467
            T  MIGNY G  C YT+P+ G   Y++ I +  GCAD+ C ++    AAIDAA+ ADAT
Sbjct: 287 VTATMIGNYAGLACGYTTPLQGIQRYAQTI-HRQGCADVACVSDQQFSAAIDAARQADAT 345

Query: 468 VIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
           V+V GLD S+EAE +DR  LLLPG Q EL++KVA A+KGP  LV+MS G +D++FA+N+P
Sbjct: 346 VLVMGLDQSIEAEFRDRTGLLLPGRQQELVSKVAAASKGPTILVLMSGGPIDVSFAENDP 405

Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN-- 584
           KI SI+W GYPG+ GG AI+DV+FG  NPGG+LP+TWY  +Y+  +P T+M +R   +  
Sbjct: 406 KIGSIVWAGYPGQAGGAAISDVLFGITNPGGKLPMTWYPQDYITNLPMTNMAMRSSKSKG 465

Query: 585 FPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPP 644
           +PGRTY+F+ G VVYPFG+G+SYT F + +AS+P  V + LD  +      +  G     
Sbjct: 466 YPGRTYRFYKGKVVYPFGHGISYTNFVHTIASAPTMVSVPLDGHR------HGSGNATIS 519

Query: 645 CAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIA 704
             A+ +   +C       Q++V+N G MDG+  ++VYS+PP       KQ++ +E+V +A
Sbjct: 520 GKAIRVTHARCNRLSLGMQVDVKNTGSMDGTHTLLVYSRPPARHWAPHKQLVAFEKVHVA 579

Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
           AG   +VG  ++ CKSL +VD +    +  G H++ +G+    VS 
Sbjct: 580 AGTQQRVGINIHVCKSLSVVDGSGIRRIPMGEHSLHIGDVKHSVSL 625


>gi|297611657|ref|NP_001067709.2| Os11g0291000 [Oryza sativa Japonica Group]
 gi|255680005|dbj|BAF28072.2| Os11g0291000 [Oryza sativa Japonica Group]
          Length = 764

 Score =  645 bits (1663), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 331/754 (43%), Positives = 458/754 (60%), Gaps = 39/754 (5%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           +CDA L   +RA DLV  +TL EKV Q+GD A GV RLG+P YEWWSE LHG+S  GR  
Sbjct: 31  FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGR-- 88

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTFWS 132
               G  F+  V   TSFP VILT A+F+  LW+++G+ V  EARA+YNLG A GLT WS
Sbjct: 89  ----GIRFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWS 144

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           PN+N+ RDPRWGR  ETPGEDP    RYA+ +V GLQ + G            + SACCK
Sbjct: 145 PNVNIFRDPRWGRGQETPGEDPVTASRYAVAFVTGLQGIGG------------EASACCK 192

Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           H  AYDLD W    R+++DS+VT QD+++T+  PF+ CV EG  + +MC YN +NG+P C
Sbjct: 193 HATAYDLDYWNNVVRYNYDSKVTLQDLEDTYNPPFKSCVAEGKATCIMCGYNSINGVPAC 252

Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
           A   LL + +R +W  +GY+ SDCD++ TI ++H +   + ED VA  +K G+D++CG+Y
Sbjct: 253 ASSDLLTKKVRQEWGMNGYVASDCDAVATIRDAHHY-TLSPEDTVAVSIKVGMDVNCGNY 311

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNPQHI 368
                M AVQ+G + E DID +L  L+ V MRLG+FDG P+    Y +LG  ++C+P H 
Sbjct: 312 TQVHAMAAVQKGNLTEKDIDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHK 371

Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
            LA EAA+ GIVLLKND GALPL    + +LA++GP+A+   A+ GNY G PC  T+P+ 
Sbjct: 372 SLALEAAQDGIVLLKNDAGALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQ 431

Query: 429 GFYAY-SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
           G   Y      +  GC    C   +    A   A ++D  V+  GL    E +G DR  L
Sbjct: 432 GIKGYLGDRARFLAGCDSPACAVAATN-EAAALASSSDHVVLFMGLSQKQEQDGLDRTSL 490

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
           LLPG Q  LI  VA+AA+ PV LV+++ G VD+ FAK+NPKI +ILW GYPG+ GG AIA
Sbjct: 491 LLPGEQQGLITAVANAARRPVILVLLTGGPVDVTFAKDNPKIGAILWAGYPGQAGGLAIA 550

Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGL 605
            V+FG +NP GRLP+TWY   + K+P T M +R  P   +PGR+Y+F+ G  VY FGYGL
Sbjct: 551 KVLFGDHNPSGRLPVTWYPEEFTKVPMTDMRMRADPATGYPGRSYRFYQGNTVYNFGYGL 610

Query: 606 SYTQFKYKVASS---PKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYK 659
           SY++F  ++ SS     + ++ L      R      G +    ++ L+ ++   +C    
Sbjct: 611 SYSKFSRRMFSSFSTSNAGNLSLLAGVMAR----RAGDDGGGMSSYLVKEIGVERCSRLV 666

Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNAC 718
           F   +EV+N G MDG   V++Y + P  +G    +Q+IG+    +  G+ A V F ++ C
Sbjct: 667 FPAVVEVQNHGPMDGKHSVLMYLRWPTTSGGRPARQLIGFRSQHVKVGEKAMVSFEVSPC 726

Query: 719 KSLKIVDNAANSLLASGAHTILVGEGVGGVSFPL 752
           +    V      ++  GAH ++VG+     SF L
Sbjct: 727 EHFSWVGEDGERVIDGGAHFLMVGDEELETSFGL 760


>gi|32488698|emb|CAE03635.1| OSJNBb0003B01.27 [Oryza sativa Japonica Group]
          Length = 839

 Score =  644 bits (1660), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 319/646 (49%), Positives = 438/646 (67%), Gaps = 26/646 (4%)

Query: 105 LWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINY 164
           ++  I   VSTEARAM+N+G AGLTFWSPNIN+ RDPRWGR  ETPGEDP +  +YA+ Y
Sbjct: 204 MYNLIVLVVSTEARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGY 263

Query: 165 VRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFI 224
           V GLQD  G        S  LK++ACCKHY AYD+DNW+G +R+ FD+ V++QD+ +TF 
Sbjct: 264 VTGLQDAGG-------GSDALKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQ 316

Query: 225 LPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE 284
            PF+ CV +G+V+SVMCSYN+VNG PTCAD  LL+  IRGDW  +GYIVSDCDS+  +  
Sbjct: 317 PPFKSCVIDGNVASVMCSYNKVNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYN 376

Query: 285 SHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMR 344
           +  +  +  EDA A  +K+GLDL+CG++    T+ AVQ GK++E+D+D ++   +IVLMR
Sbjct: 377 NQHYTKN-PEDAAAITIKSGLDLNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMR 435

Query: 345 LGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLAL 401
           LG+FDG P+   + +LG  ++C   + ELA EAARQGIVLLKN  GALPL+  +IK++A+
Sbjct: 436 LGFFDGDPRKLPFGSLGPKDVCTSSNQELAREAARQGIVLLKN-TGALPLSAKSIKSMAV 494

Query: 402 VGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDA 460
           +GP+ANA+  MIGNYEGTPC+YT+P+ G  A    + Y PGC ++ C  NS+ + AA  A
Sbjct: 495 IGPNANASFTMIGNYEGTPCKYTTPLQGLGANVATV-YQPGCTNVGCSGNSLQLSAATQA 553

Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
           A +AD TV+V G D SVE E  DR  LLLPG Q +L++ VA+A++GPV LV+MS G  DI
Sbjct: 554 AASADVTVLVVGADQSVERESLDRTSLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDI 613

Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPL 579
           +FAK++ KI +ILWVGYPGE GG A+AD++FG +NPGGRLP+TWY A++  K+  T M +
Sbjct: 614 SFAKSSDKISAILWVGYPGEAGGAALADILFGYHNPGGRLPVTWYPASFADKVSMTDMRM 673

Query: 580 RP--VNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
           RP     +PGRTY+F+ G  VY FG GLSYT+F + + S+P+ V ++L +   C      
Sbjct: 674 RPDSSTGYPGRTYRFYTGDTVYAFGDGLSYTKFAHSLVSAPEQVAVQLAEGHAC------ 727

Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIG 697
              +   C +V      C    F   + V N G M G   V ++S PP +     K ++G
Sbjct: 728 ---HTEHCFSVEAAGEHCGSLSFDVHLRVRNAGGMAGGHTVFLFSSPPSVHSAPAKHLLG 784

Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
           +E+V +  GQ+  V F ++ CK L +VD   N  +A G+HT+ VG+
Sbjct: 785 FEKVSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGD 830



 Score =  110 bits (275), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 54/103 (52%), Positives = 69/103 (66%), Gaps = 6/103 (5%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           +S + +CD       RA DL+ R+TL EKV  + +    +PRLG+P YEWWSEALHGVS+
Sbjct: 40  VSGYGFCDRTKSSAARAADLLGRLTLAEKVGFLVNKQAALPRLGIPAYEWWSEALHGVSY 99

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQ 111
           +G      PGT F + VPGATSFP  ILT ASFN SL++ IG+
Sbjct: 100 VG------PGTRFSTLVPGATSFPQPILTAASFNASLFRAIGE 136


>gi|357489463|ref|XP_003615019.1| hypothetical protein MTR_5g062650 [Medicago truncatula]
 gi|355516354|gb|AES97977.1| hypothetical protein MTR_5g062650 [Medicago truncatula]
          Length = 785

 Score =  643 bits (1659), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 338/747 (45%), Positives = 469/747 (62%), Gaps = 33/747 (4%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           + +C+  L   +RAKD+V R+TL EK+ Q+ + A  +PRLG+  Y+WWSEALHGV+  G+
Sbjct: 48  YTFCNLNLTTIQRAKDIVSRLTLDEKLAQLVNTAPAIPRLGIHSYQWWSEALHGVADYGK 107

Query: 72  --RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GL 128
             R N       +  +  AT FP VILT ASF+  LW +I + + TEARA+YN G A G+
Sbjct: 108 GIRLNG------NVTIKAATIFPQVILTAASFDSKLWYRISKVIGTEARAVYNAGQAEGM 161

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ--DVEGVEYHRDSDSRPLK 186
           TFW+PNIN+ RDPRWGR  ET GEDP V  +YA+++VRGLQ    EG + + D     LK
Sbjct: 162 TFWAPNINIFRDPRWGRGQETAGEDPLVSAKYAVSFVRGLQGDSFEGGKLNEDR----LK 217

Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
            SACCKH+ AYDLDNW+G DRF FD+ VT QD+ +T+  PF  C+ +G  S +MC+YNRV
Sbjct: 218 ASACCKHFTAYDLDNWKGVDRFDFDANVTLQDLADTYQPPFHSCIVQGRSSGIMCAYNRV 277

Query: 247 NGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLD 306
           NGIP CAD  LL  T R  WNF+GYI SDC ++  I +   +     EDAVA VL+AG+D
Sbjct: 278 NGIPNCADYNLLTNTARKKWNFNGYITSDCSAVDIIHDRQGYAK-APEDAVADVLQAGMD 336

Query: 307 LDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNIC 363
           ++CGDY+T+ +  AV Q K+  + ID +L  L+ + +RLG FDG P   +Y  +G N +C
Sbjct: 337 VECGDYFTSHSKSAVLQKKVPISQIDRALHNLFSIRIRLGLFDGHPTKLKYGKIGPNRVC 396

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT-KAMIGNYEGTPCR 422
           + Q++ +A EAAR GIVLLKN    LPL   +  ++ ++GP+AN++ + ++GNY G PC 
Sbjct: 397 SKQNLNIALEAARSGIVLLKNAASILPL-PKSTDSIVVIGPNANSSSQVVLGNYFGRPCN 455

Query: 423 YTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
             + + GF  YS  + Y PGC+D     ++ I  A++ AK  D  V+V GLD S E+EG 
Sbjct: 456 LVTILQGFENYSDNLLYHPGCSDGTKCVSAEIDRAVEVAKVVDYVVLVMGLDQSQESEGH 515

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
           DR DL LPG Q ELIN VA A+K PV LV+   G VDI+FAK + KI  ILW GYPGE G
Sbjct: 516 DRDDLELPGKQQELINSVAKASKRPVILVLFCGGPVDISFAKVDDKIGGILWAGYPGELG 575

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYP 600
           G A+A V+FG YNPGGRLP+TWY  +++KIP T M +R  P + +PGRTY+F+ GP VY 
Sbjct: 576 GMALAQVVFGDYNPGGRLPMTWYPKDFIKIPMTDMRMRADPSSGYPGRTYRFYTGPKVYE 635

Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKD 657
           FGYGLSY+ + Y   S      +K +     +   Y++          L+ ++    CK 
Sbjct: 636 FGYGLSYSNYSYNFIS------VKNNNLHINQSTTYSILEKSQTIHYKLVSELGKKACKT 689

Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
              +  + + N G M G   V+++ KP  G  G  +KQ++G+E V +  G   +VGF ++
Sbjct: 690 MSISVTLGITNTGSMAGKHPVLLFVKPKKGRNGNPVKQLVGFESVTVEGGGKGEVGFEVS 749

Query: 717 ACKSLKIVDNAANSLLASGAHTILVGE 743
            C+ L   + +   ++  G +  LVGE
Sbjct: 750 VCEHLSRANESGVKVIEEGGYLFLVGE 776


>gi|222618262|gb|EEE54394.1| hypothetical protein OsJ_01415 [Oryza sativa Japonica Group]
          Length = 776

 Score =  642 bits (1655), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 354/778 (45%), Positives = 475/778 (61%), Gaps = 76/778 (9%)

Query: 1   RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
           RF +  + ++ FPYCDA LPY +R +DLV RMTL EKV  +GD A G PR+GLP Y    
Sbjct: 51  RFAAAGLDMAGFPYCDASLPYADRVRDLVGRMTLEEKVANLGDRAGGAPRVGLPRYCGGG 110

Query: 61  EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
                     RR           ++P       V+   A          G       + M
Sbjct: 111 RRCTACPTSARRDVVWRRRARRHQLPARHQQRRVVQRDAVARHRRRGVDGD------QGM 164

Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
           YNLG+A LT+WSPNINVVRDPRWGR  ETPGEDP+VVGRYA+N+VRG+QD++G      +
Sbjct: 165 YNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDIDGATTAASA 224

Query: 181 D------SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
                  SRP+K+S+CCKHYAA                                      
Sbjct: 225 AAATDAFSRPIKVSSCCKHYAA-------------------------------------- 246

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
               VMCSYNR+NG+P CAD +LL +T+R DW  HGYIVSDCDS++ +V   K+L  T  
Sbjct: 247 ---CVMCSYNRINGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYTGV 303

Query: 295 DAVARVLKAGLDLDCG-------DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY 347
           +A A  +KAGLDLDCG       D++T + + AV+QGK+ E+ +D +L  LY+ LMRLG+
Sbjct: 304 EATAAAMKAGLDLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTNLYLTLMRLGF 363

Query: 348 FDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGP--H 405
           FDG P+ ++LG  ++C  +H ELAA+AARQG+VLLKND   LPL+   + ++AL G   H
Sbjct: 364 FDGIPELESLGAADVCTEEHKELAADAARQGMVLLKNDAALLPLSPEKVNSVALFGQLQH 423

Query: 406 ANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNAD 465
            NAT  M+G+Y G PCR  +P DG     KV++     A   C   S    A  AAK  D
Sbjct: 424 INATDVMLGDYRGKPCRVVTPYDGV---RKVVSSTSVHA---CDKGS-CDTAAAAAKTVD 476

Query: 466 ATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
           AT++VAGL++SVE E  DR DLLLP  Q   IN VA+A+  P+ LVIMSAG VD++FA++
Sbjct: 477 ATIVVAGLNMSVERESNDREDLLLPWSQASWINAVAEASPSPIVLVIMSAGGVDVSFAQD 536

Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--V 582
           NPKI +++W GYPGEEGG AIADV+FGKYNPGGRLP+TWY+  YV KIP TSM LRP   
Sbjct: 537 NPKIGAVVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKNEYVSKIPMTSMALRPDAE 596

Query: 583 NNFPGRTYKFFDGP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTN 641
           + +PGRTYKF+ G  V+YPFG+GLSYT F Y  A++   V +K+   + C+ + Y  G +
Sbjct: 597 HGYPGRTYKFYGGADVLYPFGHGLSYTNFTYASATAAAPVTVKVGAWEYCKQLTYKAGVS 656

Query: 642 KPP-CAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG-IAGTHIKQVIGYE 699
            PP C AV +    C++ + +F + V N G  DG+ VV +Y+ PP  + G   KQ++ + 
Sbjct: 657 SPPACPAVNVASHACQE-EVSFAVTVANTGGRDGTHVVPMYTAPPAEVDGAPRKQLVAFR 715

Query: 700 RVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLN 757
           RV +AAG + +V F +N CK+  IV+  A +++ SG   +LVG+    +SFP+Q++L 
Sbjct: 716 RVRVAAGAAVEVAFALNVCKAFAIVEETAYTVVPSGVSRVLVGDDALSLSFPVQIDLQ 773


>gi|125534137|gb|EAY80685.1| hypothetical protein OsI_35867 [Oryza sativa Indica Group]
          Length = 779

 Score =  640 bits (1652), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 318/744 (42%), Positives = 448/744 (60%), Gaps = 33/744 (4%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           F +C+A LP  +RA DLV R+T  EKV Q+GD A GVPRLG+P+Y+WWSEALHG++  G+
Sbjct: 48  FAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPVYKWWSEALHGLAISGK 107

Query: 72  RTNSPPGTHF-DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLT 129
                 G HF +     ATSFP VI T A+F++ LW +IGQ +  E RA YNLG A GL 
Sbjct: 108 ------GIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAFYNLGQAEGLA 161

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
            WSPN+N+ RDPRWGR  ETPGEDP    +Y   +V+GLQ          S    L+ SA
Sbjct: 162 MWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQG---------SSLTNLQTSA 212

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
           CCKH  AYD++ W+G  R++F+++VT QD+ +T+  PF  CV +G  S +MC+Y  +NG+
Sbjct: 213 CCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLINGV 272

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           P CA   LL +T+RG+W   GY  SDCD++  + +S  F   T E+AVA  LKAGLD++C
Sbjct: 273 PACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTR-TAEEAVAVALKAGLDINC 331

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNP 365
           G Y       A+QQGK+ E D+D +L+ L+ + MRLG+FDG P+    Y  LG  ++C P
Sbjct: 332 GVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLGAADVCTP 391

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            H  LA EAAR+G+VLLKND   LPL    + + A++G +AN   A++GNY G PC  T+
Sbjct: 392 VHKALALEAARRGVVLLKNDARLLPLRAPTVSSAAVIGHNANDILALLGNYYGLPCETTT 451

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P  G   Y K   + PGC+   C + +    A   AK++D   +V GL    E EG DR 
Sbjct: 452 PFGGIQKYVKSAKFLPGCSSAAC-DVAATDQATALAKSSDYVFLVMGLSQKQEQEGLDRT 510

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            LLLPG Q  LI  VA A+K PV L++++ G VDI FA+ NPKI +ILW GYPG+ GG+A
Sbjct: 511 SLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQA 570

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
           IADV+FG++NP G+LP+TWY   + K   T M +R  P   +PGR+Y+F+ G  VY FGY
Sbjct: 571 IADVLFGEFNPSGKLPVTWYPEEFTKFTMTDMRMRPDPATGYPGRSYRFYKGKTVYKFGY 630

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYKF 660
           GLSY++F  ++ S   +         +         T     A   +D++   +C+  +F
Sbjct: 631 GLSYSKFACRIVSGAGNS----SSYGKAALAGLRAATTPEGDAVYRVDEIGDDRCERLRF 686

Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGI-AGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
              +EV+N G MDG   V+++ +      G  ++Q+IG+    +  G+  K+   ++ C+
Sbjct: 687 PVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLIGFRNQHLKVGEKKKLKMEISPCE 746

Query: 720 SLKIVDNAANSLLASGAHTILVGE 743
            L         ++  G+H ++V E
Sbjct: 747 HLSRARVDGEKVIDRGSHFLMVEE 770


>gi|62734691|gb|AAX96800.1| Glycosyl hydrolase family 3 C terminal domain, putative [Oryza
           sativa Japonica Group]
 gi|77549994|gb|ABA92791.1| beta-D-xylosidase, putative, expressed [Oryza sativa Japonica
           Group]
          Length = 853

 Score =  640 bits (1650), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 317/744 (42%), Positives = 447/744 (60%), Gaps = 33/744 (4%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           F +C+A LP  +RA DLV R+T  EKV Q+GD A GVPRLG+P+Y+WWSEALHG++  G+
Sbjct: 122 FAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPVYKWWSEALHGLAISGK 181

Query: 72  RTNSPPGTHF-DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLT 129
                 G HF +     ATSFP VI T A+F++ LW +IGQ +  E RA YNLG A GL 
Sbjct: 182 ------GIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAFYNLGQAEGLA 235

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
            WSPN+N+ RDPRWGR  ETPGEDP    +Y   +V+GLQ          S    L+ SA
Sbjct: 236 MWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQG---------SSLTNLQTSA 286

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
           CCKH  AYD++ W+G  R++F+++VT QD+ +T+  PF  CV +G  S +MC+Y  +NG+
Sbjct: 287 CCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLINGV 346

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           P CA   LL +T+RG+W   GY  SDCD++  + +S  F   T E+AVA  LKAGLD++C
Sbjct: 347 PACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHF-TRTAEEAVAVALKAGLDINC 405

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNP 365
           G Y       A+QQGK+ E D+D +L+ L+ + MRLG+FDG P+    Y  L   ++C P
Sbjct: 406 GVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAADVCTP 465

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            H  LA EAAR+G+VLLKND   LPL    + + A++G +AN   A++GNY G PC  T+
Sbjct: 466 VHKALALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLPCETTT 525

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P  G   Y K   + PGC+   C + +    A   AK++D   +V GL    E EG DR 
Sbjct: 526 PFGGIQKYVKSAKFLPGCSSAAC-DVAATDQATALAKSSDYVFLVMGLSQKQEQEGLDRT 584

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            LLLPG Q  LI  VA A+K PV L++++ G VDI FA+ NPKI +ILW GYPG+ GG+A
Sbjct: 585 SLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQA 644

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
           IADV+FG++NP G+LP+TWY   + K   T M +R  P   +PGR+Y+F+ G  VY FGY
Sbjct: 645 IADVLFGEFNPSGKLPVTWYPEEFTKFTMTDMRMRPDPATGYPGRSYRFYKGKTVYKFGY 704

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYKF 660
           GLSY++F  ++ S   +         +         T     A   +D++   +C+  +F
Sbjct: 705 GLSYSKFACRIVSGAGNS----SSYGKAALAGLRAATTPEGDAVYRVDEIGDDRCERLRF 760

Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGI-AGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
              +EV+N G MDG   V+++ +      G  ++Q+IG+    +  G+  K+   ++ C+
Sbjct: 761 PVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLIGFRNQHLKVGEKKKLKMEISPCE 820

Query: 720 SLKIVDNAANSLLASGAHTILVGE 743
            L         ++  G+H ++V E
Sbjct: 821 HLSRARVDGEKVIDRGSHFLMVEE 844


>gi|115485163|ref|NP_001067725.1| Os11g0297300 [Oryza sativa Japonica Group]
 gi|113644947|dbj|BAF28088.1| Os11g0297300 [Oryza sativa Japonica Group]
          Length = 779

 Score =  638 bits (1646), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 317/744 (42%), Positives = 447/744 (60%), Gaps = 33/744 (4%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           F +C+A LP  +RA DLV R+T  EKV Q+GD A GVPRLG+P+Y+WWSEALHG++  G+
Sbjct: 48  FAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPVYKWWSEALHGLAISGK 107

Query: 72  RTNSPPGTHF-DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLT 129
                 G HF +     ATSFP VI T A+F++ LW +IGQ +  E RA YNLG A GL 
Sbjct: 108 ------GIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAFYNLGQAEGLA 161

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
            WSPN+N+ RDPRWGR  ETPGEDP    +Y   +V+GLQ          S    L+ SA
Sbjct: 162 MWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQG---------SSLTNLQTSA 212

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
           CCKH  AYD++ W+G  R++F+++VT QD+ +T+  PF  CV +G  S +MC+Y  +NG+
Sbjct: 213 CCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLINGV 272

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           P CA   LL +T+RG+W   GY  SDCD++  + +S  F   T E+AVA  LKAGLD++C
Sbjct: 273 PACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTR-TAEEAVAVALKAGLDINC 331

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNP 365
           G Y       A+QQGK+ E D+D +L+ L+ + MRLG+FDG P+    Y  L   ++C P
Sbjct: 332 GVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAADVCTP 391

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            H  LA EAAR+G+VLLKND   LPL    + + A++G +AN   A++GNY G PC  T+
Sbjct: 392 VHKALALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLPCETTT 451

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P  G   Y K   + PGC+   C + +    A   AK++D   +V GL    E EG DR 
Sbjct: 452 PFGGIQKYVKSAKFLPGCSSAAC-DVAATDQATALAKSSDYVFLVMGLSQKQEQEGLDRT 510

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            LLLPG Q  LI  VA A+K PV L++++ G VDI FA+ NPKI +ILW GYPG+ GG+A
Sbjct: 511 SLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQA 570

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
           IADV+FG++NP G+LP+TWY   + K   T M +R  P   +PGR+Y+F+ G  VY FGY
Sbjct: 571 IADVLFGEFNPSGKLPVTWYPEEFTKFTMTDMRMRPDPATGYPGRSYRFYKGKTVYKFGY 630

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYKF 660
           GLSY++F  ++ S   +         +         T     A   +D++   +C+  +F
Sbjct: 631 GLSYSKFACRIVSGAGNS----SSYGKAALAGLRAATTPEGDAVYRVDEIGDDRCERLRF 686

Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGI-AGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
              +EV+N G MDG   V+++ +      G  ++Q+IG+    +  G+  K+   ++ C+
Sbjct: 687 PVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLIGFRNQHLKVGEKKKLKMEISPCE 746

Query: 720 SLKIVDNAANSLLASGAHTILVGE 743
            L         ++  G+H ++V E
Sbjct: 747 HLSRARVDGEKVIDRGSHFLMVEE 770


>gi|222629257|gb|EEE61389.1| hypothetical protein OsJ_15562 [Oryza sativa Japonica Group]
          Length = 771

 Score =  638 bits (1645), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 330/743 (44%), Positives = 458/743 (61%), Gaps = 25/743 (3%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S +P+C+A LP+P RA+ LV  +TL EK+ Q+  L +   R            + GV   
Sbjct: 36  SAYPFCNATLPFPARARALVSLLTLDEKIAQL--LQHRRGRPPPRRPAL--RVVVGVPST 91

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
              T  P  T     V  AT FP VIL+ A+FN SLW+   + ++ EARAM+N G AGLT
Sbjct: 92  ASATTGPGSTSPRGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAGLT 151

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
           FW+PNINV RDPRWGR  ETPGEDP VV  Y++ YV+G Q   G E         + +SA
Sbjct: 152 FWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGEEGR-------MMLSA 204

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
           CCKHY AYDL+ W G  R+ F+++V  QDM++T+  PF+ C+ EG  S +MCSYN+VNG+
Sbjct: 205 CCKHYIAYDLEKWRGFTRYTFNAKVNAQDMEDTYQPPFKSCIQEGRASCLMCSYNQVNGV 264

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           P CA   +L Q  R +W F GYI SDCD++  I E+  +   + ED++A VLKAG+D++C
Sbjct: 265 PACARKDIL-QRARDEWGFQGYITSDCDAVAIIHENQTY-TASDEDSIAVVLKAGMDINC 322

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
           G +    T  A+++GK+ E DI+ +L  L+ V +RLG+FD + +   +  LG NN+C  +
Sbjct: 323 GSFLIRHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCTTE 382

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
           H ELAAEA RQG VLLKNDNG LPL    +  +AL+GP AN    + G+Y G PC  T+ 
Sbjct: 383 HRELAAEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHSTTF 442

Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
           + G  AY     +A GC D+ C +      AI+AAK AD  V++AGL+L+ E E  DRV 
Sbjct: 443 VKGMQAYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDRVS 502

Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
           LLLPG Q +LI+ VA   K PV LV+M  G VD++FAK++P+I SILW+GYPGE GG  +
Sbjct: 503 LLLPGRQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGNVL 562

Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
            +++FGKYNPGG+LPITWY  ++  +P   M +R      +PGRTY+F+ G VVY FGYG
Sbjct: 563 PEILFGKYNPGGKLPITWYPESFTAVPMDDMNMRADASRGYPGRTYRFYTGDVVYGFGYG 622

Query: 605 LSYTQFKYKVASSPKSVDIKLDK--DQQCRDINYTVGTNKPPCAAVLIDDV-KCKDYKFT 661
           LSY+++ Y +  +PK + +      D   R   Y   T +     V ++D+  C+  +F 
Sbjct: 623 LSYSKYSYSILQAPKKISLSRSSVPDLISRKPAY---TRRDGVDYVQVEDIASCEALQFP 679

Query: 662 FQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
             I V N G MDGS  V+++ S  P   G+ IKQ++G+ERV  AAG+S  V  T++ CK 
Sbjct: 680 VHISVSNDGAMDGSHAVLLFASSKPSFPGSPIKQLVGFERVHTAAGRSTDVEITVDPCKL 739

Query: 721 LKIVDNAANSLLASGAHTILVGE 743
           +   +     +L  G H ++VG+
Sbjct: 740 MSFANTEGTRVLFLGTHVLMVGD 762


>gi|371917284|dbj|BAL44718.1| SlArf/Xyl3 [Solanum lycopersicum]
          Length = 777

 Score =  638 bits (1645), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 321/744 (43%), Positives = 460/744 (61%), Gaps = 27/744 (3%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S +P+C+A LP P+R  DLV R+T+ EK+ Q+ + A  +PRLG+  YEWWSE LHG+S  
Sbjct: 42  SSYPFCNAALPIPQRVNDLVSRLTVDEKILQLVNGAPEIPRLGISAYEWWSEGLHGISRH 101

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN-AGL 128
           G+      GT F+  +  AT FP +ILT +SF+E+LW +I Q +  EARA+YN G   G+
Sbjct: 102 GK------GTLFNGTIKAATQFPQIILTASSFDENLWYRIAQAIGREARAVYNAGQLKGI 155

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T W+PNIN++RDPRWGR  ETPGEDP +VG+Y + YVRGLQ  +  E  +  D   L+ S
Sbjct: 156 TLWAPNINILRDPRWGRGQETPGEDPMMVGKYGVAYVRGLQG-DSFEGGKLKDGH-LQTS 213

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKH+ A D+DNW    R+ FD++V +QD+ +++  PF+ CV +G  SSVMC+YN VNG
Sbjct: 214 ACCKHFIAQDMDNWHNFSRYTFDAQVLKQDLADSYEPPFKDCVEQGKASSVMCAYNLVNG 273

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           IP CA+  LL  T RG W   GYIVSDCD++  +     +  +  EDAVA  LKAG+D++
Sbjct: 274 IPNCANFDLLTTTARGKWGLQGYIVSDCDAVDKMYSEQHYAKEP-EDAVAATLKAGMDVN 332

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNP 365
           CG +   +T  A+++ K+ E+DID +L  L+ V MRLG F+G P   +Y ++    +C+ 
Sbjct: 333 CGSHLKTYTKSALEKQKVKESDIDRALHNLFSVRMRLGLFNGDPSKLEYGDISAAEVCSE 392

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H  LA EAAR G VLLKN N  LPL+     +LA++GP AN ++ ++GNYEG  C+  +
Sbjct: 393 EHRALAVEAARSGSVLLKNSNRLLPLSKMKTASLAVIGPKANDSEVLLGNYEGFSCKNVT 452

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
              G   Y     Y PGC  I C + + I  A++ AK AD  V+V GLD ++E E  DR 
Sbjct: 453 LFQGLQGYVANTMYHPGCDFINCTSPA-IDEAVNIAKKADYVVLVMGLDQTLEREKFDRT 511

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           +L LPG Q +LI  +A+AA  PV LV+M  G VD+ FAK+NPKI  ILWVGYPGE G  A
Sbjct: 512 ELGLPGMQEKLITSIAEAASKPVILVLMCGGPVDVTFAKDNPKIGGILWVGYPGEGGAAA 571

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGY 603
           +A ++FG++NPGGR P+TWY   + K+    M +RP ++  +PGRTY+F++GP V+ FGY
Sbjct: 572 LAQILFGEHNPGGRSPVTWYPKEFNKVAMNDMRMRPESSSGYPGRTYRFYNGPKVFEFGY 631

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK---CKDYKF 660
           GLSYT + Y  AS  K+  +        ++      T K     + + DV    C     
Sbjct: 632 GLSYTNYSYTFASVSKNQLL-------FKNPKINQSTEKGSVLNIAVSDVGPEVCNSAMI 684

Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPG-IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
           T ++ V+N G+M G   V+++ K    +     K +IG++ V + AG + +V F +  C+
Sbjct: 685 TVKVAVKNQGEMAGKHPVLLFLKHSSTVDEVPKKTLIGFKSVNLEAGANTQVTFDVKPCE 744

Query: 720 SLKIVDNAANSLLASGAHTILVGE 743
                +     ++  G H +L+G+
Sbjct: 745 HFTRANRDGTLVIDEGKHFLLLGD 768


>gi|253761860|ref|XP_002489304.1| hypothetical protein SORBIDRAFT_0010s007570 [Sorghum bicolor]
 gi|241946952|gb|EES20097.1| hypothetical protein SORBIDRAFT_0010s007570 [Sorghum bicolor]
          Length = 750

 Score =  634 bits (1636), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 327/752 (43%), Positives = 451/752 (59%), Gaps = 46/752 (6%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +P+CD  LP   RA DLV R+T+ EKV Q+GD A GVPRLG+P Y+WWSE LHG++F G 
Sbjct: 30  YPFCDRSLPAARRAADLVSRLTVAEKVSQLGDEAAGVPRLGVPPYKWWSEGLHGLAFWGH 89

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
                 G  F+  V G TSFP V+LTTASF++ LW +IGQ +  EARA+YNLG A GLT 
Sbjct: 90  ------GMRFNGTVTGVTSFPQVLLTTASFDDGLWFRIGQAIGREARALYNLGQAEGLTI 143

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           WSPN+N+ RDPRWGR  ETPGEDP V  +YA+ +VRG+Q            + PL+ SAC
Sbjct: 144 WSPNVNIFRDPRWGRGQETPGEDPAVASKYAVAFVRGIQGSS-----AAGAAAPLQASAC 198

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH  AYDL++W G  R++FD+RVT QD+ +TF  PF+ CV +G  + VMC+Y  +NG+P
Sbjct: 199 CKHATAYDLEDWNGVARYNFDARVTAQDLADTFNPPFQSCVVDGKATCVMCAYTGINGVP 258

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
            CA   LL +T RG W   GY+ SDCD++  + ++ +++  T ED VA  LK        
Sbjct: 259 ACASSDLLTKTFRGAWGHDGYVSSDCDAVAIMHDAQRYV-PTPEDTVAVALK-------- 309

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNPQ 366
                  M A+QQGK+ E D+D +L  L+ V MRLG+FDG P+    Y +LG  ++C   
Sbjct: 310 ----EHGMAAIQQGKMTEKDVDKALTNLFAVRMRLGHFDGDPRGNALYGHLGAADVCTAD 365

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
           H  LA EAA+ GIVLLKND G LPL+   + + A++G +AN    + GNY G  C  T+P
Sbjct: 366 HKNLALEAAQDGIVLLKNDAGILPLDRSAMGSAAVIGHNANDALVLRGNYFGPACETTTP 425

Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
           + G  +Y   + +  GC+   C   +    A   A +++   +  GL    E EG DR  
Sbjct: 426 LQGVQSYVSNVRFLAGCSSAAC-GYAATGQAAALASSSEYVFLFMGLSQDQEKEGLDRTS 484

Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
           LLLPG Q  LI  VA AAK PV LV+++ G VDI FA++NPKI +ILW GYPG+ GG AI
Sbjct: 485 LLLPGKQQSLITAVASAAKRPVILVLLTGGPVDITFAQSNPKIGAILWAGYPGQAGGLAI 544

Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
           A V+FG +NP GRLP+TWY   + K+P T M +R  P N +PGR+Y+F+ G  +Y FGYG
Sbjct: 545 ARVLFGDHNPSGRLPVTWYPEEFTKVPMTDMRMRADPANGYPGRSYRFYRGNTIYKFGYG 604

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL--IDDV---KCKDYK 659
           LSY++F  ++ +          K+Q    +     T K   A     +DD+    C+  +
Sbjct: 605 LSYSKFSRQLVTG--------GKNQLASLLAGLSATTKDDDATSYYHVDDIGADGCEQLR 656

Query: 660 FTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
           F  ++EV+N G MDG   V+++ + P    G  + Q+IG+    I AG+ A V F +  C
Sbjct: 657 FPAEVEVQNHGPMDGKHSVLMFLRWPNATDGRPVSQLIGFTSQHIKAGEKANVRFDVRPC 716

Query: 719 KSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
           +           ++  G+H ++VG+    VSF
Sbjct: 717 EHFSRARADGKKVIDRGSHFLMVGKEEVEVSF 748


>gi|357138088|ref|XP_003570630.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
           distachyon]
          Length = 1026

 Score =  631 bits (1628), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 318/620 (51%), Positives = 411/620 (66%), Gaps = 22/620 (3%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S +P+CD KLP  +RA DL  R+T+ EKV  +GD++ GVPRLG+P Y+WWSEALHGV+  
Sbjct: 34  SSYPFCDRKLPIGQRAADLASRLTVEEKVSLLGDVSPGVPRLGVPAYKWWSEALHGVA-- 91

Query: 70  GRRTNSPP---GTHFD-SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
               N+P    G  FD   V  ATSFP V++T ASFN  LW +IGQ +  EAR +YN G 
Sbjct: 92  ----NAPADRAGVRFDDGPVRAATSFPQVLVTAASFNPHLWYRIGQVIGREARGIYNSGQ 147

Query: 126 A-GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
           A GLTFW+PNINV RDPRWGR  ETPGEDP + G+YA  +VRG+Q   G       +S  
Sbjct: 148 AEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQ---GYGASGAVNSSG 204

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
           L+ SACCKH+ AYDL+NW G  RF F+++V+EQD+ +T+  PF  CV +G  S +MCSYN
Sbjct: 205 LEASACCKHFTAYDLENWNGVTRFAFNAKVSEQDLADTYNPPFRSCVEDGGASGIMCSYN 264

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           RVNG+PTCAD  LL++T RGDW F+GYI SDCD++  I +   +  +  EDAVA VLKAG
Sbjct: 265 RVNGVPTCADHNLLSKTARGDWRFNGYITSDCDAVAIIHDVQGYAKE-PEDAVADVLKAG 323

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYK---NLGKNN 361
           +D++CGDY     + A  QGKI E DID +L+ L+ + MRLG FDG+P+Y    N+G + 
Sbjct: 324 MDVNCGDYVQKHGVSAFHQGKITEQDIDRALQNLFAIRMRLGLFDGNPKYNRYGNIGADQ 383

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           +C  +H +LA EAA+ GIVLLKND G LPL    I +LA++G +AN  + + GNY G PC
Sbjct: 384 VCKKEHQDLALEAAQDGIVLLKNDAGTLPLPKQKISSLAVIGHNANDAQRLQGNYFGPPC 443

Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
              SP+     Y +   +  GC   VC N S I  A  AA  A+  V+  GLD   E E 
Sbjct: 444 ISVSPLQALQGYVRETKFVAGCNAAVC-NVSDIAGAAKAASEAEYVVLFMGLDQDQERED 502

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
            DR++L LPG Q  L+N VADAAK PV LV++  G VD+ FAK NPKI +I+W GYPG+ 
Sbjct: 503 LDRIELGLPGMQESLVNAVADAAKKPVVLVLLCGGPVDVTFAKGNPKIGAIIWAGYPGQA 562

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLR--PVNNFPGRTYKFFDGPVV 598
           GG AIA V+FG++NPGGRLP+TWY   Y   +  T M +R      +PGRTY+F+ G  V
Sbjct: 563 GGIAIAQVLFGEHNPGGRLPVTWYPKEYATAVAMTDMRMRADASTGYPGRTYRFYKGKTV 622

Query: 599 YPFGYGLSYTQFKYKVASSP 618
           Y FGYGLSY+++ +   S P
Sbjct: 623 YNFGYGLSYSKYSHSFVSKP 642


>gi|62701894|gb|AAX92967.1| beta-xylosidase, putative [Oryza sativa Japonica Group]
 gi|77550041|gb|ABA92838.1| Glycosyl hydrolase family 3 C terminal domain containing protein
           [Oryza sativa Japonica Group]
          Length = 793

 Score =  630 bits (1626), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 331/782 (42%), Positives = 458/782 (58%), Gaps = 67/782 (8%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           +CDA L   +RA DLV  +TL EKV Q+GD A GV RLG+P YEWWSE LHG+S  GR  
Sbjct: 32  FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGR-- 89

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTFWS 132
               G  F+  V   TSFP VILT A+F+  LW+++G+ V  EARA+YNLG A GLT WS
Sbjct: 90  ----GIRFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWS 145

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           PN+N+ RDPRWGR  ETPGEDP    RYA+ +V GLQ + G            + SACCK
Sbjct: 146 PNVNIFRDPRWGRGQETPGEDPVTASRYAVAFVTGLQGIGG------------EASACCK 193

Query: 193 HYAAYDLDNWEGNDRFHFDSR----------------------------VTEQDMQETFI 224
           H  AYDLD W    R+++DS+                            VT QD+++T+ 
Sbjct: 194 HATAYDLDYWNNVVRYNYDSKDGASTGKSGETSSQVEKKHGPYEKGYFAVTLQDLEDTYN 253

Query: 225 LPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE 284
            PF+ CV EG  + +MC YN +NG+P CA   LL + +R +W  +GY+ SDCD++ TI +
Sbjct: 254 PPFKSCVAEGKATCIMCGYNSINGVPACASSDLLTKKVRQEWGMNGYVASDCDAVATIRD 313

Query: 285 SHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMR 344
           +H +   + ED VA  +K G+D++CG+Y     M AVQ+G + E DID +L  L+ V MR
Sbjct: 314 AHHY-TLSPEDTVAVSIKVGMDVNCGNYTQVHAMAAVQKGNLTEKDIDRALVNLFAVRMR 372

Query: 345 LGYFDGSPQ----YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLA 400
           LG+FDG P+    Y +LG  ++C+P H  LA EAA+ GIVLLKND GALPL    + +LA
Sbjct: 373 LGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVLLKNDAGALPLQPSAVTSLA 432

Query: 401 LVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY-SKVINYAPGCADIVCQNNSMIPAAID 459
           ++GP+A+   A+ GNY G PC  T+P+ G   Y      +  GC    C   +    A  
Sbjct: 433 VIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFLAGCDSPACAVAATN-EAAA 491

Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
            A ++D  V+  GL    E +G DR  LLLPG Q  LI  VA+AA+ PV LV+++ G VD
Sbjct: 492 LASSSDHVVLFMGLSQKQEQDGLDRTSLLLPGEQQGLITAVANAARRPVILVLLTGGPVD 551

Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
           + FAK+NPKI +ILW GYPG+ GG AIA V+FG +NP GRLP+TWY   + K+P T M +
Sbjct: 552 VTFAKDNPKIGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEFTKVPMTDMRM 611

Query: 580 R--PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASS---PKSVDIKLDKDQQCRDI 634
           R  P   +PGR+Y+F+ G  VY FGYGLSY++F  ++ SS     + ++ L      R  
Sbjct: 612 RADPATGYPGRSYRFYQGNTVYNFGYGLSYSKFSRRMFSSFSTSNAGNLSLLAGVMAR-- 669

Query: 635 NYTVGTNKPPCAAVLIDDV---KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH 691
               G +    ++ L+ ++   +C    F   +EV+N G MDG   V++Y + P  +G  
Sbjct: 670 --RAGDDGGGMSSYLVKEIGVERCSRLVFPAVVEVQNHGPMDGKHSVLMYLRWPTTSGGR 727

Query: 692 -IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
             +Q+IG+    +  G+ A V F ++ C+    V      ++  GAH ++VG+     SF
Sbjct: 728 PARQLIGFRSQHVKVGEKAMVSFEVSPCEHFSWVGEDGERVIDGGAHFLMVGDEELETSF 787

Query: 751 PL 752
            L
Sbjct: 788 GL 789


>gi|195614824|gb|ACG29242.1| auxin-induced beta-glucosidase [Zea mays]
 gi|413920229|gb|AFW60161.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 655

 Score =  629 bits (1621), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 317/656 (48%), Positives = 416/656 (63%), Gaps = 27/656 (4%)

Query: 120 MYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEY--H 177
           MYN G AGLTFWSPN+N+ RDPRWGR  ETPGEDP V  RYA  YVRGLQ         H
Sbjct: 1   MYNGGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVSARYAAAYVRGLQQPYAAPNGGH 60

Query: 178 RDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVS 237
           R+     LK++ACCKH+ AYDLD W G DRFHF++ V  QD+++TF +PF  CV +G  +
Sbjct: 61  RNR----LKLAACCKHFTAYDLDKWGGTDRFHFNAVVAAQDLEDTFNVPFRACVEDGRAA 116

Query: 238 SVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV 297
           SVMCSYN+VNG+PTCAD   L  TIRG W   GYIVSDCDS+        +   T EDA 
Sbjct: 117 SVMCSYNQVNGVPTCADAAFLRGTIRGRWGLDGYIVSDCDSVDVFFRDQHYTR-TPEDAA 175

Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---Y 354
           A  L+AGLDLDCG +   +   AV  GK+A+AD+D +L     V MRLG FDG P    +
Sbjct: 176 AATLRAGLDLDCGPFLALYAGSAVAAGKVADADVDAALLNTVTVQMRLGMFDGDPAAGPF 235

Query: 355 KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGA------LPLNTGNIKTLALVGPHANA 408
             LG  ++C  +H +LA +AARQG+VLLKN  GA      LPL     + +A+VGPHA+A
Sbjct: 236 GRLGPADVCTREHQDLALDAARQGVVLLKNRRGARHNRDVLPLRPAAHRVVAVVGPHADA 295

Query: 409 TKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
           T AMIGNY G PCRYT+P+ G  AY+  + +  GC D+ C+ N  I AA++AA+ ADATV
Sbjct: 296 TVAMIGNYAGKPCRYTTPLQGVAAYAARVAHQAGCTDVACRGNQPIAAAVEAARQADATV 355

Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
           +VAGLD  VEAEG DR  LLLPG Q ELI+ VA A+KGPV LV+MS G +DI FA+N+P+
Sbjct: 356 VVAGLDQRVEAEGLDRTTLLLPGRQAELISAVAKASKGPVILVLMSGGPIDIAFAQNDPR 415

Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNF 585
           I  ILWVGYPG+ GG+AIADVIFG +NPG +LP+TWY  +Y+ K+P T+M +R  P   +
Sbjct: 416 IDGILWVGYPGQAGGQAIADVIFGHHNPGAKLPVTWYHQDYLQKVPMTNMAMRANPARGY 475

Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL--DKDQQCRDINYTVGTNKP 643
           PGRTY+F+ GP +YPFG+GLSYTQF + +A +P  + ++L           +    T   
Sbjct: 476 PGRTYRFYTGPTIYPFGHGLSYTQFTHTLAHAPTQLTVRLSGSGHSAASAASLLNATLAR 535

Query: 644 PCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI------AGTHIKQVIG 697
           P  AV +   +C+       ++V N+G  DG+  V+VY   P        A    +Q++ 
Sbjct: 536 PVRAVRVAHARCEGLTVPVHVDVSNVGDRDGAHAVLVYHAAPSPSHAAPGADAPARQLVA 595

Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQ 753
           +E+V + AG  A+V   +  C  L + D      +  G H +++GE    VS  ++
Sbjct: 596 FEKVHVPAGGVARVEMRIGVCDRLSVADRNGVRRVPVGEHRLMIGELTHSVSLGVE 651


>gi|90399376|emb|CAJ86207.1| B1011H02.4 [Oryza sativa Indica Group]
          Length = 738

 Score =  628 bits (1620), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 327/744 (43%), Positives = 453/744 (60%), Gaps = 60/744 (8%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S +P+C+A LP+P RA+ LV  +TL EK+ Q+ + A G PRLG+P +EWWSE+LHGV   
Sbjct: 36  SAYPFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSESLHGV--- 92

Query: 70  GRRTNSPPGTHFDS-EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
               ++ PG +F S  V  AT FP VIL+ A+FN SLW+   + ++ EARAM+N G AGL
Sbjct: 93  ---CDNGPGVNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAGL 149

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           TFW+PNINV RDPRWGR  ETPGEDP VV  Y++ YV+G Q   G E         + +S
Sbjct: 150 TFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGEEGR-------MMLS 202

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYDL+ W G  R+ F+++V                                NG
Sbjct: 203 ACCKHYIAYDLEKWRGFTRYTFNAKV--------------------------------NG 230

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           +P CA   +L Q  R +W F GYI SDCD++  I E+  +   + ED++A VLKAG+D++
Sbjct: 231 VPACARKDIL-QRARDEWGFQGYITSDCDAVAIIHENQTY-TASDEDSIAVVLKAGMDIN 288

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG +    T  A+++GK+ E DI+ +L  L+ V +RLG+FD + +   +  LG NN+C  
Sbjct: 289 CGSFLIRHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCTT 348

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H ELAAEA RQG VLLKNDNG LPL    +  +AL+GP AN    + G+Y G PC  T+
Sbjct: 349 EHRELAAEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHSTT 408

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
            + G  AY     +A GC D+ C +      AI+AAK AD  V++AGL+L+ E E  DRV
Sbjct: 409 FVKGMQAYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDRV 468

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            LLLPG Q +LI+ VA   K PV LV+M  G VD++FAK++P+I SILW+GYPGE GG  
Sbjct: 469 SLLLPGRQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGNV 528

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
           + +++FGKYNPGG+LPITWY  ++  +P   M +R      +PGRTY+F+ G VVY FGY
Sbjct: 529 LPEILFGKYNPGGKLPITWYPESFTAVPMDDMNMRADASRGYPGRTYRFYTGDVVYGFGY 588

Query: 604 GLSYTQFKYKVASSPKSVDIKLDK--DQQCRDINYTVGTNKPPCAAVLIDDV-KCKDYKF 660
           GLSY+++ Y +  +PK + +      D   R   Y   T +     V ++D+  C+  +F
Sbjct: 589 GLSYSKYSYSILQAPKKISLSRSSVPDLISRKPAY---TRRDGVDYVQVEDIASCEALQF 645

Query: 661 TFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
              I V N G MDGS  V+++ S  P   G+ IKQ++G+ERV  AAG+S  V  T++ CK
Sbjct: 646 PVHISVSNDGAMDGSHAVLLFASSKPSFPGSPIKQLVGFERVHTAAGRSTDVEITVDPCK 705

Query: 720 SLKIVDNAANSLLASGAHTILVGE 743
            +   +     +L  G H ++VG+
Sbjct: 706 LMSFANTEGTRVLFLGTHVLMVGD 729


>gi|318136853|gb|ADV41671.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Actinidia deliciosa
           var. deliciosa]
          Length = 634

 Score =  623 bits (1607), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 313/637 (49%), Positives = 422/637 (66%), Gaps = 29/637 (4%)

Query: 116 EARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
           EARAMYN G AGLTFWSPN+N+ RDPRWGR  ETPGEDP + G YA +YVRGLQ      
Sbjct: 2   EARAMYNGGMAGLTFWSPNVNIFRDPRWGRGQETPGEDPMLAGNYAASYVRGLQG----- 56

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
               +D   LK++ACCKHY AYDLDNW G DRFHF++RV++QD+++TF +PF  CV  G 
Sbjct: 57  ----NDGERLKVAACCKHYTAYDLDNWRGVDRFHFNARVSKQDIKDTFEIPFRECVLGGK 112

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKED 295
           V+SVMCSYN+VNGIPTCA+PKLL  TIRG W  +GYIVSDCDS+    E+  + +   E+
Sbjct: 113 VASVMCSYNQVNGIPTCANPKLLKGTIRGSWRLNGYIVSDCDSVGVFFENQHYTSK-PEE 171

Query: 296 AVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--- 352
           AVA  +KAGLDLDCG +    T  AV++G +++ +I+ +L       MRLG FDG P   
Sbjct: 172 AVAAAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTAQMRLGMFDGEPSAH 231

Query: 353 QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAM 412
           QY NLG  ++C P H +LA EAARQGIVLL+N   +LPL+    +T+A++GP+++ T  M
Sbjct: 232 QYGNLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTM 291

Query: 413 IGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG 472
           IGNY G  C YT+P+ G   Y++ I+ A GC D+ C  N +  AA  AA+ ADATV+V G
Sbjct: 292 IGNYAGVACGYTTPLQGIGRYTRTIHQA-GCTDVHCNGNQLFGAAEAAARQADATVLVMG 350

Query: 473 LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
           LD S+EAE  DR   LLPG Q EL+++VA A++GP  LV+MS G +D+ FAKN+P+I +I
Sbjct: 351 LDQSIEAEFVDRAGPLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAI 410

Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRT 589
           +WVGYPG+ GG AIADV+FG  NPGG+LP+TWY  NYV  +P T M +R  P   +PGRT
Sbjct: 411 IWVGYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRT 470

Query: 590 YKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRD---INYTVGTNKPPCA 646
           Y+F+ GPVV+PFG GLSYT F + +A  P  V + L   +   +   ++  V  +   C 
Sbjct: 471 YRFYRGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKAVRVSHADCN 530

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
           A+   DV          ++V+N G MDG+  ++V++ PP       KQ++G+ ++ IAAG
Sbjct: 531 ALSPLDV---------HVDVKNTGSMDGTHTLLVFTSPPDGKWAASKQLVGFHKIHIAAG 581

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
              +V   ++ CK L +VD      +  G H + +G+
Sbjct: 582 SETRVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGD 618


>gi|125535275|gb|EAY81823.1| hypothetical protein OsI_36995 [Oryza sativa Indica Group]
          Length = 885

 Score =  621 bits (1601), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 325/663 (49%), Positives = 432/663 (65%), Gaps = 28/663 (4%)

Query: 111 QTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQD 170
           Q VS E RAMYN G AGLTFWSPN+N+ RDPRWGR  ETPGEDP V  RYA  YVRGLQ 
Sbjct: 227 QAVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQ- 285

Query: 171 VEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMC 230
                  +   S  LK++ACCKH+ AYDLDNW G DRFHF++ VT QD+++TF +PF  C
Sbjct: 286 ------QQQPSSGRLKLAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRSC 339

Query: 231 VNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLN 290
           V +G  +SVMCSYN+VNG+PTCAD   L  TIR  W   GYIVSDCDS+  +  S +   
Sbjct: 340 VVDGRAASVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVD-VFYSDQHYT 398

Query: 291 DTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
            T+EDAVA  L+AGLDLDCG +   +T GAV QGK+ + DID ++     V MRLG FDG
Sbjct: 399 RTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGMFDG 458

Query: 351 SPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIK-TLALVGPHA 406
            P    + +LG  ++C   H ELA EAARQGIVLLKND  ALPL+    +  +A+VGPHA
Sbjct: 459 DPAAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVGPHA 518

Query: 407 NATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNAD 465
            AT AMIGNY G PCRYT+P+ G   Y+    + PGC D+ C  +   I AA+DAA+ AD
Sbjct: 519 EATVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAARRAD 578

Query: 466 ATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
           AT++VAGLD  +EAEG DR  LLLPG Q ELI+ VA A+KGPV LV+MS G +DI FA+N
Sbjct: 579 ATIVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQN 638

Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PV 582
           +PKI  ILW GYPG+ GG+AIADVIFG +NPGG+LP+TWY  +Y+ K+P T+M +R  P 
Sbjct: 639 DPKIAGILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPQDYLQKVPMTNMAMRANPA 698

Query: 583 NNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
             +PGRTY+F+ GP ++PFG+GLSYT F + +A +P  + ++L         + +   N 
Sbjct: 699 KGYPGRTYRFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLTVRLAAHHAAASASASASLNA 758

Query: 643 PP----CAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY--------SKPPGIAGT 690
                  AAV +   +C++ +    ++V N+G+ DG+  V+VY        ++     G 
Sbjct: 759 TARLSRAAAVRVAHARCEELRMPVHVDVRNVGERDGAHTVLVYAAAPASSAAEAAAGHGA 818

Query: 691 HIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
            ++Q++ +E+V + AG +A+V   ++ C  L + D      +  G H +++GE    V+ 
Sbjct: 819 PVRQLVAFEKVHVGAGGTARVEMGIDVCDGLSVADRNGVRRIPVGEHRLIIGELTHTVTI 878

Query: 751 PLQ 753
            L+
Sbjct: 879 ALE 881



 Score =  112 bits (281), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 59/115 (51%), Positives = 71/115 (61%), Gaps = 6/115 (5%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           +  P+C   LP   RA+DLV RMT  EKV+ + + A GVPRLG+  YEWWSEALHGVS  
Sbjct: 39  ATLPFCRRSLPARARARDLVARMTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDT 98

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
           G      PG  F    PGAT+FP VI T ASFN +LW+ IGQ  S+ +     LG
Sbjct: 99  G------PGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQFRSSLSSMDKTLG 147


>gi|77552476|gb|ABA95273.1| Beta-D-xylosidase, putative, expressed [Oryza sativa Japonica
           Group]
          Length = 883

 Score =  620 bits (1600), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 324/661 (49%), Positives = 432/661 (65%), Gaps = 26/661 (3%)

Query: 111 QTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQD 170
           Q VS E RAMYN G AGLTFWSPN+N+ RDPRWGR  ETPGEDP V  RYA  YVRGLQ 
Sbjct: 227 QAVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQ- 285

Query: 171 VEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMC 230
                  +   S  LK++ACCKH+ AYDLDNW G DRFHF++ VT QD+++TF +PF  C
Sbjct: 286 ------QQQPSSGRLKLAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRSC 339

Query: 231 VNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLN 290
           V +G  +SVMCSYN+VNG+PTCAD   L  TIR  W   GYIVSDCDS+  +  S +   
Sbjct: 340 VVDGRAASVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVD-VFYSDQHYT 398

Query: 291 DTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
            T+EDAVA  L+AGLDLDCG +   +T GAV QGK+ + DID ++     V MRLG FDG
Sbjct: 399 RTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGMFDG 458

Query: 351 SPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIK-TLALVGPHA 406
            P    + +LG  ++C   H ELA EAARQGIVLLKND  ALPL+    +  +A+VGPHA
Sbjct: 459 DPAAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVGPHA 518

Query: 407 NATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNAD 465
            AT AMIGNY G PCRYT+P+ G   Y+    + PGC D+ C  +   I AA+DAA+ AD
Sbjct: 519 EATVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAARRAD 578

Query: 466 ATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
           AT++VAGLD  +EAEG DR  LLLPG Q ELI+ VA A+KGPV LV+MS G +DI FA+N
Sbjct: 579 ATIVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQN 638

Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PV 582
           +PKI  ILW GYPG+ GG+AIADVIFG +NPGG+LP+TWY  +Y+ K+P T+M +R  P 
Sbjct: 639 DPKIAGILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPQDYLQKVPMTNMAMRANPA 698

Query: 583 NNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
             +PGRTY+F+ GP ++PFG+GLSYT F + +A +P  + ++L         + ++    
Sbjct: 699 KGYPGRTYRFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLTVRLSAHHAAASASASLNATA 758

Query: 643 --PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY--------SKPPGIAGTHI 692
                AAV +   +C++ +    ++V N+G+ DG+  V+VY        ++     G  +
Sbjct: 759 RLSRAAAVRVAHARCEELRMPVHVDVRNVGERDGAHTVLVYAAAPASSAAEAAAGHGAPV 818

Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPL 752
           +Q++ +E+V + AG +A+V   ++ C  L + D      +  G H +++GE    V+  L
Sbjct: 819 RQLVAFEKVHVGAGGTARVEMGIDVCDGLSVADRNGVRRIPVGEHRLIIGELTHTVTIAL 878

Query: 753 Q 753
           +
Sbjct: 879 E 879



 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 58/115 (50%), Positives = 71/115 (61%), Gaps = 6/115 (5%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           +  P+C   LP   RA+DLV R+T  EKV+ + + A GVPRLG+  YEWWSEALHGVS  
Sbjct: 39  ATLPFCRRSLPARARARDLVARLTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDT 98

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
           G      PG  F    PGAT+FP VI T ASFN +LW+ IGQ  S+ +     LG
Sbjct: 99  G------PGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQFRSSLSSMDKTLG 147


>gi|359473427|ref|XP_002265788.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
           [Vitis vinifera]
          Length = 464

 Score =  603 bits (1556), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 281/451 (62%), Positives = 350/451 (77%), Gaps = 3/451 (0%)

Query: 120 MYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD 179
           MYNLG+AGLTFWSPNINVVRD RWGR  ET  EDP++VG +A+NYVRGLQDVEG E   D
Sbjct: 1   MYNLGHAGLTFWSPNINVVRDTRWGRTQETSREDPFMVGEFAVNYVRGLQDVEGTENVTD 60

Query: 180 SDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
            +SRPLK+S+CCKHYAAYD+D+W   DR  FD+RV+EQDM+ETF+ PFE CV EGDVSSV
Sbjct: 61  LNSRPLKVSSCCKHYAAYDIDSWLNIDRHTFDARVSEQDMKETFVSPFERCVREGDVSSV 120

Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
           MCS+N++NGIP C+DP+LL   IR +W+ HGYIVSDC  ++ IV++  +LND+K DAVA+
Sbjct: 121 MCSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAK 180

Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK 359
            L+AGLDL+CG YYT+     V  GK+++ ++D +L+ +Y++LMR+GYFDG P Y++LG 
Sbjct: 181 TLQAGLDLECGHYYTDALNELVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPAYESLGL 240

Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
            +IC   HIELA EAARQGIVLLKND    PL  G  K LALVGPHANAT+ MIGNY G 
Sbjct: 241 KDICAADHIELAREAARQGIVLLKNDYEVFPLKPG--KKLALVGPHANATEVMIGNYAGL 298

Query: 420 PCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
           P +Y SP++ F A   V  Y  GC D  C N++    A +AAK+A+ T+I  G DLS+EA
Sbjct: 299 PRKYVSPLEAFSAIGNV-TYTTGCLDASCSNDTYFSEAKEAAKSAEVTIIFVGTDLSIEA 357

Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
           E  DRVD LLPG QTELI +VA+ + GPV LV++S   +DI FAKNNP+I +ILWVG+PG
Sbjct: 358 EFVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPG 417

Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYV 570
           E+GG AIADV+FGKYNPGGRLP+TWYEA+YV
Sbjct: 418 EQGGHAIADVVFGKYNPGGRLPVTWYEADYV 448


>gi|222615852|gb|EEE51984.1| hypothetical protein OsJ_33664 [Oryza sativa Japonica Group]
          Length = 753

 Score =  603 bits (1556), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 320/754 (42%), Positives = 448/754 (59%), Gaps = 50/754 (6%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           +CDA L   +RA DLV  +TL EKV Q+GD A GV RLG+P YEWWSE LHG+S  GR  
Sbjct: 31  FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGR-- 88

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTFWS 132
               G  F+  V   TSFP VILT A+F+  LW+++G+ V  EARA+YNLG A GLT WS
Sbjct: 89  ----GIRFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWS 144

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           PN+N+ RDP   R    PG+      R    +  G Q + G            + SACCK
Sbjct: 145 PNVNIFRDPSGTR----PGD-----ARRGPRH--GEQGIGG------------EASACCK 181

Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           H  AYDLD W    R+++DS+VT QD+++T+  PF+ CV EG  + +MC YN +NG+P C
Sbjct: 182 HATAYDLDYWNNVVRYNYDSKVTLQDLEDTYNPPFKSCVAEGKATCIMCGYNSINGVPAC 241

Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
           A   LL + +R +W  +GY+ SDCD++ TI ++H +   + ED VA  +K G+D++CG+Y
Sbjct: 242 ASSDLLTKKVRQEWGMNGYVASDCDAVATIRDAHHY-TLSPEDTVAVSIKVGMDVNCGNY 300

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNPQHI 368
                M AVQ+G + E DID +L  L+ V MRLG+FDG P+    Y +LG  ++C+P H 
Sbjct: 301 TQVHAMAAVQKGNLTEKDIDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHK 360

Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
            LA EAA+ GIVLLKND GALPL    + +LA++GP+A+   A+ GNY G PC  T+P+ 
Sbjct: 361 SLALEAAQDGIVLLKNDAGALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQ 420

Query: 429 GFYAY-SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
           G   Y      +  GC    C  ++   AA   A ++D  V+  GL    E +G DR  L
Sbjct: 421 GIKGYLGDRARFLAGCDSPACAVDATNEAAA-LASSSDHVVLFMGLSQKQEQDGLDRTSL 479

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
           LLPG Q  LI  VA+AA+ PV LV+++ G VD+ FAK+NPKI +ILW GYPG+ GG AIA
Sbjct: 480 LLPGEQQGLITAVANAARRPVILVLLTGGPVDVTFAKDNPKIGAILWAGYPGQAGGLAIA 539

Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGL 605
            V+FG +NP GRLP+TWY   + K+P T M +R  P   +PGR+Y+F+ G  VY FGYGL
Sbjct: 540 KVLFGDHNPSGRLPVTWYPEEFTKVPMTDMRMRADPATGYPGRSYRFYQGNTVYNFGYGL 599

Query: 606 SYTQFKYKVASS---PKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYK 659
           SY++F  ++ SS     + ++ L      R      G +    ++ L+ ++   +C    
Sbjct: 600 SYSKFSRRMFSSFSTSNAGNLSLLAGVMAR----RAGDDGGGMSSYLVKEIGVERCSRLV 655

Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNAC 718
           F   +EV+N G MDG   V++Y + P  +G    +Q+IG+    +  G+ A V F ++ C
Sbjct: 656 FPAVVEVQNHGPMDGKHSVLMYLRWPTTSGGRPARQLIGFRSQHVKVGEKAMVSFEVSPC 715

Query: 719 KSLKIVDNAANSLLASGAHTILVGEGVGGVSFPL 752
           +    V      ++  GAH ++VG+     SF L
Sbjct: 716 EHFSWVGEDGERVIDGGAHFLMVGDEELETSFGL 749


>gi|356510699|ref|XP_003524073.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Glycine max]
          Length = 613

 Score =  603 bits (1554), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 304/554 (54%), Positives = 392/554 (70%), Gaps = 25/554 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           ++ + +CD  L    R KDLV R+TL EK+  + + A  V RLG+P YEWWSEALHGVS 
Sbjct: 40  VAGYGFCDKSLGVEARVKDLVGRLTLQEKIGNLVNSAGDVSRLGIPRYEWWSEALHGVSN 99

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           +G       GT F + VPGATSFP  ILT ASFN SL++ IG+ VSTEA AMYN+G AGL
Sbjct: 100 VGL------GTRFSNVVPGATSFPMPILTAASFNTSLFEVIGRVVSTEAGAMYNVGLAGL 153

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           T+WSPNIN+ RDPRWGR LETPGEDP +  +YA  YV+GLQ  +G       D   LK++
Sbjct: 154 TYWSPNINIFRDPRWGRGLETPGEDPVLTSKYAAGYVKGLQQTDG------GDPNKLKVA 207

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYD+D W+G  R+ F++ +T+QD+++TF  PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 208 ACCKHYTAYDVDKWKGIQRYTFNAVLTKQDLEDTFQPPFKSCVIDGNVASVMCSYNKVNG 267

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTCADP LL   +RG+W  +GY+VSDCDS++ + +   +   T E+A A  + AGLDL+
Sbjct: 268 KPTCADPDLLKGVVRGEWKLNGYMVSDCDSVEVLYKYQHY-TKTPEEAAAISILAGLDLN 326

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG +   +T GAV+QG I E+ I+ ++   +  LMRLG+FDG P+   Y NLG  ++C P
Sbjct: 327 CGRFLGQYTEGAVKQGLIDES-INNAVSNNFATLMRLGFFDGDPRKQPYGNLGPKDVCTP 385

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            + ELA EAARQGIV LKN   +LPLN   IK+LA++GP+ANAT+ MIGNYEG PC+Y S
Sbjct: 386 ANQELAREAARQGIVSLKNSPASLPLNAKAIKSLAVIGPNANATRVMIGNYEGIPCKYIS 445

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAK---NADATVIVAGLDLSVEAEGK 482
           P+ G  A+    +YA GC D+ C N    P   DA K   + DATVIV G  L++EAE  
Sbjct: 446 PLQGLTAFVPT-SYAAGCLDVRCPN----PVLDDAKKISASGDATVIVVGASLAIEAESL 500

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
           DRV++LLPG Q  L+ +VA+A+KGPV LVIMS G +D++FAK+N KI SILWVGYPGE G
Sbjct: 501 DRVNILLPGQQQLLVTEVANASKGPVILVIMSGGGMDVSFAKDNNKITSILWVGYPGEAG 560

Query: 543 GRAIADVIFGKYNP 556
           G AIADVIFG +NP
Sbjct: 561 GAAIADVIFGFHNP 574


>gi|37359708|dbj|BAC98299.1| LEXYL2 [Solanum lycopersicum]
          Length = 633

 Score =  597 bits (1540), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 298/642 (46%), Positives = 423/642 (65%), Gaps = 26/642 (4%)

Query: 109 IGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGL 168
           IG+ VSTE RAMYN+G AGLT+WSPN+N+ RDPRWGR  ET GEDP +  RY + YV+GL
Sbjct: 2   IGKVVSTEGRAMYNVGQAGLTYWSPNVNIYRDPRWGRGQETAGEDPTLSSRYGVAYVKGL 61

Query: 169 QDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFE 228
           Q  +      D     LK+++CCKHY AYD+D+W+G  R++F+++VT+QD+ +TF  PF+
Sbjct: 62  QQRD------DGKKDMLKVASCCKHYTAYDVDDWKGIQRYNFNAKVTQQDLDDTFNPPFK 115

Query: 229 MCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKF 288
            CV +G+V+SVMCSYN+V+G PTC D  LL   IRG W  +GYIV+DCDS+  +  +  +
Sbjct: 116 SCVLDGNVASVMCSYNQVDGKPTCGDYDLLAGVIRGQWKLNGYIVTDCDSLNEMYWAQHY 175

Query: 289 LNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
              T E+  A  L AGL L+CG +   +T GAV QG + E+ ID ++   +  LMRLG+F
Sbjct: 176 -TKTPEETAALSLNAGLGLNCGSWLGKYTQGAVNQGLVNESVIDRAVTNNFATLMRLGFF 234

Query: 349 DGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPH 405
           DG+P+   Y NLG  +IC   H ELA EAARQGIVLLKN  G+LPL+  +IK+LA++GP+
Sbjct: 235 DGNPKNQLYGNLGPKDICTEDHQELAREAARQGIVLLKNTAGSLPLSPKSIKSLAVIGPN 294

Query: 406 ANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNAD 465
           AN    M+G+YEG+PC+YT+P+DG  A    + Y  GC DI C   + +  A   A  AD
Sbjct: 295 ANLAYTMVGSYEGSPCKYTTPLDGLGASVSTV-YQQGC-DIACAT-AQVDNAKKVAAAAD 351

Query: 466 ATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
           A V+V G D ++E E KDR ++ LPG Q+ L+ +VA  +KGPV LVIMS G +D+ FA +
Sbjct: 352 AVVLVMGSDQTIERESKDRFNITLPGQQSLLVTEVASVSKGPVILVIMSGGGMDVKFAVD 411

Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PV 582
           NPK+ SILWVG+PGE GG A+ADV+FG +NPGGRLP+TWY  +YV K+  T+M +R  P 
Sbjct: 412 NPKVTSILWVGFPGEAGGAALADVVFGYHNPGGRLPMTWYPQSYVDKVDMTNMNMRADPK 471

Query: 583 NNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
             FPGR+Y+F+ GP V+ FG GLSYTQ+K+ +  +PK V I L++   CR          
Sbjct: 472 TGFPGRSYRFYKGPTVFNFGDGLSYTQYKHHLVKAPKFVSIPLEEGHACRSTK------- 524

Query: 643 PPCAAV-LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERV 701
             C ++  +++  C +      ++V+N+GKM GS  V++++ PP +     K ++ ++++
Sbjct: 525 --CKSIDAVNEQGCNNLGLDIHLKVQNVGKMRGSHTVLLFTSPPSVHNAPQKHLLDFQKI 582

Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
            +       V F ++ CK L +VD   N  +A G H + +G+
Sbjct: 583 HLTPQSEGVVKFNLDVCKHLSVVDEVGNRKVALGLHVLHIGD 624


>gi|357489437|ref|XP_003615006.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
 gi|355516341|gb|AES97964.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
          Length = 685

 Score =  588 bits (1517), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 306/677 (45%), Positives = 423/677 (62%), Gaps = 22/677 (3%)

Query: 78  GTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTFWSPNIN 136
           G   +  +P ATSFP VILT ASF+  LW +I + + TEAR +YN G A G+ FW+PNIN
Sbjct: 2   GIILNGSIPAATSFPQVILTAASFDPKLWYQISKVIGTEARGVYNAGQAQGMNFWAPNIN 61

Query: 137 VVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAA 196
           + RDPRWGR  ET GEDP V  +Y ++YVRGLQ  +  E  +    R LK SACCKH+ A
Sbjct: 62  IFRDPRWGRGQETAGEDPLVNSKYGVSYVRGLQG-DSFEGGKLIGGR-LKASACCKHFTA 119

Query: 197 YDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPK 256
           YDL+NW+G +R+ FD++VT QD+ +T+   F  CV +G  S +MC+YNRVNG+P CAD  
Sbjct: 120 YDLENWKGVNRYVFDAKVTLQDLADTYQPSFHSCVVQGRSSGIMCAYNRVNGVPNCADYN 179

Query: 257 LLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNF 316
           LL  T R  WNF+GYI SDCD+++ I E   +   T ED VA VL+AG+DL+CG+Y T  
Sbjct: 180 LLTNTARKKWNFNGYIASDCDAVRFIYEKQGYAK-TPEDVVADVLRAGMDLECGNYMTKH 238

Query: 317 TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQHIELAAE 373
              AV Q KI  + ID +L  L+ + +RLG FDG+P   QY  +G N +C+ ++++LA E
Sbjct: 239 AKSAVLQKKIPISQIDRALHNLFTIRIRLGLFDGNPTKLQYGRIGPNQVCSKENLDLALE 298

Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATK-AMIGNYEGTPCRYTSPMDGFYA 432
           AAR GIVLLKN    LPL    + TL ++GP+AN +   ++GNY G PC+  S + GFY 
Sbjct: 299 AARSGIVLLKNTASILPLP--RVNTLGVIGPNANKSSIVLLGNYIGPPCKNVSILKGFYT 356

Query: 433 YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGF 492
           Y+   +Y  GC D     ++ I  A++ AK +D  ++V GLD S E E  DR  L LPG 
Sbjct: 357 YASQTHYHSGCTDGTKCASAEIDRAVEVAKISDYVILVMGLDQSQETETLDRDHLELPGK 416

Query: 493 QTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFG 552
           Q +LIN VA A+K PV LV++  G VDI FAKNN KI  I+W GYPGE GGRA+A V+FG
Sbjct: 417 QQKLINSVAKASKKPVILVLLCGGPVDITFAKNNDKIGGIIWAGYPGELGGRALAQVVFG 476

Query: 553 KYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSYTQF 610
            YNPGGRLP+TWY  +++KIP T M +R  P + +PGRTY+F+ GP VY FGYGLSY+ +
Sbjct: 477 DYNPGGRLPMTWYPKDFIKIPMTDMRMRADPSSGYPGRTYRFYTGPKVYEFGYGLSYSNY 536

Query: 611 KYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYKFTFQIEVE 667
            Y        + +K +     +   Y++  N       L+ ++    CK    +  + + 
Sbjct: 537 SYNF------ISVKNNNLHINQSTTYSILENSETINYKLVSELGEETCKTMSISVTLGIT 590

Query: 668 NMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDN 726
           N G M G   V+++ KP  G  G  +KQ++G+E V +  G   +VGF ++ C+ L   + 
Sbjct: 591 NTGSMAGKHPVLLFVKPKKGRNGNPVKQLVGFESVTVEGGGKGEVGFEVSVCEHLSRANE 650

Query: 727 AANSLLASGAHTILVGE 743
           +   ++  G +  LVG+
Sbjct: 651 SGVKVIEEGGYLFLVGQ 667


>gi|326431595|gb|EGD77165.1| beta-glucosidase [Salpingoeca sp. ATCC 50818]
          Length = 900

 Score =  583 bits (1502), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 323/760 (42%), Positives = 453/760 (59%), Gaps = 57/760 (7%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            P+C+  L Y +R +DL+ R+   +    + + A GV  L LP Y+WWSEALHGV     
Sbjct: 182 LPFCNTALSYDDRIRDLISRINDSDLPGLLVNSATGVEHLNLPAYQWWSEALHGVGH--- 238

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
                PG HF  +VP ATSFP VI T A+FN++L++KIG  +STEARAM N+  AG TFW
Sbjct: 239 ----SPGVHFGGDVPAATSFPQVIHTGATFNKTLYRKIGTVISTEARAMNNVQRAGNTFW 294

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
           +PNIN++RDPRWGR  ETPGEDP+  G YA N+V G QD E + Y        +K S+CC
Sbjct: 295 APNINIIRDPRWGRGQETPGEDPFATGEYAANFVSGFQDGEDMNY--------IKASSCC 346

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
           KH+  Y+L+NW G DR H+++  T+QD+ +T++  FE CV  G  S +MCSYN VNG+P+
Sbjct: 347 KHFFDYNLENWHGVDRHHYNAIATDQDIADTYLPSFEACVRYGRASGLMCSYNAVNGVPS 406

Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
           CA+  ++    R  W F GYI SDC ++  ++ SHKF  +T E  +  VL+AG+D DCG 
Sbjct: 407 CANGDIMTVMARESWGFDGYITSDCGAVADVLNSHKFTRNTSE-TIRAVLEAGMDTDCGS 465

Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNICNPQHIE 369
           +   +   A+Q+G +    ++T+L  L++V  RLG FD      Y N     +  P + +
Sbjct: 466 FVQQYLAKAMQEGVVPRELVNTALHRLFMVQFRLGLFDPVSKQPYTNYSVARVNTPANQQ 525

Query: 370 LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG 429
           LA EAA+QGIVLLKN N  LPL TG    +AL+GP+A+AT  M GNY+GT     SP+ G
Sbjct: 526 LALEAAQQGIVLLKNTNARLPLKTG--LHVALIGPNADATTVMQGNYQGTAPFLISPVRG 583

Query: 430 FYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLL 489
           F  YS  + YA GC D+ C++ S   AA+ AAK ADA V+V GLD   E+EG DR  + L
Sbjct: 584 FKNYSAAVTYAKGC-DVACKDTSGFDAAVAAAKEADAVVVVVGLDQGQESEGHDRTSITL 642

Query: 490 PGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADV 549
           PG Q +L+ +VA AAK P+ + +M+ GAVD++  K N  +  ILW GYPG+ GG+A+ADV
Sbjct: 643 PGHQEDLVAQVAAAAKSPIVVFVMTGGAVDLSTIKANKNVAGILWCGYPGQSGGQAMADV 702

Query: 550 IFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGYGLS 606
           +FG  +PGGRLP T Y  +YV         +RP   +  PGRTY+F+ G  VY +G GLS
Sbjct: 703 VFGAVSPGGRLPYTIYPGSYVDACSMLDNGMRPNKTSGNPGRTYRFYTGKPVYEYGTGLS 762

Query: 607 YTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT----- 661
           YT F Y +     ++D  L   Q                    + D K +++KF      
Sbjct: 763 YTSFSYHI-HYLNTMDTSLATVQ------------------TYVQDAK-QNHKFIRYDAP 802

Query: 662 ----FQIEVENMGKMDGSEVVMVYSKP--PGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
                ++ V N+G++ G++VV V+ +P  P   G  IK +IG+ERVF+  GQ   V F++
Sbjct: 803 EFTRVEVNVTNVGRVAGADVVQVFVEPKTPAELGAPIKTLIGFERVFLNPGQWTIVQFSV 862

Query: 716 NACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLN 755
           NA   L  VD +   +  +G   + +G     ++FP+ +N
Sbjct: 863 NA-HDLTFVDASGKRVARAGEWLVHIGHD-SRLTFPVHVN 900


>gi|163889365|gb|ABY48135.1| beta-D-xylosidase [Medicago truncatula]
          Length = 776

 Score =  579 bits (1492), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 307/757 (40%), Positives = 439/757 (57%), Gaps = 50/757 (6%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S +P+C+  LP   R   L+  +TL +K+ Q+ + A  +  LG+P Y+WWSEALHG++  
Sbjct: 37  SHYPFCNISLPISTRTTSLISLLTLSDKINQLSNTASSISHLGIPSYQWWSEALHGIATN 96

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
           G      PG +F+  V  AT+FP VI++ A+FN SLW  IG  V  E RAM+N+G AGL+
Sbjct: 97  G------PGVNFNGSVKSATNFPQVIVSAAAFNRSLWFLIGYAVGVEGRAMFNVGQAGLS 150

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEY---HRDSDSRPLK 186
           FW+PN+NV RDPRWGR  ETPGEDP V   YA+ +VRG+Q V+G++      DSD   L 
Sbjct: 151 FWAPNVNVFRDPRWGRGQETPGEDPMVGSAYAVEFVRGIQGVDGIKKVLNDHDSDDDGLM 210

Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
           +SACCKH+ AYDL+ W    R++F++ V       T+  PF  CV +G  S +MCSYN V
Sbjct: 211 VSACCKHFTAYDLEKWGEFSRYNFNAVVN------TYQPPFRGCVQQGKASCLMCSYNEV 264

Query: 247 NGIPTCADPKLLNQTIRGDWNFHGY-IVSDCDSIQTIVESHKFLNDTKEDAVARVLKA-- 303
           NG+P CA   LL   +R  W F G  I+     +  +  S K + +  +  +   LK   
Sbjct: 265 NGVPACASKDLLG-LVRNKWGFEGVGILPQTVMLWLLFLSIKSMQNLPKMLLLMFLKQVF 323

Query: 304 ---------GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ- 353
                     +D++CG +    T  A++QG + E D+D +L  L+ V MRLG F+G P+ 
Sbjct: 324 FYVFENLWFCMDINCGTFMLRHTESAIEQGLVKEEDLDRALFNLFSVQMRLGLFNGDPEK 383

Query: 354 --YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKA 411
             +  LG  ++C P+H +LA EAARQGIVLLKNDN  LPL+  +  +LA++GP A  T  
Sbjct: 384 GKFGKLGPQDVCTPEHKKLALEAARQGIVLLKNDNKFLPLDKKDRVSLAIIGPMA-TTSE 442

Query: 412 MIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVA 471
           + G Y G PC   S  DG   Y K I+YA GC+D+ C ++     AID AK AD  VIVA
Sbjct: 443 LGGGYSGIPCSPRSLYDGLKEYVKTISYAFGCSDVKCDSDDGFAVAIDIAKQADFVVIVA 502

Query: 472 GLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKS 531
           GLD ++E E  DRV LLLPG Q +L+++VA A+K PV LV+   G +D++FA++N  I S
Sbjct: 503 GLDTTLETEDLDRVSLLLPGKQMDLVSRVAAASKRPVILVLTGGGPLDVSFAESNQLITS 562

Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRT 589
           ILW+GYP +             ++  GRLP+TWY  ++  +P   M +R  P   +PGRT
Sbjct: 563 ILWIGYPVD-------------FDAAGRLPMTWYPESFTNVPMNDMGMRADPSRGYPGRT 609

Query: 590 YKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI-KLDKDQQCRDINYTVGTNKPPCAAV 648
           Y+F+ G  +Y FG+GLSY+ F Y+V S+P  + + K       R +   V  +      V
Sbjct: 610 YRFYTGSRIYGFGHGLSYSDFSYRVLSAPSKLSLSKTTNGGLRRSLLNKVEKDVFEVDHV 669

Query: 649 LIDDVK-CKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAG 706
            +D+++ C    F+  I V N+G MDGS VVM++SK P  I G+   Q++G  R+   + 
Sbjct: 670 HVDELQNCNSLSFSVHISVMNVGDMDGSHVVMLFSKWPKNIQGSPESQLVGPSRLHTVSN 729

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
           +S +     + C+     D     +L  G H + VG+
Sbjct: 730 KSIETSILADPCEHFSFADEQGKRILPLGNHILNVGD 766


>gi|348667575|gb|EGZ07400.1| xylosidase [Phytophthora sojae]
          Length = 751

 Score =  568 bits (1464), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 304/748 (40%), Positives = 441/748 (58%), Gaps = 73/748 (9%)

Query: 8   KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
           K+S  P+CD  LP   R  DLV R+ L + V  + + A   P + +P YEWW+EALHGV+
Sbjct: 28  KVSSLPFCDGSLPIDARVSDLVNRIPLEQAVGLLVNKASAAPSVNVPSYEWWNEALHGVA 87

Query: 68  FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG 127
                    PG  F   +  ATSFP V+ T ASFN +L+ +I + +STEARA YN  NAG
Sbjct: 88  L-------SPGVTFKGPLTAATSFPQVLSTAASFNRTLFYQIAEAISTEARAFYNEKNAG 140

Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS-DSRPLK 186
           LTFW+PN+N+ RDPRWGR  ETPGEDPY+ G YA+ +VRGLQ  E +E H +  D++ LK
Sbjct: 141 LTFWTPNVNIFRDPRWGRGQETPGEDPYLTGEYAVAFVRGLQG-EAMEGHENKDDNKFLK 199

Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
           IS+CCKH++AY     +   R   D+ VT+QD  +T+   FE CV  G VSS+MCSYN V
Sbjct: 200 ISSCCKHFSAYS----QEVPRHRNDAIVTKQDQADTYFPAFEDCVKRGHVSSIMCSYNAV 255

Query: 247 NGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLD 306
           NGIP+CAD  LL   +R  W F GYI SDC+++  ++  H F   + E   A  L AG+D
Sbjct: 256 NGIPSCADKGLLTDLVRNQWKFDGYITSDCEAVADVIYRHHF-TQSPEQTCATTLDAGMD 314

Query: 307 LDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD-GSPQYKNLGKNNICNP 365
           L+CG++       A++QG ++   +  +L+  + V+MRLG F+ G+  + N+ K+ +   
Sbjct: 315 LNCGEFLRQHLSSAIEQGIVSTEMVHNALKNQFRVMMRLGMFEKGTQPFSNITKDAVDTA 374

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIK---TLALVGPHANATKAMIGNYEGTPCR 422
            H +LA EAARQ +VLLKN++  LPL T       +LAL+GPH NA+ A++GNY G P  
Sbjct: 375 AHRQLALEAARQSVVLLKNEDNTLPLATDVFSKDGSLALIGPHFNASTALLGNYFGIPSH 434

Query: 423 YTSPMDGFYAYSKVINYAPGCADIVCQNNSMIP---AAIDAAKNADATVIVAGLDLSVEA 479
             +P+ G  +Y   + Y+ GC      +  ++P    AI+  K AD  V+  GLD S E 
Sbjct: 435 IVTPLKGVSSYVPNVAYSLGCK----VSGEVLPDFDEAIEVVKKADRVVVFMGLDQSQER 490

Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
           E  DR  L LPGFQ  L+N++  AA  P+ LV++S G+VD++  KN+PK+ +I++ GY G
Sbjct: 491 EEIDRYHLKLPGFQIALLNRILAAASHPIVLVLISGGSVDLSLYKNHPKVGAIVFGGYLG 550

Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRP--VNNFPGRTYKFFDGP 596
           + GG+A+AD++FGKY+P GRL  T+Y+++YV  +P   M +RP  V   PGRTY+FF G 
Sbjct: 551 QAGGQALADMLFGKYSPAGRLTQTFYDSDYVNTMPIYDMHMRPTFVTGNPGRTYRFFSGA 610

Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
            VY FG+GLSYT F                  + CR            C A         
Sbjct: 611 PVYEFGFGLSYTTFH-----------------KACRS-----------CVA--------- 633

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERV-FIAAGQSAKVGF 713
               +F+I V N+G ++G + +++Y++PP  G  G  ++ ++ +ER   +  G++A   F
Sbjct: 634 ----SFEITVTNLGDVEGEDAILIYAEPPHAGEGGRPLRSLVAFERTALVTTGKTATADF 689

Query: 714 TMNACKSLKIVDNAANSLLASGAHTILV 741
            + A K+  + +   + ++  G  TI V
Sbjct: 690 CLEA-KAFALANAEGSWVVEQGNWTIHV 716


>gi|326513064|dbj|BAK03439.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 694

 Score =  565 bits (1455), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 294/648 (45%), Positives = 422/648 (65%), Gaps = 40/648 (6%)

Query: 104 SLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAIN 163
           +L +K+G  V+ +  A+  LG     +WS               ETPGEDP +  +YA+ 
Sbjct: 70  TLAEKVGFLVNKQP-ALGRLGIPAYEWWS---------------ETPGEDPLLASKYAVG 113

Query: 164 YVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETF 223
           YV GLQD  G     D     LK++ACCKHY AYD+DNW+G +R+ FD++V++QD+ +TF
Sbjct: 114 YVTGLQDA-GAGGVTDG---ALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTF 169

Query: 224 ILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV 283
             PF+ CV +G+V+SVMCSYN+VNG PTCAD  LL   IRGDW  +GYIVSDCDS+  ++
Sbjct: 170 QPPFKSCVLDGNVASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVD-VL 228

Query: 284 ESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLM 343
            + +    T E+A A  +K+GLDL+CG++    T+ AVQ G+++E D+D ++   +I+LM
Sbjct: 229 YTQQHYTKTPEEAAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLM 288

Query: 344 RLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLA 400
           RLG+FDG P+   + +LG  ++C   + ELA E ARQGIVLLKN +GALPL+  +IK++A
Sbjct: 289 RLGFFDGDPRQLAFGSLGPKDVCTSSNRELARETARQGIVLLKN-SGALPLSAKSIKSMA 347

Query: 401 LVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAID 459
           ++GP+ANA+  MIGNYEGTPC+YT+P+ G  A    + Y PGC ++ C  NS+ +  A+ 
Sbjct: 348 VIGPNANASFTMIGNYEGTPCKYTTPLQGLGAKVNTV-YQPGCTNVGCSGNSLQLSTAVA 406

Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
           AA +AD TV+V G D S+E E  DR  LLLPG QT+L++ VA+A+ GPV LV+MS G  D
Sbjct: 407 AAASADVTVLVVGADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFD 466

Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMP 578
           I+FAK + KI +ILWVGYPGE GG A+AD++FG +NP GRLP+TWY A+Y   +  T M 
Sbjct: 467 ISFAKASDKIAAILWVGYPGEAGGAALADILFGSHNPSGRLPVTWYPASYADTVTMTDMR 526

Query: 579 LRP--VNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKS-VDIKLDKDQQCRDIN 635
           +RP     +PGRTY+F+ G  V+ FG GLSYT+  + + S+P S V ++L +D  CR   
Sbjct: 527 MRPDTSTGYPGRTYRFYTGDTVFAFGDGLSYTKMSHSLVSAPPSYVSMRLAEDHPCR--- 583

Query: 636 YTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQV 695
                    CA+V      C D  F  +++V N G++ G+  V+++S PP       K +
Sbjct: 584 ------AEECASVEAAGDHCDDLAFDVKLQVRNAGEVAGAHSVLLFSSPPPAHNAPAKHL 637

Query: 696 IGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
           +G+E+V +A G++  V F ++ C+ L +VD      +A G HT+ VG+
Sbjct: 638 LGFEKVSLAPGEAGTVAFRVDVCRDLSVVDELGGRKVALGGHTLHVGD 685



 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 24/53 (45%), Positives = 34/53 (64%)

Query: 9  LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSE 61
          L+ + +C+ K     RA+DLV R+TL EKV  + +    + RLG+P YEWWSE
Sbjct: 46 LAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYEWWSE 98


>gi|293336530|ref|NP_001167905.1| uncharacterized protein LOC100381616 [Zea mays]
 gi|223944757|gb|ACN26462.1| unknown [Zea mays]
          Length = 630

 Score =  564 bits (1453), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 281/633 (44%), Positives = 407/633 (64%), Gaps = 21/633 (3%)

Query: 120 MYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD 179
           M+N G AGLT+W+PNIN+ RDPRWGR  ET GEDP V   Y++ YV+G Q  EG E    
Sbjct: 1   MHNAGQAGLTYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEEGEEGR-- 58

Query: 180 SDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
                +++SACCKHY AYD++ WEG  R+ F+++V  QD+++T+  PF+ C+ E   S +
Sbjct: 59  -----IRLSACCKHYTAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCL 113

Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
           MC+YN+VNG+P CA   LL +T R +W F GYI SDCD++  I E+  +   + ED++A 
Sbjct: 114 MCAYNQVNGVPMCAHKDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTY-TKSGEDSIAI 171

Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD---GSPQYKN 356
           VLKAG+D++CG +    T  A+++GKI E DID +L  L+ V +RLG FD    +  +  
Sbjct: 172 VLKAGMDINCGSFLVRHTKSAIEKGKIQEEDIDRALFNLFSVQLRLGIFDKPSNNQWFSQ 231

Query: 357 LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
           LG N++C  +H ELAAEA RQG VLLKND+  LPL    ++ +A++GP AN   AM G+Y
Sbjct: 232 LGPNSVCTKEHRELAAEAVRQGAVLLKNDHNFLPLKRSEVRHVAIIGPSANDAYAMGGDY 291

Query: 417 EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
            G PC  T+ + G  AY+   ++APGC D  C +  +   A++AAK AD  V++AGL+L+
Sbjct: 292 TGVPCNPTTFLKGIQAYATQTSFAPGCKDASCNSTDLFGEAVEAAKRADIVVVIAGLNLT 351

Query: 477 VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
            E E  DRV LLLPG Q  LI+ +A  AK P+ LV++  G VD++FAK +P+I SILW+G
Sbjct: 352 EEREDFDRVSLLLPGKQMGLIHAIASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLG 411

Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFD 594
           YPGE GG+ + +++FG+YNPGG+LPITWY  ++  IP T M +R  P   +PGRTY+F+ 
Sbjct: 412 YPGEVGGQVLPEILFGEYNPGGKLPITWYPESFTAIPMTDMNMRADPSRGYPGRTYRFYT 471

Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQ--CRDINYTVGTNKPPCAAVLIDD 652
           G VVY FGYGLSY+++ Y ++S+PK + +    D     R   Y   T +    +V  +D
Sbjct: 472 GDVVYGFGYGLSYSKYSYSISSAPKKITVSRSSDLGIISRKPAY---TRRDGLGSVKTED 528

Query: 653 V-KCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAK 710
           +  C+   F+  + V N G MDGS  V+++++    + G  IKQ++G+E V  AAG ++ 
Sbjct: 529 IASCEALVFSVHVAVSNHGSMDGSHAVLLFARSKSSVPGFPIKQLVGFESVHTAAGSASN 588

Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
           V  T++ CK +   +     +L  GAH + VG+
Sbjct: 589 VEITVDPCKQMSAANPEGKRVLLLGAHVLTVGD 621


>gi|340370206|ref|XP_003383637.1| PREDICTED: probable beta-D-xylosidase 5-like [Amphimedon
           queenslandica]
          Length = 728

 Score =  556 bits (1432), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 302/745 (40%), Positives = 426/745 (57%), Gaps = 58/745 (7%)

Query: 6   KVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHG 65
           K   + + YCD     PER  DL+ RMT+ +K+ Q+   A  +P L +P Y+WWSE LHG
Sbjct: 23  KAPFNTYKYCDYTQSIPERVNDLLSRMTILDKIPQLITSAPAIPSLDIPAYQWWSEGLHG 82

Query: 66  VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
           V+         PG HF    P ATSFP VI   A+FN SL   + Q +STEARA  N G 
Sbjct: 83  VA-------GSPGVHFGGNFPNATSFPQVIGLGATFNMSLVLAMAQVISTEARAFANGGQ 135

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
           AGLT+++PNIN+ RDPRWGR  ETPGEDPY+  +YA N+V+G+Q  EG +     D+R L
Sbjct: 136 AGLTYFAPNINIFRDPRWGRGQETPGEDPYLSSQYAANFVKGMQ--EGAD-----DTRYL 188

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K  A CKHYAAYDL+N+    R  F++ V++QD +ET+   F  CV EG V S+MCSYN 
Sbjct: 189 KTIATCKHYAAYDLENYLNLSRHTFNAIVSDQDFEETYFPAFRSCVEEGKVGSIMCSYNA 248

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           VNG+P+CA+  + N+  RG W F GY+VSDC +I  I+ SHK+ ++T +D VA  L+ G 
Sbjct: 249 VNGVPSCANDFINNEVARGKWGFEGYVVSDCGAISDIINSHKYTSNT-DDTVAAGLRGGC 307

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNIC 363
           DL+CG +Y++    A   G I + DID ++  L+   MRLG FD      +++   + + 
Sbjct: 308 DLNCGHFYSDHAQAAYDNGAITDDDIDRAMTRLFTYRMRLGMFDPPSMQPFRDYTNDKVD 367

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
             QH  LA +A+R+ IVLL+N+   LPL+    + +ALVGPH  A  AM GNY+GT    
Sbjct: 368 TKQHEALALDASRESIVLLQNNKDILPLSLTTHRKIALVGPHGQAQGAMQGNYKGTAPYL 427

Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAK--NADATVIVAGLDLSVEAEG 481
            SPM G       + +A GC  + C   +         +  + +A + V GLD S E+EG
Sbjct: 428 ISPMQGLQDLGLSVTFAAGCTQVACPTIAGFSEVTKLVEEHSIEAIIAVIGLDESQESEG 487

Query: 482 KDRVDLLLPGFQTELINKVADAAKG--PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
            DR  L LPG Q +L+  +   A    P  +V+MS G VD++  K+     +ILW GYPG
Sbjct: 488 HDRTSLTLPGQQVQLLEDIKKKAVPGIPFIVVVMSGGPVDLSGVKD--IADAILWAGYPG 545

Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVV 598
           + GG+AIA+VI+GK NP GRLP+T+Y A+Y+ +IPYT+M +R     PGR+YKF+ G  V
Sbjct: 546 QSGGQAIAEVIYGKVNPSGRLPVTFYPASYINEIPYTNMSMRVP---PGRSYKFYTGTPV 602

Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
           +PFG+GLSYT F+ K  + P    +K   D    D+NY                      
Sbjct: 603 FPFGFGLSYTTFEMKWKNPPNVTHLKTTHD---VDVNY---------------------- 637

Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
               ++ V N GK  GS  V+ Y     + G  +K++ G++++++   QS  + F     
Sbjct: 638 ----EVVVTNAGKRSGSVSVLAYITST-VPGAPMKELFGFQKIYLKPEQSMTLSFVAEP- 691

Query: 719 KSLKIVDNAANSLLASGAHTILVGE 743
           K    VD      +  G + I +G+
Sbjct: 692 KVFTTVDKHGERKIRPGTYKITIGD 716


>gi|167525174|ref|XP_001746922.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774702|gb|EDQ88329.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1620

 Score =  544 bits (1402), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 300/738 (40%), Positives = 429/738 (58%), Gaps = 60/738 (8%)

Query: 11   DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
            +FP+C+A L    R +D++ R+++ +KV    + A      GLP Y+WWSEALHGV F  
Sbjct: 923  NFPFCNASLDLDTRIRDVISRLSIQDKVALTANTAGAAADAGLPAYQWWSEALHGVGF-- 980

Query: 71   RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
                  PG  F  +V  ATSFP VI T+ASFN++LW  IG T+STEARAM N+  AGLTF
Sbjct: 981  -----SPGVTFMGKVQAATSFPQVIHTSASFNKTLWHHIGMTISTEARAMNNVNQAGLTF 1035

Query: 131  WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
            W+PNIN++RDPRWGR  ETPGEDPY  G YA N+V G+Q+ E        D+R +K S+C
Sbjct: 1036 WAPNINIIRDPRWGRGQETPGEDPYATGLYAANFVPGMQEGE--------DTRYIKASSC 1087

Query: 191  CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
            CKH+  Y+L++W   DR HF++  T+QD+ +T++  FE CV  G  SS+MCSYN VNG+P
Sbjct: 1088 CKHFFDYNLEDWHNVDRHHFNAIATDQDIADTYLPAFESCVRFGRASSLMCSYNAVNGVP 1147

Query: 251  TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
            +CA+  ++    R  W F GYI SDC +++ +  +HK+ N T    V  VL AG+D+DCG
Sbjct: 1148 SCANADIMTTLAREAWGFDGYITSDCGAVEDVYSNHKYYNTTGA-TVNGVLSAGMDVDCG 1206

Query: 311  DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHI 368
             + +     A+  G +  A +D +L  L+ V  RLG FD +    Y NL  + +  P+H 
Sbjct: 1207 SFLSQHLADAIDSGDVTNATVDQALYNLFRVQFRLGMFDPAEDQPYLNLTTDAVNTPEHQ 1266

Query: 369  ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
            +LA EAARQG+ LL+N +  LPL+  +IK LAL+GP+ANAT  M GNY G      SP  
Sbjct: 1267 QLALEAARQGMTLLENRDSRLPLDASSIKQLALIGPNANATGVMQGNYNGKAPFLISPQQ 1326

Query: 429  GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLL 488
            G   Y                N ++   A+ AAK AD  V+V GLD + E+EG DR  + 
Sbjct: 1327 GVQQY--------------VSNVALELGAVTAAKAADTVVMVIGLDQTQESEGHDREIIA 1372

Query: 489  LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
            LPG Q EL+ +VA+A+  P+ +V+M+ GAVD+   K+   +         G+ GG+A+A+
Sbjct: 1373 LPGMQAELVAQVANASSSPIVVVVMTGGAVDLTPVKDLDNV---------GQAGGQALAE 1423

Query: 549  VIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGYGL 605
             +FG  NPGGRLP T Y A+ V ++      +RP   +  PGRTY+F+ G  VY +G GL
Sbjct: 1424 TLFGDNNPGGRLPYTLYPADLVNQVSMFDDGMRPNATSGNPGRTYRFYTGTPVYAYGTGL 1483

Query: 606  SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIE 665
            SYT F Y+ ++    V       ++ R      G       + + D+V  +DY     + 
Sbjct: 1484 SYTSFSYETSTPSLRVSA-----ERVRAWVAARGQT-----SFIRDEVDAEDY---ITVT 1530

Query: 666  VENMGKMDGSEVVMVYSK--PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKI 723
            V+N G + G++VV V+ K   PG  G  IK + G+ERVF+  G++  + F +     L +
Sbjct: 1531 VQNNGTVAGADVVQVFIKTTTPGADGNPIKSLCGFERVFLKPGETTSIQFPVTP-HDLSV 1589

Query: 724  VDNAANSLLASGAHTILV 741
            V++    +   G  T+ V
Sbjct: 1590 VNSRGERVAVPGTWTVEV 1607


>gi|320170454|gb|EFW47353.1| beta-xylosidase [Capsaspora owczarzaki ATCC 30864]
          Length = 779

 Score =  539 bits (1388), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 303/751 (40%), Positives = 439/751 (58%), Gaps = 59/751 (7%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L + P+C+  L + +RA DLV R+TL EK+ Q G  A GV RLG+  YEWWSEALHGV+ 
Sbjct: 32  LRNLPFCNPNLAWEQRADDLVGRLTLQEKISQFGTTAPGVARLGVNAYEWWSEALHGVA- 90

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVI--------LTTASFNESLWKKIGQTVSTEARAM 120
                   PG +F    P +T FP +I           A+FN      + Q +STEARA 
Sbjct: 91  ------ESPGVNFTGNTPVSTCFPQIIGNNCSSLSRVGATFNLDSVAAMAQVISTEARAF 144

Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
            N G+AGLT+++PNIN+ RDPRWGR  ETPGEDPY+  RY    V+ LQ+ E        
Sbjct: 145 ANAGHAGLTYFTPNINIFRDPRWGRGQETPGEDPYLTSRYVETLVQNLQNGE-------- 196

Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
           D+R LK+ A CKHY AYD+++W G DRFHF++ V++QD+ ETF+ PFE CV  G  +S+M
Sbjct: 197 DARYLKVVATCKHYTAYDMEDWGGIDRFHFNAVVSDQDLVETFMPPFEACVRVGKGASLM 256

Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
           CSYN VNGIP+CAD  + N+  R  W F GYIVSDC +I  I  +H + N T+    A +
Sbjct: 257 CSYNAVNGIPSCADDFINNEIAREQWGFDGYIVSDCGAIDCIQYTHNYTNTTQATCAAGI 316

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLG 358
            + G DLDCGD+Y +  M A+    + EAD+D SLR L+   +RLG FD +    Y+ + 
Sbjct: 317 -QGGCDLDCGDFYQSHLMDAIGNATLHEADLDFSLRRLFGHRIRLGEFDAASIQPYRQIP 375

Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
            + I + +H ELA + AR+ IVLL NDN  LP +   ++ LA++GP+A+  + ++GNY G
Sbjct: 376 VSAINSQEHQELALQIARESIVLLGNDNNTLPFSLATVRKLAIIGPNADDAETLLGNYYG 435

Query: 419 TPCRYTSPMDGFYAYSKV--INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
                 +P+ GF        I +  GC D+   + S   AA  AAK ADAT++V GL+ +
Sbjct: 436 DAPYLITPLKGFQQLDPTLSITFVKGC-DVNSTDTSGFVAAAAAAKAADATIVVVGLNQT 494

Query: 477 VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
           VE+E  DR  L+LPG Q ELI  +  AA+GPV LV+MS   +D++   +   +++ LW+G
Sbjct: 495 VESENLDRTTLVLPGVQAELILALTAAARGPVILVVMSGSPIDLSNVIH--PVRAALWIG 552

Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDG 595
           YPG+ GGRA+A+ +FG ++P GRLP T Y A+YV ++P T+M +R     PGRTY+F+ G
Sbjct: 553 YPGQAGGRALAEAVFGVFSPAGRLPFTVYPADYVNQLPMTNMDMRAG---PGRTYRFYTG 609

Query: 596 PVVYPFGYGLSYTQFKYK--VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
             ++ FG+GLSY+ F+Y    +SS  S             +       + P  AV     
Sbjct: 610 TPLFEFGHGLSYSTFQYTWSNSSSSSSSSATSQHSLSTAALAAQHLAARAPVEAV----- 664

Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSK----------PPGIAGTHIKQVIGYERVFI 703
                  +F++ V+N GKM   +VV+ ++               A   I+ ++G+ R+ +
Sbjct: 665 -------SFRVLVQNTGKMASDDVVLAFASFNASSIIDQSSSQFASPPIRSLVGFRRIHL 717

Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLAS 734
           A G S ++ F + + +  ++    A +L+ S
Sbjct: 718 APGASQEIFFAVTSSQLAQVDSTGAQTLVPS 748


>gi|301110280|ref|XP_002904220.1| beta-D-xylosidase, putative [Phytophthora infestans T30-4]
 gi|262096346|gb|EEY54398.1| beta-D-xylosidase, putative [Phytophthora infestans T30-4]
          Length = 709

 Score =  539 bits (1388), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 296/730 (40%), Positives = 420/730 (57%), Gaps = 71/730 (9%)

Query: 24  RAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
           R+   + R+ L + V  + + A   P + +P YEWW+EALHGV+         PG  F  
Sbjct: 7   RSLHCLTRIPLDQAVGLLVNKAAPAPSVNIPSYEWWNEALHGVAL-------SPGVTFKG 59

Query: 84  EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRW 143
            +  ATSFP V+ T ASFN SL+ +I   +STEARA +N  +AGLTFW+PN+N+ RDPRW
Sbjct: 60  SITAATSFPQVLSTAASFNRSLFYQIADVISTEARAFHNAKDAGLTFWTPNVNIFRDPRW 119

Query: 144 GRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWE 203
           GR  ETPGEDPY+ G YA+ +VRGLQ  EG+E     +S+ LKIS+CCKH++AY     +
Sbjct: 120 GRGQETPGEDPYLTGEYAVAFVRGLQG-EGMEGREVENSKFLKISSCCKHFSAYS----Q 174

Query: 204 GNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIR 263
              R   ++ VT+QD  +T+   FE CV  G VSS+MCSYN VNGIP+CAD  LL   +R
Sbjct: 175 EVPRHRNNAMVTKQDQADTYFPAFEDCVKRGHVSSIMCSYNAVNGIPSCADKGLLTDLVR 234

Query: 264 GDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQ 323
           G W F GYI SDC+++  +++ H +   + E   A  L AG+DL+CG++       A++Q
Sbjct: 235 GQWKFDGYIASDCEAVADVIDHHHY-TQSPEQTCATTLDAGMDLNCGEFLRQHLPKALEQ 293

Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLK 383
           G +    I  +L+  + VLMRLG F+    + N+ K+++    H +LA EAARQ IVLLK
Sbjct: 294 GIVTTEMIHNALKNQFRVLMRLGMFEKVEPFANITKDSVDTTMHRQLALEAARQSIVLLK 353

Query: 384 NDNGALPLNTGNI---KTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYA 440
           ND   LPL T +    ++LAL+GPH NA+ A++GNY G P    +P++G   +   + ++
Sbjct: 354 NDGNTLPLATKDFTRDRSLALIGPHFNASAALLGNYFGIPSHIVTPLEGISQFVPNVAHS 413

Query: 441 PGCADIVCQNNSMIP---AAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELI 497
            GC      +  ++P    AI  AK AD  ++  GLD S E E  DR  + LP FQ+ L+
Sbjct: 414 LGCK----VSGEVLPDFDDAIAVAKKADRLIVFVGLDQSQEREEIDRYHIGLPAFQSTLL 469

Query: 498 NKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPG 557
            +V + A  P+  V++S G VD++  KN+PK+ +I++ GY G+ GG+A+ADV+FGKYNP 
Sbjct: 470 KRVLEVASHPIVFVVISGGCVDLSAYKNHPKVGAIVFGGYLGQAGGQALADVLFGKYNPS 529

Query: 558 GRLPITWYEANYVK-IPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKV 614
           G+LP T+Y++ YV  +    M +R  PV    GRTY+FF G  VY FG+GLSYT F    
Sbjct: 530 GKLPQTFYDSEYVNAMSIYDMHMRPTPVTGNSGRTYRFFTGVPVYEFGFGLSYTTFH--- 586

Query: 615 ASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDG 674
                                     N   C A             TF I V N G + G
Sbjct: 587 -------------------------KNCHACVA-------------TFNITVTNAGAISG 608

Query: 675 SEVVMVYSKPP--GIAGTHIKQVIGYERV-FIAAGQSAKVGFTMNACKSLKIVDNAANSL 731
            +V++ Y +PP  G  G  +K ++ +ER   IAAGQ A     + A K+  + + A N +
Sbjct: 609 EDVILTYVEPPLAGEGGRPLKSLVAFERTPLIAAGQRATAKICLEA-KAFALANEAGNWV 667

Query: 732 LASGAHTILV 741
           +  G  TI V
Sbjct: 668 VEPGNWTIHV 677


>gi|300121549|emb|CBK22068.2| unnamed protein product [Blastocystis hominis]
          Length = 690

 Score =  538 bits (1386), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 304/732 (41%), Positives = 422/732 (57%), Gaps = 72/732 (9%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           ERA+ LV  +TL EK+  MG  A  V RL +P Y+WWSEALHGV+       + PG  F 
Sbjct: 3   ERARALVAELTLAEKMSLMGHTASEVKRLNIPKYQWWSEALHGVA-------ASPGVVFQ 55

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPR 142
              P AT+FP V LT  SF++ L+  I   +STEAR M N   A LT+WSPN+NV RDPR
Sbjct: 56  EPTPFATAFPQVALTAQSFDKPLFHDIASIISTEARVMNNAERANLTYWSPNVNVYRDPR 115

Query: 143 WGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNW 202
           WGR  ETPGEDP++V  YA+ +VRGLQ+ E        D R LK+SACCKHY+AYDL+NW
Sbjct: 116 WGRGQETPGEDPFLVATYAVEFVRGLQEGE--------DPRYLKVSACCKHYSAYDLENW 167

Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
            G +RF FD+ V+++DM +TF +PFE CV +G VSS+MCSYN +NGIP CAD +LL  T 
Sbjct: 168 HGVERFEFDAIVSDRDMTDTFQVPFEQCVKKGHVSSLMCSYNAINGIPACADRELLYGTA 227

Query: 263 RGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQ 322
           RG W F GYI SDC +I TI+ +H + NDT   A+  V +A  DLDCG +Y    + +V+
Sbjct: 228 RGGWGFEGYITSDCGAIDTIIYNHHYTNDTDTTAMLGV-RATCDLDCGGFYQQHILHSVE 286

Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELAAEAARQGIV 380
            G++ EA++D +L  L+ V MRLG FD   Q  Y + G + +   +H  +A  AAR+GI 
Sbjct: 287 SGRLKEAEVDDALANLFKVQMRLGLFDPVEQQVYTHYGLDKLNTKEHQAMALRAAREGIA 346

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYA 440
           LLKN N  LPL+  + K + ++GP+A     M+GNY G P               ++  A
Sbjct: 347 LLKNQNDFLPLSLKD-KHVVVMGPYAEDAGVMLGNYNGIP-------------EFIVTVA 392

Query: 441 PGCADIVCQNNSMIPA--AIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELIN 498
            G  + VC +  ++ +  A+   +  D  V+  GL+  +E EG DR DLLLP  Q  L++
Sbjct: 393 QGLRN-VCDHVDVVKSLEALSKLEGVDLIVVTVGLNQEIEREGLDREDLLLPASQRALLD 451

Query: 499 KVADAAKGPVTLVIMSAG-AVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPG 557
            +      PV L ++S G +VDI+  + N  +  +L VGY G  GG+AIA+VI G  NP 
Sbjct: 452 GLLAQTDVPVVLTLLSGGGSVDISAYEQNEHVVGVLAVGYGGMFGGQAIAEVIVGDVNPS 511

Query: 558 GRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGYGLSYTQFKYKV 614
           GRL  T Y  +YV  + Y  M +RP     FPGRTY+FF GPV++PFG+GLSYT F +  
Sbjct: 512 GRLVNTMYYNDYVTNLDYFDMNMRPKEETGFPGRTYRFFAGPVIHPFGFGLSYTTFAH-- 569

Query: 615 ASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDG 674
                +V+I   ++ + R             +A+ ID            ++V N G   G
Sbjct: 570 -----AVEIGQMRNHRLR-------------SALAID----------VYVKVTNTGSRQG 601

Query: 675 SEVVMVYSKPP--GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL 732
            E V+++ K P  G  G  +K +  + RV +A G++  V F +   + L + +  A  +L
Sbjct: 602 DESVLLFVKSPLAGKQGYPLKSLADFSRVSLAPGETQTVHFVLGE-EQLHLANEQAKYVL 660

Query: 733 ASGAHTILVGEG 744
             G   + V E 
Sbjct: 661 LRGEWKVEVEEA 672


>gi|340370204|ref|XP_003383636.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 755

 Score =  526 bits (1356), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 298/740 (40%), Positives = 425/740 (57%), Gaps = 61/740 (8%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           + YC+      ER KDL+ R+T+ EK+ Q    A  + RL +P Y+WWSE LHG++    
Sbjct: 56  YLYCNYSASITERVKDLLSRLTVLEKMSQTATNASAIERLDIPAYDWWSECLHGLA---- 111

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
                PG  F++++  ATSFP VI   A+FN SL   +GQ +STEARA  N G +GLTF+
Sbjct: 112 ---QSPGVFFENDLTSATSFPQVIGLGATFNMSLVLAMGQVISTEARAFANNGQSGLTFF 168

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
           +PNIN+ RDPRWGR  ETPGEDPY+  +YA N+V+G+Q  EG E     D R LK  A C
Sbjct: 169 APNINIYRDPRWGRGQETPGEDPYLTSQYAANFVKGIQ--EGSE-----DRRYLKAIATC 221

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
           KHYAAY+L+ +    R +F++ V++QD++ET++  F+ CV EG V S+MCSYN +NG+P 
Sbjct: 222 KHYAAYNLERYLDVRRVNFNAIVSDQDLEETYLPAFKACVQEGQVGSIMCSYNAINGVPN 281

Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
           CA+  + N+  R  W F GYIVSDC +I  I   H + +DT    VA  LK G DL+CG 
Sbjct: 282 CANDFINNKIARDTWGFEGYIVSDCGAILDIQYKHNYTSDTN-ITVADALKGGCDLNCGH 340

Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHI 368
           +Y  +   A     I E DID SL  L+   MRLG FD  P+   ++     ++  P+  
Sbjct: 341 FYEKYMEDAFDNSTITEEDIDKSLTRLFTSRMRLGMFD-PPEIQPFRQYSVKDVNTPEAQ 399

Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
           +LA  AAR+GIVLL+N    LPL+      +A +GP+A+AT  M GNY G      SP+ 
Sbjct: 400 DLALNAAREGIVLLQNKGSVLPLDIVKHSNIAAIGPNADATHIMQGNYHGIAPYLISPLQ 459

Query: 429 GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLL 488
           GF        Y  GC  + C +    P A+ A +  DA + V GL+ + E E  DR  + 
Sbjct: 460 GFSNLGINATYQIGCP-VACNDTEGFPDAVKAVQGVDAVIAVIGLNNTQEGESHDRTSIA 518

Query: 489 LPGFQTELINKV-ADAAKG-PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
           LPG Q +L+ ++  +AAKG P+ +V+MS G+VD+   K+     +ILW GYPG+ GG+AI
Sbjct: 519 LPGHQEDLLLELKKNAAKGTPLIVVVMSGGSVDLTGVKD--IADAILWAGYPGQSGGQAI 576

Query: 547 ADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGL 605
           A+VI+GK NP GRLP+T+Y A+Y+ +IPYT+M +R     PGR+YKF+ G  V+PFG+GL
Sbjct: 577 AEVIYGKVNPSGRLPVTFYPASYINEIPYTNMSMRVP---PGRSYKFYTGTPVFPFGFGL 633

Query: 606 SYTQF--KYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           SYT F  K+K  S+ K   +K   D+    +NY                          +
Sbjct: 634 SYTTFEIKWKDTSTAKDYYLKTTHDEV---VNY--------------------------E 664

Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKI 723
             V N G   GS  V+ +     + G  +K++  ++++++   +S  V F     K    
Sbjct: 665 ATVTNSGSRPGSVSVLAFIT-SSVPGAPMKELFAFKKIYLEPTESVDVSFVAEP-KVFTT 722

Query: 724 VDNAANSLLASGAHTILVGE 743
           VD      +  GA+ I++G+
Sbjct: 723 VDIYGIRKIRPGAYKIIIGD 742


>gi|326488213|dbj|BAJ89945.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 525

 Score =  521 bits (1342), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 263/493 (53%), Positives = 352/493 (71%), Gaps = 17/493 (3%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L+ + +C+ K     RA+DLV R+TL EKV  + +    + RLG+P YEWWSEALHGVS+
Sbjct: 46  LAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYEWWSEALHGVSY 105

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           +G      PGT F   VPGATSFP  ILT ASFN SL++ IG+ VSTEARAM+N+G AGL
Sbjct: 106 VG------PGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNVGLAGL 159

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
           TFWSPNIN+ RDPRWGR  ETPGEDP +  +YA+ YV GLQD  G     D     LK++
Sbjct: 160 TFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQDA-GAGGVTDG---ALKVA 215

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           ACCKHY AYD+DNW+G +R+ FD++V++QD+ +TF  PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 216 ACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNG 275

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            PTCAD  LL   IRGDW  +GYIVSDCDS+  ++ + +    T E+A A  +K+GLDL+
Sbjct: 276 KPTCADKDLLEGVIRGDWKLNGYIVSDCDSVD-VLYTQQHYTKTPEEAAAITIKSGLDLN 334

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
           CG++    T+ AVQ G+++E D+D ++   +I+LMRLG+FDG P+   + +LG  ++C  
Sbjct: 335 CGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTS 394

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            + ELA E ARQGIVLLKN +GALPL+  +IK++A++GP+ANA+  MIGNYEGTPC+YT+
Sbjct: 395 SNRELARETARQGIVLLKN-SGALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTT 453

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           P+ G  A    + Y PGC ++ C  NS+ +  A+ AA +AD TV+V G D S+E E  DR
Sbjct: 454 PLQGLGAKVNTV-YQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESLDR 512

Query: 485 VDLLLPGFQTELI 497
             LLLPG QT+L+
Sbjct: 513 TSLLLPGQQTQLV 525


>gi|125576920|gb|EAZ18142.1| hypothetical protein OsJ_33692 [Oryza sativa Japonica Group]
          Length = 618

 Score =  514 bits (1325), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 253/624 (40%), Positives = 366/624 (58%), Gaps = 25/624 (4%)

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
            WSPN+N+ RDPRWGR  ETPGEDP    +Y   +V+GLQ          S    L+ SA
Sbjct: 1   MWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQG---------SSLTNLQTSA 51

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
           CCKH  AYD++ W+G  R++F+++VT QD+ +T+  PF  CV +G  S +MC+Y  +NG+
Sbjct: 52  CCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLINGV 111

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           P CA   LL +T+RG+W   GY  SDCD++  + +S  F   T E+AVA  LKAGLD++C
Sbjct: 112 PACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTR-TAEEAVAVALKAGLDINC 170

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNP 365
           G Y       A+QQGK+ E D+D +L+ L+ + MRLG+FDG P+    Y  L   ++C P
Sbjct: 171 GVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAADVCTP 230

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            H  LA EAAR+G+VLLKND   LPL    + + A++G +AN   A++GNY G PC  T+
Sbjct: 231 VHKALALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLPCETTT 290

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P  G   Y K   + PGC+   C + +    A   AK++D   +V GL    E EG DR 
Sbjct: 291 PFGGIQKYVKSAKFLPGCSSAAC-DVAATDQATALAKSSDYVFLVMGLSQKQEQEGLDRT 349

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            LLLPG Q  LI  VA A+K PV L++++ G VDI FA+ NPKI +ILW GYPG+ GG+A
Sbjct: 350 SLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQA 409

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
           IADV+FG++NP G+LP+TWY   + K   T M +R  P   +PGR+Y+F+ G  VY FGY
Sbjct: 410 IADVLFGEFNPSGKLPVTWYPEEFTKFTMTDMRMRPDPATGYPGRSYRFYKGKTVYKFGY 469

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYKF 660
           GLSY++F  ++ S   +         +         T     A   +D++   +C+  +F
Sbjct: 470 GLSYSKFACRIVSGAGNS----SSYGKAALAGLRAATTPEGDAVYRVDEIGDDRCERLRF 525

Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGI-AGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
              +EV+N G MDG   V+++ +      G  ++Q+IG+    +  G+  K+   ++ C+
Sbjct: 526 PVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLIGFRNQHLKVGEKKKLKMEISPCE 585

Query: 720 SLKIVDNAANSLLASGAHTILVGE 743
            L         ++  G+H ++V E
Sbjct: 586 HLSRARVDGEKVIDRGSHFLMVEE 609


>gi|340370208|ref|XP_003383638.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 732

 Score =  511 bits (1317), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 292/758 (38%), Positives = 420/758 (55%), Gaps = 80/758 (10%)

Query: 6   KVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHG 65
           K K   F YC+  LP  +R KDL+ RMTL EK+ Q+G+ A  + RL +P Y+WWSE LHG
Sbjct: 26  KTKFQSFSYCNYSLPISDRVKDLLSRMTLAEKITQLGNTAGSIDRLDIPAYQWWSEGLHG 85

Query: 66  VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
           V+         PG HF+     ATSFP VI T +SFN++L+ +I   +STEARA     N
Sbjct: 86  VA-------DSPGVHFNGMFHNATSFPQVITTASSFNKTLYHEIAAVMSTEARA---FAN 135

Query: 126 AGLTFWSPNINVV--------RDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYH 177
            G+ ++  +  ++        RDPRWGR  ETPGEDPY+  +YAI +V G Q        
Sbjct: 136 QGIVYFKQHQQLLSNYLLFYCRDPRWGRAQETPGEDPYLNSQYAIQFVTGAQ-------- 187

Query: 178 RDSDSRPLKISACCKHYAAYDLDNW-EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDV 236
              DS+ LK+   CKH+A YDL+++ +G  R  F++++T QD +ET+   F+ CV E +V
Sbjct: 188 --GDSKYLKVVTTCKHFAGYDLEDYVDGETRHSFNAKITPQDFEETYYPAFKACVEEANV 245

Query: 237 SSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDA 296
           +S+MCSYN VNG+P+CAD ++ N+  R  W F G+I SDC +I  I   H + N+T +D 
Sbjct: 246 ASIMCSYNEVNGVPSCADGQINNKLARDTWGFDGFIASDCGAIDDIQNKHHYTNNT-DDT 304

Query: 297 VARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--- 353
           VA  LK G DL+CG YY +    A   G I   +I+ +L  L+   M+LG FD  P+   
Sbjct: 305 VAAALKGGCDLNCGSYYQSHAQSAFLNGTITIGEINLALTRLFTARMKLGMFD-PPELQP 363

Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
           Y  +  + + + +H  LA  AAR+ IVLL+N+N  LPLN     T+A+VGPHA AT  M 
Sbjct: 364 YNAISPDVVNSLEHQALALNAARESIVLLQNNNDVLPLNFEKHSTIAVVGPHAMATDVMQ 423

Query: 414 GNYEGTPCRYTSPMDGF--YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVA 471
           GNY G      SP++GF       V+  A GC D+ C+       A D A  ADA + V 
Sbjct: 424 GNYNGVAPYLISPVEGFENLGIDSVLT-ASGC-DVNCEVTDGFQDAFDIAVKADAVIAVL 481

Query: 472 GLDLSVEAEGKDRVDLLLPGFQTELINKVADAAK-----GPVTLVIMSAGAVDINFAKNN 526
           GLD S E+EG DR DL LP  Q + +  + +  K      P+ +V+MS  +VD+   K +
Sbjct: 482 GLDQSHESEGHDREDLFLPNLQDKFVQDLKNTLKAAGTNAPLIVVVMSGSSVDLTVTKKH 541

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNNF 585
               +ILW GYPG+ GG+AIA++I+GK NP GRLP+T+Y  +Y+  + +  M +R    +
Sbjct: 542 A--DAILWAGYPGQSGGQAIAEIIYGKVNPSGRLPVTFYPGSYIDLVAFRHMSMR---EY 596

Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
           PGRTYKF++    + FG GLSYT F Y   S P ++          R ++Y         
Sbjct: 597 PGRTYKFYNDTPDFSFGDGLSYTTF-YLEWSKPVNM-------SGVRSVSYPT------- 641

Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAA 705
                           + + V N GKM G+  V+ Y      +G   K++ G+E+VF+  
Sbjct: 642 --------------VVYNVTVTNTGKMPGAISVLAYISYNN-SGAPKKKLFGFEKVFLNP 686

Query: 706 GQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
            QS  V F  ++ K+   VD +    +  G + + +G+
Sbjct: 687 LQSVSVTFPADS-KAFSTVDKSGKRSVNPGDYHVTIGD 723


>gi|147857580|emb|CAN78858.1| hypothetical protein VITISV_030325 [Vitis vinifera]
          Length = 699

 Score =  508 bits (1308), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 276/645 (42%), Positives = 373/645 (57%), Gaps = 89/645 (13%)

Query: 104 SLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAIN 163
           S + ++ + VSTEARAMYN+G AGLTFWSPN+N+ +DPRWGR  ETPGEDP +  +YA  
Sbjct: 128 SKFMRLRKVVSTEARAMYNVGLAGLTFWSPNVNIFQDPRWGRGQETPGEDPLLSSKYASG 187

Query: 164 YVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETF 223
           YVRGLQ  +      D     LK++ACCKHY AYDLDNW+G D FHF++ VT QDM +TF
Sbjct: 188 YVRGLQQSD------DGSPDRLKVAACCKHYTAYDLDNWKGVDCFHFNAVVTNQDMDDTF 241

Query: 224 ILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV 283
             PF+ CV +G+V+SV+                              YIVSDCDS+    
Sbjct: 242 QPPFKSCVIDGNVASVI------------------------------YIVSDCDSVDVFY 271

Query: 284 ESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLM 343
            S  +   T E+A A+ + AGLDL+CG +    T  AV+ G + E+ +D ++   +  LM
Sbjct: 272 NSQHY-TKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLM 330

Query: 344 RLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLA 400
           RLG+FDG+P    Y  LG  ++C  +H E A EA RQGIV                    
Sbjct: 331 RLGFFDGNPSKAIYGKLGPKDVCTSEHQERAREAPRQGIV-------------------- 370

Query: 401 LVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDA 460
                          + GTPC+YT+P+ G  A      Y PGC+++ C   + I  A   
Sbjct: 371 ---------------FAGTPCKYTTPLQGLTALVAT-TYLPGCSNVAC-GTAQIDEAKKI 413

Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
           A  ADATV++ G+D S+EAEG+DRV++ LPG Q  LI +VA  +KG V LV+MS G  DI
Sbjct: 414 AAAADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKXSKGNVILVVMSGGGFDI 473

Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPL 579
           +FAKN+ KI SI WVGYPGE GG AIADVIFG YNP G+LP+TWY  +YV K+P T+M +
Sbjct: 474 SFAKNDDKITSIQWVGYPGEAGGAAIADVIFGFYNPSGKLPMTWYPQSYVDKVPMTNMNM 533

Query: 580 R--PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
           R  P + +PGRTY+F+ G  +Y FG GLSYTQF + +  +PKSV I +++   C      
Sbjct: 534 RPDPASGYPGRTYRFYTGETIYTFGDGLSYTQFNHHLVQAPKSVSIPIEEAHSC------ 587

Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIG 697
              +   C +V      C++  F   + V N G + GS  V ++S PP +  +  K ++G
Sbjct: 588 ---HSSKCKSVDAVQESCQNLAFDIHLRVNNAGNISGSHTVFLFSSPPSVHNSPQKHLLG 644

Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +E+VF+ A   A V F ++ CK L IVD      +A G H + VG
Sbjct: 645 FEKVFVTAKAKALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVG 689


>gi|340377241|ref|XP_003387138.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 733

 Score =  503 bits (1296), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 293/740 (39%), Positives = 414/740 (55%), Gaps = 70/740 (9%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           YC+ +L + +R KDL+ R+TL EK+ Q+G+ A  + RLG+P Y+WWSE LHGV+      
Sbjct: 37  YCNYRLSFKDRVKDLLSRLTLEEKISQLGNSASAIDRLGIPGYQWWSEGLHGVAV----- 91

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
              PG H    +   TSFP +I T +SFN+SL+ +IG+ VSTEAR   + G  GLT+++P
Sbjct: 92  --SPGLHLGGNLTCTTSFPQIITTASSFNKSLFYEIGEAVSTEARGFADNGQGGLTYFTP 149

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+VRDPRWGR  ET GEDPY+  +YA+N VRG Q          +DS   KI A CKH
Sbjct: 150 NINIVRDPRWGRGQETAGEDPYLTSQYAVNLVRGAQ---------GNDSEYKKIIATCKH 200

Query: 194 YAAYDLDNW-EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           +AAYDL+++  G+ R  F++ VT+QD++ET+   F  CV  G V S+MCSYN VNG+P+C
Sbjct: 201 FAAYDLESYINGDVRDSFNAEVTKQDLEETYFPAFRSCVTAGGVGSIMCSYNSVNGVPSC 260

Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
            D    N+  R  W F GY+VSDC +I  ++  H + + T  D VA  LK G DL+CG +
Sbjct: 261 VDGVFNNKIARNKWKFDGYLVSDCGAIDDVMNKHHYTS-TPTDTVAAGLKGGTDLNCGSF 319

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN---ICNPQHIE 369
           Y    M A   G I E DID ++  L+   MRLG FD  P+Y+     N   +   QH +
Sbjct: 320 YQTHAMDAFLNGSITEVDIDRAVGRLFTARMRLGLFD-LPKYQPYSYFNTDVVNTKQHQD 378

Query: 370 LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG 429
           LA +AAR+ IVLL+N NG LPL+  +   +A+VGP+  A   M G  +       SP+DG
Sbjct: 379 LALQAARESIVLLQN-NGKLPLSYEDHHKIAVVGPNILANVTMQGISQVIAPYLISPVDG 437

Query: 430 FYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLL 489
           F +    + Y+ GC D+ C        A    K+A A V V GLD  +E E  DR D+ L
Sbjct: 438 FKSKGLHVTYSLGC-DVKCIVTDGFHDAFKLVKDAKAVVAVMGLDQGIERETVDREDIFL 496

Query: 490 PGFQTELINKVADAAKG-----PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
           PG Q + +  + D         P+ +VIMS  +VD++ +K+     +ILWVGYPG+ GG+
Sbjct: 497 PGLQDKFLLGLRDTLTNLQSPVPLIVVIMSGSSVDLSESKS--LADAILWVGYPGQSGGQ 554

Query: 545 AIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
           AIA+VI+G+ NP GRLP+T+Y   Y+  + Y  M +R     PGRTY+F+    V+PFG+
Sbjct: 555 AIAEVIYGEVNPSGRLPLTFYPGEYIDLVAYRHMSMREP---PGRTYRFYTENPVFPFGH 611

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           GLSYT F+    +   +V                          V+ D V   D    F 
Sbjct: 612 GLSYTTFELSWTNKMNNV-----------------------TEIVISDSV---DINIDFD 645

Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKI 723
           I V N G + G+  V+ Y     I    ++++  +++VFI   +S K+        SL  
Sbjct: 646 ITVVNTGYLSGAVSVLGYVS-SNIPDAPLRELFDFDKVFIDKYESKKI--------SLFA 696

Query: 724 VDNAANSLLASGAHTILVGE 743
            ++A  ++   G   IL GE
Sbjct: 697 TNDAFTTVDEKGRRNILPGE 716


>gi|452989371|gb|EME89126.1| glycoside hydrolase family 3 protein [Pseudocercospora fijiensis
           CIRAD86]
          Length = 790

 Score =  503 bits (1295), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 295/745 (39%), Positives = 406/745 (54%), Gaps = 60/745 (8%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD       RAK L+   TL EK+   G  + GVPRLGL  YEWW EALHGV+       
Sbjct: 39  CDTAADPLTRAKALIAEFTLAEKINNTGSTSPGVPRLGLLPYEWWQEALHGVA------- 91

Query: 75  SPPGTHFD--SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
           S PG +F    E   ATSFP  IL  A+F++ L   +   +STEARA  N   AGL FW+
Sbjct: 92  SSPGVNFSVSGEFRYATSFPQPILMGAAFDDQLIHDVASVISTEARAFSNDDRAGLDFWT 151

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           PNIN  +DPRWGR  ETPGEDPY +  Y  + +RGLQ  +   Y         K+ A CK
Sbjct: 152 PNINPFKDPRWGRGQETPGEDPYHLSSYVHSLIRGLQG-DNPSYK--------KVVATCK 202

Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           H+ AYD++NW GN R+  D+ +  QD+ E ++ PF  C  + +V + MCSYN +NG+PTC
Sbjct: 203 HFVAYDVENWNGNFRYQLDAHINSQDLVEYYMPPFRSCARDSNVGAFMCSYNSLNGVPTC 262

Query: 253 ADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           ADP LL   +R  WN+     ++ SDCDS+Q +   H + + ++E+A A  LKAG D++C
Sbjct: 263 ADPYLLQTVLREHWNWTAEEQWVTSDCDSVQNVFLYHNYAS-SREEAAAISLKAGTDINC 321

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP-QYKNLGKNNICNPQHI 368
           G YY      A +QG I E D+DTSL   Y  L+RLGYFDG    Y+NL  N++  P   
Sbjct: 322 GTYYQEHLPRAYEQGLINETDVDTSLIRQYGSLIRLGYFDGDRVPYRNLTWNDVSTPYAQ 381

Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
           +LA +AA  GI LLKND G LPL   N   +AL+G  ANAT  M+GNY G P  + SP+ 
Sbjct: 382 DLALKAATSGITLLKND-GILPLQITNGTKIALIGDWANATDQMLGNYHGIPPYFHSPLW 440

Query: 429 GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLL 488
                   + Y  G                 AA  +D  + + G+D  VEAE KDRV + 
Sbjct: 441 AAQQTGAEVTYVQGPGGQSDPTTYTWRPIWSAANKSDVIIYIGGMDERVEAEEKDRVSIA 500

Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
             G Q ++I ++AD    P  +V M  G++D +    NP I+++LW GYPG++GG+AI D
Sbjct: 501 WSGPQLDVIGQLADYYDKPTIVVQMGGGSLDSSPLVKNPNIRALLWGGYPGQDGGKAIFD 560

Query: 549 VIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGYGL 605
           ++ G   P GRLPIT Y A+Y+ K+P T   LRP   +  PGRTY + +   V+ FGYGL
Sbjct: 561 ILQGISAPAGRLPITQYRADYISKVPMTDTSLRPNATSGSPGRTYIWLNEEPVFEFGYGL 620

Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIE 665
            YT F   +             D +  D  Y++ +    C    +D    K    TF I+
Sbjct: 621 HYTNFTATI------------PDAESSDTTYSIDSLASDCTESYLDRCPFK----TFSID 664

Query: 666 VENMGKMDGSEVVMVYSKPPGIAGTH------IKQVIGYERVF-IAAG--QSAKVGFTMN 716
           V N G +    V + +     + G H       K+++ Y+R+  I AG  Q+A +  T+ 
Sbjct: 665 VTNTGSVTSDYVTLGF-----LTGAHGPEPCPNKRLVSYQRLHNITAGSTQTAALNLTLG 719

Query: 717 ACKSLKIVDNAANSLLASGAHTILV 741
              SL  VD+  N++L  G++ +LV
Sbjct: 720 ---SLSRVDDKGNTVLFPGSYALLV 741


>gi|40363751|dbj|BAD06320.1| putative beta-xylosidase [Triticum aestivum]
          Length = 573

 Score =  503 bits (1295), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 257/579 (44%), Positives = 361/579 (62%), Gaps = 17/579 (2%)

Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
           +S  L+ SACCKH+ AYDL+NW+G  RF FD++VTEQD+ +T+  PF+ CV +G  S +M
Sbjct: 1   NSSDLEASACCKHFTAYDLENWKGVTRFAFDAKVTEQDLADTYNPPFKSCVEDGGASGIM 60

Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
           CSYNRVNG+PTCAD  LL++T RGDW+F+GYI SDCD++  I +   +     EDAVA V
Sbjct: 61  CSYNRVNGVPTCADHNLLSKTARGDWSFNGYITSDCDAVAIIHDVQGYAK-APEDAVADV 119

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYK---NL 357
           LKAG+D++CG Y     + A QQGKI   DID +LR L+ + MRLG F+G+P+Y    N+
Sbjct: 120 LKAGMDVNCGGYIQTHGVSAYQQGKITGEDIDRALRNLFAIRMRLGLFNGNPKYNRYGNI 179

Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
           G + +C  +H +LA +AA+ GIVLLKND GALPL+   + ++A++GP+ N    ++GNY 
Sbjct: 180 GADQVCKKEHQDLALQAAQDGIVLLKNDAGALPLSKSKVSSVAVIGPNGNNASLLLGNYF 239

Query: 418 GTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSV 477
           G PC   +P      Y K   +  GC   VC N S I  A+ AA +AD  V+  GLD + 
Sbjct: 240 GPPCISVTPFQALQGYVKDATFVQGCNAAVC-NVSNIGEAVHAASSADYVVLFMGLDQNQ 298

Query: 478 EAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGY 537
           E E  DR++L LPG Q  L+NKVADAAK PV LV++  G VD+ FAKNNPKI +I+W GY
Sbjct: 299 EREEVDRLELGLPGMQESLVNKVADAAKKPVILVLLCGGPVDVTFAKNNPKIGAIVWAGY 358

Query: 538 PGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDG 595
           PG+ GG AIA V+FG++NPGGRLP+TWY   +  +P T M +R  P   +PGRTY+F+ G
Sbjct: 359 PGQAGGIAIAQVLFGEHNPGGRLPVTWYPKEFTAVPMTDMRMRADPSTGYPGRTYRFYKG 418

Query: 596 PVVYPFGYGLSYTQFKYKVASS---PKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
             VY FGYGLSY+++ ++ AS    P S+   ++  +       TV  +     A     
Sbjct: 419 KTVYNFGYGLSYSKYSHRFASEGTKPPSMS-GIEGLKATASAAGTVSYDVEEMGA----- 472

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKV 711
             C   +F   + V+N G MDG   V+++ + P    G    Q+IG++ V + A ++A V
Sbjct: 473 EACDRLRFPAVVRVQNHGPMDGRHPVLLFLRWPNATDGRPASQLIGFQSVHLRADEAAHV 532

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
            F ++ CK           ++  G+H + VG+    +SF
Sbjct: 533 EFEVSPCKHFSRAAEDGRKVIDQGSHFVKVGDDEFELSF 571


>gi|440799679|gb|ELR20723.1| betaxylosidase [Acanthamoeba castellanii str. Neff]
          Length = 748

 Score =  501 bits (1289), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 286/750 (38%), Positives = 414/750 (55%), Gaps = 100/750 (13%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L D P+C+  L   +R  DLV R+TL + + QMG  A  VP LG+P Y WW+E LHGV  
Sbjct: 10  LKDLPFCNTSLTAGQRTDDLVSRLTLDQLIGQMGHQAPAVPSLGIPAYNWWTECLHGV-L 68

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
               TN P            TSFP      A+FN  L  K+ + +S EARA+ N G  GL
Sbjct: 69  TKCGTNCP------------TSFPAPCALGAAFNMKLIHKMARAISNEARALNNEGIGGL 116

Query: 129 TFWSPNI-----------------------NVVRDPRWGRVLETPGEDPYVVGRYAINYV 165
            FW+PNI                       ++ RDPRWGR +E PGEDP++  +Y  +++
Sbjct: 117 DFWAPNIKYSTQPTNKTRQESQLRNAMVCISINRDPRWGRNMEVPGEDPFMTAQYVAHFM 176

Query: 166 RGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFIL 225
           RGLQ+ E        DSR  ++   CKH+AAY L+ W+  DRF FD+ V++ D  ET++ 
Sbjct: 177 RGLQEGE--------DSRYPQVVGTCKHFAAYSLEAWKDYDRFMFDAIVSDYDFVETYLP 228

Query: 226 PFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVES 285
            F+ C+ EG   S+MCSYN VNG+P+CA+  LL   +R  W+F GY+VSDCD++ TI  +
Sbjct: 229 AFKGCIVEGRARSIMCSYNSVNGVPSCANDFLLRTILRDSWSFDGYVVSDCDAVDTIYNN 288

Query: 286 HKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRL 345
           H F   T E A A  L AG DL+CGD+Y      A  +G++ E ++  +++ L+   M L
Sbjct: 289 HHF-TKTPEGACAVALHAGTDLNCGDFYQKHLGKAHSEGRVTEDEVRLAVKRLFRQRMEL 347

Query: 346 GYFDGSPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG 403
           G +D   +  YK    + + + +H +LA +AAR+ +VLL+N  G LPL   +++ +A++G
Sbjct: 348 GMWDPPAEQPYKQYPPSVVGSREHSDLALQAARESMVLLQNRRGVLPLRK-SVRRVAVIG 406

Query: 404 PHANATKAMIGNYEGTPCR------YTSPMDGFYAY--SKVINYAPGCADIVCQNNSMIP 455
           P+ANAT+ M+GNY G+ C         SP     A     ++ Y  GC D+   N + IP
Sbjct: 407 PNANATETMLGNYYGSRCHDGTYDCIVSPYLAIKAKLPQALVTYNLGC-DVDSTNTTGIP 465

Query: 456 AAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSA 515
            A+ AA+ AD  ++V GL+ SVE+EGKDRV + LPG Q  LI  +  A   P  +V+M  
Sbjct: 466 EAVKAAQAADVAIVVLGLNTSVESEGKDRVAITLPGMQDHLIKSIV-ATNTPTVVVMMHG 524

Query: 516 GAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPG----------GRLPITWY 565
           GAV I + K+  ++  I+   YPGE GG+AIADV+FG YNPG          GRLP+T  
Sbjct: 525 GAVAIEWIKD--QVDGIVDAFYPGENGGQAIADVLFGDYNPGDNKTDGTTLLGRLPVTVL 582

Query: 566 EANYVK-IPYTSMPLRPVNNFPGRTYKFFDGPV-VYPFGYGLSYTQFKYKVASSPKSVDI 623
            ANYV  +P T+M +R   N PGRTY+++ GP  ++ FG+GLSYT FK +  S+P+   +
Sbjct: 583 PANYVDMVPLTNMSMRASGNNPGRTYRYYTGPAPLWEFGFGLSYTTFKTEWLSTPQPSAL 642

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
           K                               +D   +F++ V N+G + G EVV+ +  
Sbjct: 643 K----------------------------SYARDEAVSFRVRVTNVGPVAGDEVVLAFVT 674

Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
                   +KQ+  +ERV +  G+S ++ F
Sbjct: 675 RDNADRGPLKQLFAFERVHLNPGESKEIFF 704


>gi|407922988|gb|EKG16078.1| Glycoside hydrolase family 3 [Macrophomina phaseolina MS6]
          Length = 800

 Score =  500 bits (1287), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 295/747 (39%), Positives = 421/747 (56%), Gaps = 52/747 (6%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L D   CD+      RA  LV+ +TL EK+   G+ + GVPRLG+P Y+WW+EALHGV+F
Sbjct: 35  LKDNLVCDSSATPLARATALVKELTLEEKLNNTGNTSPGVPRLGIPEYQWWNEALHGVAF 94

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
                      +F S    ATSFP  IL  A+F++ L  ++   VSTEARA  N G +GL
Sbjct: 95  TYPGQPMTESGNFSS----ATSFPQPILMGAAFDDELIYEVASVVSTEARAYSNGGRSGL 150

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
            +W+PNIN  +DPRWGR  ETPGEDP+ +  Y  N +RGL+  +   Y         KI 
Sbjct: 151 DYWTPNINPYKDPRWGRGQETPGEDPFHLASYVQNLIRGLEGNQNDPYK--------KIV 202

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           A CKH+  YD++NW GN R+ FD+++  +DM E ++ PF+ C  E  V + MCSYN VNG
Sbjct: 203 ATCKHFTGYDMENWNGNFRYQFDAQINMRDMVEYYMPPFQACAREAKVGAFMCSYNAVNG 262

Query: 249 IPTCADPKLLNQTIRGDWNFH---GYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           +PTCADP LL   +R  W ++    ++VSDCD+IQ +   H++  +++E AVA  L AG 
Sbjct: 263 VPTCADPWLLQTVLREHWGWNQEDQWVVSDCDAIQNVYLPHEWA-ESREQAVADTLNAGT 321

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG--SPQYKNLGKNNIC 363
           DL+CG YY  +  GA +QG I +  +D +L   Y  L++LGYFD   S  Y+ +G  ++ 
Sbjct: 322 DLNCGTYYQRYLPGAYEQGLINDTTLDRALTRTYSSLIKLGYFDNADSQPYRQIGWQDVN 381

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
           +    ELA +AA++GIVLLKND G LPL+   + ++AL+G  ANAT+ M GNY G     
Sbjct: 382 SQHAQELALKAAQEGIVLLKND-GLLPLSLDGVSSIALIGSWANATEQMQGNYAGVAPYL 440

Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIP---AAIDAAKNADATVIVAGLDLSVEAE 480
            SP+         +NYA G +    Q+N       A   AA+N+D  ++V G+D  +E+E
Sbjct: 441 HSPLYAAEQLGVKVNYAEGAS----QSNPTTDQWGAEYTAAENSDVIIVVGGIDNDIESE 496

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
             DRV +   G Q ++I K+A   K PV +V M AG +D     +N  I ++LW GYPG+
Sbjct: 497 ELDRVAIAWSGPQLDMITKLATYGK-PVIVVQMGAGQLDSTPLVSNANISALLWGGYPGQ 555

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
           +GG A+ D+I G   P GRLPIT Y A Y K +  T M LRP +   GRTYK+++G  V+
Sbjct: 556 DGGTALFDIITGAVAPAGRLPITQYPARYTKEVAMTDMSLRPSSTSAGRTYKWYNGTAVF 615

Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
           PFG+GL YT F   + S P S              ++ +      C+A   D  K     
Sbjct: 616 PFGFGLHYTNFSAAIPSPPAS--------------SFAISDLVASCSAN--DTSKLDLCP 659

Query: 660 FT-FQIEVENMGKMDGSEVVMVYSKPPGIAGTHIK-QVIGYERVF-IAAG--QSAKVGFT 714
           FT   +++ N G      V + +         H K  ++ Y+R+  IAAG  Q+A++  T
Sbjct: 660 FTSLAVDIANDGTRASDFVALAFLTGEFGPSPHPKSSLVAYQRLHAIAAGETQTARLNLT 719

Query: 715 MNACKSLKIVDNAANSLLASGAHTILV 741
           +    SL  VD   + LL  G +++L+
Sbjct: 720 LG---SLVRVDENGDKLLYPGDYSVLI 743


>gi|78482949|emb|CAJ41429.1| beta (1,4)-xylosidase [Populus tremula x Populus alba]
          Length = 732

 Score =  500 bits (1287), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 303/755 (40%), Positives = 413/755 (54%), Gaps = 85/755 (11%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
           D P+C   LP   R  DL+ RMTL EKV  + + A  VPRLG+  YEWWSEALHGVS +G
Sbjct: 38  DLPFCQVNLPIHTRVNDLIGRMTLQEKVGLLVNNAAAVPRLGIKGYEWWSEALHGVSNVG 97

Query: 71  RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
                 PGT F    P ATSFP VI T ASFN +LW+ IG+ VS EARAM+N G AGLT+
Sbjct: 98  ------PGTKFGGAFPVATSFPQVITTAASFNATLWEAIGRVVSDEARAMFNGGVAGLTY 151

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           WSPN+     PRWGR  ETPGEDP VVG+YA +YVRGLQ  +G+          LK++AC
Sbjct: 152 WSPNVTYSVYPRWGRGQETPGEDPVVVGKYAASYVRGLQGSDGIR---------LKVAAC 202

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH+ AYDLDNW G DRFHF+++V++QDM +TF +PF MCV EG V+SVMCSYN+VNGIP
Sbjct: 203 CKHFTAYDLDNWNGVDRFHFNAKVSKQDMVDTFDVPFRMCVKEGKVASVMCSYNQVNGIP 262

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
           TCADP LL +T+RG W  +GYIVSDCDS         F +  +        KAGLDLDCG
Sbjct: 263 TCADPNLLKKTVRGQWRLNGYIVSDCDSFGVYYGQQHFTSPRRSS--LGCYKAGLDLDCG 320

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP-QYKNLGKNNICNPQHIE 369
            +       AV++    EA+I+ +        + LG FDGSP Q        +  P + +
Sbjct: 321 PFLVTHR-DAVKKAA-EEAEINNAWLKTLTFQISLGIFDGSPLQAVGDVVPTMGPPTNQD 378

Query: 370 LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHA--NATKAMIGNYEGTPCRYTSPM 427
           LA  A ++ + + KN    L           + GP A   +   M+GNYEG PC+Y  P+
Sbjct: 379 LAVNAPKR-LFIFKNRAFLL------YSPRHIFGPVALFKSLPFMLGNYEGLPCKYLFPL 431

Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
            G   +  ++ Y PGC++++C     + +A+D A +ADA V+V G D S+E EG DRVD 
Sbjct: 432 QGLAGFVSLL-YLPGCSNVICAVAD-VGSAVDLAASADAVVLVVGADQSIEREGHDRVDF 489

Query: 488 LLPGFQTELINKVADAAKGPVTLVIM----SAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
            LPG Q EL+ +VA AAKGPV LVIM    S G    N                  +  G
Sbjct: 490 YLPGKQQELVTRVAMAAKGPVLLVIMDLAISGGGCSYN------------------QVNG 531

Query: 544 RAIADVIFGK-------YNPGGRLP-ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDG 595
             I+DV  G         N  G +P I++  A +  + +T +   P  ++  + +KF   
Sbjct: 532 IPISDVCEGSSYRWPSFSNCHGYMPWISYSRAIWETLRFTKVNWVPTWSW-NKLHKF--- 587

Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
                   G  +++       +P+     L      R  N+  G         +ID +  
Sbjct: 588 --------GSHHSKCTDDGFGTPRRPPPWL------RKCNHFQGRQSELHMLDVIDSL-- 631

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
                  Q++V+N G MDG+  ++VY +PP       KQ++ +E+V +AAG   +VG  +
Sbjct: 632 ----LGMQVDVKNTGSMDGTHTLLVYFRPPARHWAPHKQLVAFEKVHVAAGTQQRVGINI 687

Query: 716 NACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
           + CKSL +VD +    +  G H++ +G+    VS 
Sbjct: 688 HVCKSLSVVDGSGIRRIPMGEHSLHIGDVKHSVSL 722


>gi|452846807|gb|EME48739.1| glycoside hydrolase family 3 protein [Dothistroma septosporum
           NZE10]
          Length = 802

 Score =  494 bits (1272), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 287/749 (38%), Positives = 403/749 (53%), Gaps = 46/749 (6%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L D   CD       RA  L+   TL EK+   G  + GVPRLGLP Y WW EALHGV+ 
Sbjct: 33  LKDNTVCDTTADPLTRATALINAFTLQEKLNNTGSTSPGVPRLGLPAYTWWQEALHGVA- 91

Query: 69  IGRRTNSPPGTHFDSEVPG--ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
                 S PG +F    P   ATSFP  IL  A+F++ L + +   +STEARA  N   A
Sbjct: 92  ------SSPGVNFSDSGPFRYATSFPQPILMGAAFDDDLIRDVATVISTEARAFNNDKRA 145

Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLK 186
           GL FW+PNIN  +D RWGR  ETPGEDPY +  Y    + GLQ     +Y R        
Sbjct: 146 GLDFWTPNINPFKDSRWGRGQETPGEDPYHLSSYVAALIEGLQGSPDDKYKR-------- 197

Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
           + A CKH+ AYD+++W GN R+ FD++V+ QD+ E ++ PF+ C  + +V + MCSYN +
Sbjct: 198 VVATCKHFVAYDMESWNGNFRYQFDAQVSSQDLVEYYMPPFQQCARDSNVGAFMCSYNAL 257

Query: 247 NGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NG+PTCADP LL   +R  WN+     ++ SDCD++Q +   H + + T+E+A A  LKA
Sbjct: 258 NGVPTCADPWLLQTVLREKWNWTSEQQWVTSDCDAVQNVFLPHDYAS-TREEAAALSLKA 316

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNI 362
           G D++CG YY +    A  QG I   D+D SL   Y  L+RLGYFDG +  Y+NL  N++
Sbjct: 317 GTDINCGTYYQDHLPAAYDQGLINTTDLDISLIRQYSSLVRLGYFDGLAVPYRNLTWNDV 376

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
             P   +LA +AA +GI LLKND G LPL   N  ++AL+G  ANAT  M+GNY+G P  
Sbjct: 377 STPHAQQLAYKAAAEGITLLKND-GVLPLTISNGTSIALIGDWANATDQMLGNYDGIPPF 435

Query: 423 YTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
           + SP+         +N+A G                 AA  +D  +   G+D SVE+EG 
Sbjct: 436 FHSPLYAAQQTGATVNFATGPGGQGDPTTDHWLPVWAAANKSDVIIYAGGIDNSVESEGM 495

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
           DRV L   G Q ++I ++A   K PV ++ M  G +D +   NNP + +++W GYPG++G
Sbjct: 496 DRVSLTWTGAQLDMIGQLAMYGK-PVIVLQMGGGQIDSSPLVNNPNVSALIWGGYPGQDG 554

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVY 599
           G A+ D+I G   P GRLP T Y A Y+ ++P T M LRP      PGRTY +++   V+
Sbjct: 555 GVALFDIIRGITAPAGRLPTTQYPAKYISQVPMTDMTLRPNSTTGSPGRTYIWYNENAVF 614

Query: 600 PFGYGLSYTQFKYKVASS-PKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
           P+G GL YT F   +  S P + D             Y + T    C A   D       
Sbjct: 615 PYGLGLHYTNFTAAIKPSFPSTYDSSSSNSGSAS---YDISTLTSNCTATYKDLCPFT-- 669

Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH------IKQVIGYERVFIAAGQSAKVG 712
             +F + + N G++    V + +     +AG H       K+++ Y+R+      S++  
Sbjct: 670 --SFSVSITNTGEIMSDYVTLGF-----LAGIHGPAPHPNKRLVSYQRLHNITAGSSQTA 722

Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILV 741
           +      SL  VD   N +L  G + +LV
Sbjct: 723 WLNLTLGSLARVDEMGNKVLYPGDYALLV 751


>gi|125576923|gb|EAZ18145.1| hypothetical protein OsJ_33695 [Oryza sativa Japonica Group]
          Length = 591

 Score =  493 bits (1268), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 245/593 (41%), Positives = 355/593 (59%), Gaps = 20/593 (3%)

Query: 156 VVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVT 215
           +  +YA+ +V+G+Q          + S  L+ SACCKH  AYDL++W G  R++F+++VT
Sbjct: 1   MASKYAVAFVKGMQG---------NSSAILQTSACCKHVTAYDLEDWNGVQRYNFNAKVT 51

Query: 216 EQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSD 275
            QD+++T+  PF  CV +   + +MC+Y  +NG+P CA+  LL +T+RGDW   GYI SD
Sbjct: 52  AQDLEDTYNPPFRSCVVDAKATCIMCAYTGINGVPACANADLLTKTVRGDWGLDGYIASD 111

Query: 276 CDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSL 335
           CD++  + ++ ++   T EDAVA  LKAGLD++CG Y       A+QQGK+ E DID +L
Sbjct: 112 CDAVAIMRDAQRY-TQTPEDAVAVALKAGLDMNCGTYMQQHATAAIQQGKLTEEDIDKAL 170

Query: 336 RFLYIVLMRLGYFDGSPQ----YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPL 391
           + L+ + MRLG+FDG P+    Y  LG  +IC P+H  LA EAA  GIVLLKND G LPL
Sbjct: 171 KNLFAIRMRLGHFDGDPRSNSVYGGLGAADICTPEHRSLALEAAMDGIVLLKNDAGILPL 230

Query: 392 NTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNN 451
           +   + + A++GP+AN   A+IGNY G PC  T+P++G   Y K + +  GC    C   
Sbjct: 231 DRTAVASAAVIGPNANDGLALIGNYFGPPCESTTPLNGILGYIKNVRFLAGCNSAACDVA 290

Query: 452 SMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLV 511
           +   AA  A+ ++D   +  GL    E+EG+DR  LLLPG Q  LI  VADAAK PV LV
Sbjct: 291 ATDQAAAVAS-SSDYVFLFMGLSQKQESEGRDRTSLLLPGEQQSLITAVADAAKRPVILV 349

Query: 512 IMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK 571
           +++ G VD+ FA+ NPKI +ILW GYPG+ GG AIA V+FG +NPGGRLP+TWY   + K
Sbjct: 350 LLTGGPVDVTFAQTNPKIGAILWAGYPGQAGGLAIARVLFGDHNPGGRLPVTWYPEEFTK 409

Query: 572 IPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQ 629
           +P T M +R  P   +PGR+Y+F+ G  VY FGYGLSY+ +  ++ S  K  +   +   
Sbjct: 410 VPMTDMRMRADPATGYPGRSYRFYQGKTVYKFGYGLSYSSYSRQLVSGGKPAESYTNLLA 469

Query: 630 QCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG 689
             R    + G        +  D   C+  KF   +EV+N G MDG   V++Y + P   G
Sbjct: 470 SLRTTTTSEGDESYHIEEIGTDG--CEQLKFPAVVEVQNHGPMDGKHSVLMYLRWPNAKG 527

Query: 690 TH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
                Q+IG+    +  G+ A + F ++ C+    V      ++  G+H ++V
Sbjct: 528 GRPTTQLIGFRSQHLKVGEKANIRFDISPCEHFSRVRKDGKKVIDRGSHYLMV 580


>gi|344303941|gb|EGW34190.1| hypothetical protein SPAPADRAFT_65353 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 788

 Score =  492 bits (1266), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 290/736 (39%), Positives = 418/736 (56%), Gaps = 43/736 (5%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           C+  LP  +RAK +V+  T+ E +  MG+ + GV RLGLP Y+WWSEALHG   I R   
Sbjct: 61  CNPHLPTEQRAKAVVDLFTVDELIANMGNTSPGVERLGLPPYQWWSEALHG---IARSNF 117

Query: 75  SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
           +  G     E   ATSFP  IL   +FN  L+K++G  + TEARA  N+G AGL F+SPN
Sbjct: 118 TASG-----EYSHATSFPQPILMGGAFNNDLYKQVGNVIGTEARAFNNVGRAGLDFYSPN 172

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           IN  RD RWGR  E   E P +VG YA+NYV+GLQ   G++ +++ D+  L+++A CKH+
Sbjct: 173 INPFRDARWGRGQEVASESPVLVGNYALNYVQGLQG--GLDSNQNDDT--LQVAATCKHF 228

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
             YD+++W  + R  +++ +++QD+ + ++  F+ CV +   +  MCSYN VNG+P CA 
Sbjct: 229 VGYDMESWNQHSRLGYNAIISDQDLADFYLPTFQSCVRDAKAAGAMCSYNAVNGVPACAS 288

Query: 255 PKLLNQTIRGDWNFH-GYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              LN  +R  ++F  G I SDCD+I  +   H +  D    A A  +KAG+D++CGD Y
Sbjct: 289 EFFLNTVLRDGFDFQNGVIHSDCDAIYNVWNPHLYAQDLG-GAAADAIKAGVDVNCGDTY 347

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIEL 370
            N    A+    I E  I TS+   Y  L+RLGYFD SPQ   Y+    N++  PQ  +L
Sbjct: 348 QNNLGYALGNKTINENQIRTSVTRQYSNLIRLGYFD-SPQTNKYRKYDWNDVSTPQANQL 406

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A +AA +GI LLKND G LP N   ++ +A++GP ANAT  M+G+Y GTP    SP+ G 
Sbjct: 407 AYQAAVEGIALLKND-GTLPFNKQKVRKVAVIGPWANATTQMLGDYAGTPPYMISPLQGA 465

Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
            +    + YA G   I   + S   AA++AAK ADA V   G+D SVE E  DR  L  P
Sbjct: 466 QSEGFQVEYALGT-QINTTDTSGYTAALNAAKGADAIVYFGGIDNSVENEALDRESLAWP 524

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
           G Q +L++K++   K P+ ++    G +D    KNN  + +I++ GYPG+ GG AI D++
Sbjct: 525 GNQLDLVSKLS-GLKKPLVVLQFGGGQIDDTEIKNNKNVNAIVYAGYPGQSGGTAIWDIL 583

Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
            GKY P GRL  T Y A+Y  ++P T M LRP   +PGRT+ +++G  VY FGYGL YT 
Sbjct: 584 SGKYAPAGRLTTTQYPASYADQVPMTDMTLRPRQGYPGRTFMWYNGEPVYEFGYGLHYTT 643

Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
           F   +A++P+          Q  +I   V   K    +  +D         TF + ++N 
Sbjct: 644 FSASLANAPRG-------GHQSFNIEQVVAAAK---RSQYVDTGLIT----TFDVNIKNT 689

Query: 670 GKMDGSEVVMVYSKPPGIAGTHIKQV-IGYERVF-IAAG--QSAKVGFTMNACKSLKIVD 725
           GK       ++YSK     G H  ++ + ++++  I AG  Q+AK+  T+    SL   D
Sbjct: 690 GKTTSDYAALLYSKTTAGPGPHPNKILVSFDKLHQIHAGQTQTAKLPVTIG---SLLQTD 746

Query: 726 NAANSLLASGAHTILV 741
              N  L  G +T  V
Sbjct: 747 TNGNKWLYPGTYTFFV 762


>gi|393247584|gb|EJD55091.1| beta-xylosidase [Auricularia delicata TFB-10046 SS5]
          Length = 763

 Score =  489 bits (1260), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 286/744 (38%), Positives = 411/744 (55%), Gaps = 50/744 (6%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L D   C+    + +RAK L++  T  E V    + + GVPRLGLP Y+WWSEALHGV+ 
Sbjct: 31  LKDNLVCNTTANFMDRAKALIDEFTTEELVNNTVNGSPGVPRLGLPPYQWWSEALHGVA- 89

Query: 69  IGRRTNSPPGTHF---DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
                 + PG HF     +   ATSFP  IL  A+F++ L  ++   +STEARA  N G 
Sbjct: 90  -----GANPGVHFAPAGEDFDHATSFPQPILMGAAFDDELIHEVATVISTEARAFNNFGF 144

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
           +G+ F++PNIN  RDPRWGR  ETPGEDP  + RY    V  LQ   G   +        
Sbjct: 145 SGIDFFTPNINPFRDPRWGRGQETPGEDPLHISRYVFQLVTALQGGLGPSPY-------Y 197

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           KI A CKH+A YDL++WEG DRFHFD+ +T QD+ E +   F+ CV +  V SVMCSYN 
Sbjct: 198 KIVADCKHFAGYDLESWEGIDRFHFDAVITTQDLAEFYTPSFQSCVRDAKVGSVMCSYNS 257

Query: 246 VNGIPTCADPKLLNQTIRGDWNF-HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           VNG+P CA   LL   +R  +    G+I SDCD++Q +  +H F   T+ +A A  LKAG
Sbjct: 258 VNGVPACASSYLLQDIVRDFYGLGDGWITSDCDAVQNVFTTHNFTT-TQANASAISLKAG 316

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNN 361
            D+DCG+ Y      A+ QG + E D+  +L  LY  L+R GYFD SP+   ++ LG  +
Sbjct: 317 TDVDCGNVYAQSLGDALDQGLVEEDDLKQALVRLYGSLVRTGYFD-SPEEQPFRQLGWAD 375

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           +  P    LA  AA +GIVLLKND G LPL++ ++  + +VGP  NAT  M GNY G   
Sbjct: 376 VDTPASRRLALLAAEEGIVLLKND-GLLPLSSRDVPNVIMVGPWGNATTMMQGNYFGNAP 434

Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
              SP  GF      + +  G       + S    A+ AA + D  V V G D  VE E 
Sbjct: 435 YLVSPRQGFVDAGFNVTFFNGTVGTNGTDTSGFDEAVAAAGDTDLIVFVGGPDNVVERES 494

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
           +DR+++  PG Q +LI ++A   K P+ ++ M AG VD  + K +  I +++W GYPG+ 
Sbjct: 495 RDRINITWPGVQLDLIKELAGVGK-PMIVLQMGAGQVDDTWLKESDAINALIWGGYPGQS 553

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
           GG A+A+++ GK  P  RLPIT Y  +Y+ +P T M +RP N+ PGRTYK+F G  ++ F
Sbjct: 554 GGTALANIVTGKTAPAARLPITQYPEDYISLPMTDMNVRPSNSSPGRTYKWFTGEPIFEF 613

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
           G+GL Y++F +  A  P +              ++ +G       A     V    +  T
Sbjct: 614 GFGLHYSKFDFAWAEEPPA--------------SFAIGD----LVANASSPVDLATFH-T 654

Query: 662 FQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF---IAAGQSAKVGFTMNA 717
           FQ+ V N+G +    V M++ +   G +   +K+++GY R+    + A  +A V  T+  
Sbjct: 655 FQVNVTNLGPVASDFVAMLFGNTTAGPSPAPLKELVGYTRLTNIPVGATVTASVPVTLG- 713

Query: 718 CKSLKIVDNAANSLLASGAHTILV 741
             ++   D   NS+L  G +++ +
Sbjct: 714 --TIARADEDGNSVLFPGQYSVWL 735


>gi|389748262|gb|EIM89440.1| hypothetical protein STEHIDRAFT_182874, partial [Stereum hirsutum
           FP-91666 SS1]
          Length = 772

 Score =  488 bits (1257), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 302/745 (40%), Positives = 414/745 (55%), Gaps = 44/745 (5%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGV-S 67
           L D   C+    + +RA  L+E   L + V    + + GV RLGLP Y+WW+EALHGV S
Sbjct: 33  LRDNLVCNTTAHFVDRATSLIEEFNLTDLVNNTVNGSPGVDRLGLPPYQWWNEALHGVGS 92

Query: 68  FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG 127
             G    S P  +F S    ATSFP  IL  A+FN+SL   I   +STEARA  N   AG
Sbjct: 93  SPGVNWGSGPDANFTS----ATSFPAPILLGATFNDSLIASIADVISTEARAFNNFNYAG 148

Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
           LTF++PNIN  RDPRWGR  ETPGEDPY + RY   YV GLQ     + +        K+
Sbjct: 149 LTFFTPNINPFRDPRWGRGQETPGEDPYHLSRYVYQYVVGLQGGLSPDPY-------YKV 201

Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
            A CKH  AYD++NWEGNDR  F++ VT QD+ E +   F+ C+ +   +S MCSYN VN
Sbjct: 202 LANCKHVLAYDVENWEGNDRTGFNAVVTTQDLSEFYTPSFQGCLRDAQGASAMCSYNAVN 261

Query: 248 GIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           G+P+CA   +L   +R  W      G+I  DC ++Q I + H +  DT  +A A  + AG
Sbjct: 262 GVPSCASSYILKDLVRDFWGLGEREGWITGDCGAVQNIYQPHGY-TDTLVNATAVAMDAG 320

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNI 362
            DLDCGD Y+     AV +G I    I T+L  LY  L+RLGYFD + Q  Y++   +N+
Sbjct: 321 TDLDCGDVYSPNLWTAVVEGLITAGQIQTALIRLYGSLIRLGYFDPAEQQPYRSFDWSNV 380

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
             P   +LA  AA QGIVLL+ND G LPL+T N+K +AL+GP ANAT ++ GNY G    
Sbjct: 381 NTPSSQDLAYNAAVQGIVLLEND-GLLPLST-NVKNIALIGPMANATLSLQGNYAGIAPF 438

Query: 423 YTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
             SP   F      + +A G   I   +NS    A++AA+ AD  V V G+D S+EAEG+
Sbjct: 439 VISPQQAFETAGYNVTFAFGTG-ISNSDNSGYSEALEAAQGADVVVFVGGIDNSIEAEGQ 497

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
           DR  +  PG Q +LI ++ +  K P+ +V M  G  D +  K N  + ++LW GYPG+ G
Sbjct: 498 DRTSIEWPGSQLDLIGQLGELGK-PLVVVRMGGGQCDDSTLKANATVNALLWAGYPGQSG 556

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP-VNNFPGRTYKFFDGPVVYP 600
           G A+ D+I GK +P GRLP+T Y ++YV +I  T M +RP  +  PGRTYK++ G  +YP
Sbjct: 557 GTALVDIISGKQSPSGRLPVTQYPSSYVSEIDMTDMAIRPNSSGSPGRTYKWYTGAPIYP 616

Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
           FGYG+ YT F+   + S  +           +DI      NK    A    D +  D   
Sbjct: 617 FGYGIHYTTFRLAWSDSSSTT-------YNIQDI--VSSANKSGGFA----DTEILD--- 660

Query: 661 TFQIEVENMGKMDGSEVV--MVYSKPPGIAGTHIKQVIGYERV-FIAAGQSAKVGFTMNA 717
           TF + V N G    S+ V  +  +   G +   +++++GY RV  I  G +A     +  
Sbjct: 661 TFSLLVTNTGSNYTSDYVALLFANSTSGPSPAPLQELVGYTRVPHITPGGTATAELNV-T 719

Query: 718 CKSLKIVDNAANSLLASGAHTILVG 742
             S+  VD   N +L  G + + VG
Sbjct: 720 LGSISRVDENGNWILYPGTYNLWVG 744


>gi|409041356|gb|EKM50841.1| glycoside hydrolase family 3 protein [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 764

 Score =  486 bits (1250), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 299/741 (40%), Positives = 414/741 (55%), Gaps = 42/741 (5%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L D   C+       RA  LV+ +TL E V    + + GVPRLGLP Y WWSEALHGV+ 
Sbjct: 32  LKDNLVCNPSADPTSRANALVDALTLEELVNNTVNASPGVPRLGLPPYNWWSEALHGVAL 91

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
                 S PG+ F S    ATSFP  I+  A+F++ L   I   +STEARA  N G AGL
Sbjct: 92  SPGTNFSVPGSPFSS----ATSFPQPIILGATFDDDLVTSIATVISTEARAFNNAGRAGL 147

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
            F++PNIN  +DPRWGR  ETPGEDP+ + +Y    V GLQ     + +        K+ 
Sbjct: 148 DFFTPNINPFKDPRWGRGQETPGEDPFHIAQYVYQLVTGLQGGLSPDPY-------YKVI 200

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           A CKH+A YDL+NWEGN R  F++ ++ QD+ E +   F+ CV +  V SVMCSYN VNG
Sbjct: 201 ADCKHFAGYDLENWEGNSRMAFNAIISTQDLAEYYTPSFQSCVRDAHVGSVMCSYNAVNG 260

Query: 249 IPTCADPKLLNQTIRGDWNF-HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL 307
           IP+CA+  LL   IRG +    G+I SDCD++  I   H++   T  +A A  LKAG D+
Sbjct: 261 IPSCANSYLLQDIIRGHFGLGDGWITSDCDAVANIFSPHQYTT-TLVNASAVALKAGTDV 319

Query: 308 DCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNP 365
           DCG  Y+   + AV Q  + E DI  S+  LY  L+RLGYFD   +  ++ LG +++  P
Sbjct: 320 DCGTTYSQTLVDAVDQNLVTEDDIKNSMIRLYRSLVRLGYFDSPAEQPFRQLGWSDVNTP 379

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
               LA  AA +G+ LLKND G LPL++  IK +ALVGP ANAT  M GNY+G      S
Sbjct: 380 SSQALALTAAEEGVTLLKND-GTLPLSSA-IKRIALVGPWANATTQMQGNYQGIAPFLVS 437

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+         + +A G A I   ++S   AA+ A + ADA +   G+D ++E+EG DR 
Sbjct: 438 PLQALQDAGFQVTFANGTA-INSTDDSGFAAAVSAVQVADAVIYAGGIDETIESEGNDRE 496

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            +  PG Q +L++++A   K P  ++ M  G VD +  K+N  + +++W GYPG+ GG A
Sbjct: 497 IITWPGNQLDLVSQLAAVGK-PFVVLQMGGGQVDSSSLKSNKAVNALIWGGYPGQSGGAA 555

Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYG 604
           I +++ GK  P GRLPIT Y A+YV +IP T M LRP    PGRTYK+F G  ++ FG+G
Sbjct: 556 IVNILTGKIAPAGRLPITQYPADYVNEIPMTDMALRPNGTSPGRTYKWFTGTPIFGFGFG 615

Query: 605 LSYTQFKYKVASSPKS--VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
           L YT F    A +P S      L  +     +++   TN  P               FTF
Sbjct: 616 LHYTTFSLDWAPTPPSSFAISTLVSEANTAGVSF---TNLAPL--------------FTF 658

Query: 663 QIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERV-FIAAGQSAKVGFTMNACKS 720
           ++ V+N GK+    V +++S    G     +KQ++ Y RV  IA GQ+      +    S
Sbjct: 659 RVNVKNTGKVGSDYVALLFSNTTAGPQPAPLKQLVSYTRVKGIAPGQTETAELKVT-LGS 717

Query: 721 LKIVDNAANSLLASGAHTILV 741
           +  +D   +S L  G + I V
Sbjct: 718 IARIDENGDSALYPGRYNIWV 738


>gi|398403795|ref|XP_003853364.1| putative xylan 1,4-beta-Xylosidase [Zymoseptoria tritici IPO323]
 gi|339473246|gb|EGP88340.1| putative xylan 1,4-beta-Xylosidase [Zymoseptoria tritici IPO323]
          Length = 785

 Score =  485 bits (1248), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 289/746 (38%), Positives = 403/746 (54%), Gaps = 66/746 (8%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD       RA  L+   T+ EK+   G  A GVPRLGLP Y WW EALHGV+       
Sbjct: 39  CDFTADPLTRATALIAAFTIEEKINNTGSTAPGVPRLGLPAYTWWQEALHGVA------- 91

Query: 75  SPPGTHFD--SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
             PG +F    +   ATSFP  IL  A+F++ L K +   +STEARA  N   +GL +W+
Sbjct: 92  QSPGVNFSDSGDFRYATSFPQPILMGAAFDDDLIKDVATVISTEARAFNNDARSGLDYWT 151

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           PNIN  +D RWGR  ETPGEDPY +  Y  + + GLQ           D +  K+ A CK
Sbjct: 152 PNINPFKDSRWGRGQETPGEDPYHLSSYVKSLIAGLQ----------GDGKYKKVVATCK 201

Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           H+ AYDL+ W GN R+ FD  V  Q++ E ++ PF+ C  + +V + MCSYN +NGIPTC
Sbjct: 202 HFVAYDLETWNGNFRYQFDPHVGSQELVEYYMPPFQACARDANVGAFMCSYNSLNGIPTC 261

Query: 253 ADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           ADP LL   +R  WN+     ++ SDCDSIQ +   H++ + T+E+AVA  LKAG D++C
Sbjct: 262 ADPYLLQTILREHWNWTSEEQWVTSDCDSIQNVYLPHEYTS-TREEAVAVSLKAGTDVNC 320

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP-QYKNLGKNNICNPQHI 368
           G YY  F  GA+  G + E DID +L   Y  L+RLGYFDG+  +Y++L   ++  P   
Sbjct: 321 GTYYQEFLPGALSLGLVTEKDIDMALIRQYSSLVRLGYFDGTAVEYRSLSWKDVSTPYAQ 380

Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
           +LA +AA +GI LLKND G LPL       +A++G  ANAT+ M+GNY+G P    SP+ 
Sbjct: 381 QLALKAAVEGITLLKND-GILPLAITKDTKIAVIGDWANATEQMLGNYDGIPPYLHSPLW 439

Query: 429 GFYAYSKVINYA---PGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
                   + Y+    G  D    N   I  A+D    AD  +   G+D  VEAEG DRV
Sbjct: 440 AAQQTGANVTYSGNPGGQGDPTTNNWLHIWTAVD---EADVILFAGGIDNGVEAEGMDRV 496

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            +   G Q ++I ++A   K PV +  M    VD     NN  I ++LW GYPG++GG A
Sbjct: 497 SIAWTGAQLDVIGQLASRGK-PVIVAQMGTNGVDSTPLLNNQNISALLWGGYPGQDGGVA 555

Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
           + D+I GK  P GRLP T Y A+Y+ K+P T M LRP     FPGRTY +++   V+ FG
Sbjct: 556 LLDIIQGKSAPAGRLPTTQYPASYISKVPMTDMHLRPNSTTGFPGRTYMWYNEKPVFEFG 615

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
           YGL YT F   ++ +  +              ++++      C    +D     D K   
Sbjct: 616 YGLHYTNFSATISPTDTT--------------SFSIADLTKDCTEHYMDRCPFADMK--- 658

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTH------IKQVIGYERVF-IAAGQSAKVGFTM 715
            I V N G +    V + +     +AG H       K+++ Y+R+  I AG S      +
Sbjct: 659 -IAVTNTGNVTSDYVTLGF-----LAGEHGPAPCPNKRLVNYQRLHNITAGASQTTSLNL 712

Query: 716 NACKSLKIVDNAANSLLASGAHTILV 741
               SL  VD+  N++L  G++ +L+
Sbjct: 713 T-LASLARVDDMGNTVLYPGSYALLI 737


>gi|389748500|gb|EIM89677.1| glycoside hydrolase family 3 protein [Stereum hirsutum FP-91666
           SS1]
          Length = 770

 Score =  484 bits (1247), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 296/752 (39%), Positives = 417/752 (55%), Gaps = 46/752 (6%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           C+    + +RAK LV  MTL E V    + + GVPRLGLP YEWWSEALHGV+       
Sbjct: 36  CNTSANFLDRAKALVNAMTLEEMVNNTVNTSPGVPRLGLPPYEWWSEALHGVA------- 88

Query: 75  SPPGTHFDS--EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
           S PG  F++  +  GATSFP  IL +A+F++ L   +  T+STEARA  N  ++GL F++
Sbjct: 89  SSPGVTFETSGDFSGATSFPEPILMSAAFDDDLIFSVASTISTEARAFGNTNHSGLDFFT 148

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           PNIN  +DPRWGR  ETPGEDP    RY    + GLQ   G        S   KI A CK
Sbjct: 149 PNINPFKDPRWGRGQETPGEDPLHTSRYVYQLITGLQGGVG-------PSPYYKIIADCK 201

Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           H+AAYDL+NWEGN+R  F++ V+ QD+ E +   F+ CV +  V SVMCSYN VNG+P C
Sbjct: 202 HFAAYDLENWEGNNRMAFNAIVSTQDLAEFYTPSFQSCVRDAKVGSVMCSYNAVNGVPAC 261

Query: 253 ADPKLLNQTIRGDWNFHG--YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
             P LL   +R  +      +I SDCD++  I + H +   T  +A A  L AG D+DCG
Sbjct: 262 GSPYLLQDLVRDYFELGNDTWITSDCDAVGNIFDPHNYTT-TLTNASAVALLAGTDVDCG 320

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNICNPQHI 368
             Y+     AV +G ++++D++ +L  LY  L+RLGYFD   S  Y+ LG +++  P   
Sbjct: 321 TSYSETLGEAVSEGLVSKSDVERALVRLYGSLVRLGYFDPEDSVPYRALGASDVNTPAAQ 380

Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
            LA  AA +GIVLLKND G LPL++ N+  +AL+GP ANAT  M GNYEG      SP+D
Sbjct: 381 TLAYTAAVEGIVLLKND-GLLPLSS-NVSHIALIGPWANATTQMQGNYEGIAPLLISPLD 438

Query: 429 GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLL 488
           GF +    +++  G   I   + S    A+  A  AD  V + G+D +VEAEG+DR  + 
Sbjct: 439 GFTSAGFNVSFTNGTT-ISGNSTSGFADALSMASAADVIVYIGGIDDTVEAEGQDRTSIT 497

Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
            PG Q ELI ++    K P  ++ M  G VD    K N  + ++LW GYPG+ GG+A+AD
Sbjct: 498 WPGNQLELIGELGAFGK-PFVVIQMGGGQVDDTELKANSSVNALLWGGYPGQAGGKALAD 556

Query: 549 VIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNF--PGRTYKFFDGPVVYPFGYGL 605
           +I G   P GRL  T Y A+YV ++  T M +RP N+   PGRTYK++ G  V+ FG+GL
Sbjct: 557 IITGVQAPAGRLTTTQYPASYVDQVAMTDMSVRPSNSTGSPGRTYKWYTGTPVFEFGFGL 616

Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIE 665
            YT F  + A    +    +   Q       +  +      + ++D         TF ++
Sbjct: 617 HYTTFDVEWAEGSPAASYSI---QDLVASANSSSSAVAHVDSAILD---------TFTVQ 664

Query: 666 VENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERV-FIAAGQSAKVGFTMNACKSLKI 723
           V N G +    V +++S    G +   +++++ Y RV  I  G SA     +    ++  
Sbjct: 665 VTNTGNVTSDYVALLFSNTTAGPSPAPLQELVSYARVKGITPGVSATASLNVT-LGTIAR 723

Query: 724 VDNAANSLLASGAHTILV---GEGVGGVSFPL 752
           VD   NS++  G + + V   G+     SF L
Sbjct: 724 VDEDGNSIIYPGVYNLWVDTTGQAKAVTSFEL 755


>gi|396473219|ref|XP_003839293.1| similar to beta-1,4-xylosidase [Leptosphaeria maculans JN3]
 gi|312215862|emb|CBX95814.1| similar to beta-1,4-xylosidase [Leptosphaeria maculans JN3]
          Length = 789

 Score =  483 bits (1242), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 291/749 (38%), Positives = 404/749 (53%), Gaps = 68/749 (9%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           C+      +RAK LV   TL EK+      + GVPRLG+P Y+WWSE LHG++       
Sbjct: 35  CNTSASPLDRAKSLVTLYTLEEKINATSSGSPGVPRLGIPPYQWWSEGLHGIA------- 87

Query: 75  SPPGTHFDS---EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
             P T+F +   E   +TSFP  IL  A+F++ L   + + +STEARA  N    GL FW
Sbjct: 88  -GPYTNFSTSGIEYSYSTSFPQPILMGAAFDDHLITDVAKVISTEARAFNNANRTGLDFW 146

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
           +PNIN  RDPRWGR  ETPGED + +  Y    + GLQ      Y R        + A C
Sbjct: 147 TPNINPFRDPRWGRGQETPGEDAFHLSSYVKALIAGLQGETTDPYKR--------VVATC 198

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
           KH+A YD+++W GN R+ FD+++++QD+ E ++ PF+ CV + +V + MCSYN VNG+PT
Sbjct: 199 KHFAGYDIEDWNGNLRYQFDAQISQQDLVEYYLQPFQACV-QANVGAFMCSYNAVNGVPT 257

Query: 252 CADPKLLNQTIRGDW---NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           CADP LL   +R  W   N   ++ SDCD++Q I   H++ + T+E AVA  L AG DLD
Sbjct: 258 CADPYLLQTILREHWGWTNEEQWVTSDCDAVQNIYLPHQW-SATREQAVADALIAGTDLD 316

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQ 366
           CG Y      GA  QG + E  +D +L   Y  L+RLG+FD +    Y+  G +++    
Sbjct: 317 CGTYMQEHLPGAFAQGLVNENVLDQALVRQYSSLVRLGWFDDAADQPYRQFGWDSVATDA 376

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
              LA  AA +GIVLLKND G LPL+  +  +L + G  ANAT  ++GNY G P    SP
Sbjct: 377 SQALARRAAVEGIVLLKND-GVLPLSIDSSVSLGVFGDWANATSQLLGNYAGVPTYLHSP 435

Query: 427 MDGFYAYSKVINYA----PGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
           +      +  INYA     G  D      S +  AI     +D  + + G+D S+E EG 
Sbjct: 436 LWALQQENLTINYAGGNPGGQGDPTTNRWSSLSGAI---ATSDILIYIGGIDNSIEEEGH 492

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
           DR  L   G Q ++I ++A   K P  +V+M  G +D     NN  I +ILW GYPG++G
Sbjct: 493 DRTSLAWTGAQLDVIFQLAATGK-PTIVVVMGGGQIDSAPLANNANISAILWAGYPGQDG 551

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
           G AI D++ GK  P GRLP T Y A+Y   +P T M LRP  N PGRTYK+++G   Y F
Sbjct: 552 GPAIVDILTGKSPPAGRLPQTQYPASYTSLVPMTDMGLRPSENNPGRTYKWYNGTATYEF 611

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
           G+GL YT F   V S  +      D    C++           CA   +D          
Sbjct: 612 GHGLHYTNFSATVTSPMQQSYRIADLMSTCKN---ATSITLERCAFTSVD---------- 658

Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTH------IKQVIGYERVF-IAAGQS--AKVG 712
             I V N G +    V + Y     I+G+H       K ++GY+R+F IAAG S  A++ 
Sbjct: 659 --ISVTNTGAVASDYVTLCY-----ISGSHGPAPHPKKSLVGYQRLFGIAAGASDTARID 711

Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILV 741
            T+   +SL  VD   N +L  G ++++V
Sbjct: 712 LTL---ESLARVDEVGNKVLYPGEYSLMV 737


>gi|115436902|ref|XP_001217674.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|121734342|sp|Q0CB82.1|BXLB_ASPTN RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|114188489|gb|EAU30189.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 765

 Score =  482 bits (1241), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 286/703 (40%), Positives = 393/703 (55%), Gaps = 57/703 (8%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD  L    RA+ L+  MTL EK+      + GVPRLGLP Y WWSEALHGV+ 
Sbjct: 37  LSKNAVCDTTLDPVTRAQALLAAMTLEEKINNTQYNSPGVPRLGLPAYNWWSEALHGVA- 95

Query: 69  IGRRTNSPPGTHF--DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
                   PG HF        ATSFP+ I   A+F++ L K+I   + TE RA  N G+A
Sbjct: 96  ------GSPGVHFADSGNFSYATSFPSPITLGAAFDDDLVKQIATVIGTEGRAFGNAGHA 149

Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLK 186
           GL +W+PNIN  RDPRWGR  ETPGEDP+   RY  + + GLQD  G E       +P K
Sbjct: 150 GLDYWTPNINPYRDPRWGRGQETPGEDPFHTSRYVYHLIDGLQDGIGPE-------KP-K 201

Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
           I A CKH+A YD+++WEGN+R+ FD+ +++QDM E +  PF+ C  +  V +VMCSYN V
Sbjct: 202 IVATCKHFAGYDIEDWEGNERYAFDAVISDQDMAEYYFPPFKTCTRDAKVDAVMCSYNSV 261

Query: 247 NGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NGIPTCADP LL   +R  W + G   ++ SDC +I  I + HK++      A A  + A
Sbjct: 262 NGIPTCADPWLLQTVLREHWEWEGVGHWVTSDCGAIDNIYKDHKYVA-DGAHAAAVAVNA 320

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNN 361
           G DLDCG  Y  F   A+ QG +    +D +L  LY  L++LGYFD +    Y+++G ++
Sbjct: 321 GTDLDCGSVYPQFLGSAISQGLLGNRTLDRALTRLYSSLVKLGYFDPAADQPYRSIGWSD 380

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           +  P   +LA  AA +G VLLKND G LPL      T+A+VGP+ANAT  + GNYEGT  
Sbjct: 381 VATPDAEQLAHTAAVEGTVLLKND-GTLPLKKNG--TVAIVGPYANATTQLQGNYEGTAK 437

Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
              + +         + YAPG   I   + S    A++AAK +D  +   G+D  VEAE 
Sbjct: 438 YIHTMLSAAAQQGYKVKYAPGTG-INSNSTSGFEQALNAAKGSDLVIYFGGIDHEVEAEA 496

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
            DR  +  PG Q +LI +++D  K P+ +V    G VD +   +N  +  +LW GYP + 
Sbjct: 497 LDRTSIAWPGNQLDLIQQLSDLKK-PLVVVQFGGGQVDDSSLLSNAGVNGLLWAGYPSQA 555

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
           GG A+ D++ GK  P GRLP+T Y   YV ++P T M LRP  + PGRTY+++D  V+ P
Sbjct: 556 GGAAVFDILTGKTAPAGRLPVTQYPEEYVDQVPMTDMNLRPGPSNPGRTYRWYDKAVI-P 614

Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
           FGYG+ YT F                 D   +  NY         AAV  ++   +    
Sbjct: 615 FGYGMHYTTF-----------------DVSWKRKNY----GPYNTAAVKAENAVLE---- 649

Query: 661 TFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERV 701
           TF ++V+N GK+    V +V+  +   G     IK ++GY+RV
Sbjct: 650 TFSLQVKNTGKVTSDYVALVFLTTTDAGPKPYPIKTLVGYQRV 692


>gi|297740661|emb|CBI30843.3| unnamed protein product [Vitis vinifera]
          Length = 401

 Score =  479 bits (1234), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 240/423 (56%), Positives = 303/423 (71%), Gaps = 38/423 (8%)

Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLL 382
           QGK  E D+DTSLR LYIVL ++G+FDG P Y++L K ++C  +HIELAA+AARQGIVLL
Sbjct: 2   QGKAREEDVDTSLRNLYIVLTQVGFFDGIPSYESLDKKDLCTKEHIELAADAARQGIVLL 61

Query: 383 KNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPG 442
           KN N  LPL+   +K LAL+GPHANAT  M+GNY G PC+Y+SP+DGF AY KV  Y  G
Sbjct: 62  KNINETLPLDPAKLKNLALIGPHANATIEMLGNYAGVPCQYSSPLDGFSAYGKV-TYEMG 120

Query: 443 CADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVAD 502
           C ++ C N + I  A++A+KNADAT+++ GLD +VE EG DR DLLLPG+QTELI +V  
Sbjct: 121 CNNVTCDNKTFIMPAVEASKNADATILLVGLDKTVEGEGLDRNDLLLPGYQTELILQVIV 180

Query: 503 AAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPI 562
           A+KGP+ LVIMS  AVDI+F+K + ++K+ILW GYPGEEGGRAIADV++GKYNPGGRLP+
Sbjct: 181 ASKGPIILVIMSGSAVDISFSKTDDRVKAILWAGYPGEEGGRAIADVVYGKYNPGGRLPL 240

Query: 563 TWYEANYVK-IPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
           TW++ +Y+  +P TSM LRPVNN+PGRTYKFF+G VVYPFG+GLSYT+F Y + SS    
Sbjct: 241 TWHQNDYLSMLPMTSMSLRPVNNYPGRTYKFFNGSVVYPFGHGLSYTKFNYTLRSS---- 296

Query: 622 DIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY 681
                                         ++ CKD+ F   IEV+N+G   G+EVV+VY
Sbjct: 297 ------------------------------NMSCKDH-FELDIEVKNIGAKHGNEVVLVY 325

Query: 682 SKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
           SKPP GI GTH KQVIG++RVF+ AG S  V F  N CKSL IV   A  LL SG H I+
Sbjct: 326 SKPPTGIVGTHAKQVIGFKRVFVPAGGSQNVKFEFNVCKSLGIVGYNAYKLLPSGEHKII 385

Query: 741 VGE 743
           +G+
Sbjct: 386 IGD 388


>gi|226491558|ref|NP_001146416.1| uncharacterized protein LOC100279996 [Zea mays]
 gi|223975771|gb|ACN32073.1| unknown [Zea mays]
          Length = 507

 Score =  479 bits (1233), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 240/510 (47%), Positives = 332/510 (65%), Gaps = 18/510 (3%)

Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
           MCSYN+VNG PTCAD  LL+  IRGDW  +GYI SDCDS+  +  +  +   T EDA A 
Sbjct: 1   MCSYNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHY-TKTPEDAAAI 59

Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKN 356
            +KAGLDL+CG +    T+ AVQ GK++E+D+D ++    + LMRLG+FDG P+   + N
Sbjct: 60  SIKAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGN 119

Query: 357 LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
           LG +++C P + ELA EAARQGIVLLKN  G LPL+  +IK++A++GP+ANA+  MIGNY
Sbjct: 120 LGPSDVCTPSNQELAREAARQGIVLLKN-TGKLPLSATSIKSMAVIGPNANASFTMIGNY 178

Query: 417 EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDL 475
           EGTPC+YT+P+ G  A    + Y PGC ++ C  NS+ + AA  AA +AD TV+V G D 
Sbjct: 179 EGTPCKYTTPLQGLGANVATV-YQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQ 237

Query: 476 SVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
           S+E E  DR  LLLPG Q +L++ VA+A+ GP  LV+MS G  DI+FAK++ KI +ILWV
Sbjct: 238 SIERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWV 297

Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFF 593
           GYPGE GG AIADV+FG +NP GRLP+TWY  ++ K+P T M +R  P   +PGRTY+F+
Sbjct: 298 GYPGEAGGAAIADVLFGYHNPSGRLPVTWYPESFTKVPMTDMRMRPDPSTGYPGRTYRFY 357

Query: 594 DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
            G  VY FG GLSYT F + + S+PK + ++L +   C             C +V  +  
Sbjct: 358 TGDTVYAFGDGLSYTSFAHHLVSAPKQLALQLAEGHACL---------TEQCPSVEAEGA 408

Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
            C+   F   + V N G+  G   V ++S PP +     K ++G+E+V +  GQ+  V F
Sbjct: 409 HCEGLAFDVHLRVRNAGERSGGHTVFLFSSPPAVHNAPAKHLLGFEKVSLEPGQAGVVAF 468

Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVGE 743
            ++ CK L +VD   N  +A G+HT+ VG+
Sbjct: 469 KVDVCKDLSVVDELGNRKVALGSHTLHVGD 498


>gi|62321271|dbj|BAD94481.1| beta-xylosidase [Arabidopsis thaliana]
          Length = 523

 Score =  478 bits (1231), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 247/520 (47%), Positives = 335/520 (64%), Gaps = 15/520 (2%)

Query: 231 VNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLN 290
           V +G+V+SVMCSYN+VNG PTCADP LL+  IRG+W  +GYIVSDCDS+  + ++  +  
Sbjct: 3   VVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQHYTK 62

Query: 291 DTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
              E A   +L AGLDL+CG +    T  AV+ G + EA ID ++   ++ LMRLG+FDG
Sbjct: 63  TPAEAAAISIL-AGLDLNCGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLTLMRLGFFDG 121

Query: 351 SPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
           +P+   Y  LG  ++C   + ELAA+AARQGIVLLKN  G LPL+  +IKTLA++GP+AN
Sbjct: 122 NPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKN-TGCLPLSPKSIKTLAVIGPNAN 180

Query: 408 ATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADAT 467
            TK MIGNYEGTPC+YT+P+ G  A +    Y PGC+++ C     +  A   A  AD +
Sbjct: 181 VTKTMIGNYEGTPCKYTTPLQGL-AGTVSTTYLPGCSNVACAVAD-VAGATKLAATADVS 238

Query: 468 VIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
           V+V G D S+EAE +DRVDL LPG Q EL+ +VA AAKGPV LVIMS G  DI FAKN+P
Sbjct: 239 VLVIGADQSIEAESRDRVDLRLPGQQQELVIQVAKAAKGPVLLVIMSGGGFDITFAKNDP 298

Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNN 584
           KI  ILWVGYPGE GG AIAD+IFG+YNP G+LP+TWY  +YV K+P T M +RP   + 
Sbjct: 299 KIAGILWVGYPGEAGGIAIADIIFGRYNPSGKLPMTWYPQSYVEKVPMTIMNMRPDKASG 358

Query: 585 FPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDIN-YTVGTNKP 643
           +PGRTY+F+ G  VY FG GLSYT+F + +  +P  V + L+++  CR     ++    P
Sbjct: 359 YPGRTYRFYTGETVYAFGDGLSYTKFSHTLVKAPSLVSLGLEENHVCRSSECQSLDAIGP 418

Query: 644 PCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFI 703
            C     + V      F   I+V N G  +G   V +++ PP I G+  K ++G+E++ +
Sbjct: 419 HCE----NAVSGGGSAFEVHIKVRNGGDREGIHTVFLFTTPPAIHGSPRKHLVGFEKIRL 474

Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
              + A V F +  CK L +VD      +  G H + VG+
Sbjct: 475 GKREEAVVRFKVEICKDLSVVDEIGKRKIGLGKHLLHVGD 514


>gi|344302281|gb|EGW32586.1| hypothetical protein SPAPADRAFT_51129 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 788

 Score =  478 bits (1229), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 288/741 (38%), Positives = 412/741 (55%), Gaps = 41/741 (5%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L D   C+  LP  +RAK +V+  T+ E +  MG+ + GV RLGLP Y+WWSE LHG   
Sbjct: 55  LKDNDVCNPYLPNNQRAKAVVDLFTVDELIANMGNTSPGVERLGLPPYQWWSEGLHG--- 111

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
           I R   +  G     E   ATSFP  IL   +FN  L+K++G  + TEARA  N+G AGL
Sbjct: 112 IARSNFTASG-----EYSHATSFPQPILMGGAFNSDLYKQVGNVIGTEARAFNNVGRAGL 166

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
            ++SPNIN  +DPRWGR  E   E P +VG YA+NYV+GLQ   G++ + + D+  L+++
Sbjct: 167 DYYSPNINPFKDPRWGRGQEVASESPVLVGNYALNYVQGLQG--GIDSNPNDDT--LQVA 222

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           A CKH+A YD+++W+ + R  +++ +++QD+ + +   F+ CV +   +  MCSYN +NG
Sbjct: 223 ATCKHFAGYDMESWKQHSRLGYNAIISDQDLADYYFPTFQSCVRDAKAAGAMCSYNAING 282

Query: 249 IPTCADPKLLNQTIRGDWNFH-GYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL 307
           IP CA    L   IR  ++F  G I SDCDS+ +I   H ++ D    A A  +KAG+D+
Sbjct: 283 IPVCASEFFLGTVIREGFDFQNGVIHSDCDSLYSIWNPHLYVQDLGA-AAADGIKAGVDV 341

Query: 308 DCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICN 364
           +CGD Y N    A+    I E  I  S+   Y  L+RLGYFD SPQ   Y+    +++  
Sbjct: 342 NCGDTYQNNLGYALGNKTINEDQIRASVTRQYSNLIRLGYFD-SPQTNKYRTYNWSDVST 400

Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
            Q  +LA +AA +GI LLKND G LP N   +K +A++GP ANAT  M+G+Y GTP    
Sbjct: 401 SQANQLAYQAAVEGITLLKND-GTLPFNKDKVKNVAVIGPWANATTDMLGDYAGTPPYLI 459

Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           SP+ G       + YA G   I     +   AA++AAK ADA V   G+D S+E E  DR
Sbjct: 460 SPLQGAQDSGFKVQYAYGT-QINTTLTTNYTAALNAAKGADAIVYFGGIDNSIENEALDR 518

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
             L  PG Q +L++K++   K P+ +V   AG VD    KNN  + SI++ GYPG+ GG 
Sbjct: 519 ESLAWPGNQLDLVSKLSGLNK-PLVVVQFGAGQVDDTEIKNNNNVNSIVYAGYPGQSGGT 577

Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
           AI DV+ G Y P GRL  T Y A+Y  ++P T M LRP + +PGRT+ +++G  VY FGY
Sbjct: 578 AIWDVLNGIYAPAGRLSTTQYPASYADQVPMTDMTLRPRDGYPGRTFMWYNGEPVYEFGY 637

Query: 604 GLSYTQFKYKVASS-PKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
           GL YT F   +A++ PK      + DQ      +    +       LI          TF
Sbjct: 638 GLHYTTFSVSLANAPPKGAPQSFNIDQ------FIAAKSSQYVDTSLIT---------TF 682

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQV-IGYERVF-IAAGQSAKVGFTMNACKS 720
            + ++N GK+      ++YS      G H  ++ + ++++  I  GQ       +    S
Sbjct: 683 DVNIKNTGKVTSDYAALLYSNTTSGPGPHPNKILVSFDKLHQIHPGQIQTASLPV-TIGS 741

Query: 721 LKIVDNAANSLLASGAHTILV 741
           L   D   N  L  GA+T  V
Sbjct: 742 LLQTDTNGNKWLYPGAYTFFV 762


>gi|291167620|dbj|BAI82526.1| 1,4-beta-D-xylosidase [Aureobasidium pullulans var. melanogenum]
          Length = 805

 Score =  478 bits (1229), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 288/754 (38%), Positives = 409/754 (54%), Gaps = 57/754 (7%)

Query: 3   ESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEA 62
           + +   LS+   CD       RAK LV   T+ EK+   G+ + GVPRLGLP+Y+WW EA
Sbjct: 32  DCVNGPLSNNTVCDKSADPVARAKALVAAFTVAEKLNLTGNNSPGVPRLGLPVYQWWQEA 91

Query: 63  LHGVSFIGRRTNSPPGTHFDS--EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
           LHGV+       S PG  F++  +   ATSFP  IL  A+F+++L + + + VSTEARA 
Sbjct: 92  LHGVA-------SSPGVTFNATGQFDSATSFPQPILMGAAFDDALIQSVAEVVSTEARAF 144

Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
            N G AGL FW+PNIN  RDPRWGR  ETPGEDPY +  Y  + + GLQ  E        
Sbjct: 145 NNYGRAGLDFWTPNINPYRDPRWGRGQETPGEDPYHLSSYVHSLIMGLQGGE-------- 196

Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
           D    KI+A CKH+A YD+++W GN R+  D ++ ++D+ E ++  F  C  + +V + M
Sbjct: 197 DPEIRKITATCKHFAGYDIESWNGNLRYQNDVQIPQRDLVEYYLPSFRSCARDSNVGAFM 256

Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDW---NFHGYIVSDCDSIQTIVESHKFLNDTKEDAV 297
           C+Y+ +NG+PTCADP LLN  +R  W   N   ++ SDCDSIQ I   H F +DT++ A 
Sbjct: 257 CTYSALNGVPTCADPWLLNDVLREHWGWTNEEQWVTSDCDSIQNIFLPHNF-SDTRQGAA 315

Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKN 356
           A  L AG DLDCG YY +    A  QG I +  +D +L  LY  L+R GYFDG +  Y+N
Sbjct: 316 AAALNAGTDLDCGTYYQHHLPLAYSQGLINQTTVDQALVRLYTSLVRTGYFDGPNAMYRN 375

Query: 357 LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
           L  +++      +LA +AA +G+VLLKND G LPL+  N   +AL+G  ANAT  M GNY
Sbjct: 376 LTWSDVGTTHAQQLALQAAEEGMVLLKND-GLLPLSISNGTKIALIGSWANATTQMQGNY 434

Query: 417 EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
            G P    SP+         + YA G                 AA+ AD  + + G+D+S
Sbjct: 435 YGVPTYLHSPLYAAQQTGAQVFYAQGPGGQGDPTTDHWLPVWTAAEKADIIIYIGGVDIS 494

Query: 477 VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
           VEAEG DR D+   G Q ++I ++A   K P+ L  M    +D     NN  I +++W G
Sbjct: 495 VEAEGMDREDINWTGAQLDIIGELAMYGK-PMVLAQM-GDQLDNTPIVNNANISALIWGG 552

Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRP--VNNFPGRTYKFF 593
           YPG++GG A+ ++I GK  P GRLP+T Y A+Y+  IP T M LRP      PGRTYK++
Sbjct: 553 YPGQDGGVALFNIITGKTAPAGRLPVTQYPAHYIADIPMTDMTLRPNATTGSPGRTYKWY 612

Query: 594 DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
           +G  V+ FGYG+ YT+F   ++   KS              +Y + +    C     D  
Sbjct: 613 NGTAVFEFGYGMHYTKFSADISPMSKS--------------SYDISSLLSGCNETYKDRC 658

Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH------IKQVIGYERVFIAAGQ 707
             +    +  + V N G +      + +     IAG         K ++ Y+R+   AG 
Sbjct: 659 AFE----SISVNVHNTGNVTSDYAALGF-----IAGQFGPSPYPKKSLVNYQRLHNIAGG 709

Query: 708 SAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
           S++         SL  VD+  N+ L  G + +++
Sbjct: 710 SSQTATLNLTLGSLSRVDDHGNTYLYPGDYALMI 743


>gi|115436096|ref|NP_001042806.1| Os01g0296700 [Oryza sativa Japonica Group]
 gi|113532337|dbj|BAF04720.1| Os01g0296700, partial [Oryza sativa Japonica Group]
          Length = 522

 Score =  475 bits (1223), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 254/527 (48%), Positives = 351/527 (66%), Gaps = 23/527 (4%)

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           +NG+P CAD +LL +T+R DW  HGYIVSDCDS++ +V   K+L  T  +A A  +KAGL
Sbjct: 1   INGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYTGVEATAAAMKAGL 60

Query: 306 DLDCG-------DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLG 358
           DLDCG       D++T + + AV+QGK+ E+ +D +L  LY+ LMRLG+FDG P+ ++LG
Sbjct: 61  DLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTNLYLTLMRLGFFDGIPELESLG 120

Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGP--HANATKAMIGNY 416
             ++C  +H ELAA+AARQG+VLLKND   LPL+   + ++AL G   H NAT  M+G+Y
Sbjct: 121 AADVCTEEHKELAADAARQGMVLLKNDAALLPLSPEKVNSVALFGQLQHINATDVMLGDY 180

Query: 417 EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
            G PCR  +P DG     KV++     A   C   S    A  AAK  DAT++VAGL++S
Sbjct: 181 RGKPCRVVTPYDGV---RKVVSSTSVHA---CDKGS-CDTAAAAAKTVDATIVVAGLNMS 233

Query: 477 VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
           VE E  DR DLLLP  Q   IN VA+A+  P+ LVIMSAG VD++FA++NPKI +++W G
Sbjct: 234 VERESNDREDLLLPWSQASWINAVAEASPSPIVLVIMSAGGVDVSFAQDNPKIGAVVWAG 293

Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFF 593
           YPGEEGG AIADV+FGKYNPGGRLP+TWY+  YV KIP TSM LRP   + +PGRTYKF+
Sbjct: 294 YPGEEGGTAIADVLFGKYNPGGRLPLTWYKNEYVSKIPMTSMALRPDAEHGYPGRTYKFY 353

Query: 594 DGP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPP-CAAVLID 651
            G  V+YPFG+GLSYT F Y  A++   V +K+   + C+ + Y  G + PP C AV + 
Sbjct: 354 GGADVLYPFGHGLSYTNFTYASATAAAPVTVKVGAWEYCKQLTYKAGVSSPPACPAVNVA 413

Query: 652 DVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG-IAGTHIKQVIGYERVFIAAGQSAK 710
              C++ + +F + V N G  DG+ VV +Y+ PP  + G   KQ++ + RV +AAG + +
Sbjct: 414 SHACQE-EVSFAVTVANTGGRDGTHVVPMYTAPPAEVDGAPRKQLVAFRRVRVAAGAAVE 472

Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLN 757
           V F +N CK+  IV+  A +++ SG   +LVG+    +SFP+Q++L 
Sbjct: 473 VAFALNVCKAFAIVEETAYTVVPSGVSRVLVGDDALSLSFPVQIDLQ 519


>gi|403412992|emb|CCL99692.1| predicted protein [Fibroporia radiculosa]
          Length = 760

 Score =  475 bits (1222), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 296/742 (39%), Positives = 406/742 (54%), Gaps = 57/742 (7%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD       RA  L+   TL EK+   G+ + GVPRLGLP Y+WW EALHGV+       
Sbjct: 34  CDTSASPVARATALIGLFTLEEKINNTGNTSPGVPRLGLPAYQWWQEALHGVA------- 86

Query: 75  SPPGTHF--DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
             PG  F    E   ATSFP  IL  A+F++ L  ++   VSTEARA  N   +GL FW+
Sbjct: 87  ESPGVIFAETGEYSYATSFPQPILMGAAFDDELINQVATIVSTEARAFNNANRSGLDFWT 146

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           PNIN  +DPRWGR  ETPGEDP+ +  Y  N + GLQ     EY R        I A CK
Sbjct: 147 PNINPFKDPRWGRGQETPGEDPFHLQSYVYNLITGLQGGLDPEYKR--------IVATCK 198

Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           HYA YDL+NWEGN R+ FD+ ++ QD+ E +   FE C  + +V + MCSYN VNG+P+C
Sbjct: 199 HYAGYDLENWEGNVRYGFDALISIQDLSEFYTRSFETCARDANVGAFMCSYNAVNGVPSC 258

Query: 253 ADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           A+  LL   +RG WN+     +I SDCD+IQ I E H +   T+E  VA  L AG DLDC
Sbjct: 259 ANSYLLQDILRGHWNWTSDDQWITSDCDAIQNIYEPH-YYAPTRELTVADALNAGADLDC 317

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQH 367
           G YY      A  +G  AE+ +D +L   Y  L++LGYFD +    Y+ +G  N+  P+ 
Sbjct: 318 GTYYPENLGAAYDEGLFAESTLDRALIRQYASLVKLGYFDPAENQPYRQIGWANVSTPEA 377

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
            ELA  AA +GI L+KND G LPL+  +IK+LAL+GP ANAT  M GNY G P    SP+
Sbjct: 378 EELAYRAAVEGITLIKND-GTLPLSP-SIKSLALIGPWANATTQMQGNYYGQPPYLISPL 435

Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
               A +  + Y+PG   +     S  PAA  AA+ ADA + + G+D +VEAE  DR  L
Sbjct: 436 MAAEALNYTVYYSPGPG-VDDPTTSSFPAAFAAAQAADAIIYIGGIDTTVEAEAMDRYTL 494

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
             PG Q + I++++   K P+ ++ M  G VD +    N  + +++W GYPG+ GG A+ 
Sbjct: 495 DWPGVQPDFIDQLSQFGK-PLVVLQMGGGQVDDSCLLPNTNVNALIWGGYPGQSGGTALM 553

Query: 548 DVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLS 606
           D+I G   P GRLP T Y  +YV ++  T M LRP    PGRTY ++ G  +  FG+GL 
Sbjct: 554 DIIVGNAAPAGRLPTTQYPLDYVYQVAMTDMSLRPSATNPGRTYMWYTGTPIVEFGFGLH 613

Query: 607 YTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEV 666
           YT F  ++ S P +            DI   VG     C  V   D+   +   ++ + V
Sbjct: 614 YTNFSAEL-SQPSA---------PSYDIASLVGA----CEGVAHLDLCAFE---SYTVNV 656

Query: 667 ENMG-KMDGSEVVMVYSKPPGIAGTH------IKQVIGYERVFIAAGQSAKVGFTMNACK 719
            N+G K+    V +++     +AG H       K +  Y+R+   A  S++         
Sbjct: 657 TNIGSKVTSDYVALLF-----VAGEHGPAPIPNKVLAAYDRLHTIAPLSSQQATLNLTLG 711

Query: 720 SLKIVDNAANSLLASGAHTILV 741
           SL  VD   N +L  G +T+++
Sbjct: 712 SLSRVDEYGNRVLYPGEYTLIL 733


>gi|336377735|gb|EGO18896.1| glycoside hydrolase family 3 protein [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 766

 Score =  474 bits (1220), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 299/755 (39%), Positives = 416/755 (55%), Gaps = 59/755 (7%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD  L    RA  +V+  T+ E +      + GVPRLGLP Y+WWSE LHGV+       
Sbjct: 37  CDTSLDPISRATAVVDLFTIDELINNTVSTSPGVPRLGLPPYQWWSEGLHGVA------- 89

Query: 75  SPPGTHFDS--EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
             PG +F +  E   ATSFP  I+  A+F++ L K +G  V  E R+  N G AGL FW+
Sbjct: 90  DSPGVNFSASGEFSYATSFPQPIIMGAAFDDELIKSVGAIVGMEGRSFNNYGRAGLDFWT 149

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP-LKISACC 191
           PNIN  +DPRWGR  ETPGEDPY + +Y  N V+GLQ           D +P  ++ + C
Sbjct: 150 PNINPFKDPRWGRGQETPGEDPYHLAQYVYNLVQGLQG--------GLDPKPYYQVISTC 201

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
           KH+AAYDL++W+GN R+ FD+ VT QD+ E ++  F+ C  +  V + MCSYN VNGIP+
Sbjct: 202 KHFAAYDLEDWDGNYRYGFDAIVTTQDLSEYYLPSFQSCYRDAKVGAAMCSYNAVNGIPS 261

Query: 252 CADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           CA+  LL   +R  W F    ++ SDCD++  I + H +   T E+AVA  LKAG D+DC
Sbjct: 262 CANTYLLQSILRDFWGFAEDRWVTSDCDAVDNIYDPHNY-TKTPEEAVADALKAGTDIDC 320

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS--PQYKNLGKNNICNPQH 367
           G +Y+ +  GA  Q  I E ++  +L   Y  L+RLGYFD +    Y+    NN+  PQ 
Sbjct: 321 GTFYSEYLPGAYNQSLITETELRQALIRQYASLVRLGYFDPTDIQPYRQYNWNNVDTPQA 380

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
            +LA +AA +GIVLLKND G LPL++ +IK +AL+GP  NAT  M GNY G      SP+
Sbjct: 381 QQLAYQAAAEGIVLLKND-GTLPLSS-DIKNIALIGPWGNATGEMQGNYYGVAPYLISPL 438

Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
            G  A    + Y  G  +I   + S   AAI AA+ AD  +   G+D +VE+EG DR  +
Sbjct: 439 MGAVATGYNVTYVFGT-NITSNDTSGFAAAIAAAQGADVVIYAGGIDETVESEGNDRNYI 497

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
             PG Q +L+ ++A   K P+ +V    G VD    K N  + ++LW GYPG+ GG A+ 
Sbjct: 498 TWPGNQLDLVGELAAVGK-PLVVVQFGGGQVDDTSLKANSTVNALLWAGYPGQSGGSALF 556

Query: 548 DVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLS 606
           D+I GK  P GRLP+T Y A+YV +IP T M LRP    PGRTYK++ G  +Y FGYGL 
Sbjct: 557 DIISGKVAPAGRLPVTQYPADYVYEIPMTDMDLRPNATSPGRTYKWYTGTPIYDFGYGLH 616

Query: 607 YTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEV 666
           YT F YK A +P S               Y + T           D+   D   TF + V
Sbjct: 617 YTTFSYKWAKAPSST--------------YNIQTLVQSGNLYSYLDLAPFD---TFTVNV 659

Query: 667 ENMGKMDGSEVVMVYSKPPGIAGTH------IKQVIGYERVF-IAAGQSAKVGFTMNACK 719
            N G +      +++     + GT+       K +I Y R+  IA+G +A V   +    
Sbjct: 660 TNTGNVTSDFASLLF-----VNGTYGPSPYPNKSLITYARLHDIASGDTASVALGV-TLG 713

Query: 720 SLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQL 754
           S+   D   N  L  G + + + + +G +++  QL
Sbjct: 714 SIARADTYGNMWLYPGTYQVTL-DTLGVLTYQFQL 747


>gi|336365124|gb|EGN93476.1| glycoside hydrolase family 3 protein [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 732

 Score =  474 bits (1219), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 300/754 (39%), Positives = 413/754 (54%), Gaps = 61/754 (8%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD  L    RA  +V+  T+ E +      + GVPRLGLP Y+WWSE LHGV+       
Sbjct: 22  CDTSLDPISRATAVVDLFTIDELINNTVSTSPGVPRLGLPPYQWWSEGLHGVA------- 74

Query: 75  SPPGTHFDS--EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
             PG +F +  E   ATSFP  I+  A+F++ L K +G  V  E R+  N G AGL FW+
Sbjct: 75  DSPGVNFSASGEFSYATSFPQPIIMGAAFDDELIKSVGAIVGMEGRSFNNYGRAGLDFWT 134

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP-LKISACC 191
           PNIN  +DPRWGR  ETPGEDPY + +Y  N V+GLQ           D +P  ++ + C
Sbjct: 135 PNINPFKDPRWGRGQETPGEDPYHLAQYVYNLVQGLQG--------GLDPKPYYQVISTC 186

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
           KH+AAYDL++W+GN R+ FD+ VT QD+ E ++  F+ C  +  V + MCSYN VNGIP+
Sbjct: 187 KHFAAYDLEDWDGNYRYGFDAIVTTQDLSEYYLPSFQSCYRDAKVGAAMCSYNAVNGIPS 246

Query: 252 CADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           CA+  LL   +R  W F    ++ SDCD++  I + H +   T E+AVA  LKAG D+DC
Sbjct: 247 CANTYLLQSILRDFWGFAEDRWVTSDCDAVDNIYDPHNY-TKTPEEAVADALKAGTDIDC 305

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS--PQYKNLGKNNICNPQH 367
           G +Y+ +  GA  Q  I E ++  +L   Y  L+RLGYFD +    Y+    NN+  PQ 
Sbjct: 306 GTFYSEYLPGAYNQSLITETELRQALIRQYASLVRLGYFDPTDIQPYRQYNWNNVDTPQA 365

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
            +LA +AA +GIVLLKND G LPL++ +IK +AL+GP  NAT  M GNY G      SP+
Sbjct: 366 QQLAYQAAAEGIVLLKND-GTLPLSS-DIKNIALIGPWGNATGEMQGNYYGVAPYLISPL 423

Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
            G  A    + Y  G  +I   + S   AAI AA+ AD  +   G+D +VE+EG DR  +
Sbjct: 424 MGAVATGYNVTYVFGT-NITSNDTSGFAAAIAAAQGADVVIYAGGIDETVESEGNDRNYI 482

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
             PG Q +L+ ++A   K P+ +V    G VD    K N  + ++LW GYPG+ GG A+ 
Sbjct: 483 TWPGNQLDLVGELAAVGK-PLVVVQFGGGQVDDTSLKANSTVNALLWAGYPGQSGGSALF 541

Query: 548 DVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLS 606
           D+I GK  P GRLP+T Y A+YV +IP T M LRP    PGRTYK++ G  +Y FGYGL 
Sbjct: 542 DIISGKVAPAGRLPVTQYPADYVYEIPMTDMDLRPNATSPGRTYKWYTGTPIYDFGYGLH 601

Query: 607 YTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEV 666
           YT F YK A +P S               Y + T           D+   D   TF + V
Sbjct: 602 YTTFSYKWAKAPSST--------------YNIQTLVQSGNLYSYLDLAPFD---TFTVNV 644

Query: 667 ENMGKMDGSEVVMVYSKPPGIAGTH------IKQVIGYERVF-IAAGQSAKVGFTMNACK 719
            N G +      +++     + GT+       K +I Y R+  IA+G +A V   +    
Sbjct: 645 TNTGNVTSDFASLLF-----VNGTYGPSPYPNKSLITYARLHDIASGDTASVALGV-TLG 698

Query: 720 SLKIVDNAANSLLASGAHTI---LVGEGVGGVSF 750
           S+   D   N  L  G + +    +G  VG  +F
Sbjct: 699 SIARADTYGNMWLYPGTYQVTLDTLGNSVGANTF 732


>gi|409079878|gb|EKM80239.1| hypothetical protein AGABI1DRAFT_120267 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 786

 Score =  471 bits (1211), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 289/754 (38%), Positives = 412/754 (54%), Gaps = 46/754 (6%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L   P CD+      RA+ L++  T  E +Q   + + GVPRLGLP YEWWSEALHGV  
Sbjct: 32  LKSTPVCDSAKDPATRAQSLIQMFTDDELIQNGDNASPGVPRLGLPPYEWWSEALHGVGH 91

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
                 +P G     +   ATSFP  I+  A+F++ L K +   VSTEARA  N G AGL
Sbjct: 92  SPGVVFAPSG-----DFSSATSFPQPIVIGAAFDDDLVKAVANVVSTEARAFNNFGRAGL 146

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP-LKI 187
            +++PNIN  +DPRWGR  ETPGEDP+ + +Y  + V GLQ   G+      D  P +K+
Sbjct: 147 NYFTPNINPFKDPRWGRGQETPGEDPFHLSQYVYHLVDGLQG--GI------DPWPYIKV 198

Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
           +A CKH+AAYDL+NWEG DRFHFD++V++QD+ E ++ PF+ CV +   +SVMCSYN VN
Sbjct: 199 AADCKHFAAYDLENWEGIDRFHFDAQVSQQDLSEYYLPPFQSCVRDAKAASVMCSYNSVN 258

Query: 248 GIPTCADPKLLNQTIRGDWNFHG--YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           G+P CA   LL   +R  W F    ++ SDC ++  I +SH F     E A A  LKAG 
Sbjct: 259 GVPACASTYLLQDILRDAWGFDDDRWVTSDCWALDKIFDSHNFTRSFAE-AAAISLKAGT 317

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNIC 363
           D+DCG  + +    A+ Q  I+  D+  +    Y  L+RLGYFD   S  Y+    +++ 
Sbjct: 318 DIDCGSTFADHLPAALNQSLISRDDLTRAFIRQYTSLIRLGYFDPSDSQTYRQFDWSDVN 377

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
            P+   L+  AA +G+VLLKND G LPL     KT+A++GP+ NAT +M GNY G     
Sbjct: 378 TPEAQALSRRAAVEGLVLLKND-GLLPLAPDG-KTIAIIGPYTNATSSMQGNYFGNAPII 435

Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
           TSP  G       +  A G   +   +++    AI+ AK AD  V V G+D ++E EG D
Sbjct: 436 TSPFQGAQDVGFKVVSAAGTT-VNGTSSAGFAEAINTAKAADVVVFVGGIDNTLEREGLD 494

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R  +  PG Q +L+  +A   K P+ +V    G VD      N K+++I+W GYPG+ GG
Sbjct: 495 RSSISWPGNQLDLVKDLASLGK-PLIVVQFGGGQVDDTEILANKKVQAIIWAGYPGQSGG 553

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFG 602
            AI D+I G   P GRLP+T Y A+Y  ++  T M LRP ++ PGRTYK++  PV+  +G
Sbjct: 554 TAIFDIIVGSTAPAGRLPVTQYPADYTHQVRMTDMSLRPSSHNPGRTYKWYKTPVL-EYG 612

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
           +GL +T F +     P +   + D  +  R             +     D+   D   TF
Sbjct: 613 HGLHFTTFDFSWQRQPAA---EYDIQELIR------------ASHSKFLDLAHFD---TF 654

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVF-IAAGQSAKVGFTMNACKS 720
           +I V N G +    V +++       G H IK ++ Y RV  I  G SA +   +    S
Sbjct: 655 EICVRNTGNITSDYVGLLFLSGNTGPGPHPIKSLVAYSRVHDIQGGTSATLTLKVT-LGS 713

Query: 721 LKIVDNAANSLLASGAHTILVGEGVGGVSFPLQL 754
           +  VD   +  L  G + +++    G ++ P +L
Sbjct: 714 VARVDKNGDLWLFPGPYRLVLDTKDGVLTHPFRL 747


>gi|426198356|gb|EKV48282.1| hypothetical protein AGABI2DRAFT_67675 [Agaricus bisporus var.
           bisporus H97]
          Length = 763

 Score =  469 bits (1207), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 288/754 (38%), Positives = 412/754 (54%), Gaps = 46/754 (6%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L   P CD+      RA+ L++  T  E +Q   + + GVPRLGLP YEWWSEALHGV  
Sbjct: 32  LKSTPVCDSTKDPATRAQSLIQMFTDDELIQNGDNASPGVPRLGLPPYEWWSEALHGVGH 91

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
                 +P G     +   ATSFP  I+  A+F++ L K +   VSTEARA  N G AGL
Sbjct: 92  SPGVVFAPSG-----DFSSATSFPQPIVIGAAFDDDLVKAVANVVSTEARAFNNFGRAGL 146

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP-LKI 187
            +++PNIN  +DPRWGR  ETPGEDP+ + +Y  + V GLQ   G+      D  P +K+
Sbjct: 147 NYFTPNINPFKDPRWGRGQETPGEDPFHLSQYVYHLVDGLQG--GI------DPWPYIKV 198

Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
           +A CKH+AAYDL+NWEG DRFHFD++V++QD+ E ++ PF+ CV +   +SVMCSYN VN
Sbjct: 199 AADCKHFAAYDLENWEGIDRFHFDAQVSQQDLSEYYLPPFQSCVRDAKAASVMCSYNSVN 258

Query: 248 GIPTCADPKLLNQTIRGDWNFHG--YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           G+P CA   LL   +R  W F    ++ SDC ++  I +SH F     E A A  LKAG 
Sbjct: 259 GVPACASTYLLQDILRDAWGFDDDRWVTSDCWALDKIFDSHNFTRSFAE-AAAISLKAGT 317

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNIC 363
           D+DCG  + +    A+ Q  I+  D+  +    Y  L+RLGYFD   S  Y+    +++ 
Sbjct: 318 DIDCGSTFADHLPAALNQSLISRDDLTRAFIRQYTSLIRLGYFDPSHSQTYRQFDWSDVN 377

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
            P+   L+  AA +G+VLLKND G LPL     KT+A++GP+ NAT +M GNY G     
Sbjct: 378 TPEAQALSRRAAVEGLVLLKND-GLLPLAPDG-KTIAIIGPYTNATSSMQGNYFGNAPFI 435

Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
           TSP  G       +  A G   +   +++    AI+ A+ AD  V V G+D ++E EG D
Sbjct: 436 TSPFQGAQDVGFKVVSAAGTI-VNGTSSAGFAEAINTARAADVVVFVGGIDNTLEREGLD 494

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R  +  PG Q +L+  +A   K P+ +V    G VD      N K+++I+W GYPG+ GG
Sbjct: 495 RSSISWPGNQLDLVKDLASLGK-PLIVVQFGGGQVDDTEILANEKVQAIIWAGYPGQSGG 553

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFG 602
            AI D+I G   P GRLP+T Y A+Y  ++  T M LRP ++ PGRTYK++  PV+  +G
Sbjct: 554 TAIFDIIVGATAPAGRLPVTQYPADYTHQVRMTDMSLRPSSHNPGRTYKWYKTPVL-EYG 612

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
           +GL +T F +     P +   + D  +  R             +     D+   D   TF
Sbjct: 613 HGLHFTTFDFSWQRQPAA---EYDIQELIR------------ASHSKFLDLAHFD---TF 654

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVF-IAAGQSAKVGFTMNACKS 720
           +I V N G +    V +++       G H IK ++ Y RV  I  G SA +   +    S
Sbjct: 655 EICVRNTGNITSDYVGLLFLSGNSGPGPHPIKSLVAYSRVHDIQGGTSATLTLKVT-LGS 713

Query: 721 LKIVDNAANSLLASGAHTILVGEGVGGVSFPLQL 754
           +  VD   +  L  G + +++    G ++ P +L
Sbjct: 714 VARVDKNGDLWLFPGPYRLVLDTKDGVLTHPFRL 747


>gi|242216161|ref|XP_002473890.1| beta-xylosidase [Postia placenta Mad-698-R]
 gi|220726990|gb|EED80923.1| beta-xylosidase [Postia placenta Mad-698-R]
          Length = 741

 Score =  468 bits (1204), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 294/738 (39%), Positives = 400/738 (54%), Gaps = 51/738 (6%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD      ERA  L+   TL EK+   G+ A GVPRLGLP Y+WW EALHGV+       
Sbjct: 34  CDTSATPLERATALISLFTLEEKINNTGNTAPGVPRLGLPAYQWWQEALHGVA------- 86

Query: 75  SPPGTHF--DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
             PG  F    E   ATSFP  IL  A+F+++L   +   VSTEARA  N   +G+ FW+
Sbjct: 87  ESPGVIFAPSGEYSYATSFPQPILMGAAFDDALINHVATIVSTEARAFNNANRSGIDFWT 146

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           PNIN  +DPRWGR  ETPGEDP+ +  Y  N + GLQ     EY R        I A CK
Sbjct: 147 PNINPFKDPRWGRGQETPGEDPFHLQSYVYNLITGLQGGLDPEYKR--------IVATCK 198

Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           H+AAYDL+NWEGN R+ FD+ V+ QD+ E +   F  C  + +V S MCSYN VNG+P+C
Sbjct: 199 HFAAYDLENWEGNVRYGFDALVSLQDLSEFYTRSFRTCARDANVGSFMCSYNAVNGVPSC 258

Query: 253 ADPKLLNQTIRGDW---NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           A+  LL   +R  W   N   YI SDCD+IQ I E H +   T+ + VA  L AG DLDC
Sbjct: 259 ANSYLLQDILRDHWGWTNEDQYITSDCDAIQNIYEPH-YYTATRAETVADALNAGTDLDC 317

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS--PQYKNLGKNNICNPQH 367
           G+YY      A  QG   E+ ++ +L   Y  L++LGYFD +    Y+ +G  N+  P+ 
Sbjct: 318 GEYYPENLGAAYDQGLFTESTLNRALIRQYAALVKLGYFDPADIQPYRQIGWANVSTPEA 377

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
            ELA  AA +GI LLKND G LPL+  +IKT+AL+GP ANAT  M GNY G      SP+
Sbjct: 378 EELAYTAAVEGITLLKND-GTLPLSP-SIKTIALIGPWANATTQMQGNYYGVAPYLISPL 435

Query: 428 DGFYAYSKVINYA--PGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
                    + Y+  PG  D      S  PAA  AA+ ADA +   G+D++VEAE  DR 
Sbjct: 436 MAAEELGFTVYYSAGPGVDD---PTTSSFPAAFAAAEAADAIIYAGGIDITVEAEAMDRY 492

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            L  PG Q + I++++   K P+ ++    G +D +    NP + +++W GYPG+ GG+A
Sbjct: 493 TLDWPGVQPDFIDQLSLLGK-PLIVLQFGGGQIDDSALLPNPGVNALVWGGYPGQSGGKA 551

Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYG 604
           I D+I G   P GRLPIT Y  +YV ++  T M LRP    PGRTY ++ G  +  FG+G
Sbjct: 552 IMDIIVGNAAPAGRLPITQYPLDYVYQVAMTDMSLRPSPTNPGRTYMWYTGTPIVEFGFG 611

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
           L YT F   ++              Q    +Y + T    C+ V   D+ C    +T   
Sbjct: 612 LHYTTFTASLS--------------QPSAPSYDIATLVSLCSGVAHPDL-CPFASYT--A 654

Query: 665 EVENMGKMDGSEVV--MVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
            V N G    S+ V  +  +   G A    K ++ Y+R+   A  +++         SL 
Sbjct: 655 NVTNTGSSVTSDFVSLLFLAGEHGPAPYPNKVLVAYDRLHAIAPLASQTTTLNLTLGSLS 714

Query: 723 IVDNAANSLLASGAHTIL 740
            VD+  N++L  G +T++
Sbjct: 715 RVDDYGNTILYPGEYTLI 732


>gi|392590128|gb|EIW79457.1| glycoside hydrolase family 3 protein [Coniophora puteana RWD-64-598
           SS2]
          Length = 770

 Score =  468 bits (1203), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 268/617 (43%), Positives = 359/617 (58%), Gaps = 29/617 (4%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD  L   +RA  LVE  T+ E +    + + GVPRLGLP Y+WWSE LHGV+       
Sbjct: 37  CDTSLNATQRAAALVELFTVEELINNTVNGSPGVPRLGLPAYQWWSEGLHGVA------- 89

Query: 75  SPPGTHFDSEVPG--ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
             PG +F +  P   ATSFP  I+ +A+F+++L K +G  V  E R+  N G+AGL FW+
Sbjct: 90  DSPGVNFSTSGPFSYATSFPQPIVMSAAFDDALIKAVGGVVGMEGRSFNNYGHAGLDFWT 149

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           PNIN  +DPRWGR  ETPGEDPY + +Y  N ++GLQ     E +        ++ A CK
Sbjct: 150 PNINPFKDPRWGRGQETPGEDPYHIAQYVYNLIQGLQGGVNPEPY-------FQVVATCK 202

Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           H+A YDL++WE N R+ FD+ +T QD+ E ++  F+ C  +    + MCSYN VNGIPTC
Sbjct: 203 HFAGYDLEDWENNFRYGFDALITTQDLSEFYLPSFQSCYRDAQAGASMCSYNAVNGIPTC 262

Query: 253 ADPKLLNQTIRGDWNFHG--YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
           AD  LL   +R  WNF    ++ SDCD+++ I   H +     + A A  L+AG DLDCG
Sbjct: 263 ADTYLLQDILRDYWNFDETRWVTSDCDAVENIYNPHNY-TALPQQAAADALRAGTDLDCG 321

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHI 368
            +YT +   A  Q  I E ++  +L   Y  L+RLGYFD + Q  Y+  G +N+  P   
Sbjct: 322 TFYTEYLPLAYNQSLITETELRAALTRQYASLVRLGYFDPAAQQPYRQYGWSNVDTPYAQ 381

Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
           +LA  AA +GI LLKND G LPL +  +K +AL+GP ANAT  M GNY G      SP+ 
Sbjct: 382 QLAYTAATEGITLLKND-GTLPLPS-TLKNIALIGPWANATNQMQGNYFGVAPYLVSPLQ 439

Query: 429 GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLL 488
           G  A    + Y  G  +I   + +   AAI AA+ ADA V   G+D++VEAE  DR ++ 
Sbjct: 440 GALAAGYNVTYVFGT-NITSNSTAGFAAAIAAAREADAVVYAGGIDVTVEAEAMDRYNVT 498

Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
            PG Q +LI ++A   K P  +     G VD    K N  + S++W GYPG+ GG+A+ D
Sbjct: 499 WPGNQLQLIGELAALGK-PFVVAQFGGGQVDDTEIKANASVNSLIWAGYPGQSGGQALFD 557

Query: 549 VIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN---FPGRTYKFFDGPVVYPFGYG 604
           +I GK  P GRL  T Y A+YV +IP T M LRP  N    PGRTYK++ G  VY FGYG
Sbjct: 558 IISGKVAPAGRLVTTQYPADYVYEIPMTDMNLRPNANGTTSPGRTYKWYTGAPVYEFGYG 617

Query: 605 LSYTQFKYKVASSPKSV 621
           L YT F Y    +P S 
Sbjct: 618 LHYTNFTYTWTKAPAST 634


>gi|297039776|gb|ADH95739.1| beta-xylosidase [Aspergillus fumigatus]
          Length = 771

 Score =  466 bits (1199), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 301/764 (39%), Positives = 410/764 (53%), Gaps = 85/764 (11%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD  L    RA+ LV  MT  EKV      + GVPRLGLP Y WWSEALHGV+ 
Sbjct: 37  LSKLAVCDTSLDVTTRAQSLVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWSEALHGVA- 95

Query: 69  IGRRTNSPPGTHFDSEVPG--ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
                   PG  F    P   ATSFP  IL  A+F++ L K++   VSTE RA  N G +
Sbjct: 96  ------GSPGVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAFGNAGRS 149

Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLK 186
           GL FW+PNIN  RD RWGR  ETPGEDP  V RY  + V GLQ+  G        + P K
Sbjct: 150 GLDFWTPNINPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQNGIG-------PANP-K 201

Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
           + A CKH+AAY L++W G  R  F++ V+ QD+ E ++ PF+ C  +  V +VMCSYN +
Sbjct: 202 VVATCKHFAAYGLEDWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDARVDAVMCSYNAL 261

Query: 247 NGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NG+P CAD  LL   +R  W +     +I SDC +I  I   H F   T  +A A  L A
Sbjct: 262 NGVPACADSYLLQTILREHWKWDEPGRWITSDCGAIDDIYNGHNFTT-TPAEAAATALNA 320

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNN 361
           G DLDCG  +  +   A  +G  +   +D +L  LY   ++LGYFD +    Y+++G  +
Sbjct: 321 GTDLDCGTVFPKYLGQAADEGLYSNQTLDRALVRLYSSFVKLGYFDPAEDQPYRSIGWTD 380

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           +  P    LA +AA +GIVLLKND   LPL      TLAL+GP+ANATK M GNYEG P 
Sbjct: 381 VDTPAVEALAHKAAGEGIVLLKNDK-TLPLKAKG--TLALIGPYANATKQMQGNYEG-PA 436

Query: 422 RYTSPMDGFYAYSKV---INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
           +Y   +   +A ++    + YA G A I   + +   AA+ AAK AD  V   G+D ++E
Sbjct: 437 KYIRTL--LWAATQAGYDVKYAAGTA-INTNSTAGFDAALSAAKQADVVVYAGGIDNTIE 493

Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
           AEG+DR  +  PG Q  LI++++   K P+ +V    G VD +   +NP++ ++LW GYP
Sbjct: 494 AEGRDRTTIAWPGNQVNLIDQLSKIGK-PLVVVQFGGGQVDDSSLLSNPRVNALLWAGYP 552

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPV 597
            +EGG AI D++ GK  P GRLP+T Y A+YV ++P T M LRP +N PGRTY+++D  V
Sbjct: 553 SQEGGSAIFDILTGKTAPAGRLPVTQYPADYVNQVPMTDMALRPGSNTPGRTYRWYDKAV 612

Query: 598 VYPFGYGLSYTQFKYK--------------VASSPKSVDIKLDKDQQCRDINYTVGTNKP 643
           + PFG+GL YT FK                V+ SPK+V I    D+   D          
Sbjct: 613 L-PFGFGLHYTTFKISWPRRALGPYNTAALVSRSPKNVPI----DRAAFD---------- 657

Query: 644 PCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP--PGIAGTHIKQVIGYERV 701
                            TF I+V N GK     V +++ K    G     +K ++GY R 
Sbjct: 658 -----------------TFHIQVTNTGKTTSDYVALLFLKTIDAGPKPYPLKTLVGYTRA 700

Query: 702 -FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
             I  G+   V   ++     +  +N  + +L  G +T+ V  G
Sbjct: 701 KQIKPGEKRSVDIEVSLGSLARTAEN-GDLVLYPGRYTLEVDVG 743


>gi|402225863|gb|EJU05924.1| hypothetical protein DACRYDRAFT_113532 [Dacryopinax sp. DJM-731
           SS1]
          Length = 778

 Score =  466 bits (1199), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 285/744 (38%), Positives = 402/744 (54%), Gaps = 44/744 (5%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L++   CD+ L    RA+ LV  +T+ EK     + + GVPRLGLP Y WWSE LHGV+ 
Sbjct: 36  LANTTVCDSALDPLTRARALVGMLTMAEKFNNTVNASPGVPRLGLPPYNWWSEGLHGVAS 95

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
               T +P G +F      ATSFP  IL  A+F+++L   I   +STEARA  N  ++GL
Sbjct: 96  SPGVTFAPAGQNFSY----ATSFPEPILMGAAFDDNLIYDIATIISTEARAFNNFNHSGL 151

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
            FW+PNIN VRDPRWGR LETPGEDP+ +  Y    V GLQ           D +  K+ 
Sbjct: 152 DFWTPNINPVRDPRWGRSLETPGEDPFHLASYVAKLVTGLQ-------FGGDDPKYQKLV 204

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           A CKHYA YDL+NW G  R+ FD+ ++ QD+ E F+ PF+ C  + +V+SVMCSYN VNG
Sbjct: 205 ATCKHYAGYDLENWGGYARYGFDAVISNQDLVEYFLPPFQTCARDVNVTSVMCSYNAVNG 264

Query: 249 IPTCADPKLLNQTIRGDWNFH--------GYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
           IP+CA+  LL   +R  W +          Y+ SDCD++  I   H +   T E AVA  
Sbjct: 265 IPSCANDYLLQSLLRTYWGWEPDSESLNAHYVTSDCDAVSNIYYPHNY-TITPEQAVAVS 323

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLG 358
           LKAG DLDCG +Y  +   + +QG   + DID +L   Y  L  LGYFD +    Y+   
Sbjct: 324 LKAGTDLDCGTFYAEWLPSSYEQGLFHQTDIDRALIRSYAALFLLGYFDPAEGQIYRQYN 383

Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
             NI      +LA  AA +GI LLKN +  LPL +  +  +AL+GP ANAT  M GNY+G
Sbjct: 384 WANINTDYAQQLAYTAAWEGITLLKNIDDMLPLPS-TMTNIALIGPWANATTQMQGNYQG 442

Query: 419 TPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
                 SP+         + Y  G  +I   + +   AA+ AA+ AD T+ + G+D++VE
Sbjct: 443 IAPFLHSPLYALQQRGINVTYVLGT-NITSNSTAGFAAALAAAQTADLTLYIGGIDITVE 501

Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
           AE  DRV++  PG Q +LI ++A+ +   + +  M  G +D      NPK+  +LW GYP
Sbjct: 502 AEAMDRVNITWPGNQLDLIAQLANVSTH-LIVYQMGGGQIDDTVLLENPKVHGLLWGGYP 560

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPV 597
           G++GG A+ D+++G   P GRLP++ Y AN++ ++P T M L P    PGRTYK++ G +
Sbjct: 561 GQDGGTAMIDILYGSRAPAGRLPLSQYPANFINEVPMTDMRLHPALGTPGRTYKWYSGDL 620

Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
           V PFGYGL YT F                KD   R  +     N+   ++  +D    K 
Sbjct: 621 VLPFGYGLHYTTFAKAAL-----------KDHSPRSSDIATLVNEAKQSSAWLD----KA 665

Query: 658 YKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF-IAAGQSAKVGFTM 715
           +   F  EV N G +    V + Y +   G A      ++ Y R+  +  G++  V F +
Sbjct: 666 FFDVFAAEVTNTGSLTSDYVALGYLTGEFGPAPYPKSSLVSYTRLSQVTPGETQVVNFDL 725

Query: 716 NACKSLKIVDNAANSLLASGAHTI 739
               S+   D   +  L  G +T+
Sbjct: 726 T-LGSIARADYYGDLYLYPGTYTL 748


>gi|426198365|gb|EKV48291.1| hypothetical protein AGABI2DRAFT_219902 [Agaricus bisporus var.
           bisporus H97]
          Length = 767

 Score =  466 bits (1198), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 295/758 (38%), Positives = 421/758 (55%), Gaps = 54/758 (7%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD       RAK L++  T  E +Q   +++ GVPRLG+P Y+WWSEALHGV+ 
Sbjct: 32  LSSTAVCDPTKAPAARAKTLIQMFTDEELMQNTDNVSPGVPRLGVPSYQWWSEALHGVA- 90

Query: 69  IGRRTNSPPGTHF--DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
                   PG  F    E   ATSFP  I+  ++F+  L K +   +STEARA  N   A
Sbjct: 91  ------GSPGVSFAPSGEFSSATSFPQSIVLGSTFDIDLVKAVATVISTEARAFNNFHRA 144

Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP-L 185
           GL +++PNIN  +DPRWGR  ETPGEDP+ V +Y  + + GLQ   G+      D RP  
Sbjct: 145 GLDYFTPNINPFKDPRWGRGQETPGEDPFHVSQYVYSLIDGLQG--GI------DPRPYF 196

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K++A CKHYAAYDLD+WEG DRFHFD++V+ QD+ E ++  F+ CV +  V+SVMCSYN 
Sbjct: 197 KVAADCKHYAAYDLDSWEGIDRFHFDAKVSLQDLSEYYLPSFQSCVRDAKVASVMCSYNS 256

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHG--YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           VNGIP CA+P LL   +R  W F    ++ SDCD+I  I  +H F  DT  +AVA  LKA
Sbjct: 257 VNGIPACANPYLLQDILRDFWGFDDDRWVTSDCDAIGNIFTTHNF-TDTFAEAVADALKA 315

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNN 361
           G D+DCG  Y+     A+ Q  I   D++ +L   Y  LMRLGYFD   S   + L  ++
Sbjct: 316 GTDVDCGTSYSTHLPDALNQSLITRDDLERALTRQYTSLMRLGYFDPPESQPLRQLAWSD 375

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           +  P    LA  AA +G+VLLKND G LP++    KT+A++GP+ANATK M GNY GT  
Sbjct: 376 VNKPDAQALAHTAAVEGLVLLKND-GFLPVSASG-KTIAIIGPYANATKDMQGNYFGTAP 433

Query: 422 RYTSPMDGFY--AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
              +P  G     +++V++ A     I   + +   AAI  A ++D  +   G++ S+E+
Sbjct: 434 FIVTPFQGAVDAGFNEVVSAA--GTSINGTSEADFAAAIAVANSSDIIIFAGGINNSIES 491

Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
           E KDR+ +   G Q  L+ ++A   K PV +V    G +D +   +N  +++++W GYPG
Sbjct: 492 EAKDRLTIAWTGNQLSLVKQLASLGK-PVVVVQFGGGQLDDSDLLDNDAVRAVIWAGYPG 550

Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVV 598
           + GG AI DVI G   P GRL +T Y  ++V ++  T M LRP +  PGRTYK++ G  V
Sbjct: 551 QSGGTAIFDVITGAVAPAGRLSVTQYPEDFVNQVGMTDMALRPGSANPGRTYKWYTGRPV 610

Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
             FG+GL +T F +     P        +    + + +T     P        D+   D 
Sbjct: 611 LEFGHGLHFTTFDFSWRGRPG-------RKYNIQHLLHTADKKFP--------DLIPLD- 654

Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVF-IAAGQSAKVGFTMN 716
             TF + + N G +    V +++ +   G A    K ++ + R   I AG SA V   +N
Sbjct: 655 --TFHVNIRNTGNITSDYVALLFLRSNAGFAPHPKKSLVSFARAHRIDAGSSATVDLGVN 712

Query: 717 ACKSLKIVDNAANSLLASGAHTIL--VGEGVGGVSFPL 752
              S+  VD   +S L +G + ++  +G+GV   SF L
Sbjct: 713 -LGSIARVDEHGDSWLFAGDYQLVLDIGDGVLSHSFSL 749


>gi|451992719|gb|EMD85198.1| glycoside hydrolase family 3 protein [Cochliobolus heterostrophus
           C5]
          Length = 781

 Score =  466 bits (1198), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 287/742 (38%), Positives = 400/742 (53%), Gaps = 57/742 (7%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD       RAK LV   TL EK+    + A GV RLG+P Y+WW+E LHG++       
Sbjct: 37  CDPSASTLARAKSLVALYTLEEKINATSNSAPGVARLGVPPYQWWNEGLHGIA------- 89

Query: 75  SPPGTHF--DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
             P T F    +   +TSFP  IL  A+F++ L  ++ + +STEARA  N    GL FW+
Sbjct: 90  -GPFTSFAKQGDYSYSTSFPQPILMGAAFDDDLITEVAKVISTEARAFNNANRTGLDFWT 148

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           PNIN  RDPRWGR  ETPGED Y +  Y    + GLQ      Y R        + A CK
Sbjct: 149 PNINPFRDPRWGRGQETPGEDSYHLSSYVKALIHGLQGNATDPYRR--------VVATCK 200

Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           HYA YD++NW GN R+  D ++++QD+ E ++ PFE CV + +V + MCSYN VNG P C
Sbjct: 201 HYAGYDIENWNGNLRYQNDVQISQQDLVEYYLAPFEACV-QANVGAFMCSYNAVNGAPPC 259

Query: 253 ADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           ADP LL   +R  W +     ++ SDCD+IQ +   H++ + T+E A A  L AG DLDC
Sbjct: 260 ADPYLLQTVLREHWGWSSDDHWVTSDCDAIQNVYLPHQW-SSTREGAAADSLNAGTDLDC 318

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
           G Y      GAV+QG   E  +D +L   Y  L++LGYFD +P+   Y+ LG + +    
Sbjct: 319 GTYLQTHLPGAVKQGLTDETTLDKALIRQYSSLIKLGYFD-APENQPYRQLGFDAVATSA 377

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
              LA +AA +GIVLLKND G LP+N G+ K + + G  ANAT  + GNY G     TSP
Sbjct: 378 SQALALKAAEEGIVLLKND-GVLPINLGS-KQVGIYGDWANATSQLQGNYFGVAKFLTSP 435

Query: 427 MDGFYAYSKVINYA----PGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
           +         + YA     G  D      S +   I     +D  + V G+D  VE+E +
Sbjct: 436 LMALQNLGVDVKYAGNLPGGQGDPTTGAWSSLSGVI---TTSDVHIWVGGIDNGVESEDR 492

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
           DR  L L G Q ++I ++AD  K PV +VIM  G +D +    NPKI ++LW GYPG++G
Sbjct: 493 DRSWLTLTGGQLDVIGQLADTGK-PVIVVIMGGGQIDTSPLIRNPKISAVLWAGYPGQDG 551

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
           G AI +++ GK  P GRLP T Y + YV ++P T M +RP +  PGRTYK++ G  ++ F
Sbjct: 552 GTAIVNILTGKAAPAGRLPQTQYPSKYVSEVPMTDMAMRPSDKNPGRTYKWYTGEPIFEF 611

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
           GYGL YT F   + + PK      D  + C   N T G  +           +C     T
Sbjct: 612 GYGLHYTNFSASITNQPKQSYAISDLVKGC---NSTGGFLE-----------RCPFTGIT 657

Query: 662 FQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNACK 719
             + V+N GK+    V + + +   G      K ++ Y+R+F IAAG S+     +    
Sbjct: 658 --VSVQNTGKISSDYVTLGFLTGSFGPKPYPKKSLVAYDRLFNIAAGSSSTATLNLT-LA 714

Query: 720 SLKIVDNAANSLLASGAHTILV 741
           SL  VD + N +L  G + + +
Sbjct: 715 SLARVDESGNKVLYPGDYELQI 736


>gi|413919687|gb|AFW59619.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 451

 Score =  465 bits (1197), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 228/422 (54%), Positives = 299/422 (70%), Gaps = 18/422 (4%)

Query: 3   ESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEA 62
           ++    L+ + +C+       RA DLV R+TL EKV  + D    +PRLG+PLYEWWSEA
Sbjct: 43  DASNATLASYGFCNRSAAAAARAADLVSRLTLAEKVGFLVDKQAALPRLGVPLYEWWSEA 102

Query: 63  LHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN 122
           LHGVS++G      PGT F   VPGATSFP  ILT ASFN +L++ IG+ VS EARAM+N
Sbjct: 103 LHGVSYVG------PGTRFSPLVPGATSFPQPILTAASFNATLFRAIGEVVSNEARAMHN 156

Query: 123 LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
           +G AGLTFWSPNIN+ RDPRWGR  ETPGEDP +  +YA+ YV GLQ          S +
Sbjct: 157 VGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQGAV-------SGA 209

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
             LK++ACCKHY AYD+DNW+G +R+ FD+ V++QD+ +TF  PF+ CV +G+V+SVMCS
Sbjct: 210 GALKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVVDGNVASVMCS 269

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           YN+VNG PTCAD  LL+  IRGDW  +GYI SDCDS+  +  +  +   T EDA A  +K
Sbjct: 270 YNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHY-TKTPEDAAAISIK 328

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGK 359
           AGLDL+CG +    T+ AVQ GK++E+D+D ++    + LMRLG+FDG P+   + NLG 
Sbjct: 329 AGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNLGP 388

Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
           +++C P + ELA EAARQGIVLLKN  G LPL+  +IK++A++GP+ANA+  MIGNYEGT
Sbjct: 389 SDVCTPSNQELAREAARQGIVLLKN-TGKLPLSATSIKSMAVIGPNANASFTMIGNYEGT 447

Query: 420 PC 421
            C
Sbjct: 448 SC 449


>gi|449531013|ref|XP_004172482.1| PREDICTED: beta-D-xylosidase 1-like, partial [Cucumis sativus]
          Length = 534

 Score =  465 bits (1196), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 250/543 (46%), Positives = 340/543 (62%), Gaps = 19/543 (3%)

Query: 219 MQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDS 278
           +++T+ +PF+ CV EG V+SVMCSYN+VNG PTCADP LL  TIRG W   GYIVSDCDS
Sbjct: 1   LEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDLLKNTIRGAWGLDGYIVSDCDS 60

Query: 279 IQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFL 338
           +  + +S  F   T E+A A  +KAGLDLDCG +    T  AV +G + E D++ +L  L
Sbjct: 61  VGVLYDSQHF-TPTPEEAAASTIKAGLDLDCGPFLAVHTATAVGRGLLKEVDLNNALANL 119

Query: 339 YIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGN 395
             V MRLG FDG P    Y NLG  ++C P H  LA EAARQGIVLL+N  GALPL+   
Sbjct: 120 LSVQMRLGMFDGEPAAQPYGNLGPKDVCTPAHKHLALEAARQGIVLLQNRAGALPLSPTR 179

Query: 396 IKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIP 455
            +T+A++GP+++AT  MIGNY G  C YT+P+ G   Y K I +A GCA++ C  + +I 
Sbjct: 180 HRTVAVIGPNSDATVTMIGNYAGVACEYTTPVQGISKYVKTI-HAKGCANVACVGDQLIG 238

Query: 456 AAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSA 515
            A  AA+ ADA V+V GLD S+EAE +DR  +LLPG Q EL+ ++  A KGP  +V+MS 
Sbjct: 239 EAEAAARVADAAVVVVGLDQSIEAESRDRNGVLLPGKQEELVRRIGLACKGPTVVVLMSG 298

Query: 516 GAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPY 574
           G +D++FAKN+ KI  ILWVGYPG+ GG AIADV+FG  NPGG+LP+TWY  +Y+ K+P 
Sbjct: 299 GPIDVSFAKNDGKISGILWVGYPGQAGGAAIADVLFGATNPGGKLPMTWYPQSYLAKVPM 358

Query: 575 TSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCR 632
           T+M LR  P   +PGRTY+F+ GPVV+PFG+GLSY++F    A +P    I L       
Sbjct: 359 TNMGLRPDPSTGYPGRTYRFYKGPVVFPFGFGLSYSKFSQSFAEAP--TKISLPLSSLSP 416

Query: 633 DINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI 692
           + + TV  +   CA+V               I+V+N G +DGS  ++V+S  P    +  
Sbjct: 417 NSSATVKVSHTDCASV---------SDLPIMIDVKNTGTVDGSHTILVFSTVPNQTWSPE 467

Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPL 752
           K +IG+E+V + AG   +V   ++ C  L  VD      +  G H + +G+    +S   
Sbjct: 468 KHLIGFEKVHLIAGSQKRVRIGIHVCDHLSRVDEFGTRRIPMGEHKLHIGDLTHSISLQA 527

Query: 753 QLN 755
            L 
Sbjct: 528 DLQ 530


>gi|409079872|gb|EKM80233.1| hypothetical protein AGABI1DRAFT_57801 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 767

 Score =  465 bits (1196), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 295/758 (38%), Positives = 420/758 (55%), Gaps = 54/758 (7%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD       RA  L++  T  E +Q   +++ GVPRLG+P Y+WWSEALHGV+ 
Sbjct: 32  LSSTAVCDPTKAPAARATTLIQMFTDEELMQNTDNVSPGVPRLGVPSYQWWSEALHGVA- 90

Query: 69  IGRRTNSPPGTHF--DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
                   PG  F    E   ATSFP  I+  ++F+  L K +   +STEARA  N   A
Sbjct: 91  ------GSPGVSFAPSGEFSSATSFPQSIVLGSTFDIDLVKAVATVISTEARAFNNFHRA 144

Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP-L 185
           GL +++PNIN  +DPRWGR  ETPGEDP+ V +Y  + + GLQ   G+      D RP  
Sbjct: 145 GLDYFTPNINPFKDPRWGRGQETPGEDPFHVSQYVYSLIDGLQG--GI------DPRPYF 196

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K++A CKHYAAYDLD+WEG DRFHFD++V+ QD+ E ++  F+ CV +  V+SVMCSYN 
Sbjct: 197 KVAADCKHYAAYDLDSWEGIDRFHFDAKVSLQDLSEYYLPSFQSCVRDAKVASVMCSYNS 256

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHG--YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           VNGIP CA+P LL   +R  W F    ++ SDCD+I  I  +H F  DT  +AVA  LKA
Sbjct: 257 VNGIPACANPYLLQDILRDFWGFDDDRWVTSDCDAIGNIFTTHNF-TDTFAEAVADALKA 315

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNN 361
           G D+DCG  Y+     A+ Q  I   D++ +L   Y  LMRLGYFD   S   + L  ++
Sbjct: 316 GTDVDCGTSYSTHLPDALNQSLITRDDLERALTRQYTSLMRLGYFDPPESQPLRQLAWSD 375

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           +  P    LA  AA +G+VLLKND G LP++    KT+A++GP+ANATK M GNY GT  
Sbjct: 376 VNKPDAQALAHTAAVEGLVLLKND-GFLPVSASG-KTIAIIGPYANATKDMQGNYFGTAP 433

Query: 422 RYTSPMDGFY--AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
              +P  G     +++V++ A     I   + +   AAI  A ++D  +   G++ S+E+
Sbjct: 434 FIVTPFQGAVDAGFNEVVSAA--GTSINGTSEADFAAAIAVANSSDIIIFAGGINNSIES 491

Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
           E KDR+ +   G Q  L+ ++A   K PV +V    G +D +   +N  +++++W GYPG
Sbjct: 492 EAKDRLTIAWTGNQLSLVKQLASLGK-PVVVVQFGGGQLDDSDLLDNDAVRAVIWAGYPG 550

Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVV 598
           + GG AI DVI G   P GRL +T Y  ++V ++  T M LRP +  PGRTYK++ G  V
Sbjct: 551 QSGGTAIFDVITGAVAPAGRLSVTQYPEDFVNQVGMTDMALRPGSANPGRTYKWYTGRPV 610

Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
             FG+GL +T F +     P        +    + + +T     P        D+   D 
Sbjct: 611 LEFGHGLHFTTFDFSWRGRPG-------RKYNIQHLLHTADKKFP--------DLIPLD- 654

Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVF-IAAGQSAKVGFTMN 716
             TF + + N G +    V +++ K   G A    K ++ + R   I AG SA V   +N
Sbjct: 655 --TFHVNIRNTGNITSDYVALLFLKSNAGFAPHPKKSLVSFARAHRIDAGSSATVDLGVN 712

Query: 717 ACKSLKIVDNAANSLLASGAHTIL--VGEGVGGVSFPL 752
              S+  VD   +S L +G + ++  +G+GV   SF L
Sbjct: 713 -LGSIARVDEHGDSWLFAGDYQLVLDIGDGVLSHSFSL 749


>gi|121797681|sp|Q2TYT2.1|BXLB_ASPOR RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|83775471|dbj|BAE65591.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 797

 Score =  464 bits (1194), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 289/748 (38%), Positives = 407/748 (54%), Gaps = 54/748 (7%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD  L    RAK LV  MTL EK+      + G PRLGLP Y WW+EALHGV+ 
Sbjct: 56  LSKNNVCDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVA- 114

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
            G   +     +F      ATSFP  IL  A+F++ L K++   +STEARA  N G+AGL
Sbjct: 115 EGHGVSFSDSGNFSY----ATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGL 170

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
            +W+PNIN  RDPRWGR  ETPGEDP  + RY  + V GLQD  G E       RP K+ 
Sbjct: 171 DYWTPNINPFRDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPE-------RP-KVV 222

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           A CKH+AAYDL+NWEG +R+ FD+ V+ QD+ E ++  F+ C  +  V +VMCSYN +NG
Sbjct: 223 ATCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNG 282

Query: 249 IPTCADPKLLNQTIRGDWNFH---GYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           IPTCAD  LL   +R  W +     ++  DC +I  I   H ++      A A  L AG 
Sbjct: 283 IPTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVA-DGAHAAAAALNAGT 341

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNIC 363
           DLDCG  +  +   A+QQG      ++ +L  LY  L++LGYFD +    Y+++G N + 
Sbjct: 342 DLDCGSVFPEYLGSALQQGLYNNQTLNNALIRLYSSLVKLGYFDPADDQPYRSIGWNEVF 401

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
            P   ELA +A  +GIV+LKND G LPL +    T+A++GP ANAT  + GNYEG P   
Sbjct: 402 TPAAEELAHKATVEGIVMLKND-GTLPLKSNG--TVAIIGPFANATTQLQGNYEGPPKYI 458

Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
            + +         + ++ G  DI   +++    AI AAK AD  +   G+D ++E E +D
Sbjct: 459 RTLIWAAVHNGYKVKFSQGT-DINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQD 517

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R  ++ PG Q +LI +++D  K P+ +V    G VD +    N  + ++LW GYP + GG
Sbjct: 518 RTTIVWPGNQLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGG 576

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFG 602
            A+ D++ GK  P GRLP+T Y A+YV ++P T M LRP +N PGRTY+++D  V+ PFG
Sbjct: 577 AAVFDILTGKSAPAGRLPVTQYPASYVDEVPMTDMTLRPGSNNPGRTYRWYDKAVL-PFG 635

Query: 603 YGLSYTQFKYK---VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
           +GL YT F          P + D            +   GT   P    L D        
Sbjct: 636 FGLHYTTFNVSWNHAEYGPYNTD------------SVASGTTNAPVDTELFD-------- 675

Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGT--HIKQVIGYERV-FIAAGQSAKVGFTMN 716
            TF I V N G +    + +++    G+      IK ++GY R   I  GQS +V   ++
Sbjct: 676 -TFSITVTNTGNVASDYIALLFLTADGVGPEPYPIKTLVGYSRAKGIEPGQSQQVKLDVS 734

Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEG 744
                +  +N  + +L  G++ + V  G
Sbjct: 735 VGSVARTAEN-GDLVLYPGSYKLEVDVG 761


>gi|317158006|ref|XP_001826724.2| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
          Length = 776

 Score =  464 bits (1194), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 289/748 (38%), Positives = 407/748 (54%), Gaps = 54/748 (7%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD  L    RAK LV  MTL EK+      + G PRLGLP Y WW+EALHGV+ 
Sbjct: 35  LSKNNVCDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVA- 93

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
            G   +     +F      ATSFP  IL  A+F++ L K++   +STEARA  N G+AGL
Sbjct: 94  EGHGVSFSDSGNFSY----ATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGL 149

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
            +W+PNIN  RDPRWGR  ETPGEDP  + RY  + V GLQD  G E       RP K+ 
Sbjct: 150 DYWTPNINPFRDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPE-------RP-KVV 201

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           A CKH+AAYDL+NWEG +R+ FD+ V+ QD+ E ++  F+ C  +  V +VMCSYN +NG
Sbjct: 202 ATCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNG 261

Query: 249 IPTCADPKLLNQTIRGDWNFH---GYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           IPTCAD  LL   +R  W +     ++  DC +I  I   H ++      A A  L AG 
Sbjct: 262 IPTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVA-DGAHAAAAALNAGT 320

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNIC 363
           DLDCG  +  +   A+QQG      ++ +L  LY  L++LGYFD +    Y+++G N + 
Sbjct: 321 DLDCGSVFPEYLGSALQQGLYNNQTLNNALIRLYSSLVKLGYFDPADDQPYRSIGWNEVF 380

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
            P   ELA +A  +GIV+LKND G LPL +    T+A++GP ANAT  + GNYEG P   
Sbjct: 381 TPAAEELAHKATVEGIVMLKND-GTLPLKSNG--TVAIIGPFANATTQLQGNYEGPPKYI 437

Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
            + +         + ++ G  DI   +++    AI AAK AD  +   G+D ++E E +D
Sbjct: 438 RTLIWAAVHNGYKVKFSQGT-DINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQD 496

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R  ++ PG Q +LI +++D  K P+ +V    G VD +    N  + ++LW GYP + GG
Sbjct: 497 RTTIVWPGNQLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGG 555

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFG 602
            A+ D++ GK  P GRLP+T Y A+YV ++P T M LRP +N PGRTY+++D  V+ PFG
Sbjct: 556 AAVFDILTGKSAPAGRLPVTQYPASYVDEVPMTDMTLRPGSNNPGRTYRWYDKAVL-PFG 614

Query: 603 YGLSYTQFKYK---VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
           +GL YT F          P + D            +   GT   P    L D        
Sbjct: 615 FGLHYTTFNVSWNHAEYGPYNTD------------SVASGTTNAPVDTELFD-------- 654

Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGT--HIKQVIGYERV-FIAAGQSAKVGFTMN 716
            TF I V N G +    + +++    G+      IK ++GY R   I  GQS +V   ++
Sbjct: 655 -TFSITVTNTGNVASDYIALLFLTADGVGPEPYPIKTLVGYSRAKGIEPGQSQQVKLDVS 713

Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEG 744
                +  +N  + +L  G++ + V  G
Sbjct: 714 VGSVARTAEN-GDLVLYPGSYKLEVDVG 740


>gi|70986056|ref|XP_748529.1| beta-xylosidase [Aspergillus fumigatus Af293]
 gi|74668295|sp|Q4WFI6.1|BXLB_ASPFU RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|296439536|sp|B0Y0I4.1|BXLB_ASPFC RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|66846158|gb|EAL86491.1| beta-xylosidase, putative [Aspergillus fumigatus Af293]
 gi|159128339|gb|EDP53454.1| beta-xylosidase [Aspergillus fumigatus A1163]
          Length = 771

 Score =  464 bits (1193), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 303/764 (39%), Positives = 412/764 (53%), Gaps = 85/764 (11%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD  L    RA+ LV  MT  EKV      + GVPRLGLP Y WWSEALHGV+ 
Sbjct: 37  LSKLAVCDTSLDVTTRAQSLVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWSEALHGVA- 95

Query: 69  IGRRTNSPPGTHFDSEVPG--ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
                   PG  F    P   ATSFP  IL  A+F++ L K++   VSTE RA  N G +
Sbjct: 96  ------GSPGVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAFGNAGRS 149

Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLK 186
           GL FW+PNIN  RD RWGR  ETPGEDP  V RY  + V GLQ+  G        + P K
Sbjct: 150 GLDFWTPNINPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQNGIG-------PANP-K 201

Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
           + A CKH+AAYDL++W G  R  F++ V+ QD+ E ++ PF+ C  +  V +VMCSYN +
Sbjct: 202 VVATCKHFAAYDLEDWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDARVDAVMCSYNAL 261

Query: 247 NGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NG+P CAD  LL   +R  W +     +I SDC +I  I   H F   T  +A A  L A
Sbjct: 262 NGVPACADSYLLQTILREHWKWDEPGRWITSDCGAIDDIYNGHNFTT-TPAEAAATALNA 320

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNN 361
           G DLDCG  +  +   A  +G  +   +D +L  LY  L++LGYFD +    Y+++G  +
Sbjct: 321 GTDLDCGTVFPKYLGQAADEGLYSNQTLDRALVRLYSSLVKLGYFDPAEDQPYRSIGWTD 380

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           +  P    LA +AA +GIVLLKND   LPL      TLAL+GP+ANATK M GNYEG P 
Sbjct: 381 VDTPAAEALAHKAAGEGIVLLKNDK-TLPLKAKG--TLALIGPYANATKQMQGNYEG-PA 436

Query: 422 RYTSPMDGFYAYSKV---INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
           +Y   +   +A ++    + YA G A I   + +   AA+ AAK AD  V   G+D ++E
Sbjct: 437 KYIRTL--LWAATQAGYDVKYAAGTA-INTNSTAGFDAALSAAKQADVVVYAGGIDNTIE 493

Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
           AEG+DR  +  PG Q  LI++++   K P+ +V    G VD +   +NP++ ++LW GYP
Sbjct: 494 AEGRDRTTIAWPGNQVNLIDQLSKIGK-PLVVVQFGGGQVDDSSLLSNPRVNALLWAGYP 552

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPV 597
            +EGG AI D++ GK  P GRLP+T Y A+YV ++P T M LRP +N PGRTY+++D  V
Sbjct: 553 SQEGGSAIFDILTGKTAPAGRLPVTQYPADYVNQVPMTDMALRPGSNTPGRTYRWYDKAV 612

Query: 598 VYPFGYGLSYTQFKYK--------------VASSPKSVDIKLDKDQQCRDINYTVGTNKP 643
           + PFG+GL YT FK                V+ SPK+V I    D+   D          
Sbjct: 613 L-PFGFGLHYTTFKISWPRRALGPYNTAALVSRSPKNVPI----DRAAFD---------- 657

Query: 644 PCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP--PGIAGTHIKQVIGYERV 701
                            TF I+V N GK     V +++ K    G     +K ++GY R 
Sbjct: 658 -----------------TFHIQVTNTGKTTSDYVALLFLKTTDAGPKPYPLKTLVGYTRA 700

Query: 702 -FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
             I  G+   V   ++     +  +N  + +L  G +T+ V  G
Sbjct: 701 KQIKPGEKRSVDIEVSLGSLARTAEN-GDLVLYPGRYTLEVDVG 743


>gi|238508313|ref|XP_002385353.1| beta-xylosidase, putative [Aspergillus flavus NRRL3357]
 gi|296439537|sp|B8NYD8.1|BXLB_ASPFN RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|220688872|gb|EED45224.1| beta-xylosidase, putative [Aspergillus flavus NRRL3357]
          Length = 776

 Score =  462 bits (1189), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 289/748 (38%), Positives = 407/748 (54%), Gaps = 54/748 (7%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD  L    RAK LV  MTL EK+      + G PRLGLP Y WW+EALHGV+ 
Sbjct: 35  LSKNNVCDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVA- 93

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
            G   +     +F      ATSFP  IL  A+F++ L K++   +STEARA  N G+AGL
Sbjct: 94  EGHGVSFSDSGNFSY----ATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGL 149

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
            +W+PNIN  RDPRWGR  ETPGEDP  + RY  + V GLQD  G E       RP K+ 
Sbjct: 150 DYWTPNINPFRDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPE-------RP-KVV 201

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           A CKH+AAYDL+NWEG +R+ FD+ V+ QD+ E ++  F+ C  +  V +VMCSYN +NG
Sbjct: 202 ATCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNG 261

Query: 249 IPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           IPTCAD  LL   +R  W +     ++  DC +I  I   H ++      A A  L AG 
Sbjct: 262 IPTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVA-DGAHAAAAALNAGT 320

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNIC 363
           DLDCG  +  +   A+QQG      ++ +L  LY  L++LGYFD +    Y+++G N + 
Sbjct: 321 DLDCGSVFPEYLRSALQQGLYNNQTLNNALIRLYSSLVKLGYFDPADDQPYRSIGWNEVF 380

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
            P   ELA +A  +GIV+LKND G LPL +    T+A++GP ANAT  + GNYEG P   
Sbjct: 381 TPAAEELAHKATVEGIVMLKND-GTLPLKSNG--TVAIIGPFANATTQLQGNYEGPPKYI 437

Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
            + +         + ++ G  DI   +++    AI AAK AD  +   G+D ++E E +D
Sbjct: 438 RTLIWAAVHNGYKVKFSQGT-DINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQD 496

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R  ++ PG Q +LI +++D  K P+ +V    G VD +    N  + ++LW GYP + GG
Sbjct: 497 RTTIVWPGNQLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGG 555

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFG 602
            A+ D++ GK  P GRLP+T Y A+YV ++P T M LRP +N PGRTY+++D  V+ PFG
Sbjct: 556 AAVFDILTGKSAPAGRLPVTQYPASYVDEVPMTDMTLRPGSNNPGRTYRWYDKAVL-PFG 614

Query: 603 YGLSYTQFKYK---VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
           +GL YT F          P + D            +   GT   P    L D        
Sbjct: 615 FGLHYTTFNVSWNHAEYGPYNTD------------SVASGTTNAPVDTELFD-------- 654

Query: 660 FTFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERVF-IAAGQSAKVGFTMN 716
            TF I V N G +    + +++  +   G     IK ++GY R   I  GQS +V   ++
Sbjct: 655 -TFSITVTNTGNVASDYIALLFLTADRVGPEPYPIKTLVGYSRAKGIEPGQSQQVKLDVS 713

Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEG 744
                +  +N  + +L  G++ + V  G
Sbjct: 714 VGSVARTAEN-GDLVLYPGSYKLEVDVG 740


>gi|391864313|gb|EIT73609.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
          Length = 797

 Score =  462 bits (1189), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 289/748 (38%), Positives = 406/748 (54%), Gaps = 54/748 (7%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD  L    RAK LV  MTL EK+      + G PRLGLP Y WW+EALHGV+ 
Sbjct: 56  LSKNNVCDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVA- 114

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
            G   +     +F      ATSFP  IL  A+F++ L K++   +STEARA  N G+AGL
Sbjct: 115 EGHGVSFSDSGNFSY----ATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGL 170

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
            +W+PNIN  RDPRWGR  ETPGEDP  + RY  + V GLQD  G E       RP K+ 
Sbjct: 171 DYWTPNINPFRDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPE-------RP-KVV 222

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           A CKH+AAYDL+NWEG +R+ FD+ V+ QD+ E ++  F+ C  +  V +VMCSYN +NG
Sbjct: 223 ATCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNG 282

Query: 249 IPTCADPKLLNQTIRGDWNFH---GYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           IPTCAD  LL   +R  W +     ++  DC +I  I   H ++      A A  L AG 
Sbjct: 283 IPTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVA-DGAHAAAAALNAGT 341

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNIC 363
           DLDCG  +  +   A+QQG      +  +L  LY  L++LGYFD +    Y+++G N + 
Sbjct: 342 DLDCGSVFPEYLGSALQQGLYNNQTLYNALIRLYSSLVKLGYFDPADDQPYRSIGWNEVF 401

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
            P   ELA +A  +GIV+LKND G LPL +    T+A++GP ANAT  + GNYEG P   
Sbjct: 402 TPAAEELAHKATVEGIVMLKND-GTLPLKSNG--TVAIIGPFANATTQLQGNYEGPPKYI 458

Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
            + +         + ++ G  DI   +++    AI AAK AD  +   G+D ++E E +D
Sbjct: 459 RTLIWAAVHNGYKVKFSQGT-DINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQD 517

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R  ++ PG Q +LI +++D  K P+ +V    G VD +    N  + ++LW GYP + GG
Sbjct: 518 RTTIVWPGNQLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGG 576

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFG 602
            A+ D++ GK  P GRLP+T Y A+YV ++P T M LRP +N PGRTY+++D  V+ PFG
Sbjct: 577 AAVFDILTGKSAPAGRLPVTQYPASYVDEVPMTDMTLRPGSNNPGRTYRWYDKAVL-PFG 635

Query: 603 YGLSYTQFKYK---VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
           +GL YT F          P + D            +   GT   P    L D        
Sbjct: 636 FGLHYTTFNVSWNHAEYGPYNTD------------SVASGTTNAPVDTELFD-------- 675

Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGT--HIKQVIGYERV-FIAAGQSAKVGFTMN 716
            TF I V N G +    + +++    G+      IK ++GY R   I  GQS +V   ++
Sbjct: 676 -TFSITVTNTGNVASDYIALLFLTADGVGPEPYPIKTLVGYSRAKGIEPGQSQQVKLDVS 734

Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEG 744
                +  +N  + +L  G++ + V  G
Sbjct: 735 VGSVARTAEN-GDLVLYPGSYKLEVDVG 761


>gi|451849522|gb|EMD62825.1| glycoside hydrolase family 3 protein [Cochliobolus sativus ND90Pr]
          Length = 849

 Score =  462 bits (1188), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 278/731 (38%), Positives = 390/731 (53%), Gaps = 53/731 (7%)

Query: 24  RAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHF-- 81
           RAK LV   TL EK+    + A GV RLG+P Y+WW+E LHG++         P T F  
Sbjct: 114 RAKSLVALYTLEEKINATSNSAPGVARLGIPPYQWWNEGLHGIA--------GPFTSFAK 165

Query: 82  DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
             +   +TSFP  IL  A+F+++L  ++   +STEARA  N+   GL FW+PNIN  RDP
Sbjct: 166 QGDYSYSTSFPQPILMGAAFDDNLITEVANVISTEARAFNNVNRTGLDFWTPNINPFRDP 225

Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
           RWGR  ETPGED Y +  Y    + GLQ  E   Y R        + A CKHYA YD++N
Sbjct: 226 RWGRGQETPGEDSYHLSSYVKALIHGLQGNETDPYRR--------VVATCKHYAGYDIEN 277

Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
           W GN R+  D ++++QD+ E ++ PFE CV + +V + MCSYN VNG P CADP +L   
Sbjct: 278 WNGNLRYQNDVQISQQDLVEYYLAPFEACV-QANVGAFMCSYNAVNGAPPCADPYMLQTV 336

Query: 262 IRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTM 318
           +R  W +     ++ SDCDSIQ +   H++ + T+E A A  L AG DLDCG Y  +   
Sbjct: 337 LREHWGWSSDEHWVTSDCDSIQNVYLPHQW-SSTREGAAADSLNAGTDLDCGTYLQSHLP 395

Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNICNPQHIELAAEAAR 376
           GAV+QG   E  +D +L   Y  L++LGYFD   +  Y+ LG + +       LA +AA 
Sbjct: 396 GAVKQGLTNETTLDNALIRQYSSLIKLGYFDIPENQPYRQLGFDAVATSASQALALKAAE 455

Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV 436
           +GIVLLKND G LP+N G+ K + + G  ANAT  + GNY G     TSP          
Sbjct: 456 EGIVLLKND-GVLPINFGS-KNVGIYGDWANATSQLQGNYFGVAKFLTSPYMALEKLGVN 513

Query: 437 INYA----PGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGF 492
           + YA     G  D    +   +   I     +D  + V G+D  +E+E +DR  L L G 
Sbjct: 514 VRYAGNLPGGQGDPTTGSWPRLSGVI---TTSDVHIWVGGMDNGIESEDRDRSWLTLTGS 570

Query: 493 QTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFG 552
           Q ++I ++AD  K PV ++IM  G +D +    NPKI ++LW GYPG++GG AI +++ G
Sbjct: 571 QLDVIGQLADTGK-PVIVIIMGGGQIDTSPLIKNPKISAVLWAGYPGQDGGTAIVNILTG 629

Query: 553 KYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFK 611
           K  P GRLP T Y   YV ++P T M +RP N  PGRTYK++ G  ++ FGYGL YT F 
Sbjct: 630 KAAPAGRLPQTQYLYKYVSEVPMTDMAMRPSNKNPGRTYKWYTGKPIFEFGYGLHYTNFS 689

Query: 612 YKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGK 671
             + + PK      D  + C     + G     C    I+            + V+N GK
Sbjct: 690 ASITNQPKQSYAISDLVKGCN----STGGFLERCPFTGIN------------VSVQNTGK 733

Query: 672 MDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS 730
                V + + +   G      K ++ Y+R+F  A  S+          SL  VD + N 
Sbjct: 734 TSSDYVTLGFLTGSFGPKPYPKKSLVAYDRLFNIAASSSSTATLNLTLASLARVDESGNK 793

Query: 731 LLASGAHTILV 741
           +L  G + + +
Sbjct: 794 VLYPGDYELQI 804


>gi|119473971|ref|XP_001258861.1| beta-xylosidase [Neosartorya fischeri NRRL 181]
 gi|292495290|sp|A1DJS5.1|XYND_NEOFI RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|119407014|gb|EAW16964.1| beta-xylosidase [Neosartorya fischeri NRRL 181]
          Length = 771

 Score =  461 bits (1186), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 299/761 (39%), Positives = 405/761 (53%), Gaps = 79/761 (10%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD  L    RA+ LV  MT  EKV      + GVPRLGLP Y WWSEALHGV+ 
Sbjct: 37  LSKLAVCDTSLDVTTRARSLVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWSEALHGVA- 95

Query: 69  IGRRTNSPPGTHFDSEVPG--ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
                   PG  F    P   ATSFP  IL  A+F++ L K++   VSTE RA  N G A
Sbjct: 96  ------GSPGVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAFGNAGRA 149

Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLK 186
           GL FW+PNIN  RD RWGR  ETPGEDP  V RY  + V GLQ+  G        + P K
Sbjct: 150 GLDFWTPNINPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQNGIG-------PANP-K 201

Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
           + A CKH+AAYDL++W G  R  F++ V+ QD+ E ++ PF+ C  +  V +VMCSYN +
Sbjct: 202 VVATCKHFAAYDLEDWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDAKVDAVMCSYNAL 261

Query: 247 NGIPTCADPKLLNQTIRGDWNFH---GYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NG+P CAD  LL   +R  W +     +I  DC +I  I   H +   T  +A A  L A
Sbjct: 262 NGVPACADSYLLQTILREHWKWDEPGHWITGDCGAIDDIYNGHNY-TKTPAEAAATALNA 320

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNN 361
           G DLDCG  +  +   A  +G      +D +L  LY  L++LGYFD +    Y+++G  +
Sbjct: 321 GTDLDCGTVFPKYLGQAADEGLYTNKTLDKALVRLYSSLVKLGYFDPAEDQPYRSIGWKD 380

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           + +P    LA +AA +GIVLLKND   LPL      TLAL+GP+ANATK M GNYEG P 
Sbjct: 381 VDSPAAEALAHKAAVEGIVLLKNDK-TLPLKAKG--TLALIGPYANATKQMQGNYEGPPK 437

Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
              + +         + Y  G A I   + +   AA+ AAK AD  V   G+D ++EAEG
Sbjct: 438 YIRTLLWAATQAGYDVKYVAGTA-INANSTAGFDAALSAAKQADVVVYAGGIDNTIEAEG 496

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
            DR  ++ PG Q +LI++++   K P+ +V    G VD +   +NP + ++LW GYP +E
Sbjct: 497 HDRTTIVWPGNQLDLIDQLSKIGK-PLVVVQFGGGQVDDSSLLSNPHVNALLWTGYPSQE 555

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
           GG AI D++ GK  P GRLP+T Y A+YV ++P T M LRP +N PGRTY+++D  V+ P
Sbjct: 556 GGSAIFDILTGKTAPAGRLPVTQYPADYVNQVPLTDMALRPGSNTPGRTYRWYDKAVL-P 614

Query: 601 FGYGLSYTQFKYK--------------VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           FG+GL YT FK                V+ SPK+V I    D+   D             
Sbjct: 615 FGFGLHYTTFKISWPRRALGPYDTAALVSRSPKNVPI----DRAAFD------------- 657

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP--PGIAGTHIKQVIGYERV-FI 703
                         TF I+V N GK     V +++ K    G     +K ++GY R   I
Sbjct: 658 --------------TFHIQVTNTGKTTSDYVALLFLKTIDAGPKPYPLKTLVGYTRAKQI 703

Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
             G+   V   ++     +  +N  + +L  G +T+ V  G
Sbjct: 704 KPGEKRSVDIKVSLGSLARTAEN-GDLVLYPGRYTLEVDVG 743


>gi|83774566|dbj|BAE64689.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 822

 Score =  460 bits (1184), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 280/741 (37%), Positives = 407/741 (54%), Gaps = 50/741 (6%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L   P CD  L   ER   LV+ +TL EK+  + D + G  RLGLP YEWWSEA HGV  
Sbjct: 74  LCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGV-- 131

Query: 69  IGRRTNSPPGTHFDSEVPG---ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
                 S PG  F S+      ATSFP  ILT ASF+++L +KI + +  E RA  N G 
Sbjct: 132 -----GSAPGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGF 186

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
           +G  FW+PNIN  RDPRWGR  ETPGEDP V   Y  N+V GLQ           D +  
Sbjct: 187 SGFDFWAPNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQ---------GDDPKNK 237

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           ++ A CKHYA YDL+      R+  +   T+QD+ + F+ PF+ CV + DV S+MCSYN 
Sbjct: 238 QVIATCKHYAVYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNS 293

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           V+GIP CA+  LL++ +R  WNF+    Y+VSDC ++  I + H F  DT+E A +  L 
Sbjct: 294 VSGIPACANEYLLSEVLRKHWNFNSDYHYVVSDCGAVTDIWQYHNF-TDTEEAAASVALN 352

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNI 362
           AG+DL+CG  Y      ++   + +   +D SL  LY  L  +G+FDG  +Y  L  +++
Sbjct: 353 AGVDLECGSSYLKLNE-SLAANQTSVKVMDQSLARLYSALFTVGFFDGG-KYDKLDFSDV 410

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGN-IKTLALVGPHANATKAMIGNYEGTPC 421
             P    LA EAA +G+ LLKND+  LPL++ +  K++A++GP ANAT  M G+Y G   
Sbjct: 411 STPDAQALAYEAAVEGMTLLKNDD-LLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAP 469

Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
              SP++ F      +NYA G A +  QN S    A+ AA  +D  + + G+D S+E+E 
Sbjct: 470 YLISPLEAFGDSRWKVNYALGTA-MNNQNTSGFEEALAAANKSDLIIYLGGIDNSLESET 528

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
            DR  L  PG Q +LI  ++  +K P+ +V    G VD +    N  I++++W GYP + 
Sbjct: 529 LDRTSLTWPGNQLDLITSLSKLSK-PLVVVQFGGGQVDDSDILKNKDIQALVWAGYPSQS 587

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
           GG A+ DV+ GK +P GRLP+T Y A+Y  ++    + LRP +++PGRTYK++ G  V P
Sbjct: 588 GGTALLDVLVGKRSPAGRLPVTQYPASYADQVNIFDINLRPTDSYPGRTYKWYTGKPVLP 647

Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
           FGYGL YT+F +    +       L+++   +D+   V + +      + D+        
Sbjct: 648 FGYGLHYTKFMFDWEKT-------LNREYNIQDL---VASCRNSSGGPINDNTPLT---- 693

Query: 661 TFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
           T ++ V+N+G      V +++  SK  G A    K ++ Y R+   A  S +V       
Sbjct: 694 TVKVRVKNVGHKTSDYVSLLFLSSKNAGPAPRPNKSLVSYVRLLNIARGSDQVAELPLTL 753

Query: 719 KSLKIVDNAANSLLASGAHTI 739
            SL   D   + ++  G + I
Sbjct: 754 GSLARADENGSLVIFPGRYKI 774


>gi|317156541|ref|XP_001825822.2| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
          Length = 882

 Score =  459 bits (1182), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 280/741 (37%), Positives = 407/741 (54%), Gaps = 50/741 (6%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L   P CD  L   ER   LV+ +TL EK+  + D + G  RLGLP YEWWSEA HGV  
Sbjct: 134 LCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGV-- 191

Query: 69  IGRRTNSPPGTHFDSEVPG---ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
                 S PG  F S+      ATSFP  ILT ASF+++L +KI + +  E RA  N G 
Sbjct: 192 -----GSAPGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGF 246

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
           +G  FW+PNIN  RDPRWGR  ETPGEDP V   Y  N+V GLQ           D +  
Sbjct: 247 SGFDFWAPNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQ---------GDDPKNK 297

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           ++ A CKHYA YDL+      R+  +   T+QD+ + F+ PF+ CV + DV S+MCSYN 
Sbjct: 298 QVIATCKHYAVYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNS 353

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           V+GIP CA+  LL++ +R  WNF+    Y+VSDC ++  I + H F  DT+E A +  L 
Sbjct: 354 VSGIPACANEYLLSEVLRKHWNFNSDYHYVVSDCGAVTDIWQYHNF-TDTEEAAASVALN 412

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNI 362
           AG+DL+CG  Y      ++   + +   +D SL  LY  L  +G+FDG  +Y  L  +++
Sbjct: 413 AGVDLECGSSYLKLNE-SLAANQTSVKVMDQSLARLYSALFTVGFFDGG-KYDKLDFSDV 470

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGN-IKTLALVGPHANATKAMIGNYEGTPC 421
             P    LA EAA +G+ LLKND+  LPL++ +  K++A++GP ANAT  M G+Y G   
Sbjct: 471 STPDAQALAYEAAVEGMTLLKNDD-LLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAP 529

Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
              SP++ F      +NYA G A +  QN S    A+ AA  +D  + + G+D S+E+E 
Sbjct: 530 YLISPLEAFGDSRWKVNYALGTA-MNNQNTSGFEEALAAANKSDLIIYLGGIDNSLESET 588

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
            DR  L  PG Q +LI  ++  +K P+ +V    G VD +    N  I++++W GYP + 
Sbjct: 589 LDRTSLTWPGNQLDLITSLSKLSK-PLVVVQFGGGQVDDSDILKNKDIQALVWAGYPSQS 647

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
           GG A+ DV+ GK +P GRLP+T Y A+Y  ++    + LRP +++PGRTYK++ G  V P
Sbjct: 648 GGTALLDVLVGKRSPAGRLPVTQYPASYADQVNIFDINLRPTDSYPGRTYKWYTGKPVLP 707

Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
           FGYGL YT+F +    +       L+++   +D+   V + +      + D+        
Sbjct: 708 FGYGLHYTKFMFDWEKT-------LNREYNIQDL---VASCRNSSGGPINDNTPLT---- 753

Query: 661 TFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
           T ++ V+N+G      V +++  SK  G A    K ++ Y R+   A  S +V       
Sbjct: 754 TVKVRVKNVGHKTSDYVSLLFLSSKNAGPAPRPNKSLVSYVRLLNIARGSDQVAELPLTL 813

Query: 719 KSLKIVDNAANSLLASGAHTI 739
            SL   D   + ++  G + I
Sbjct: 814 GSLARADENGSLVIFPGRYKI 834


>gi|238492365|ref|XP_002377419.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
 gi|220695913|gb|EED52255.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
          Length = 775

 Score =  459 bits (1182), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 281/741 (37%), Positives = 404/741 (54%), Gaps = 50/741 (6%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L   P CD  L   ER   LV+ +TL EK+  + D + G  RLGLP YEWWSEA HGV  
Sbjct: 27  LCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGVG- 85

Query: 69  IGRRTNSPPGTHFDSEVPG---ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
                 S PG  F S+      ATSFP  ILT ASF+++L +KI + +  E R   N G 
Sbjct: 86  ------SAPGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRVFGNNGF 139

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
           +G  FW+PNIN  RDPRWGR  ETPGEDP V   Y  N+V GLQ           D +  
Sbjct: 140 SGFDFWAPNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQ---------GDDPKNK 190

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           ++ A CKHYA YDL+      R+  +   T+QD+ E F+ PF+ CV + DV S+MCSYN 
Sbjct: 191 QVIATCKHYAVYDLE----TGRYGNNYNPTQQDLSEYFLAPFKTCVRDTDVGSIMCSYNS 246

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           V+GIP CA+  LL++ +R  WNF+    Y+VSDC ++  I + H F  DT+E A +  L 
Sbjct: 247 VSGIPACANEYLLDEVLRKHWNFNSDYHYVVSDCGAVTDIWQYHNF-TDTEEAAASVALN 305

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNI 362
           AG+DL+CG  Y      ++   + +   +D SL  LY  L  +G+FDG  +Y  L  +++
Sbjct: 306 AGVDLECGSSYLKLNE-SLAANQTSVKVMDQSLARLYSALFTVGFFDGG-KYDKLDFSDV 363

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGN-IKTLALVGPHANATKAMIGNYEGTPC 421
             P    LA EAA +G+ LLKND+  LPL++ +  K++A++GP ANAT  M G+Y G   
Sbjct: 364 STPDAQALAYEAAVEGMTLLKNDD-LLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAP 422

Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
              SP++ F      +NYA G A I  QN S    A+ AA  +D  + + G+D S+E+E 
Sbjct: 423 YLISPLEAFGDSRWKVNYALGTA-INNQNTSGFEEALAAANKSDLIIYLGGIDNSLESET 481

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
            DR  L  PG Q +LI  ++  +K P+ +V    G VD +    N  I++++W GYP + 
Sbjct: 482 LDRTSLAWPGNQLDLITSLSKLSK-PLVVVQFGGGQVDDSAILKNKDIQALVWAGYPSQS 540

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
           GG A+ DV+ GK +P GRLP+T Y A+Y  ++    + LRP + +PGRTYK++ G  V P
Sbjct: 541 GGTALLDVLVGKRSPAGRLPVTQYPASYADQVNIFDINLRPTDLYPGRTYKWYTGKPVLP 600

Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
           FGYGL YT+F +    +       L+++   +D+   V + +      + D+        
Sbjct: 601 FGYGLHYTKFMFDWEKT-------LNREYNIQDL---VASCRNSSGGPINDNTPLT---- 646

Query: 661 TFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
           T +  V+N+G      V +++  SK  G A    K ++ Y R+   A  S +V       
Sbjct: 647 TVKARVKNVGHKTSDYVSLLFLSSKNAGPAPRPNKSLVSYVRLLNIARGSDQVAELPLTL 706

Query: 719 KSLKIVDNAANSLLASGAHTI 739
            SL   D   + ++  G + I
Sbjct: 707 GSLARADENGSLVIFPGRYKI 727


>gi|340519849|gb|EGR50086.1| glycoside hydrolase family 3 [Trichoderma reesei QM6a]
          Length = 796

 Score =  459 bits (1180), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 292/756 (38%), Positives = 410/756 (54%), Gaps = 64/756 (8%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS-FIGRRT 73
           CD      ERA  +V+ MTL EKV  +G  A G  RLGLP Y+W +EALHGV+   G + 
Sbjct: 75  CDTTKSIAERAAAIVKPMTLNEKVANVGSSASGSARLGLPAYQWQNEALHGVAGSTGVQF 134

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
            SP G +F +    ATSFP  IL +A+F+++L K +   +STEARA  N G AGL FW+P
Sbjct: 135 QSPLGANFSA----ATSFPMPILLSAAFDDALVKSVATAISTEARAFANYGFAGLDFWTP 190

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN  RDPRWGR +ETPGED + +  Y +  V GLQ     +++R          + CKH
Sbjct: 191 NINPFRDPRWGRGMETPGEDAFRIQGYVLALVDGLQGGIDPDFYR--------TLSTCKH 242

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           +AAYD++    N R   +   T+QDM + ++  FE CV +  V+S+MC+YN V+G+P CA
Sbjct: 243 FAAYDIE----NGRTANNLSPTQQDMADYYLPMFETCVRDAKVASIMCAYNAVDGVPACA 298

Query: 254 DPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
           D  LL   +R  + F     Y+VSDCD+++ + + H +  +  + A A  + AG DLDCG
Sbjct: 299 DSYLLQDVLRDTYGFTEDFNYVVSDCDAVENVFDPHHYAANLTQ-AAAMSINAGTDLDCG 357

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIEL 370
             Y N    +VQ G   EA +D SL  LY  L+++GYFD   +Y +LG  N+   Q   L
Sbjct: 358 SSY-NVLNASVQAGLTTEATLDKSLIRLYSALVKVGYFDQPAEYNSLGWGNVNTTQSQAL 416

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A +AA +G+ LLKND G LPL+   +  +A++GP AN T  M GNY GT     +P+  F
Sbjct: 417 AHDAATEGMTLLKND-GTLPLSR-TLSNVAVIGPWANVTTQMQGNYAGTAPLLVNPLSVF 474

Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
               + + YA G A I  Q+ S   AA+ AA ++D  V + G+D+SVE EG DR  +  P
Sbjct: 475 QQKWRNVKYAQGTA-INSQDTSGFNAALSAASSSDVIVYLGGIDISVENEGFDRSSITWP 533

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
           G Q  LI+++A+  K P+ +V    G +D +   +N K+ SILW GYPG++GG AI DV+
Sbjct: 534 GNQLNLISQLANLGK-PLVIVQFGGGQIDDSALLSNSKVNSILWAGYPGQDGGNAIFDVL 592

Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
            G   P GRLP+T Y ANYV       M LRP N  PGRTY ++ G  V PFGYGL YT 
Sbjct: 593 TGANPPAGRLPVTQYPANYVNNNNIQDMNLRPSNGIPGRTYAWYTGTPVLPFGYGLHYTN 652

Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
           F     S+                   T G++     A L+++        TF   V N+
Sbjct: 653 FSLSFQSTK------------------TAGSD----IATLVNNAGSNKDLATFATIVVNV 690

Query: 670 GKMDGSE--------VVMVYSKPPGIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNACKS 720
               G          ++ + S   G A    KQ+  Y RV  +  G + ++  T+N   S
Sbjct: 691 KNTGGKANLASDYVGLLFLKSTNAGPAPHPNKQLAAYGRVRNVGVGATQQLTLTVN-LGS 749

Query: 721 LKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
           L   D   +  +  GA+T+++      V+ PL  N 
Sbjct: 750 LARADTNGDRWIYPGAYTLIL-----DVNGPLTFNF 780


>gi|391865040|gb|EIT74331.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
          Length = 822

 Score =  459 bits (1180), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 281/741 (37%), Positives = 404/741 (54%), Gaps = 50/741 (6%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L   P CD  L   ER   LV+ +TL EK+  + D + G  RLGLP YEWWSEA HGV  
Sbjct: 74  LCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGVG- 132

Query: 69  IGRRTNSPPGTHFDSEVPG---ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
                 S PG  F S+      ATSFP  ILT ASF+++L +KI + +  E RA  N G 
Sbjct: 133 ------SAPGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGF 186

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
           +G  FW+PNIN  RDPRWGR  ETPGEDP V   Y  N+V GLQ           D +  
Sbjct: 187 SGFDFWAPNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQ---------GDDPKNK 237

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           ++ A CKHYA YDL+      R+  +   T+QD+ + F+ PF+ CV + DV S+MCSYN 
Sbjct: 238 QVIATCKHYAVYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNS 293

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           V+GIP CA+  LL++ +R  WNF+    Y+VSDC ++  I + H F  DT+E A +  L 
Sbjct: 294 VSGIPACANEYLLDEVLRKHWNFNSDYYYVVSDCGAVTDIWQYHNF-TDTEEAAASVALN 352

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNI 362
           AG+DL+CG  Y      ++   + +   +D SL  LY  L  +G+FDG  +Y  L  +++
Sbjct: 353 AGVDLECGSSYLKLNE-SLAANQTSVKVMDRSLARLYSALFTVGFFDGG-KYDKLDFSDV 410

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLN-TGNIKTLALVGPHANATKAMIGNYEGTPC 421
             P    LA EAA +G+ LLKND+  LPL+     K++A++GP ANAT  M G+Y G   
Sbjct: 411 STPDAQALAYEAAVEGMTLLKNDD-LLPLDFPHKYKSVAVIGPFANATTQMQGDYSGDAP 469

Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
              SP++ F      +NYA G A I  QN S    A+ AA  +D  + + G+D S+E+E 
Sbjct: 470 YLISPLEAFGDSRWKVNYALGTA-INNQNTSGFEEALAAANKSDLIIYLGGIDNSLESET 528

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
            DR  L  PG Q +LI  ++  +K P+ +V    G VD +    N  I++++W GYP + 
Sbjct: 529 LDRTSLAWPGNQLDLITSLSKLSK-PLVVVQFGGGQVDDSAILKNKDIQALVWAGYPSQS 587

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
           GG A+ DV+ GK +P GRLP+T Y A+Y  ++    + LRP +++PGRTYK++ G  V P
Sbjct: 588 GGTALLDVLVGKRSPAGRLPVTQYPASYADQVNIFDINLRPTDSYPGRTYKWYTGKPVLP 647

Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
           FGYGL YT+F +    +       L+++   +D+   V + +      + D+        
Sbjct: 648 FGYGLHYTKFMFDWEKT-------LNREYNIQDL---VASCRNSSGGPINDNTPLT---- 693

Query: 661 TFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
           T +  V+N+G      V +++  SK  G A    K ++ Y R+   A  S +V       
Sbjct: 694 TVKARVKNVGHKTSDYVSLLFLSSKNAGPAPRPNKSLVSYVRLLNIARGSDQVAELPLTL 753

Query: 719 KSLKIVDNAANSLLASGAHTI 739
            SL   D   + ++  G + I
Sbjct: 754 GSLARADENGSLVIFPGRYKI 774


>gi|332982588|ref|YP_004464029.1| glycoside hydrolase [Mahella australiensis 50-1 BON]
 gi|332700266|gb|AEE97207.1| glycoside hydrolase family 3 domain protein [Mahella australiensis
           50-1 BON]
          Length = 714

 Score =  457 bits (1175), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 281/753 (37%), Positives = 400/753 (53%), Gaps = 99/753 (13%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D  L + +RAKDLV RMTLPEK+ QM   A  +PRL +P Y WW+E LHGV+  G   
Sbjct: 13  YKDVSLSFEDRAKDLVSRMTLPEKISQMIYDAPAIPRLDIPAYNWWNECLHGVARAGI-- 70

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG--------N 125
                         AT FP  I   A+FN  L  K+ + +S EARA ++           
Sbjct: 71  --------------ATVFPQAIAMAATFNPELIHKVAEAISDEARAKHHEAVRNGDRGIY 116

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLTFWSPNIN+ RDPRWGR  ET GEDPY+  R  + +V+GLQ           D + L
Sbjct: 117 KGLTFWSPNINIFRDPRWGRGHETYGEDPYLTSRMGVAFVKGLQG---------DDPKYL 167

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K+ A  KHYA +   +   + R  FD+RV+++D++ET++  FE CV EG   S+M +YNR
Sbjct: 168 KVVATPKHYAVH---SGPESQRHSFDARVSQKDLRETYLPAFEECVKEGKAVSIMGAYNR 224

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
            NG P CA   LL   +R +W F GY+VSDC +I  I   HK +  T  ++ A  +  G 
Sbjct: 225 TNGEPCCASKTLLKDILRDEWGFDGYVVSDCGAIDDIHMHHK-VTKTAAESAALAVNNGC 283

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNIC 363
           +L+CG  Y  +   AV+QG I+E  ID ++  L+   MRLG FD     +Y ++  +   
Sbjct: 284 ELNCGKTY-EYLCQAVEQGLISEETIDQAVIKLFTARMRLGMFDPPEMVRYAHIPYDVND 342

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
           +P+H ELA E ARQ IVLLKND   LPL+   +KT+A++GP+A+    ++ NY GTP +Y
Sbjct: 343 SPEHRELALETARQSIVLLKNDENILPLSK-KLKTIAVIGPNADDLDVLLANYFGTPSKY 401

Query: 424 TSPMDGFYAY----SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
            +P++G        +KV+ YA GC ++   +      A++ A+ AD  ++  GL   +E 
Sbjct: 402 VTPLEGIKNKVSPDTKVL-YAKGC-EVTGNSVDGFDEAVNIAEMADIVIMCLGLSPRIEG 459

Query: 480 E---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIK 530
           E         G DR+ + LPG Q +L+  +    K P+ LV+++  A+ IN+A  +  + 
Sbjct: 460 EEGDVADSDGGGDRLHIDLPGMQEQLLETIYGTGK-PIVLVLLNGSAIAINWAHEH--VP 516

Query: 531 SILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTY 590
           +I+   YPGEEGG AIADV+FG YNP GRLPIT+  +     P+T        N  GRTY
Sbjct: 517 AIIEAWYPGEEGGTAIADVLFGDYNPAGRLPITFVRSLDDLPPFTDY------NMKGRTY 570

Query: 591 KFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
           ++F+   +YPFGYGLSYT FKY                      N  +   + P    L 
Sbjct: 571 RYFEKEPLYPFGYGLSYTSFKYS---------------------NLRLSAMRLPAGNNL- 608

Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSA 709
                        ++VEN GK+ G EVV +Y S         ++Q+ G + + +  GQ  
Sbjct: 609 ----------DINVDVENTGKLAGREVVQLYISDVEASVEVPMRQLCGIQCITLEPGQKQ 658

Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            V FT+   + + + D     +L  G   I VG
Sbjct: 659 TVSFTVEP-QHMSLFDYDGKRILEPGQFIIAVG 690


>gi|242813865|ref|XP_002486253.1| beta-xylosidase, putative [Talaromyces stipitatus ATCC 10500]
 gi|218714592|gb|EED14015.1| beta-xylosidase, putative [Talaromyces stipitatus ATCC 10500]
          Length = 893

 Score =  456 bits (1173), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 285/750 (38%), Positives = 414/750 (55%), Gaps = 55/750 (7%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L   P CD  L    RAK LV+ MT  EKVQ   + + G  RLGLP Y+WW+EALHGV+ 
Sbjct: 159 LCSNPICDTSLDPLTRAKGLVDAMTFEEKVQNTQNGSPGAARLGLPAYQWWNEALHGVAG 218

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
               T  P G         ATSFP  IL +A+F+++L K++G  VS E RA  N GNAGL
Sbjct: 219 SPGVTFQPSG-----NFSYATSFPQPILMSAAFDDALIKEVGTVVSIEGRAFNNYGNAGL 273

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
            FW+PNIN  RDPRWGR  ETPGEDPY + RY  N V GLQ+  G+     + + P ++ 
Sbjct: 274 DFWTPNINPFRDPRWGRGQETPGEDPYHIARYVYNLVDGLQN--GI-----APANP-RVV 325

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           A CKH+A YD+++WEGN R+ F++ ++ QD+ E ++ PF+ C  +  V ++MCSYN VNG
Sbjct: 326 ATCKHFAGYDIEDWEGNSRYGFNAIISTQDLSEYYLPPFKSCARDAQVDAIMCSYNAVNG 385

Query: 249 IPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           IPTCAD  LL+  +R  WN++    ++ SDCD++  I   H++ + +   A A  L AG 
Sbjct: 386 IPTCADSYLLDTILRDHWNWNQTGHWVTSDCDAVDNIYSDHRYTS-SLAAAAADALNAGT 444

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICN 364
           +LDCG   +N    A  Q     A ++++L +LY  L+RLG+FD    QY +LG +++  
Sbjct: 445 NLDCGTTMSNNLAAAAAQDLFKNATLNSALVYLYSSLVRLGWFDSEDSQYSSLGWSDVGT 504

Query: 365 PQHIELAAEAARQGIVLLKNDN-GALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
               +LA  AA +GIVLLKND+   LPL+  + +T+AL+GP+ANAT  + GNY GTP   
Sbjct: 505 TASQQLANRAAVEGIVLLKNDHKKVLPLSQ-HGQTIALIGPYANATTQLQGNYYGTPAYI 563

Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
            + + G       + Y  G   I   + S   AA+ AAK AD  +   G+D S+EAE  D
Sbjct: 564 RTLVWGAEQMGYTVQYEAGTG-INSTDTSGFAAAVAAAKTADIVIYAGGIDNSIEAEAMD 622

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R  +   G Q +LI++++   K P+ ++    G +D +    N  + ++LW GYP + GG
Sbjct: 623 RNTIAWTGNQLQLIDQLSQVGK-PLVVLQFGGGQLDDSALLQNENVNALLWCGYPSQTGG 681

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNNFPGRTYKFFDGPVVYPFG 602
           +A+ D++ G+  P GRLP+T Y ANY   IP T M LRP  + PGRTY+++D  V+ PFG
Sbjct: 682 QAVFDILTGQSAPAGRLPVTQYPANYTNAIPMTDMSLRPNGSTPGRTYRWYDDAVI-PFG 740

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT- 661
           +GL YT F                      D ++      P   A L+       Y+ T 
Sbjct: 741 FGLHYTTF----------------------DASWADKKFGPYNTASLVAKASKSKYQDTA 778

Query: 662 ----FQIEVENMGKMDGSEVVMVYSKP--PGIAGTHIKQVIGYERV-FIAAGQSAKVGFT 714
               F + V+N GK+    V ++++     G     IK +I Y R   I  G++  V   
Sbjct: 779 PFDSFHVNVKNTGKVTSDFVALLFASTDNAGPKPYPIKTLISYARASSIKPGETRTVSID 838

Query: 715 MNACKSLKIVDNAANSLLASGAHTILVGEG 744
           +      +   N  + +L  G++T+ +  G
Sbjct: 839 VTIGSIARTATN-GDLVLYPGSYTLQLDVG 867


>gi|296439595|sp|A1CCL9.2|BXLB_ASPCL RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
          Length = 771

 Score =  452 bits (1164), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 287/704 (40%), Positives = 396/704 (56%), Gaps = 51/704 (7%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD       RA+ LV+ M+  EKV      A GVPRLGLP Y WWSEALHGV+ 
Sbjct: 37  LSKLAVCDTSRDVTTRAQSLVDAMSFAEKVNNTQYEAPGVPRLGLPAYNWWSEALHGVA- 95

Query: 69  IGRRTNSPPGTHFDSEVPG--ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
                   PG HF    P   ATSF   IL  ASF++ L K++   V TE RA  N G A
Sbjct: 96  ------GAPGVHFADSGPFSYATSFAQPILLGASFDDELVKQVATVVGTEGRAFGNAGRA 149

Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLK 186
           GL +W+PNIN  RDPRWGR  ETPGEDP  V RY  + V GLQ   G        +RP +
Sbjct: 150 GLDYWTPNINPFRDPRWGRGQETPGEDPLHVSRYVYHLVDGLQGGIG-------PARP-Q 201

Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
           I+A CKH+AAYD+++W G  R  FD+RV+ QD+ E ++  F+ CV +  V +VMCSYN +
Sbjct: 202 IAATCKHFAAYDMEDWNGVSRHEFDARVSTQDLAEFYLPSFKSCVRDAQVDAVMCSYNAL 261

Query: 247 NGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NG+PTCADP LL   +R  W++     ++VSDC +I  I   H +   T  +A A  L A
Sbjct: 262 NGVPTCADPYLLQTLLREHWDWDQPGHWVVSDCGAIDDIYIGHNY-TKTGAEAAAVALNA 320

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNN 361
           G DLDCG  +      A +QG      +D +L  LY  L++LGYFD + +  Y ++G  +
Sbjct: 321 GTDLDCGTVFPKHLGEAAEQGLYTNQTLDRALVRLYSSLVKLGYFDPAEKQPYGSIGWKD 380

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           +  P   +LA +AA +GIVLLKND   LPL      TLAL+GP+ANATK M GNY+G P 
Sbjct: 381 VDTPAAEQLAHKAAVEGIVLLKNDQ-TLPLKAKG--TLALIGPYANATKQMQGNYQGPP- 436

Query: 422 RYTSPMD-GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
           +Y   ++     +   + Y+PG A I   + +   AA+ AAK+AD  +   G+D ++E+E
Sbjct: 437 KYIRTLEWAATQHGYQVQYSPGTA-INNSSTAGFAAALAAAKDADVVLYAGGIDNTIESE 495

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
             DR  +  PG Q  LI+++++  K P+ ++    G VD      NP + ++LW GYP +
Sbjct: 496 TLDRTTITWPGNQLSLISELSNLHK-PLIVIQFGGGQVDDTPLLTNPHVNALLWAGYPSQ 554

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
           EGG AI D++ GK  P GRLPIT Y A Y  ++P T M LR   + PGRTY+++D  VV 
Sbjct: 555 EGGAAIFDILTGKAAPAGRLPITQYPAAYTAQVPMTEMGLRAGGDNPGRTYRWYDKAVV- 613

Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
           PFG+GL YT F           ++  D+ +     N     N+ P  + +  D    D  
Sbjct: 614 PFGFGLHYTSF-----------EVSWDRGR-LGPYNTAALVNRAPGGSHV--DRALFD-- 657

Query: 660 FTFQIEVENMGKMDGSEVVMVYSKP--PGIAGTHIKQVIGYERV 701
            TF+++V+N G +    V +++ K    G     +K ++GY RV
Sbjct: 658 -TFRVQVQNTGTVTSDYVALLFVKTEDAGPEPYPLKTLVGYTRV 700


>gi|121712174|ref|XP_001273702.1| beta-xylosidase [Aspergillus clavatus NRRL 1]
 gi|119401854|gb|EAW12276.1| beta-xylosidase [Aspergillus clavatus NRRL 1]
          Length = 803

 Score =  452 bits (1162), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 287/704 (40%), Positives = 396/704 (56%), Gaps = 51/704 (7%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD       RA+ LV+ M+  EKV      A GVPRLGLP Y WWSEALHGV+ 
Sbjct: 69  LSKLAVCDTSRDVTTRAQSLVDAMSFAEKVNNTQYEAPGVPRLGLPAYNWWSEALHGVA- 127

Query: 69  IGRRTNSPPGTHFDSEVPG--ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
                   PG HF    P   ATSF   IL  ASF++ L K++   V TE RA  N G A
Sbjct: 128 ------GAPGVHFADSGPFSYATSFAQPILLGASFDDELVKQVATVVGTEGRAFGNAGRA 181

Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLK 186
           GL +W+PNIN  RDPRWGR  ETPGEDP  V RY  + V GLQ   G        +RP +
Sbjct: 182 GLDYWTPNINPFRDPRWGRGQETPGEDPLHVSRYVYHLVDGLQGGIG-------PARP-Q 233

Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
           I+A CKH+AAYD+++W G  R  FD+RV+ QD+ E ++  F+ CV +  V +VMCSYN +
Sbjct: 234 IAATCKHFAAYDMEDWNGVSRHEFDARVSTQDLAEFYLPSFKSCVRDAQVDAVMCSYNAL 293

Query: 247 NGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NG+PTCADP LL   +R  W++     ++VSDC +I  I   H +   T  +A A  L A
Sbjct: 294 NGVPTCADPYLLQTLLREHWDWDQPGHWVVSDCGAIDDIYIGHNY-TKTGAEAAAVALNA 352

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNN 361
           G DLDCG  +      A +QG      +D +L  LY  L++LGYFD + +  Y ++G  +
Sbjct: 353 GTDLDCGTVFPKHLGEAAEQGLYTNQTLDRALVRLYSSLVKLGYFDPAEKQPYGSIGWKD 412

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           +  P   +LA +AA +GIVLLKND   LPL      TLAL+GP+ANATK M GNY+G P 
Sbjct: 413 VDTPAAEQLAHKAAVEGIVLLKNDQ-TLPLKAKG--TLALIGPYANATKQMQGNYQGPP- 468

Query: 422 RYTSPMD-GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
           +Y   ++     +   + Y+PG A I   + +   AA+ AAK+AD  +   G+D ++E+E
Sbjct: 469 KYIRTLEWAATQHGYQVQYSPGTA-INNSSTAGFAAALAAAKDADVVLYAGGIDNTIESE 527

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
             DR  +  PG Q  LI+++++  K P+ ++    G VD      NP + ++LW GYP +
Sbjct: 528 TLDRTTITWPGNQLSLISELSNLHK-PLIVIQFGGGQVDDTPLLTNPHVNALLWAGYPSQ 586

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
           EGG AI D++ GK  P GRLPIT Y A Y  ++P T M LR   + PGRTY+++D  VV 
Sbjct: 587 EGGAAIFDILTGKAAPAGRLPITQYPAAYTAQVPMTEMGLRAGGDNPGRTYRWYDKAVV- 645

Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
           PFG+GL YT F           ++  D+ +     N     N+ P  + +  D    D  
Sbjct: 646 PFGFGLHYTSF-----------EVSWDRGR-LGPYNTAALVNRAPGGSHV--DRALFD-- 689

Query: 660 FTFQIEVENMGKMDGSEVVMVYSKP--PGIAGTHIKQVIGYERV 701
            TF+++V+N G +    V +++ K    G     +K ++GY RV
Sbjct: 690 -TFRVQVQNTGTVTSDYVALLFVKTEDAGPEPYPLKTLVGYTRV 732


>gi|358397360|gb|EHK46735.1| glycoside hydrolase family 3 protein [Trichoderma atroviride IMI
           206040]
          Length = 865

 Score =  450 bits (1158), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 260/607 (42%), Positives = 359/607 (59%), Gaps = 27/607 (4%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS-FIGRRT 73
           CD  L   ERA  +V+ MTL EKV  +G  A G  RLGLP Y+W +EALHGV+   G + 
Sbjct: 144 CDTTLSMAERAAAIVKPMTLDEKVANVGSSASGSARLGLPAYQWQNEALHGVAGSTGVQF 203

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
            SP G +F +    ATSFP  IL +A+F+++L + +   +STEARA  N G AGL FW+P
Sbjct: 204 QSPLGANFSA----ATSFPMPILLSAAFDDALVQNVATAISTEARAFANYGFAGLDFWTP 259

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN  RDPRWGR +ETPGED + +  Y +  + GLQ     ++ R        I A CKH
Sbjct: 260 NINPFRDPRWGRGMETPGEDAFRIQGYVLALISGLQGGINPDFFR--------IIATCKH 311

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           +AAYD++N    +  +     T+QDM + ++  FE CV +  V SVMC+YN V+GIP CA
Sbjct: 312 FAAYDIENGRTGNNLN----PTQQDMADYYLPMFETCVRDAKVGSVMCAYNAVDGIPACA 367

Query: 254 DPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
              LL   +R  + F     Y+VSDCD++  + + H + ++  E A A  L AG DLDCG
Sbjct: 368 SEYLLQDVLRDGFGFTEDFNYVVSDCDAVDNVFDPHHYASNLTE-AAALSLNAGTDLDCG 426

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIEL 370
             Y N    +V+    +EA ++ SL  LY  L+++GYFD   +YK+L   N+   Q+  L
Sbjct: 427 SSY-NVLNASVEAALTSEAALNQSLVRLYSALIKVGYFDQPSEYKSLSWANVNTTQNQAL 485

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A +AA  G+ LLKND G LPL+   +  +A++GP  NAT  M GNY GT     +P+D F
Sbjct: 486 AHDAATGGMTLLKND-GTLPLSR-TLSNVAIIGPWVNATTQMQGNYAGTAPFLVNPLDVF 543

Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
                 + YA G A I  Q+ S   AA+ AA ++D  V + G+D++VE EG DR  ++ P
Sbjct: 544 QQKWGNVKYAQGTA-INSQDTSGFSAALSAASSSDVIVYLGGIDITVENEGFDRGSIVWP 602

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
           G Q +LI+++A+  K P+ +V    G +D +   +NP ++SILW GYPG++GG A+ DV+
Sbjct: 603 GNQLDLISQLANLGK-PLVIVQFGGGQIDDSSLLSNPNVRSILWAGYPGQDGGNAVFDVL 661

Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
            G   P GRLPIT Y A+Y+       M LRP N  PGRTY ++ G  V PFGYGL YT 
Sbjct: 662 TGANPPAGRLPITQYPASYINNNNIQDMNLRPSNGIPGRTYAWYTGTPVLPFGYGLHYTN 721

Query: 610 FKYKVAS 616
           F     S
Sbjct: 722 FSVSFQS 728


>gi|302683012|ref|XP_003031187.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
 gi|300104879|gb|EFI96284.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
          Length = 752

 Score =  450 bits (1158), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 293/756 (38%), Positives = 406/756 (53%), Gaps = 59/756 (7%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L+  P CDA L + ERA+ LVE  T+PE +    + A+GVPRLGLP YEWW+EALHGV  
Sbjct: 30  LASNPVCDASLGHVERARALVEEFTVPEMINNTVNAAFGVPRLGLPPYEWWNEALHGVGL 89

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
                 SP    F+ E   ATSFP  I   ++F+++L   +G  +STEARA  N G AGL
Sbjct: 90  ------SPGVVFFEPEPAVATSFPMPINMGSAFDDALMLAMGDVISTEARAFSNAGRAGL 143

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
            +W+PNIN  +DPRWGR  ETPGEDP    RY  + V GLQ   G+      D   LK++
Sbjct: 144 DYWTPNINPFKDPRWGRGAETPGEDPLHAARYVRSLVEGLQG--GI------DPPSLKVA 195

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           A CKH+AAYDL+NW G  R+ FD+ VT QD+ E +  PF  CV +   +S MCSYN VNG
Sbjct: 196 AACKHWAAYDLENWGGVTRYAFDAVVTPQDLAEYYAPPFRSCVRDARAASAMCSYNAVNG 255

Query: 249 IPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLD 306
           +P CA P LL   +R  W      ++ SDC ++  + + H +  D   +A    LKAG D
Sbjct: 256 VPACASPYLLKTVLRDAWGLAEDRWVTSDCGAVGNVYDPHGYTEDLV-NASTVSLKAGTD 314

Query: 307 LDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNIC 363
           L+CG  YT +   A  +G I E D+  +L  LY  L+ LGYFD +P+   Y+ +   ++ 
Sbjct: 315 LNCGTNYTQYLPEAYDRGLIDEDDLKAALTRLYASLVWLGYFD-APEDQPYRQITWADVN 373

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATK-AMIGNYEGTPCR 422
            P+   LA  AA +  VLLKND G LPL T +  +LAL+GP ANA+   M+GNY G P  
Sbjct: 374 TPEAQALAYTAAIKSFVLLKND-GTLPL-TDSTLSLALIGPMANASALQMLGNYFGIPPF 431

Query: 423 YTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
             +P+ GF      + Y  G  ++   +     AA+ AA+ AD  + V G+D ++E E K
Sbjct: 432 VIAPLQGFLDAGFNVTYVLGT-NVTGNDAGSFDAAVAAAEAADVVIYVGGIDNTLEMEEK 490

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
           DR ++  P  Q  L++ +    K P+ +V M  G +D    K +  + +ILW GYPG+ G
Sbjct: 491 DRTEISWPDNQLALLSALEGVGK-PLVVVQMGGGQLDDTPLKESDAVNAILWAGYPGQSG 549

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNF--PGRTYKFFDGPVVY 599
           G AIAD + GK  P GRL        YV ++  T M LRP N    PGRTYK++ G  VY
Sbjct: 550 GTAIADTVTGKVAPAGRL--------YVDEVAMTDMTLRPDNATGNPGRTYKWYTGTPVY 601

Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
           P+GYGL YT      AS         D  + C  I    G      A+  +D        
Sbjct: 602 PYGYGLHYTNISVAWAS---------DAPEACYSIQDLTGE-----ASGFVDLAPLD--- 644

Query: 660 FTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNA 717
            TF++ V N G +    V +++ S   G A   IK+++ Y R   +  G S +V   +  
Sbjct: 645 -TFRVTVTNEGDIASDFVALLFVSTQAGPAPAPIKEMVAYARASDVQPGNSTEVELEVT- 702

Query: 718 CKSLKIVDNAANSLLASGAHTILVG-EGVGGVSFPL 752
             +L   D + ++ L  G + +    +G   +SF L
Sbjct: 703 LGALARTDESGDASLYPGKYELTFDYDGALSLSFEL 738


>gi|358382857|gb|EHK20527.1| hypothetical protein TRIVIDRAFT_192759 [Trichoderma virens Gv29-8]
          Length = 860

 Score =  450 bits (1157), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 290/756 (38%), Positives = 408/756 (53%), Gaps = 64/756 (8%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS-FIGRRT 73
           CD       RA  +V+ MTL EKV  +G  A G  RLGLP Y+W +EALHGV+   G + 
Sbjct: 139 CDTTKSIAARAAAIVKPMTLNEKVANVGSSASGSGRLGLPAYQWQNEALHGVAGSTGVQF 198

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
            SP G +F +    ATSFP  IL +A+F+++L + +   +STEARA  N G AGL FW+P
Sbjct: 199 QSPLGANFSA----ATSFPMPILLSAAFDDALVQSVATAISTEARAFANYGFAGLDFWTP 254

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN  RDPRWGR +ETPGED + +  Y ++ + GLQ     ++ R   +        CKH
Sbjct: 255 NINPFRDPRWGRGMETPGEDAFRIQGYVLSLINGLQGGIDPDFFRTIST--------CKH 306

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           +AAYD++    N R   +   T+QDM + ++  FE CV +  V S+MC+YN VNG+P CA
Sbjct: 307 FAAYDIE----NGRTANNLSPTQQDMADYYLPMFETCVRDAKVGSIMCAYNSVNGVPACA 362

Query: 254 DPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
           D  LL   +R  + F     Y+VSDCD+++ + + H +  +  + A A  L AG DLDCG
Sbjct: 363 DSYLLQSVLRDGYGFTEDFNYVVSDCDAVENVYDPHHYAANLTQ-AAAMSLNAGTDLDCG 421

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIEL 370
             Y N    +VQ G   EA +D SL  LY  L+++G+FD   +Y +LG  N+   Q   L
Sbjct: 422 SSY-NVLNASVQAGMTTEATLDKSLIRLYSALIKVGWFDQPAKYSSLGWGNVNTTQTRAL 480

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A +AA  G+ LLKND G LPL+   ++ +A++GP  NAT  + GNY GT     +P+  F
Sbjct: 481 AHDAATGGMTLLKND-GTLPLSP-TLQNVAVIGPWVNATTQLQGNYAGTAPVLVNPLTVF 538

Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
               + + YA G A I  Q+ S   AAI AA ++D  V + G+D+SVE EG DR  +  P
Sbjct: 539 QQKWRNVKYAQGTA-INSQDTSGFNAAISAASSSDVIVYLGGIDISVENEGFDRTAITWP 597

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
           G Q  LI+++A+  K P+ +V    G +D +   +N K+ SILW GYPG+EGG A+ DV+
Sbjct: 598 GNQLSLISQLANLGK-PLVIVQFGGGQIDDSSLLSNSKVNSILWAGYPGQEGGNALFDVL 656

Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
            G   P GRLPIT Y ANYV       M LRP  + PGRTY ++ G  V PFGYGL YT 
Sbjct: 657 TGANPPAGRLPITQYPANYVNNNNIQDMNLRPSGSIPGRTYAWYTGTPVLPFGYGLHYTN 716

Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
           F     S+  S                  GT+     A ++++      + TF   V N+
Sbjct: 717 FSVSFQSTKTS------------------GTD----VATIVNNAGSNKDRATFATLVVNV 754

Query: 670 GKMDGSE--------VVMVYSKPPGIAGTHIKQVIGYERV-FIAAGQSAKVGFTMNACKS 720
               G          ++ + S   G A    KQ+  Y RV  +  G + ++  T+N   S
Sbjct: 755 KNTGGKANLASDYVGLLFLKSTNAGPAPHPNKQLAAYGRVKKVGVGATQQLTLTVN-LGS 813

Query: 721 LKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
           L   D   +  +  GA+T+ +      V+ PL  N 
Sbjct: 814 LARADTNGDRWVYPGAYTLTL-----DVNGPLTFNF 844


>gi|242786966|ref|XP_002480909.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
 gi|218721056|gb|EED20475.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
          Length = 757

 Score =  450 bits (1157), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 247/604 (40%), Positives = 356/604 (58%), Gaps = 35/604 (5%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +R K L++ +TL EK+  + D + G  RLGLP YEWW+EA HGV        S PG  F 
Sbjct: 24  QRVKSLIDSLTLEEKILNLVDASAGSERLGLPSYEWWNEATHGV-------GSAPGVQF- 75

Query: 83  SEVPG----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVV 138
           +E P     ATSFP  ILT ASF+++L ++I   +  E RA  N G +G  FW+PNIN  
Sbjct: 76  TEKPVNFSYATSFPAPILTAASFDDALVREIASVIGREGRAFGNNGFSGFDFWAPNINPF 135

Query: 139 RDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYD 198
           RDPRWGR  ETPGED +VV  Y  N++ GLQ           D    ++ A CKHYAAYD
Sbjct: 136 RDPRWGRGQETPGEDSFVVQSYIRNFIPGLQ---------GDDPEDKQVIATCKHYAAYD 186

Query: 199 LDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLL 258
           L+      R+  D   T+QD+ + F+ PF+ CV +  V S+MC+YN V+GIPTCA   LL
Sbjct: 187 LE----TGRYGNDYNPTQQDLADYFLAPFKTCVRDTGVGSIMCAYNAVDGIPTCASEYLL 242

Query: 259 NQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
           +Q +R  WNF   + Y+VSDC ++  I + H F  DT+E A +  L AG+DL+CG  Y  
Sbjct: 243 DQVLRKHWNFTADYNYVVSDCGAVTDIWQYHNF-TDTEEAAASVSLNAGVDLECGSSYLK 301

Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAA 375
                       +A +D +L  LY  L  +G+FDG  +Y  LG  ++  P+   LA EAA
Sbjct: 302 LNESLAANQTTVQA-LDQALTRLYSALFTVGFFDGG-KYTALGFADVSTPEAQSLAYEAA 359

Query: 376 RQGIVLLKNDNGALPLNTGN-IKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYS 434
            +G+ LLKND   LP+ + +  K++AL+GP ANAT  M G+Y G P    SP++ F  + 
Sbjct: 360 VEGMTLLKNDKRLLPIRSSHKYKSVALIGPFANATTQMQGDYSGIPPFLISPLEAFKGHD 419

Query: 435 KVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQT 494
             +NYA G   I  Q  +   +A+ AA+ +D  + + G+D S+EAE  DR  L  PG Q 
Sbjct: 420 WEVNYAMGTG-INNQTTTGFASALAAAEKSDLVIYLGGIDNSIEAETLDRTSLTWPGNQL 478

Query: 495 ELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKY 554
           +L+ +++   K P+ +V    G +D +    N  +++++W GYP + GG A+ DV+ GK 
Sbjct: 479 DLVTQLSKLHK-PLIVVQFGGGQLDDSALLQNEGVQALVWAGYPSQSGGSALLDVLLGKR 537

Query: 555 NPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYK 613
           +  GRLP+T Y A+Y  ++    + +RP +++PGRTYK++ G  V PFGYGL YT+F+++
Sbjct: 538 SIAGRLPVTQYPASYADQVSIFDINIRPNDSYPGRTYKWYTGMPVVPFGYGLHYTKFEFE 597

Query: 614 VASS 617
            A +
Sbjct: 598 WAQT 601


>gi|421077748|ref|ZP_15538711.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           JBW45]
 gi|392524151|gb|EIW47314.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           JBW45]
          Length = 750

 Score =  449 bits (1155), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 268/765 (35%), Positives = 409/765 (53%), Gaps = 103/765 (13%)

Query: 8   KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
           ++  F Y D  L + +RAKDLV RMTL EKV QM  ++  +PRLG+P Y WWSEALHGV+
Sbjct: 26  RMEIFDYQDETLSFEQRAKDLVSRMTLEEKVTQMVYISPAIPRLGVPAYNWWSEALHGVA 85

Query: 68  FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN-- 125
             G                 AT FP  I   A+F+E L   + + +S E RA ++     
Sbjct: 86  RAGV----------------ATVFPQAIGLAATFDEKLIHDVAEVISIEGRAKFHEFQRK 129

Query: 126 ------AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD 179
                  GLTFWSPN+N+ RDPRWGR  ET GEDPY+ GR  +++++GLQ          
Sbjct: 130 GDHGIYKGLTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQG--------- 180

Query: 180 SDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
            D + L+ +AC KH+A +       ++R  FD+ V+ +D++ET++  F+ CV E +V +V
Sbjct: 181 QDKKYLRAAACAKHFAVHSGPE---SERHSFDAVVSPKDLRETYLPAFKECVKEANVEAV 237

Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
           M +YNRVNG P C    LL +T+R +W F G++VSDC +I+   E+H+  +   E +VA 
Sbjct: 238 MGAYNRVNGEPCCGSNMLLKETLRQEWGFTGHVVSDCWAIKDFHENHRVTSSAPE-SVAL 296

Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNL 357
            L  G DL+CG+ Y N  + A Q+G + E  I+T++  L +  M+LG FD +    Y N+
Sbjct: 297 ALNNGCDLNCGNMYLNLLI-AYQEGLVTEEAINTAVTRLMLTRMKLGLFDTAENVPYTNI 355

Query: 358 G-KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
           G   N C  +H E A E +++ +VLLKN+N  LPL+   I ++A++GP+AN+ +A+ GNY
Sbjct: 356 GFHQNDCQ-EHREFALEVSKKTLVLLKNENNLLPLDRNTISSIAVIGPNANSREALTGNY 414

Query: 417 EGTPCRYTSPMDGF---YAYSKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADAT 467
            GT   Y + ++G         +++YA GC      A+ + +       A+  A+ AD  
Sbjct: 415 CGTASNYITVLEGIREAVGKDTIVSYAQGCHLYRDKAENLGEARDRFAEAVSTAERADIV 474

Query: 468 VIVAGLDLSVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAV 518
           V+  GLD S+E E           D++ L LPG Q EL+  +    K P+ LV+++  A+
Sbjct: 475 VMCMGLDASIEGEEGDVSNEYASGDKLGLNLPGLQQELLEVIYQTGK-PIILVLLAGSAL 533

Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
            + +A    K+ +I+   YPG EGG+A+A  IFG+Y+P G+LPIT+Y        +T   
Sbjct: 534 AVTWAAE--KVPAIIQAWYPGAEGGKALASAIFGEYSPVGKLPITFYRTTEELPEFTDYS 591

Query: 579 LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV 638
           ++       RTY++     +YPFGYGL YT F Y+         ++L++ Q       + 
Sbjct: 592 MK------NRTYRYMTKEALYPFGYGLGYTTFAYR--------QLQLNRTQ------ISA 631

Query: 639 GTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIG 697
           G N           V+C        + V+N G     E V +Y K         I ++ G
Sbjct: 632 GEN-----------VQCS-------VLVKNTGNFASDETVQLYIKDVKASVEVPILELQG 673

Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            ++V +  G   +V FT+   + L +++   N +L  GA  I VG
Sbjct: 674 IQKVHLLPGTEQEVFFTLTP-RQLALINEEGNCILEPGAFEIYVG 717


>gi|395334835|gb|EJF67211.1| beta-xylosidase [Dichomitus squalens LYAD-421 SS1]
          Length = 774

 Score =  449 bits (1155), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 284/738 (38%), Positives = 401/738 (54%), Gaps = 35/738 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS+   CD       RA  L++  T  E      + + GVPRLGLP Y WWSE LHGV+ 
Sbjct: 35  LSNNTVCDTSKDPITRATALIDLWTDEELTNNTVNASPGVPRLGLPAYNWWSEGLHGVAQ 94

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
               T +P G         ATSFP  IL  A+F++ L + +   VSTE RA  N+G AGL
Sbjct: 95  SPGVTFAPSG-----NFSYATSFPQPILMGAAFDDPLIQAVASVVSTEGRAFNNVGRAGL 149

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP-LKI 187
            +W+PNIN  +DPRWGR  ETPGEDP+ +  Y  N + GLQ           D  P  K+
Sbjct: 150 DYWTPNINPFKDPRWGRGQETPGEDPFHLQGYVYNLILGLQG--------GLDPTPYFKV 201

Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
            A CKH+AAYD+DNWEGN R+ F++ VT+QD+ E ++  F+ CV +  V+SVMCSYN VN
Sbjct: 202 VADCKHFAAYDMDNWEGNVRYGFNAVVTQQDLSEYYLPSFQTCVRDAKVASVMCSYNAVN 261

Query: 248 GIPTCADPKLLNQTIRGDWNFHG--YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           GIP+CA+  LL   +R  W F    ++ SDCD++Q I   H +  D    A A  L AG 
Sbjct: 262 GIPSCANSFLLQDILRDYWGFDDTRWVTSDCDAVQNIYTPHNY-TDNPAQAAADALLAGT 320

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNIC 363
           D+DCG + + +   A+ QG +   D+  +    Y  L+RLGYFD   S  Y+ LG +++ 
Sbjct: 321 DIDCGTFSSTYLPDALSQGLVNATDLKRAAIRQYASLVRLGYFDPPESQPYRQLGWSDVN 380

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
            P+  +LA  AA +G+VLLKND G LPL+  +++ LAL+GP ANAT  M GNY G     
Sbjct: 381 TPEAQQLAHTAAVEGMVLLKND-GTLPLSK-HVRKLALIGPWANATTLMQGNYAGIAPYL 438

Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
            SP+ G       + Y  G       + S   AA+ AAK ADA +   GLD +VE E  D
Sbjct: 439 ISPLLGAQQAGFDVEYVFGTNVTTTNDTSGFAAAVAAAKRADAVIFAGGLDETVEREEVD 498

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R+++  PG Q +L+ ++A   K P+ +     G +D +  K+   + +I+W GYPG+ GG
Sbjct: 499 RLNVTWPGNQLDLVAELASVGK-PLIVAQFGGGQLDDSALKSKRSVNAIIWGGYPGQSGG 557

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFG 602
            A+ D++ GK  P GRLPIT Y A Y  ++P T M LRP    PGRTYK++ G  V+ FG
Sbjct: 558 TALFDILTGKAAPAGRLPITQYPAEYANQVPMTDMTLRPSATNPGRTYKWYTGTPVFEFG 617

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
           +GL YT F +  AS+  +     +       I+  + +     A +   D+   D   TF
Sbjct: 618 FGLHYTTFSFAWASNAHA-----NTPAASYSIDALMASGNKSAAFL---DLAPLD---TF 666

Query: 663 QIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
            + V N GKM    V +++ S   G A    KQ++ Y RV   A + + +        ++
Sbjct: 667 AVRVTNTGKMTSDYVALLFASGTFGPAPHPNKQLVAYTRVHGVAPKQSTIAELTVTLGAI 726

Query: 722 KIVDNAANSLLASGAHTI 739
              D +    +  G +T+
Sbjct: 727 ARADESGAKWVYPGTYTL 744


>gi|302683060|ref|XP_003031211.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
 gi|300104903|gb|EFI96308.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
          Length = 761

 Score =  447 bits (1150), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 290/750 (38%), Positives = 404/750 (53%), Gaps = 49/750 (6%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD  L + ERA+ LVE +T+ E +      A GVPRLGLP Y WW+EALHGV+       
Sbjct: 35  CDTSLGHVERARALVEELTVAEMINNTVHTAPGVPRLGLPPYNWWNEALHGVAASPGVVF 94

Query: 75  SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
           + PG  F S    ATSFP  I   ++F+++L   +G   STEARA  N G AGL +W+PN
Sbjct: 95  TSPGEEFSS----ATSFPMPINMGSAFDDALMLAVGNVTSTEARAFNNAGLAGLDYWTPN 150

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           IN  +DPRWGR  ETPGEDP    RY    V GLQ   G+      D   LK++A CKH+
Sbjct: 151 INPFKDPRWGRGAETPGEDPLHAARYVRTLVEGLQG--GI------DPPSLKVAADCKHW 202

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           AAYDL++W G  R+ FD+ VT QD+ E +  PF+ CV +   +SVMCSYN VNG+P CA 
Sbjct: 203 AAYDLEDWGGVARYAFDAVVTPQDLAEYYSPPFKSCVRDARAASVMCSYNAVNGVPACAS 262

Query: 255 PKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
           P LL   +R  W      ++ SDCD++  + + H +  D   +  A  LKAG DLDCG  
Sbjct: 263 PYLLKTVLRDAWGLAEDRWVTSDCDAVGNVYDPHGYTEDFV-NGSAVSLKAGSDLDCGTT 321

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIE 369
           Y+ +   A  +G I E D+  +L  LY  L+ LGYFD +P+   Y+ +   ++  P    
Sbjct: 322 YSQYLPEAYDRGLIDEDDLKAALTRLYASLVWLGYFD-APEDQPYRQISWADVNTPAAQA 380

Query: 370 LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI-GNYEGTPCRYTSPMD 428
           LA  AA +  VLLKND G LPL   ++ ++AL+GP ANA+   + GNY G P    +P+ 
Sbjct: 381 LAYTAAIESFVLLKND-GTLPLTDSSL-SIALIGPMANASAVQLQGNYNGIPPFAIAPLQ 438

Query: 429 GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLL 488
           GF      + Y  G  ++   +   I  A+ AA+ AD  + V G+D +VE E KDR ++ 
Sbjct: 439 GFLDAGFNVTYVLGT-NVTGNDADDIDGAVAAAEAADVVIYVGGIDSTVEEEAKDRTEIS 497

Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
            P  Q  L++ + +A K P+ +V M  G +D    K +  + +ILW GYPG+ GG AIAD
Sbjct: 498 WPDNQLALLSALEEAGK-PLVVVQMGGGQLDDTPLKESDAVNAILWAGYPGQSGGTAIAD 556

Query: 549 VIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNF--PGRTYKFFDGPVVYPFGYGL 605
            + GK  P GRL IT Y A+YV  +  T M LRP N+   PGRTYK++ G  VYP+GYGL
Sbjct: 557 TVMGKVAPAGRLSITQYPASYVDAVAMTDMTLRPDNSTGNPGRTYKWYTGTPVYPYGYGL 616

Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIE 665
            YT F    AS         D  + C  I     +         +D         TF++ 
Sbjct: 617 HYTNFSVAWAS---------DAPEACYSIQDLTSSADGFVDLAPLD---------TFRVT 658

Query: 666 VENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNACKSLKI 723
           V N G +    V +++ S   G A   +K+++ Y R   +  G S  V   +    +L  
Sbjct: 659 VTNDGDVASDFVALLFVSTQAGPAPAPMKELVAYARASDVQPGDSTDVDLEVT-LGALAR 717

Query: 724 VDNAANSLLASGAHTILVG-EGVGGVSFPL 752
            D + ++ L  G + +    +G   +SF L
Sbjct: 718 SDESGDASLYPGDYELTFDYDGALSLSFEL 747


>gi|156062754|ref|XP_001597299.1| hypothetical protein SS1G_01493 [Sclerotinia sclerotiorum 1980]
 gi|154696829|gb|EDN96567.1| hypothetical protein SS1G_01493 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 758

 Score =  446 bits (1147), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 262/617 (42%), Positives = 357/617 (57%), Gaps = 35/617 (5%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L++   CD       RA  LV   TL EK+   G+ + GVPR+GLP Y+WW+EALHG+++
Sbjct: 28  LANNTVCDTTADPYTRATALVSLFTLAEKINNTGNTSPGVPRIGLPAYQWWNEALHGIAY 87

Query: 69  IGRRTNSPPGTHF---DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
                    GTHF    S    ATSFP  IL  A+F+++L   +   +STEARA  N   
Sbjct: 88  ---------GTHFAAAGSNYSYATSFPQPILMGAAFDDALIHDVASQISTEARAFSNANR 138

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GL FW+PNIN  +DPRWGR  ETPGEDP+ V  Y    V GLQ   G+      D  P 
Sbjct: 139 YGLNFWTPNINPYKDPRWGRGQETPGEDPFHVSSYVNALVTGLQG--GL------DDLPY 190

Query: 186 KIS-ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
           K   A CKHYA YDL+N  G  R+ FD+ +  QD+++ ++  F+ C  + +V S+MCSYN
Sbjct: 191 KKGVATCKHYAGYDLENGGGIQRYAFDAIINSQDLRDYYLPSFQQCARDSNVQSIMCSYN 250

Query: 245 RVNGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
            VNG+PTCAD  LL   +R  W +     ++ SDCD++Q I +SH + + T E A A  L
Sbjct: 251 AVNGVPTCADDWLLQSLLREHWGWVEEDQWVTSDCDAVQNIWDSHNYTS-TPEQAAADAL 309

Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGK 359
            AG DLDCG ++  +   A  Q     + +D SL   Y  L+RLGYFD +    Y+ LG 
Sbjct: 310 NAGTDLDCGGFWPTYLGSAYNQSLYNISTLDRSLTRRYASLVRLGYFDPASIQPYRQLGW 369

Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
           +++  P   +LA +AA  GIVLLKND G LPL + NI  +AL+GP ANAT  M GNY G 
Sbjct: 370 SDVSTPSAEQLALQAAEDGIVLLKND-GILPLPS-NITNVALIGPWANATTQMQGNYYGQ 427

Query: 420 PCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
                SP+         + Y  G ADI   N +   AAI AAK AD  + + G+D S+EA
Sbjct: 428 APYLHSPLIAAQNAGFHVTYVQG-ADIDSTNTTEFTAAIAAAKKADVIIYIGGIDNSIEA 486

Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-VDINFAKNNPKIKSILWVGYP 538
           E KDR  +  P  Q  L+N++A+ +   + L+I   G  +D +    N  +  I+W GYP
Sbjct: 487 EAKDRKTIAWPSSQISLVNQLANLS---IPLIISQMGTMIDSSSLLTNRGVNGIIWAGYP 543

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPV 597
           G++GG AI +++ GK  P GRLPIT Y ++YV ++   +M L P  N PGRTYK+F+G  
Sbjct: 544 GQDGGTAIFNILTGKTAPAGRLPITQYPSDYVNEVSMNNMNLHPGANNPGRTYKWFNGTS 603

Query: 598 VYPFGYGLSYTQFKYKV 614
           ++ FG+GL YT F  K+
Sbjct: 604 IFDFGFGLHYTTFNAKI 620


>gi|189203341|ref|XP_001938006.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187985105|gb|EDU50593.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 761

 Score =  446 bits (1147), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 274/731 (37%), Positives = 382/731 (52%), Gaps = 52/731 (7%)

Query: 24  RAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD- 82
           RA+ LV   TL EK+      A GVPRLG+P Y+WWSE LHG++         P T+F  
Sbjct: 10  RAQSLVALYTLEEKINATSSGAPGVPRLGVPPYQWWSEGLHGIA--------GPYTNFSD 61

Query: 83  -SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
             E   +TSFP  IL  A+F++ L   + + +STEARA  N    GL FW+PNIN  RDP
Sbjct: 62  SGEWSYSTSFPQPILMGAAFDDDLITDVAKVISTEARAFNNANRTGLDFWTPNINPFRDP 121

Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
           RWGR  ETPGED Y +  Y    + GLQ      Y R        + A CKH+A YD+++
Sbjct: 122 RWGRGQETPGEDAYHLSSYVQALIHGLQGESTDPYKR--------VVATCKHFAGYDVED 173

Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
           W GN R+  D ++T+Q++ E ++ PF+ CV + +V + MCSYN VNG P CADP LL   
Sbjct: 174 WNGNLRYQNDVQITQQELVEYYLAPFQACV-QANVGAFMCSYNAVNGAPPCADPYLLQTI 232

Query: 262 IRGDW---NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTM 318
           +R  W   N   ++  DCD++Q +   H++ + T+  A A  L AG D+ CG Y      
Sbjct: 233 LREHWGWTNEEQWVTGDCDAVQNVYLPHQW-SPTRAGAAADSLVAGTDVTCGTYMQEHLP 291

Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELAAEAAR 376
            A QQ  + E+ +D +L   Y  L+RLGYFD S    Y+ LG + +       LA  AA 
Sbjct: 292 AAFQQKLLNESSLDQALIRQYSSLVRLGYFDASENQPYRQLGFDAVATNASQALARRAAA 351

Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV 436
           +GIVLLKND G LPL+  +  T+ L G  ANAT  ++GNY G      SP+         
Sbjct: 352 EGIVLLKND-GTLPLSLDSSVTVGLFGDWANATSQLLGNYAGVATYLHSPLYALEQTGVK 410

Query: 437 INYA----PGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGF 492
           INYA     G  D      S +     A   +D  + V G+D SVE EG+DR  L   G 
Sbjct: 411 INYAGGNPGGQGDPTTNRWSNL---YGAYSTSDVLIYVGGIDNSVEEEGRDRGYLTWTGA 467

Query: 493 QTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFG 552
           Q ++I ++AD  K PV +V+   G +D +   NNP I +I+W GYPG++GG AI D+I G
Sbjct: 468 QLDVIGQLADTGK-PVIVVVTGGGQIDSSPLVNNPNISAIMWAGYPGQDGGSAIIDIIGG 526

Query: 553 KYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFK 611
           K  P GRLP T Y ANY   +   +M LRP  N PGRTYK+++G   + FGYG+ YT F 
Sbjct: 527 KTAPAGRLPQTQYPANYTAAVSMMNMNLRPGENSPGRTYKWYNGSATFEFGYGMHYTNF- 585

Query: 612 YKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGK 671
                   S +I     Q     +Y + +    C +      +C     +  ++V N G 
Sbjct: 586 --------SAEITTQMQQ-----SYAISSLASGCNSTGGFLERCP--FASVNVQVHNTGN 630

Query: 672 MDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS 730
           +    + + Y +   G A    K ++ Y+R+   AG +           SL  VD   N 
Sbjct: 631 VTSDYITLGYMAGTFGPAPHPRKTLVSYKRLHSIAGGATSTATLNLTLASLARVDEHGNK 690

Query: 731 LLASGAHTILV 741
           +L  G +++ +
Sbjct: 691 VLYPGDYSLQI 701


>gi|343172466|gb|AEL98937.1| beta-xylosidase, partial [Silene latifolia]
 gi|343172468|gb|AEL98938.1| beta-xylosidase, partial [Silene latifolia]
          Length = 374

 Score =  446 bits (1146), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 215/390 (55%), Positives = 270/390 (69%), Gaps = 19/390 (4%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLGL  YEWWSEALHGVS +G      PGT F    P ATSFP VI T ASFN SLW+ I
Sbjct: 1   RLGLQGYEWWSEALHGVSNVG------PGTKFQGAFPAATSFPQVITTAASFNASLWQAI 54

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           GQ VS EARAMYN G AGLT+WSPN+N+ RDPRWGR  ETPGEDP +  +YA +YV GLQ
Sbjct: 55  GQAVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLSAQYAASYVTGLQ 114

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
              G           LK++ACCKHY AYDLDNW G DRFHF+++V++QD+++T+ +PF+ 
Sbjct: 115 GNYGNR---------LKVAACCKHYTAYDLDNWNGMDRFHFNAKVSKQDLEDTYNVPFKA 165

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
           CV EG V+SVMCSYN+VNG PTCADP +L  TIRG W+ +GYIVSDCDS+  + +   + 
Sbjct: 166 CVLEGKVASVMCSYNQVNGKPTCADPDILRNTIRGQWHLNGYIVSDCDSVGVLYDDQHYT 225

Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
             T E+A A  + AGLDLDCG +    T GA++QG + EA ++ +L     V MRLG FD
Sbjct: 226 R-TPEEAAADTINAGLDLDCGPFLAVHTEGAIRQGLVTEAAVNQALANTITVQMRLGMFD 284

Query: 350 GSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHA 406
           G P    + NLG  ++C P H +LA +AAR+GIVLLKN  G+LPL+T   + +A++GP+A
Sbjct: 285 GEPSAQPFGNLGPRDVCTPAHQDLALQAAREGIVLLKNQVGSLPLSTVRHRNIAVIGPNA 344

Query: 407 NATKAMIGNYEGTPCRYTSPMDGFYAYSKV 436
            AT  MIGNY G  C YTSP+ G   Y++ 
Sbjct: 345 QATTTMIGNYAGIACGYTSPLQGISRYART 374


>gi|330934749|ref|XP_003304687.1| hypothetical protein PTT_17336 [Pyrenophora teres f. teres 0-1]
 gi|311318569|gb|EFQ87188.1| hypothetical protein PTT_17336 [Pyrenophora teres f. teres 0-1]
          Length = 798

 Score =  444 bits (1141), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 278/748 (37%), Positives = 398/748 (53%), Gaps = 55/748 (7%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L +   CD       RAK LV   TL EK+      A GVPRLG+P Y+WW+E LHG++ 
Sbjct: 31  LKNVTICDPSASPLARAKSLVALYTLEEKINATSSGAPGVPRLGVPPYQWWNEGLHGIA- 89

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
            G  TN    +H   E   +TSFP  IL  A+F++ L  ++ + +STEARA  N    GL
Sbjct: 90  -GPYTNF---SHSGVEWSYSTSFPQPILMGAAFDDDLITEVAKVISTEARAFNNANRTGL 145

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
            FW+PNIN  RDPRWGR  ETPGED Y +  Y    + GLQ      Y R        + 
Sbjct: 146 DFWTPNINPFRDPRWGRGQETPGEDAYHLSSYVQALIHGLQGEATDPYKR--------VV 197

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           A CKH+A YD+++W GN R+  D ++T+QD+ E ++ PF+ CV + +V + MCSYN VNG
Sbjct: 198 ATCKHFAGYDVEDWNGNLRYQNDVQITQQDLVEYYLAPFQACV-QANVGAFMCSYNAVNG 256

Query: 249 IPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
            P CADP LL   +R  W ++    ++  DCD++Q +   H++ + T+  A A  L AG 
Sbjct: 257 APPCADPYLLQTILREHWGWNKEEQWVTGDCDAVQNVYFPHQW-SSTRAGAAADSLVAGT 315

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNI 362
           D+ CG Y       A +Q  + E+ +D +L   Y  L+RLGYFD +P+   Y+ LG + +
Sbjct: 316 DITCGTYMQEHLPAAFRQKLLNESSLDLALIRQYSSLVRLGYFD-APENQPYRQLGFDAV 374

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
                  LA  AA +GIVLLKND G LPL+  +  T+ L G  ANAT  ++GNY G    
Sbjct: 375 ATNASQALARRAAAEGIVLLKND-GTLPLSLDSSMTVGLFGDWANATTQLLGNYAGVATY 433

Query: 423 YTSPMDGFYAYSKVINYA----PGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
             SP+         INYA     G  D      S +     A   +D  + V G+D  VE
Sbjct: 434 LHSPLYALKQTGVKINYAGGKPGGQGDPTTNRWSNL---YGAYSTSDVLIYVGGIDNGVE 490

Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
            EG DR  L   G Q ++I ++A+  K PV +V+   G +D +   NNP I +I+W GYP
Sbjct: 491 EEGHDRGYLTWTGPQLDVIGQLAETGK-PVIVVVTGGGQIDSSPLVNNPNISAIMWAGYP 549

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPV 597
           G++GG AI D+I GK  P GRLP T Y A+Y   +   +M LRP  N PGRTYK+++G  
Sbjct: 550 GQDGGSAIIDIISGKTAPAGRLPQTQYPASYAAAVSMMNMNLRPGENNPGRTYKWYNGSA 609

Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
           V+ FGYG+ YT F   +++          + QQ    +Y + +    C +      +C  
Sbjct: 610 VFEFGYGMHYTNFSAAIST----------QMQQ----SYAISSLASGCNSTGGFLERCP- 654

Query: 658 YKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAG---QSAKVGF 713
              +  ++V N GK+    V + Y +   G A    K ++ Y+R+   AG    +AK+  
Sbjct: 655 -FASVDVQVHNTGKVTSDYVTLGYMAGTFGPAPHPRKTLVSYKRLHNIAGGATSTAKLNL 713

Query: 714 TMNACKSLKIVDNAANSLLASGAHTILV 741
           T+    S+  VD   N +L  G +++ +
Sbjct: 714 TL---ASVARVDEYGNKVLYPGHYSLQI 738


>gi|392962219|ref|ZP_10327666.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           DSM 17108]
 gi|392452977|gb|EIW29882.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           DSM 17108]
          Length = 724

 Score =  443 bits (1140), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 263/761 (34%), Positives = 405/761 (53%), Gaps = 103/761 (13%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           F Y D  L + +RAKDLV RMT+ EKV QM   +  + RLG+P Y WWSEALHGV+  G 
Sbjct: 4   FDYQDETLSFEQRAKDLVSRMTIEEKVTQMVYSSPAISRLGIPAYNWWSEALHGVARAGV 63

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN------ 125
                           AT FP  I   A+F+E L   + + +S EARA ++         
Sbjct: 64  ----------------ATVFPQAIGLAATFDEKLIYDVAEIISIEARAKFHEFQRKGDHG 107

Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
              GLTFWSPN+N+ RDPRWGR  ET GEDPY+ GR  +++++GLQ           D +
Sbjct: 108 IYKGLTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQG---------QDKK 158

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            L+ +AC KH+A +       ++R  FD+ V+ +D++ET++  F+ CV E +V +VM +Y
Sbjct: 159 YLRAAACAKHFAVHSGPE---SERHRFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAY 215

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NRVNG P C    LL +T+R +W F G++VSDC +I+   E+H+  +   E +VA  L  
Sbjct: 216 NRVNGEPCCGSNILLKETLRQEWGFTGHVVSDCWAIKDFHENHRVTSSAPE-SVALALNN 274

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLG-KN 360
           G DL+CG+ Y N  + A Q+G + E  I+T++  L +  M+LG FD +    Y N+G   
Sbjct: 275 GCDLNCGNMYLNLLI-AYQEGLVTEEAINTAVTRLMLTRMKLGLFDAAENVPYTNIGFHQ 333

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
           N C  +H E A E +++ +VLLKN+N  LPL+   I ++A++GP+AN+ +A+ GNY GT 
Sbjct: 334 NDCQ-EHREFALEVSKKTLVLLKNENHLLPLDRNTISSIAVIGPNANSREALTGNYFGTA 392

Query: 421 CRYTSPMDGF---YAYSKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVA 471
             Y + ++G         +++YA GC      A+ + +       A+  A+ AD  V+  
Sbjct: 393 SNYITVLEGIREAVGKDTMVSYAQGCHLYRDKAENLGEERDRFAEAVSTAERADLVVMCM 452

Query: 472 GLDLSVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINF 522
           GLD S+E E           D++ L LPG Q EL+  +    K P+ LV+++  A+ + +
Sbjct: 453 GLDASIEGEEGDVSNEYASGDKLGLNLPGLQQELLEVIYKTGK-PIILVLLAGSALAVTW 511

Query: 523 AKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPV 582
           A    K+ +I+   YPG EGG+A+A  IFG+Y+P G+LPIT+Y        +T   ++  
Sbjct: 512 AAE--KVPAIIQAWYPGAEGGKALASAIFGEYSPVGKLPITFYRTTEELPEFTDYSMK-- 567

Query: 583 NNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
                RTY++     +YPFGYGL YT F Y+         ++L++ + C           
Sbjct: 568 ----NRTYRYMTKEALYPFGYGLGYTTFAYR--------QLQLNRTKICAG--------- 606

Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERV 701
                   ++V+C        I V+N G     E V +Y K         I  + G +++
Sbjct: 607 --------ENVQCS-------ILVKNTGNFASDETVQLYIKDVKASVEVPIWALQGIQKI 651

Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            +  G   ++ FT+ + + L +++   N +L  G   I VG
Sbjct: 652 HLLPGAEQEISFTLTS-RQLALINEKGNCILEPGIFEIYVG 691


>gi|392560759|gb|EIW53941.1| glycoside hydrolase family 3 protein [Trametes versicolor FP-101664
           SS1]
          Length = 783

 Score =  442 bits (1138), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 282/737 (38%), Positives = 402/737 (54%), Gaps = 36/737 (4%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD       RA  L+   T  E      + + GVPRLGLP Y WWSE LHGV+     T 
Sbjct: 41  CDITKDPITRATALIGLWTDEELTSNTVNASPGVPRLGLPAYNWWSEGLHGVAQSPGVTF 100

Query: 75  SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
           +P G         ATSFP  IL  A+F+++L + I   VSTE RA  N G AGL +W+PN
Sbjct: 101 APSG-----NFSHATSFPQPILMGAAFDDTLIQAIATIVSTEGRAFNNAGRAGLDYWTPN 155

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP-LKISACCKH 193
           IN  +DPRWGR  ETPGEDP+ + +Y  N + GLQ           D +P  K+ A CKH
Sbjct: 156 INPFKDPRWGRGQETPGEDPFHLSQYVYNLILGLQG--------GLDPKPYFKVVADCKH 207

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           +AAYDL+NWEG  R  FD+ V++QD+ E ++ PF+ CV +  V+SVMCSYN VNGIP+CA
Sbjct: 208 FAAYDLENWEGIVRNGFDAIVSQQDLSEFYLPPFQTCVRDAKVASVMCSYNAVNGIPSCA 267

Query: 254 DPKLLNQTIRGDWNFHG--YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
           +  LL   +R  W F    ++ SDCD+++ I+  HK+  D    A A  L AG D+DCG 
Sbjct: 268 NSFLLQDVLRDHWGFTDDRWVTSDCDAVENILTPHKYTTD-PAQAAADALLAGTDIDCGT 326

Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNICNPQHIE 369
           + + +   A+Q+G +   D+  +    Y  L+RLGYFD   +  Y+ LG +++  PQ  +
Sbjct: 327 FSSTYLPEALQRGLVNSTDLRRAAIRQYASLVRLGYFDDPAAQPYRQLGWSDVNTPQAQQ 386

Query: 370 LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG 429
           LA  AA +GIVLLKND G LP +  +++ LAL+GP ANAT  + G+Y G      SP+ G
Sbjct: 387 LAHTAAVEGIVLLKND-GVLPFSK-HVRKLALIGPWANATSLLQGSYIGVAPYLVSPLQG 444

Query: 430 FYAYSKVINYAPGCADIVCQNN-SMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLL 488
                  + Y  G  ++  QN+ S   AA+ A + ADA V   GLD +VE EG DR+++ 
Sbjct: 445 AQEAGFEVEYVLGT-NVTTQNDMSGFAAAVAAVRRADAVVFAGGLDETVECEGTDRLNVT 503

Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
            PG Q +L+ ++    K P+ +     G +D    K++  + +I+W GYPG+ GG A+ D
Sbjct: 504 WPGNQLDLVAELERVGK-PLIVAQFGGGQLDDTALKHSKAVNAIIWGGYPGQSGGTALFD 562

Query: 549 VIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSY 607
           ++ GK  P GRLPIT Y A Y K +P T M LRP    PGRTYK++ G  V+ FG+GL Y
Sbjct: 563 ILTGKAAPAGRLPITQYPAAYTKQVPMTDMSLRPSATNPGRTYKWYSGTPVFEFGFGLHY 622

Query: 608 TQFKYKVASSPKSVDI----KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           T F +  A+   +  +          +   I+  V   +   A +   D+   D   TF 
Sbjct: 623 TTFVFSWAAPSAAAAVDSTASFGSLAKSYSISQLVAHGQESTAFL---DLAPLD---TFA 676

Query: 664 IEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
           + V N G++    V +++ S   G A    KQ++ Y RV   A + + V        ++ 
Sbjct: 677 VRVTNTGRVASDYVALLFVSGAFGPAPHPKKQLVAYTRVHGLAPRGSTVAQLPVTLGAIA 736

Query: 723 IVDNAANSLLASGAHTI 739
             D      +  G +T+
Sbjct: 737 RADKNGEKWVHPGTYTL 753


>gi|392596548|gb|EIW85871.1| hypothetical protein CONPUDRAFT_80240 [Coniophora puteana
           RWD-64-598 SS2]
          Length = 770

 Score =  441 bits (1134), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 258/616 (41%), Positives = 348/616 (56%), Gaps = 27/616 (4%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD  L   +RA  L++  T+ E +    + A GVPRLGLP YEWWSE LHGV+     T 
Sbjct: 37  CDTSLNATQRAAALIDLFTVDELIVNTVNWAPGVPRLGLPAYEWWSEGLHGVANSAGVTW 96

Query: 75  SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
           S  G         ATSFP  IL +A+F+++L K +G  +  E RA  N G+AGL FW+PN
Sbjct: 97  SITG-----PFSYATSFPQPILMSAAFDDALIKAVGGVIGMEGRAFNNYGHAGLDFWTPN 151

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP-LKISACCKH 193
           IN  +DPRWGR  ETPGEDPY + +Y  N ++GLQ           D  P  ++ A CKH
Sbjct: 152 INPFKDPRWGRGQETPGEDPYHIAQYVYNLIQGLQG--------GLDPEPYFQVVATCKH 203

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           +A YDL++W+ N R+ +++ ++ QD+ E ++  F+ C  +    + MCSYN +NGIPTCA
Sbjct: 204 FAGYDLEDWDFNYRYGYNAIISTQDLSEYYLPSFQSCYRDAFAGASMCSYNAINGIPTCA 263

Query: 254 DPKLLNQTIRGDWNFHG--YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
           D  LL   +RG W F    ++  DCDS++ I + H +     + A A  LKAG D+DCG 
Sbjct: 264 DTYLLQDILRGFWGFDQTRWVTGDCDSVEDIYDFHHY-TALPQQAAADALKAGSDIDCGI 322

Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIE 369
           +YT +   A  +  I E D+  +L   Y  L+RLGYFD + +  Y+    +N+      E
Sbjct: 323 FYTTWLPLAYTESLITEQDLRAALTRQYASLVRLGYFDPASEQPYRQYNWSNVDTSYAQE 382

Query: 370 LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG 429
           LA  AA +GI LLKND G LP ++  IK +AL+GP   AT  M GNY G      SP  G
Sbjct: 383 LAYTAAVEGITLLKND-GTLPFSSA-IKNIALIGPWTFATTQMQGNYYGNAPYLISPYQG 440

Query: 430 FYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLL 489
                  I+Y     ++         AA  AA+ ADA V V G+D +VEAE  DR D+  
Sbjct: 441 AQLAGYNISYVLET-NVTSNTTDGYAAAFTAAQGADAIVFVGGIDNTVEAEAMDRNDITW 499

Query: 490 PGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADV 549
           P FQ  LI ++    K P+ +V    G VD      NP + ++LW GYPG+ GG+A+ D+
Sbjct: 500 PAFQLWLIGELGKLGK-PLVVVQFGGGQVDDTEINANPDVNALLWGGYPGQSGGQALFDI 558

Query: 550 IFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN---FPGRTYKFFDGPVVYPFGYGL 605
           I GK  P GRL  T Y A+YV +IP T+M LRP  N    PGRTYK++ G  VY FGYGL
Sbjct: 559 ISGKVAPAGRLVSTQYPADYVNEIPMTNMNLRPDANGTTSPGRTYKWYTGTPVYEFGYGL 618

Query: 606 SYTQFKYKVASSPKSV 621
            YT F Y    +P + 
Sbjct: 619 HYTNFTYAWTKAPAAT 634


>gi|421060771|ref|ZP_15523202.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           B3]
 gi|421065248|ref|ZP_15527033.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           A12]
 gi|421073214|ref|ZP_15534285.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           A11]
 gi|392444242|gb|EIW21677.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           A11]
 gi|392454445|gb|EIW31278.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           B3]
 gi|392459366|gb|EIW35779.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           A12]
          Length = 724

 Score =  441 bits (1133), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 268/761 (35%), Positives = 404/761 (53%), Gaps = 103/761 (13%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           F Y D  L + +RAKDLV RMTL EKV QM  ++  +PRLG+P Y WWSEALHGV+  G 
Sbjct: 4   FAYQDETLSFEQRAKDLVSRMTLEEKVTQMVYISPAIPRLGVPAYNWWSEALHGVARAGV 63

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN------ 125
                           AT FP  I   A+F+E L   + + +S E RA ++         
Sbjct: 64  ----------------ATVFPQAIGLAATFDEKLIFNVAEVISIEGRAKFHEFQRKGDHG 107

Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
              GLTFWSPN+N+ RDPRWGR  ET GEDPY+ GR  +++++GLQ           D +
Sbjct: 108 IYKGLTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQG---------QDKK 158

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            L+ +AC KH+A +       ++R  FD+ V+ +D++ET++  F+ CV E +V +VM +Y
Sbjct: 159 YLRAAACAKHFAVHSGPE---SERHSFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAY 215

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NRVNG P C    LL +T+R +W F G++VSDC +I+   E+H+  +   E +VA  L  
Sbjct: 216 NRVNGEPCCGSNMLLKETLRREWGFTGHVVSDCWAIKDFHENHRVTSSAPE-SVAMALNN 274

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLG-KN 360
           G DL+CG+ Y N  + A Q+G + E  I+T++  L +  M+LG FD +    Y  +G   
Sbjct: 275 GCDLNCGNMYLNLLI-AYQEGLVTEEAINTAVTRLMLTRMKLGLFDTAENVPYTKIGFHQ 333

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
           N C  +H E A E +++ +VLLKN+N  LPL+   I ++A++GP+AN+ +A+ GNY GT 
Sbjct: 334 NDCQ-EHREFALEVSKKTLVLLKNENNLLPLDRNTISSIAVIGPNANSREALTGNYCGTA 392

Query: 421 CRYTSPMDGF---YAYSKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVA 471
             Y + ++G         +++YA GC      A+ + +       A+  A+ AD  V+  
Sbjct: 393 SNYITVLEGIREAVGKDTMVSYAQGCHLYRDKAENLGEARDRFAEAVSTAERADIVVMCM 452

Query: 472 GLDLSVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINF 522
           GLD S+E E           D++ L LPG Q EL+  +    K P+ LV+++  A+ + +
Sbjct: 453 GLDASIEGEEGDVSNEYASGDKLGLNLPGLQQELLEVIYQTGK-PIILVLLAGSALAVTW 511

Query: 523 AKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPV 582
           A    KI +I+   YPG EGG+A+A  IFG+Y+P G+LPIT+Y        +T   ++  
Sbjct: 512 AAE--KIPAIIQAWYPGAEGGKALASAIFGEYSPVGKLPITFYRTTEELPEFTDYSMK-- 567

Query: 583 NNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
                RTY++     +YPFGYGL YT F Y+         ++L++ Q       +VG N 
Sbjct: 568 ----NRTYRYMTKEALYPFGYGLGYTTFAYR--------QLQLNRTQ------ISVGENV 609

Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERV 701
               +VL                V+N G     E V +Y K         I  + G ++V
Sbjct: 610 Q--GSVL----------------VKNTGNFASDETVQLYIKDVKASVEVPIWALQGIQKV 651

Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            +  G   +V FT+   + L +++   N +L  G   I VG
Sbjct: 652 HLLPGTEQEVFFTLTP-RQLALINEEGNCILEPGVFEIYVG 691


>gi|224068498|ref|XP_002302758.1| predicted protein [Populus trichocarpa]
 gi|222844484|gb|EEE82031.1| predicted protein [Populus trichocarpa]
          Length = 462

 Score =  440 bits (1132), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 217/455 (47%), Positives = 302/455 (66%), Gaps = 13/455 (2%)

Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLG 358
           +A LDLDCG +    T  AV++G + EA+I+ +L     V MRLG FDG P    Y NLG
Sbjct: 5   QASLDLDCGPFLGQHTEDAVRKGLLTEAEINNALLNTLTVQMRLGMFDGEPSSKPYGNLG 64

Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
             ++C P H ELA EAARQGIVLLKN    LPL+T + +++A++GP++N T  MIGNY G
Sbjct: 65  PTDVCTPAHQELALEAARQGIVLLKNHGPPLPLSTRHHQSVAIIGPNSNVTVTMIGNYAG 124

Query: 419 TPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
             C YT+P+ G   Y+K I Y  GCAD+ C ++    AA+DAA+ ADATV+V GLD S+E
Sbjct: 125 VACGYTTPLQGIGRYAKTI-YQQGCADVACVSDQQFVAAMDAARQADATVLVMGLDQSIE 183

Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
           AE +DR +LLLPG Q ELI+KVA A+KGP  LV+MS G +D++FA+N+PKI  I+W GYP
Sbjct: 184 AESRDRTELLLPGRQQELISKVAAASKGPTILVLMSGGPIDVSFAENDPKIGGIVWAGYP 243

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDG 595
           G+ GG AI+DV+FG  NPGG+LP+TWY  +YV  +P T+M +RP   N +PGRTY+F+ G
Sbjct: 244 GQAGGAAISDVLFGTTNPGGKLPMTWYPQDYVTNLPMTNMAMRPSKSNGYPGRTYRFYKG 303

Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
            VVYPFG+G+SYT F + +AS+P  V + LD  +Q    N T+        A+ +   +C
Sbjct: 304 KVVYPFGHGISYTNFVHTIASAPTMVSVPLDGHRQASR-NATISGK-----AIRVTHARC 357

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
               F  Q++V+N G MDG+  ++VYSKPP      +KQ++ +E+V +AAG   +VG  +
Sbjct: 358 NRLSFGVQVDVKNTGSMDGTHTLLVYSKPPAGHWAPLKQLVAFEKVHVAAGTQQRVGINV 417

Query: 716 NACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
           + CK L +VD +    +  GAH++ +G+    VS 
Sbjct: 418 HVCKFLSVVDRSGIRRIPMGAHSLHIGDVKHSVSL 452


>gi|212531051|ref|XP_002145682.1| beta-xylosidase XylA [Talaromyces marneffei ATCC 18224]
 gi|210071046|gb|EEA25135.1| beta-xylosidase XylA [Talaromyces marneffei ATCC 18224]
          Length = 799

 Score =  437 bits (1125), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 273/722 (37%), Positives = 390/722 (54%), Gaps = 62/722 (8%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L D   CD    Y +RA+ L+   TL E +    +   GVPRLGLP YE WSE LHG+  
Sbjct: 58  LKDNIVCDTSANYVDRAEGLIALFTLEELINNTQNSGPGVPRLGLPPYEVWSEGLHGLD- 116

Query: 69  IGRRTNSPPGTHF---DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
                      HF     E   ATSFP  IL+ A+ N +L  +I   ++T+ARA  N+G 
Sbjct: 117 ---------RAHFVKSGDEWTWATSFPMPILSMAALNRTLINQIASIIATQARAFNNVGR 167

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDP-YVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
            GL  ++PNIN  R P WGR  ETPGED  ++   YA  Y+ GLQ   G+      D   
Sbjct: 168 YGLDAYAPNINGFRSPLWGRGQETPGEDANFLTSSYAYEYITGLQG--GI------DPDN 219

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
           LKI+A  KH+A YDL+NW GN R  FD+R+T+QD+ E +   F          S MCSYN
Sbjct: 220 LKIAATAKHFAGYDLENWGGNSRLGFDARITQQDLAEYYTPQFLAASRYAKARSFMCSYN 279

Query: 245 RVNGIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
            VN IP+C+   LL   +R  W+F  +GY+ SDCD++  +   H + ++ +  A A  L+
Sbjct: 280 SVNAIPSCSSSFLLQTLLREQWDFPEYGYVSSDCDAVYNVFNPHGYASN-QSSAAAESLR 338

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP-QYKNLGKNN 361
           AG D+DCG  Y+     +  +G +   +I+ S+  LY  L++LGYFDG   +Y+ LG N+
Sbjct: 339 AGTDIDCGQTYSWHLNQSFIEGSVTRGEIERSILRLYSNLVKLGYFDGDKNEYRQLGWND 398

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           +       ++ EAA +GIVLLKND G LPL + N+K++ALVGP ANATK + GNY GT  
Sbjct: 399 VVTTDAWNISYEAAVEGIVLLKND-GVLPL-SKNVKSVALVGPWANATKQLQGNYFGTAP 456

Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
              +P+ G       +NYA G  +I          A+ AAK +D  V + G+D ++EAEG
Sbjct: 457 YLITPLQGASDAGYKVNYALGT-NISGNTTDGFANALSAAKKSDVIVYLGGIDNTIEAEG 515

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
            DR+++  P  Q +LI +++   K P+ ++ M  G VD +  K+N K+ +++W GYPG+ 
Sbjct: 516 TDRMNVTWPRNQLDLIQQLSQTGK-PLVVLQMGGGQVDSSSIKSNSKVNALIWGGYPGQS 574

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRP-VNNFPGRTYKFFDGPVVY 599
           GG+AI D++ GK  P GRL  T Y A Y  + P T M LRP   + PG+TY ++ G  VY
Sbjct: 575 GGKAIFDILKGKRAPAGRLVSTQYPAEYATQFPATDMSLRPDGKSNPGQTYMWYIGKPVY 634

Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
            FGYGL YT FK            KL       DI+  V + + P             Y+
Sbjct: 635 EFGYGLFYTTFKETAK--------KLGSSSSSFDISEIVSSPRSPS------------YE 674

Query: 660 FTFQI-------EVENMGKMDGSEVVMVYSKP--PGIAGTHIKQVIGYERV-FIAAGQSA 709
           ++  +        ++N GK       M+++     G A    K ++GY+R+  I  G+SA
Sbjct: 675 YSELVPFLNVTATIKNTGKTASPYTAMLFANTTNAGPAPYPNKWLVGYDRLPSIEPGKSA 734

Query: 710 KV 711
            +
Sbjct: 735 DL 736


>gi|442803736|ref|YP_007371885.1| beta-xylosidase BxlB [Clostridium stercorarium subsp. stercorarium
           DSM 8532]
 gi|442739586|gb|AGC67275.1| beta-xylosidase BxlB [Clostridium stercorarium subsp. stercorarium
           DSM 8532]
          Length = 715

 Score =  437 bits (1123), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 273/759 (35%), Positives = 404/759 (53%), Gaps = 104/759 (13%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D    + ERAKDLV RMT+ EKV QM   +  + RLG+P Y WW+EALHGV+  G   
Sbjct: 7   YLDPSYSFEERAKDLVSRMTIEEKVSQMLYNSPAIERLGIPAYNWWNEALHGVARAGT-- 64

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   A+F+E L  K+   +STE RA Y+  +        
Sbjct: 65  --------------ATMFPQAIGMAATFDEELIYKVADVISTEGRAKYHASSKKGDRGIY 110

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLTFWSPNIN+ RDPRWGR  ET GEDPY+  R  + +V+GLQ          +  + L
Sbjct: 111 KGLTFWSPNINIFRDPRWGRGQETYGEDPYLTARLGVAFVKGLQG---------NHPKYL 161

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K +AC KH+A +       + R  F++ V+++D+ ET++  F+  V E  V SVM +YNR
Sbjct: 162 KAAACAKHFAVHSGPE---SLRHEFNAVVSKKDLYETYLPAFKALVQEAKVESVMGAYNR 218

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
            NG P C    LL+  +RG+W F G++VSDC +I+     H  +  T  ++ A  ++ G 
Sbjct: 219 TNGEPCCGSKTLLSDILRGEWGFKGHVVSDCWAIRDF-HMHHHVTATAPESAALAVRNGC 277

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNIC 363
           DL+CG+ + N  + A+++G I E +ID ++  L I  M+LG FD   Q  Y ++  + + 
Sbjct: 278 DLNCGNMFGNLLI-ALKEGLITEEEIDRAVTRLMITRMKLGMFDPEDQVPYASISYDFVD 336

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
             +H ELA + A++ IVLLKND G LPL+   I+++A++GP+A++ +A+IGNYEGT   Y
Sbjct: 337 CKEHRELALDVAKKSIVLLKND-GLLPLDRKKIRSIAVIGPNADSRQALIGNYEGTASEY 395

Query: 424 TSPMDGFYAYSK---VINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
            + +DG    +     I Y+ GC       + + +    I  A+  A++AD  ++  GLD
Sbjct: 396 VTVLDGIREMAGDDVRIYYSVGCHLYKDRVENLGEPGDRIAEAVTCAEHADVVIMCLGLD 455

Query: 475 LSVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
            ++E E           D+ DL LPG Q EL+  V    K P+ LV+++  A+ + +A  
Sbjct: 456 STIEGEEMHESNIYGSGDKPDLNLPGQQQELLEAVYATGK-PIVLVLLTGSALAVTWADE 514

Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
           +  I +IL   YPG  GGRAIA V+FG+ NP G+LP+T+Y        +T   +      
Sbjct: 515 H--IPAILNAWYPGALGGRAIASVLFGETNPSGKLPVTFYRTTEELPDFTDYSME----- 567

Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
             RTY+F     +YPFG+GLSYT F Y         D+KL KD        T+   +   
Sbjct: 568 -NRTYRFMKNEALYPFGFGLSYTTFDYS--------DLKLSKD--------TIRAGE--- 607

Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIK--QVIGYERVFI 703
                         F   ++V N GKM G EVV VY K    A   +   Q+ G +RV +
Sbjct: 608 -------------GFNVSVKVTNTGKMAGEEVVQVYIKDLE-ASWRVPNWQLSGMKRVRL 653

Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            +G++A++ F +   + L +V +   S++  G   I VG
Sbjct: 654 ESGETAEITFEIRP-EQLAVVTDEGKSVIEPGEFEIYVG 691


>gi|292495634|sp|A1CND4.2|XYND_ASPCL RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
          Length = 792

 Score =  435 bits (1119), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 273/754 (36%), Positives = 399/754 (52%), Gaps = 48/754 (6%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD      +RA  L+   TL E V   G+ + GVPRLGLP Y+ W+EALHG+  
Sbjct: 57  LSKTIVCDTLTSPYDRAAALISLFTLEELVNATGNTSPGVPRLGLPPYQVWNEALHGLD- 115

Query: 69  IGRRTNSPPGTHFDSE--VPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
                      +F  E     +TSFP  ILT ++ N +L  ++   +ST+ RA  N G  
Sbjct: 116 ---------RAYFTDEGQFSWSTSFPMPILTMSALNRTLINQVASIISTQGRAFSNAGRY 166

Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSRPL 185
           GL  +SPNIN  R P WGR  ETPGED Y +   YA  Y+ G+Q   GV      D + L
Sbjct: 167 GLDVYSPNINSFRHPVWGRGQETPGEDAYCLSSAYAYEYITGIQG--GV------DPKSL 218

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K+ A  KHYA YD++NW+G+ R   D  +T+QD+ E +   F +   +  V SVMCSYN 
Sbjct: 219 KLVATAKHYAGYDIENWDGHSRLGNDMNITQQDLSEYYTPQFLVAARDAKVRSVMCSYNA 278

Query: 246 VNGIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           VNG+P+CA+   L   +R  + F   GYI SDCDS   +   H++  +    A A  ++A
Sbjct: 279 VNGVPSCANSFFLQTLLRDTFGFVEDGYISSDCDSAYNVFNPHEYAANVSS-AAADSIRA 337

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNI 362
           G D+DCG  Y  +   AV Q  ++ ADI+  +  LY  LMRLGYFDG S  Y+NL  N++
Sbjct: 338 GTDIDCGTTYQYYFDEAVDQNLLSRADIERGVIRLYSNLMRLGYFDGNSSAYRNLTWNDV 397

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
                  ++ E   +G VLLKND G LPL+  +I+++ALVGP  N +  + GNY G    
Sbjct: 398 VTTNSWNISYEV--EGTVLLKND-GTLPLSE-SIRSIALVGPWMNVSTQLQGNYFGPAPY 453

Query: 423 YTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
             SP+D F      +NYA G  +I   +      A+ AAK +DA +   G+D S+EAE  
Sbjct: 454 LISPLDAFRDSHLDVNYAFGT-NISSNSTDGFSKALSAAKKSDAIIFAGGIDNSLEAETL 512

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
           DR+++  PG Q ELI++++   K P+ ++ M  G VD +  K+N  + S++W GYPG+ G
Sbjct: 513 DRMNITWPGKQLELIDQLSQLGK-PLIVLQMGGGQVDSSLLKSNKNVNSLIWGGYPGQSG 571

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
           G+A+ D+I GK  P GRL +T Y A Y  + P T M LRP  N PG+TY ++ G  VY F
Sbjct: 572 GQALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGNNPGQTYMWYTGTPVYEF 631

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
           G+GL YT F+   A +     +K+      +D+       +P    + ++ +        
Sbjct: 632 GHGLFYTTFRVSHARA-----VKIKPTYNIQDL-----LAQPHPGYIHVEQMPF----LN 677

Query: 662 FQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
           F +++ N GK       M+++    G A    K ++G++R+      ++K+        S
Sbjct: 678 FTVDITNTGKASSDYTAMLFANTTAGPAPYPKKWLVGFDRLPTLGPSTSKLMTIPVTINS 737

Query: 721 LKIVDNAANSLLASGAHTILVGEGVGGVSFPLQL 754
           +   D   N +L  G + + +      V  PL L
Sbjct: 738 MARTDELGNRVLYPGKYELALNNE-RSVVLPLSL 770


>gi|164429277|ref|XP_958209.2| hypothetical protein NCU09923 [Neurospora crassa OR74A]
 gi|16945419|emb|CAB91343.2| related to xylan 1, 4-beta-xylosidase [Neurospora crassa]
 gi|157073010|gb|EAA28973.2| hypothetical protein NCU09923 [Neurospora crassa OR74A]
          Length = 774

 Score =  435 bits (1119), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 272/751 (36%), Positives = 399/751 (53%), Gaps = 57/751 (7%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L+    CDA L  P+RA  LV  MT  EK+Q +   + G PR+GLP Y WWSEALHGV++
Sbjct: 36  LASLKVCDATLSPPQRAAALVAAMTTEEKLQNLVSKSKGAPRIGLPAYNWWSEALHGVAY 95

Query: 69  IGRRTNSPPGTHF---DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
                   PGT F   D     +TSFP  +L  A+F++ L +K+G+ + TE RA  N G 
Sbjct: 96  A-------PGTQFRSGDGPFNSSTSFPMPLLMAATFDDELIEKVGEVIGTEGRAFGNAGF 148

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
           +G  +W+PN+N  +DPRWGR  ETPGED   + RYA + +RGLQ   G    R       
Sbjct: 149 SGFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAASMIRGLQ---GPLPER------- 198

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           ++ A CKHYAA D ++W G+ R  FD++VT QD+ E ++ PF+ C  +  V S+MCSYN 
Sbjct: 199 RVVATCKHYAANDFEDWNGSTRHDFDAKVTLQDLAEYYLSPFQQCARDSKVGSIMCSYNA 258

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           VNG+P CA+  L+   +R  WN+     YI SDC+++  I  +H +   T  +  A   +
Sbjct: 259 VNGVPACANTYLMQTILREHWNWTAPGNYITSDCEAVLDIFANHHYAK-TNAEGTALAFE 317

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNN 361
           AG D  C    ++   GA  QG + ++ +D +L  LY  L+R+GYFDG+  +Y +LG  +
Sbjct: 318 AGTDSSCEYESSSDIPGAWTQGLLEQSTVDRALTRLYEGLVRVGYFDGNHSEYASLGWKD 377

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNG-ALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
           + +P+  E+A + A +GIVLLKND    L L T     LA++G  AN  K + G Y G P
Sbjct: 378 VNSPKSQEVALQTAVEGIVLLKNDQTLPLGLKTDPKSKLAMIGFWANDPKTLSGGYSGKP 437

Query: 421 CRYTSPMDGFYAYSKVINYAPG-CADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
               SP+    A    +  A G        N++   AA++AA++A+  +   GLD S   
Sbjct: 438 AFEHSPVYAAEAMGFNVTTAGGPVLQNSTSNDTWTQAALEAAQDANYILYFGGLDTSAAG 497

Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
           E KDR  +  P  Q +LI  +    K P+ +V M    +D         + SILW  +PG
Sbjct: 498 ETKDRTTINWPEAQLQLIKTLTKLGK-PLVVVQM-GDQLDNTPLLATKTVNSILWANWPG 555

Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVV 598
           ++GG A+  ++ G  +P GRLP+T Y ANY   +P T M LRP +  PGRTY+++    V
Sbjct: 556 QDGGTAVMQILTGLKSPAGRLPVTQYPANYTAAVPMTDMNLRPSDRLPGRTYRWYPT-AV 614

Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
            PFG+GL YT F+ K+A+    + I+ D   +C   N     N  P    L         
Sbjct: 615 QPFGFGLHYTTFQAKIAAPLPRLAIQ-DLLSRCGGDN----ANAYPDTCALP-------- 661

Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH---IKQVIGYERVF-IAAGQ--SAKVG 712
               ++EV N G      VV+ +    G AG     IK ++ Y R+  ++ G   +A + 
Sbjct: 662 --PLKVEVTNSGNRSSDYVVLAFLA--GDAGPRPYPIKTLVSYTRLRDVSPGHKTTAHLE 717

Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVGE 743
           +T+     +   D   N++L  G +T+ V E
Sbjct: 718 WTLG---DIARYDEQGNTVLYPGTYTVTVDE 745


>gi|347832625|emb|CCD48322.1| glycoside hydrolase family 3 protein [Botryotinia fuckeliana]
          Length = 772

 Score =  435 bits (1119), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 266/625 (42%), Positives = 364/625 (58%), Gaps = 34/625 (5%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L++   CD       RA  L+   TL EKV   G+ + GVPR+GLP YEWW+EALHG++ 
Sbjct: 28  LANNTVCDTSSDPYTRAAALISLFTLAEKVNNTGNTSPGVPRIGLPSYEWWNEALHGIA- 86

Query: 69  IGRRTNSPPGTHF---DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
                   PGT F    S    +TSFP  IL  A+F++ L  K+   VSTEARA  N+  
Sbjct: 87  ------RSPGTTFAATGSNYSYSTSFPQPILMGATFDDELIHKVATQVSTEARAFNNVNR 140

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GL FW+PNIN  +DPRWGR  ETPGEDP+    Y    + GLQ   G+      D  P 
Sbjct: 141 FGLNFWTPNINPYKDPRWGRGQETPGEDPFHTSSYVNALITGLQG--GL------DDLPY 192

Query: 186 KIS-ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
           K   A CKH+A YDL++ +G  R+ FD+ +  QD+++ ++ PF+ C  + +V SVMCSYN
Sbjct: 193 KKGVATCKHFAGYDLESSDGAIRYGFDAIIKSQDLRDYYLPPFQQCARDSNVQSVMCSYN 252

Query: 245 RVNGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
            +NG+PTCAD  LL   +R  W +     ++ SDCD+++ I + H +   T E + A  L
Sbjct: 253 AMNGVPTCADDWLLQTLLREHWGWTEEDQWVTSDCDAVKNIWDYHNY-TLTPEQSAADAL 311

Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGK 359
            AG DLDCG ++  +   A  QG    + +D SL   Y  L+RLGYFD      Y+ L  
Sbjct: 312 NAGTDLDCGTFWPTYLGSAYDQGLYDISTLDRSLARRYASLVRLGYFDPPSVQPYRQLNW 371

Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
           +N+  P   +LA +AA  GIVLLKND G LPL++ NI  +AL+GP ANATK M GNY GT
Sbjct: 372 DNVSTPAAQQLALQAAEDGIVLLKND-GILPLSS-NITNVALIGPLANATKQMQGNYYGT 429

Query: 420 PCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
                SP+         + Y  G ADI  QN +   AAI AA++AD  + V G+D S+EA
Sbjct: 430 APYLRSPLIAAQNAGFKVTYVQG-ADIDSQNTTDFSAAISAAQSADLVIYVGGIDNSIEA 488

Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-VDINFAKNNPKIKSILWVGYP 538
           E  DR  +  P  Q  LIN++A+ +     L+I   G  +D +   +N  + ++LW GYP
Sbjct: 489 EEIDRTSISWPSSQLSLINQLANLS---TPLIISQMGCMIDSSSLLSNTGVNALLWAGYP 545

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPV 597
           G++GG AI +++ GK  P GRLPIT Y +NYV ++  T M L+P    PGRTYK+++G  
Sbjct: 546 GQDGGTAIFNILTGKTAPAGRLPITQYPSNYVNQVTMTDMNLQPSRFNPGRTYKWYNGEP 605

Query: 598 VYPFGYGLSYTQFKYKVA-SSPKSV 621
           V+ +GYGL YT F  K+  SSP + 
Sbjct: 606 VFEYGYGLQYTTFDAKITPSSPNNT 630


>gi|223945397|gb|ACN26782.1| unknown [Zea mays]
          Length = 516

 Score =  435 bits (1118), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 228/521 (43%), Positives = 317/521 (60%), Gaps = 17/521 (3%)

Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
           MCSYNRVNG+PTCAD  LL+ T R DW F+GYI SDCD++  I ++  +   T EDAVA 
Sbjct: 1   MCSYNRVNGVPTCADYNLLSTTARQDWGFYGYITSDCDAVAIIHDAQGYAK-TAEDAVAD 59

Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKN 356
           VLKAG+D++CG Y  +    A+QQGKI E DI+ +L  L+ V MRLG F+G P+   Y +
Sbjct: 60  VLKAGMDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGD 119

Query: 357 LGKNNICNPQHIELAAEAARQGIVLLKNDNGA--LPLNTGNIKTLALVGPHANATKAMIG 414
           +G + +C  +H +LA EAA+ GIVLLKND GA  LPL+  N+ +LA++G +AN    + G
Sbjct: 120 IGPDQVCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRG 179

Query: 415 NYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
           NY G PC   +P+     Y K  ++  GC    C N + IP A+ AA +AD+ V+  GLD
Sbjct: 180 NYFGPPCVTVTPLQVLQGYVKDTSFVAGCNSAAC-NVTTIPEAVQAASSADSVVLFMGLD 238

Query: 475 LSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
              E E  DR+DL LPG Q  LI  VA+AAK PV LV++  G VD++FAK NPKI +ILW
Sbjct: 239 QDQEREEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILW 298

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKF 592
            GYPGE GG AIA V+FG++NPGGRLP+TWY  ++ ++P T M +R  P   +PGRTY+F
Sbjct: 299 AGYPGEAGGIAIAQVLFGEHNPGGRLPVTWYPQDFTRVPMTDMRMRADPATGYPGRTYRF 358

Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
           + GP V+ FGYGLSY+++ ++ A+ P             + +  T G          I  
Sbjct: 359 YRGPTVFNFGYGLSYSKYSHRFATKPPPT----SNVAGLKAVEATAG-GMASYDVEAIGS 413

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI---AGTHIKQVIGYERVFIAAGQSA 709
             C   KF   + V+N G MDG   V+V+ + P     +G    Q+IG++ + + A Q+A
Sbjct: 414 ETCDRLKFPAVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPASQLIGFQSLHLRATQTA 473

Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
            V F ++ CK           ++  G+H ++VGE    +SF
Sbjct: 474 HVEFEVSPCKHFSRATEDGRKVIDQGSHFVMVGEDEFEMSF 514


>gi|375150455|ref|YP_005012896.1| Beta-glucosidase [Niastella koreensis GR20-10]
 gi|361064501|gb|AEW03493.1| Beta-glucosidase [Niastella koreensis GR20-10]
          Length = 711

 Score =  434 bits (1116), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 275/747 (36%), Positives = 389/747 (52%), Gaps = 100/747 (13%)

Query: 20  PYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGT 79
           P   R  DL+ ++TLPEK+  +G  +  V RLG+P Y WW+EALHGV+  G         
Sbjct: 23  PMEARVNDLLHQLTLPEKISLLGYRSKEVERLGIPAYNWWNEALHGVARAGV-------- 74

Query: 80  HFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFW 131
                   AT FP  I   A+FN+ L K+    +STEARA YNL  A        GLTFW
Sbjct: 75  --------ATVFPQAIGMAATFNDDLLKEAATVISTEARAKYNLSLAQGRHLQYMGLTFW 126

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
           SPNIN+ RDPRWGR  ET GEDP++       +V+GLQ          +D R LK SAC 
Sbjct: 127 SPNINIFRDPRWGRGQETYGEDPFLTAHMGTAFVKGLQ---------GNDPRYLKASACA 177

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
           KH+A +   +   N R  F++ V E+D++ET++  F   V+ G V SVMC+YNRVN  P 
Sbjct: 178 KHFAVH---SGPENGRHTFNAIVDEKDLRETYLYAFHALVDAG-VESVMCAYNRVNDQPC 233

Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
           C+   LLN  +R +W F G++V+DC ++  I   HK +    E A A  +KAG++LDC +
Sbjct: 234 CSGNFLLNSILRNEWKFKGHVVTDCGALDDIFMRHKVMPSGVEVAAA-AIKAGVNLDCSN 292

Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD---GSPQYKNLGKNNICNPQHI 368
                   AV+Q  + E DID+SL  L    ++LG++D    +P YK  G +++ N  H 
Sbjct: 293 VLQKDVEKAVEQKLLNEKDIDSSLAHLLRTQIKLGFYDDPTANPFYK-YGADSVANTAHA 351

Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
            LA   A+Q +VLLKN N  LPL+      + +VG ++ +  A++GNY G   R  S ++
Sbjct: 352 TLARAMAQQSMVLLKNSNQLLPLDKKKYPAIMVVGTNSASMDALLGNYHGVSNRAVSFVE 411

Query: 429 GFYAYSKV---INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGL---------DLS 476
           G          + Y  G       N++     I AA NAD TV V GL         D  
Sbjct: 412 GITNAVDAGTRVEYDQGSD----YNDTTHFGGIWAAGNADITVAVIGLTPVYEGEEGDAF 467

Query: 477 VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
           + A+G D+ D+ LP      +  +  A K P+  VI +  AVDI+  +  P   +IL   
Sbjct: 468 LAAKGGDKPDMSLPAAHIAFMKALRKANKKPIIAVITAGSAVDISAIE--PYADAILLAW 525

Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
           YPGE+GG A+AD++FGK +P GRLP+T+Y++      +  +P        GRTY++F+G 
Sbjct: 526 YPGEQGGNALADILFGKVSPAGRLPVTFYQS------FADVPAYDNYAMKGRTYRYFNGK 579

Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
           V YPFGYGLSYT F Y+    P   +I+  KD                            
Sbjct: 580 VQYPFGYGLSYTSFAYEWQQMP--ANIRTAKDS--------------------------- 610

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
               +F I+V+N G MDG EVV VY + P +    +K++  ++RV + AG    V  T+ 
Sbjct: 611 ---VSFSIKVKNTGSMDGDEVVQVYVEYPAVERMPLKELKAFKRVHVKAGGEETVQLTIP 667

Query: 717 ACKSLKIVDNAANSL-LASGAHTILVG 742
           A   L+  D A +S  L  G++ I  G
Sbjct: 668 AS-DLQKWDLATSSWKLYPGSYNIFAG 693


>gi|392570764|gb|EIW63936.1| glycoside hydrolase family 3 protein [Trametes versicolor FP-101664
           SS1]
          Length = 781

 Score =  434 bits (1115), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 277/732 (37%), Positives = 392/732 (53%), Gaps = 28/732 (3%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD       RA  L+   T  E      + + GVPRLGLP Y WWSE LHGV+     T 
Sbjct: 41  CDVTKDPITRATALISIWTDEELTNNTVNASPGVPRLGLPAYNWWSEGLHGVAQSPGVTF 100

Query: 75  SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
           +P G         ATSFP  IL  A+F++ L + I   VSTE RA  N G AGL +W+PN
Sbjct: 101 APSG-----NFSYATSFPQPILMGAAFDDPLIQAIATIVSTEGRAFNNAGRAGLDYWTPN 155

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP-LKISACCKH 193
           IN  +DPRWGR  ETPGEDP+ + +Y  N + GLQ           D +P  K+ A CKH
Sbjct: 156 INPFKDPRWGRGQETPGEDPFHLSQYVYNLILGLQG--------GLDPKPYFKVVADCKH 207

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           +AAYD+DNWEG  R+ F++ V++QD+ E ++ PF+ CV +  V+SVMCSYN VNGIP+CA
Sbjct: 208 FAAYDMDNWEGVVRYGFNAVVSQQDLSEFYLPPFQTCVRDAKVASVMCSYNAVNGIPSCA 267

Query: 254 DPKLLNQTIRGDWNFHG--YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
           +  LL   +R  W F    ++ SDCD++Q I   H +  D    A A  L AG D+DCG 
Sbjct: 268 NSFLLQDVLRDHWGFTDDRWVTSDCDAVQNIFTPHNYTTD-PAQAAADALLAGTDIDCGT 326

Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNICNPQHIE 369
           + + +   A+Q+G +   D+  +    Y  L+RLGYFD   +  Y+ LG +++   Q  +
Sbjct: 327 FSSTYLPEALQRGLVNSTDLRRAAIRQYASLVRLGYFDDPAAQPYRQLGWSDVNTLQAQQ 386

Query: 370 LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG 429
           LA  AA +G+VLLKND G LPL+   ++ LAL+GP ANAT+ + GNY G      SP+ G
Sbjct: 387 LAHTAAVEGMVLLKND-GLLPLSK-RVRKLALIGPWANATRLLQGNYFGIAPYLVSPVQG 444

Query: 430 FYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLL 489
                  + Y  G       + S   AA+ AAK ADA V   GLD +VE E  DR+++  
Sbjct: 445 AQQAGFEVEYVFGTNVTTRNDTSGFAAAVAAAKRADAVVFAGGLDETVEREEIDRLNVTW 504

Query: 490 PGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADV 549
           PG Q +L+ ++    K P+ +     G +D    K +  + +I+W GYPG+ GG A+ D+
Sbjct: 505 PGNQLDLVAELERVGK-PLIVAQFGGGQLDNTALKRSKAVNAIIWGGYPGQSGGTALFDI 563

Query: 550 IFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYT 608
           + GK  P GRLPIT Y A Y  ++P T M LRP    PGRTYK++ G  V+ FG+GL YT
Sbjct: 564 LTGKAAPAGRLPITQYPAAYAEQVPMTDMTLRPSATNPGRTYKWYSGTPVFEFGFGLHYT 623

Query: 609 QFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVEN 668
            F +  A+   + D         +  + +        +A  +D         TF + V N
Sbjct: 624 TFAFAWAAPGAAADSTASFGGPAKSYSISQLVAHGQESAAFLDLAPLD----TFAVRVTN 679

Query: 669 MGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
            GK+    V +++ S   G A    K ++ Y R+   A + + VG       ++   D  
Sbjct: 680 TGKVASDYVALLFVSGSFGPAPHPKKTLVAYTRIHGLAPRGSTVGQLPVTLGAIARADEN 739

Query: 728 ANSLLASGAHTI 739
               +  G +T+
Sbjct: 740 GEKWVHPGTYTL 751


>gi|220927661|ref|YP_002504570.1| glycoside hydrolase [Clostridium cellulolyticum H10]
 gi|219997989|gb|ACL74590.1| glycoside hydrolase family 3 domain protein [Clostridium
           cellulolyticum H10]
          Length = 712

 Score =  433 bits (1114), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 267/759 (35%), Positives = 393/759 (51%), Gaps = 106/759 (13%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D  L + ERA DLV RMTL EK  Q+   A  V RLG+P Y WW+EALHGV+  G   
Sbjct: 6   YLDKSLSFKERAVDLVSRMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAGV-- 63

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   A F++   +KI   ++TE RA YN  +        
Sbjct: 64  --------------ATVFPQAIGLAAIFDDEFLEKIADVIATEGRAKYNESSKKGDRDIY 109

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            G+TFWSPN+N+ RDPRWGR  ET GEDPY+  R  + +V+GLQ           D + L
Sbjct: 110 KGITFWSPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQ----------GDGKYL 159

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K +AC KH+A +   +   +DR HF++  +++DM ET++  FE  V E  V SVM +YNR
Sbjct: 160 KSAACAKHFAVH---SGPEDDRHHFNAVASQKDMYETYLPAFEALVKEAKVESVMGAYNR 216

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
            NG P      LL   +R DW F G++VSDC +I+   E H  +  T  ++VA  LK G 
Sbjct: 217 TNGEPCNGSKTLLKDILRDDWGFDGHVVSDCWAIKDFHEGHG-VTKTPTESVALALKNGC 275

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNP 365
           DL+CG+ Y    + A+++GKI E DID +   L    M+LG FD   ++  +      + 
Sbjct: 276 DLNCGNMYL-LILLALKEGKITEEDIDRAAIRLMTTRMKLGMFDDDCEFDKIPYEVNDSI 334

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H +L+ EAAR+ +VLLKN NG LPL++  IK +A++GP+A+++ A+  NY GTP    +
Sbjct: 335 EHNKLSLEAARKSMVLLKN-NGLLPLDSKKIKNIAVIGPNADSSLALRANYSGTPSHNIT 393

Query: 426 PMDGFYA---------YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
            +DG  +         YS   +      + + Q +  +  A+  A+ +D  V+  GLD S
Sbjct: 394 ILDGVRSRVSEDTRVWYSLGSHLFMNREEDLAQPDDRLKEAVSMAERSDVVVLCLGLDAS 453

Query: 477 VEAE-----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
           VE E           G D+ DL LP  Q  L+N V    K P  + ++S  A+ I  A +
Sbjct: 454 VEGEQNDQGTVILDAGGDKADLNLPESQRNLLNAVLATGK-PTIVALLSGSALSIGDAAD 512

Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
             K  +I+   YPG +GG A A++IFG Y+P GRLP+T+Y++     P+    +      
Sbjct: 513 --KAAAIVQCWYPGSKGGLAFAEMIFGDYSPAGRLPVTFYKSTEELPPFEDYSME----- 565

Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
             RTYKF  G  +YPFG+GLSYT F+Y     P++V+                       
Sbjct: 566 -NRTYKFMKGEALYPFGFGLSYTNFEYSNIVCPQAVN----------------------- 601

Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI--KQVIGYERVFI 703
                          +  ++V+N G +D  EVV VY K    A   +    + G++R+F+
Sbjct: 602 ----------NGESLSVSVDVQNAGSVDSDEVVQVYIKDME-ASVRVPNHSLCGFKRIFL 650

Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            +G+   V F +++ +++ IVD      + +G  T+ VG
Sbjct: 651 KSGEKKTVTFEIDS-RAMTIVDEEGKRYIENGDFTLYVG 688


>gi|376259588|ref|YP_005146308.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp. BNL1100]
 gi|373943582|gb|AEY64503.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp. BNL1100]
          Length = 712

 Score =  433 bits (1113), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 270/759 (35%), Positives = 397/759 (52%), Gaps = 106/759 (13%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D  L + ERA DLV RMTL EK  Q+   A  V RLG+P Y WW+EALHGV+  G   
Sbjct: 6   YLDKSLSFKERAADLVSRMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAGV-- 63

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   A F++   +KI   ++TE RA YN  NA       
Sbjct: 64  --------------ATVFPQAIGMAAIFDDEFLEKIADVIATEGRAKYN-ENAKKGDRDI 108

Query: 127 --GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             G+TFWSPN+N+ RDPRWGR  ET GEDPY+  R  + +V+GLQ           D + 
Sbjct: 109 YKGITFWSPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQ----------GDGKY 158

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
           LK +AC KH+A +   +   +DR HFD+ V+++D+ ET++  FE  V E  V SVM +YN
Sbjct: 159 LKTAACAKHFAVH---SGPEDDRHHFDAVVSQKDLYETYLPAFEALVKEAKVESVMGAYN 215

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           R NG P      LL   +R  W F G++VSDC +I+   E H  +  T  ++VA  LK+G
Sbjct: 216 RTNGEPCNGSKTLLKDILRDGWGFDGHVVSDCWAIKDFHEGHG-VTKTPTESVALALKSG 274

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICN 364
            DL+CG+ Y    + A+++G+I E DID +   L    MRLG FD   ++  +      +
Sbjct: 275 CDLNCGNMYL-LILLALKEGRITEEDIDRAAIRLMTTRMRLGMFDDDCEFDKIPYELNDS 333

Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
            +H +L+ EAA++ +VLLKND G LPL++  IK +A++GP+A+++ A+  NY GTP +  
Sbjct: 334 VEHNKLSLEAAKKSMVLLKND-GLLPLDSKKIKNIAVIGPNADSSLALRANYSGTPSQNI 392

Query: 425 SPMDGF---YAYSKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDL 475
           + +DG     +    + Y+ G        + + Q +  +  A+  A+ +D  V+  GLD 
Sbjct: 393 TILDGIRKRVSEDTRVWYSVGSHLFMNREEDLAQPDDRLKEAVSVAERSDVVVLCLGLDA 452

Query: 476 SVEAE-----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAK 524
           SVE E           G D+ DL LP  Q  L+N V    K P  + ++S  A+ I  A 
Sbjct: 453 SVEGEQNDQGTVILDAGGDKADLNLPESQRNLLNAVLATGK-PTIVALLSGSALSIGDAA 511

Query: 525 NNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNN 584
           +  K  +I+   YPG  GG A A++IFG Y+P GRLP+T+Y++     P+    +     
Sbjct: 512 D--KAAAIVQCWYPGSRGGLAFAEMIFGDYSPAGRLPVTFYKSTEELPPFADYSME---- 565

Query: 585 FPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPP 644
              RTYKF  G  +YPFG+GLSYT F+Y     P++V+                G N   
Sbjct: 566 --NRTYKFMKGEALYPFGFGLSYTNFEYSNIVCPQNVN---------------NGEN--- 605

Query: 645 CAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIK-QVIGYERVFI 703
                           +  ++V+N G +D  EVV VY K    +    K  + G++R+ +
Sbjct: 606 ---------------LSVSVDVQNAGSVDSDEVVQVYIKDMDASVRVPKYSLCGFKRIHL 650

Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            +G+   V F +++  ++ IVD A    + +G  T+ VG
Sbjct: 651 KSGEKKTVTFEIDS-NAMTIVDEAGKRYIENGEFTLYVG 688


>gi|336471692|gb|EGO59853.1| hypothetical protein NEUTE1DRAFT_99999 [Neurospora tetrasperma FGSC
           2508]
 gi|350292807|gb|EGZ74002.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
          Length = 770

 Score =  433 bits (1113), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 267/750 (35%), Positives = 401/750 (53%), Gaps = 55/750 (7%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L+    CD  L  P+RA  LV  MT  EK+Q +   + G PR+GLP Y WWSEALHGV++
Sbjct: 36  LASLKVCDVTLSPPQRAAALVAAMTTEEKLQNLVSKSKGAPRIGLPAYNWWSEALHGVAY 95

Query: 69  IGRRTNSPPGTHF---DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
                   PGT F   D     +TSFP  +L  A+F++ L +K+G+ + TE RA  N G 
Sbjct: 96  A-------PGTQFWSGDGPFNASTSFPMPLLMAATFDDELIEKVGEVIGTEGRAFGNAGF 148

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
           +G  +W+PN+N  +DPRWGR  ETPGED   + RYA + +RGLQ            +R  
Sbjct: 149 SGFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAASMIRGLQ----------GPARER 198

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           ++ A CKHYAA D ++W G+ R  F+++VT QD+ E ++ PF+ C  +  V S+MCSYN 
Sbjct: 199 RVVATCKHYAANDFEDWNGSTRHDFNAKVTLQDLAEYYLSPFQQCARDSKVGSIMCSYNA 258

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           VNG+P CA+  L+   +R  WN+     YI SDC+++  I  +H +  +T  +  A   +
Sbjct: 259 VNGVPACANTYLMQTILREHWNWTAPGNYITSDCEAVLDISANHHYA-ETNAEGTALAFE 317

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNN 361
           AG+D  C    ++   GA  QG + ++ +D +L+ +Y  L+R+GYFDG+  +Y +LG  +
Sbjct: 318 AGIDSSCEYESSSDIPGAWTQGLLEQSTVDRALKRIYEGLVRVGYFDGNHSEYASLGWKD 377

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLN--TGNIKTLALVGPHANATKAMIGNYEGT 419
           + +P+  E+A +AA +GIVLLKND   LPL+  T     LA++G  AN  K + G Y G 
Sbjct: 378 VNSPKSQEVALQAAVEGIVLLKNDK-TLPLDLRTDPKSKLAMIGFWANDPKTLSGGYSGK 436

Query: 420 PCRYTSPMDGFYAYSKVINYAPG-CADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
           P    SP+    A    +  A G        N++   AA++AAK+A+  +   G D S  
Sbjct: 437 PAFEHSPVYAAQAMGFSVTTAGGPVLQNSTSNDTWTQAALEAAKDANYILYFGGQDTSAA 496

Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
            E KDR  +  P  Q +LI  ++   K P+ +V M    +D         + +ILW  + 
Sbjct: 497 GETKDRTTINWPEAQLQLITTLSKLGK-PLVVVQM-GDQLDNTPLLAAKAVNAILWANWL 554

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPV 597
           G++GG A+  ++ G  NP GRLP+T Y ANY   +P T M LRP +  PGRTY+++    
Sbjct: 555 GQDGGTAVMQILTGLKNPAGRLPVTQYPANYTAAVPMTDMNLRPSDKLPGRTYRWYPT-A 613

Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
           V PFG+GL YT F+ K+A     + I+ D   +C   N     N  P    L        
Sbjct: 614 VQPFGFGLHYTTFQTKIAVPLPRLAIQ-DLLSRCGGDN----ANAYPDTCALP------- 661

Query: 658 YKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF-IAAGQ--SAKVGF 713
                ++EV N G      VV+ + +   G     IK ++ Y R+  ++ G   +A + +
Sbjct: 662 ---PLKVEVTNSGNRSSDYVVLAFLAGDVGPKPYPIKTLVSYTRLRDLSPGHKTTAHLKW 718

Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVGE 743
           T+     +   D   N++L  G +T+ V E
Sbjct: 719 TLG---DIARYDEQGNTVLYPGTYTVTVDE 745


>gi|378730020|gb|EHY56479.1| beta-glucosidase, variant [Exophiala dermatitidis NIH/UT8656]
 gi|378730021|gb|EHY56480.1| beta-glucosidase [Exophiala dermatitidis NIH/UT8656]
          Length = 783

 Score =  433 bits (1113), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 283/762 (37%), Positives = 409/762 (53%), Gaps = 65/762 (8%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS+   C+      +RAK LV  +T  EK    G+ + GVPRLGL  Y+WW EALHGV+ 
Sbjct: 29  LSNNTVCNTNASVADRAKALVAALTNEEKFNLTGNTSPGVPRLGLYSYQWWQEALHGVA- 87

Query: 69  IGRRTNSPPGTHFDS--EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
                 S PG +F +  +   ATSFP  IL +A+F+++L   +   VSTEARA  N+  +
Sbjct: 88  ------SSPGVNFSTSGDFSHATSFPQPILMSAAFDDALINAVATVVSTEARAFNNVNRS 141

Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLK 186
           GL FW+PNIN  +DPRWGR  ETPGED + +  Y    + GLQ   G+       + P+K
Sbjct: 142 GLDFWTPNINPYKDPRWGRGQETPGEDTFHLKSYVAALIDGLQG--GL-------NPPIK 192

Query: 187 -ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
            + A CKH+ AYDL++W   DR++FD+ V+ QD+ E ++ PF+ C  +  V S+MCSYN 
Sbjct: 193 KVIATCKHFVAYDLEDWITTDRYNFDAIVSTQDLAEYYMQPFQTCARDARVGSIMCSYNA 252

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           +NG+PTCADP +L   +R  WN+     Y+ SDCD+IQ I   H +   T+E AVA  L 
Sbjct: 253 MNGVPTCADPYILQTVLREHWNWTDDGQYVTSDCDAIQNIYAPH-YYEPTREQAVADALT 311

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKN 360
           AG DL+CG YY      A  +G   +  ID ++  LY  L++LGYFD   +  Y++L  +
Sbjct: 312 AGTDLNCGTYYQTHLPAAFSEGLFNQTVIDQTITRLYSALIKLGYFDPPSATPYRSLNWS 371

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLN--TGNIKTLALVGPHANATKAMIGNYEG 418
           ++  P    LA +AA +GIVLLKND G LPL+  T    T+A++G  ANAT  M GNY G
Sbjct: 372 DVSTPAAEALALKAAEEGIVLLKND-GLLPLSFPTDKNTTVAIIGGWANATTTMQGNYFG 430

Query: 419 TPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAA------IDAAKNADATVIVAG 472
                 SP+   YA  ++ N      + V      +P        + AA  AD  +I  G
Sbjct: 431 IAPYLHSPL---YALQQLPN-----INAVYGGGFGVPTTDGWDELLGAAGEADLIIIADG 482

Query: 473 LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
           L  S E+E  DR  +       ++IN++  +  G  T+ +     +D     NNP I ++
Sbjct: 483 LTTSDESESNDRYTIGWQPAAIDIINQL--SGMGKPTVFLQMGDQLDNTPLLNNPNISAL 540

Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRT 589
           +W GYPG  GG A+ +++ GK  P GRLP+T Y A+YV ++  T M LRP   +  PGRT
Sbjct: 541 IWGGYPGMAGGDALINILTGKAAPAGRLPVTQYPADYVNQVNMTDMELRPNATSGNPGRT 600

Query: 590 YKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL 649
           YK+++  V+ PFGYGL YT F    ++  ++      + Q     N + G       + L
Sbjct: 601 YKWYNNAVL-PFGYGLHYTNFSVAASAQGQA------QTQSGPSSNSSQGQGTSYNISSL 653

Query: 650 IDDVKCKDYKF-------TFQIEVENMGKMDGSEVVMV--YSKPPGIAGTHIKQVIGYER 700
           +       Y +       +F + V N G    S+ V +   S   G     IKQ++ Y+R
Sbjct: 654 VSSCDRSQYAYLDLCPFESFNVNVTNTGSKLASDFVALGFISGSYGPQPYPIKQLVAYQR 713

Query: 701 VF-IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
           +F I+AG SA     +    SL   D   N++L  G + +L+
Sbjct: 714 LFNISAGASATATLNL-TLGSLARHDENGNAVLYPGDYGLLI 754


>gi|348604625|dbj|BAK96214.1| beta-xylosidase [Acremonium cellulolyticus]
          Length = 797

 Score =  430 bits (1106), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 250/615 (40%), Positives = 352/615 (57%), Gaps = 26/615 (4%)

Query: 3   ESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEA 62
           + I   L D   CD    Y +RA+ L+   TL E +    + A GVPRLGLP Y+ WSEA
Sbjct: 52  DCINGPLKDNIVCDTSANYVDRAEGLIALFTLEELINNTQNTAPGVPRLGLPPYQVWSEA 111

Query: 63  LHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN 122
           LHG+      T+         E   ATSFP  IL+ A+ N +L  +I   + T+ARA  N
Sbjct: 112 LHGLDRANFATSG-------DEWTWATSFPMPILSMAALNRTLINQIAGIIGTQARAFNN 164

Query: 123 LGNAGLTFWSPNINVVRDPRWGRVLETPGEDP-YVVGRYAINYVRGLQDVEGVEYHRDSD 181
            G  GL  ++PNIN  R P WGR  ETPGED  ++   YA  Y+ GLQ   GV      D
Sbjct: 165 AGRYGLDAYAPNINGFRSPLWGRGQETPGEDANFLSSSYAYEYITGLQG--GV------D 216

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
              LK+ A  KH+A YDL+NW GN R  FD+ +T+QD+ E +   F          S MC
Sbjct: 217 PDHLKVVATAKHFAGYDLENWGGNSRLGFDASITQQDLAEYYTPQFLAASRYAKARSFMC 276

Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
           SYN VNG+P+C+   LL   +R +W+F  +GY+ SDCD++  +   H + ++ +  A A 
Sbjct: 277 SYNSVNGVPSCSSSFLLQTLLRDNWDFPEYGYVSSDCDAVYNVFNPHGYASN-QSAAAAD 335

Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLG 358
            L+AG D+DCG  Y      +  +G +   +I+ S+  LY  L++LGYFDG   +Y+ LG
Sbjct: 336 SLRAGTDIDCGQTYPWNLNQSFIEGSVTRGEIERSIVRLYSNLVKLGYFDGDKSEYRQLG 395

Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
            N++       ++ EAA +GIVLLKND G LPL + ++K++AL+GP ANAT+ + GNY G
Sbjct: 396 WNDVVTTDAWNISYEAAVEGIVLLKND-GILPL-SKHVKSIALIGPWANATEQLQGNYYG 453

Query: 419 TPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
           T     +P+ G       +NYA G  +I+         A+ AAK +D  V + G+D ++E
Sbjct: 454 TAPYLITPLQGASDAGYKVNYALGT-NILGNTTEGFADALSAAKKSDVIVYLGGIDNTIE 512

Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
           AEG DR+++  PG Q +LI +++   K P+ ++ M  G VD +  K N K+ +++W GYP
Sbjct: 513 AEGTDRMNVTWPGNQLDLIQQLSQTGK-PLVVLQMGGGQVDSSSIKANSKVNALVWGGYP 571

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVN-NFPGRTYKFFDGP 596
           G+ GG AI D++ GK  P GRL  T Y A Y  + P T M LRP   + PG+TY ++ G 
Sbjct: 572 GQSGGTAIFDILSGKRVPAGRLVTTQYPAEYATQFPATDMNLRPDGASNPGQTYMWYTGT 631

Query: 597 VVYPFGYGLSYTQFK 611
            VY FGYGL YT FK
Sbjct: 632 PVYDFGYGLFYTTFK 646


>gi|115387056|ref|XP_001210069.1| predicted protein [Aspergillus terreus NIH2624]
 gi|114191067|gb|EAU32767.1| predicted protein [Aspergillus terreus NIH2624]
          Length = 908

 Score =  430 bits (1106), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 276/713 (38%), Positives = 394/713 (55%), Gaps = 57/713 (7%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD  L   ER   LV+ +TL EK+  + D A G  RLGLP YEWW+EA HGV        
Sbjct: 163 CDTSLSIAERVNSLVKSLTLEEKILNLVDAAAGSTRLGLPFYEWWNEATHGV-------G 215

Query: 75  SPPGTHFDSEVPG---ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
           S PG  F S+      ATSFP  IL  ASF+ +L +KI + +  E RA  N G +G  FW
Sbjct: 216 SAPGVQFTSKPANFSYATSFPAPILIAASFDNALIRKIAEVIGKEGRAFANNGFSGFDFW 275

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
           +PNIN  RDPRWGR  ETPGED +V   Y  N++ GLQ           D +  ++ A C
Sbjct: 276 APNINGFRDPRWGRGQETPGEDTFVAQNYIRNFIPGLQ---------GDDPKNKQVIATC 326

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
           KHYA YDL+      R+  +   T+QD+ + F+ PF+ CV + DV S+MCSYN V+GIP 
Sbjct: 327 KHYAVYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIPA 382

Query: 252 CADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           CA+  LL++ +R  W F+    Y+VSDC+++  I + H F  DT+E A A  L AG+DL+
Sbjct: 383 CANEYLLDEVLRKHWGFNADYHYVVSDCNAVTDIWQYHNF-TDTEEAAAAVALNAGVDLE 441

Query: 309 CGDYY--TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQ 366
           CG  Y   N ++ A Q    A   +D SL  LY  L  +G+FDG  +Y +L  +++  P 
Sbjct: 442 CGSSYLKLNESLAANQTSVKA---MDQSLARLYSALFTIGFFDGG-KYDHLDFSDVSIPA 497

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGN-IKTLALVGPHANATKAMIGNYEGTPCRYTS 425
              LA EAA +G+ LLKND G LPL++ +  K++A++GP ANAT  M G Y G      S
Sbjct: 498 AQALAYEAAVEGMTLLKND-GLLPLHSQHKYKSVAVIGPFANATTQMQGGYSGNAPYLIS 556

Query: 426 PMDGFYA-YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           P+  F + +   +NYA G A I  QN +   A++ AAK +D  V + G+D S+E+E  DR
Sbjct: 557 PLVAFESDHRWKVNYAVGTA-INDQNTTGFEASLAAAKKSDLIVYLGGIDNSIESETIDR 615

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
             L  PG Q +LI  +++ +K P+ +V    G VD +    N  I++++W GYP + GG 
Sbjct: 616 TSLAWPGNQLDLIKSLSNLSK-PMVVVQFGGGQVDDSALLENKDIQALIWAGYPSQSGGT 674

Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPF 601
           A+ D++ GK +P GRLP+T Y A+Y  +I    + LRP   ++ PGRTYK++ G  V PF
Sbjct: 675 ALLDILVGKRSPAGRLPVTQYPASYADQINIFDINLRPNSKDSHPGRTYKWYTGKPVIPF 734

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
           G+GL YT+FK+                ++  +  Y++      C       +K      T
Sbjct: 735 GHGLHYTKFKFGW--------------EETLNREYSIQELVASCQRSSGGPIKDNTPFTT 780

Query: 662 FQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERVFIAAGQSAKVG 712
            +  V N+G      V +++  SK  G A    K ++ Y+R+   A  S +V 
Sbjct: 781 VKARVRNVGHETSDYVSLLFLSSKNAGPAPRPNKSLVSYKRLHNIAPGSDRVA 833


>gi|310792973|gb|EFQ28434.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Glomerella graminicola M1.001]
          Length = 728

 Score =  430 bits (1105), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 244/589 (41%), Positives = 344/589 (58%), Gaps = 34/589 (5%)

Query: 32  MTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHF--DSEVPG-A 88
           M++ EKV+ + D + GV  LGLP + WW+E LHGV F        PG  F  DSE  G A
Sbjct: 1   MSVEEKVRNLVDASAGVKSLGLPPHGWWNEGLHGVGF-------SPGVLFAQDSEPFGYA 53

Query: 89  TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLE 148
           TSFP  ILT ASF++ L+  IGQ +  E RA  N G AG  FW+PN+N  RDPRWGR  E
Sbjct: 54  TSFPLPILTAASFDDDLFNAIGQVIGREGRAFSNYGYAGFNFWTPNMNAFRDPRWGRGQE 113

Query: 149 TPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRF 208
           TPGED  VV  Y  +YV GLQ          SD     I A CKH+AAYD++     + +
Sbjct: 114 TPGEDVLVVSNYVQSYVTGLQ---------GSDPTDKVIIAACKHFAAYDIETARRANNY 164

Query: 209 HFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNF 268
           +     T+QD+Q+ ++  F  CV +  V +VMCSYN V+GIP C+   LL + +R  W F
Sbjct: 165 N----PTQQDLQDYYLPAFRRCVRDSHVGTVMCSYNSVDGIPACSSEYLLKEVLRDTWGF 220

Query: 269 ---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGK 325
              + ++VSDC ++  +   H F N T++DA +  + AG DL+CG  Y +   G++   +
Sbjct: 221 TNDYQFVVSDCGAVTDVWLLHNFTN-TEQDAASVSMAAGTDLECGSSYLHLN-GSLADKQ 278

Query: 326 IAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKND 385
           + +  +D +L  LY  L  +GYFDGS  + +LG +++      ++A EAAR G+ LLKND
Sbjct: 279 VTQERVDEALTRLYKALFTVGYFDGS-SHSSLGWSDVSTIDAQQIACEAARAGMTLLKND 337

Query: 386 NGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV-INYAPGCA 444
            G LPL  G  K++AL+GP ANAT  M GNY G      SP+  F   S + +NYA G  
Sbjct: 338 -GVLPLADGKYKSVALIGPFANATTQMQGNYFGRAPFVRSPLWAFTQQSSLQVNYAAGT- 395

Query: 445 DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAA 504
           DI   ++S    A+ AAKN+D  +   G+D ++EAE  DRV +  PG Q +LI++++   
Sbjct: 396 DINSTSDSGFADALAAAKNSDIVIFCGGIDTTIEAETLDRVSITWPGNQLDLISQLSMLG 455

Query: 505 KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW 564
           K P+ +     G VD     +N  + ++ W G PG+ GG A+ D++ GK +  GRLP T 
Sbjct: 456 K-PLVVAQFGGGQVDDTALVDNANVNALFWAGLPGQAGGLAMYDLVVGKASFAGRLPTTQ 514

Query: 565 YEANYVK-IPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKY 612
           Y A+Y   +   ++ LRP   FPGRTYK++ G  V+PFG+GL YT+F +
Sbjct: 515 YPASYADLVSIFNINLRPNGTFPGRTYKWYIGEPVFPFGFGLHYTKFNF 563


>gi|2791278|emb|CAA93248.1| beta-xylosidase [Trichoderma reesei]
 gi|340519464|gb|EGR49702.1| glycoside hydrolase family 3 [Trichoderma reesei QM6a]
          Length = 797

 Score =  429 bits (1103), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 277/760 (36%), Positives = 405/760 (53%), Gaps = 57/760 (7%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD+   Y ERA+ L+   TL E +    +   GVPRLGLP Y+ W+EALHG+    R   
Sbjct: 63  CDSSAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLD---RANF 119

Query: 75  SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
           +  G  F+     ATSFP  ILTTA+ N +L  +I   +ST+ARA  N G  GL  ++PN
Sbjct: 120 ATKGGQFE----WATSFPMPILTTAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPN 175

Query: 135 INVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           +N  R P WGR  ETPGED + +   Y   Y+ G+Q   GV      D   LK++A  KH
Sbjct: 176 VNGFRSPLWGRGQETPGEDAFFLSSAYTYEYITGIQG--GV------DPEHLKVAATVKH 227

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           +A YDL+NW    R  FD+ +T+QD+ E +   F          S+MC+YN VNG+P+CA
Sbjct: 228 FAGYDLENWNNQSRLGFDAIITQQDLSEYYTPQFLAAARYAKSRSLMCAYNSVNGVPSCA 287

Query: 254 DPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
           +   L   +R  W F   GY+ SDCD++  +   H +    +  A A  L+AG D+DCG 
Sbjct: 288 NSFFLQTLLRESWGFPEWGYVSSDCDAVYNVFNPHDYA-SNQSSAAASSLRAGTDIDCGQ 346

Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELA 371
            Y      +   G+++  +I+ S+  LY  L+RLGYFD   QY++LG  ++       ++
Sbjct: 347 TYPWHLNESFVAGEVSRGEIERSVTRLYANLVRLGYFDKKNQYRSLGWKDVVKTDAWNIS 406

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            EAA +GIVLLKND G LPL+   ++++AL+GP ANAT  M GNY G      SP++   
Sbjct: 407 YEAAVEGIVLLKND-GTLPLSK-KVRSIALIGPWANATTQMQGNYYGPAPYLISPLEAAK 464

Query: 432 AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPG 491
                +N+  G  +I   + +    AI AAK +DA + + G+D ++E EG DR D+  PG
Sbjct: 465 KAGYHVNFELGT-EIAGNSTTGFAKAIAAAKKSDAIIYLGGIDNTIEQEGADRTDIAWPG 523

Query: 492 FQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIF 551
            Q +LI ++++  K P+ ++ M  G VD +  K+N K+ S++W GYPG+ GG A+ D++ 
Sbjct: 524 NQLDLIKQLSEVGK-PLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILS 582

Query: 552 GKYNPGGRLPITWYEANYV-KIPYTSMPLRP-VNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
           GK  P GRL  T Y A YV + P   M LRP   + PG+TY ++ G  VY FG GL YT 
Sbjct: 583 GKRAPAGRLVTTQYPAEYVHQFPQNDMNLRPDGKSNPGQTYIWYTGKPVYEFGSGLFYTT 642

Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
           FK  +AS PKS+              YT     P                FTF+  ++N 
Sbjct: 643 FKETLASHPKSLKFNTSSILSAPHPGYTYSEQIP---------------VFTFEANIKNS 687

Query: 670 GKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNACKSLKIVDN 726
           GK +     M++ +    G A    K ++G++R+  I  G S+K+   +    +L  VD+
Sbjct: 688 GKTESPYTAMLFVRTSNAGPAPYPNKWLVGFDRLADIKPGHSSKLSIPI-PVSALARVDS 746

Query: 727 AANSLLASGAHTI-------------LVGEGVGGVSFPLQ 753
             N ++  G + +             LVGE V   ++PL+
Sbjct: 747 HGNRIVYPGKYELALNTDESVKLEFELVGEEVTIENWPLE 786


>gi|322512556|gb|ADX05682.1| putative carbohydrate-active enzyme [uncultured organism]
          Length = 717

 Score =  429 bits (1103), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 259/763 (33%), Positives = 406/763 (53%), Gaps = 106/763 (13%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           ++D  + D    + ERA+ LV  MTL EKV Q    A  + RLG+P Y +W+EALHGV+ 
Sbjct: 1   MTDKAWLDETKTFEERAQALVCEMTLEEKVFQTLFNAPAIERLGVPAYNYWNEALHGVAR 60

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-- 126
            G                 AT FP  I   ASF+E L  ++  T+STEARA +N+     
Sbjct: 61  AGV----------------ATVFPQAIGLAASFDEELLGQVADTISTEARAKFNMQQKFG 104

Query: 127 ------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
                 GLTFWSPN+N+ RDPRWGR  ET GEDP++ GR  ++++RG+Q           
Sbjct: 105 DRDIYKGLTFWSPNVNIFRDPRWGRGHETFGEDPFLSGRLGVSFIRGMQG---------D 155

Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
           D R +K++AC KH+A +       + R  F++ V+EQD++ET++  F  CV E  V +VM
Sbjct: 156 DERYMKVAACAKHFAVHSGPE---DQRHSFNAVVSEQDLRETYLPAFHACVTEAGVEAVM 212

Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
            +YNR NG   C   KLL   +RG+W F G++ SDC +++   E H  +   +E+ VA  
Sbjct: 213 GAYNRTNGEACCGSKKLLVDILRGEWGFRGHVTSDCWALKDFHEFH-MVTKNQEETVALA 271

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLG 358
           + +G DL+CG+ Y +  + AV+ G + E+ ID ++  L+   M+LG FD S +  Y  +G
Sbjct: 272 MNSGCDLNCGNLYVHL-LQAVRDGLVEESVIDRAVTRLFTTRMKLGLFDRSEEVPYNGIG 330

Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
            + +    + +L  EA+R+ + LLKN +G LPL+   ++T+ +VGP+A+  KA++GNYEG
Sbjct: 331 YDRVDTEANRKLNREASRRTVCLLKNADGLLPLDISKLRTIGVVGPNADNRKALVGNYEG 390

Query: 419 TPCRYTSPMDGFYAYS----KVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATV 468
           T   Y + +DG    +    +V+ Y+ GC         + Q N  I  A   A+ +D  +
Sbjct: 391 TASEYVTVLDGIRELAGDDVRVV-YSEGCHLFRDRVQGLGQPNDRIAEARAVAELSDVVI 449

Query: 469 IVAGLDLSVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
            V GLD  +E E           D+ +L LPG Q E++  + ++ K PV LV++   A+ 
Sbjct: 450 AVMGLDPGLEGEEGDQGNEFASGDKPNLELPGLQGEVLKALVESGK-PVVLVLLGGSALA 508

Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
           I +A+ +  + +IL   YPG +GGRA+ADV+FG+  P G+LP+T+Y  +     +T   +
Sbjct: 509 IPWAEEH--VPAILDAWYPGAQGGRAVADVLFGRACPEGKLPVTFYRTSEELPAFTDYSM 566

Query: 580 RPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
           +       RTY++   P +YPFGYGLSYT ++    ++  SVD                 
Sbjct: 567 K------NRTYRYMKQPALYPFGYGLSYTSWELTNTTAEGSVD----------------- 603

Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYE 699
                      D V C+         + N G M G++ V VY K P   G +  Q+ G  
Sbjct: 604 -----------DGVVCRAV-------LRNTGAMAGAQTVQVYVKAPLATGPN-AQLKGLR 644

Query: 700 RVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           ++ +  G+SA+V  +++  ++  + +     +L  G + I +G
Sbjct: 645 KIRLQPGESAEVAISLDK-EAFGVYNEKGLRVLLPGEYKIYIG 686


>gi|291537442|emb|CBL10554.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis
           M50/1]
          Length = 710

 Score =  429 bits (1102), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 259/754 (34%), Positives = 392/754 (51%), Gaps = 109/754 (14%)

Query: 21  YPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTH 80
           Y +RA +LV +MTL EKV Q    A  V RL +  Y WW+EALHGV+  G          
Sbjct: 13  YRKRAAELVGKMTLEEKVAQTLYQAPAVERLNIKAYNWWNEALHGVARAGT--------- 63

Query: 81  FDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG--------LTFWS 132
                  AT FP  I   A+F+E L +++G  VSTEARA +N+   G        LTFW+
Sbjct: 64  -------ATVFPQAIGLAATFDEDLLEQVGDAVSTEARAKFNMQQEGKDTDIYKGLTFWA 116

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           PN+N+ RDPRWGR  ET GEDPY+  R  + Y+ GLQ           D   LK +AC K
Sbjct: 117 PNVNIFRDPRWGRGHETFGEDPYLTSRLGVRYIEGLQG---------HDENYLKAAACAK 167

Query: 193 HYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           H+A +      G +  R  FD+ VTEQD++ET++  FE CV EG V +VM +YNR NG+P
Sbjct: 168 HFAVHS-----GPEAVRHEFDAEVTEQDLRETYLPAFEACVKEGKVEAVMGAYNRTNGVP 222

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
            C + +LL   +R +W F G++ SDC +I+   E H  +  T  ++VA  +  G DL+CG
Sbjct: 223 CCGNKRLLIDILRKEWGFSGHVTSDCWAIRDFHEGH-HVTGTAIESVAMAMNNGCDLNCG 281

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHI 368
             +  F + AV+QG + E  +D ++  L++  M+LG FD   +  Y  +      + +  
Sbjct: 282 TLF-GFLVQAVRQGLVKEERLDEAVTNLFMARMKLGVFDKKEENPYDKIPYLAADSREMK 340

Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
           +L    AR+ +VLLKN    LPL+   IKT+ ++GP+A++ +A++GNYEGT  RY + ++
Sbjct: 341 KLNEAVARRTVVLLKNKEHILPLDKNKIKTIGVIGPNADSRRALVGNYEGTASRYITVLE 400

Query: 429 GFYAY---SKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
           G   Y      + Y+ GC         + Q N  +   +   K +D  V V GLD  +E 
Sbjct: 401 GIEDYVGDDVRVLYSEGCHLYKDRTSNLAQENDRMSEVLGVCKESDVVVAVLGLDAGIEG 460

Query: 480 E---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIK 530
           E           D+ DL LPG Q E++       K PV LV++S  A+ +N+A  +  + 
Sbjct: 461 EEGDAGNEYGSGDKPDLNLPGLQEEILEAAVSCGK-PVILVLLSGSALAVNWA--DEHVD 517

Query: 531 SILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTY 590
           +I+   YPG  GG AIAD++FG+ NP G+LP+T+Y           +P     +  GRTY
Sbjct: 518 AIVQGWYPGARGGAAIADILFGEANPEGKLPVTFYRTT------EELPDFEDYSMQGRTY 571

Query: 591 KFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
           ++ +   +YPFGYGLSYT++ Y+        +++  + +       T+G           
Sbjct: 572 RYMEQEALYPFGYGLSYTEYAYQ--------NVRFLEQEPVVSEGVTIG----------- 612

Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH--IKQVIGYERVFIAAGQS 708
                        + V+N GKMDG+E V VY K       H  +K+++   ++ + AG+ 
Sbjct: 613 -------------LSVKNTGKMDGTETVQVYVKAEHSKMPHGQLKKIV---KLPLCAGEE 656

Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            ++   + + ++  + D     +L SG   I VG
Sbjct: 657 KEINIRLES-EAFMLYDENGEKILPSGHFEIFVG 689


>gi|240146254|ref|ZP_04744855.1| beta-glucosidase [Roseburia intestinalis L1-82]
 gi|257201613|gb|EEU99897.1| beta-glucosidase [Roseburia intestinalis L1-82]
 gi|291539969|emb|CBL13080.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis
           XB6B4]
          Length = 710

 Score =  428 bits (1101), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 259/754 (34%), Positives = 392/754 (51%), Gaps = 109/754 (14%)

Query: 21  YPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTH 80
           Y +RA +LV +MTL EKV Q    A  V RL +  Y WW+EALHGV+  G          
Sbjct: 13  YRKRAAELVGKMTLEEKVAQTLYQAPAVERLNIKAYNWWNEALHGVARAGT--------- 63

Query: 81  FDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG--------LTFWS 132
                  AT FP  I   A+F+E L +++G  VSTEARA +N+   G        LTFW+
Sbjct: 64  -------ATVFPQAIGLAATFDEDLLEQVGDAVSTEARAKFNMQQEGKDTDIYKGLTFWA 116

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           PN+N+ RDPRWGR  ET GEDPY+  R  + Y+ GLQ           D   LK +AC K
Sbjct: 117 PNVNIFRDPRWGRGHETFGEDPYLTSRLGVRYIEGLQG---------HDENYLKAAACAK 167

Query: 193 HYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           H+A +      G +  R  FD+ VTEQD++ET++  FE CV EG V +VM +YNR NG+P
Sbjct: 168 HFAVHS-----GPEAVRHEFDAEVTEQDLRETYLPAFEACVKEGKVEAVMGAYNRTNGVP 222

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
            C + +LL   +R +W F G++ SDC +I+   E H  +  T  ++VA  +  G DL+CG
Sbjct: 223 CCGNKRLLIDILRKEWGFSGHVTSDCWAIRDFHEGH-HVTGTAIESVAMAMNNGCDLNCG 281

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHI 368
             +  F + AV+QG + E  +D ++  L++  M+LG FD   +  Y  +      + +  
Sbjct: 282 TLF-GFLVQAVRQGLVKEERLDEAVTNLFMARMKLGVFDKKEENPYDKIPYLAADSREMK 340

Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
           +L    AR+ +VLLKN    LPL+   IKT+ ++GP+A++ +A++GNYEGT  RY + ++
Sbjct: 341 KLNEAVARRTVVLLKNKEHILPLDKNKIKTVGVIGPNADSRRALVGNYEGTASRYITVLE 400

Query: 429 GFYAY---SKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
           G   Y      + Y+ GC         + Q N  +   +   K +D  V V GLD  +E 
Sbjct: 401 GIEDYVGDDVRVLYSEGCHLYKDRTSNLAQENDRMSEVLGVCKESDVVVAVLGLDAGIEG 460

Query: 480 E---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIK 530
           E           D+ DL LPG Q E++       K PV LV++S  A+ +N+A  +  + 
Sbjct: 461 EEGDAGNEYGSGDKPDLNLPGLQEEILEAAVSCGK-PVILVLLSGSALAVNWA--DEHVD 517

Query: 531 SILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTY 590
           +I+   YPG  GG AIAD++FG+ NP G+LP+T+Y           +P     +  GRTY
Sbjct: 518 AIVQGWYPGARGGAAIADILFGEANPEGKLPVTFYRTT------EELPDFEDYSMQGRTY 571

Query: 591 KFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
           ++ +   +YPFGYGLSYT++ Y+        +++  + +       T+G           
Sbjct: 572 RYMEQEALYPFGYGLSYTEYAYQ--------NVRFLEQEPVVSEGVTIG----------- 612

Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH--IKQVIGYERVFIAAGQS 708
                        + V+N GKMDG+E V VY K       H  +K+++   ++ + AG+ 
Sbjct: 613 -------------LSVKNTGKMDGTETVQVYVKAEHSKMPHGQLKKIV---KLPLCAGEE 656

Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            ++   + + ++  + D     +L SG   I VG
Sbjct: 657 KEINIRLES-EAFMLYDENGEKILPSGHFEIFVG 689


>gi|297738404|emb|CBI27605.3| unnamed protein product [Vitis vinifera]
          Length = 581

 Score =  428 bits (1100), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 213/401 (53%), Positives = 271/401 (67%), Gaps = 46/401 (11%)

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
           DVEG E   D +SRPLK+S+CCKHYA YD+D+W           V+EQDM+ETF  PFE 
Sbjct: 4   DVEGTENVTDLNSRPLKVSSCCKHYATYDIDSW---------LNVSEQDMKETFFSPFE- 53

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
                                            R +W+ HGYIVSDC  ++ IV++  +L
Sbjct: 54  ---------------------------------RDEWDLHGYIVSDCYGLEVIVDNQNYL 80

Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
           N++K DAVA+ L+AGLDL+CG YYT+    +V  GK+++ ++D +L+ +Y++LMR+GYFD
Sbjct: 81  NESKVDAVAKTLQAGLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFD 140

Query: 350 GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT 409
           G P Y++LG  +IC   HIELA EAARQGIVLLKND   LPL  G  K L LVGPHANAT
Sbjct: 141 GIPAYESLGLKDICAADHIELAREAARQGIVLLKNDYEVLPLKPG--KKLVLVGPHANAT 198

Query: 410 KAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVI 469
           + MIGNY G P +Y SP++ F A   V  YA GC D  C N++    A +AAK A+ T+I
Sbjct: 199 EVMIGNYAGLPYKYVSPLEAFSAIGNV-TYATGCLDASCSNDTYFSEAKEAAKFAEVTII 257

Query: 470 VAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
             G DLS+EAE  DRVD LLPG QTELI +VA+ + GPV LV++S   +DI FAKNNP+I
Sbjct: 258 FVGTDLSIEAEFVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRI 317

Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV 570
            +ILWVG+PGE+GG AIADV+FGKYNPGGRLP+TWYEA+YV
Sbjct: 318 SAILWVGFPGEQGGHAIADVVFGKYNPGGRLPVTWYEADYV 358


>gi|334187562|ref|NP_196532.2| Glycosyl hydrolase family protein [Arabidopsis thaliana]
 gi|332004052|gb|AED91435.1| Glycosyl hydrolase family protein [Arabidopsis thaliana]
          Length = 526

 Score =  427 bits (1097), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 222/483 (45%), Positives = 305/483 (63%), Gaps = 22/483 (4%)

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEAD 330
           YIVSDCDS+  +  S  +   T E+A A+ + AGLDL+CG +  N T  AV++G I EA 
Sbjct: 45  YIVSDCDSLGILYGSQHY-TKTPEEAAAKSILAGLDLNCGSFLGNHTENAVKKGLIDEAA 103

Query: 331 IDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNG 387
           I+ ++   +  LMRLG+FDG+P+   Y  LG  ++C  ++ ELA E ARQGIVLLKN  G
Sbjct: 104 INKAISNNFATLMRLGFFDGNPKNQPYGGLGPKDVCTVENRELAVETARQGIVLLKNSAG 163

Query: 388 ALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIV 447
           +LPL+   IKTLA++GP+AN TK MIGNYEG  C+YT+P+ G         Y  GC ++ 
Sbjct: 164 SLPLSPSAIKTLAVIGPNANVTKTMIGNYEGVACKYTTPLQGLERTVLTTKYHRGCFNVT 223

Query: 448 CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGP 507
           C    +  A   AA +ADATV+V G D ++E E  DR+DL LPG Q EL+ +VA AA+GP
Sbjct: 224 CTEADLDSAKTLAA-SADATVLVMGADQTIEKETLDRIDLNLPGKQQELVTQVAKAARGP 282

Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
           V LVIMS G  DI FAKN+ KI SI+WVGYPGE GG AIADVIFG++NP G+LP+TWY  
Sbjct: 283 VVLVIMSGGGFDITFAKNDEKITSIMWVGYPGEAGGIAIADVIFGRHNPSGKLPMTWYPQ 342

Query: 568 NYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIK 624
           +YV K+P T+M +RP   N + GRTY+F+ G  VY FG GLSYT F +++  +PK V + 
Sbjct: 343 SYVEKVPMTNMNMRPDKSNGYLGRTYRFYIGETVYAFGDGLSYTNFSHQLIKAPKFVSLN 402

Query: 625 LDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD-----YKFTFQIEVENMGKMDGSEVVM 679
           LD+ Q CR          P C ++      C+        F  Q++V N+G  +G+E V 
Sbjct: 403 LDESQSCRS---------PECQSLDAIGPHCEKAVGERSDFEVQLKVRNVGDREGTETVF 453

Query: 680 VYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
           +++ PP + G+  KQ++G+E++ +   +   V F ++ CK L +VD      LA G H +
Sbjct: 454 LFTTPPEVHGSPRKQLLGFEKIRLGKKEETVVRFKVDVCKDLGVVDEIGKRKLALGHHLL 513

Query: 740 LVG 742
            VG
Sbjct: 514 HVG 516


>gi|23304843|emb|CAD48309.1| beta-xylosidase B [Clostridium stercorarium]
          Length = 715

 Score =  425 bits (1093), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 271/759 (35%), Positives = 400/759 (52%), Gaps = 104/759 (13%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D    + ERAKDLV RMT+ EKV QM   +  + RLG+P Y WW+EALHGV+  G   
Sbjct: 7   YLDPSYSFEERAKDLVSRMTIEEKVSQMLYNSPAIERLGIPAYNWWNEALHGVARAGT-- 64

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   A+F+E L  K+   +STE RA Y+  +        
Sbjct: 65  --------------ATMFPQAIGMAATFDEELIYKVADVISTEGRAKYHASSKKGDRGIY 110

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLTFWSPNIN+ RDPRWGR  ET GEDPY+  R  + +V+GLQ          +  + L
Sbjct: 111 KGLTFWSPNINIFRDPRWGRGQETYGEDPYLTARLGVAFVKGLQG---------NHPKYL 161

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K    CK+   + +     + R  F++ V+++D+ ET++  F+  V E  V SVM +YNR
Sbjct: 162 KAGGMCKNILPFTV--VPESLRHEFNAVVSKKDLYETYLPAFKALVQEAKVESVMGAYNR 219

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
            NG P C    LL+  +RG+W F G++VSDC +I+     H  +  T  ++ A  ++ G 
Sbjct: 220 TNGEPCCGSKTLLSDILRGEWGFKGHVVSDCWAIRDF-HMHHHVTATAPESAALAVRNGC 278

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNIC 363
           DL+CG+ + N  + A+++G I E +ID ++  L I  M+LG FD   Q  Y ++     C
Sbjct: 279 DLNCGNMFGNLLI-ALKEGLITEEEIDRAVTRLMITRMKLGMFDPEDQVPYASISSFVDC 337

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
             +H ELA + A++ IVLLKND G LPL+   I+++A++GP+A++ +A+IGNYEGT   Y
Sbjct: 338 K-EHRELALDVAKKSIVLLKND-GLLPLDRKKIRSIAVIGPNADSRQALIGNYEGTASEY 395

Query: 424 TSPMDGFYAYSK---VINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
            + +DG    +     I Y+ GC       + + +    I  A+  A++AD  ++  GLD
Sbjct: 396 VTVLDGIREMAGDDVRIYYSVGCHLYKDRVENLGEPGDRIAEAVTCAEHADVVIMCLGLD 455

Query: 475 LSVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
            ++E E           D+ DL LPG Q EL+  V    K P+ LV+++  A+ + +A  
Sbjct: 456 STIEGEEMHESNIYGSGDKPDLNLPGQQQELLEAVYATGK-PIVLVLLTGSALAVTWADE 514

Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
           +  I +IL   YPG  GGRAIA V+FG+ NP G+LP+T+Y        +T   +      
Sbjct: 515 H--IPAILNAWYPGALGGRAIASVLFGETNPSGKLPVTFYRTTEELPDFTDYSME----- 567

Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
             RTY+F     +YPFG+GLSYT F Y         D+KL KD        T+   +   
Sbjct: 568 -NRTYRFMKNEALYPFGFGLSYTTFDYS--------DLKLSKD--------TIRAGE--- 607

Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIK--QVIGYERVFI 703
                         F   ++V N GKM G EVV VY K    A   +   Q+ G +RV +
Sbjct: 608 -------------GFNVSVKVTNTGKMAGEEVVQVYIKDLE-ASWRVPNWQLSGMKRVRL 653

Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            +G++A++ F +   + L +V +   S++  G   I VG
Sbjct: 654 ESGETAEITFEIRP-EQLAVVTDEGKSVIEPGEFEIYVG 691


>gi|326202986|ref|ZP_08192853.1| glycoside hydrolase family 3 domain protein [Clostridium
           papyrosolvens DSM 2782]
 gi|325987063|gb|EGD47892.1| glycoside hydrolase family 3 domain protein [Clostridium
           papyrosolvens DSM 2782]
          Length = 712

 Score =  425 bits (1093), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 266/758 (35%), Positives = 394/758 (51%), Gaps = 104/758 (13%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D  L + ERA DLV +MTL EK  Q+   A  V RLG+P Y WW+EALHGV+  G   
Sbjct: 6   YLDKSLSFKERAADLVSKMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAGV-- 63

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   A F++   +KI   ++TE RA YN           
Sbjct: 64  --------------ATVFPQAIGMAAMFDDEFLEKIADVIATEGRAKYNESAKKGDRDIY 109

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            G+TFWSPN+N+ RDPRWGR  ET GEDPY+  R  + +V+GLQ           D + L
Sbjct: 110 KGITFWSPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQ----------GDGKYL 159

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K +AC KHYA +   +   +DR  FD+ V+++D+ ET++  FE  V E  V S+M +YNR
Sbjct: 160 KTAACAKHYAVH---SGPEDDRHFFDAIVSQKDLYETYLPAFEALVKEAKVESIMGAYNR 216

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
            NG P      LL   +R  W F G++VSDC +I+   E H  +  T  ++VA  LK+G 
Sbjct: 217 TNGEPCNGSKTLLKDILRDGWGFDGHVVSDCWAIKDFHEGHG-VTKTPTESVALALKSGC 275

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNP 365
           DL+CG+ Y    + A+++G I E DID +   L    M+LG FD   ++ N+      + 
Sbjct: 276 DLNCGNMYL-LILLALKEGLITEEDIDRAAIRLMTTRMKLGMFDDDCEFDNIPYELNDSA 334

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H +++ EAA++ +VLLKND G LPL++  IK +A++GP+A+++ A+  NY GTP +  +
Sbjct: 335 EHNKISLEAAKKSMVLLKND-GLLPLDSKKIKNVAVIGPNADSSLALRANYSGTPSQNVT 393

Query: 426 PMDGF---YAYSKVINYAPGCA------DIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
            ++G     + +  + YA G        + + Q +  +  A+ AA+ +D  V+  GLD S
Sbjct: 394 IIEGIRKRVSENTRVWYAMGSHLFLNRDEDLAQPDDRLKEAVSAAERSDVVVLCLGLDAS 453

Query: 477 VEAE-----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
           VE E           G D+ DL LP  Q  L+N V    K P  + ++S  A+ I  A +
Sbjct: 454 VEGEQNDQGTVILDAGGDKADLNLPESQRNLLNAVLATGK-PTIVALLSGSALSIGDAAD 512

Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
             K  +I+   YPG  GG A A++IFG Y+P GRLP+T+Y++     P+    +      
Sbjct: 513 --KAAAIVQCWYPGAIGGLAFAEMIFGDYSPAGRLPVTFYKSTEELPPFADYSME----- 565

Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
             RTYKF  G  +YPFG+GLSYT F+Y     P++V+                G N    
Sbjct: 566 -NRTYKFMKGDALYPFGFGLSYTSFEYSNMVCPQTVN---------------NGEN---- 605

Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIK-QVIGYERVFIA 704
                          +  ++V+N G +D  EVV VY K    +    K  + G++R+ + 
Sbjct: 606 --------------LSVSVDVQNTGSVDSDEVVQVYIKDMDASVRVPKYSLCGFKRIHLK 651

Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +G+   V F + A  ++ IVD A    + +G  T+  G
Sbjct: 652 SGEKKTVTFEV-ASNAMSIVDEAGKRHIENGEFTLYAG 688


>gi|323447708|gb|EGB03620.1| hypothetical protein AURANDRAFT_72703 [Aureococcus anophagefferens]
          Length = 744

 Score =  425 bits (1093), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 270/744 (36%), Positives = 381/744 (51%), Gaps = 114/744 (15%)

Query: 5   IKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALH 64
           +       P+CDA L    RA D V RMT+PEK+  +      +  LGLP Y WWSEA  
Sbjct: 30  LNATFEALPFCDATLAIDLRAADAVSRMTIPEKIDALDTKTGPIASLGLPAYNWWSEASS 89

Query: 65  GVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
           GV  +G R    P T F        ++P  + T  SFN +LW+  G  +  EARA+ N G
Sbjct: 90  GV--MGSR----PTTKF--------AYP--VTTAMSFNRTLWRATGAAIGREARALMNAG 133

Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
            A  T+W+P +N+ R+PRWGR +E PGEDPY+ G YA  +V G Q      YH       
Sbjct: 134 AAYSTYWAPVVNLAREPRWGRNIEVPGEDPYLTGEYATEFVGGFQAAPEDPYH------- 186

Query: 185 LKISACCKHYAAYDLDNW-----EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
           L+ SACCKHY A +L+N      E  DR H DS VT++D+ +++++PF+ CV +G VSS+
Sbjct: 187 LQASACCKHYVANELENTRQPDGEQWDRQHVDSNVTQRDLVDSYMVPFQACVEKGKVSSL 246

Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
           MCSYN VNG+P+CA+  LL    R  W+F GYI SDCD+   + ++H +   T E+AVA 
Sbjct: 247 MCSYNAVNGVPSCANDWLLRTVARDAWHFDGYITSDCDADSNVYDAHHYAA-TPEEAVAD 305

Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS-------- 351
           VLKAG D+DC  +       A+ +G I EAD+D  L  L+ V +RLG+FD S        
Sbjct: 306 VLKAGTDVDCQSFVGQHARSALDKGLITEADMDARLVNLFKVRLRLGHFDLSFDAAKPRG 365

Query: 352 PQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKA 411
           P  +      +C+  H++ + E   Q   LLKND GALPL      T A+VGP+A  +KA
Sbjct: 366 PLDEIDADAVVCSDAHLDASMEGLAQSATLLKND-GALPLKPSG--TAAVVGPNALLSKA 422

Query: 412 MIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVA 471
             G Y         P D                                   ADA V+  
Sbjct: 423 DAGYY--------GPTDA----------------------------------ADAVVLAV 440

Query: 472 GLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDIN--FAKNNPKI 529
           G DL+  AEGKD   ++    Q ELI+ VA A+  PV +V+ SA  +D+    A+++ K+
Sbjct: 441 GTDLTWAAEGKDATSIVFTAAQLELIDAVATASATPVVVVVFSATPLDLTPLLARSDGKV 500

Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR-------- 580
            +++ VG P     + + D+++G+ +  GR   T Y A Y  +I      +R        
Sbjct: 501 GAVVHVGQPSVT-VKGLGDLLYGRRSFAGRAVQTVYPAAYADQISIFDFNMRPGPSAFAR 559

Query: 581 ----------PVNNFPGRTYKFF-DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQ 629
                     P    PGRTY+F+ D PVV PFG+GLSYT F Y V S+P +VD  L   +
Sbjct: 560 PDCATNESACPRGTNPGRTYRFYVDEPVV-PFGFGLSYTTFAYAVRSAPTTVD--LAPLR 616

Query: 630 QCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP--GI 687
                      +  P    L DD        T+ ++V N G +D  +VV+ +  PP  G+
Sbjct: 617 AAYAGVAAARGDGGPAFLSLHDDAAAA----TYAVDVTNTGDIDADDVVLGFVTPPGAGV 672

Query: 688 AGTHIKQVIGYERVFIAAGQSAKV 711
            G  +K++ G+ERV + AG++  V
Sbjct: 673 DGVPLKELFGFERVHVKAGETKTV 696


>gi|398406144|ref|XP_003854538.1| hypothetical protein MYCGRDRAFT_38178 [Zymoseptoria tritici IPO323]
 gi|339474421|gb|EGP89514.1| hypothetical protein MYCGRDRAFT_38178 [Zymoseptoria tritici IPO323]
          Length = 884

 Score =  425 bits (1092), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 265/711 (37%), Positives = 382/711 (53%), Gaps = 44/711 (6%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS-FIGRRT 73
           CD  L   +R   L+ +MT+ EK   + D A G+PR+GLP YEWW+EALHGV+   G   
Sbjct: 146 CDTSLSQDDRIAALISQMTVEEKATNLVDGALGLPRIGLPPYEWWNEALHGVAGSRGVSF 205

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
           +SP G+ F      ATSFP  IL  A+F++ L   +   +  EARA  N  ++G  FW+P
Sbjct: 206 DSPNGSDFSY----ATSFPLPILMGAAFDDPLIYDVASIIGKEARAFANYAHSGYDFWTP 261

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           N+N   DPRWGR LE P ED +   RY  + V GLQ  +    H+       +I A CKH
Sbjct: 262 NMNTFLDPRWGRGLEVPTEDSFHAQRYVASLVPGLQGGKEKTDHK-------QIIATCKH 314

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           +A YD++     +R   +   T QD+ E ++  F+ CV + +V S+MCSYN V G+P CA
Sbjct: 315 FAVYDVE----TNRHAQNYEPTPQDLGEYYLPAFKTCVRDVNVGSIMCSYNAVYGVPACA 370

Query: 254 DPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
               L   +R  WNF   + Y+ SDC++++ I   H F  DT+  A A  L AG D +CG
Sbjct: 371 SEYFLQDVLRDQWNFNEPYHYVTSDCEAVKDIWTPHNF-TDTEPAAAAVALNAGTDTNCG 429

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIEL 370
             Y      +V      EA +D SL  LY  L  +GYFDG P+Y  L   ++  P     
Sbjct: 430 TSYLQLNT-SVANNWTTEAQMDISLTRLYNALFTVGYFDGQPEYDGLSFADVSTPFAQAT 488

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A  AA +GI LLKND G LPL   +  ++AL+GP ANAT  M G Y+G      SP+   
Sbjct: 489 AYRAASEGITLLKND-GLLPLKK-SYNSVALIGPWANATTQMQGIYQGIAPYLVSPLAAA 546

Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
            A    I++  G A I   N +   +A+ AA++AD  +   G+D S+E E +DR  +  P
Sbjct: 547 QAQWGHISFTNGTA-INSTNTTGFASALSAARDADVIIYAGGIDSSIEKESRDRTSISWP 605

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
           G Q +L+ ++++  K P+ +V    G VD +    N  + S++W GYPG++GG A+ DV+
Sbjct: 606 GNQLDLVQQLSELGK-PLVVVQFGGGQVDDSALLRNKNVNSLVWAGYPGQDGGSALIDVL 664

Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
            GK +P GRL IT Y A+Y+ +I      LRP ++ PGRTYK+++   V PFGYGL YT 
Sbjct: 665 VGKQSPAGRLTITQYPADYINQISLFDPNLRPSDSSPGRTYKWYNKEPVLPFGYGLHYTT 724

Query: 610 FKYKVASSPK-SVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVEN 668
           F++  A +P+ S DI    D      +YT    K   +                 I+V N
Sbjct: 725 FEFDWAKAPQASYDIASLVDSTA---SYTTSPKKNDASPWT-----------ELSIKVHN 770

Query: 669 MGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERVF-IAAGQSAKVGFTMN 716
            G +    V +V+ + P  G A    K +  Y R+  ++AG SA++ F+++
Sbjct: 771 SGSLGSDYVGLVFLRTPNAGPAPYPNKWLASYARLHGLSAGASAELSFSLS 821


>gi|76160898|gb|ABA40420.1| Xld [Aspergillus fumigatus]
          Length = 792

 Score =  424 bits (1091), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 267/737 (36%), Positives = 391/737 (53%), Gaps = 41/737 (5%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD      +RA  LV   T  E V   G+ + GVPRLGLP Y+ WSEALHG+  
Sbjct: 57  LSKTLVCDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALHGLD- 115

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
              R N       + E   ATSFP  ILT ++ N +L  +I   ++T+ RA  N+G  GL
Sbjct: 116 ---RANFTD----EGEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYGL 168

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSRPLKI 187
             ++PNIN  R   WGR  ETPGED Y +   YA  Y+ G+Q   GV      D   LK+
Sbjct: 169 DVYAPNINAFRSAMWGRGQETPGEDAYCLASAYAYEYITGIQG--GV------DPEHLKL 220

Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
            A  KHYA YDL+NW+G+ R   D  +T+Q++ E +   F +   +  V SVMCSYN VN
Sbjct: 221 VATAKHYAGYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSVMCSYNAVN 280

Query: 248 GIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           G+P+CA+   L   +R  + F   GY+ SDCDS   +   H+F  +    A A  ++AG 
Sbjct: 281 GVPSCANSFFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANIT-GAAADSIRAGT 339

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICN 364
           D+DCG  Y  +   A  + ++  A+I+  +  LY  L+RLGYFDG+   Y++L  N++  
Sbjct: 340 DIDCGTTYQYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGNGSVYRDLTWNDVVT 399

Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
                ++ EAA +GIVLLKND G LPL   +++++AL+GP  N T  + GNY G      
Sbjct: 400 TDAWNISYEAAVEGIVLLKND-GTLPL-AKSVRSVALIGPWMNVTTQLQGNYFGPAPYLI 457

Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           SP++ F      +NYA G  +I   +      A+ AAK +D  +   G+D ++EAE  DR
Sbjct: 458 SPLNAFQNSDFDVNYAFGT-NISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLEAEAMDR 516

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
           +++  PG Q +LI++++   K P+ ++ M  G VD +  K+N  + S++W GYPG+ GG+
Sbjct: 517 MNITWPGNQLQLIDQLSQLGK-PLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYPGQSGGQ 575

Query: 545 AIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
           A+ D+I GK  P GRL +T Y A Y  + P T M LRP  N PG+TY ++ G  VY FG+
Sbjct: 576 ALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGNNPGQTYMWYTGTPVYEFGH 635

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           GL YT F    AS P +      KD+   +I   +    P  A V    +        F 
Sbjct: 636 GLFYTTFH---ASLPGT-----GKDKTSFNIQDLLTQPHPGFANVEQMPL------LNFT 681

Query: 664 IEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
           + + N GK+      M+++    G A    K ++G++R+       ++         S+ 
Sbjct: 682 VTITNTGKVASDYTAMLFANTTAGPAPYPNKWLVGFDRLASLEPHRSQTMTIPVTIDSVA 741

Query: 723 IVDNAANSLLASGAHTI 739
             D A N +L  G + +
Sbjct: 742 RTDEAGNRVLYPGKYEL 758


>gi|70996610|ref|XP_753060.1| beta-xylosidase XylA [Aspergillus fumigatus Af293]
 gi|74672055|sp|Q4WRB0.1|XYND_ASPFU RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|66850695|gb|EAL91022.1| beta-xylosidase XylA [Aspergillus fumigatus Af293]
          Length = 792

 Score =  424 bits (1091), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 267/737 (36%), Positives = 391/737 (53%), Gaps = 41/737 (5%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD      +RA  LV   T  E V   G+ + GVPRLGLP Y+ WSEALHG+  
Sbjct: 57  LSKTLVCDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALHGLD- 115

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
              R N       + E   ATSFP  ILT ++ N +L  +I   ++T+ RA  N+G  GL
Sbjct: 116 ---RANFTD----EGEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYGL 168

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSRPLKI 187
             ++PNIN  R   WGR  ETPGED Y +   YA  Y+ G+Q   GV      D   LK+
Sbjct: 169 DVYAPNINAFRSAMWGRGQETPGEDAYCLASAYAYEYITGIQG--GV------DPEHLKL 220

Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
            A  KHYA YDL+NW+G+ R   D  +T+Q++ E +   F +   +  V SVMCSYN VN
Sbjct: 221 VATAKHYAGYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSVMCSYNAVN 280

Query: 248 GIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           G+P+CA+   L   +R  + F   GY+ SDCDS   +   H+F  +    A A  ++AG 
Sbjct: 281 GVPSCANSFFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANIT-GAAADSIRAGT 339

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICN 364
           D+DCG  Y  +   A  + ++  A+I+  +  LY  L+RLGYFDG+   Y++L  N++  
Sbjct: 340 DIDCGTTYQYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGNGSVYRDLTWNDVVT 399

Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
                ++ EAA +GIVLLKND G LPL   +++++AL+GP  N T  + GNY G      
Sbjct: 400 TDAWNISYEAAVEGIVLLKND-GTLPL-AKSVRSVALIGPWMNVTTQLQGNYFGPAPYLI 457

Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           SP++ F      +NYA G  +I   +      A+ AAK +D  +   G+D ++EAE  DR
Sbjct: 458 SPLNAFQNSDFDVNYAFGT-NISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLEAEAMDR 516

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
           +++  PG Q +LI++++   K P+ ++ M  G VD +  K+N  + S++W GYPG+ GG+
Sbjct: 517 MNITWPGNQLQLIDQLSQLGK-PLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYPGQSGGQ 575

Query: 545 AIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
           A+ D+I GK  P GRL +T Y A Y  + P T M LRP  N PG+TY ++ G  VY FG+
Sbjct: 576 ALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGNNPGQTYMWYTGTPVYEFGH 635

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           GL YT F    AS P +      KD+   +I   +    P  A V    +        F 
Sbjct: 636 GLFYTTFH---ASLPGT-----GKDKTSFNIQDLLTQPHPGFANVEQMPL------LNFT 681

Query: 664 IEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
           + + N GK+      M+++    G A    K ++G++R+       ++         S+ 
Sbjct: 682 VTITNTGKVASDYTAMLFANTTAGPAPYPNKWLVGFDRLASLEPHRSQTMTIPVTIDSVA 741

Query: 723 IVDNAANSLLASGAHTI 739
             D A N +L  G + +
Sbjct: 742 RTDEAGNRVLYPGKYEL 758


>gi|292495282|sp|B0XP71.1|XYND_ASPFC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|159131796|gb|EDP56909.1| beta-xylosidase XylA [Aspergillus fumigatus A1163]
          Length = 792

 Score =  424 bits (1091), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 267/737 (36%), Positives = 391/737 (53%), Gaps = 41/737 (5%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD      +RA  LV   T  E V   G+ + GVPRLGLP Y+ WSEALHG+  
Sbjct: 57  LSKTLVCDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALHGLD- 115

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
              R N       + E   ATSFP  ILT ++ N +L  +I   ++T+ RA  N+G  GL
Sbjct: 116 ---RANFTD----EGEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYGL 168

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSRPLKI 187
             ++PNIN  R   WGR  ETPGED Y +   YA  Y+ G+Q   GV      D   LK+
Sbjct: 169 DVYAPNINAFRSAMWGRGQETPGEDAYCLASAYAYEYITGIQG--GV------DPEHLKL 220

Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
            A  KHYA YDL+NW+G+ R   D  +T+Q++ E +   F +   +  V SVMCSYN VN
Sbjct: 221 VATAKHYAGYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSVMCSYNAVN 280

Query: 248 GIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           G+P+CA+   L   +R  + F   GY+ SDCDS   +   H+F  +    A A  ++AG 
Sbjct: 281 GVPSCANSFFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANIT-GAAADSIRAGT 339

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICN 364
           D+DCG  Y  +   A  + ++  A+I+  +  LY  L+RLGYFDG+   Y++L  N++  
Sbjct: 340 DIDCGTTYQYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGNGSVYRDLTWNDVVT 399

Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
                ++ EAA +GIVLLKND G LPL   +++++AL+GP  N T  + GNY G      
Sbjct: 400 TDAWNISYEAAVEGIVLLKND-GTLPL-AKSVRSVALIGPWMNVTTQLQGNYFGPAPYLI 457

Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           SP++ F      +NYA G  +I   +      A+ AAK +D  +   G+D ++EAE  DR
Sbjct: 458 SPLNAFQNSDFDVNYAFGT-NISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLEAEAMDR 516

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
           +++  PG Q +LI++++   K P+ ++ M  G VD +  K+N  + S++W GYPG+ GG+
Sbjct: 517 MNITWPGNQLQLIDQLSQLGK-PLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYPGQSGGQ 575

Query: 545 AIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
           A+ D+I GK  P GRL +T Y A Y  + P T M LRP  N PG+TY ++ G  VY FG+
Sbjct: 576 ALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGNNPGQTYMWYTGTPVYEFGH 635

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           GL YT F    AS P +      KD+   +I   +    P  A V    +        F 
Sbjct: 636 GLFYTTFH---ASLPGT-----GKDKTSFNIQDLLTQPHPGFANVEQMPL------LNFT 681

Query: 664 IEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
           + + N GK+      M+++    G A    K ++G++R+       ++         S+ 
Sbjct: 682 VTITNTGKVASDYTAMLFANTTAGPAPYPNKWLVGFDRLASLEPHRSQTMTIPVTIDSVA 741

Query: 723 IVDNAANSLLASGAHTI 739
             D A N +L  G + +
Sbjct: 742 RTDEAGNRVLYPGKYEL 758


>gi|347531439|ref|YP_004838202.1| beta-glucosidase [Roseburia hominis A2-183]
 gi|345501587|gb|AEN96270.1| beta-glucosidase [Roseburia hominis A2-183]
          Length = 716

 Score =  423 bits (1087), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 262/760 (34%), Positives = 393/760 (51%), Gaps = 114/760 (15%)

Query: 19  LPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPG 78
           L   E AK LVE+MTL EK+ QM   +  + RL +P Y WW+EALHGV+  G        
Sbjct: 3   LETKEYAKRLVEQMTLEEKISQMRYESPAIERLHIPAYNWWNEALHGVARSGV------- 55

Query: 79  THFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTF 130
                    AT FP  I   A+F+E L +KIG  VSTE RA +   +         GLTF
Sbjct: 56  ---------ATMFPQAIALAATFDEELIEKIGDVVSTEGRAKFEAYSGRGDRGIYKGLTF 106

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           W+PNIN+ RDPRWGR  ET GEDP +  +    Y+RG+Q        +D D   LK +AC
Sbjct: 107 WAPNINIFRDPRWGRGHETYGEDPCLTAKLGCAYIRGIQG-------KDPDH--LKAAAC 157

Query: 191 CKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
            KH+A +      G +  R  FD++V+  D+ +T++  F+ CV +  V +VM +YNRVNG
Sbjct: 158 AKHFAVHS-----GPEALRHEFDAKVSLHDLYDTYLYAFKRCVKDAGVEAVMGAYNRVNG 212

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
            P C    LL   +R  + F G++VSDC +I    E H  +  T E++ A  +  G DL+
Sbjct: 213 EPACGSKTLLQDILREQFGFEGHVVSDCWAILDFHEHHH-VTKTVEESAAMAVNHGCDLN 271

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICNPQH 367
           CG  +   +  A +QG + E  I  ++  L  V +RLG  +  P  Y N+  + +  P+H
Sbjct: 272 CGKAFLYLSR-ACEQGLVEEKTITEAVERLMDVRIRLGMMEDYPSPYANIPYDVVECPEH 330

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
           I L+ EA+++ +VLLKNDN  LPL    + T+A++GP+AN+  A++GNYEGT  RY +P+
Sbjct: 331 IALSLEASKRSMVLLKNDNHFLPLKQEQVHTIAVIGPNANSRAALVGNYEGTSSRYITPL 390

Query: 428 DGFYAYS---KVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
           +G   Y+     + YA GC       + + +       A+ AA+ AD  V+  GLD  +E
Sbjct: 391 EGIQEYTGEKTRVLYAQGCHLYKDQVEFLGEPKDRFKEALIAAERADVIVMCLGLDAGIE 450

Query: 479 AE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
            E           D++ L LPG Q EL+  VA   K P+ L +++  A+D+++A+ + +I
Sbjct: 451 GEEGDAGNEYASGDKLGLKLPGLQQELLEAVAAVGK-PIVLTVLAGSALDLSWAQEHAQI 509

Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRT 589
           ++IL   YPG  GG+AIA+ +FG+++P G+LP+T+YE       +T   +       GRT
Sbjct: 510 RAILDCWYPGARGGKAIAEALFGEFSPCGKLPVTFYEGTEFLPDFTDYSM------AGRT 563

Query: 590 YKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL 649
           Y++ D  V+YPFGYGL+Y+Q +Y  A +               D+    G  +P      
Sbjct: 564 YRYTDRHVLYPFGYGLTYSQIRYSDAHA---------------DVT-DFGILEP------ 601

Query: 650 IDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK-------PPGIAGTHIKQVIGYERVF 702
                      T  + VEN G     E V VY +        PG       Q+ G   V 
Sbjct: 602 ----------VTVHVTVENTGTYPVQEAVQVYVRFSEREAYDPGY------QLKGIRSVA 645

Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +  G+  +V  T++  +   ++      L+  G++ I VG
Sbjct: 646 LECGEKKEVCITLSP-RDFALISEEGKCLVHPGSYEIAVG 684


>gi|449303062|gb|EMC99070.1| glycoside hydrolase family 3 protein [Baudoinia compniacensis UAMH
           10762]
          Length = 786

 Score =  423 bits (1087), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 267/743 (35%), Positives = 387/743 (52%), Gaps = 44/743 (5%)

Query: 5   IKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALH 64
           +   LS  P C+  L   +RA  LV+  TL E     G+ A GVPRLGLP YE W+EALH
Sbjct: 50  VNSTLSTTPVCNRSLSAWDRAHALVQLFTLEELANNTGNTAPGVPRLGLPAYEVWNEALH 109

Query: 65  GVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
           G+S     TN   GT        ATSFP+ IL+ AS N +L  +IG  +ST+ RA  N G
Sbjct: 110 GISHGHFATN---GTW-----SWATSFPSPILSMASMNRTLINQIGDIISTQGRAFSNAG 161

Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSR 183
             GL  ++PNIN  R P WGR  ETPGED + +   YA  Y+ G+Q  +       + + 
Sbjct: 162 RYGLDSYAPNINGFRSPVWGRGQETPGEDAFFLSSLYAYEYITGMQGGK-------APAV 214

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
           P K+ A  KH+A YD++NW  N R   D  +T+QD+   +   F   +       +MCSY
Sbjct: 215 P-KLVAVPKHFAGYDIENWNNNSRLGLDVNITQQDLAGYYTPQFRSAIQNAKALGLMCSY 273

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNF-HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           N VNG+P+C++   L    R  W F +G++ SDCD++  +   H +  +T   AVA  L+
Sbjct: 274 NAVNGVPSCSNSFFLQTLARDTWGFGNGFVSSDCDAVYNVYNPHGYAANTT-GAVADSLR 332

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNN 361
           AG D+DCG  Y  + + A   G ++  DI+ +L   Y  L+  GYFDG S  Y+NLG N+
Sbjct: 333 AGTDIDCGTSYPFYLVPAFNAGLVSRNDIELALTRYYSGLVMQGYFDGNSSLYRNLGWND 392

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           +       ++ EAA +GI LLKND G LPL+  + +++AL+GP ANAT  + GNY     
Sbjct: 393 VLTTDAWNISYEAAVEGITLLKND-GTLPLSK-STRSVALIGPWANATLQLQGNYYAAAP 450

Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
              SP+  F A    +N+  G   I   N S    AI  A+ +D  +   G+D S+EAEG
Sbjct: 451 YLISPLQAFRASGMTVNFVNGTT-ISSTNTSGFAEAITLAQQSDVIIYAGGIDNSIEAEG 509

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
            DR ++  PG Q +LI +++   K P+ ++ M  G VD +  KNN K+ +++W GYPG+ 
Sbjct: 510 LDRQNITWPGNQLDLIYQLSQVGK-PLVVLQMGGGQVDSSALKNNSKVNALVWGGYPGQS 568

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
           GG+A+ D+I G   P GRL  T Y A+Y       +M + PVN   G+TY ++ G  VYP
Sbjct: 569 GGQALFDIIMGNRAPAGRLVTTQYPASYATSFNQLNMNMAPVNGSLGQTYMWYTGTPVYP 628

Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
           FG+GL YT F       P +              N T     P      +++V   D+ F
Sbjct: 629 FGHGLFYTNFTTTSTMGPVTT------------YNLTSIFAAPHPGYEFVEEVPIMDFNF 676

Query: 661 TFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYER-VFIAAGQSAKVGFTMNAC 718
                V N G+       M++ S   G     IK ++G +R   I  G  A V   +   
Sbjct: 677 I----VNNTGRTASDWSGMLFASTTSGPTPRPIKWLVGIDREAIIVPGGLASVTIKV-PV 731

Query: 719 KSLKIVDNAANSLLASGAHTILV 741
            +L   D   N ++  G++++++
Sbjct: 732 GALARADANGNLVVYPGSYSLML 754


>gi|380293100|gb|AFD50200.1| beta-xylosidase [Hypocrea orientalis]
          Length = 797

 Score =  422 bits (1085), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 278/760 (36%), Positives = 403/760 (53%), Gaps = 57/760 (7%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD+   Y ERA+ L+   TL E +    +   GVPRLGLP Y+ W+EALHG+    R   
Sbjct: 63  CDSSAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLD---RANF 119

Query: 75  SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
           +  G  F+     ATSFP  ILTTA+ N +L  +I   +ST+ARA  N G  GL  ++PN
Sbjct: 120 ATKGGQFE----WATSFPMPILTTAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPN 175

Query: 135 INVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           +N  R P WGR  ETPGED + +   Y   Y+ G+Q   GV      D   LK++A  KH
Sbjct: 176 VNGFRSPLWGRGQETPGEDAFFLSSAYTYEYITGIQG--GV------DPEQLKVAATVKH 227

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           +A YDL+NW    R  FD+ +T+QD+ E +   F          S+MCSYN VNG+P+CA
Sbjct: 228 FAGYDLENWNNQSRLGFDAIITQQDLSEYYTPQFLAAARYAKSRSLMCSYNSVNGVPSCA 287

Query: 254 DPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
           +   L   +R  W F   GY+ SDCD++  +   H +    +  A A  L+AG D+DCG 
Sbjct: 288 NSFFLQTLLRESWGFPEWGYVSSDCDAVYNVFNPHDYA-SNQSSAAASSLRAGTDIDCGQ 346

Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELA 371
            Y      +   G++   +I+ S+  LY  L+RLGYFD   QY++LG  ++       ++
Sbjct: 347 TYPWHLNESFVAGEVTRGEIERSVTRLYANLVRLGYFDKKNQYRSLGWKDVVKTDAWNIS 406

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            EAA +GIVLLKND G LPL+   ++++AL+GP ANAT  M GNY G      SP++   
Sbjct: 407 YEAAVEGIVLLKND-GTLPLSK-KVRSIALIGPWANATTQMQGNYFGPAPYLISPLEAAK 464

Query: 432 AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPG 491
                +N+  G  +I   + +    AI AAK +DA V + G+D ++E EG DR D+  PG
Sbjct: 465 KAGYHVNFELGT-EIAGNSTAGFAKAIAAAKKSDAIVYLGGIDNTIEQEGADRTDIAWPG 523

Query: 492 FQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIF 551
            Q +LI ++++  K P+ ++ M  G VD +  K+N K+ S++W GYPG+ GG A+ D++ 
Sbjct: 524 NQLDLIKQLSEVGK-PLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILS 582

Query: 552 GKYNPGGRLPITWYEANYV-KIPYTSMPLRP-VNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
           GK  P GRL  T Y A YV + P   M LRP   + PG+TY ++ G  VY FG GL YT 
Sbjct: 583 GKRAPAGRLITTQYPAEYVHQFPQNDMNLRPDGKSNPGQTYIWYTGKPVYEFGSGLFYTT 642

Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
           FK  +AS PK +              YT     P                FTF+  ++N 
Sbjct: 643 FKETLASHPKCLKFNTSSILSAPHPGYTYSEQIP---------------VFTFEANIKNS 687

Query: 670 GKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNACKSLKIVDN 726
           GK +     M++ +    G A    K ++G++R+  I  G S+K+   +    +L  VD+
Sbjct: 688 GKTESPYTAMLFVRTSNAGPAPYPNKWLVGFDRLADIKPGHSSKLSIPI-PVSALARVDS 746

Query: 727 AANSLLASGAHTI-------------LVGEGVGGVSFPLQ 753
             N ++  G + +             LVGE V   ++PL+
Sbjct: 747 YGNRIVYPGKYELALNTDESVKLEFELVGEEVTIENWPLE 786


>gi|171678585|ref|XP_001904242.1| hypothetical protein [Podospora anserina S mat+]
 gi|170937362|emb|CAP62020.1| unnamed protein product [Podospora anserina S mat+]
          Length = 800

 Score =  421 bits (1081), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 267/757 (35%), Positives = 400/757 (52%), Gaps = 64/757 (8%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    C+  L  PERA  LV  +T  EK+Q +   + G PR+GLP Y WWSEALHGV++
Sbjct: 34  LSTNQVCNTTLSPPERAAALVAALTPEEKLQNIVSKSLGAPRIGLPAYNWWSEALHGVAY 93

Query: 69  IGRRTNSPPGTHF---DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
                   PGT F   D     +TSFP  +L  A+F++ L +KI + +  E RA  N G 
Sbjct: 94  A-------PGTQFWQGDGPFNSSTSFPMPLLMAATFDDELLEKIAEVIGIEGRAFGNAGF 146

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
           +GL +W+PN+N  +DPRWGR  ETPGED  +V RYA   ++GL   EG    ++      
Sbjct: 147 SGLDYWTPNVNPFKDPRWGRGSETPGEDVLLVKRYAAAMIKGL---EGPVPEKER----- 198

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           ++ A CKHYAA D ++W G  R +F+++++ QDM E + +PF+ CV +  V S+MC+YN 
Sbjct: 199 RVVATCKHYAANDFEDWNGATRHNFNAKISLQDMAEYYFMPFQQCVRDSRVGSIMCAYNA 258

Query: 246 VNGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           VNG+P+CA P LL   +R  WN+   + YI SDC+++  +  +HK+   T  +  A   +
Sbjct: 259 VNGVPSCASPYLLQTILREHWNWTEHNNYITSDCEAVLDVSLNHKYAA-TNAEGTAISFE 317

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNN 361
           AG+D  C    ++   GA  QG + E+ +D +L  LY  ++R GYFDG    Y +LG  +
Sbjct: 318 AGMDTSCEYEGSSDIPGAWSQGLLKESTVDRALLRLYEGIVRAGYFDGKQSLYSSLGWAD 377

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALP----LNTGNIKTLALVGPHANATKAMIGNYE 417
           +  P   +L+ +AA  G VLLKND G LP    L+    K +A++G  ++A   + G Y 
Sbjct: 378 VNKPSAQKLSLQAAVDGTVLLKND-GTLPLSDLLDKSRPKKVAMIGFWSDAKDKLRGGYS 436

Query: 418 GTPCRYTSPMDGFYAYSKV-INYAPGCADI----VCQNNSMIPAAIDAAKNADATVIVAG 472
           GT     +P    YA S++ I ++     I    +  N S    A+ AAK+AD  +   G
Sbjct: 437 GTAAYLHTPA---YAASQLGIPFSTASGPILHSDLASNQSWTDNAMAAAKDADYILYFGG 493

Query: 473 LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAG-AVDINFAKNNPKIKS 531
           +D S   E KDR DL  PG Q  LIN +   +K    L+++  G  +D     +NPKI +
Sbjct: 494 IDTSAAGETKDRYDLDWPGAQLSLINLLTTLSK---PLIVLQMGDQLDNTPLLSNPKINA 550

Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPV--NNFPGR 588
           ILW  +PG++GG A+ +++ G  +P GRLP+T Y +N+ + +P T M LRP   N+  GR
Sbjct: 551 ILWANWPGQDGGTAVMELVTGLKSPAGRLPVTQYPSNFTELVPMTDMALRPSAGNSQLGR 610

Query: 589 TYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
           TY+++  P V  FG+GL YT F  K      +V I +D+  +  D  Y            
Sbjct: 611 TYRWYKTP-VQAFGFGLHYTTFSPKFGKKFPAV-IDVDEVLEGCDDKY------------ 656

Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGT--HIKQVIGYERVFIAAG 706
            +D     D      + VEN G      V + +   PG+      IK +  + R+    G
Sbjct: 657 -LDTCPLPD----LPVVVENRGNRTSDYVALAFVSAPGVGPGPWPIKTLGAFTRLRGVKG 711

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
              + G       +L   D   N+++  G + + + E
Sbjct: 712 GEKREGGLKWNLGNLARHDEEGNTVVYPGKYEVSLDE 748


>gi|238483831|ref|XP_002373154.1| beta-xylosidase XylA [Aspergillus flavus NRRL3357]
 gi|292495283|sp|B8MYV0.1|XYND_ASPFN RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|220701204|gb|EED57542.1| beta-xylosidase XylA [Aspergillus flavus NRRL3357]
          Length = 797

 Score =  419 bits (1076), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 260/744 (34%), Positives = 386/744 (51%), Gaps = 46/744 (6%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD      +RA  LV  +T  E V    +  +G PR+GLP Y+ W+EALHGV+ 
Sbjct: 57  LSKTLVCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVA- 115

Query: 69  IGRRTNSPPGTHFDSEVPG----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
                      H D    G    +TSFP  I T A+ N +L  +I   +ST+ RA  N G
Sbjct: 116 -----------HADFSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAG 164

Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSR 183
             GL  +SPNIN  R P WGR  ETPGED Y +   YA  Y+ G+Q   GV      D+ 
Sbjct: 165 RYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQG--GV------DAN 216

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
           PLK+ A  KHYA YD++NW+ + R   D ++T+QD+ E +   F +   +  V SVMCSY
Sbjct: 217 PLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSY 276

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
           N VNG+P+C++   L   +R  ++F   GY+  DC ++  +   H +  + +  A A  +
Sbjct: 277 NAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATN-ESSAAADSI 335

Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKN 360
           +AG D+DCG  Y      +    +++  D++  +  LY  L+R GYFDG +  Y+N+  +
Sbjct: 336 RAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRAGYFDGKTSPYRNITWS 395

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGN-IKTLALVGPHANATKAMIGNYEGT 419
           ++ +     L+ EAA Q IVLLKND G LPL T +  KT+AL+GP ANAT  M+GNY G 
Sbjct: 396 DVVSTNAQNLSYEAAAQSIVLLKND-GILPLTTSSSTKTIALIGPWANATTQMLGNYYGP 454

Query: 420 PCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
                SP+  F      I Y  G       +++    A+  AK AD  +   G+D ++E 
Sbjct: 455 APYLISPLQAFQDSEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLET 514

Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
           E +DR ++  P  Q  LI K+AD  K P+ ++ M  G VD +  KNN  + +++W GYPG
Sbjct: 515 EAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYPG 573

Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKI-PYTSMPLRPVNNFPGRTYKFFDGPVV 598
           + GG+A+AD+I GK  P  RL  T Y A Y ++ P   M LRP  + PG+TY ++ G  V
Sbjct: 574 QSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGSNPGQTYMWYTGTPV 633

Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
           Y FG+GL YT F    ++   +      K++   +I+  +G  +P     L++ +     
Sbjct: 634 YEFGHGLFYTNFTASASAGSGT------KNRTSFNIDEVLG--RPHPGYKLVEQMPL--- 682

Query: 659 KFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNA 717
              F ++V+N G        M + +   G A    K ++G++R+      SAK       
Sbjct: 683 -LNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFDRLSAVEPGSAKTMVIPVT 741

Query: 718 CKSLKIVDNAANSLLASGAHTILV 741
             SL   D   N +L  G + + +
Sbjct: 742 VDSLARTDEEGNRVLYPGRYEVAL 765


>gi|367046937|ref|XP_003653848.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
           8126]
 gi|347001111|gb|AEO67512.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
           8126]
          Length = 923

 Score =  418 bits (1074), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 278/748 (37%), Positives = 389/748 (52%), Gaps = 47/748 (6%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L   P C+  LP  +R + LV ++TL EK+  + D A G  R+GLP YEWWSEALHGV+ 
Sbjct: 159 LCSSPACNTSLPIADRVRWLVGQLTLQEKITNLVDGASGSARVGLPPYEWWSEALHGVAA 218

Query: 69  I-GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG 127
             G     P GT F      ATSFP  I  +A+F++ L  +I   V  E RA  N G +G
Sbjct: 219 SPGVTFAGPNGTAFSY----ATSFPMPITISAAFDDDLVSQIAAVVGREGRAFANHGLSG 274

Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
             FW+PNIN  RDPRWGR  ETPGED + + +Y  + + GLQ          SD    +I
Sbjct: 275 FDFWTPNINPFRDPRWGRGPETPGEDAFRIQQYIRHLIPGLQ---------GSDPLDKQI 325

Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
            A CKHYA YD++      R+ +D      D+ E ++ PF+ CV +  + SVMCSYN V+
Sbjct: 326 IATCKHYAVYDVE----TGRYEYDYDPQPHDLAEYYLAPFKTCVRDVGIGSVMCSYNAVD 381

Query: 248 GIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           GIP CA   LL   +R  W F   + Y+VSDCD+++ I   H F  D+   A A  L AG
Sbjct: 382 GIPACASEYLLQSVLRDHWGFTEPYQYVVSDCDAVRFIYSPHNF-TDSPAAAAAVALNAG 440

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICN 364
            DL+CG  Y N    ++      EA +D +L  LY  L  +G+FDGS +Y  LG + +  
Sbjct: 441 TDLECGSTYLNLNQ-SLASNMTTEAALDRALTRLYTALHTIGFFDGSARYGGLGWDAVGT 499

Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
                LA +AA  G VLLKN+   LPL++  ++ LA++GP ANAT  M GNY G      
Sbjct: 500 GDAQVLAYQAAVDGAVLLKNEKSLLPLDSKRLRKLAVIGPWANATTQMQGNYFGQAAYLV 559

Query: 425 SPMDGFYAYSKVIN--YAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
           SP+  F +     N  +A G   I   + +   AA+ AAK ADA V + G+D SVE+E  
Sbjct: 560 SPLAAFQSAWGADNVLFANGTG-IAGNSTAGFAAALAAAKAADAVVFLGGVDNSVESESL 618

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
           DR  +  PG Q +LI ++A   K P+ +V    G +D +    NP++ ++LW GYPG+ G
Sbjct: 619 DRTAISWPGNQLDLIAQLAAVGK-PLVVVQCGGGQLDDSALLANPRVGALLWAGYPGQAG 677

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPV--------NNFPGRTYKFF 593
           G AIAD++ GK  P GRLP+T Y A+Y  ++      LRP         + FPGRTYK++
Sbjct: 678 GAAIADLLTGKQAPAGRLPVTQYAASYTSEVSLFDPSLRPRRSGGSKSHSTFPGRTYKWY 737

Query: 594 DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
            G  V PFGYGL YT F+   A  P+          +  DI      N    ++      
Sbjct: 738 TGKPVLPFGYGLHYTTFRTAWADEPRG---------RAYDIAGLFPANTTTTSSAFSAAD 788

Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVF-IAAGQSAKVG 712
                  +  +     G  D   ++ + ++  G A    K ++GY R   +A G SA++ 
Sbjct: 789 TYPVLNVSVTVTNTGRGASDYVGLLFLRTRNAGPAPYPNKWLVGYARARGLAPGSSARLE 848

Query: 713 FTMNACKSLKIVDNAANSLLASGAHTIL 740
             + A  SL   D     ++  G + +L
Sbjct: 849 LAV-ALGSLARADEDGRRVVYPGDYELL 875


>gi|67902828|ref|XP_681670.1| hypothetical protein AN8401.2 [Aspergillus nidulans FGSC A4]
 gi|74592887|sp|Q5ATH9.1|BXLB_EMENI RecName: Full=Exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|40747867|gb|EAA67023.1| hypothetical protein AN8401.2 [Aspergillus nidulans FGSC A4]
 gi|259484335|tpe|CBF80465.1| TPA: beta-1,4-xylosidase (Eurofung) [Aspergillus nidulans FGSC A4]
          Length = 763

 Score =  417 bits (1073), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 273/745 (36%), Positives = 395/745 (53%), Gaps = 57/745 (7%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS+ P CD  L   ERAK LV  +TL EK+   G  A G  RLGLP Y WW+EALHGV+ 
Sbjct: 33  LSELPICDTSLSPLERAKSLVSALTLEEKINNTGHEAAGSSRLGLPAYNWWNEALHGVA- 91

Query: 69  IGRRTNSPPGTHFDS--EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
                    G  F+   +   ATSFP  I+  A+FN++L +++ + +STEARA  N  +A
Sbjct: 92  ------EKHGVSFEESGDFSYATSFPAPIVLGAAFNDALIRRVAEIISTEARAFSNSDHA 145

Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLK 186
           G+ +W+PN+N  +DPRWGR  ETPGEDP    RY   +V GLQ         D   +P K
Sbjct: 146 GIDYWTPNVNPFKDPRWGRGQETPGEDPLHCSRYVKEFVGGLQG--------DDPEKP-K 196

Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
           + A CKH AAYDL+ W G  RF FD++V+  D+ E ++ PF+ C  +  V + MCSYN +
Sbjct: 197 VVATCKHLAAYDLEEWGGVSRFEFDAKVSAVDLLEYYLPPFKTCAVDASVGAFMCSYNAL 256

Query: 247 NGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NG+P CAD  LL   +R  W + G   ++  DC +++ I   H ++    E A A  L A
Sbjct: 257 NGVPACADRYLLQTVLREHWGWEGPGHWVTGDCGAVERIQTYHHYVESGPE-AAAAALNA 315

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD---GSPQYKNLGKN 360
           G+DLDCG +  ++   A +QG I+   +D +L  LY  L++LGYFD   G P  ++LG +
Sbjct: 316 GVDLDCGTWLPSYLGEAERQGLISNETLDAALTRLYTSLVQLGYFDPAEGQP-LRSLGWD 374

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
           ++   +  ELA   A QG VLLKN +  LPL      TLAL+GP  N T  +  NY G  
Sbjct: 375 DVATSEAEELAKTVAIQGTVLLKNIDWTLPLKANG--TLALIGPFINFTTELQSNYAGPA 432

Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
               + ++        +  APG  ++   +      A+  A  ADA +   G+D +VE E
Sbjct: 433 KHIPTMIEAAERLGYNVLTAPGT-EVNSTSTDGFDDALAIAAEADALIFFGGIDNTVEEE 491

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
             DR  +  PG Q ELI ++A+  + P+T+V    G VD +    +  + +I+W GYP +
Sbjct: 492 SLDRTRIDWPGNQEELILELAELGR-PLTVVQFGGGQVDDSALLASAGVGAIVWAGYPSQ 550

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
            GG  + DV+ GK  P GRLPIT Y  +YV ++P T M L+P  + PGRTY++++  V+ 
Sbjct: 551 AGGAGVFDVLTGKAAPAGRLPITQYPKSYVDEVPMTDMNLQPGTDNPGRTYRWYEDAVL- 609

Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
           PFG+GL YT F    A   K      D     R  N          ++ ++D        
Sbjct: 610 PFGFGLHYTTFNVSWA---KKAFGPYDAATLARGKN---------PSSNIVD-------- 649

Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERV-FIAAGQSAKVGFTMN 716
            TF + V N G +    V +V++  P  G     IK ++GY R   I  G++ KV   + 
Sbjct: 650 -TFSLAVTNTGDVASDYVALVFASAPELGAQPAPIKTLVGYSRASLIKPGETRKVDVEVT 708

Query: 717 ACKSLKIVDNAANSLLASGAHTILV 741
                +  ++    +L  G +T+LV
Sbjct: 709 VAPLTRATED-GRVVLYPGEYTLLV 732


>gi|310797011|gb|EFQ32472.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Glomerella graminicola M1.001]
          Length = 767

 Score =  416 bits (1069), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 267/743 (35%), Positives = 394/743 (53%), Gaps = 57/743 (7%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD  L  PERA  LV+ +T+ EK+Q +   A G PR+GLP Y WWSEALHGV++      
Sbjct: 43  CDRTLSPPERAAALVKALTVEEKLQNLVSKAQGAPRIGLPAYNWWSEALHGVAYA----- 97

Query: 75  SPPGTHF---DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
             PGT+F   D E   +TS+P  +L  A+F++ L ++IG  +  EARA  N G AGL +W
Sbjct: 98  --PGTYFPEGDVEFNSSTSYPMPLLMAAAFDDELIEQIGAAIGIEARAWGNAGWAGLDYW 155

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
           +PN+N  +DPRWGR  ETPGED   V RYA    RGL      E  R        + + C
Sbjct: 156 TPNVNPFKDPRWGRGSETPGEDVLRVKRYAEYITRGLDGPVPGEQRR--------VISTC 207

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
           KHYA  D ++W G  R  FD+++T QD+ E +++PF+ C  +  V S+MC+YN VNG+P+
Sbjct: 208 KHYAGNDFEDWNGTSRHDFDAKITAQDLAEYYLMPFQQCARDSKVGSIMCAYNAVNGVPS 267

Query: 252 CADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           CA+  LL   +R  WN+   + Y+ SDC+++  +  +HK+   T     A   +AG+D  
Sbjct: 268 CANEYLLQNILREHWNWTEHNNYVTSDCEAVLDVSANHKYA-PTNAAGTAICFEAGMDTS 326

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICNPQH 367
           C    ++   GA  QG + E  +D +L  LY  L+R GYFDG    Y  LG  ++ + + 
Sbjct: 327 CEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGYFDGHEAIYAKLGWKDVNSAEA 386

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
             LA +AA +GIVLLKN NG LPL+      +A++G  A+A   + G Y G      +P 
Sbjct: 387 QSLALQAAVEGIVLLKN-NGTLPLDLKPSHKVAMIGFWADAPDKLQGGYSGRAAHLHTP- 444

Query: 428 DGFYAYSKVINYAPGCADIVCQNNS---MIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
             + A    ++       ++ +NN+      AA++AA+ AD  +   GLD S   E  DR
Sbjct: 445 -AYAARQLGLDITLASGPVLQRNNASDNWTAAALEAAEGADYILYFGGLDTSAAGETLDR 503

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
            DL  P  Q  LI K++   K P+ + ++     D    + + ++ SILW  +PG++GG 
Sbjct: 504 TDLEWPEAQLMLIKKLSALGK-PLVVNLLGDQLDDTPLLQLD-EVSSILWANWPGQDGGV 561

Query: 545 AIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
           AI  +I G+ +P GRLP+T Y +NY   IP TSM LRP + +PGRTY+++D P+   FG+
Sbjct: 562 AIMKLITGEKSPAGRLPVTQYPSNYTDLIPMTSMDLRPTSQYPGRTYRWYDKPIKR-FGF 620

Query: 604 GLSYTQFKYKVASS-PKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
           GL YT FK +V  + PK++ I         D+      +   C A               
Sbjct: 621 GLHYTTFKAEVGGAFPKTLRIA--------DLVGCGNEHPDTCPAP------------PL 660

Query: 663 QIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNACKS 720
            + + N G      V + Y S   G     IK +  Y+R+  +A G++A V         
Sbjct: 661 PVSITNTGNRTSDYVALAYLSGEYGPRPYPIKTLSAYKRLRDVAPGETATVDLAWT-LGD 719

Query: 721 LKIVDNAANSLLASGAHTILVGE 743
           +   D   N++L  G +TI + E
Sbjct: 720 IARHDEQGNTVLYPGEYTITIDE 742


>gi|154313073|ref|XP_001555863.1| hypothetical protein BC1G_05538 [Botryotinia fuckeliana B05.10]
          Length = 755

 Score =  416 bits (1068), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 260/624 (41%), Positives = 353/624 (56%), Gaps = 49/624 (7%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L++   CD       RA  L+   TL EKV   G+ + GVPR+GLP YEWW+EALHG++ 
Sbjct: 28  LANNTVCDTSSDPYTRAAALISLFTLAEKVNNTGNTSPGVPRIGLPSYEWWNEALHGIA- 86

Query: 69  IGRRTNSPPGTHF---DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
                   PGT F    S    +TSFP  IL  A+F++ L  K+   VSTEARA  N+  
Sbjct: 87  ------RSPGTTFAATGSNYSYSTSFPQPILMGATFDDELIHKVATQVSTEARAFNNVNR 140

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GL FW+PNIN  +DPRWGR  ETPGEDP+    Y    + GLQ   G+      D  P 
Sbjct: 141 FGLNFWTPNINPYKDPRWGRGQETPGEDPFHTSSYVNALITGLQG--GL------DDLPY 192

Query: 186 KIS-ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
           K   A CKH+A YDL+N +G  R+ FD+ +  QD+++ ++ PF+ C  + +V SVMCSYN
Sbjct: 193 KKGVATCKHFAGYDLENSDGAIRYGFDAIIKSQDLRDYYLPPFQQCARDSNVQSVMCSYN 252

Query: 245 RVNGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
            +NG+PTCAD  LL   +R  W +     ++ SDCD+++ I + H +   T E + A  L
Sbjct: 253 AMNGVPTCADDWLLQTLLREHWGWTEEDQWVTSDCDAVKNIWDYHNY-TLTPEQSAADAL 311

Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGK 359
            AG DLDCG ++  +   A  QG    + +D SL   Y  L+RLGYFD      Y+ L  
Sbjct: 312 NAGTDLDCGTFWPTYLGSAYDQGLYDISTLDRSLARRYASLVRLGYFDPPSVQPYRQLNW 371

Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
           +N+  P   +LA +AA  GIVLLKND G LPL++ NI  +AL+GP ANATK M GNY GT
Sbjct: 372 DNVSTPAAQQLALQAAEDGIVLLKND-GILPLSS-NITNVALIGPLANATKQMQGNYYGT 429

Query: 420 PCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
                SP+         + Y  G ADI  QN +   AAI AA++AD  + V G+D S+EA
Sbjct: 430 APYLRSPLIAAQNAGFKVTYVQG-ADIDSQNTTDFSAAISAAQSADLVIYVGGIDNSIEA 488

Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
           E       +L    T LI             +      +D +   +N  + ++LW GYPG
Sbjct: 489 EE------ILANLSTPLI-------------ISQMGCMIDSSSLLSNTGVNALLWAGYPG 529

Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVV 598
           ++GG AI +++ GK  P GRLPIT Y +NYV ++  T M L+P    PGRTYK+++G  V
Sbjct: 530 QDGGTAIFNILTGKTAPAGRLPITQYPSNYVNQVTMTDMNLQPSRFNPGRTYKWYNGEPV 589

Query: 599 YPFGYGLSYTQFKYKVA-SSPKSV 621
           + +GYGL YT F  K+  SSP + 
Sbjct: 590 FEYGYGLQYTTFDAKITPSSPNNT 613


>gi|343428088|emb|CBQ71612.1| related to Beta-xylosidase [Sporisorium reilianum SRZ2]
          Length = 698

 Score =  415 bits (1067), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 250/630 (39%), Positives = 344/630 (54%), Gaps = 42/630 (6%)

Query: 7   VKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGV 66
           + LS  P CD  L +  RA  LV + T  E +    + A GVPRLG+P Y+WW+EALHGV
Sbjct: 27  LPLSTLPVCDTSLDFYTRATSLVAQFTTAELINNTVNHAPGVPRLGIPQYQWWTEALHGV 86

Query: 67  SFIGRRTNSPPGTHFDSEVPG----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN 122
           +         PG +F+ +  G    ATSFP VI   A+F+++L++ +   ++ E RA  N
Sbjct: 87  A-------RSPGVNFNPDAAGEFGCATSFPQVINLGATFDDALYEAVAAHIANETRAFSN 139

Query: 123 LGNAGLTFWSP-NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
            G AGL  +SP NIN  RDPRWGR  ET GEDP  + RYA+  VRGLQ   G     +++
Sbjct: 140 AGRAGLNMYSPLNINAFRDPRWGRGQETVGEDPLHLSRYAVRVVRGLQ---GPAAQDEAN 196

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
            R L ++A CKHY AYDL+   G +R+ FD+ V+ QD+ +  +  F  CV +G  +++M 
Sbjct: 197 PR-LTLAATCKHYLAYDLEASAGVERYQFDALVSNQDLADLHLPQFRACVRDGGATTLMT 255

Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVA 298
           SYN VNG+P  A    L    R  W     H Y+ SDCD++  + ++H +  D    A A
Sbjct: 256 SYNAVNGVPPSASKYYLETLARDTWGLDKHHNYVTSDCDAVANVYDAHHYAADYVHAAAA 315

Query: 299 RVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKN 356
             L AG DLDCG  Y +    A+ Q     A I  ++  +Y  L+RLGYFD +     + 
Sbjct: 316 S-LNAGTDLDCGATYRDSLAAALAQNLTDVATIRRAVTRMYGSLVRLGYFDAAEAQPLRQ 374

Query: 357 LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
           LG  ++  P   +LA EAA   I LLKN    LPL     KT+AL+GP+ NAT A+ GNY
Sbjct: 375 LGWKDVNAPAAQKLAYEAAAASITLLKNRQSTLPLRETAGKTIALIGPYTNATFALRGNY 434

Query: 417 EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAID---------AAKNADAT 467
            G      +P D   A  +  +     A IV  N + I    D          AK+AD  
Sbjct: 435 AGPSPLVITPFD---AARRTFS----DAHIVSANGTSIAGPYDTATASAALATAKSADII 487

Query: 468 VIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVI-MSAGAVDINFAKNN 526
           V   G+D +VE E  DR D+  P  Q  LI ++  AA G V +V+    G VD    K +
Sbjct: 488 VYAGGIDPTVEGESLDRRDIAWPANQLRLIQEL--AALGKVLVVVQFGGGQVDGALLKGD 545

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNNF 585
             + +++W GYPG+ G  A+ D++ GK  P GRLPIT Y ANY   +  T+M LRP   +
Sbjct: 546 DGVGALVWAGYPGQSGALALMDILAGKRAPAGRLPITQYPANYTHALRETTMALRPTATY 605

Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVA 615
           PGRTYK++ G   +PFG+GL YT F+  +A
Sbjct: 606 PGRTYKWYTGTPTFPFGFGLHYTTFRASIA 635


>gi|292495285|sp|B6EY09.1|XYND_ASPJA RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|211970990|dbj|BAG82824.1| 1,4-beta-D-xylosidase [Aspergillus japonicus]
          Length = 804

 Score =  414 bits (1063), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 270/746 (36%), Positives = 394/746 (52%), Gaps = 53/746 (7%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD+     +RA  LV   TL E +   G+ + GVPRLGLP Y+ WSEALHG++   R   
Sbjct: 60  CDSTASPYDRAAALVSLFTLEELIANTGNTSPGVPRLGLPPYQVWSEALHGLA---RANF 116

Query: 75  SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
           +  G +       ATSFP+ IL+ A+FN +L  +I   +ST+ RA  N G  GL  +SPN
Sbjct: 117 TDNGAY-----SWATSFPSPILSAAAFNRTLINQIASIISTQGRAFNNAGRFGLDVYSPN 171

Query: 135 INVVRDPRWGRVLETPGEDPY-VVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           IN  R P WGR  ETPGED Y +   YA  Y+ G+Q     E+        LK++A  KH
Sbjct: 172 INTFRHPVWGRGQETPGEDAYTLTAAYAYEYITGIQGGVNPEH--------LKLAATAKH 223

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           +A YD++NW+ + R   D  +T+QD+ E +   F +   +  V S MCSYN VNG+P+C+
Sbjct: 224 FAGYDIENWDNHSRLGNDVNITQQDLAEYYTPQFLVAARDAHVHSFMCSYNAVNGVPSCS 283

Query: 254 DPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
           +   L   +R  ++F  HGY+  DC ++  +   H +  + +  A A  + AG D+DCG 
Sbjct: 284 NTFFLQTLLRDTFSFVDHGYVSGDCGAVYGVFNPHGYAAN-EPSAAADAILAGTDIDCGT 342

Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG----SPQYKNLGKNNICNPQH 367
            Y      ++  G +A  DI+     LY  L+ LGYFDG    S  Y++LG  ++     
Sbjct: 343 SYQYHFNESITTGAVARDDIERGFIRLYANLVELGYFDGNSSSSNPYRSLGWPDVQKTDA 402

Query: 368 IELAAEAARQGIVLLKNDNGALPLNT---GNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
             ++ EAA +GIVLLKND G LPL +   G  K++AL+GP ANAT  + GNY G      
Sbjct: 403 WNISYEAAVEGIVLLKND-GTLPLASPSEGKNKSIALIGPWANATTQLQGNYYGDAPYLI 461

Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           SP+D F A    ++YAPG  +I   + +   AA+ AA+ AD  V + G+D ++EAE +DR
Sbjct: 462 SPVDAFTAAGYTVHYAPGT-EISTNSTANFSAALSAARAADTIVFLGGIDNTIEAEAQDR 520

Query: 485 VDLLLPGFQTELINKVA--DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
             +  PG Q ELI+++A   +   P+ +  M  G VD +  K+N K+ ++LW GYPG+ G
Sbjct: 521 SSIAWPGNQLELISQLAAQKSDDQPLVVYQMGGGQVDSSALKSNAKVNALLWGGYPGQSG 580

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRP--VNNFPGRTYKFFDGPVVY 599
           G A+ D++ G   P GRL  T Y A Y +      M LRP      PG+TY ++ G  VY
Sbjct: 581 GLALRDILTGARAPAGRLTTTQYPAAYAESFSALDMNLRPNETTQNPGQTYMWYTGEPVY 640

Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
            FG+GL YT F    ASS ++   K           YT        AA        +   
Sbjct: 641 AFGHGLFYTTFN---ASSAQAAKTK-----------YTFNITDLTSAAHPDTTTVGQRTL 686

Query: 660 FTFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERVFIAA--GQSAKVGFTM 715
           F F   + N G+ D     +VY  +   G +    K ++G++R+   A  G +A++   +
Sbjct: 687 FNFTASITNSGQRDSDYTALVYANTSTAGPSPYPNKWLVGFDRLAAVAKEGGTAELNVPV 746

Query: 716 NACKSLKIVDNAANSLLASGAHTILV 741
            A   L  VD A N++L  G + + +
Sbjct: 747 -AVDRLARVDEAGNTVLFPGRYEVAL 771


>gi|255957137|ref|XP_002569321.1| Pc21g23540 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211591032|emb|CAP97251.1| Pc21g23540 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 791

 Score =  413 bits (1062), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 259/701 (36%), Positives = 371/701 (52%), Gaps = 44/701 (6%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD      +RA  L+   T  E V   G++   +PRLGLP Y+ W+EALHG+  
Sbjct: 55  LSKTMVCDTTAKPHDRAAALIAMFTFEELVNSTGNVMPAIPRLGLPPYQVWNEALHGLD- 113

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
              R N    T F  +   ATSFP+ ILT A+ N +L  +IG  VST+ RA  N G  GL
Sbjct: 114 ---RANL---TEF-GDYSWATSFPSPILTMAALNRTLINQIGGIVSTQGRAFNNGGRYGL 166

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
             +SPNIN  R P WGR  ETPGED  +   Y + Y+ GLQ           D + LK++
Sbjct: 167 DVYSPNINSFRHPVWGRGQETPGEDIQLCSVYGLEYITGLQG--------GLDPKELKLA 218

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           A  KH+A YD++NW  + R   D  ++  D    +   F   V +  V SVM SYN VNG
Sbjct: 219 ATAKHFAGYDIENWGNHSRLGNDMSISAFDFASYYAPQFVTAVRDARVHSVMASYNAVNG 278

Query: 249 IPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLD 306
           +P  A+  LL   +R  WNF   GY+ SDCDS+  +   H + +     A   + +AG D
Sbjct: 279 VPASANSFLLQTLLRDTWNFVEDGYVSSDCDSVYNVFNPHGYASSASLAAAKSI-QAGTD 337

Query: 307 LDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNICNP 365
           +DCG  Y  +   +  QG+I+ ++I+ +    Y  L+ LGYFDG + +Y++L  +++   
Sbjct: 338 IDCGATYQLYLNQSFTQGEISRSEIERAATRFYSNLVSLGYFDGDNSKYRDLDWSDVVAT 397

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
               ++ EAA +GIVLLKND G LPL+  +  ++AL+GP AN T  M GNY G     T 
Sbjct: 398 DAWNISYEAAVEGIVLLKND-GTLPLSK-DTHSVALIGPWANVTTTMQGNYYGAAPYLTG 455

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+    A    +NYA G  +I  +  S   AA+ AA+ +D  +   G+D SVEAEG DR 
Sbjct: 456 PLAALQASDLDVNYAFGT-NISSETTSGFEAALSAARKSDVVIFAGGIDNSVEAEGVDRE 514

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            +  PG Q +LI ++++  K P+ ++ M  G VD +  K N  + S++W GYPG+ GG A
Sbjct: 515 TITWPGNQLQLIEQLSELGK-PLVVLQMGGGQVDSSSLKANKNVNSLVWGGYPGQSGGPA 573

Query: 546 IADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYG 604
           I D++ GK  P GRL +T Y A Y ++ P T M LRP  + PG+TY ++ G  VY FG+G
Sbjct: 574 ILDILTGKRAPAGRLTVTQYPAEYALQFPATDMSLRPKGSNPGQTYMWYTGKPVYEFGHG 633

Query: 605 LSYTQFKYKVASSPKS---VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
           L YT F+  +A+S  +       + K     +  Y V           I+ V   +Y   
Sbjct: 634 LFYTTFETSLANSHGANNGASFDIVKLLSRSNAGYNV-----------IEQVPFMNYT-- 680

Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTHI-KQVIGYERV 701
             IEVEN G +      M +         H  K ++G++R+
Sbjct: 681 --IEVENTGTVTSDYTAMAFVNTKAGPSPHPNKWLVGFDRL 719


>gi|60729621|pir||JC7966 xylan 1,4-beta-xylosidase (EC 3.2.1.37) - Talaromyces emersonii
 gi|21326570|gb|AAL32053.2|AF439746_1 beta-xylosidase [Rasamsonia emersonii]
          Length = 796

 Score =  413 bits (1061), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 245/599 (40%), Positives = 340/599 (56%), Gaps = 28/599 (4%)

Query: 24  RAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
           RA+ LV   TL E +    + A GVPRLGLP Y+ W+EALHG+    R   S  G     
Sbjct: 73  RAEALVSLFTLEELINNTQNTAPGVPRLGLPQYQVWNEALHGLD---RANFSDSG----- 124

Query: 84  EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRW 143
           E   ATSFP  IL+ ASFN +L  +I   ++T+ARA  N G  GL  ++PNIN  R P W
Sbjct: 125 EYSWATSFPMPILSMASFNRTLINQIASIIATQARAFNNAGRYGLDSYAPNINGFRSPLW 184

Query: 144 GRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNW 202
           GR  ETPGED + +   YA  Y+ GLQ   GV      D   +KI A  KH+A YDL+NW
Sbjct: 185 GRGQETPGEDAFFLSSAYAYEYITGLQG--GV------DPEHVKIVATAKHFAGYDLENW 236

Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
               R   ++ +T+QD+ E +   F          S+MCSYN VNG+P+C++   L   +
Sbjct: 237 GNVSRLGSNAIITQQDLSEYYTPQFLASARYAKTRSLMCSYNAVNGVPSCSNSFFLQTLL 296

Query: 263 RGDWNF--HGYIVSDCDSIQTIVESHKF-LNDTKEDAVARVLKAGLDLDCGDYYTNFTMG 319
           R  +NF   GY+ SDCD++  +   H + LN  +  A A  L AG D+DCG         
Sbjct: 297 RESFNFVDDGYVSSDCDAVYNVFNPHGYALN--QSGAAADSLLAGTDIDCGQTMPWHLNE 354

Query: 320 AVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICNPQHIELAAEAARQG 378
           +  +  ++  DI+ SL  LY  L+RLGYFDG+   Y+NL  N++       ++ EAA +G
Sbjct: 355 SFYERYVSRGDIEKSLTRLYANLVRLGYFDGNNSVYRNLNWNDVVTTDAWNISYEAAVEG 414

Query: 379 IVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVIN 438
           I LLKND G LPL +  ++++AL+GP ANAT  M GNY GTP    SP++   A    +N
Sbjct: 415 ITLLKND-GTLPL-SKKVRSIALIGPWANATVQMQGNYYGTPPYLISPLEAAKASGFTVN 472

Query: 439 YAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELIN 498
           YA G  +I   +      AI AAK +D  +   G+D ++EAEG+DR DL  PG Q +LI 
Sbjct: 473 YAFGT-NISTDSTQWFAEAISAAKKSDVIIYAGGIDNTIEAEGQDRTDLKWPGNQLDLIE 531

Query: 499 KVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGG 558
           +++   K P+ ++ M  G VD +  K N  + +++W GYPG+ GG A+ D++ GK  P G
Sbjct: 532 QLSKVGK-PLVVLQMGGGQVDSSSLKANKNVNALVWGGYPGQSGGAALFDILTGKRAPAG 590

Query: 559 RLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVAS 616
           RL  T Y A Y  + P   M LRP  + PG+TY ++ G  VY FG+GL YT+F+   A+
Sbjct: 591 RLVSTQYPAEYATQFPANDMNLRPNGSNPGQTYIWYTGTPVYEFGHGLFYTEFQESAAA 649


>gi|333379783|ref|ZP_08471502.1| hypothetical protein HMPREF9456_03097 [Dysgonomonas mossii DSM
           22836]
 gi|332884929|gb|EGK05184.1| hypothetical protein HMPREF9456_03097 [Dysgonomonas mossii DSM
           22836]
          Length = 737

 Score =  413 bits (1061), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 258/757 (34%), Positives = 388/757 (51%), Gaps = 97/757 (12%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
           ++P+ +  L   ER  DLV ++TL EKV QM +    + RL +P Y WW+E LHG   IG
Sbjct: 24  NYPFQNTNLSIDERVNDLVSKLTLEEKVAQMLNNTPAIERLNIPAYNWWNECLHG---IG 80

Query: 71  RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA---- 126
           R          D +V   T FP  I   A++N+ L K++   +S E RA+YN   +    
Sbjct: 81  RT---------DYKV---TVFPQAIGMAAAWNKELMKEVASAISDEGRAIYNDATSKGNR 128

Query: 127 ----GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
               GLT+W+PNIN+ RDPRWGR  ET GEDP++ G    ++V GLQ           D+
Sbjct: 129 EIYYGLTYWTPNINIFRDPRWGRGQETYGEDPFLTGVLGKSFVAGLQG---------DDT 179

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
           + LK +AC KHYA +   +   N R  F++ VT+ D+ +T++  F   V E  V+ VMC+
Sbjct: 180 KYLKAAACAKHYAVH---SGPENTRHTFNTFVTDYDLWDTYLPAFRNLVVEAKVAGVMCA 236

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           YN  NG P C +  L+ + +R  WNF GY+ SDC +I    + HK   D K  A A  + 
Sbjct: 237 YNAYNGEPCCGNNFLMQEILREKWNFTGYVTSDCGAIDDFYQHHKTHPDAKY-AAADAVY 295

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKN 360
            G D+DCG+      + AV+ G I E  ID SL+ L+ +  RLG FD +   +Y  +  +
Sbjct: 296 NGTDIDCGNEAYKALVDAVKTGIITEKQIDISLKRLFTIRFRLGMFDPAENVKYSQISTS 355

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + + +H +LA +  R+ IVLLKN+N  LPL +  +K +A+VGP+AN   +++GNY G P
Sbjct: 356 VLESQKHKDLALKITRESIVLLKNENNTLPL-SKKLKKVAVVGPNANNEVSVLGNYNGFP 414

Query: 421 CRYTSPMDGFYAYSK--VINYAPGCADIVCQNNSM--IPAAIDAAKNADATVIVAGLDLS 476
               +P +      K   + Y  G   +    NS   + A +   K+ D  + V G+   
Sbjct: 415 TEIVTPYEAVKQKLKGAEVIYEKGIDFVTPSTNSKEEVSALVKRLKDVDVVIFVGGISPE 474

Query: 477 VEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           +E E          G DR  + LP  QT+ + K   A K P   V+M+  A+   +   N
Sbjct: 475 LEGEEMPVKIEGFTGGDRTSIKLPKIQTDFM-KALVAEKIPTVFVMMTGSAIATEWESQN 533

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
             I +I+   Y G++ G AIADV+FG YNP G+LP+T+Y  +      + +P        
Sbjct: 534 --IPAIVNAWYGGQDAGTAIADVLFGDYNPSGKLPVTFYAKD------SDLPAFNSYEMK 585

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
            RTY++F+G V+YPFGYGLSYT+F+Y     P ++D                G N     
Sbjct: 586 NRTYRYFNGEVLYPFGYGLSYTKFEYSPIQVPSTID---------------TGNNA---- 626

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAA 705
                            + ++N GK++G EVV +Y   P   G   +  + G+ RV + A
Sbjct: 627 --------------KVSVSIKNTGKVEGEEVVQLYISYPDTKGQKPLYALKGFNRVSLKA 672

Query: 706 GQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           G+S  V F ++  + L +VD+A    +++G   I +G
Sbjct: 673 GESKTVEFNLSP-RELGLVDDAGILKVSAGKRKIFIG 708


>gi|121809149|sp|Q4AEG8.1|XYND_ASPAW RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|73486695|dbj|BAE19756.1| beta-xylosidase [Aspergillus awamori]
          Length = 804

 Score =  413 bits (1061), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 262/703 (37%), Positives = 374/703 (53%), Gaps = 49/703 (6%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD      +RA  L+   TL E +   G+   GV RLGLP Y+ WSEALHG+    R   
Sbjct: 69  CDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD---RANF 125

Query: 75  SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
           S  G +       ATSFP  ILTTA+ N +L  +I   +ST+ RA  N G  GL  ++PN
Sbjct: 126 SDSGAY-----NWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPN 180

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           IN  R P WGR  ETPGED  +   YA  Y+ G+Q         D +S  LK++A  KHY
Sbjct: 181 INTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQG-------PDPESN-LKLAATAKHY 232

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A YD++NW  + R   D  +T+QD+ E +   F +   +  V SVMC+YN VNG+P CAD
Sbjct: 233 AGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPACAD 292

Query: 255 PKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
              L   +R  + F  HGY+ SDCD+   I   H + + ++  A A  + AG D+DCG  
Sbjct: 293 SYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDCGTT 351

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-----YKNLGKNNICNPQH 367
           Y      ++  G ++  DI+  +  LY  L++ GYFD +       Y++L  +++     
Sbjct: 352 YQWHLNESITAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLETDA 411

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTG----NIKTLALVGPHANATKAMIGNYEGTPCRY 423
             ++ +AA QGIVLLKN N  LPL       +  T+AL+GP ANAT  ++GNY G     
Sbjct: 412 WNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYM 471

Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
            SP   F      +N+A G   I   + S   AA+ AA++AD  +   G+D ++EAE  D
Sbjct: 472 ISPRAAFEEAGYKVNFAEGTG-ISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALD 530

Query: 484 RVDLLLPGFQTELINKVADAA-KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
           R  +  PG Q +LI K+A AA K P+ ++ M  G VD +  KNN K+ ++LW GYPG+ G
Sbjct: 531 RESIAWPGNQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTKVSALLWGGYPGQSG 590

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
           G A+ D+I GK NP GRL  T Y A+Y  + P T M LRP  + PG+TYK++ G  VY F
Sbjct: 591 GFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDNPGQTYKWYTGEAVYEF 650

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLD-KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
           G+GL YT F  + +S+  + ++KL+ +D   R         + P                
Sbjct: 651 GHGLFYTTFA-ESSSNTTTKEVKLNIQDILSRTHEELASITQLPV--------------L 695

Query: 661 TFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERV 701
            F   + N GK++     MV++     G A    K ++G++R+
Sbjct: 696 NFTANIRNTGKLESDYTAMVFANTSDAGPAPYPKKWLVGWDRL 738


>gi|169767016|ref|XP_001817979.1| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
 gi|121805502|sp|Q2UR38.1|XYND_ASPOR RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|83765834|dbj|BAE55977.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 798

 Score =  412 bits (1060), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 259/743 (34%), Positives = 384/743 (51%), Gaps = 47/743 (6%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD      +RA  LV  +T  E V    +  +G PR+GLP Y+ W+EALHGV+ 
Sbjct: 57  LSKTLVCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVA- 115

Query: 69  IGRRTNSPPGTHFDSEVPG----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
                      H D    G    +TSFP  I T A+ N +L  +I   +ST+ RA  N G
Sbjct: 116 -----------HADFSDAGGFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAG 164

Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSR 183
             GL  +SPNIN  R P WGR  ETPGED Y +   YA  Y+ G+Q   GV      D+ 
Sbjct: 165 RYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQG--GV------DAN 216

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
           PLK+ A  KHYA YD++NW+ + R   D ++T+QD+ E +   F +   +  V SVMCSY
Sbjct: 217 PLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSY 276

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
           N VNG+P+C++   L   +R  ++F   GY+  DC ++  +   H +  + +  A A  +
Sbjct: 277 NAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATN-ESSAAADSI 335

Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKN 360
           +AG D+DCG  Y      +    +++  D++  +  LY  L+R GYFDG +  Y+N+  +
Sbjct: 336 RAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRAGYFDGKTSPYRNITWS 395

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPL--NTGNIKTLALVGPHANATKAMIGNYEG 418
           ++ +     L+ EAA Q IVLLKND G LPL   + + KT+AL+GP ANAT  M+GNY G
Sbjct: 396 DVVSTNAQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQMLGNYYG 454

Query: 419 TPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
                 SP+  F      I Y  G       +++    A+  AK AD  +   G+D ++E
Sbjct: 455 PAPYLISPLQAFQDSEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLE 514

Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
            E +DR ++  P  Q  LI K+AD  K P+ ++ M  G VD +  KNN  + +++W GYP
Sbjct: 515 TEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYP 573

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKI-PYTSMPLRPVNNFPGRTYKFFDGPV 597
           G+ GG+A+AD+I GK  P  RL  T Y A Y ++ P   M LRP  + PG+TY ++ G  
Sbjct: 574 GQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGSNPGQTYMWYTGTP 633

Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
           VY FG+GL YT F    ++S  +      K++   +I+  +G  +P     L++ +    
Sbjct: 634 VYEFGHGLFYTNFTASASASSGT------KNRTSFNIDEVLG--RPHLGYKLVEQMPL-- 683

Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI-KQVIGYERVFIAAGQSAKVGFTMN 716
               F ++V+N G        M +         H  K ++G++R+      SAK      
Sbjct: 684 --LNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFDRLSAVEPGSAKTMVIPV 741

Query: 717 ACKSLKIVDNAANSLLASGAHTI 739
              SL   D   N +L  G + +
Sbjct: 742 TVDSLARTDEEGNRVLYPGRYEV 764


>gi|367028614|ref|XP_003663591.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
           ATCC 42464]
 gi|347010860|gb|AEO58346.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
           ATCC 42464]
          Length = 760

 Score =  412 bits (1060), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 270/743 (36%), Positives = 395/743 (53%), Gaps = 58/743 (7%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD       RA  LV  M   EK+  + + + GV RLGL  Y+WW+EALHGV+    R  
Sbjct: 39  CDTSASPGARAAALVSVMNNNEKLANLVNNSPGVSRLGLSAYQWWNEALHGVAH--NR-- 94

Query: 75  SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
              G  +  E   AT FP  I T+A+F+++L ++IG  +STEARA  N G A L FW+PN
Sbjct: 95  ---GITWGGEFSAATQFPQAITTSATFDDALIEQIGTIISTEARAFANNGRAHLDFWTPN 151

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           +N  RDPRWGR  ETPGED +   ++A  +V+G+Q       HR        + A CKHY
Sbjct: 152 VNPFRDPRWGRGHETPGEDAFKNKKWAEAFVKGMQGPGPT--HR--------VIATCKHY 201

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           AAYDL+N     RF+FD++V+ QD+ E ++ PF+ C  +  V S+MCSYN VN IP CA+
Sbjct: 202 AAYDLENSGSTTRFNFDAKVSTQDLAEYYLPPFQQCARDSKVGSIMCSYNAVNEIPACAN 261

Query: 255 PKLLNQTIRGDWNF---HGYIVSDCDSIQTIVES---HKFLNDTKEDAVARVLKAGLDLD 308
           P L++  +R  WN+   H YIVSDCD++  +  +   H++   +   A+   L+AG D  
Sbjct: 262 PYLMDTILRKHWNWTDEHQYIVSDCDAVYYLGNANGGHRY-KPSYAAAIGASLEAGCDNM 320

Query: 309 CGDYYTNFT----MGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNIC 363
           C  + T  T      A   G+ ++  +DT++      L+  GYFDG    Y+NL   ++ 
Sbjct: 321 C--WATGGTAPDPASAFNSGQFSQTTLDTAILRQMQGLVLAGYFDGPGGMYRNLSVADVN 378

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNT-GNIKTLALVGPHANATKAMIGNYEGTPCR 422
                + A +AA  GIVLLKND G LPL+  G+   +A++G  ANA   M+G Y G+P  
Sbjct: 379 TQTAQDTALKAAEGGIVLLKND-GILPLSVNGSNFQVAMIGFWANAADKMLGGYSGSPPF 437

Query: 423 YTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
              P+    +    +NY  G    + Q N    AA++AA+ ++A V   G+D +VE E +
Sbjct: 438 NHDPVTAARSMGITVNYVNGP---LTQPNGDTSAALNAAQKSNAVVFFGGIDNTVEKESQ 494

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
           DR  +  P  Q  LI ++A+  K PV +V+     VD     + P +++ILW GYPG++G
Sbjct: 495 DRTSIEWPSGQLALIRRLAETGK-PV-IVVRLGTHVDDTPLLSIPNVRAILWAGYPGQDG 552

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
           G A+  +I G  +P GRLP T Y ++Y  + P+T+M LRP +++PGRTY+++    V+PF
Sbjct: 553 GTAVVKIITGLASPAGRLPATVYPSSYTSQAPFTNMALRPSSSYPGRTYRWYSN-AVFPF 611

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
           G+GL YT F   V   P S  I  D    C D            +   +D         +
Sbjct: 612 GHGLHYTNFSVSVRDFPASFAIA-DLLASCGD------------SVAYLDLCPFP----S 654

Query: 662 FQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
             + V N G      V + + S   G +   IK +  Y+RVF       +V       +S
Sbjct: 655 VSLNVTNTGTRVSDYVALGFLSGDFGPSPHPIKTLATYKRVFNIEPGETQVAELDWKLES 714

Query: 721 LKIVDNAANSLLASGAHTILVGE 743
           L  VD   N +L  G +T+LV +
Sbjct: 715 LVRVDEKGNRVLYPGTYTLLVDQ 737


>gi|116197206|ref|XP_001224415.1| hypothetical protein CHGG_05201 [Chaetomium globosum CBS 148.51]
 gi|88181114|gb|EAQ88582.1| hypothetical protein CHGG_05201 [Chaetomium globosum CBS 148.51]
          Length = 735

 Score =  412 bits (1060), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 261/724 (36%), Positives = 394/724 (54%), Gaps = 66/724 (9%)

Query: 47  GVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLW 106
           GV RLGL  Y+WW+EALHGV+    R     G  +  +   AT FP  I ++A+F++ L 
Sbjct: 47  GVSRLGLSAYQWWNEALHGVAH--NR-----GITWGGQFSAATQFPQAITSSAAFDDHLI 99

Query: 107 KKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVR 166
           ++IG  +STEARA  N G A L FW+PN+N  RDPRWGR  ETPGED +   ++A  +V+
Sbjct: 100 ERIGVIISTEARAFANNGRAHLDFWTPNVNPFRDPRWGRGHETPGEDAFRNKKWAEAFVQ 159

Query: 167 GLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILP 226
           G+Q  E    HR        + A CKHYAAYDL+N     RF+FD++V+ QD+ E ++ P
Sbjct: 160 GMQGTEST--HR--------VIATCKHYAAYDLENSGSTTRFNFDAKVSTQDLAEYYLPP 209

Query: 227 FEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIV 283
           F+ C  +  V S+MCSYN VNG+P CA P L++  +R  WN+   + Y+VSDCD++  + 
Sbjct: 210 FQQCARDSKVGSIMCSYNAVNGVPACASPYLMDTILRKHWNWTDQNQYVVSDCDAVYYLG 269

Query: 284 ES---HKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTM----GAVQQGKIAEADIDTSLR 336
            +   H++   +   A+   L+AG D  C  + T  T      A    +  +A +D ++ 
Sbjct: 270 NANGGHRY-KSSYAAAIGASLEAGCDNMC--WATGGTTPDPASAFNSRQFTQATLDKAML 326

Query: 337 FLYIVLMRLGYFDG-SPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGN 395
                L++ GYFDG +  Y+NL   ++      + A +AA +GIVLLKNDN  LPL  G 
Sbjct: 327 RQMQGLVKAGYFDGPNSLYRNLTAADVNTQVARDTALKAAEEGIVLLKNDN-ILPLTLGG 385

Query: 396 IKT-LALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMI 454
             T +A++G  ANA   M+G Y G+P     P+    +    +NY  G    + Q N+  
Sbjct: 386 SNTQVAMIGFWANAADKMLGGYSGSPPFSHDPVTAARSMGITVNYVNGP---LTQTNADT 442

Query: 455 PAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMS 514
            AA++AA+ +   +   G+D +VE E +DR  +  P  Q  +I ++A   K PV +V M 
Sbjct: 443 SAAVNAAQKSSVVIFFGGIDNTVEKESQDRTSIAWPSGQLTMIQRLAQTGK-PVIVVRMG 501

Query: 515 AGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIP 573
              VD     + P +K+ILW GYPG++GG A+ ++I G  +P GRLP+T Y ++Y  + P
Sbjct: 502 T-HVDDTPLLSIPNVKAILWAGYPGQDGGTAVMNLITGLASPAGRLPVTVYPSSYTNQAP 560

Query: 574 YTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRD 633
           YT+M LRP +++PGRTY+++  P V+PFG+GL YT F       P +  I  D    C+ 
Sbjct: 561 YTNMALRPSSSYPGRTYRWYKDP-VFPFGHGLHYTNFSVAPLDFPATFSIA-DLLASCKG 618

Query: 634 INYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHI 692
           + Y      P                 +  + V N G      VV+ + +   G     I
Sbjct: 619 VTYLELCPFP-----------------SVSVSVTNTGSRASDYVVLGFLAGDFGPTPRPI 661

Query: 693 KQVIGYERVF-IAAG--QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE-GVGGV 748
           K +  Y+RVF +  G  QSA++ + +   +SL  VD   N +L  G +T+L+ +  +  +
Sbjct: 662 KSLATYKRVFDVQPGKTQSAELDWKL---ESLARVDGKGNRVLYPGTYTLLLDQPTLANI 718

Query: 749 SFPL 752
           +F L
Sbjct: 719 TFTL 722


>gi|253579611|ref|ZP_04856880.1| glycoside hydrolase, family 3 domain-containing protein
           [Ruminococcus sp. 5_1_39B_FAA]
 gi|251849112|gb|EES77073.1| glycoside hydrolase, family 3 domain-containing protein
           [Ruminococcus sp. 5_1_39BFAA]
          Length = 706

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 268/759 (35%), Positives = 384/759 (50%), Gaps = 114/759 (15%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           ++A+ LV +MTL EK  Q+   A  V RLG+P Y +W+EALHGV+  G            
Sbjct: 13  KKAEKLVSQMTLLEKASQLKYDAAPVKRLGVPAYNYWNEALHGVARAGV----------- 61

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
                AT FP  I   A F++   KK+G  ++TE RA YN  +A        GLTFWSPN
Sbjct: 62  -----ATMFPQAIAMAAVFDDEEMKKVGDIIATEGRAKYNAYSAKEDRDIYKGLTFWSPN 116

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           +N+ RDPRWGR  ET GEDPY+  R  + +V G+Q           D   +K +AC KHY
Sbjct: 117 VNIFRDPRWGRGHETYGEDPYLTSRLGVKFVEGIQ----------GDGPVMKAAACAKHY 166

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A +   +   + R  FD++ + +DM ET++  FE  V E DV +VM +YNR NG P CA 
Sbjct: 167 AVH---SGPESLRHEFDAQASMKDMWETYLPAFEALVTEADVEAVMGAYNRTNGEPCCAH 223

Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
             L+   +RG W F G+  SDC +I+   E H  +  T   + A  L AG DL+CG+ Y 
Sbjct: 224 KYLMEDVLRGKWKFEGHYTSDCWAIRDFHE-HHMVTSTPRQSAAMALNAGCDLNCGNTYL 282

Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEA 374
           +  MGA Q G + E  I  S   L      LG FDGS +Y  +  + +   +HI+ A + 
Sbjct: 283 HM-MGAYQDGLVTEEKITESAVRLLTTRYLLGLFDGS-EYDKIPYSVVECKEHIDEALKM 340

Query: 375 ARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYS 434
           AR+  VLLKND G LP++   + T+ ++GP+A++  A+IGNY GT   Y + ++G    +
Sbjct: 341 ARKSCVLLKND-GVLPIDKTKVNTIGVIGPNADSRAALIGNYHGTSSEYITVLEGIREEA 399

Query: 435 K---VINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----- 480
                I Y+ GC       + +  +   I  A+  A+N+D  ++  GL+ ++E E     
Sbjct: 400 GDDVRILYSQGCDLYKDKVENLAWDQDRISEAVITAENSDVVILCVGLNETLEGEEGDTG 459

Query: 481 ----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
                 D+VDL LP  Q ELI KV    K P  +V+M+  A+D+N+A++N     IL   
Sbjct: 460 NSDASGDKVDLHLPKVQEELIEKVTAVGK-PTIVVLMAGSAIDLNYAQDN--CNGILLAW 516

Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
           YPG  GGRAIAD++FGK +P G+LPIT+Y+          MP     +   RTY++ +  
Sbjct: 517 YPGARGGRAIADLLFGKESPSGKLPITFYK------DLEGMPEFTDYSMKNRTYRYMEKE 570

Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
            +YPFGYGL+Y+      A     V  + D                              
Sbjct: 571 ALYPFGYGLTYSDTCVTEAEVVGEVSAESD------------------------------ 600

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSK----PPGIAGTHIKQVIGYERVFIAAGQSAKVG 712
                 +  V+N G +D  EVV VY K    P  +    +    G++RV + AG+   V 
Sbjct: 601 ---IVLKATVKNNGTVDTDEVVQVYIKDLDSPLAVRNYSL---CGFKRVSLKAGEEKSVE 654

Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFP 751
           FT++  K++ IVD   N  +A G H  L      GVS P
Sbjct: 655 FTISN-KAMNIVDEDGNRYIA-GKHFRL----FAGVSQP 687


>gi|292495281|sp|C0STH4.1|XYND_ASPAC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|225878711|dbj|BAH30675.1| beta-xylosidase [Aspergillus aculeatus]
          Length = 805

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 270/746 (36%), Positives = 391/746 (52%), Gaps = 52/746 (6%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD+     +RA  LV   TL E +   G+ + GVPRLGLP Y+ WSEALHG   +GR   
Sbjct: 60  CDSTASPYDRAAALVSLFTLEELIANTGNTSPGVPRLGLPPYQVWSEALHG---LGRANF 116

Query: 75  SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
           +  G        G  SFP+ IL+ A+FN +L  +I   +ST+ RA  N G  GL  +SPN
Sbjct: 117 TDNGALH----AGRPSFPSPILSAAAFNRTLINQIASIISTQGRAFNNAGRFGLDVYSPN 172

Query: 135 INVVRDPRWGRVLETPGEDPYVV-GRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           IN  R P WGR  ETPGED Y +   YA  Y+ G+Q     E+        LK++A  KH
Sbjct: 173 INTFRHPVWGRGQETPGEDAYTLTAAYAYEYITGIQGGVNPEH--------LKLAATAKH 224

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           +A YD++NW+ + R   D  +T+QD+ E +   F +   +  V S MCSYN VNG+P+C+
Sbjct: 225 FAGYDIENWDNHSRLGNDVNITQQDLAEYYTPQFLVAARDAHVHSFMCSYNAVNGVPSCS 284

Query: 254 DPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
           +   L   +R  ++F  HGY+  DC ++  +   H +  + +  A A  + AG D+DCG 
Sbjct: 285 NTFFLQTLLRDTFSFVDHGYVSGDCGAVYGVFNPHGYAAN-EPSAAADAILAGTDIDCGT 343

Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG----SPQYKNLGKNNICNPQH 367
            Y      ++  G +A  DI+     LY  L+ LGYFDG    S  Y++LG  ++     
Sbjct: 344 SYQYHFNESITTGAVARDDIERGFIRLYANLVELGYFDGNSSSSNPYRSLGWPDVQKTDA 403

Query: 368 IELAAEAARQGIVLLKNDNGALPLNT---GNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
             ++ EAA +GIVLLKND G LPL +   G  K++AL+GP ANAT  + GNY G      
Sbjct: 404 WNISYEAAVEGIVLLKND-GTLPLASPSEGKNKSIALIGPWANATTQLQGNYYGDAPYLI 462

Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           SP+D F A    ++YAPG  +I   + +   AA+ AA+ AD  V + G+D ++EAE +DR
Sbjct: 463 SPVDAFTAAGYTVHYAPGT-EISTNSTANFSAALSAARAADTIVFLGGIDNTIEAEAQDR 521

Query: 485 VDLLLPGFQTELINKVA--DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
             +  PG Q ELI+++A   +   P+ +  M  G VD +  K N K+ ++LW GYPG+ G
Sbjct: 522 SSIAWPGNQLELISQLAAQKSDDQPLVVYQMGGGQVDSSSLKFNAKVNALLWGGYPGQSG 581

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRP--VNNFPGRTYKFFDGPVVY 599
           G A+ D++ G   P GRL  T Y A Y +      M LRP      PG+TY ++ G  VY
Sbjct: 582 GLALRDILTGARAPAGRLTTTQYPAAYAESFSALDMNLRPNETTQNPGQTYMWYTGEPVY 641

Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
            FG+GL YT F    ASS ++   K           YT        AA        +   
Sbjct: 642 AFGHGLFYTTFN---ASSAQAAKTK-----------YTFNITDLTSAAHPDTTTVGQRTL 687

Query: 660 FTFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERVFIAA--GQSAKVGFTM 715
           F F   + N G+ D     +VY  +   G +    K ++G++R+   A  G +A++   +
Sbjct: 688 FNFTASITNSGQRDSDYTALVYANTSTAGPSPYPNKWLVGFDRLAAVAKEGGTAELNVPV 747

Query: 716 NACKSLKIVDNAANSLLASGAHTILV 741
            A   L  VD A N++L  G + + +
Sbjct: 748 -AVDRLARVDEAGNTVLFPGRYEVAL 772


>gi|413919686|gb|AFW59618.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 475

 Score =  411 bits (1056), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 207/450 (46%), Positives = 291/450 (64%), Gaps = 17/450 (3%)

Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKN 356
           V  AGLDL+CG +    T+ AVQ GK++E+D+D ++    + LMRLG+FDG P+   + N
Sbjct: 28  VAAAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGN 87

Query: 357 LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
           LG +++C P + ELA EAARQGIVLLKN  G LPL+  +IK++A++GP+ANA+  MIGNY
Sbjct: 88  LGPSDVCTPSNQELAREAARQGIVLLKN-TGKLPLSATSIKSMAVIGPNANASFTMIGNY 146

Query: 417 EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDL 475
           EGTPC+YT+P+ G  A    + Y PGC ++ C  NS+ + AA  AA +AD TV+V G D 
Sbjct: 147 EGTPCKYTTPLQGLGANVATV-YQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQ 205

Query: 476 SVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
           S+E E  DR  LLLPG Q +L++ VA+A+ GP  LV+MS G  DI+FAK++ KI +ILWV
Sbjct: 206 SIERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWV 265

Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFF 593
           GYPGE GG AIADV+FG +NP GRLP+TWY  ++ K+P T M +R  P   +PGRTY+F+
Sbjct: 266 GYPGEAGGAAIADVLFGYHNPSGRLPVTWYPESFTKVPMTDMRMRPDPSTGYPGRTYRFY 325

Query: 594 DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
            G  VY FG GLSYT F + + S+PK + ++L +   C             C +V  +  
Sbjct: 326 TGDTVYAFGDGLSYTSFAHHLVSAPKQLALQLAEGHACLTEQ---------CPSVEAEGA 376

Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
            C+   F   + V N G+  G   V ++S PP +     K ++G+E+V +  GQ+  V F
Sbjct: 377 HCEGLAFDVHLRVRNAGERSGGHTVFLFSSPPAVHNAPAKHLLGFEKVSLEPGQAGVVAF 436

Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVGE 743
            ++ CK L +VD   N  +A G+HT+ VG+
Sbjct: 437 KVDVCKDLSVVDELGNRKVALGSHTLHVGD 466


>gi|194400335|gb|ACF61038.1| beta-xylosidase [Aspergillus awamori]
          Length = 804

 Score =  410 bits (1055), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 261/702 (37%), Positives = 378/702 (53%), Gaps = 47/702 (6%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD      +RA  L+   TL E +   G+   GV RLGLP Y+ WSEALHG+    R   
Sbjct: 69  CDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD---RANF 125

Query: 75  SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
           S  G++       ATSFP  ILTTA+ N +L  +I   +ST+ RA  N G  GL  ++PN
Sbjct: 126 SDSGSY-----NWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPN 180

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           IN  R P WGR  ETPGED  +   YA  Y+ G+Q         D DS  LK++A  KHY
Sbjct: 181 INTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQG-------PDPDSN-LKLAATAKHY 232

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A YD++NW  + R   D  +T+QD+ E +   F +   +  V SVMC+YN VNG+P CAD
Sbjct: 233 AGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPACAD 292

Query: 255 PKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
              L   +R  + F  HGY+ SDCD+   I   H + + ++  A A  + AG D+DCG  
Sbjct: 293 SYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDCGTT 351

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-----YKNLGKNNICNPQH 367
           Y      ++  G ++  DI+  +  LY  L++ GYFD +       Y++L  +++     
Sbjct: 352 YQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDA 411

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTG----NIKTLALVGPHANATKAMIGNYEGTPCRY 423
             ++ +AA QGIVLLKN N  LPL       +  T+AL+GP ANAT  ++GNY G     
Sbjct: 412 WNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYM 471

Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
            SP   F      +N+A G   I   + S   AA+ AA++AD  +   G+D ++EAE  D
Sbjct: 472 ISPRAAFEEAGYKVNFAEGTG-ISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAEALD 530

Query: 484 RVDLLLPGFQTELINKVADAA-KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
           R  +  PG Q +LI K+A +A   P+ ++ M  G VD +  KNN  + ++LW GYPG+ G
Sbjct: 531 RESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPGQSG 590

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
           G A+ D+I GK NP GRL  T Y A+Y  + P T M LRP  + PG+TYK++ G  VY F
Sbjct: 591 GFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDNPGQTYKWYTGEAVYEF 650

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
           G+GL YT F  + +S+  + ++KL+     +DI   +       A++    V        
Sbjct: 651 GHGLFYTTFA-ESSSNTTTKEVKLN----IQDI---LSQTHEELASITQLPV------LN 696

Query: 662 FQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERV 701
           F   ++N GK++     MV++     G A   +K ++G++R+
Sbjct: 697 FTANIKNTGKLESDYTAMVFANTSDAGPAPYPVKWLVGWDRL 738


>gi|336425135|ref|ZP_08605165.1| hypothetical protein HMPREF0994_01171 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336013044|gb|EGN42933.1| hypothetical protein HMPREF0994_01171 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 705

 Score =  410 bits (1054), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 255/749 (34%), Positives = 383/749 (51%), Gaps = 105/749 (14%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           E+A +LV +MTL EK  Q+   A  +PRLG+P Y WW+EALHGV+  G            
Sbjct: 9   EKAHELVSQMTLEEKASQLRYDAPAIPRLGVPTYNWWNEALHGVARAGV----------- 57

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN--------AGLTFWSPN 134
                ATSFP  I   A+F++ L K +G  V+ E RA YN  +         GLTFWSPN
Sbjct: 58  -----ATSFPQAIAMAAAFDDELLKTVGDAVAAEGRAKYNEYSRHDDRDIYKGLTFWSPN 112

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           +N+ RDPRWGR  ET GEDPY+  R  + YV GLQ  +        D   +K +AC KH+
Sbjct: 113 VNIFRDPRWGRGHETYGEDPYLTSRLGVAYVEGLQGSQ--------DDDFMKTAACAKHF 164

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A +   +   + R  FD++ +++DM ET++  FE CV E  V +VM +YNR NG P C  
Sbjct: 165 AVH---SGPESVRHEFDAQASKKDMYETYLPAFEACVKEAGVEAVMGAYNRTNGEPCCGS 221

Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
           P L+   +R +W+F G+ VSDC +I      H  +  T E++ A  LK+G D++CG  Y 
Sbjct: 222 PTLIQNILREEWDFQGHYVSDCWAIADF-HMHHMVTKTPEESAALALKSGCDVNCGVTYL 280

Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEA 374
           +  + A QQG + E +I  +   L+     LG FD + +Y ++    +   +H+ELA + 
Sbjct: 281 HL-LKAYQQGLVTEEEITQAAERLFTTRFLLGCFDKN-EYDDIPYEVVECKEHLELAQKM 338

Query: 375 ARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG---FY 431
           A++ +VLLKND G LPLN   +KT+ ++GP+A++   ++GNY GT  RY + ++G   F 
Sbjct: 339 AKESMVLLKND-GILPLNKDGLKTIGVIGPNADSRTPLVGNYHGTSSRYITLLEGIQDFV 397

Query: 432 AYSKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----- 480
                + Y+ GC       + +      I  A+  A+++D  V+  GLD ++E E     
Sbjct: 398 GEDVRVYYSEGCHIYKDRVEGLGWKQDRISEALTVAEHSDVVVLCLGLDENLEGEEGDTG 457

Query: 481 ----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
                 D+ DL LP  Q EL+  VA   K PV L +MS  A+D+ FA  +  + +IL V 
Sbjct: 458 NSYASGDKKDLELPESQRELLEAVAGCGK-PVVLCMMSGSAIDMQFAAEH--VNAILQVW 514

Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
           YPG  GG+A A+++FG  +P G+LP+T+Y+           P     +  GRTY++ +  
Sbjct: 515 YPGARGGKAAAEILFGACSPSGKLPVTFYK------DLEGFPAFEDYSMKGRTYRYLEKE 568

Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
            +YPFGYGL+Y Q   K A    +V+                                 +
Sbjct: 569 PLYPFGYGLTYGQVCVKAAELTGAVE---------------------------------E 595

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSK---PPGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
             + T +  VEN GK D  +V+ VY K          H   +  ++RV +  G+ A++  
Sbjct: 596 GKELTIKAMVENSGKYDTDDVIQVYIKDLDSKNAVPNH--SLCAFKRVSLKKGEKAEILL 653

Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVG 742
            +   ++L  VD      + S    + VG
Sbjct: 654 KV-PYEALMAVDEEGKKYVDSSHFVLSVG 681


>gi|297745533|emb|CBI40698.3| unnamed protein product [Vitis vinifera]
          Length = 461

 Score =  410 bits (1054), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 205/383 (53%), Positives = 267/383 (69%), Gaps = 12/383 (3%)

Query: 120 MYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD 179
           MYN+G AGLTFWSPN+N+ RDPRWGR  ETPGEDP +  +YA  YVRGLQ  +      D
Sbjct: 1   MYNVGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVRGLQQSD------D 54

Query: 180 SDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
                LKI+ACCKHY AYDLDNW+G DRFHF++ VT+QDM +TF  PF+ CV +G+V+SV
Sbjct: 55  GSPDRLKIAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDDTFQPPFKSCVIDGNVASV 114

Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
           MCSYN+VNG P CADP LL+  +RG+W  +GYIVSDCDS+     S  +   T E+A A+
Sbjct: 115 MCSYNQVNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVDVFYNSQHY-TKTPEEAAAK 173

Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKN 356
            + AGLDL+CG +    T  AV+ G + E+ +D ++   +  LMRLG+FDG+P    Y  
Sbjct: 174 AILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIYGK 233

Query: 357 LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
           LG  ++C  +H ELA EAARQGI+LLKN  G+LPL+   IKTLA++GP+AN TK MIGNY
Sbjct: 234 LGPKDVCTLEHQELAREAARQGIMLLKNSKGSLPLSPTAIKTLAIIGPNANVTKTMIGNY 293

Query: 417 EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
           EGTPC+YT+P+ G  A      Y  GC+++ C + + I  A   A  ADATV++ G+D S
Sbjct: 294 EGTPCKYTTPLQGLMALV-ATTYLSGCSNVAC-STAQIDEAKKIAAAADATVLIVGIDQS 351

Query: 477 VEAEGKDRVDLLLPGFQTELINK 499
           +EAEG+DRV++ LPG Q  LI +
Sbjct: 352 IEAEGRDRVNIQLPGQQPLLITE 374


>gi|242771939|ref|XP_002477942.1| beta-xylosidase XylA [Talaromyces stipitatus ATCC 10500]
 gi|218721561|gb|EED20979.1| beta-xylosidase XylA [Talaromyces stipitatus ATCC 10500]
          Length = 797

 Score =  410 bits (1054), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 253/631 (40%), Positives = 358/631 (56%), Gaps = 31/631 (4%)

Query: 3   ESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEA 62
           + I   L D   C+  + Y ERA+ L+   TL E +    + A GVPRLGLP Y+ WSE 
Sbjct: 52  DCINGPLKDNIVCNTSVNYVERAEGLISLFTLEELINNTQNSAPGVPRLGLPPYQVWSEG 111

Query: 63  LHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN 122
           LHG+     R N         E   ATSFP  IL+ A+ N +L  +I   ++T+ARA  N
Sbjct: 112 LHGLD----RAN---WAKSGEEWKWATSFPMPILSMAALNRTLINQIASIIATQARAFNN 164

Query: 123 LGNAGLTFWSPNINVVRDPRWGRVLETPGEDP-YVVGRYAINYVRGLQDVEGVEYHRDSD 181
           +G  GL  ++PNIN  R P WGR  ETPGED  ++   YA  Y+ GLQ   GV      D
Sbjct: 165 VGRYGLDAYAPNINGFRSPLWGRGQETPGEDAGFLSSSYAYEYITGLQG--GV------D 216

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
              LKI A  KH+A YDL+NW  N R  FD+ +T+QD+ E +   F          S MC
Sbjct: 217 PEHLKIVATAKHFAGYDLENWNNNSRLGFDASITQQDLAEYYTPQFLAASRYAKARSFMC 276

Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
           SYN VNG+P+C+   LL   +R +W+F  +GY+ SDCD+   +   H +  +    A A 
Sbjct: 277 SYNSVNGVPSCSSSFLLQTLLRENWDFPDYGYVSSDCDAAYNVFNPHGYAINISA-AAAD 335

Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLG 358
            L+AG D+DCG  Y  +   +  +G +   +I+ SL  LY  L++LGYFDG+  +Y+ LG
Sbjct: 336 SLRAGTDIDCGQTYPWYLNQSFIEGSVTRGEIERSLIRLYSNLVKLGYFDGNQSEYRQLG 395

Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
            N++       ++ EAA +GIVLLKND G LPL+   +K++A++GP ANAT+ + GNY G
Sbjct: 396 WNDVVATDAWNISYEAAVEGIVLLKND-GVLPLSE-KLKSVAVIGPWANATQQLQGNYFG 453

Query: 419 TPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
                 +P+         +NYA G  +I+        AA+ AAK +D  + + G+D ++E
Sbjct: 454 PAPYLITPLQAARDAGYKVNYAFGT-NILGNTTDGFAAALSAAKKSDVIIYLGGIDNTIE 512

Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
           AEG DR+++  PG Q +LI +++   K P+ ++ M  G VD +  K+N  + +++W GYP
Sbjct: 513 AEGTDRMNVTWPGNQLDLIQQLSQTGK-PLVVLQMGGGQVDSSSLKSNNNVNALVWGGYP 571

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRP-VNNFPGRTYKFFDGP 596
           G+ GG+AI D++ GK  P GRL  T Y A Y  + P T M LRP   + PG+TY ++ G 
Sbjct: 572 GQSGGKAIFDILSGKRAPAGRLVTTQYPAEYATQFPATDMNLRPDGKSNPGQTYIWYTGK 631

Query: 597 VVYPFGYGLSYTQFK---YKVASSPKSVDIK 624
            VY FGY L YT FK    K+ASS  S DI 
Sbjct: 632 PVYEFGYALFYTTFKETAEKLASS--SFDIS 660


>gi|391872736|gb|EIT81831.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
          Length = 798

 Score =  410 bits (1053), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 258/743 (34%), Positives = 383/743 (51%), Gaps = 47/743 (6%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD      +RA  LV  +T  E V    +  +G PR+GLP Y+ W+EALHGV+ 
Sbjct: 57  LSKTLVCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVA- 115

Query: 69  IGRRTNSPPGTHFDSEVPG----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
                      H D    G    +TSFP  I T A+ N +L  +I   +ST+ RA  N G
Sbjct: 116 -----------HADFSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAG 164

Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSR 183
             GL  +SPNIN  R P WGR  ETPGED Y +   YA  Y+ G+Q   GV      D+ 
Sbjct: 165 RYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQG--GV------DAN 216

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
           PLK+ A  KHYA YD++NW+ + R   D ++T+QD+ E +   F +   +  V SVMCSY
Sbjct: 217 PLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSY 276

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
           N VNG+P+C++   L   +R  ++F   GY+  DC ++  +   H +  + +  A A  +
Sbjct: 277 NAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATN-ESSAAADSI 335

Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKN 360
           +AG D+DCG  Y      +    +++  D++  +  LY  L+R GYFDG +  Y+N+  +
Sbjct: 336 RAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRAGYFDGKTSPYRNITWS 395

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPL--NTGNIKTLALVGPHANATKAMIGNYEG 418
           ++ +     L+ EAA Q IVLLKND G LPL   + + KT+AL+GP ANAT  M+GNY G
Sbjct: 396 DVVSTNAQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQMLGNYYG 454

Query: 419 TPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
                 SP+  F      I Y  G       +++    A+  AK AD  +   G+D ++E
Sbjct: 455 PAPYLISPLQAFQDSEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLE 514

Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
            E +DR ++  P  Q  LI K+AD  K P+ ++ M  G VD +  KNN  + +++W GYP
Sbjct: 515 TEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYP 573

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKI-PYTSMPLRPVNNFPGRTYKFFDGPV 597
           G+ GG+A+AD+I GK  P  RL  T Y A Y ++ P   M LRP  + PG+TY ++ G  
Sbjct: 574 GQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGSNPGQTYMWYTGTP 633

Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
           VY FG+GL YT F    ++   +      K++   +I+  +G  +P     L++ +    
Sbjct: 634 VYEFGHGLFYTNFTASASAGSGT------KNRTSFNIDEVLG--RPHPGYKLVEQMPL-- 683

Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI-KQVIGYERVFIAAGQSAKVGFTMN 716
               F ++V+N G        M +         H  K ++G++R+      SAK      
Sbjct: 684 --LNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFDRLSAVEPGSAKTMVIPV 741

Query: 717 ACKSLKIVDNAANSLLASGAHTI 739
              SL   D   N +L  G + +
Sbjct: 742 TVDSLARTDEEGNRVLYPGRYEV 764


>gi|358385386|gb|EHK22983.1| glycoside hydrolase family 3 protein [Trichoderma virens Gv29-8]
          Length = 795

 Score =  410 bits (1053), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 275/759 (36%), Positives = 398/759 (52%), Gaps = 57/759 (7%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD+   Y ERA+ L+   TL E +    +   GVPRLGLP Y+ W+EALHG+    R   
Sbjct: 63  CDSSAGYAERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLD---RANF 119

Query: 75  SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
           +  G  F      ATSFP  IL+ A+ N +L  +I   +ST+ARA  N G  GL  ++PN
Sbjct: 120 ATKGGQFQ----WATSFPMPILSMAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPN 175

Query: 135 INVVRDPRWGRVLETPGEDPYVV-GRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           IN  R P WGR  ETPGED  V+   Y   Y+ G+Q   GV      D   LKI+A  KH
Sbjct: 176 INGFRSPLWGRGQETPGEDANVLTSAYTYEYITGMQG--GV------DPENLKIAATAKH 227

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           +A YDL+NW    R  FD+ +T+QD+ E +   F          S MC+YN VNG+P+CA
Sbjct: 228 FAGYDLENWNNQSRLGFDAIITQQDLSEYYTPQFLAASRYAKSHSFMCAYNSVNGVPSCA 287

Query: 254 DPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
           +   L   +R  W F   GY+ SDCD++  +   H +    +  A A  L+AG D+DCG 
Sbjct: 288 NSFFLQTLLRESWGFPEWGYVSSDCDAVYNVWNPHDYA-SNQSSAAASSLRAGTDIDCGQ 346

Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELA 371
            Y      +   G+++  +I+ S+  LY  L+RLGYFD   +Y++LG  ++       ++
Sbjct: 347 TYPWHLNESFVAGEVSRGEIERSVTRLYANLVRLGYFDKKNEYRSLGWKDVVKTDAWNIS 406

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            EAA +GIVLLKND G LPL+   ++++AL+GP ANAT  M GNY G      SP++   
Sbjct: 407 YEAAVEGIVLLKND-GTLPLSK-KVRSIALIGPWANATTQMQGNYFGAAPYLISPLEAAK 464

Query: 432 AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPG 491
                +N+  G  +    + +    AI AAK +DA +   G+D +VE EG DR D+  PG
Sbjct: 465 KAGYQVNFELGT-ETASTSTAGFAKAIAAAKKSDAIIFAGGIDNTVEQEGADRTDIAWPG 523

Query: 492 FQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIF 551
            Q +LI ++++  K P+ ++ M  G VD +  K+N K+ S++W GYPG+ GG A+ D++ 
Sbjct: 524 NQLDLIKQLSELGK-PLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILS 582

Query: 552 GKYNPGGRLPITWYEANYV-KIPYTSMPLRP-VNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
           GK  P GRL  T Y A+YV + P   M LRP   + PG+TY ++ G  VY FG G+ YT 
Sbjct: 583 GKRAPAGRLVSTQYPADYVHQFPQNDMNLRPDGKSNPGQTYIWYTGKPVYQFGDGIFYTT 642

Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
           FK  ++ S K +   +          YT     P                 TF   +EN 
Sbjct: 643 FKETLSGSSKGLKFNVSSVLAAPHPGYTYSEQTP---------------VLTFTANIENS 687

Query: 670 GKMDG--SEVVMVYSKPPGIAGTHIKQVIGYERV-FIAAGQSAKVGFTMNACKSLKIVDN 726
           GK D   S ++ V +   G A    K ++G++R+  I  G S+K+   +    +L  VD+
Sbjct: 688 GKTDSPYSAMLFVRTANAGPAPYPNKWLVGFDRLATIKPGHSSKLSIPI-PVSALARVDS 746

Query: 727 AANSLLASGAHTI-------------LVGEGVGGVSFPL 752
             N ++  G + +             LVGE V   ++PL
Sbjct: 747 LGNRIVYPGKYELALNTDESIKLEFELVGEEVTIENWPL 785


>gi|436410475|gb|AGB57183.1| beta-xylosidase [Aspergillus sp. BCC125]
          Length = 804

 Score =  409 bits (1052), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 262/702 (37%), Positives = 378/702 (53%), Gaps = 47/702 (6%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD      +RA  L+   TL E +   G+   GV RLGLP Y+ WSEALHG+    R   
Sbjct: 69  CDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD---RANF 125

Query: 75  SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
           S  G++       ATSFP  ILTTA+ N +L  +I   +ST+ RA  N G  GL  ++PN
Sbjct: 126 SDLGSY-----NWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPN 180

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           IN  R P WGR  ETPGED  +   YA  Y+ G+Q         D DS  LK++A  KHY
Sbjct: 181 INTFRHPVWGRGQETPGEDVSLAAIYAYEYITGIQG-------PDPDSN-LKLAATAKHY 232

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A YD++NW  + R   D  +T+QD+ E +   F +   +  V SVMC+YN VNG+P CAD
Sbjct: 233 AGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPACAD 292

Query: 255 PKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
              L   +R  + F  HGY+ SDCD+   I   H + + ++  A A  + AG D+DCG  
Sbjct: 293 SYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDCGTT 351

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-----YKNLGKNNICNPQH 367
           Y      ++  G ++  DI+  +  LY  L++ GYFD +       Y++L  +++     
Sbjct: 352 YQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDA 411

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTG----NIKTLALVGPHANATKAMIGNYEGTPCRY 423
             ++ +AA QGIVLLKN N  LPL       +  T+AL+GP ANAT  ++GNY G     
Sbjct: 412 WNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYM 471

Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
            SP   F      +N+A G   I   + S   AA+ AA++AD  +   G+D ++EAE  D
Sbjct: 472 ISPRAAFEEAGYNVNFAEGTG-ISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALD 530

Query: 484 RVDLLLPGFQTELINKVADAA-KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
           R  +  PG Q +LI K+A +A   P+ ++ M  G VD +  KNN  + ++LW GYPG+ G
Sbjct: 531 RESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSG 590

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
           G A+ D+I GK NP GRL  T Y A+Y  + P T M LRP  + PG+TYK++ G  VY F
Sbjct: 591 GFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDNPGQTYKWYTGEAVYEF 650

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
           G+GL YT F  + +S+  + +IKL+     +DI   +       A++    V        
Sbjct: 651 GHGLFYTTFA-ESSSNTTTREIKLN----IQDI---LSQTHEDLASITQLPV------LN 696

Query: 662 FQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERV 701
           F   ++N GK++     MV++     G A   +K ++G++R+
Sbjct: 697 FTANIKNTGKVESDYTAMVFANTSDAGPAPYPVKWLVGWDRL 738


>gi|292495632|sp|Q0CMH8.2|XYND_ASPTN RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
          Length = 793

 Score =  409 bits (1052), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 249/614 (40%), Positives = 349/614 (56%), Gaps = 26/614 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD      +RA  LV   TL E V   G+   GVPRLGLP Y+ WSE+LHGV  
Sbjct: 57  LSKTLVCDKSARPHDRAAALVSMFTLEELVNNTGNTGTGVPRLGLPKYQVWSESLHGV-- 114

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
              R N       + +   ATSFP  ILT A+ N +L  +IG  +ST+ARA  N+G  GL
Sbjct: 115 --YRANWAS----EGDYSWATSFPQPILTMAALNRTLIHQIGDILSTQARAFSNVGRYGL 168

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPY-VVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
             ++PNIN  R P WGR  ETPGED Y +   YA  Y+ G+Q   GV      D   LK+
Sbjct: 169 DTYAPNINSFRHPVWGRGQETPGEDAYYLASTYAYEYITGIQG--GV------DPETLKL 220

Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
            A  KHYA YD++NW+G+ R   D ++T+QD+ E +   F +   +  V SVMCSYN VN
Sbjct: 221 VATAKHYAGYDIENWDGHSRLGNDMQITQQDLSEYYTPQFLVSARDAKVHSVMCSYNAVN 280

Query: 248 GIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           G+P+C++   L   +R  + F   GY+  DC ++      H++  + +  A A  ++AG 
Sbjct: 281 GVPSCSNSFFLQTLLRETFGFVEDGYVSGDCGAVYNAFNPHEYAAN-ESSASADSIRAGT 339

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNICN 364
           D+DCG  Y      A  +G+I+  DI+  +  LY  L+RLGYFDG S QY++L  +++  
Sbjct: 340 DIDCGTSYQYHFTNAFDEGEISRQDIERGVIRLYTNLVRLGYFDGNSSQYRDLTWSDVQT 399

Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
                ++ EAA +G VLLKND G LPL   +I+++AL+GP ANAT  M GNY G     T
Sbjct: 400 TDAWNISHEAAVEGTVLLKND-GTLPL-ADSIRSVALIGPWANATTQMQGNYYGPAPYLT 457

Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           SP+    A    ++YA G  +I     +    A+ AA+ ADA +   G+D ++E E  DR
Sbjct: 458 SPLAALEASDLDVHYAFGT-NISSTTTAGFADALAAARKADAIIFAGGIDNTIEGEALDR 516

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
           +++  PG Q +LIN+++   K P+ ++ M  G VD +  K+N  + ++LW GYPG+ GG 
Sbjct: 517 MNITWPGNQLDLINQLSALGK-PLVVLQMGGGQVDSSALKHNTNVSALLWGGYPGQSGGT 575

Query: 545 AIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
           A+ D+I G   P GRL  T Y A Y  + P   M LRP    PG+TY ++ G  VY FG+
Sbjct: 576 ALLDIIRGVRAPAGRLVTTQYPAGYATQFPAIDMGLRPNGTNPGQTYMWYTGTPVYEFGH 635

Query: 604 GLSYTQFKYKVASS 617
           GL YT F+ K AS+
Sbjct: 636 GLFYTTFEAKRAST 649


>gi|150019484|ref|YP_001311738.1| glycoside hydrolase family protein [Clostridium beijerinckii NCIMB
           8052]
 gi|149905949|gb|ABR36782.1| glycoside hydrolase, family 3 domain protein [Clostridium
           beijerinckii NCIMB 8052]
          Length = 709

 Score =  409 bits (1052), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 260/748 (34%), Positives = 384/748 (51%), Gaps = 104/748 (13%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           E+AK+LV +MTL EK +Q+   +  V RL +P Y WW+E LHGV+  G            
Sbjct: 14  EKAKELVGKMTLEEKAEQLTYKSSAVKRLNVPRYNWWNEGLHGVARAGT----------- 62

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
                AT FP  I   A F++ L   I + +STE RA YN  +         G+TFWSPN
Sbjct: 63  -----ATVFPQAIGLAAMFDDELLNYIAKVISTEGRAKYNENSKKDDRDIYKGITFWSPN 117

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           +N+ RDPRWGR  ET GEDPY+  R  + +V+GLQ           + + LK +AC KH+
Sbjct: 118 VNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQ----------GEGKYLKAAACAKHF 167

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A +     EG  R  FD+ V+++D+ ET++  FE CV EGDV +VM +YNR NG P C  
Sbjct: 168 AVHS--GPEGL-RHEFDAVVSKKDLYETYLPAFEACVKEGDVEAVMGAYNRTNGEPCCGS 224

Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
             LL   +RG WNF G++VSDC +I      H+ +  T  ++ A  +K G DL+CG+ Y 
Sbjct: 225 KTLLRDILRGKWNFKGHVVSDCWAIADFHLHHR-VTSTATESAALAMKNGCDLNCGNVYL 283

Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLG-KNNICNPQHIELAAE 373
              + A ++G + E DI T+   L    +RLG FD   +Y  +  + N C  +H EL+ +
Sbjct: 284 QLLL-AYKEGLVTEEDITTAAERLMATRIRLGMFDEECEYNKIPYELNDCK-EHNELSLK 341

Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY-- 431
           AAR  +VLLKN NG LPLN  N+K++A++GP+A++   + GNY GT  RY + ++G +  
Sbjct: 342 AARNSMVLLKN-NGILPLNKNNLKSIAVIGPNADSQIMLKGNYSGTASRYITVLEGIHEA 400

Query: 432 -AYSKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE---- 480
                 + Y+ GC       + + + N  +  AI  A+ +D  ++  GLD ++E E    
Sbjct: 401 VGEDVRVYYSEGCHLFRDRVEELAEPNDRLKEAISIAERSDVAILCLGLDSTIEGEQGDA 460

Query: 481 -----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
                  D+  L LPG Q EL+ K+ +    PV LVI +  A+  N A++  K  +IL  
Sbjct: 461 GNSEGAGDKASLNLPGRQQELLEKIIETGT-PVILVIGAGSALTFNNAED--KCSAILDA 517

Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDG 595
            YPG  GGRA+AD+IFGK +P G+LPIT+Y           +P     +   RTY++   
Sbjct: 518 WYPGSRGGRAVADLIFGKCSPSGKLPITFYRNT------KDLPEFIDYSMKDRTYRYMSC 571

Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
             +YPFGYGL+Y+  K      P   D+K D                        +DV+ 
Sbjct: 572 ESLYPFGYGLTYSTVKLSELHVP---DVKSD-----------------------FEDVE- 604

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
                   +++ N G  D  EV+  Y K            + G++RV +  G+S K+   
Sbjct: 605 ------VSVKITNTGNFDIEEVIQCYIKDLESKYAVRNHSLAGFKRVRLKIGES-KIAKM 657

Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
                S ++V++    +L S    + VG
Sbjct: 658 KIKKSSFEVVNDDGERILDSKRFKLFVG 685


>gi|145230215|ref|XP_001389416.1| exo-1,4-beta-xylosidase xlnD [Aspergillus niger CBS 513.88]
 gi|74626559|sp|O00089.2|XYND_ASPNG RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|292495287|sp|A2QA27.1|XYND_ASPNC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|2181180|emb|CAB06417.1| xylosidase [Aspergillus niger]
 gi|134055533|emb|CAK37179.1| xylosidase xlnD-Aspergillus niger
 gi|350638468|gb|EHA26824.1| hypothetical protein ASPNIDRAFT_205670 [Aspergillus niger ATCC
           1015]
          Length = 804

 Score =  409 bits (1052), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 262/702 (37%), Positives = 376/702 (53%), Gaps = 47/702 (6%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD      +RA  L+   TL E +   G+   GV RLGLP Y+ WSEALHG+    R   
Sbjct: 69  CDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD---RANF 125

Query: 75  SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
           S  G +       ATSFP  ILTTA+ N +L  +I   +ST+ RA  N G  GL  ++PN
Sbjct: 126 SDSGAY-----NWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPN 180

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           IN  R P WGR  ETPGED  +   YA  Y+ G+Q         D +S  LK++A  KHY
Sbjct: 181 INTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQG-------PDPESN-LKLAATAKHY 232

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A YD++NW  + R   D  +T+QD+ E +   F +   +  V SVMC+YN VNG+P CAD
Sbjct: 233 AGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPACAD 292

Query: 255 PKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
              L   +R  + F  HGY+ SDCD+   I   H + + ++  A A  + AG D+DCG  
Sbjct: 293 SYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDCGTT 351

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-----YKNLGKNNICNPQH 367
           Y      ++  G ++  DI+  +  LY  L++ GYFD +       Y++L  +++     
Sbjct: 352 YQWHLNESIAAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLETDA 411

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTG----NIKTLALVGPHANATKAMIGNYEGTPCRY 423
             ++ +AA QGIVLLKN N  LPL       +  T+AL+GP ANAT  ++GNY G     
Sbjct: 412 WNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYM 471

Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
            SP   F      +N+A G   I   + S   AA+ AA++AD  +   G+D ++EAE  D
Sbjct: 472 ISPRAAFEEAGYKVNFAEGTG-ISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALD 530

Query: 484 RVDLLLPGFQTELINKVADAA-KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
           R  +  PG Q +LI K+A AA K P+ ++ M  G VD +  KNN  + ++LW GYPG+ G
Sbjct: 531 RESIAWPGNQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSG 590

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
           G A+ D+I GK NP GRL  T Y A+Y  + P T M LRP  + PG+TYK++ G  VY F
Sbjct: 591 GFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDNPGQTYKWYTGEAVYEF 650

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
           G+GL YT F  + +S+  + ++KL+     +DI   +       A++    V        
Sbjct: 651 GHGLFYTTFA-ESSSNTTTKEVKLN----IQDI---LSQTHEDLASITQLPV------LN 696

Query: 662 FQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERV 701
           F   + N GK++     MV++     G A    K ++G++R+
Sbjct: 697 FTANIRNTGKLESDYTAMVFANTSDAGPAPYPKKWLVGWDRL 738


>gi|290889355|gb|ADD69953.1| xylosidase HistTag [synthetic construct]
          Length = 810

 Score =  409 bits (1052), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 262/702 (37%), Positives = 376/702 (53%), Gaps = 47/702 (6%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD      +RA  L+   TL E +   G+   GV RLGLP Y+ WSEALHG+    R   
Sbjct: 69  CDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD---RANF 125

Query: 75  SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
           S  G +       ATSFP  ILTTA+ N +L  +I   +ST+ RA  N G  GL  ++PN
Sbjct: 126 SDSGAY-----NWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPN 180

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           IN  R P WGR  ETPGED  +   YA  Y+ G+Q         D +S  LK++A  KHY
Sbjct: 181 INTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQG-------PDPESN-LKLAATAKHY 232

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A YD++NW  + R   D  +T+QD+ E +   F +   +  V SVMC+YN VNG+P CAD
Sbjct: 233 AGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPACAD 292

Query: 255 PKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
              L   +R  + F  HGY+ SDCD+   I   H + + ++  A A  + AG D+DCG  
Sbjct: 293 SYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDCGTT 351

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-----YKNLGKNNICNPQH 367
           Y      ++  G ++  DI+  +  LY  L++ GYFD +       Y++L  +++     
Sbjct: 352 YQWHLNESIAAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLETDA 411

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTG----NIKTLALVGPHANATKAMIGNYEGTPCRY 423
             ++ +AA QGIVLLKN N  LPL       +  T+AL+GP ANAT  ++GNY G     
Sbjct: 412 WNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYM 471

Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
            SP   F      +N+A G   I   + S   AA+ AA++AD  +   G+D ++EAE  D
Sbjct: 472 ISPRAAFEEAGYKVNFAEGTG-ISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALD 530

Query: 484 RVDLLLPGFQTELINKVADAA-KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
           R  +  PG Q +LI K+A AA K P+ ++ M  G VD +  KNN  + ++LW GYPG+ G
Sbjct: 531 RESIAWPGNQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSG 590

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
           G A+ D+I GK NP GRL  T Y A+Y  + P T M LRP  + PG+TYK++ G  VY F
Sbjct: 591 GFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDNPGQTYKWYTGEAVYEF 650

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
           G+GL YT F  + +S+  + ++KL+     +DI   +       A++    V        
Sbjct: 651 GHGLFYTTFA-ESSSNTTTKEVKLN----IQDI---LSQTHEDLASITQLPV------LN 696

Query: 662 FQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERV 701
           F   + N GK++     MV++     G A    K ++G++R+
Sbjct: 697 FTANIRNTGKLESDYTAMVFANTSDAGPAPYPKKWLVGWDRL 738


>gi|354508473|gb|AER26905.1| beta-xylosidase 3 [synthetic construct]
          Length = 778

 Score =  409 bits (1051), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 260/702 (37%), Positives = 378/702 (53%), Gaps = 47/702 (6%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD      +RA  L+   TL E +   G+   GV RLGLP Y+ WSEALHG+    R   
Sbjct: 43  CDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD---RANF 99

Query: 75  SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
           S  G++       ATSFP  ILTTA+ N +L  +I   +ST+ RA  N G  GL  ++PN
Sbjct: 100 SDSGSY-----NWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPN 154

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           IN  R P WGR  ETPGED  +   YA  Y+ G+Q         D DS  LK++A  KHY
Sbjct: 155 INTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQG-------PDPDSN-LKLAATAKHY 206

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A YD++NW  + R   D  +T+QD+ E +   F +   +  V SVMC+YN V+G+P CAD
Sbjct: 207 AGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVDGVPACAD 266

Query: 255 PKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
              L   +R  + F  HGY+ SDCD+   I   H + + ++  A A  + AG D+DCG  
Sbjct: 267 SYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDCGTT 325

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-----YKNLGKNNICNPQH 367
           Y      ++  G ++  DI+  +  LY  L++ GYFD +       Y++L  +++     
Sbjct: 326 YQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDA 385

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTG----NIKTLALVGPHANATKAMIGNYEGTPCRY 423
             ++ +AA QGIVLLKN N  LPL       +  T+AL+GP ANAT  ++GNY G     
Sbjct: 386 WNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYM 445

Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
            SP   F      +N+A G   I   + S   AA+ AA++AD  +   G+D ++EAE  D
Sbjct: 446 ISPRAAFEEAGYKVNFAEGTG-ISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAEALD 504

Query: 484 RVDLLLPGFQTELINKVADAA-KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
           R  +  PG Q +LI K+A +A   P+ ++ M  G VD +  KNN  + ++LW GYPG+ G
Sbjct: 505 RESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPGQSG 564

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
           G A+ D+I GK NP GRL  T Y A+Y  + P T M LRP  + PG+TYK++ G  VY F
Sbjct: 565 GFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDNPGQTYKWYTGEAVYEF 624

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
           G+GL YT F  + +S+  + ++KL+     +DI   +       A++    V        
Sbjct: 625 GHGLFYTTFA-ESSSNTTTKEVKLN----IQDI---LSQTHEELASITQLPV------LN 670

Query: 662 FQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERV 701
           F   ++N GK++     MV++     G A   +K ++G++R+
Sbjct: 671 FTANIKNTGKLESDYTAMVFANTSDAGPAPYPVKWLVGWDRL 712


>gi|3135209|dbj|BAA28267.1| beta-xylosidase A [Aspergillus oryzae]
          Length = 798

 Score =  409 bits (1051), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 259/743 (34%), Positives = 385/743 (51%), Gaps = 47/743 (6%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD      +RA  LV  +T  E V    +  +G PR+GLP Y+ W+EALHGV+ 
Sbjct: 57  LSKTLVCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVA- 115

Query: 69  IGRRTNSPPGTHFDSEVPG----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
                      H D    G    +TSFP  I T A+ N +L  +I   +ST+ RA  N G
Sbjct: 116 -----------HADFSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAG 164

Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSR 183
             GL  +SPNIN  R P WGR  ETPGED Y +   YA  Y+ G+Q   GV      D+ 
Sbjct: 165 RYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQG--GV------DAN 216

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
           PLK+ A  KHYA YD++NW+ + R   D ++T+QD+ E +   F +   +  V SVMCSY
Sbjct: 217 PLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSY 276

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
           N VNG+P+C++   L   +R  ++F   GY+  DC ++  +   H +  + +  A A  +
Sbjct: 277 NAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATN-ESSAAADSI 335

Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKN 360
           +AG D+DCG  Y      +    +++  D++  +  LY  L+R GYFDG +  Y+N+  +
Sbjct: 336 RAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVIRLYASLIRAGYFDGKTSPYRNITWS 395

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPL--NTGNIKTLALVGPHANATKAMIGNYEG 418
           ++ +     L+ EAA Q IVLLKND G LPL   + + KT+AL+GP ANAT  M+GNY G
Sbjct: 396 DVVSTNAQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQMLGNYYG 454

Query: 419 TPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
                 SP+  F      I Y  G       +++    A+  AK AD  +   G+D ++E
Sbjct: 455 PAPYLISPLQAFQDSEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLE 514

Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
            E +DR ++  P  Q  LI K+AD  K P+ ++ M  G VD +  KNN  + +++W GYP
Sbjct: 515 TEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYP 573

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKI-PYTSMPLRPVNNFPGRTYKFFDGPV 597
           G+ GG+A+AD+I GK  P  RL  T Y A Y ++ P   M LRP  + PG+TY ++ G  
Sbjct: 574 GQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGSNPGQTYMWYTGTP 633

Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
           VY FG+GL YT F    ++   +      K++   +I+  +G  +P     L++ +    
Sbjct: 634 VYEFGHGLFYTNFTASASAGSGT------KNRTSFNIDEVLG--RPHPGYKLVEQMPL-- 683

Query: 658 YKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
               F ++V+N G        M + +   G A    K ++G++R+      SAK      
Sbjct: 684 --LNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFDRLSAVEPGSAKTMVIPV 741

Query: 717 ACKSLKIVDNAANSLLASGAHTI 739
              SL   D   N +L  G + +
Sbjct: 742 TVDSLARTDEEGNRVLYPGRYEV 764


>gi|2723496|dbj|BAA24107.1| beta-1,4-xylosidase [Aspergillus oryzae]
          Length = 798

 Score =  409 bits (1051), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 259/743 (34%), Positives = 385/743 (51%), Gaps = 47/743 (6%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD      +RA  LV  +T  E V    +  +G PR+GLP Y+ W+EALHGV+ 
Sbjct: 57  LSKTLVCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVA- 115

Query: 69  IGRRTNSPPGTHFDSEVPG----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
                      H D    G    +TSFP  I T A+ N +L  +I   +ST+ RA  N G
Sbjct: 116 -----------HADFSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAG 164

Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSR 183
             GL  +SPNIN  R P WGR  ETPGED Y +   YA  Y+ G+Q   GV      D+ 
Sbjct: 165 RYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQG--GV------DAN 216

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
           PLK+ A  KHYA YD++NW+ + R   D ++T+QD+ E +   F +   +  V SVMCSY
Sbjct: 217 PLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSY 276

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
           N VNG+P+C++   L   +R  ++F   GY+  DC ++  +   H +  + +  A A  +
Sbjct: 277 NAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATN-ESSAAADSI 335

Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKN 360
           +AG D+DCG  Y      +    +++  D++  +  LY  L+R GYFDG +  Y+N+  +
Sbjct: 336 RAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVIRLYASLIRAGYFDGKTSPYRNITWS 395

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPL--NTGNIKTLALVGPHANATKAMIGNYEG 418
           ++ +     L+ EAA Q IVLLKND G LPL   + + KT+AL+GP ANAT  M+GNY G
Sbjct: 396 DVVSTNAQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQMLGNYYG 454

Query: 419 TPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
                 SP+  F      I Y  G       +++    A+  AK AD  +   G+D ++E
Sbjct: 455 PAPYLISPLQAFQDSEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLE 514

Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
            E +DR ++  P  Q  LI K+AD  K P+ ++ M  G VD +  KNN  + +++W GYP
Sbjct: 515 TEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYP 573

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKI-PYTSMPLRPVNNFPGRTYKFFDGPV 597
           G+ GG+A+AD+I GK  P  RL  T Y A Y ++ P   M LRP  + PG+TY ++ G  
Sbjct: 574 GQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGSNPGQTYMWYTGTP 633

Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
           VY FG+GL YT F    ++   +      K++   +I+  +G  +P     L++ +    
Sbjct: 634 VYEFGHGLFYTNFTASASAGSGT------KNRTSFNIDEVLG--RPHPGYKLVEQMPL-- 683

Query: 658 YKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
               F ++V+N G        M + +   G A    K ++G++R+      SAK      
Sbjct: 684 --LNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFDRLSAVEPGSAKTMVIPV 741

Query: 717 ACKSLKIVDNAANSLLASGAHTI 739
              SL   D   N +L  G + +
Sbjct: 742 TVDSLARTDEEGNRVLYPGRYEV 764


>gi|4235093|gb|AAD13106.1| beta-xylosidase [Aspergillus niger]
          Length = 804

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 260/702 (37%), Positives = 378/702 (53%), Gaps = 47/702 (6%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD      +RA  L+   TL E +   G+   GV RLGLP Y+ WSEALHG+    R   
Sbjct: 69  CDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD---RANF 125

Query: 75  SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
           S  G++       ATSFP  ILTTA+ N +L  +I   +ST+ RA  N G  GL  ++PN
Sbjct: 126 SDSGSY-----NWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPN 180

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           IN  R P WGR  ETPGED  +   YA  Y+ G+Q         D DS  LK++A  KHY
Sbjct: 181 INTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQG-------PDPDSN-LKLAATAKHY 232

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A YD++NW  + R   D  +T+QD+ E +   F +   +  V SVMC+YN V+G+P CAD
Sbjct: 233 AGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVDGVPACAD 292

Query: 255 PKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
              L   +R  + F  HGY+ SDCD+   I   H + + ++  A A  + AG D+DCG  
Sbjct: 293 SYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDCGTT 351

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-----YKNLGKNNICNPQH 367
           Y      ++  G ++  DI+  +  LY  L++ GYFD +       Y++L  +++     
Sbjct: 352 YQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDA 411

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTG----NIKTLALVGPHANATKAMIGNYEGTPCRY 423
             ++ +AA QGIVLLKN N  LPL       +  T+AL+GP ANAT  ++GNY G     
Sbjct: 412 WNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYM 471

Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
            SP   F      +N+A G   I   + S   AA+ AA++AD  +   G+D ++EAE  D
Sbjct: 472 ISPRAAFEEAGYKVNFAEGTG-ISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAEALD 530

Query: 484 RVDLLLPGFQTELINKVADAA-KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
           R  +  PG Q +LI K+A +A   P+ ++ M  G VD +  KNN  + ++LW GYPG+ G
Sbjct: 531 RESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPGQSG 590

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
           G A+ D+I GK NP GRL  T Y A+Y  + P T M LRP  + PG+TYK++ G  VY F
Sbjct: 591 GFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDNPGQTYKWYTGEAVYEF 650

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
           G+GL YT F  + +S+  + ++KL+     +DI   +       A++    V        
Sbjct: 651 GHGLFYTTFA-ESSSNTTTKEVKLN----IQDI---LSQTHEELASITQLPV------LN 696

Query: 662 FQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERV 701
           F   ++N GK++     MV++     G A   +K ++G++R+
Sbjct: 697 FTANIKNTGKLESDYTAMVFANTSDAGPAPYPVKWLVGWDRL 738


>gi|336261464|ref|XP_003345521.1| hypothetical protein SMAC_07509 [Sordaria macrospora k-hell]
 gi|380088197|emb|CCC13872.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 762

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 275/756 (36%), Positives = 399/756 (52%), Gaps = 85/756 (11%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L+    CDA L  P+RA  LV  MT  EK+Q +   + G PR+GLP Y WWSEALHGV++
Sbjct: 43  LASLKVCDATLSPPQRAAALVAAMTTEEKLQNLVSKSKGAPRIGLPAYNWWSEALHGVAY 102

Query: 69  IGRRTNSPPGTHFDS---EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
                   PGT F S       +TSFP  +L  A+F++ L +++G+ +  E RA  N G 
Sbjct: 103 A-------PGTQFRSGNGTFNSSTSFPMPLLMAATFDDELIERVGEVIGIEGRAFGNAGF 155

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
           +G  +W+PN+N  +DPRWGR  ETPGED   + RYA + +RGL   EG    R+      
Sbjct: 156 SGFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAASMIRGL---EGPVRERER----- 207

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           +I A CKHYAA D ++W G+ R  F+++VT QD+ E ++ PF+ C  +  V S+MCSYN 
Sbjct: 208 RIVATCKHYAANDFEDWNGSTRHDFNAKVTLQDLAEYYLSPFQQCARDSKVGSIMCSYNA 267

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           VNG+P CA+  L+   +R  WN+     YI SDC+++  I  +H +   T  +  A   +
Sbjct: 268 VNGVPACANTYLMQTILRDHWNWTAPGNYITSDCEAVLDISANHHYAK-TNAEGTALAFE 326

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNN 361
           AG+D  C    ++  +GA  QG + ++ +D +LR LY  L+++GYFDG+  +Y +LG N+
Sbjct: 327 AGIDSSCEYEGSSDILGAWTQGLLKQSTVDRALRRLYEGLVQVGYFDGNRSEYASLGWNH 386

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPL---NTGNIKTLALVGPHANATKAMIGNYEG 418
           +  P+  E+A +AA +GIVLLKND   LPL     G    LA++G  AN  K + G Y G
Sbjct: 387 VNRPKSQEVALQAAVEGIVLLKNDK-TLPLGVKKNGPKLKLAMIGFWANDPKTLSGGYSG 445

Query: 419 TPCRYTSPMDGFYAYSKVINYAPGCADIVCQN----NSMIPAAIDAAKNADATVIVAGLD 474
           TP    SP+    A    +  A G    V QN    ++   AA+ AAK+A+  +   G D
Sbjct: 446 TPAFEHSPVYATQAMGFKVTTAGGP---VLQNSTSKDTWTQAALAAAKDANYILYFGGQD 502

Query: 475 LSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
            S   E KDR  +  P  Q +LI  ++   K P+ +V M    +D      +  I SILW
Sbjct: 503 TSAAGETKDRTTINWPEAQLQLITDLSKLGK-PLVVVQM-GDQLDNTPLLASKAINSILW 560

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFF 593
             +P                 P GRLP+T Y ANY   +P T M LRP +  PGRTY+++
Sbjct: 561 ANWP----------------VPAGRLPVTQYHANYTAAVPMTDMTLRPSDKLPGRTYRWY 604

Query: 594 DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
             P V PFG+GL YT FK K+   P+   IK D   +C +  Y      PP         
Sbjct: 605 PTP-VQPFGFGLHYTTFKTKIVRLPR-FAIK-DLLSRCGNA-YPDTCGLPP--------- 651

Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH---IKQVIGYERVF-IAAGQ-- 707
                    ++EV N GK     VV+ + K  G  G     IK ++ Y R+  ++ G+  
Sbjct: 652 --------LKVEVTNTGKRSSDYVVLAFLK--GDVGPKPYPIKTLVSYTRLRDLSPGRKT 701

Query: 708 SAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
           +A + +T+     +   D   N++L  G +T++V E
Sbjct: 702 TAHLDWTLG---DIARYDEQGNTVLYPGTYTVIVDE 734


>gi|115397385|ref|XP_001214284.1| hypothetical protein ATEG_05106 [Aspergillus terreus NIH2624]
 gi|114192475|gb|EAU34175.1| hypothetical protein ATEG_05106 [Aspergillus terreus NIH2624]
          Length = 776

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 249/614 (40%), Positives = 349/614 (56%), Gaps = 26/614 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD      +RA  LV   TL E V   G+   GVPRLGLP Y+ WSE+LHGV  
Sbjct: 75  LSKTLVCDKSARPHDRAAALVSMFTLEELVNNTGNTGTGVPRLGLPKYQVWSESLHGV-- 132

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
              R N       + +   ATSFP  ILT A+ N +L  +IG  +ST+ARA  N+G  GL
Sbjct: 133 --YRANWAS----EGDYSWATSFPQPILTMAALNRTLIHQIGDILSTQARAFSNVGRYGL 186

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPY-VVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
             ++PNIN  R P WGR  ETPGED Y +   YA  Y+ G+Q   GV      D   LK+
Sbjct: 187 DTYAPNINSFRHPVWGRGQETPGEDAYYLASTYAYEYITGIQG--GV------DPETLKL 238

Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
            A  KHYA YD++NW+G+ R   D ++T+QD+ E +   F +   +  V SVMCSYN VN
Sbjct: 239 VATAKHYAGYDIENWDGHSRLGNDMQITQQDLSEYYTPQFLVSARDAKVHSVMCSYNAVN 298

Query: 248 GIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           G+P+C++   L   +R  + F   GY+  DC ++      H++  + +  A A  ++AG 
Sbjct: 299 GVPSCSNSFFLQTLLRETFGFVEDGYVSGDCGAVYNAFNPHEYAAN-ESSASADSIRAGT 357

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNICN 364
           D+DCG  Y      A  +G+I+  DI+  +  LY  L+RLGYFDG S QY++L  +++  
Sbjct: 358 DIDCGTSYQYHFTNAFDEGEISRQDIERGVIRLYTNLVRLGYFDGNSSQYRDLTWSDVQT 417

Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
                ++ EAA +G VLLKND G LPL   +I+++AL+GP ANAT  M GNY G     T
Sbjct: 418 TDAWNISHEAAVEGTVLLKND-GTLPL-ADSIRSVALIGPWANATTQMQGNYYGPAPYLT 475

Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           SP+    A    ++YA G  +I     +    A+ AA+ ADA +   G+D ++E E  DR
Sbjct: 476 SPLAALEASDLDVHYAFGT-NISSTTTAGFADALAAARKADAIIFAGGIDNTIEGEALDR 534

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
           +++  PG Q +LIN+++   K P+ ++ M  G VD +  K+N  + ++LW GYPG+ GG 
Sbjct: 535 MNITWPGNQLDLINQLSALGK-PLVVLQMGGGQVDSSALKHNTNVSALLWGGYPGQSGGT 593

Query: 545 AIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
           A+ D+I G   P GRL  T Y A Y  + P   M LRP    PG+TY ++ G  VY FG+
Sbjct: 594 ALLDIIRGVRAPAGRLVTTQYPAGYATQFPAIDMGLRPNGTNPGQTYMWYTGTPVYEFGH 653

Query: 604 GLSYTQFKYKVASS 617
           GL YT F+ K AS+
Sbjct: 654 GLFYTTFEAKRAST 667


>gi|291518645|emb|CBK73866.1| Beta-glucosidase-related glycosidases [Butyrivibrio fibrisolvens
           16/4]
          Length = 713

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 257/751 (34%), Positives = 390/751 (51%), Gaps = 107/751 (14%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RAK+LV +MT+ EK  QM   A  + RLG+P Y WW+EALHGV+  G            
Sbjct: 7   KRAKELVSQMTIEEKCSQMLHHAEAIDRLGIPKYCWWNEALHGVARAG------------ 54

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
                AT FP  I   A+F+E L +K+    STE RA YN            GLT+W+PN
Sbjct: 55  ----DATVFPQAIGLGATFDEELVEKVADVTSTEGRAKYNEFTKHGDRDIYKGLTYWAPN 110

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           +N+ RDPRWGR  ET GEDPY+ G+  + YVRGLQ         D    P K +AC KH+
Sbjct: 111 VNIFRDPRWGRGHETYGEDPYLTGQLGMAYVRGLQG--------DDLDNP-KSAACAKHF 161

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A +     E   R HFD++V +QD+ +T++  F+  V +  V +VM +YNRVNG P C  
Sbjct: 162 AVHSGPEAE---RHHFDAKVNDQDLYDTYLYAFKRLVKDAKVEAVMGAYNRVNGEPACGS 218

Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
            +LL   +RGDW F G++VSDC +I+   E+HK      E A A  +  G DL+CG  Y 
Sbjct: 219 KRLLKDILRGDWGFEGHVVSDCWAIRDFHENHKVTGCEVESA-ALAVNNGCDLNCGCVYE 277

Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF-DGSPQYKNLGKNNICNPQHIELAAE 373
              + A +   + E  I  S+  L  + +RLG   +   +Y ++    +   +H ELA E
Sbjct: 278 KL-LYAYKANLVTEETITESVERLIELRLRLGTLPERRSKYDDIPYEVVECKEHKELAIE 336

Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
           AA++ +VLLKND G LPL    IKT+ ++GP++N+  A++GNYEG    Y + ++G   Y
Sbjct: 337 AAKRSMVLLKND-GLLPLKKDEIKTIGVIGPNSNSRMALVGNYEGISSEYITVLEGIQQY 395

Query: 434 ---SKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE---- 480
                 + ++ G         ++ +       A+  A+++D  V+  GLD ++E E    
Sbjct: 396 VGDDVRVFHSDGTPLWKDRMHVLSEARDTFAEAMAVAEHSDVVVLAMGLDSTIEGEEGDA 455

Query: 481 -----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
                  D+  L LPG Q EL+ K+    K PV L++++  A+D+++A  N  + +I+  
Sbjct: 456 GNEFGSGDKKGLKLPGLQQELLEKITAIGK-PVVLLVLAGSAMDLSWANEN--VNAIMHC 512

Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDG 595
            YPG  GG+AIA V+FG+ +P G+LP+T+Y+++    P+    +       GRTY++F G
Sbjct: 513 WYPGARGGKAIAQVLFGEDSPSGKLPLTFYKSDADLPPFEDYSME------GRTYRYFKG 566

Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
             +YPFGYGLSY+  +Y  A                  I+ T G          I D   
Sbjct: 567 TPLYPFGYGLSYSDIQYSNAG-----------------IDKTEGA---------IGD--- 597

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSK----PPGIAGTHIKQVIGYERVFIAAGQSAKV 711
              KFT ++ V+N G     E V VY K       +A   ++++    +V +  G+S +V
Sbjct: 598 ---KFTVKVTVKNAGDYKAHETVQVYVKDVEASTRVANCSLRKIA---KVELLPGESKEV 651

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
              ++A +   I+D   + ++  G   + VG
Sbjct: 652 SLELSA-RDFAIIDEKGHCIVEPGKFKVFVG 681


>gi|288870210|ref|ZP_06113312.2| beta-glucosidase [Clostridium hathewayi DSM 13479]
 gi|288868024|gb|EFD00323.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
          Length = 730

 Score =  407 bits (1047), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 226/617 (36%), Positives = 347/617 (56%), Gaps = 68/617 (11%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           E+A+ LV++MTL EKV Q  + A  + RLG+  Y WW+E LHGV+  G            
Sbjct: 23  EKAEYLVKQMTLEEKVFQTMNQAPAIERLGIKAYNWWNEGLHGVARAGV----------- 71

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN--------AGLTFWSPN 134
                AT FP  I   A+F+E L + +G+ VSTEARA Y++           GLT W+PN
Sbjct: 72  -----ATIFPQAIGLAATFDEDLIETVGEAVSTEARAKYHMQQRYGDTDIYKGLTLWAPN 126

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           IN+ RDPRWGR  ET GEDP++  R  I Y+RGLQ          S  + LK +AC KH+
Sbjct: 127 INIFRDPRWGRGHETYGEDPWLTSRLGIRYIRGLQG---------SHEKYLKTAACVKHF 177

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A +         R  FD+ V+E+D++ET++  FE CV +GDV +VM +YNRVNG+P C +
Sbjct: 178 AVHSGPE---ELRHSFDAEVSEKDLRETYLPAFEACVKDGDVEAVMGAYNRVNGVPCCGN 234

Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
             LL   +R +W FHG++VSDC +I+   E H  + D+  ++V+  +  G DL+CG+ +T
Sbjct: 235 EYLLETILRKEWGFHGHVVSDCWAIKDFHEGHG-VTDSPVESVSMAMNHGCDLNCGNLFT 293

Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIELA 371
            + + AV++GK+ E  +D ++  L+   ++LG      +   Y  +    + +P   +L 
Sbjct: 294 -YLIQAVKEGKVKEERLDEAVIRLFTTRLKLGALGKMEEDDPYAGISYLEVDSPAMKKLN 352

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
             AA + +VLLKN  G LP++T   KT+ ++GP+A++ +A++GNYEGT   Y + ++G  
Sbjct: 353 RSAAGKSVVLLKNTEGLLPIDTKRYKTIGVIGPNADSRRALVGNYEGTASEYVTVLEGIR 412

Query: 432 AYSK---VINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE-- 480
             ++    + Y+ GC         +   N  +       + +D  +   GLD ++E E  
Sbjct: 413 EAAEPEARVLYSEGCHLYKSNVSGLGARNDRLSEVKGICRESDIVIACMGLDSTLEGEQG 472

Query: 481 -------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
                  G D+ DL+LPG Q +++    D+ K PV LV+++  A+ + +A  +  + +IL
Sbjct: 473 DTGNIYAGGDKPDLMLPGLQQKILETAYDSGK-PVVLVLLAGSAMAVTWADEH--LPAIL 529

Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
              YPG EGGR +ADV+FG  NP GRLP+T+Y        +T+  +       GRTY+F 
Sbjct: 530 TAWYPGAEGGRGVADVLFGTVNPEGRLPVTFYRTTEELPDFTNYSME------GRTYRFM 583

Query: 594 DGPVVYPFGYGLSYTQF 610
               +YPFG+GLSYT+F
Sbjct: 584 KQKALYPFGFGLSYTEF 600


>gi|336435507|ref|ZP_08615222.1| hypothetical protein HMPREF0988_00807 [Lachnospiraceae bacterium
           1_4_56FAA]
 gi|336000960|gb|EGN31106.1| hypothetical protein HMPREF0988_00807 [Lachnospiraceae bacterium
           1_4_56FAA]
          Length = 717

 Score =  407 bits (1047), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 264/756 (34%), Positives = 396/756 (52%), Gaps = 97/756 (12%)

Query: 16  DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
           D +    ++A++LV++MTL EK  Q+   A  +PRL +P Y WW+E+LHGV+  G     
Sbjct: 5   DVRKRARKQAEELVDQMTLMEKASQLRYDAPAIPRLHIPAYNWWNESLHGVARGGT---- 60

Query: 76  PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG--------NAG 127
                       AT FP  I   ASF+  + ++IG+ ++ E RA YN            G
Sbjct: 61  ------------ATVFPQAIGLAASFDREMLEEIGEAIALEGRAKYNAAVKLDDRDIYKG 108

Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
           LTFW+PN+N+ RDPRWGR  ET GEDPY+  R  ++Y+RGLQ           D   +K 
Sbjct: 109 LTFWAPNVNIFRDPRWGRGHETYGEDPYLSSRLGVSYIRGLQ----------GDGETMKA 158

Query: 188 SACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           +AC KH+A +      G +  R  FD+ V+E+D++ET++  F+ CV EG V +VM +YN 
Sbjct: 159 AACAKHFAVHS-----GPEALRHEFDAEVSEKDLRETYLPAFQACVQEGHVEAVMGAYNC 213

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           VNG P C    LL + +R +W F G++VSDC +I+   E+H  +  T   + A  ++AG 
Sbjct: 214 VNGEPCCGSETLLKKILREEWGFDGHVVSDCWAIKDFHENH-LVTGTPVQSAALAMEAGC 272

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNP 365
           DL+CG  Y +  + A Q+G + EA I  +   L+     LG FDGS +Y ++    +   
Sbjct: 273 DLNCGVTYLHL-VHACQEGLVTEAQITEAAIRLFTTRFLLGMFDGS-EYDSVPYTVVECK 330

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H +L+  AAR+ IVLLKN NG LPL+   +KT+ ++GP+A++ KA+IGNY GT   Y +
Sbjct: 331 EHRDLSERAARESIVLLKN-NGILPLDREKLKTIGIIGPNADSRKALIGNYHGTSSEYIT 389

Query: 426 PMDG---FYAYSKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
            ++G          I Y+ GC       + + +    +  A   A+ +D  ++  GLD +
Sbjct: 390 VLEGVRRLVGDEVRILYSDGCHLYENKTENLAREQDRLSEARIVARESDVVILCLGLDET 449

Query: 477 VEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
           +E E           D+VDL LP  Q  L+  VA   K P  L +M+   +D++FA+ + 
Sbjct: 450 LEGEEGDTGNSYASGDKVDLRLPKSQRMLMEAVA-MEKKPTVLCLMAGSDIDLSFAEKHF 508

Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG 587
                LW  YPG  GG A AD++FGK +P G+LPIT+YE+  V   +    +R      G
Sbjct: 509 DAIVDLW--YPGAYGGAAAADILFGKCSPSGKLPITFYESLEVLPSFEDYSMR------G 560

Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
           RTY++ +    YPFGYGL+YT+ K +      +     +KD +      T G N    AA
Sbjct: 561 RTYRYLEQKAQYPFGYGLTYTKMKIRNVWLENA-----EKDMK----EVTDGENAE--AA 609

Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAG 706
           V++    C         EVEN G MD  EV+ +Y +       T    + G+ER+F+  G
Sbjct: 610 VIV----CA--------EVENCGGMDSQEVLQIYIRDTESEHETPHPHLAGFERIFVEKG 657

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
               V   +N   +  +VD +      SG + I  G
Sbjct: 658 VKKLVKIPVNR-SAFTVVDESGRRFTDSGKYEIFAG 692


>gi|358393086|gb|EHK42487.1| glycoside hydrolase family 3 protein [Trichoderma atroviride IMI
           206040]
          Length = 794

 Score =  407 bits (1046), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 267/735 (36%), Positives = 389/735 (52%), Gaps = 45/735 (6%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD+   Y ERA+ L+   TL E +    +   GVPRLGLP Y+ W+EALHG+    R   
Sbjct: 64  CDSTAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLD---RANF 120

Query: 75  SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
           +  G  F+      TSFP  IL+ A+ N +L  +I   +ST+ARA  N G  GL  ++PN
Sbjct: 121 ATKGGEFE----WGTSFPMPILSMAALNRTLIHQIADIISTQARAFSNNGRYGLDVYAPN 176

Query: 135 INVVRDPRWGRVLETPGEDPYVV-GRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           IN  R P WGR  ETPGED  V+   Y   Y+ G+Q   GV      D   LKI+A  KH
Sbjct: 177 INGFRSPLWGRGQETPGEDANVLTSAYTYEYITGMQG--GV------DPENLKIAATAKH 228

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           +A YDL+N+    R  FD+ +T+QD+ E +   F          S MC+YN VNG+P+C+
Sbjct: 229 FAGYDLENYNNQSRLGFDAIITQQDLSEYYTPQFLAASRYAKSHSFMCAYNSVNGVPSCS 288

Query: 254 DPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
           +   L   +R  W F  +GY+ SDCD+I  +   H + N ++  A A  LKAG D+DCG 
Sbjct: 289 NSFFLQTLLRESWGFPEYGYVSSDCDAIYNVWNPHNYAN-SQSSAAADSLKAGTDIDCGQ 347

Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELA 371
            Y      +   G ++  +I+ S+  LY  L+RLGYFD   +Y++LG  ++       ++
Sbjct: 348 TYPWHLNESFVAGTVSRGEIERSVTRLYANLVRLGYFDKKNEYRSLGWKDVVKTDAWNIS 407

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            EAA +GIVLLKND G LPL +  ++++AL+GP  NAT+ + GNY GT     SP+    
Sbjct: 408 YEAAVEGIVLLKND-GTLPL-SKKVRSIALIGPWVNATEQLQGNYFGTAPYLISPLQAAK 465

Query: 432 AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPG 491
                +NY  G   I  Q  +    AI AAK +DA + + G+D ++E EG DR D+  PG
Sbjct: 466 KAGYEVNYELGTG-INNQTTAGFAKAIAAAKKSDAIIFIGGIDNTIEQEGADRTDIAWPG 524

Query: 492 FQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIF 551
            Q +LI ++++  K P+ ++ M  G VD +  K+N K+ S++W GYPG+ GG A+ D++ 
Sbjct: 525 NQLDLIKQLSEVGK-PLVVLQMGGGQVDSSSIKSNKKVNSLVWGGYPGQSGGYALFDILS 583

Query: 552 GKYNPGGRLPITWYEANYV-KIPYTSMPLRP-VNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
           GK  P GRL  T Y A YV +     M LRP     PG+TY ++ G  VY FG GL YT 
Sbjct: 584 GKRAPAGRLVSTQYPAEYVHQFAQNDMNLRPDGKKNPGQTYIWYTGKPVYQFGDGLFYTT 643

Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
           FK  +    K   +K +  Q        +G   P         V      FTF   ++N 
Sbjct: 644 FKETLG---KQSTLKFNASQ-------ILGAGHPGYTYSEQTPV------FTFTANIQNS 687

Query: 670 GKMDG--SEVVMVYSKPPGIAGTHIKQVIGYERV-FIAAGQSAKVGFTMNACKSLKIVDN 726
           GK     S +  V +   G      K ++G++R+  I  G S+ +   +    +L  VD+
Sbjct: 688 GKTASPYSAMAFVRTSNAGPKPYPNKWLVGFDRLATIKPGHSSTLSIPI-PLNALSRVDS 746

Query: 727 AANSLLASGAHTILV 741
             N ++  G + +++
Sbjct: 747 NGNKIVYPGKYELVL 761


>gi|410628680|ref|ZP_11339398.1| beta-glucosidase [Glaciecola mesophila KMM 241]
 gi|410151684|dbj|GAC26167.1| beta-glucosidase [Glaciecola mesophila KMM 241]
          Length = 732

 Score =  407 bits (1045), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 263/757 (34%), Positives = 396/757 (52%), Gaps = 99/757 (13%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           + + +L +  RA+ LV  MT+ EK+ Q+      +PRL +P Y WW+EALHG++  G+  
Sbjct: 30  WFNPELSFETRAQALVNAMTIDEKITQLSHSTPAIPRLEVPQYNWWNEALHGIARNGK-- 87

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY----NLGN---- 125
                         AT FP  I   A+F+  L +++   +S EARA Y    ++GN    
Sbjct: 88  --------------ATIFPQAIGLGATFDPELAQEVANAISDEARAKYAIAQSIGNQGQY 133

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
           AGLTFW+PN+N+ RDPRWGR  ET GEDP +  +    +V+GLQ           D + L
Sbjct: 134 AGLTFWTPNVNIFRDPRWGRGQETYGEDPLLTSQMGTAFVKGLQG---------DDPKYL 184

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K +   KH+A +       + R  FD   +++D+ ET++  FE  V +  V+ VMC+YN 
Sbjct: 185 KSAGVAKHFAVHSGPE---SLRHQFDVEPSKKDLYETYLPAFEALVTQAKVAGVMCAYNG 241

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           V G P+CA   LL + ++  W F+GY+VSDC ++      HK  ++  E A A  L+AG+
Sbjct: 242 VYGQPSCASEFLLGEMLKKKWQFNGYVVSDCGALHDFHSGHKVTHNRVESA-ALALRAGV 300

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNIC 363
           DL+CG  Y      A ++G I ++ ID  L+ L ++  RLG FD S    +  +G+  I 
Sbjct: 301 DLNCGFTYEKSLKAAFEEGLITQSLIDQRLKNLLMIRFRLGLFDPSELNPHNAIGQEVIH 360

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
           + +HIELA + A + IVLLKN+   LPL+  +IK   + GP A ++  ++GNY G     
Sbjct: 361 SLEHIELARKVAAKSIVLLKNEKQVLPLSK-DIKVPYVTGPFAASSDMLMGNYYGISDSL 419

Query: 424 TSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE-- 478
            + ++G     +    +NY  G        N +   A + AK ADA + V G+   +E  
Sbjct: 420 VTVLEGIAGKVSLGSSLNYRAGALPFHSNINPL-NWAPEVAKTADAVIAVVGISADMEGE 478

Query: 479 -------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKS 531
                  A+  DRV + LP  Q + + ++A+  KGP+ LV+ +   VDI  ++ +P   +
Sbjct: 479 EVDAIASADRGDRVAITLPQNQVDYVKQLAENKKGPLILVVAAGSPVDI--SELDPLADA 536

Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF--PGRT 589
           ILW+ YPGE+GG A+ADVIFG  NP G LP+T     +VK   T   L P +++   GRT
Sbjct: 537 ILWIWYPGEQGGNAVADVIFGDTNPSGHLPLT-----FVK---TIDDLPPFDDYTMTGRT 588

Query: 590 YKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL 649
           YKF     +YPFG+GLSYTQFK+   S  K         Q+  +IN +V           
Sbjct: 589 YKFLKKLPLYPFGFGLSYTQFKFGKLSLSKRA------PQEGENINISV----------- 631

Query: 650 IDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQS 708
                          EVEN   +DG  VV VY  P   +    I  +  ++RV I A + 
Sbjct: 632 ---------------EVENSTALDGETVVQVYLSPQVPLKNEAITNLKAFKRVHIGAYEK 676

Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGV 745
             + FT+   K+L  V++A  ++  SGA+T+ VG+ +
Sbjct: 677 RLIEFTIEG-KNLYRVNDAGENVWPSGAYTLAVGDSL 712


>gi|367053033|ref|XP_003656895.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
           8126]
 gi|347004160|gb|AEO70559.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
           8126]
          Length = 758

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 273/755 (36%), Positives = 391/755 (51%), Gaps = 70/755 (9%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDL------AYGVPRLGLPLYEWWSEALHGVSF 68
           CD     P+RA  LVE M + EK+  + +       + G PRLGLP YEWWSEALHGV+ 
Sbjct: 11  CDTTASPPKRAAALVEAMNITEKLANLVEYVMARSSSKGAPRLGLPPYEWWSEALHGVA- 69

Query: 69  IGRRTNSPPGTHFD---SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
                 + PG  F+        ATSF   I  +A+F++ L +K+   +STEARA  N G+
Sbjct: 70  ------ASPGVSFNWSGGPFSYATSFANPITLSAAFDDELVQKVADVISTEARAFANAGS 123

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
           AGL FW+PNIN  RDPRWGR  ETPGEDP  +  Y  + +RGL+  E ++          
Sbjct: 124 AGLDFWTPNINPWRDPRWGRGSETPGEDPVRIKGYVRSLLRGLEGEESIK---------- 173

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K+ A CKHYAAYDL+ W    R+ FD+ V+ QD+ E ++ PF+ C  +  V S+MCSYN 
Sbjct: 174 KVIATCKHYAAYDLERWHNITRYEFDAIVSLQDLSEYYLPPFQQCARDSKVGSIMCSYNS 233

Query: 246 VNGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIV-ESHKFLNDTKEDAVARVL 301
           +NG P CA+  L++  +R  W +   + YI SDC++I+  + + H F     E A A   
Sbjct: 234 LNGTPACANTYLMDDILRKHWRWTEDNNYITSDCNAIKDFLPDEHNFTQTAAEAAAAAYT 293

Query: 302 KAGL---DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD---GSPQYK 355
                  ++     YT+  +GA  Q  ++E  ID +LR LY  L+R GYFD    SP Y+
Sbjct: 294 AGTDTVCEVAGSPPYTD-VVGAYDQKLLSEEVIDRALRRLYEGLVRAGYFDPASASP-YR 351

Query: 356 NLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGN 415
           ++G +++   +   LA ++A  G+VLLKND G LP+     KT+AL+G  A+ T++M+G 
Sbjct: 352 DIGWSDVNTAEAQALALQSASDGLVLLKND-GTLPIKLEG-KTVALIGHWASGTRSMLGG 409

Query: 416 YEGTPCRYTSPMDGFYAYSKVINYAPG-CADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
           Y G P  Y SP+      +    YA G  A      ++    A+ AA  +D  +   GLD
Sbjct: 410 YSGIPPYYHSPVYAAGQLNLTYKYASGPVAPASAARDTWTADALSAANKSDVILYFGGLD 469

Query: 475 LSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAG-AVDINFAKNNPKIKSIL 533
            SV +E KDR  +  P  Q  LI  +A   K    LV++  G  VD      NP + +IL
Sbjct: 470 QSVASEDKDRDSIAWPPAQLTLIQTLAGLGK---PLVVIQLGDQVDDTPLLTNPNVSAIL 526

Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTY 590
           W GYPG+ GG A+ + I G   P GRLP+T Y ++Y  ++P T M LR  P +  PGRTY
Sbjct: 527 WAGYPGQSGGTAVLNAITGVSPPAGRLPVTQYPSSYTSQLPLTDMSLRPDPASGRPGRTY 586

Query: 591 KFF-DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL 649
           ++      V PFGYGL YT F                +    ++   T      PC    
Sbjct: 587 RWLPRNATVLPFGYGLHYTNFT--------------ARPNPAQNFTLTPSALLAPCKLAH 632

Query: 650 IDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERVF-IAAG 706
            D   C    +   +EV N G      V +V+  ++  G     +K ++ Y R+  IA G
Sbjct: 633 RD--LCP-LPYPVTVEVTNTGARTSDYVGLVFATTRDAGPPPHPLKTLVAYARLRGIAPG 689

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
           ++A+    + A   L  VD A N +L  G +  ++
Sbjct: 690 RTARAQVQV-ALGDLARVDAAGNRVLYPGRYGFVL 723


>gi|389632743|ref|XP_003714024.1| beta-xylosidase [Magnaporthe oryzae 70-15]
 gi|351646357|gb|EHA54217.1| beta-xylosidase [Magnaporthe oryzae 70-15]
          Length = 847

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 265/763 (34%), Positives = 397/763 (52%), Gaps = 75/763 (9%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI-GRRT 73
           CD      ERA  LV+ M L EK++ + + + G PR+GLP YEWWSEALHGV+   G   
Sbjct: 96  CDQAATPAERAAGLVDIMELDEKLENLVNKSPGAPRIGLPAYEWWSEALHGVAKSPGVTF 155

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
           N   G  F S    ATSF   I+ +A+F++ L + +   +STEARA  N G AGL +W+P
Sbjct: 156 NKSSGAAFSS----ATSFSNPIVLSAAFDDELVEAVATQISTEARAFSNAGLAGLDWWTP 211

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN  +DPRWGR +ETPGED   + +Y    +RGL+          SD    K+ A CKH
Sbjct: 212 NINPYKDPRWGRGMETPGEDALRISKYVKALLRGLE---------GSDPTTRKMVANCKH 262

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV------- 246
           YAA DL+ W G  R++FD+ VT QD+ E ++  F+ C  + +V S MC+YN +       
Sbjct: 263 YAANDLERWNGVTRYNFDAPVTLQDLSEYYLPAFKQCARDSNVGSFMCAYNAMSIKGKDL 322

Query: 247 --NGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
             NG P CA   L+N  +R  W +   + +I SDC+++  +   H + +DT+E+A     
Sbjct: 323 SWNGTPVCASKYLMNDILREHWGWKEHNNWITSDCNAVLHMWNQHHW-SDTREEAAGSAY 381

Query: 302 KAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLG 358
            AG D  C   +Y      GA  +G + E  +D +L+ LY  L+R GYFDG    Y+N+ 
Sbjct: 382 TAGTDTVCEVSNYDKTAVKGAFDRGLLDEDVVDRALKRLYEGLVRAGYFDGPDAPYRNIT 441

Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNI----KTLALVGPHANATKAMIG 414
             ++  P+  +LA  +A +G+VL KN NG LP+    +    KT+AL+G   +  + M+G
Sbjct: 442 WADVNTPEARKLAHRSAVEGMVLTKN-NGVLPIKLEELQKKGKTVALIGNWVDNGEQMLG 500

Query: 415 NYEGTPCRYTSPMDGFYAYS-KVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGL 473
            Y G      +P+    A + K++             +S    A++AA  AD  +   G+
Sbjct: 501 TYSGIAPFRNTPLAAAKALNLKMVTAGGPVNQSTGSRDSWTRPALNAAIQADVVLYFGGI 560

Query: 474 DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
           DLSVEAE +DR  L  P  Q +L++ +  +A G  T+V+     +D     +N  I +I+
Sbjct: 561 DLSVEAEDRDRYSLAWPSAQAKLLSDI--SALGKPTVVVQLGTMLDDTALLDNKNISAII 618

Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNF------- 585
           W GYPG++GG A  D+I GK  P GRLP+T Y A Y  ++P T M +RP  +        
Sbjct: 619 WAGYPGQDGGTAAFDIITGKTAPSGRLPVTQYPAKYANQVPMTDMEVRPSKDTKGGAASN 678

Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
           PGRTY+++D   V+PFG+GL +T F   VA S  S     D +  C+   +         
Sbjct: 679 PGRTYRWYD-EAVHPFGFGLHFTNFTTSVAVSSSSAISTSDLESGCKSEKH--------- 728

Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH------IKQVIGYE 699
               ID  KC  +  + ++ V N    DG      Y+    + G +      +K ++ Y 
Sbjct: 729 ----ID--KCS-FPSSLEVSVTN----DGKSTTSSYAALAFVRGEYGPKPYPLKTLVAYG 777

Query: 700 RVF-IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
           ++  IA GQ+ KV   +      +  +N  + +L  G + +LV
Sbjct: 778 KLHDIAPGQTKKVKLELTLGDLARTAEN-GDLVLYPGKYEVLV 819


>gi|440472411|gb|ELQ41274.1| beta-xylosidase [Magnaporthe oryzae Y34]
 gi|440484691|gb|ELQ64724.1| beta-xylosidase [Magnaporthe oryzae P131]
          Length = 792

 Score =  405 bits (1041), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 265/763 (34%), Positives = 397/763 (52%), Gaps = 75/763 (9%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI-GRRT 73
           CD      ERA  LV+ M L EK++ + + + G PR+GLP YEWWSEALHGV+   G   
Sbjct: 41  CDQAATPAERAAGLVDIMELDEKLENLVNKSPGAPRIGLPAYEWWSEALHGVAKSPGVTF 100

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
           N   G  F S    ATSF   I+ +A+F++ L + +   +STEARA  N G AGL +W+P
Sbjct: 101 NKSSGAAFSS----ATSFSNPIVLSAAFDDELVEAVATQISTEARAFSNAGLAGLDWWTP 156

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN  +DPRWGR +ETPGED   + +Y    +RGL+          SD    K+ A CKH
Sbjct: 157 NINPYKDPRWGRGMETPGEDALRISKYVKALLRGLE---------GSDPTTRKMVANCKH 207

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV------- 246
           YAA DL+ W G  R++FD+ VT QD+ E ++  F+ C  + +V S MC+YN +       
Sbjct: 208 YAANDLERWNGVTRYNFDAPVTLQDLSEYYLPAFKQCARDSNVGSFMCAYNAMSIKGKDL 267

Query: 247 --NGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
             NG P CA   L+N  +R  W +   + +I SDC+++  +   H + +DT+E+A     
Sbjct: 268 SWNGTPVCASKYLMNDILREHWGWKEHNNWITSDCNAVLHMWNQHHW-SDTREEAAGSAY 326

Query: 302 KAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLG 358
            AG D  C   +Y      GA  +G + E  +D +L+ LY  L+R GYFDG    Y+N+ 
Sbjct: 327 TAGTDTVCEVSNYDKTAVKGAFDRGLLDEDVVDRALKRLYEGLVRAGYFDGPDAPYRNIT 386

Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNI----KTLALVGPHANATKAMIG 414
             ++  P+  +LA  +A +G+VL KN NG LP+    +    KT+AL+G   +  + M+G
Sbjct: 387 WADVNTPEARKLAHRSAVEGMVLTKN-NGVLPIKLEELQKKGKTVALIGNWVDNGEQMLG 445

Query: 415 NYEGTPCRYTSPMDGFYAYS-KVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGL 473
            Y G      +P+    A + K++             +S    A++AA  AD  +   G+
Sbjct: 446 TYSGIAPFRNTPLAAAKALNLKMVTAGGPVNQSTGSRDSWTRPALNAAIQADVVLYFGGI 505

Query: 474 DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
           DLSVEAE +DR  L  P  Q +L++ +  +A G  T+V+     +D     +N  I +I+
Sbjct: 506 DLSVEAEDRDRYSLAWPSAQAKLLSDI--SALGKPTVVVQLGTMLDDTALLDNKNISAII 563

Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNF------- 585
           W GYPG++GG A  D+I GK  P GRLP+T Y A Y  ++P T M +RP  +        
Sbjct: 564 WAGYPGQDGGTAAFDIITGKTAPSGRLPVTQYPAKYANQVPMTDMEVRPSKDTKGGAASN 623

Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
           PGRTY+++D   V+PFG+GL +T F   VA S  S     D +  C+   +         
Sbjct: 624 PGRTYRWYD-EAVHPFGFGLHFTNFTTSVAVSSSSAISTSDLESGCKSEKH--------- 673

Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH------IKQVIGYE 699
               ID  KC  +  + ++ V N    DG      Y+    + G +      +K ++ Y 
Sbjct: 674 ----ID--KCS-FPSSLEVSVTN----DGKSTTSSYAALAFVRGEYGPKPYPLKTLVAYG 722

Query: 700 RVF-IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
           ++  IA GQ+ KV   +      +  +N  + +L  G + +LV
Sbjct: 723 KLHDIAPGQTKKVKLELTLGDLARTAEN-GDLVLYPGKYEVLV 764


>gi|261368518|ref|ZP_05981401.1| beta-glucosidase [Subdoligranulum variabile DSM 15176]
 gi|282569400|gb|EFB74935.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Subdoligranulum variabile DSM 15176]
          Length = 717

 Score =  405 bits (1041), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 231/639 (36%), Positives = 343/639 (53%), Gaps = 73/639 (11%)

Query: 21  YPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTH 80
           Y ERA+ LV +MTL EK+ QM   A  +PRLG+P Y WW+E +HGV   G          
Sbjct: 11  YRERARALVAQMTLKEKISQMLSWAPAIPRLGIPAYNWWNEGIHGVGRAGT--------- 61

Query: 81  FDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWS 132
                  AT FP  I   ASF+E L  ++G+ V  EAR  YN+  +        GLT W+
Sbjct: 62  -------ATVFPQAIGLAASFDEDLLGQVGEAVGVEARGKYNMYRSYQDRDIYKGLTIWA 114

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           PN+N+ RDPRWGR  ET GEDPY+  R  + +V G+Q           D   L+ +AC K
Sbjct: 115 PNVNIFRDPRWGRGHETYGEDPYLTSRLGVRFVEGMQG---------DDPDYLRAAACAK 165

Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           H+A +       + R +FD++V++QD+ ET++  F   V E  V +VM +YNR NG P C
Sbjct: 166 HFAVHSGPE---DQRHYFDAKVSQQDLWETYLPAFRALVKEAGVEAVMGAYNRTNGEPCC 222

Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
               LL   +RG WNF G++ SDC +I+   E H  +     D+VA  +  G DL+CGD 
Sbjct: 223 GSKTLLVDILRGKWNFQGHVTSDCWAIKDFHEGH-MVTSGPVDSVALAVNNGCDLNCGDL 281

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIEL 370
           Y  +   AV +GK+ E  ID SL  L+   M+LG FD   +  Y  +G + + + +   L
Sbjct: 282 YA-YLEEAVAEGKVKEETIDRSLVRLFTTRMKLGMFDAEEKVPYNKIGYDAVDSREMQAL 340

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
             E A + +VLLKN+N  LPL+   +  +A+VGP+A+  KA++GNYEGT  RY + +DG 
Sbjct: 341 NLEVAEKILVLLKNENHTLPLDKSKLHRVAVVGPNADNRKALVGNYEGTASRYVTVLDGI 400

Query: 431 YAY---SKVINYAPGC---ADIV---CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE- 480
             Y      + Y+ GC   AD +    ++N +I          D  +   GLD  +E E 
Sbjct: 401 QEYLGEDVQVRYSEGCHLYADKIQGLAKSNELISEVRGVCAECDVVICCLGLDAGLEGEE 460

Query: 481 --------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
                     D+  L LPG Q  ++    ++ K PV +V++S  A+ +  A+      ++
Sbjct: 461 GDQGNQFASGDKQSLSLPGNQESVLKACIESGK-PVVVVVLSGSALALGTAQEGA--AAV 517

Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
           L   YPG +GGRA+A  +FG+ NP G+LP+T+Y ++     +T   ++      GRTY++
Sbjct: 518 LQAWYPGAQGGRAVARALFGECNPQGKLPVTFYHSDEDLPAFTDYAMK------GRTYRY 571

Query: 593 FDGPVVYPFGYGLSYTQFKYKVASS------PKSVDIKL 625
            +   +YPFGYGLSY+ F ++ A +      P  VD+++
Sbjct: 572 MEKEPLYPFGYGLSYSHFTFRDAKADAAQIGPDGVDVRV 610


>gi|330947691|ref|XP_003306937.1| hypothetical protein PTT_20252 [Pyrenophora teres f. teres 0-1]
 gi|311315273|gb|EFQ84970.1| hypothetical protein PTT_20252 [Pyrenophora teres f. teres 0-1]
          Length = 756

 Score =  405 bits (1041), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 260/724 (35%), Positives = 380/724 (52%), Gaps = 58/724 (8%)

Query: 32  MTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSF 91
           M   EK++ +   + GV RLGLP Y WW EALHGV+         PG +F      ATSF
Sbjct: 52  MQTQEKLENLVSKSKGVARLGLPAYNWWGEALHGVA-------GAPGINFTGSYRTATSF 104

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPG 151
           P  +L +A+F++ L  +I   +  EARA  N G A + FW+P+IN  RDPRWGR  ETPG
Sbjct: 105 PMPLLMSAAFDDDLIHQIAIVIGNEARAFGNGGIAPVDFWTPDINPFRDPRWGRGSETPG 164

Query: 152 EDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFD 211
           ED   +  Y  + + GL   EG +  R       KI A CKHY  YD++NW G DR HFD
Sbjct: 165 EDILRIKGYTKSLLSGL---EGDKAQR-------KIIATCKHYVGYDVENWNGTDRHHFD 214

Query: 212 SRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNF--- 268
           +++T QD+ E F+ PF+ C  +  V S MCSYN VNG+PTCAD  +L   +R  WN+   
Sbjct: 215 AKITTQDLAEYFMPPFQQCARDSKVGSFMCSYNAVNGVPTCADTYVLEDILRKHWNWTDS 274

Query: 269 HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAE 328
           + YI SDC++++ I   HK++  T ++A A     G+DL C    T+   GA  QG +  
Sbjct: 275 NNYITSDCEAVKDISLRHKYVA-TLQEATAIAFNNGMDLSCEYSGTSDIPGAFSQGLLNV 333

Query: 329 ADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNG 387
           + ID +L   Y  L+  GYFDG +  Y +LG  +I  P+  +L  + A +G+ LLKND+ 
Sbjct: 334 SVIDRALTRQYEGLVHAGYFDGAAATYAHLGVQDINTPEAQKLVLQVAAEGLTLLKNDD- 392

Query: 388 ALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV-INYAPGCADI 446
            LPL+  +   +A+VG  AN T  + G Y G      +P+   YA +K+ ++ A     I
Sbjct: 393 TLPLSLKSGSKVAMVGFWANTTSKLSGIYSGPAPYLHTPV---YAGNKLGLDMAVATGPI 449

Query: 447 VCQN---NSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADA 503
           +  +   ++    A++AAK +D  +   GLD S  AEG DR D+  P  Q +LI K+  A
Sbjct: 450 LQTSGAADNWTTTALNAAKKSDFILYFGGLDPSAAAEGSDRTDISWPSAQIDLITKL--A 507

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
           A G   +VI     VD         + S++W  +PG++GG A+  VI G++   GRLPIT
Sbjct: 508 ALGKPLVVIALGDMVDHTPILKMKGVNSLIWANWPGQDGGTAVMQVITGEHAIAGRLPIT 567

Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPK-SVD 622
            Y A Y ++    M +RP  N PGRTY++++   V PFG+GL YT+F  K  SS   +V+
Sbjct: 568 QYPAEYTQLSMLDMNMRPGGNNPGRTYRWYN-ESVQPFGFGLHYTKFAAKFGSSSGLTVN 626

Query: 623 IKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYS 682
           I+        DI  +   + P    V              ++ V N G      + + + 
Sbjct: 627 IQ--------DIMKSCTKDHPDLCDVP-----------PIEVAVTNEGNRTSDFIALAFI 667

Query: 683 KPPGIAGTH---IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
           K  G  G     +K ++ Y R+   +G   K+        +L  VD + N +   G +T+
Sbjct: 668 K--GEVGPKPYPLKTLVSYARLRDISGSQTKMASLALTLGALSRVDQSGNLVAYPGEYTL 725

Query: 740 LVGE 743
           L+ E
Sbjct: 726 LLDE 729


>gi|358365439|dbj|GAA82061.1| beta-xylosidase [Aspergillus kawachii IFO 4308]
          Length = 788

 Score =  405 bits (1040), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 257/689 (37%), Positives = 373/689 (54%), Gaps = 47/689 (6%)

Query: 28  LVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPG 87
           L+   TL E +   G+   GV RLGLP Y+ WSEALHG+    R   S  G++       
Sbjct: 66  LISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD---RANFSDSGSY-----NW 117

Query: 88  ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVL 147
           ATSFP  ILTTA+ N +L  +I   +ST+ RA  N G  GL  ++PNIN  R P WGR  
Sbjct: 118 ATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPNINTFRHPVWGRGQ 177

Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
           ETPGED  +   YA  Y+ G+Q         D DS  LK++A  KHYA YD++NW  + R
Sbjct: 178 ETPGEDVSLAAVYAYEYITGIQG-------PDPDSN-LKLAATAKHYAGYDIENWHNHSR 229

Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
              D  +T+QD+ E +   F +   +  V SVMC+YN VNG+P CAD   L   +R  + 
Sbjct: 230 LGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPACADSYFLQTLLRDTFG 289

Query: 268 F--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGK 325
           F  HGY+ SDCD+   I   H + + ++  A A  + AG D+DCG  Y      ++  G 
Sbjct: 290 FVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDCGTTYQWHLNESITAGD 348

Query: 326 IAEADIDTSLRFLYIVLMRLGYFDGSPQ-----YKNLGKNNICNPQHIELAAEAARQGIV 380
           ++  DI+  +  LY  L++ GYFD +       Y++L  +++       ++ +AA QGIV
Sbjct: 349 LSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIV 408

Query: 381 LLKNDNGALPLNTG----NIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV 436
           LLKN N  LPL       +  T+AL+GP ANAT  ++GNY G      SP   F      
Sbjct: 409 LLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYK 468

Query: 437 INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTEL 496
           +N+A G   I   + S   AA+ AA++AD  +   G+D ++EAE  DR  +  PG Q +L
Sbjct: 469 VNFAEGTG-ISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDL 527

Query: 497 INKVADAA-KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYN 555
           I K+A +A   P+ ++ M  G VD +  KNN  + ++LW GYPG+ GG A+ D+I GK N
Sbjct: 528 IQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPGQSGGFALRDIITGKKN 587

Query: 556 PGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKV 614
           P GRL  T Y A+Y  + P T M LRP  + PG+TYK++ G  VY FG+GL YT F  + 
Sbjct: 588 PAGRLVTTQYPASYAEEFPATDMNLRPEGDNPGQTYKWYTGEAVYEFGHGLFYTTFA-ES 646

Query: 615 ASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDG 674
           +S+  + ++KL+     +DI   +       A++    V        F   ++N GK++ 
Sbjct: 647 SSNTTTKEVKLN----IQDI---LSQTHEELASITQLPV------LNFTANIKNTGKLES 693

Query: 675 SEVVMVYSKPP--GIAGTHIKQVIGYERV 701
               MV++     G A   +K ++G++R+
Sbjct: 694 DYTAMVFANTSDAGPAPYPVKWLVGWDRL 722


>gi|225878709|dbj|BAH30674.1| beta-xylosidase [Aspergillus aculeatus]
          Length = 785

 Score =  405 bits (1040), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 251/693 (36%), Positives = 376/693 (54%), Gaps = 42/693 (6%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD      +RA  L   MTL E +   G+    +PRLGLP Y+ W+EALHG+ ++   T 
Sbjct: 62  CDRTASAHDRAAALTSMMTLEELMNSTGNRIPAIPRLGLPPYQIWNEALHGL-YLANFTE 120

Query: 75  SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
           S P +        +TSFP+ ILT A+ N +L  +I Q ++T+ RA  N G  GL  +SPN
Sbjct: 121 SGPFSW-------STSFPSPILTMATLNRTLIHQIAQIIATQGRAFNNAGRYGLNAFSPN 173

Query: 135 INVVRDPRWGRVLETPGEDPYVV-GRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           IN  R P WGR  ETPGED   +   YA  Y+ GLQ           ++   KI A  KH
Sbjct: 174 INAFRHPVWGRGQETPGEDANCLCSAYAYEYITGLQ----------GNATNPKIIATAKH 223

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           YA YD++NW    RF  D  +T+QD+ E F   F + V +  V SVM SYN VNG+P+ A
Sbjct: 224 YAGYDIENWRQRSRFGNDLNITQQDLAEYFTPQFVVAVRDAQVRSVMPSYNAVNGVPSSA 283

Query: 254 DPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
           +  LL   +R  W F   GY+ SDCD++  +   H +  +    A A  L+AG D+DCG 
Sbjct: 284 NTFLLQTLVRDSWGFIQDGYMASDCDAVYNVFNPHGYAANLSS-ASAMSLRAGTDIDCGI 342

Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNICNPQHIEL 370
            Y      ++ QG+I+ ++I+ ++   Y  L+  GYFDG    Y++L  +++       +
Sbjct: 343 SYLTTLNESLTQGQISRSEIERAVTRFYSNLVSAGYFDGPDAPYRDLSWSDVVRTNRWNV 402

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A EAA  G+VLLKND G LPL + +++ +AL+GP ANAT+ M GNY G     TSP+   
Sbjct: 403 AYEAAVAGVVLLKND-GVLPL-SKSVQRVALIGPWANATEQMQGNYHGVAPYLTSPLAAV 460

Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
            A    +NYA G  +I     +   AA+ AA+ +D  +   G+D ++EAE  DR ++  P
Sbjct: 461 QASGLEVNYAFGT-NITSNVTNCFAAALAAAEKSDIIIFAGGIDNTLEAEELDRANITWP 519

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
           G Q ELI+++ +  K P+ ++ M  G VD +  K + K+ ++LW GYPG+ GG+A+ D++
Sbjct: 520 GNQLELIHRLGELGK-PLVVLQMGGGQVDSSALKASEKVGALLWGGYPGQAGGQALWDIL 578

Query: 551 FGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
            G+  P GRL  T Y A Y ++ P T M LRP  + PG+TY ++ G  VY FG+GL YT 
Sbjct: 579 TGQRAPAGRLTTTQYPAEYALQFPATDMSLRPRGDNPGQTYMWYTGEPVYAFGHGLFYTT 638

Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
           F   +A   +       + ++  DI   +   +P     L++ +        F ++V N 
Sbjct: 639 FATALAGPGQ-------EPERSFDIGALLA--RPHAGYNLVEQLPF----LNFTVKVTNT 685

Query: 670 GKMDGSEVVMVYSKPPGIAGTHI-KQVIGYERV 701
           G++      M ++        H  K ++G++R+
Sbjct: 686 GEVISDYTAMAFANTTAGPRPHPNKWLVGFDRI 718


>gi|425780840|gb|EKV18836.1| Beta-xylosidase XylA [Penicillium digitatum PHI26]
 gi|425783077|gb|EKV20946.1| Beta-xylosidase XylA [Penicillium digitatum Pd1]
          Length = 792

 Score =  405 bits (1040), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 260/753 (34%), Positives = 388/753 (51%), Gaps = 45/753 (5%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD      +RA  L    TL E V   G++   VPRLGLP Y+ WSEALHG+  
Sbjct: 56  LSKTIVCDTTAKPHDRAAALTSMFTLEELVNSTGNVIPAVPRLGLPPYQVWSEALHGLD- 114

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
              R N      +      ATSFP+ IL  A+ N +L  +IG+ +ST+ RA  N G  GL
Sbjct: 115 ---RANLTESGDYS----WATSFPSPILIMAALNRTLINQIGEIISTQGRAFNNGGRYGL 167

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
             ++PNIN  R P WGR  ETPGED  +   Y + Y+ G+Q           + R LK++
Sbjct: 168 DVYAPNINSFRHPVWGRGQETPGEDVQLCSIYGVEYITGIQG--------GLNPRDLKLA 219

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           A  KH+A YDL+NW  + R   +  ++  D+   +   F   V +  V SVM SYN VNG
Sbjct: 220 ATAKHFAGYDLENWGNHSRLGNNVAISSFDLASYYTPQFITAVRDARVHSVMSSYNAVNG 279

Query: 249 IPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLD 306
           +P+ A+  LL   +R  WNF   GY+ SDCD++  +   H + +     A   + +AG D
Sbjct: 280 VPSSANSFLLQTLLRETWNFVEDGYVSSDCDAVFNVFNPHGYASSASLAAAKSI-QAGTD 338

Query: 307 LDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNICNP 365
           +DCG  Y  +   ++   +I+ ++I+ ++   Y  L+ LGYFDG + +Y++L   ++   
Sbjct: 339 IDCGATYQLYLNESLSHDEISRSEIERAVTRFYSTLVSLGYFDGDNSKYRHLHWPDVVAT 398

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
               ++ EAA +GIVLLKND G LPL + N +++AL+GP AN T  + GNY G     T 
Sbjct: 399 DAWNISYEAAVEGIVLLKND-GTLPL-SNNTRSVALIGPWANVTTTLQGNYYGAAPYLTG 456

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+    A +  +NYA G  +I   + S   AA+ AA  ++  +   G+D +VEAEG DR 
Sbjct: 457 PLAALQASNLDVNYAFGT-NISSDSTSGFEAALSAAGKSEVIIFAGGIDNTVEAEGVDRE 515

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            +  PG Q +LI +++   K P+ ++ M  G VD +  K N  + S++W GYPG+ GG A
Sbjct: 516 SITWPGNQLQLIEQLSKLGK-PLVVLQMGGGQVDSSSLKANKNVNSLVWGGYPGQSGGPA 574

Query: 546 IADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYG 604
           I D++ GK  P GRL +T Y A Y ++ P T M LRP  N PG+TY ++ G  VY FG+G
Sbjct: 575 ILDILTGKRAPAGRLTVTQYPAEYALQFPATDMSLRPKGNNPGQTYMWYTGKPVYEFGHG 634

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
           L YT FK  +A             +     +     ++P     +++ +   +Y     +
Sbjct: 635 LFYTTFKVSLA--------HFHGAENGTSFDIVQLLSRPNAGYSVVEQIPFINYT----V 682

Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKI 723
           EV N G +      M +         H  K ++G++R+    G S +   TM    +L  
Sbjct: 683 EVMNTGNVTSDYTAMAFVNTKAGPSPHPNKWLVGFDRL---GGISPRTTQTMTIPITLDN 739

Query: 724 V---DNAANSLLASGAHTI-LVGEGVGGVSFPL 752
           V   D   N ++  G + + L  E    +SF L
Sbjct: 740 VARTDERGNRIVYPGKYELTLNNERSAVLSFTL 772


>gi|388857998|emb|CCF48443.1| related to Beta-xylosidase [Ustilago hordei]
          Length = 782

 Score =  404 bits (1039), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 257/759 (33%), Positives = 395/759 (52%), Gaps = 66/759 (8%)

Query: 9   LSDFP-YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
           LS  P  CD  +P+  RA  LV + T  E +    + A GVPRLG+P Y+WW+EALHGV+
Sbjct: 30  LSKIPDICDPTIPFYTRATSLVNQFTTEELLNNTINYAPGVPRLGIPNYQWWTEALHGVA 89

Query: 68  FIGRRTNSPPGTHFD-----SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN 122
                    PG +FD     +E   AT FP  I   A+F++ L+++I   +++E RA  N
Sbjct: 90  -------KSPGVNFDLSDPHAEFTSATQFPQTINLGATFDDDLYQQIASVIASEVRAYNN 142

Query: 123 LGNAGLTFWSP-NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
            G AGL  +SP NIN  RDPRWGR  ET GEDP  + R+A++ V GLQ   G     +++
Sbjct: 143 AGKAGLNLYSPLNINCFRDPRWGRGQETVGEDPLHMSRFAVSIVHGLQ---GPHAQNEAE 199

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
              L ++A CKH+ AYDL+ ++  +R+ FD+ V++QD+ +  +  F  CV +G  +++M 
Sbjct: 200 GNKLTVAATCKHFLAYDLEQYDRGERYQFDAIVSKQDLSDFHLPQFRACVRDGGATTLMT 259

Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVA 298
           SYN VN +P  A    L    R  W     H Y+ SDCD++  + + H++  +  E A A
Sbjct: 260 SYNAVNNVPPSASKYYLQTLARQAWGLDKTHNYVTSDCDAVANVYDGHRYAQNYVE-AAA 318

Query: 299 RVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKN 356
           + + AG DLDCG  Y+     A++Q     A I  ++  +Y  L+RLGYFD   S   + 
Sbjct: 319 KSINAGTDLDCGATYSENLGAALKQKLTDIATIRRAVIRMYASLVRLGYFDDPASQPLRQ 378

Query: 357 LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
           L   ++ +P    LA  +A   I LLKN +  LP+     K +A++GP+ N + +  GNY
Sbjct: 379 LTWKDVNSPSSQRLAYTSALSSITLLKNLDSTLPIKQKPTK-IAIIGPYTNVSTSFSGNY 437

Query: 417 EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNS-----MIPA-AIDAAK---NADAT 467
            G P  +   M   +A S+V       A IV  N +      IP+ A DA K   +AD+ 
Sbjct: 438 AG-PAAFNMTM--VHAASQVFP----DAKIVWVNGTDISGPYIPSDAQDAVKLTSDADSV 490

Query: 468 VIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADA----AKGPVTLVIMSAGAVDINFA 523
           V   G+D S+E E  DR D+  P  Q  LI++++ +     K  + +V    G +D    
Sbjct: 491 VFAGGIDASIERESHDRKDIAWPPNQLRLIHELSQSRKKDKKSKLVVVQFGGGQLDGASL 550

Query: 524 KNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPV 582
           K++  + +++W GYPG+    A+ D++ GK  P GRLP+T Y A+Y+  +P ++M LRP 
Sbjct: 551 KSDDAVGALVWAGYPGQSASLAVWDILAGKAVPAGRLPVTQYPASYIDGLPESAMSLRPK 610

Query: 583 NNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
             +PGRTYK++ G   YPFG+GL YT F   +A              Q   I  T     
Sbjct: 611 AGYPGRTYKWYKGVPTYPFGHGLHYTTFSASLAKP------------QPYAIPTTPAAKG 658

Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERV 701
           P    V  + +   D     Q  ++N GK+      +++++   G A    K ++GY +V
Sbjct: 659 P--EGVHAEHISVAD----VQANIKNTGKVASDYTALLFARHSNGPAPYPRKTLVGYTKV 712

Query: 702 F-IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
             ++AG+ + V   +    +L   D   N  L  G++ +
Sbjct: 713 KNLSAGEESSVTIKITQA-ALARADEEGNQFLYPGSYQL 750


>gi|255690205|ref|ZP_05413880.1| xylosidase/arabinosidase [Bacteroides finegoldii DSM 17565]
 gi|260624224|gb|EEX47095.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
            finegoldii DSM 17565]
          Length = 1425

 Score =  404 bits (1038), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 257/756 (33%), Positives = 381/756 (50%), Gaps = 97/756 (12%)

Query: 12   FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            +P+ + +L   +R  DLV R+TL EKV+QM + A  + RLG+P Y WW+E LHGV   GR
Sbjct: 712  YPFRNPQLSIEQRVDDLVSRLTLEEKVRQMLNNAPAIKRLGIPAYNWWNECLHGV---GR 768

Query: 72   RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
                             T FP  I   AS+N+ L K++  +++ E RA+YN         
Sbjct: 769  TKYH------------VTVFPQAIGMAASWNDVLMKEVASSIADEGRAIYNDAQKRGDYS 816

Query: 127  ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
                LT+W+PNIN+ RDPRWGR  ET GEDPY+  +    +V GLQ           D R
Sbjct: 817  QYHALTYWTPNINIFRDPRWGRGQETYGEDPYLTSKIGKAFVLGLQG---------DDPR 867

Query: 184  PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
             LK SAC KHYA +        +R  F+S V+  D+ +T++  F   V + +VS VMC+Y
Sbjct: 868  YLKASACAKHYAVHSGPE---KNRHSFNSDVSTYDLWDTYLPAFRTLVVDANVSGVMCAY 924

Query: 244  NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
            N   G P C +  L+   +R  WNF GY+ SDC +I  I   HK   D    A   V   
Sbjct: 925  NAFKGQPCCGNDLLMQSILRDKWNFKGYVTSDCGAIDDIFNHHKAHPDAATAAADAVFH- 983

Query: 304  GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNN 361
            G DLDCG       + AV+ G I E  +D S++ L+ +  RLG FD + Q  Y ++  + 
Sbjct: 984  GTDLDCGQSAYLALVKAVKNGIITEKQLDVSVKRLFTIRFRLGLFDPAEQVDYAHIPISV 1043

Query: 362  ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
            +   +H +LA + AR+ +VLLKND   LPL    +K + ++GP+A+   A++GNY G P 
Sbjct: 1044 LECKKHQDLAKQLARESMVLLKNDR-LLPLQKNKLKKVVVMGPNADCKDALLGNYNGHPS 1102

Query: 422  RYTSPMDGFYAYSKVIN---YAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
            R  +P+       K +    Y  G   I   +   +   ++ AK ADA + + G+   +E
Sbjct: 1103 RMLTPLQAIRERLKGVAEVVYVSGIDYINTVSEDELKRYVNQAKGADAVIFIGGISPRLE 1162

Query: 479  AE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINF-AKNNP 527
             E          G DR  + LP  QT+L+  +  A + P   V+M+  A+ I + AK+ P
Sbjct: 1163 GEEMSVNKDGFDGGDRTSIALPTVQTQLMKALV-AGRIPTVFVMMTGSALAIPWEAKHVP 1221

Query: 528  KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG 587
               +IL   Y G+ GG AIADV+FG YNP G+LP+T+Y  +      + +P     +  G
Sbjct: 1222 ---AILNAWYGGQYGGEAIADVLFGDYNPSGKLPVTFYAKD------SDLPDFESYDMQG 1272

Query: 588  RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
            RTY++F G  +YPFGYGLSYT F+Y     P + +                         
Sbjct: 1273 RTYRYFKGKALYPFGYGLSYTDFRYSSLKMPTACN------------------------- 1307

Query: 648  VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAG 706
                     D +    + V+N GKMDG EVV +Y S P       +  + G++R+++ AG
Sbjct: 1308 -------TTDKEIPVTVTVKNTGKMDGEEVVQLYVSHPDKKILVPVTALKGFKRIYLKAG 1360

Query: 707  QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            ++ ++ F++++ + L  VD      +  G   I VG
Sbjct: 1361 EAKQITFSLSS-EDLSCVDENGIRKVLPGTVKIQVG 1395


>gi|329745495|gb|AEB98984.1| xylosidase precursor [synthetic construct]
          Length = 804

 Score =  404 bits (1038), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 261/702 (37%), Positives = 377/702 (53%), Gaps = 47/702 (6%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD      +RA  L+   TL E +   G+   GV RLGLP+Y+ WSEALHG+    R   
Sbjct: 69  CDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPVYQVWSEALHGLD---RANF 125

Query: 75  SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
           S  G++       ATSFP  ILTTA+ N +L  +I   +ST+ RA  N G  GL  ++PN
Sbjct: 126 SDSGSY-----NWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPN 180

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           IN  R P  GR  ETPGED  +   YA  Y+ G+Q         D DS  LK++A  KHY
Sbjct: 181 INTFRHPVRGRGQETPGEDVSLAAVYAYEYITGIQG-------PDPDSN-LKLAATAKHY 232

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A YD++NW  + R   D  +T+QD+ E +   F +   +  V SVMC+YN VNG+P CAD
Sbjct: 233 AGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPACAD 292

Query: 255 PKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
              L   +R  + F  HGY+ SDCD+   I   H + + ++  A A  + AG D+DCG  
Sbjct: 293 SYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDCGTT 351

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-----YKNLGKNNICNPQH 367
           Y      ++  G ++  DI+  +  LY  L++ GYFD +       Y++L  +++     
Sbjct: 352 YQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDA 411

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTG----NIKTLALVGPHANATKAMIGNYEGTPCRY 423
             ++ +AA QGIVLLKN N  LPL       +  T+AL+GP ANAT  ++GNY G     
Sbjct: 412 WNISYQAATQGIVLLKNSNKVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYM 471

Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
            SP   F      +N+A     I   N S   AA+ AA++AD  +   G+D ++EAE  D
Sbjct: 472 ISPRVAFEEAGYNVNFAERTG-ISSTNTSGFAAALSAAQSADVIIYAGGIDNTLEAEALD 530

Query: 484 RVDLLLPGFQTELINKVADAA-KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
           R  +  PG Q +LI K+A +A   P+ ++ M  G VD +  KNN  + ++LW GYPG+ G
Sbjct: 531 RESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSG 590

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
           G A+ D+I GK NP GRL  T Y A+Y  + P T M LRP  + PG+TYK++ G  VY F
Sbjct: 591 GFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDNPGQTYKWYTGEAVYEF 650

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
           G+GL YT F  + +S+  + +IKL+     +DI   +       A++    V        
Sbjct: 651 GHGLFYTTFA-ESSSNTTTREIKLN----IQDI---LSQTHEDLASITQLPV------LN 696

Query: 662 FQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERV 701
           F   ++N GK++     MV++     G A   +K ++G++R+
Sbjct: 697 FTANIKNTGKVESDYTAMVFANTSDAGPAPYPVKWLVGWDRL 738


>gi|443893988|dbj|GAC71176.1| hypothetical protein PANT_1d00031 [Pseudozyma antarctica T-34]
          Length = 759

 Score =  404 bits (1038), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 262/747 (35%), Positives = 388/747 (51%), Gaps = 65/747 (8%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD  L Y  RA  LV   T  E +    + A GVPRLG+P Y+WW+EALHGV+ 
Sbjct: 30  LSANAVCDTSLDYWTRATSLVAEFTTQELINNTINTAPGVPRLGIPPYQWWTEALHGVA- 88

Query: 69  IGRRTNSPPGTHF--DSEVP--GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
                   PG +F  D E P   AT+FP +I   A+F+++L++++   ++ E RA  N G
Sbjct: 89  ------GSPGVNFADDVEAPYGSATNFPQIINLGATFDDALYEQVATHIANETRAFNNAG 142

Query: 125 NAGLTFWSP-NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
            AGL  +SP NIN  RDPRWGR  ET GEDP  + RYA+  V+GLQ     E        
Sbjct: 143 KAGLNMYSPLNINCFRDPRWGRGQETTGEDPLHMSRYAVKMVQGLQGPNQDE-------- 194

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            L+++A CKHY AYDL+ W+G +R+ FD++V+ Q++ E ++  F  CV +G   ++M SY
Sbjct: 195 -LRLAATCKHYLAYDLEKWDGVERYQFDAQVSRQELAEFYLPQFRACVRDGKAVTLMTSY 253

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
           N VN +P  A    L    R +W     H Y+ SDCD++  + + H +  D+   A A  
Sbjct: 254 NAVNNVPPSASRYYLETLARKEWGLDKKHNYVTSDCDAVANVFDGHHYA-DSYVQAAADS 312

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD---GSPQYKNL 357
           + AG DL+CG  Y++    A++Q       I T++  +Y   +RLG FD   G P  + L
Sbjct: 313 INAGTDLNCGATYSDNLGQALEQNLTDVETIRTAVARMYASQVRLGLFDPKQGQP-LREL 371

Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
           G  ++      +LA  +A   + LLKN NG LP++ G  K +A++GP++NAT A+ GNY 
Sbjct: 372 GWEHVNTKAAQDLAYSSAAASVTLLKN-NGTLPVD-GATK-VAVIGPYSNATFALRGNYA 428

Query: 418 GT-PCRYTSPMDGFYAYSK-VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL 475
           G  P   T        +S+  I+ A G       N++   AA+  AK AD  +   G+D 
Sbjct: 429 GPGPFAITMTEAAQRVFSQATISSANGTTISGTYNHTDAEAAMQLAKEADLVIFAGGIDP 488

Query: 476 SVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
           ++E+E  DR  +  P  Q +LI+ +   AK  + +V    G +D    K +  I ++LW 
Sbjct: 489 TIESEELDRATIAWPPNQLQLIHALGGMAK-KMAVVQFGGGQIDGASIKADGNIGALLWA 547

Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNNFPGRTYKFFD 594
           GYPG+ G  A+ DVI G   P GRLPIT Y A Y+  +  T+M LRP   +PGRTYK++ 
Sbjct: 548 GYPGQSGALAVMDVIAGNTAPAGRLPITQYPAEYIDGLAETTMALRPNATYPGRTYKWYS 607

Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
           G   YP+ +GL YT+FK ++A                +   YT+ T          + V 
Sbjct: 608 GTPTYPYAHGLHYTEFKAELA----------------QPAPYTIAT----AGYAEFERVA 647

Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI-KQVIGYERV-FIAAGQSAKVG 712
                 T Q  + N G+       +V+++       H  K ++GY++V  IA G+S  V 
Sbjct: 648 ------TVQATITNAGQRTSDYAALVFARHTNGPAPHPNKTLVGYKKVKAIAPGESRSVE 701

Query: 713 FTMNACKSLKIVDNAANSLLASGAHTI 739
             +    +L   D   N +L  G + +
Sbjct: 702 VEITQA-ALARGDEEGNLVLYPGKYEL 727


>gi|189201569|ref|XP_001937121.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187984220|gb|EDU49708.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 756

 Score =  404 bits (1037), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 260/724 (35%), Positives = 379/724 (52%), Gaps = 58/724 (8%)

Query: 32  MTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSF 91
           M   EK+  +   + GV RLGLP Y WW EALHGV+         PG +F      ATSF
Sbjct: 52  MQTQEKLDNLVSKSKGVARLGLPAYNWWGEALHGVA-------GAPGINFTGPYRTATSF 104

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPG 151
           P  +L +A+F++ L  +I   +  EARA  N G A + FW+P+IN  RDPRWGR  ETPG
Sbjct: 105 PMPLLMSAAFDDDLIHQIAIVIGNEARAFGNGGIAPVDFWTPDINPFRDPRWGRGSETPG 164

Query: 152 EDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFD 211
           ED   +  Y  + + GL   EG +  R       KI A CKHY  YD+++W G DR  FD
Sbjct: 165 EDILRIKGYTKSLLSGL---EGDKAQR-------KIIATCKHYVGYDMEDWNGTDRHSFD 214

Query: 212 SRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNF--- 268
           +++T QD+ E F+ PF+ C  +  V S MCSYN VNG+PTCAD  +L   +R  WN+   
Sbjct: 215 AKITTQDLAEYFMPPFQQCARDSKVGSFMCSYNAVNGVPTCADTYVLEDILRKHWNWTDS 274

Query: 269 HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAE 328
           + YI SDC++++ I   HK++  T ++A A     G+DL C    ++   GA  QG +  
Sbjct: 275 NNYITSDCEAVKDISLRHKYVA-TLQEATAIAFNNGMDLSCEYSGSSDIPGAFSQGLLNV 333

Query: 329 ADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNG 387
           + ID +L   Y  L+  GYFDG +  Y NLG  +I  P+  +L  + A +G+ LLKND+ 
Sbjct: 334 SVIDRALTRQYEGLVHAGYFDGAAATYANLGVQDINTPEAQKLVLQVAAEGLTLLKNDD- 392

Query: 388 ALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV-INYAPGCADI 446
            LPL+  +   +A+VG  AN +  + G Y G      +P+   YA +K+ ++ A     I
Sbjct: 393 TLPLSLKSGSKVAMVGFWANDSSKLSGIYSGPAPYLHNPV---YAGNKLGLDMAVATGPI 449

Query: 447 VCQN---NSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADA 503
           + ++   ++    A+DAAK +D  +   GLD S  AEG DR D+  P  Q +LI K+  A
Sbjct: 450 LQKSGAADNWTTKALDAAKKSDTILYFGGLDPSAAAEGSDRTDISWPSAQIDLITKL--A 507

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
           A G   +VI     VD     N   + S++W  +PG++GG A+  VI G++   GRLPIT
Sbjct: 508 ALGKPLVVIALGDMVDHMPILNMKGVNSLIWANWPGQDGGTAVMQVITGEHAIAGRLPIT 567

Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVAS-SPKSVD 622
            Y A Y ++    M LRP  N PGRTY++++   V PFG+GL YT+F  K  S S  +V+
Sbjct: 568 QYPAKYTQLSMLDMNLRPGGNNPGRTYRWYN-ESVQPFGFGLHYTKFAAKFGSNSSLTVN 626

Query: 623 IKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYS 682
           I+        DI  +   + P    V              ++ V N G      + + + 
Sbjct: 627 IQ--------DIMKSCTKDHPDLCDVP-----------PIEVAVTNKGNRTSDFIALAFI 667

Query: 683 KPPGIAGTH---IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
           K  G  G     +K ++ Y R+   +G   K         +L  VD + N +   G +T+
Sbjct: 668 K--GEVGPKPYPLKTLVSYARLRDISGSQTKTASLALTLGTLSRVDQSGNLVAYPGEYTL 725

Query: 740 LVGE 743
           L+ E
Sbjct: 726 LLDE 729


>gi|359409694|ref|ZP_09202159.1| Beta-glucosidase [Clostridium sp. DL-VIII]
 gi|357168578|gb|EHI96752.1| Beta-glucosidase [Clostridium sp. DL-VIII]
          Length = 723

 Score =  403 bits (1035), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 254/748 (33%), Positives = 390/748 (52%), Gaps = 107/748 (14%)

Query: 25  AKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSE 84
           AK+LV +MTL EK +Q+   +  V RL +P Y WW+E LHGV+  G              
Sbjct: 29  AKELVAKMTLQEKAEQLTYNSPAVKRLNIPEYNWWNEGLHGVARAGT------------- 75

Query: 85  VPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPNIN 136
              AT FP  I   A F+E    K+   ++TE RA YN  +         GLT+WSPN+N
Sbjct: 76  ---ATVFPQAIGLAAMFDEEFLGKVAGIIATEGRAKYNENSKKEDRDIYKGLTYWSPNVN 132

Query: 137 VVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAA 196
           + RDPRWGR  ET GEDPY+  R  + +V+GLQ           D + LK+SAC KH+A 
Sbjct: 133 IFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQ----------GDGKYLKLSACAKHFAV 182

Query: 197 YDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPK 256
           +   +   + R  F++ V+++D+ ET++  FE CV E +V SVM +YNR NG P C    
Sbjct: 183 H---SGPESLRHEFNAVVSQKDLHETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKA 239

Query: 257 LLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNF 316
           LL   +RG W F G++VSDC ++      HK +  T  ++VA  ++ G DL+CG+ Y N 
Sbjct: 240 LLKDILRGKWGFKGHVVSDCWALADFHMHHK-VTSTATESVALAIENGCDLNCGNMYLNL 298

Query: 317 TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLG-KNNICNPQHIELAAEAA 375
            + A ++G + E  I T+   L     +LG FD   +Y  +  + N C  +H +++ EA+
Sbjct: 299 LL-AYKEGLVTEEQITTAAERLMTTRFKLGMFDEDCEYNQIPYEVNDCK-EHNQVSLEAS 356

Query: 376 RQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY-- 433
           R+ +VLLKN NG LPL+   +K +A++GP+AN+   + GNY GT  +YT+ +DG +    
Sbjct: 357 RKSMVLLKN-NGILPLDKSKLKAVAVIGPNANSEIMLKGNYSGTASKYTTILDGIHDVLD 415

Query: 434 -SKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE------ 480
               + Y+ GC       + + + +  +  A+  A+ AD  ++  GLD ++E E      
Sbjct: 416 DDVRVYYSEGCHLYKEKVEDLARRDDRLAEAVSVAERADVVILCLGLDSTIEGEQGDAGN 475

Query: 481 ---GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGY 537
                D++DL LPG Q EL+ KV +  K PV +V+ +   + +N A+   +  +IL   Y
Sbjct: 476 GYGAGDKLDLNLPGIQQELLEKVLETGK-PVVVVLGTGSGLTLNGAEE--RCAAILNAWY 532

Query: 538 PGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP-YTSMPLRPVNNFPGRTYKFFD-G 595
           PG  GG A AD++FGK +P G+LP+T+Y+ +  K+P +T   ++      GRTY++ D  
Sbjct: 533 PGSHGGTAAADILFGKCSPSGKLPVTFYK-DTDKLPEFTDYAMK------GRTYRYMDES 585

Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
             +YPFGYGL+Y+  +      P +V  + D                       ID    
Sbjct: 586 NCLYPFGYGLTYSTVELSNLQVP-AVRGEFDG----------------------ID---- 618

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFT 714
                   +E+EN G  D  EVV  Y K        +   + G++RV +  G+S  V   
Sbjct: 619 ------ISVEIENTGSYDIEEVVQCYIKDLESKYAVLNHSLAGFKRVSLKKGESKTVTMK 672

Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
           +N  ++ + VD+A   +L S    + VG
Sbjct: 673 LNR-RAFEAVDDAGERILDSKKFKLFVG 699


>gi|410617070|ref|ZP_11328046.1| beta-glucosidase [Glaciecola polaris LMG 21857]
 gi|410163339|dbj|GAC32184.1| beta-glucosidase [Glaciecola polaris LMG 21857]
          Length = 731

 Score =  403 bits (1035), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 253/757 (33%), Positives = 385/757 (50%), Gaps = 99/757 (13%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           + D  + + +RA  LV  MT+ EK+ Q+      + RL +P Y WW+EALHG++  G+  
Sbjct: 29  WFDPDISFAQRANLLVNAMTVDEKIAQLSHATPAIARLNVPQYNWWNEALHGIARNGK-- 86

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY----NLGN---- 125
                         AT FP  I   A+F+  L  ++   +S EARA Y    ++GN    
Sbjct: 87  --------------ATIFPQAIGLAATFDPDLAHQVASAISDEARAKYAIAQSIGNQGQY 132

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
           AGLTFW+PN+N+ RDPRWGR  ET GEDP++  +    +V+GLQ           D + L
Sbjct: 133 AGLTFWTPNVNIFRDPRWGRGQETYGEDPFLTAQMGTAFVKGLQG---------DDPKYL 183

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K +   KH+A +   +   + R HFD   +++D+ ET++  FE  V +  V+ VMC+YN 
Sbjct: 184 KSAGVAKHFAVH---SGPESLRHHFDVEPSQKDLYETYLPAFEALVTQAKVAGVMCAYNA 240

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           VNG P CA  +LL+  ++  W FHGYIVSDC ++      HK      E A A  L++G+
Sbjct: 241 VNGEPACASAQLLDGILKKQWGFHGYIVSDCGALNDFQAGHKVTKSGPESA-ALALQSGV 299

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNIC 363
           +L+CG  Y +F   A++Q  +    ID  L  L ++  +LG+FD  G   Y  +  + I 
Sbjct: 300 NLNCGSTYEHFLKAALEQNLVPLELIDQRLTQLLMIRFQLGFFDPAGLNPYNEVTPDVIH 359

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
           +P+HI L+ + AR+ IVLLKNDN  LPL+  +IK   + GP A ++  +IGNY G     
Sbjct: 360 SPEHINLSRDVARKSIVLLKNDNHVLPLSK-DIKVPYVTGPFAASSDMLIGNYYGISDSL 418

Query: 424 TSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPA--AIDAAKNADATVIVAGLDLSVE 478
            S ++G     +    +NY  G       +N++ P   A   AK ADA + V G+   +E
Sbjct: 419 VSVLEGIAGKVSLGSSLNYRSGSLPF---HNNINPLNWAPQVAKTADAVIAVVGVSADME 475

Query: 479 ---------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
                    A+  DRV + LP  Q + + ++A   KGP+ LV+ +   VDI  +   P  
Sbjct: 476 GEEVDAIASADRGDRVAITLPQNQVDYVKQLAAHKKGPLILVVAAGSPVDI--SDLEPLA 533

Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRT 589
            +ILW+ YPGE+GG A+ADV+FG  NP G LP+T+ ++     P+    +       GRT
Sbjct: 534 DAILWIWYPGEQGGNAVADVLFGDTNPSGHLPLTFVKSIDDLPPFDDYAMT------GRT 587

Query: 590 YKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL 649
           YKF +   +YPFG+G SYT+F +                      + TV   K       
Sbjct: 588 YKFLEKAPLYPFGFGRSYTEFSFN---------------------DLTVSQGK------- 619

Query: 650 IDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG-IAGTHIKQVIGYERVFIAAGQS 708
                 +    T  +EVEN G + G  VV  Y  P   +    I  +  ++R+ +A  ++
Sbjct: 620 ----AIEGEALTLSVEVENRGDIAGETVVQAYLSPIARMNNEAISSLKSFKRIHLAPKET 675

Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGV 745
             V  T+   K L  V+NA  ++   G +++ VG+ +
Sbjct: 676 RWVELTIQG-KDLYQVNNAGETVWPQGRYSLAVGDSL 711


>gi|291525508|emb|CBK91095.1| Beta-glucosidase-related glycosidases [Eubacterium rectale DSM
           17629]
          Length = 714

 Score =  402 bits (1033), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 230/626 (36%), Positives = 349/626 (55%), Gaps = 68/626 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           E AK LV +MT+ EK+ QM   +  + RLG+P Y WW+EALHGV+  G            
Sbjct: 7   EYAKKLVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------- 55

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
                AT FP  I   A+F+  L +KIG  VSTE R  +N  +         GLTFW+PN
Sbjct: 56  -----ATVFPQAIGLAATFDTDLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPN 110

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           +N+ RDPRWGR  ET GEDPY+ G+    Y+RGLQ           D   LK +AC KH+
Sbjct: 111 VNIFRDPRWGRGHETYGEDPYLTGKLGCAYIRGLQG---------DDPDHLKSAACAKHF 161

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A +   +     R  FD++ ++ DM +T++  F+ CV +  V +VM +YNRVNG P C  
Sbjct: 162 AVH---SGPEAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGS 218

Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
             LL   +R ++ F G++VSDC +I    E H  + DT E++ A  +  G DL+CG  + 
Sbjct: 219 RTLLKDILRDEFGFEGHVVSDCWAILDFHE-HHHVTDTVEESAAMAVNNGCDLNCGSAFL 277

Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICNPQHIELAAE 373
           +    A  +G +++  I  ++  L  V +RLG     P  Y+++    +   +H+EL+ E
Sbjct: 278 HLK-DAYDKGLVSDEAITAAVERLMEVRIRLGMMKDYPSPYEDISYEVVECKEHVELSVE 336

Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
           AAR+ +VLLKN +  LPL+  N+KT+A++GP+AN+  A+IGNY GT  RY +P++G   Y
Sbjct: 337 AARRSLVLLKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQY 396

Query: 434 ----SKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE--- 480
               ++V+ YA GC         + +       A+  A+ +D  V+  GLD ++E E   
Sbjct: 397 LGEDTRVL-YAEGCHLYKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGD 455

Query: 481 ------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
                   D++ L+LPG Q EL+  VA   K PV LV+ +  A+D+++A+ +  + +I+ 
Sbjct: 456 AGNEYASGDKLGLMLPGLQEELLEAVAAVGK-PVILVLSAGSAIDLSWAEEH--VDAIID 512

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD 594
             YPG  GG+A+A+ IFG+Y+P G+LP+T+Y+         ++P     +   RTY++ +
Sbjct: 513 SWYPGARGGKAVAEAIFGEYSPSGKLPVTFYQGT------ENLPEFTDYSMAHRTYRYTN 566

Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKS 620
             V+YPFGYGL Y +  Y   S  K+
Sbjct: 567 ENVLYPFGYGLHYGETNYDGMSVDKA 592


>gi|238923424|ref|YP_002936940.1| beta-glucosidase [Eubacterium rectale ATCC 33656]
 gi|238875099|gb|ACR74806.1| beta-glucosidase [Eubacterium rectale ATCC 33656]
          Length = 714

 Score =  402 bits (1033), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 230/626 (36%), Positives = 349/626 (55%), Gaps = 68/626 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           E AK LV +MT+ EK+ QM   +  + RLG+P Y WW+EALHGV+  G            
Sbjct: 7   EYAKKLVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------- 55

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
                AT FP  I   A+F+  L +KIG  VSTE R  +N  +         GLTFW+PN
Sbjct: 56  -----ATVFPQAIGLAATFDTDLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPN 110

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           +N+ RDPRWGR  ET GEDPY+ G+    Y+RGLQ           D   LK +AC KH+
Sbjct: 111 VNIFRDPRWGRGHETYGEDPYLTGKLGCAYIRGLQG---------DDPDHLKSAACAKHF 161

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A +   +     R  FD++ ++ DM +T++  F+ CV +  V +VM +YNRVNG P C  
Sbjct: 162 AVH---SGPEAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGS 218

Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
             LL   +R ++ F G++VSDC +I    E H  + DT E++ A  +  G DL+CG  + 
Sbjct: 219 RTLLKDILRDEFGFEGHVVSDCWAILDFHE-HHHVTDTVEESAAMAVNNGCDLNCGSAFL 277

Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICNPQHIELAAE 373
           +    A  +G +++  I  ++  L  V +RLG     P  Y+++    +   +H+EL+ E
Sbjct: 278 HLK-DAYDKGMVSDEAITAAVERLMEVRIRLGMMKDYPSPYEDISYEVVECKEHVELSVE 336

Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
           AAR+ +VLLKN +  LPL+  N+KT+A++GP+AN+  A+IGNY GT  RY +P++G   Y
Sbjct: 337 AARRSLVLLKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQY 396

Query: 434 ----SKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE--- 480
               ++V+ YA GC         + +       A+  A+ +D  V+  GLD ++E E   
Sbjct: 397 LGEDTRVL-YAEGCHLYKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGD 455

Query: 481 ------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
                   D++ L+LPG Q EL+  VA   K PV LV+ +  A+D+++A+ +  + +I+ 
Sbjct: 456 AGNEYASGDKLGLMLPGLQEELLEAVAAVGK-PVILVLSAGSAIDLSWAEEH--VDAIID 512

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD 594
             YPG  GG+A+A+ IFG+Y+P G+LP+T+Y+         ++P     +   RTY++ +
Sbjct: 513 SWYPGARGGKAVAEAIFGEYSPNGKLPVTFYQGT------ENLPEFTDYSMAHRTYRYTN 566

Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKS 620
             V+YPFGYGL Y +  Y   S  K+
Sbjct: 567 ENVLYPFGYGLHYGETNYDGLSVDKA 592


>gi|291528382|emb|CBK93968.1| Beta-glucosidase-related glycosidases [Eubacterium rectale M104/1]
          Length = 714

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 230/626 (36%), Positives = 349/626 (55%), Gaps = 68/626 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           E AK LV +MT+ EK+ QM   +  + RLG+P Y WW+EALHGV+  G            
Sbjct: 7   EYAKKLVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------- 55

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
                AT FP  I   A+F+  L +KIG  VSTE R  +N  +         GLTFW+PN
Sbjct: 56  -----ATVFPQAIGLAAAFDADLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPN 110

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           +N+ RDPRWGR  ET GEDPY+ G+    Y+RGLQ           D   LK +AC KH+
Sbjct: 111 VNIFRDPRWGRGHETYGEDPYLTGKLGCAYIRGLQG---------DDPDHLKSAACAKHF 161

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A +   +     R  FD++ ++ DM +T++  F+ CV +  V +VM +YNRVNG P C  
Sbjct: 162 AVH---SGPEAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGS 218

Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
             LL   +R ++ F G++VSDC +I    E H  + DT E++ A  +  G DL+CG  + 
Sbjct: 219 RTLLKDILRDEFGFEGHVVSDCWAILDFHE-HHHVTDTVEESAAMAVNNGCDLNCGSAFL 277

Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICNPQHIELAAE 373
           +    A  +G +++  I  ++  L  V +RLG     P  Y+++    +   +H+EL+ E
Sbjct: 278 HLK-DAYDKGLVSDEAITAAVERLMEVRIRLGMMKDYPSPYEDISYEVVECKEHVELSVE 336

Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
           AAR+ +VLLKN +  LPL+  N+KT+A++GP+AN+  A+IGNY GT  RY +P++G   Y
Sbjct: 337 AARRSLVLLKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQY 396

Query: 434 ----SKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE--- 480
               ++V+ YA GC         + +       A+  A+ +D  V+  GLD ++E E   
Sbjct: 397 LGDDTRVL-YAEGCHLYKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGD 455

Query: 481 ------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
                   D++ L+LPG Q EL+  VA   K PV LV+ +  A+D+++A+ +  + +I+ 
Sbjct: 456 AGNEYASGDKLGLMLPGLQEELLEAVAAVGK-PVILVLSAGSAIDLSWAEEH--VDAIID 512

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD 594
             YPG  GG+A+A+ IFG+Y+P G+LP+T+Y+         ++P     +   RTY++ +
Sbjct: 513 SWYPGARGGKAVAEAIFGEYSPSGKLPVTFYQGT------ENLPEFTDYSMAHRTYRYTN 566

Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKS 620
             V+YPFGYGL Y +  Y   S  K+
Sbjct: 567 ENVLYPFGYGLHYGETNYDGLSVDKA 592


>gi|451996250|gb|EMD88717.1| glycoside hydrolase family 3 protein [Cochliobolus heterostrophus
           C5]
          Length = 763

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 269/752 (35%), Positives = 383/752 (50%), Gaps = 68/752 (9%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD   P  ERA  LV  M   EK+  +   + GV RLGLP Y WW EALHGV+ 
Sbjct: 31  LSTNAICDVNAPPHERAAALVAAMEPQEKLDNLVSKSKGVSRLGLPAYNWWGEALHGVA- 89

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
                   PG  F      ATSFP  IL +A+F++ L  KI   +  EARA  N G A +
Sbjct: 90  ------GAPGIKFVEPYKNATSFPMPILMSAAFDDDLIFKIANIIGNEARAFGNGGVAPM 143

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
            +W+P+IN VRD RWGR  E+PGED   +  Y    + GL   EG +  R       KI 
Sbjct: 144 DYWTPDINPVRDIRWGRASESPGEDIRRIKGYTKALLAGL---EGDQAQR-------KII 193

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           A CKHY  YD++ W G DR +F +++T QD+ E ++ PF+ C  +  V S MCSYN VNG
Sbjct: 194 ATCKHYVGYDMEAWGGYDRHNFSAKITMQDLAEYYMPPFQQCARDSKVGSFMCSYNAVNG 253

Query: 249 IPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           +PTCAD  +L   +R  WN+   + YI SDC+++  I E+HK++ +T     A     G+
Sbjct: 254 VPTCADTYVLQTILRDHWNWTDSNNYITSDCEAVADISENHKYV-ETLAQGTALAFAKGM 312

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICN 364
           DL C    ++   GA  QG +  + ID +L   Y  L+  GYFDG+   Y NL  N+I  
Sbjct: 313 DLSCEYSGSSDIPGAWSQGLLNLSVIDKALTRQYEGLVHAGYFDGAKATYANLSYNDINT 372

Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
           P+  +L+ +   +G+V+LKND+  LPL       +A++G  AN +  + G Y G P    
Sbjct: 373 PEARQLSLQVTSEGLVMLKNDH-TLPLPLTKGSKVAMIGFWANDSSKLQGIYSGPPPYRH 431

Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIP-----AAIDAAKNADATVIVAGLDLSVEA 479
           SP+  F      ++ A     ++   NS +P      A+DAA+ +D  +   G D +V  
Sbjct: 432 SPV--FAGEQMGLDMAIAWGPMI--QNSSVPDNWTTNALDAAEKSDYILYFGGQDWTVAQ 487

Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAG-AVDINFAKNNPKIKSILWVGYP 538
           EG DR  +  P  Q +L+ K+A   K    LV+++ G   D +   +   I SI+W  +P
Sbjct: 488 EGYDRTTISFPQVQIDLLAKLAKLGK---PLVVITLGDMTDHSPLLSMEGINSIIWANWP 544

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVV 598
           G++GG AI +VI G + P GRLPIT Y A+YVK+    M LRP    PGRTY++F+   V
Sbjct: 545 GQDGGPAILNVISGVHAPAGRLPITEYPADYVKLSMLDMNLRPHAESPGRTYRWFN-ESV 603

Query: 599 YPFGYGLSYTQFKYKVASSPK-SVDIKLDKD---QQCRDINYTVGTNKPPCAAVLIDDVK 654
            PFG+GL YT F+   AS    + DI+   D   QQ +D+          C         
Sbjct: 604 QPFGFGLHYTTFEAGFASEEGLTYDIQETLDSCTQQYKDL----------CEVA------ 647

Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH---IKQVIGYERVFIAAGQSAKV 711
                   ++ V N G      V + + K  G  G     +K +I Y R+    G + K 
Sbjct: 648 ------PLEVTVANKGNRTSDFVALAFIK--GEVGPKPYPLKTLITYGRLRDIHGGAKKS 699

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
                    L  VD + N+++  G +T+L+ E
Sbjct: 700 ASLPLTLGELARVDQSGNTVIYPGEYTLLLDE 731


>gi|67523807|ref|XP_659963.1| hypothetical protein AN2359.2 [Aspergillus nidulans FGSC A4]
 gi|74597492|sp|Q5BAS1.1|XYND_EMENI RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|40745314|gb|EAA64470.1| hypothetical protein AN2359.2 [Aspergillus nidulans FGSC A4]
 gi|259487761|tpe|CBF86686.1| TPA: Beta-xylosidase (EC 3.2.1.37)
           [Source:UniProtKB/TrEMBL;Acc:O42810] [Aspergillus
           nidulans FGSC A4]
          Length = 803

 Score =  400 bits (1029), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 236/605 (39%), Positives = 340/605 (56%), Gaps = 27/605 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS  P CD  L   +RA  LV   T  E V   G+   GV RLGLP Y+ W EALHGV  
Sbjct: 55  LSLTPVCDRSLSPKDRATALVSLFTFDELVNNTGNTGLGVSRLGLPNYQVWGEALHGVG- 113

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
              R N     +F      ATSFP  I   A+ N++L  +IG  VST+ RA  N G  G+
Sbjct: 114 ---RANFVESGNFS----WATSFPMPITMMAALNKTLIHQIGTIVSTQLRAFSNAGLGGV 166

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
             +SPNIN  R P WGR  ETPGED ++   Y   Y+  LQ   GV      D   LKI 
Sbjct: 167 DVYSPNINTFRHPVWGRGQETPGEDAFLTSVYGYEYITALQG--GV------DPETLKII 218

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           A  KHYA YD+++W  + R   D ++T+Q++ E +  PF +   +  V SVMCSYN VNG
Sbjct: 219 ATAKHYAGYDIESWNNHSRLGNDMQITQQELSEYYTPPFIVASRDAKVRSVMCSYNAVNG 278

Query: 249 IPTCADPKLLNQTIRGDWNFH--GYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLD 306
           +P+CA+   L   +R  + F   GY+  DC ++  +   H + ++ +  A A  + AG D
Sbjct: 279 VPSCANKFFLQTLLRDTFEFSEDGYVSGDCGAVYNVWNPHGYASN-EAAASADSILAGTD 337

Query: 307 LDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICNP 365
           +DCG  Y   +  A +   ++ +DI+  +  LY  L++ GYFDG    Y+++  +++ + 
Sbjct: 338 IDCGTSYQWHSEDAFEDSLVSRSDIERGVIRLYSNLVQAGYFDGEDAPYRDITWDDVLST 397

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
               +A EAA +GIVLLKND   LPL+  +IK++A++GP AN T+ + GNY G      S
Sbjct: 398 DAWNIAYEAAVEGIVLLKNDE-TLPLSK-DIKSVAVIGPWANVTEELQGNYFGPAPYLIS 455

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+ GF      ++YA G  ++   + S    A+ AAK ADA +   G+D ++EAE  DR 
Sbjct: 456 PLTGFRDSGLDVHYALGT-NLTSHSTSGFEEALTAAKQADAIIFAGGIDNTIEAEAMDRE 514

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           ++  PG Q +LI+K+++  K P+ ++ M  G VD +  K+N  + +++W GYPG+ GG A
Sbjct: 515 NITWPGNQLDLISKLSELGK-PLVVLQMGGGQVDSSSLKDNDNVNALIWGGYPGQSGGHA 573

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKI-PYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
           +AD+I GK  P GRL  T Y A Y ++ P   M LRP   +  PG+TY ++ G  VY FG
Sbjct: 574 LADIITGKRAPAGRLVTTQYPAEYAEVFPAIDMNLRPNETSGNPGQTYMWYTGTPVYEFG 633

Query: 603 YGLSY 607
           +GL Y
Sbjct: 634 HGLFY 638


>gi|169611757|ref|XP_001799296.1| hypothetical protein SNOG_08993 [Phaeosphaeria nodorum SN15]
 gi|160702362|gb|EAT83185.2| hypothetical protein SNOG_08993 [Phaeosphaeria nodorum SN15]
          Length = 755

 Score =  400 bits (1027), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 268/751 (35%), Positives = 378/751 (50%), Gaps = 67/751 (8%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L D   CD      ERA  LVE M   EK+    +L  GV RLGLP Y WW EALHGV+ 
Sbjct: 28  LKDNKICDVTAAPAERAAALVEAMQTNEKLD---NLMRGVTRLGLPKYNWWGEALHGVA- 83

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
                   PG +F      ATSFP  +L +A+F++ L  KI   +  EARA  N G A +
Sbjct: 84  ------GAPGINFTGAYKTATSFPMPLLMSAAFDDDLIFKIANIIGNEARAFGNGGVAPV 137

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
            FW+P+IN  RDPRWGR  ETPGED   +  Y  + + GL   EG +  R       KI 
Sbjct: 138 DFWTPDINPFRDPRWGRGSETPGEDIVRIKGYTKHLLAGL---EGDKPQR-------KII 187

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           A CKHY  YD++ W G DR  F++++  QD+ E ++ PF+ C  +  V S MCSYN VNG
Sbjct: 188 ATCKHYVGYDMEAWGGIDRHSFNAKINMQDLAEYYMPPFQQCARDSKVGSFMCSYNAVNG 247

Query: 249 IPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           +PTCAD  +L   +R  WN+   + YI SDC++++ I   HK+   T  +       AG+
Sbjct: 248 VPTCADTYVLQTILRDHWNWTESNNYITSDCEAVKDISLKHKYAK-TNAEGTGLAFTAGM 306

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNICN 364
           D  C    ++   GA  Q  ++   ID +L+  Y  L+R GYFDG +  Y NLG  +I  
Sbjct: 307 DNSCEYTGSSDIPGAFNQSYLSIPTIDRALKRQYEGLVRAGYFDGAAATYANLGVKDINT 366

Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
           P+  +L+ + A +G+VLLKND+  LPL+  N   +A++G  AN T  + G Y G      
Sbjct: 367 PEAQQLSLQVASEGLVLLKNDD-TLPLSLTNGSKVAMLGFWANDTSKLSGIYSGPAPYLR 425

Query: 425 SPMDGFYAYSKV-INYAPGCADIVCQNNS-----MIPAAIDAAKNADATVIVAGLDLSVE 478
           SP+   +A  K+ ++ A     I+ Q+NS         A+ AA+ +D  +   GLD S  
Sbjct: 426 SPV---WAGQKLGLDMAIASGPILQQSNSSTRDNWTTNALAAAEKSDYILYFGGLDPSAA 482

Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP-----KIKSIL 533
           AEG DR  +  P  Q +LI K+A   K  V LV+        +   N+P      + S++
Sbjct: 483 AEGFDRNSIAWPTAQVDLIKKLAAIGKPLVVLVLG-------DLMDNSPLLELDGVNSVI 535

Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
           W  +PG++GG A+  V+ G     GRLPIT Y ANY ++    M +RP ++ PGRTY++F
Sbjct: 536 WANWPGQDGGSAVMQVVTGAVAVAGRLPITQYPANYTELSMLDMNMRPSSSSPGRTYRWF 595

Query: 594 DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
           +G  V PFG GL YT F  K A++                I Y +      C     D  
Sbjct: 596 NG-AVQPFGTGLHYTTFDAKFAAN--------------STIEYDISNITKECTNQYPDTC 640

Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVG 712
                  +  + V N G      + + + K   G A   +K +I Y RV    G   K  
Sbjct: 641 SVP----SIPVAVTNSGNRTSDFIALAFIKGENGPAPYPLKTLISYTRVRDVKGGQTKSA 696

Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVGE 743
                  +L  VD   N++L  G +T+L+ E
Sbjct: 697 EMQLTLGNLARVDQMGNTVLYPGEYTVLLDE 727


>gi|326791674|ref|YP_004309495.1| beta-glucosidase [Clostridium lentocellum DSM 5427]
 gi|326542438|gb|ADZ84297.1| Beta-glucosidase [Clostridium lentocellum DSM 5427]
          Length = 696

 Score =  399 bits (1026), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 236/641 (36%), Positives = 347/641 (54%), Gaps = 80/641 (12%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           ++AK LV  MTL E+  Q+   +  + RLG+P Y WW+EALHGV+  G            
Sbjct: 8   KKAKALVAEMTLEERASQLKYDSPAIKRLGVPAYNWWNEALHGVARAGV----------- 56

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
                ATSFP  I   A+F++ L K++ + ++ E RA YN  +         GLTFWSPN
Sbjct: 57  -----ATSFPQAIGMAATFDDELLKRVAEVIAEEGRAKYNAYSQEGDRDIYKGLTFWSPN 111

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           +N+ RDPRWGR  ET GEDPY+  R  + +V+GLQ  EG           LK +AC KH+
Sbjct: 112 VNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQGEEG-----------LKTAACAKHF 160

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A +        DR HFD+RV+++D+ ET++  FE  V E +V SVM +YNR NG P C  
Sbjct: 161 AVHSGPE---ADRHHFDARVSQKDLWETYLPAFEALVKEAEVESVMGAYNRTNGEPCCGS 217

Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
           P L+   +R  W F G+ VSDC +I+   E H  +  T +++ A  LK+G DL+CG+ Y 
Sbjct: 218 PTLMKDILREKWGFQGHYVSDCWAIKDFHE-HHMVTSTAQESAALALKSGCDLNCGNTYL 276

Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEA 374
           +  M A Q G + E +I T+   L+     LG FDGS  Y  +    + +  H+ +A EA
Sbjct: 277 HILM-AYQNGLVTEEEITTAAERLFTTRYLLGLFDGST-YDAIPYEVVESKPHLSVADEA 334

Query: 375 ARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA-- 432
             + IVLLKN NG LPLN  +IKT+ ++GP+AN+ KA+IGNY GT  +Y + ++G     
Sbjct: 335 TAKSIVLLKN-NGLLPLNKESIKTIGVIGPNANSRKALIGNYHGTSSQYITILEGLQKEV 393

Query: 433 -------YSKVIN-YAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE---- 480
                  YS+  + YA     +  Q + +  A I  AK++D  ++  GLD ++E E    
Sbjct: 394 GDEVRILYSEGSHLYADRVEPLAYQRDRLSEAKI-VAKHSDVVIVCVGLDETLEGEEGDT 452

Query: 481 -----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
                  D+ DL LP  Q EL+  +A   K PV L + +  A+D+ +A  +    ++L  
Sbjct: 453 GNAYASGDKRDLALPEPQQELVEAMAKMGK-PVILCLSAGSAIDLQYA--DAHYDAVLQA 509

Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDG 595
            YPG  GG+ IA  + G+  P G+LP+T+Y         + +P     +  GRTY++   
Sbjct: 510 WYPGARGGQVIAKALLGEIVPSGKLPVTFYR------DLSGLPAFEDYSMQGRTYRYMQE 563

Query: 596 PVVYPFGYGLSYTQFKYKVASSPK---------SVDIKLDK 627
             +YPFGYGL+Y + + + AS  +          VD KL++
Sbjct: 564 EALYPFGYGLTYGKCRIEEASYDQGSLRVLVHNEVDFKLEE 604


>gi|451851086|gb|EMD64387.1| glycoside hydrolase family 3 protein [Cochliobolus sativus ND90Pr]
          Length = 763

 Score =  399 bits (1025), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 267/752 (35%), Positives = 382/752 (50%), Gaps = 68/752 (9%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD   P  ERA  LV  M   EK+  +   + GV RLGLP Y WW EALHGV+ 
Sbjct: 31  LSTNAICDVNAPPHERAAALVAAMEPQEKLDNLVSKSKGVSRLGLPAYNWWGEALHGVA- 89

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
                   PG  F      ATSFP  IL +A+F++ L  KI   +  EARA  N G A +
Sbjct: 90  ------GAPGIKFVEPYKNATSFPMPILMSAAFDDDLIFKIANIIGNEARAFGNGGVAPV 143

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
            +W+P+IN VRD RWGR  E+PGED   +  Y    + GL   EG +  R       KI 
Sbjct: 144 DYWTPDINPVRDIRWGRASESPGEDIRRIKGYTKALLAGL---EGDQAQR-------KII 193

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           A CKHY  YD++ W G DR +F +++T QD+ E ++ PF+ C  +  V S MCSYN VNG
Sbjct: 194 ATCKHYVGYDMEAWGGYDRHNFSAKITMQDLAEYYMPPFQQCARDSKVGSFMCSYNAVNG 253

Query: 249 IPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           IPTCAD  +L   +R  WN+   + YI SDC+++  I E+HK++ +T     A     G+
Sbjct: 254 IPTCADTYVLQTILRDHWNWTDSNNYITSDCEAVADISENHKYV-ETLAQGTALAFAKGM 312

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICN 364
           DL C    ++   GA  QG +  + ID +L   Y  L+  GYFDG+   Y NL   +I  
Sbjct: 313 DLSCEYTGSSDIPGAWAQGLLNISVIDKALTRQYEGLVHAGYFDGAKATYANLSYKDINT 372

Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
           P+  +L+ +   +G+V+LKND+  LPL       +A++G  AN +  + G Y G P    
Sbjct: 373 PEARQLSLQVTSEGLVMLKNDH-TLPLPLTKGSKVAMIGFWANDSSKLQGIYSGPPPYRH 431

Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIP-----AAIDAAKNADATVIVAGLDLSVEA 479
           SP+  F      ++ A     ++   NS +P      A+DAA+ +D  +   G D +V  
Sbjct: 432 SPV--FAGEQMGLDMAIAWGPMI--QNSSVPDNWTTNALDAAEKSDYILYFGGQDWTVAQ 487

Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAG-AVDINFAKNNPKIKSILWVGYP 538
           EG DR  +  P  Q +L+ K+A   K    LV+++ G   D +   +   + SI+W  +P
Sbjct: 488 EGYDRTTISFPQVQIDLLTKLAKLGK---PLVVITLGDMTDHSPLLSMEGVNSIIWANWP 544

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVV 598
           G++GG AI +V+ G + P GRLPIT Y A+YVK+    M LRP    PGRTY++F+   V
Sbjct: 545 GQDGGPAILNVVSGAHAPAGRLPITEYPADYVKLSMLDMNLRPHTESPGRTYRWFN-ESV 603

Query: 599 YPFGYGLSYTQFKYKVASSPK-SVDIKLDKD---QQCRDINYTVGTNKPPCAAVLIDDVK 654
            PFG+GL YT F+   AS    + DI+   D   QQ +D+          C         
Sbjct: 604 QPFGFGLHYTTFEASFASEEGLTYDIEEILDGCTQQYKDL----------CEVA------ 647

Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH---IKQVIGYERVFIAAGQSAKV 711
                   ++ V N G      V + + K  G  G     +K +I Y R+    G + K 
Sbjct: 648 ------PLEVTVANKGNRTSDFVALAFIK--GEVGPKPYPLKTLITYGRLRDIHGGAKKS 699

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
                    L  VD + N+++  G +T+L+ E
Sbjct: 700 ASLPLTLGELARVDQSGNTVIYPGEYTLLLDE 731


>gi|310795958|gb|EFQ31419.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Glomerella graminicola M1.001]
          Length = 824

 Score =  399 bits (1024), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 269/763 (35%), Positives = 388/763 (50%), Gaps = 68/763 (8%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD  L   ERA  LV  +T+ EK+  + + A GVPRL +P YEWWSE LHGV+       
Sbjct: 65  CDETLSPKERAAALVAELTIWEKLDNLVNEAPGVPRLAIPPYEWWSEGLHGVA------- 117

Query: 75  SPPGTHF--DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW- 131
           S PGT F        ATSFP  I+  ++F++ L K IG+ VS EARA  N G +GL  + 
Sbjct: 118 SSPGTKFAKSGNFSYATSFPQPIVLGSAFDDDLVKAIGEVVSKEARAFSNRGRSGLDLYV 177

Query: 132 --------------------SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDV 171
                               SPNIN  +DPRWGR  ETPGEDP+ +  Y    + GL   
Sbjct: 178 SSISRHIEPEVRDDMLTEPESPNINAFKDPRWGRGQETPGEDPFHLQNYVAAMLTGL--- 234

Query: 172 EGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCV 231
           EG +  +       K+ A CKHYAA D +N++G DR  FD+ +T QD+ E ++ PF+ C 
Sbjct: 235 EGGDPSK-------KLIATCKHYAANDFENYKGVDRAGFDANITTQDLSEYYLPPFKTCA 287

Query: 232 NEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKF 288
            +  V S MCSYN +NG P CA+P LL   +R  W ++G   Y+ +DCD +  +V  H +
Sbjct: 288 VDKKVGSFMCSYNAINGEPLCANPYLLEDILRQHWGWNGDGQYVSTDCDCVALMVSHHHY 347

Query: 289 LNDTKEDAVARVLKAGLDLDCGDYYTNFTMG-AVQQGKIAEADIDTSLRFLYIVLMRLGY 347
             D    A A  +KAG DL+C  +  +  +  A  Q  I+E ++D SL  +Y  L+ +G 
Sbjct: 348 APDLGH-AAAWAMKAGTDLECNAFPGSEALQLAWNQSLISEKEVDKSLTRMYTALVSVGQ 406

Query: 348 FD---GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTG-NIKTLALVG 403
           FD   G P  ++L  +++   +  +LA +A  +G VLLKND G LPL+     K  AL+G
Sbjct: 407 FDSARGQP-LRSLSWDDVNTKEAQKLAYQAVIEGAVLLKND-GILPLSAAWREKKYALIG 464

Query: 404 PHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKN 463
           P  NAT  M GNY G      S       +     Y+ G    +   +     A+D+A  
Sbjct: 465 PWINATTQMQGNYFGPAPYLISLYQAAKEFGLDFTYSLGSR--INSTDDSFKQALDSAHA 522

Query: 464 ADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFA 523
           A   V   G+D ++EAE +DR  L  P  Q +L+  V+   K PV ++    G VD    
Sbjct: 523 AALIVFAGGVDNTLEAETRDRKTLAWPESQLDLLRAVSALGK-PVIVLQFGGGQVDDTEL 581

Query: 524 KNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLR-- 580
             N  I ++LW GYPG+ GG+A+ D++FG+  P GRL +T Y A+Y + +P T M LR  
Sbjct: 582 LANHSINALLWGGYPGQSGGKAVIDLLFGRAAPAGRLSVTQYPASYNEDVPSTDMNLRPG 641

Query: 581 PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGT 640
           P N+  GRTY +++G  V P+G+GL YT F  K+ +   S  IK ++       +Y  GT
Sbjct: 642 PGNSGLGRTYMWYNGDAVVPYGFGLHYTTFDAKLKARQASALIKTEEVSSLLSNDYVSGT 701

Query: 641 NKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYE 699
                   L+          +  I V N G +    V +++ +   G      K + GY 
Sbjct: 702 --------LVWQQILTKPVVSVLITVSNTGNVASDYVALLFLRSNAGPTPQPTKTLAGYH 753

Query: 700 RVF-IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
           R   I  G  ++   ++   + L  VD   N +L  G++ + V
Sbjct: 754 RFRNIQPGDRSEREVSIT-IERLVRVDELGNRVLHPGSYELFV 795


>gi|380696433|ref|ZP_09861292.1| glycoside hydrolase [Bacteroides faecis MAJ27]
          Length = 739

 Score =  399 bits (1024), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 255/756 (33%), Positives = 381/756 (50%), Gaps = 97/756 (12%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           FP+ D +LP  +R +DLV R+TL EKV+QM +    V RLG+P Y WW+E LHG   IGR
Sbjct: 25  FPFRDPQLPVEQRVEDLVSRLTLEEKVKQMLNSTPPVERLGIPAYNWWNECLHG---IGR 81

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
                            T FP  I   A++N++L K++  +++ E RA+YN         
Sbjct: 82  TKYH------------VTVFPQAIGMAAAWNDALIKEVASSIADEGRAIYNDTQRKEDYS 129

Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LT+W+PNIN+ RDPRWGR  ET GEDPY+  R    +V+GLQ           + R
Sbjct: 130 QYHALTYWTPNINIFRDPRWGRGQETYGEDPYLTARIGEAFVQGLQG---------DNPR 180

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LK SAC KHYA +   +    +R  F+S V+  D+ +T++  F   V +  VS VMC+Y
Sbjct: 181 YLKASACAKHYAVH---SGPEKNRHSFNSDVSTYDLWDTYLPAFRTLVVDAKVSGVMCAY 237

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N   G P C +  L+   +R  WNF GY+ SDC +I  I   HK   D    A   V   
Sbjct: 238 NAFQGQPCCGNDLLMQSILRDKWNFTGYVTSDCGAIDDIFNHHKTHPDAATAAADAVFH- 296

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNN 361
           G DLDCG       + AV+ G I E  +D S++ L+ +  RLG FD      Y  +  + 
Sbjct: 297 GTDLDCGHSAYLALVKAVKDGIITEKQLDVSVKRLFTIRFRLGLFDPVELVDYARIPISI 356

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           +   +H +LA + AR+ +VLLKND   LPL    +K + ++GP+A++ ++++GNY G P 
Sbjct: 357 LECRKHQDLAKQLARESMVLLKNDQ-LLPLQKNKLKKVVVMGPNADSRESLLGNYNGNPS 415

Query: 422 RYTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSV 477
           R  +P+         +++V  Y  G   +   +   +   ++ AK ADA + + G+   +
Sbjct: 416 RMLTPLQAIRERLGGWTEV-EYIEGVDHVNTISADDLKQYVNRAKGADAVIFIGGISPRL 474

Query: 478 EAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
           E E          G DR  + LP  QT+++ K   A   P   V+M+  A+ I +   N 
Sbjct: 475 EGEEMPVSKDGFDGGDRTTIALPAVQTQMM-KAWVAEHIPTVFVMMTGSALAIPWEAQN- 532

Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG 587
            + +IL   Y G+ GG AIADV+FG YNP G+LP+T+Y  +      + +P     +  G
Sbjct: 533 -VPAILNAWYGGQYGGEAIADVLFGDYNPSGKLPVTFYAKD------SDLPDFESYDMQG 585

Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
           RTY++F+G  +YPFGYGLSYT F Y     PK           CR               
Sbjct: 586 RTYRYFNGKALYPFGYGLSYTSFAYSSLKLPKV----------CR--------------- 620

Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAG 706
                    D +    + V+N G  +G EVV +Y S P       +  + G++R+ + AG
Sbjct: 621 -------TTDKEIEVTVTVKNTGHTEGEEVVQLYVSHPDKKILVPLTALKGFKRIQLKAG 673

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           ++ +V F++++ + L  VD      + +G   I VG
Sbjct: 674 EAQRVTFSLSS-EDLSCVDENGIRKVWAGTVKIQVG 708


>gi|373852136|ref|ZP_09594936.1| Beta-glucosidase [Opitutaceae bacterium TAV5]
 gi|372474365|gb|EHP34375.1| Beta-glucosidase [Opitutaceae bacterium TAV5]
          Length = 740

 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 266/764 (34%), Positives = 384/764 (50%), Gaps = 108/764 (14%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           P+ D  L    R +DLV R+TL EKV QM   A  +PRLG+P Y +W+E LHGV+  GR 
Sbjct: 22  PFRDPDLALDHRVRDLVSRLTLAEKVSQMEHAAAAIPRLGIPAYNYWNECLHGVARNGR- 80

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------ 126
                          AT FP +I   A+++  L  ++   +S EARA ++   A      
Sbjct: 81  ---------------ATVFPQIIGLAATWDTDLVYRVATAISDEARAKHHAALARQGFAQ 125

Query: 127 -----GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
                GLTFW+PNIN+ RDPRWGR  ET GEDP++  R A  +VRGLQ         D+ 
Sbjct: 126 TQQYQGLTFWTPNINLFRDPRWGRGQETWGEDPHLTARLAAAFVRGLQG--------DTP 177

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
              LK++AC KHYA +   +   N+R  F++RVT  D+ ++++  FE  V    V SVM 
Sbjct: 178 DTHLKLAACAKHYAVH---SGPENERHTFNARVTPHDLWDSYLPAFEHLVRHARVESVMG 234

Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
           +YNR    P CA   LL   +R  W F G++VSDC +++ I E+H+   D  E A A  L
Sbjct: 235 AYNRTLDEPCCASQFLLLDILRERWGFEGHVVSDCWALRDIHETHRITTDPVESA-ALAL 293

Query: 302 KAGLDLDCGDYYTNFTM--GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK 359
             G DL CG   T F +   AVQ+G I EADID +L        +LG FD +   +N   
Sbjct: 294 TKGCDLACG---TTFELLGEAVQRGLITEADIDRALSRHLRARFKLGMFDPADDNRNPWS 350

Query: 360 NN------ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
           N       +    H  LA EAA    VLL+N N  LPL   +++++ + GP A    A++
Sbjct: 351 NPPAPEAIVTCAAHTALACEAAVASCVLLQNHNHILPLRP-DVRSIYITGPLAATQDALL 409

Query: 414 GNYEGTPCRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV 470
           GNY G P R  + +DG  A        +Y PG      + N++  A  D A + D T+  
Sbjct: 410 GNYYGLPPRAITLLDGLAAALPEGIRADYRPGALLSTPKQNALEWAEFDCA-SCDVTIAC 468

Query: 471 AGLDLSVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDIN 521
            GL   +E E           DR D+ LP  Q   +  +    +G   +VI+  G+  ++
Sbjct: 469 LGLTALLEGEEGEAIASSLHGDRDDISLPPPQRLFLESLIQ--RGARVIVILFGGSA-LS 525

Query: 522 FAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP 581
                 K+++ILW GYPG+EGGRA+AD++ G+ +P GRLPIT+YE      PY +  +R 
Sbjct: 526 LGPLADKVEAILWAGYPGQEGGRALADILLGRASPSGRLPITFYENINDLPPYANYSMR- 584

Query: 582 VNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTN 641
                GRT+++FDG   +PFG+GL+YT+F Y               D +  D+ Y+ G +
Sbjct: 585 -----GRTHRWFDGTPAWPFGFGLTYTRFTY--------------SDLRVSDV-YSPGND 624

Query: 642 KPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK---PPGIAGTHIKQVIGY 698
            P C +VL+                 N G  + +E+V +Y      PG      + +  +
Sbjct: 625 SPLCGSVLL----------------TNTGDHEAAEIVQIYLTDFDAPGNGPVPRENLADF 668

Query: 699 ERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            RV +A GQS +V F++   + + +VD       A  A T+ VG
Sbjct: 669 HRVTLAPGQSRRVEFSIPP-EHILLVDTNGRRTRAPLAFTVHVG 711


>gi|330836687|ref|YP_004411328.1| Beta-glucosidase [Sphaerochaeta coccoides DSM 17374]
 gi|329748590|gb|AEC01946.1| Beta-glucosidase [Sphaerochaeta coccoides DSM 17374]
          Length = 709

 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 224/612 (36%), Positives = 338/612 (55%), Gaps = 69/612 (11%)

Query: 25  AKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSE 84
           A+ +V RMTL EK+ Q+   A  +PRL +P Y WW+EALHGV+  G              
Sbjct: 14  ARRIVSRMTLDEKISQIDYRASAIPRLDIPEYNWWNEALHGVARAGI------------- 60

Query: 85  VPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG--------NAGLTFWSPNIN 136
              AT FP  I   A F+  + ++IG  +STE RA YN            GLTFWSPN+N
Sbjct: 61  ---ATVFPQAIGLAAMFDSDMMERIGAVISTEGRAKYNEAVRHGDRDIYKGLTFWSPNVN 117

Query: 137 VVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAA 196
           + RDPRWGR  ET GEDPY+  R A+ ++RG+Q           D + LK +AC KH+A 
Sbjct: 118 IFRDPRWGRGQETYGEDPYLTARLAVAFIRGIQ----------GDGKYLKAAACAKHFAV 167

Query: 197 YDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           +      G +  R  FD+RV+++D+ ET++  F+  V E  V  VM +YNRVNG+P CA 
Sbjct: 168 HS-----GPEALRHEFDARVSQKDLHETYLSAFKAAVKEAQVEIVMGAYNRVNGVPACAS 222

Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
            +LL+  +R +W F G++VSD ++++ I + H ++ D +   +A  LKAG +L C     
Sbjct: 223 HELLSDILRSEWGFEGHVVSDYEALEDIFKHHHYVAD-EAHTMAVALKAGCNL-CAGKIA 280

Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEA 374
                +V +G I+E +I  ++  L+   + +G       Y ++G      P+H +LA EA
Sbjct: 281 RHLRSSVDEGLISEDEITEAVERLFTTRIMMGMMADDCPYDSIGYEENDTPEHHQLAVEA 340

Query: 375 ARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG---FY 431
           A +  VLLKND G LPL    I ++A++GP+AN+ K + GNY GT  RY + ++G     
Sbjct: 341 ASRSFVLLKND-GLLPLEMEKISSIAVIGPNANSRKMLEGNYNGTASRYVTVLEGIQDLV 399

Query: 432 AYSKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----- 480
             S  + Y+ GC         +   N  +  A+ AA++AD  V+  GLD ++E E     
Sbjct: 400 GDSVRVWYSEGCHLYKNFHSSLSGRNDRLAEAVSAAQHADVVVLCLGLDATLEGEEGDVE 459

Query: 481 ----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
                 D+ +L LPG Q  L++ +    K PV L++ S  A+ +   +N+  +K+IL + 
Sbjct: 460 VGFGSGDKPNLSLPGRQQLLLDTMLTVGK-PVILLLASGSALTLGGRENDENLKAILQIW 518

Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
           YPG  GG+A+ADV+FG+  P G+LP+T+Y +         +P     +  GRTY++  G 
Sbjct: 519 YPGAMGGKAVADVLFGRRAPAGKLPVTFYASA------DELPAFEDYSMAGRTYRYMKGN 572

Query: 597 VVYPFGYGLSYT 608
            +YPFGYGL+Y+
Sbjct: 573 ALYPFGYGLTYS 584


>gi|2920706|emb|CAA73902.1| beta-xylosidase [Emericella nidulans]
          Length = 802

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 233/605 (38%), Positives = 337/605 (55%), Gaps = 27/605 (4%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS  P CD  L   +RA  LV   T  E V   G+   GV RLGLP Y+ W EALHGV  
Sbjct: 54  LSLTPVCDRSLSPKDRATALVSLFTFDELVNNTGNTGLGVSRLGLPNYQVWGEALHGVG- 112

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
              R N     +F      ATSFP  I   A+ N++L  +IG  VST+ RA  N G  G+
Sbjct: 113 ---RANFVESGNFS----WATSFPMPITMMAALNKTLIHQIGTIVSTQLRAFSNAGLGGV 165

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
             +SPNIN  R P WGR  ETPGED ++   Y   Y+  LQ           D    KI 
Sbjct: 166 DVYSPNINTFRHPVWGRGQETPGEDAFLTSVYGYEYITALQGA--------VDPETSKII 217

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           A  KHYA YD+++W  + R   D ++T+Q++ E +  PF +   +  V SVMCSYN VNG
Sbjct: 218 ATAKHYAGYDIESWNNHSRLGNDMQITQQELSEYYTPPFIVASRDAKVRSVMCSYNAVNG 277

Query: 249 IPTCADPKLLNQTIRGDWNFH--GYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLD 306
           +P+CA+   L   +R  + F   GY+  DC ++  +   H + ++ +  A A  + AG D
Sbjct: 278 VPSCANKFFLQTLLRDTFEFSEDGYVSGDCGAVYNVWNPHGYASN-EAAASADSILAGTD 336

Query: 307 LDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICNP 365
           +DCG  Y   +  A +   ++ +DI+  +  LY  L++ GYFDG    Y+++  +++ + 
Sbjct: 337 IDCGTSYQWHSEDAFEDSLVSRSDIERGVIRLYSNLVQAGYFDGEDAPYRDITWDDVLST 396

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
               +A EAA +GIVLLKND   LPL+  +IK++A++GP AN T+ + GNY G      S
Sbjct: 397 DAWNIAYEAAVEGIVLLKNDE-TLPLSK-DIKSVAVIGPWANVTEELQGNYFGPAPYLIS 454

Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           P+ GF      ++YA G  ++   + S    A+ AAK ADA +   G+D ++EAE  DR 
Sbjct: 455 PLTGFRDSGLDVHYALGT-NLTSHSTSGFEEALTAAKQADAIIFAGGIDNTIEAEAMDRE 513

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           ++  PG Q +LI+K+++  K P+ ++ M  G VD +  K+N  + +++W GYPG+ GG A
Sbjct: 514 NITWPGNQLDLISKLSELGK-PLVVLQMGGGQVDSSSLKDNDNVNALIWGGYPGQSGGHA 572

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKI-PYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
           +AD+I GK  P GRL  T Y A Y ++ P   M LRP   +  PG+TY ++ G  VY FG
Sbjct: 573 LADIITGKRAPAGRLVTTQYPAEYAEVFPAIDMNLRPNETSGNPGQTYMWYTGTPVYEFG 632

Query: 603 YGLSY 607
           +GL Y
Sbjct: 633 HGLFY 637


>gi|367032987|ref|XP_003665776.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
           ATCC 42464]
 gi|347013048|gb|AEO60531.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
           ATCC 42464]
          Length = 835

 Score =  397 bits (1021), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 245/640 (38%), Positives = 349/640 (54%), Gaps = 37/640 (5%)

Query: 3   ESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEA 62
           +  K  LSD   CD  LP  ERA  LV  +T  EK+Q +   A G PR+GLP Y WWSEA
Sbjct: 17  DCTKPPLSDIKVCDRTLPEAERAAALVAALTDEEKLQNLVSKAPGAPRIGLPAYNWWSEA 76

Query: 63  LHGVSFIGRRTNSPPGTHFDSEVPG----ATSFPTVILTTASFNESLWKKIGQTVSTEAR 118
           LHGV+         PGT F  + PG    +TSFP  +L  A+F++ L + +G  + TEAR
Sbjct: 77  LHGVAHA-------PGTQF-RDGPGDFNSSTSFPMPLLMAAAFDDELIEAVGDVIGTEAR 128

Query: 119 AMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHR 178
           A  N G +GL +W+PN+N  RDPRWGR  ETPGED   + RYA + +RGL+         
Sbjct: 129 AFGNAGWSGLDYWTPNVNPFRDPRWGRGSETPGEDVVRLKRYAASMIRGLEGRSSSSSSC 188

Query: 179 DSDS--RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDV 236
              S   P ++ + CKHYA  D ++W G  R  FD+ ++ QD+ E ++ PF+ C  +  V
Sbjct: 189 SFGSGGEPPRVISTCKHYAGNDFEDWNGTTRHDFDAVISAQDLAEYYLAPFQQCARDSRV 248

Query: 237 SSVMCSYNRVNGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTK 293
            SVMC+YN VNG+P+CA+  L+N  +RG WN+     Y+ SDC+++  +   H +  DT 
Sbjct: 249 GSVMCAYNAVNGVPSCANSYLMNTILRGHWNWTEHDNYVTSDCEAVLDVSAHHHYA-DTN 307

Query: 294 EDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG--S 351
            +      +AG+D  C    ++   GA   G +    +D +L  LY  L+R+GYFDG  S
Sbjct: 308 AEGTGLCFEAGMDTSCEYEGSSDIPGASAGGFLTWPAVDRALTRLYRSLVRVGYFDGPES 367

Query: 352 PQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPL---------NTGNIKTLALV 402
           P + +LG  ++  P+  ELA  AA +GIVLLKNDN  LPL           G  + +A++
Sbjct: 368 P-HASLGWADVNRPEAQELALRAAVEGIVLLKNDNDTLPLPLPDDVVVTADGGRRRVAMI 426

Query: 403 GPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGC---ADIVCQNNSMIPAAID 459
           G  A+A   + G Y G P    SP          +  A G     D   + ++    A++
Sbjct: 427 GFWADAPDKLFGGYSGAPPFARSPASAARQLGWNVTVAGGPVLEGDSDEEEDTWTAPAVE 486

Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
           AA +AD  V   GLD S   E KDR+ +  P  Q  LI+++A   K PV +V M     D
Sbjct: 487 AAADADYIVYFGGLDTSAAGETKDRMTIGWPAAQLALISELARLGK-PVVVVQMGDQLDD 545

Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMP 578
               + +  + ++LW  +PG++GG A+  ++ G  +P GRLP+T Y ANY   +P T M 
Sbjct: 546 TPLFELD-GVGAVLWANWPGQDGGTAVVRLLSGAESPAGRLPVTQYPANYTDAVPLTDMT 604

Query: 579 LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSP 618
           LRP    PGRTY+++  P V PFG+GL YT F+ +    P
Sbjct: 605 LRPSATNPGRTYRWYPTP-VRPFGFGLHYTTFRAEFGPHP 643


>gi|373955483|ref|ZP_09615443.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373892083|gb|EHQ27980.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 738

 Score =  397 bits (1021), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 259/763 (33%), Positives = 382/763 (50%), Gaps = 109/763 (14%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +P+ +  L   ER  DLV RMTL EKV QM + A  + RLG+P Y WW+E LHGV+    
Sbjct: 31  YPFNNPALSMDERVADLVGRMTLEEKVSQMLNSAPAIERLGVPAYNWWNECLHGVA---- 86

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--LGN---- 125
                  T F       T +P  I   A+++++    +G   + E RA+YN  + N    
Sbjct: 87  ------RTPFK-----VTVYPQAIAMAATWDKTSMHVMGDYTAEEGRAVYNESIKNDKHD 135

Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
              GLT+W+PNIN+ RDPRWGR  ET GEDP++ G     +V+GLQ           D R
Sbjct: 136 IYLGLTYWTPNINIFRDPRWGRGQETYGEDPFLTGEMGSAFVKGLQ---------GDDPR 186

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LK + C KHYA +   +   + R  F++ +++ D+ +T++  F   V +  V+ VMC+Y
Sbjct: 187 YLKAAGCAKHYAVH---SGPEDLRHKFNTDISDYDLWDTYLPAFRKLVVDAKVTGVMCAY 243

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARVL 301
           N   G P C    L+N  +   W F GY+ SDC  I       +H+   D  E A A  +
Sbjct: 244 NAFKGQPCCGSDLLMNSILHDKWKFTGYVTSDCGGIDDFYRENTHQTQPDA-ESAAADAV 302

Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGK 359
             G D++CG+      + AV+ GK++E  ID SL+ L+ V  +LG FD   + +Y  +GK
Sbjct: 303 LHGTDVECGNVTYKSLVKAVKDGKLSEKQIDQSLKRLFSVRFKLGMFDPADAVKYNQIGK 362

Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
           + +  P H   A + A Q IVLLKN+   LPL+  N+K +A++GP+A+   +++GNY GT
Sbjct: 363 DALEAPAHGAQALKMAHQSIVLLKNEGNLLPLSK-NLKKIAVLGPNADNAVSVLGNYNGT 421

Query: 420 PCRYTSPMDGF---------YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV 470
           P R  + + G            Y K ++Y    AD   + N    AA    K+ADA + +
Sbjct: 422 PSRIVTALQGIKNKLPAGTEVIYDKAVDY---VADSAARYNYAAMAA--KVKDADAIIYI 476

Query: 471 AGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
            G+   +E E          G DR  +LLPG QTEL+ K   A   PV  V+M+  A+  
Sbjct: 477 GGISPELEGEEMPVSKPGFHGGDRSTILLPGVQTELL-KALKATGKPVVFVMMTGSAIAT 535

Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR 580
            +   N  + +I+   Y G+  G AIADV+FG YNP GRLP+T+Y ++        +P  
Sbjct: 536 PWEAEN--LPAIVNAWYGGQAAGTAIADVLFGDYNPAGRLPVTFYGSD------KDLPSF 587

Query: 581 PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGT 640
              +   RTY++F G  +Y FGYGLSY++F+Y    +P ++                   
Sbjct: 588 TDYSMDNRTYRYFKGKPLYAFGYGLSYSKFEYAPLDAPLTLK------------------ 629

Query: 641 NKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYE 699
                               T  ++V N  KMDG EV  +Y    GI   T I+ + G+E
Sbjct: 630 ---------------AGEALTVHVKVTNKSKMDGEEVTELYLSHIGIKQKTAIRALKGFE 674

Query: 700 RVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           R  I AG++  + F +++   L I D   N + ASG   I VG
Sbjct: 675 RTLIKAGETKDITFKLSSA-DLSITDLNGNLVKASGKIAISVG 716


>gi|218186207|gb|EEC68634.1| hypothetical protein OsI_37026 [Oryza sativa Indica Group]
          Length = 1241

 Score =  397 bits (1019), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 197/326 (60%), Positives = 236/326 (72%), Gaps = 17/326 (5%)

Query: 111  QTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQD 170
            Q VSTEARAMYN+G  GLT+WSPNINVVRDPRWGR LETPGEDPYVVGRYA+N+VRG+QD
Sbjct: 916  QAVSTEARAMYNMGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQD 975

Query: 171  V---EGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPF 227
            +   E V    D ++RPLK SACCKHYAAYDLD+W  + RF FD+RV E+DM ETF  PF
Sbjct: 976  IPGHEAVAAGGDPNTRPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPF 1035

Query: 228  EMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHK 287
            EMCV +GDVSSVMCSYNRVNGIP CAD +LL+QTIR DW  HGYIVSDCD+++ + ++  
Sbjct: 1036 EMCVRDGDVSSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNAT 1095

Query: 288  FLNDTKEDAVARVLKAGLDLDCG-------------DYYTNFTMGAVQQGKIAEADIDTS 334
            +L  T  +A A  LKAGLDLDCG             D+ T + M AV +GK+ E+DID +
Sbjct: 1096 WLGYTGAEASAAALKAGLDLDCGESWKNETDGHPLMDFLTTYGMEAVNKGKMRESDIDNA 1155

Query: 335  LRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTG 394
            L   Y+ LMRLGYFD   QY +LG+ +IC  QH  LA + ARQGIVLLKNDN  LPL+  
Sbjct: 1156 LTNQYMTLMRLGYFDDIAQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDAN 1215

Query: 395  NIKTLALVGPHANA-TKAMIGNYEGT 419
             +  + + GPH  A  K M G+Y GT
Sbjct: 1216 KVGFVNVRGPHVQAPEKIMDGDYTGT 1241


>gi|373952439|ref|ZP_09612399.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373889039|gb|EHQ24936.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 721

 Score =  397 bits (1019), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 247/742 (33%), Positives = 375/742 (50%), Gaps = 96/742 (12%)

Query: 24  RAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
           R +DL+ R+TL EKV  +G  +  VPRL +P Y WW+E LHGV+  G             
Sbjct: 40  RVQDLISRLTLAEKVSLLGYRSQAVPRLNIPAYNWWNEGLHGVARAGE------------ 87

Query: 84  EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPNI 135
               AT FP  I   A+F+++L K++   VSTEARA YNL  A        GLTFWSPNI
Sbjct: 88  ----ATIFPQAIAMAATFDDNLVKQVANVVSTEARAKYNLSTAMGRHLQYMGLTFWSPNI 143

Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
           N+ RDPRWGR  ET GEDP++  +    YV GLQ          +D   LK SA  KH+ 
Sbjct: 144 NIFRDPRWGRGQETYGEDPFLTSKMGNAYVHGLQ---------GTDPLHLKTSATAKHFV 194

Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
           A+     EG +R +FD+ V E+D+++T++  F+  V +G V S+M +YNRVNG+P   + 
Sbjct: 195 AH--SGPEG-ERDYFDALVDEKDLRDTYLYAFKSLV-DGGVESIMTAYNRVNGVPNSINK 250

Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
            L+N  +  +W F G++V+DC ++  + ++HK L +  E A A  +KAG+DLDC   +  
Sbjct: 251 TLVNDIVIKEWGFKGHVVTDCGALDDVYKTHKVLPNRMEVAAA-AIKAGVDLDCSSIFQT 309

Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG--SPQYKNLGKNNICNPQHIELAAE 373
             + A+    + E  +D +L  +     +LG+FD   S  + + G ++I N  H+ LA +
Sbjct: 310 DIINAINNKLLTEKQVDAALAAVLSTQFKLGFFDAPSSSPFYSFGADSIHNDSHVMLARQ 369

Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA- 432
            A++ +VLLKND   LPL   N  ++ +VGP+A +  A++ +Y G   +  + ++G  A 
Sbjct: 370 MAQKSMVLLKNDKQILPLKMQNYSSIMVVGPNAASLDALVASYHGVSSKAVNFVEGITAA 429

Query: 433 --YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE---------G 481
                 + Y  G AD     ++     I  A NAD TV V GL   +E E         G
Sbjct: 430 VDKGTRVEYDLG-ADY---RDTTHFGGIWGAGNADVTVAVIGLTPVLEGEAGDAFLSQTG 485

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
            D+ DL LP      +  +  + K P+  V+ S   VDI  A   P   +++   YPGE+
Sbjct: 486 GDKKDLSLPAGDIAFMKALRKSVKKPIIAVVTSGSDVDI--AAIAPYADAVILAWYPGEQ 543

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
           GG A+AD++FGK +P G LP+T+Y +         +P     +  GRTY++F G V YPF
Sbjct: 544 GGNALADILFGKISPSGHLPLTFYNS------VNDLPAYNNYSMKGRTYRYFAGAVQYPF 597

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
           G+GLSYT F Y+    PK+                                   KD    
Sbjct: 598 GFGLSYTTFNYQWQQQPKT-------------------------------SYSAKD-TIQ 625

Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
             + V+N G +   EVV  Y   P +    +K++ G++R+ +  G ++    ++   +  
Sbjct: 626 LSVVVKNTGNISADEVVQAYIGYPTLNRMPLKELKGFKRITLNKGSTSLASISIPVTELQ 685

Query: 722 KIVDNAANSLLASGAHTILVGE 743
           K   +     L  G +T+ +G 
Sbjct: 686 KWNSSKHQFELYPGNYTVYLGS 707


>gi|171695518|ref|XP_001912683.1| hypothetical protein [Podospora anserina S mat+]
 gi|170948001|emb|CAP60165.1| unnamed protein product [Podospora anserina S mat+]
          Length = 805

 Score =  396 bits (1018), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 271/783 (34%), Positives = 391/783 (49%), Gaps = 99/783 (12%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGD--------------------LAYGVPRLGLP 54
           CD     P RA  LV+ + + EK+  + +                    ++ G  R+GLP
Sbjct: 36  CDTTASPPARAAALVQALNITEKLVNLVEYVKSREAPLGISIQLITPHSMSLGAERIGLP 95

Query: 55  LYEWWSEALHGVSFIGRRTNSPPGTHFDS---EVPGATSFPTVILTTASFNESLWKKIGQ 111
            Y WW+EALHGV+       + PG  F+    E   ATSF   I   A+F+  L  ++  
Sbjct: 96  AYAWWNEALHGVA-------ASPGVSFNQAGQEFSHATSFANTITLAAAFDNDLVYEVAD 148

Query: 112 TVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLE------------------TPGED 153
           T+STEARA  N   AGL +W+PNIN  +DPRWGR  E                  TPGED
Sbjct: 149 TISTEARAFSNAELAGLDYWTPNINPYKDPRWGRGHEVCYLSLLFRAVQLLRTQKTPGED 208

Query: 154 PYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSR 213
           P  +  Y    + GL+  + +           K+ A CKH+AAYDL+ W+G  R+ F++ 
Sbjct: 209 PVHIKGYVQALLEGLEGRDKIR----------KVIATCKHFAAYDLERWQGALRYRFNAV 258

Query: 214 VTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNF---HG 270
           VT QD+ E ++ PF+ C  +  V S MCSYN +NG P CA   L++  +R  WN+   + 
Sbjct: 259 VTSQDLSEYYLQPFQQCARDSKVGSFMCSYNALNGTPACASTYLMDDILRKHWNWTEHNN 318

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC---GDYYTNFTMGAVQQGKIA 327
           YI SDC++IQ  + +    + T   A A    AG D  C   G       +GA  Q  ++
Sbjct: 319 YITSDCNAIQDFLPNFHNFSQTPAQAAADAYNAGTDTVCEVPGYPPLTDVIGAYNQSLLS 378

Query: 328 EADIDTSLRFLYIVLMRLGYFD-GSPQ-YKNLGKNNICNPQHIELAAEAARQGIVLLKND 385
           E  ID +LR LY  L+R GY D  SP  Y  +  + +  P+   LA ++A  GIVLLKN 
Sbjct: 379 EEIIDRALRRLYEGLIRAGYLDSASPHPYTKISWSQVNTPKAQALALQSATDGIVLLKN- 437

Query: 386 NGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCAD 445
           NG LPL+  N KT+AL+G  ANAT+ M+G Y G P  Y +P+      +   ++APG  +
Sbjct: 438 NGLLPLDLTN-KTIALIGHWANATRQMLGGYSGIPPYYANPIYAATQLNVTFHHAPGPVN 496

Query: 446 IV--CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADA 503
                 N++    A+ AA  +D  + + G DLS+ AE +DR  +  P  Q  L+  +A  
Sbjct: 497 QSSPSTNDTWTSPALSAASKSDIILYLGGTDLSIAAEDRDRDSIAWPSAQLSLLTSLAQM 556

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
            K   T+V      VD     +NP I SILWVGYPG+ GG A+ ++I G  +P  RLP+T
Sbjct: 557 GKP--TIVARLGDQVDDTPLLSNPNISSILWVGYPGQSGGTALLNIITGVSSPAARLPVT 614

Query: 564 WYEANYVK-IPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVD 622
            Y   Y   IP T+M LRP +  PGRTY+++  PV+ PFG+GL YT F  K     +S+ 
Sbjct: 615 VYPETYTSLIPLTAMSLRPTSARPGRTYRWYPSPVL-PFGHGLHYTTFTAKFGVF-ESLT 672

Query: 623 IKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYS 682
           I + +          + +N   C    +D  +         + V N G++    V +V+ 
Sbjct: 673 INIAE----------LVSN---CNERYLDLCRFPQ----VSVWVSNTGELKSDYVALVFV 715

Query: 683 KPP-GIAGTHIKQVIGYERVF-IAAGQS--AKVGFTMNACKSLKIVDNAANSLLASGAHT 738
           +   G     IK ++GY+R+  I  G +  A VG  +     L  VD   N +L  G + 
Sbjct: 716 RGEYGPEPYPIKTLVGYKRIRDIEPGTTGAAPVGVVVG---DLARVDLGGNRVLFPGKYE 772

Query: 739 ILV 741
            L+
Sbjct: 773 FLL 775


>gi|150019782|ref|YP_001312036.1| glycoside hydrolase family protein [Clostridium beijerinckii NCIMB
           8052]
 gi|149906247|gb|ABR37080.1| glycoside hydrolase, family 3 domain protein [Clostridium
           beijerinckii NCIMB 8052]
          Length = 709

 Score =  396 bits (1017), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 249/746 (33%), Positives = 383/746 (51%), Gaps = 104/746 (13%)

Query: 25  AKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSE 84
           AK+LV +MTL EK +Q+   +  +  L +P Y WW+E LHGV+  G              
Sbjct: 16  AKELVSKMTLQEKAEQLTYQSPAIKHLNVPEYNWWNEGLHGVARAGT------------- 62

Query: 85  VPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPNIN 136
              AT FP  I   A F++    K+   ++TE RA YN  +         GLT+WSPNIN
Sbjct: 63  ---ATVFPQAIGLAAIFDDEFLGKVANIIATEGRAKYNEYSKKDDRGIYKGLTYWSPNIN 119

Query: 137 VVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAA 196
           + RDPRWGR  ET GEDPY+  R  + +++GLQ           + + LK++AC KH+A 
Sbjct: 120 IFRDPRWGRGHETYGEDPYLTSRLGVAFIKGLQ----------GEGKYLKLAACAKHFAV 169

Query: 197 YDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPK 256
           +     EG  R  F++ V ++D+ ET++  FE CV E +V SVM +YNR NG P C    
Sbjct: 170 HS--GPEGL-RHEFNAVVNKKDLYETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKT 226

Query: 257 LLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNF 316
           LL   +RG W F G++VSDC ++      H  +  T  ++VA  ++ G DL+CG+ Y N 
Sbjct: 227 LLKDILRGKWGFKGHVVSDCWALADF-HLHHMVTSTATESVALAIENGCDLNCGNMYLNL 285

Query: 317 TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAAR 376
            + A ++G + E  I T+   L     +LG FD   +Y  +      + +H E+A  A+R
Sbjct: 286 LL-AYKEGLVTEEQITTAAERLMTTRFKLGMFDEECEYNKIPYEVNDSREHNEVALIASR 344

Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY---AY 433
           + +VLLKN NG LPL+  N+K++A++GP+AN+   + GNY GT  +YT+ ++G +     
Sbjct: 345 KSMVLLKN-NGTLPLDKSNLKSIAVIGPNANSEIMLKGNYSGTASKYTTILEGIHDAVGN 403

Query: 434 SKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE------- 480
              + Y+ GC       + + + +  +  AI  A+ +D  V+  GLD ++E E       
Sbjct: 404 DVRVYYSEGCHLFKDKVEDLARPDDRLSEAISVAERSDVVVLCLGLDSTIEGEQGDAGNS 463

Query: 481 --GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
               D+ +L LPG Q  L+ KV +  K PV +V+ +  A+ +N A+   K  +IL   YP
Sbjct: 464 YGAGDKENLNLPGRQQNLLEKVLEVGK-PVIVVLGAGSALTLNGAEE--KCAAILNAWYP 520

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP-YTSMPLRPVNNFPGRTYKFFDGPV 597
           G  GG A+AD++FGK +P G+LP+T+Y+ +  K+P +T   ++      GRTY++     
Sbjct: 521 GSHGGTAVADILFGKCSPSGKLPVTFYK-DTAKLPDFTDYSMK------GRTYRYLGHES 573

Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
           +YPFGYGL+Y+  +      P                                  VK   
Sbjct: 574 LYPFGYGLTYSTVELSNLQVP---------------------------------SVKQGF 600

Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMN 716
             F   IE++N G+ D  EVV  Y K        +   + G++RV +  G+S  V   +N
Sbjct: 601 GSFDISIEIKNTGEYDIEEVVQCYVKDIESKYAVLNHSLAGFKRVSLKKGESKIVTIKLN 660

Query: 717 ACKSLKIVDNAANSLLASGAHTILVG 742
             KS ++V++    LL S    + VG
Sbjct: 661 K-KSFEVVNDDGERLLDSKKFKLFVG 685


>gi|346225847|ref|ZP_08846989.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
 gi|346227016|ref|ZP_08848158.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
          Length = 718

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 260/766 (33%), Positives = 400/766 (52%), Gaps = 106/766 (13%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
           S+K +  D  + +  +   ERA+ +V+++T+ EK+ Q+ + A  V RL +P Y+WW+E L
Sbjct: 8   SLKAQ-EDCSFRNPDISLDERAECIVKQLTVEEKINQLMNAAPAVDRLEIPEYDWWNECL 66

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           HGV+  GR                AT FP  I   A+++ +L  ++G  +STEARA YN+
Sbjct: 67  HGVARAGR----------------ATVFPQAIGMAATWDTTLVYRVGDAISTEARAKYNV 110

Query: 124 GNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
            +         GLTFW+PN+N+ RDPRWGR  ET GEDP++  R  +++V+GLQ      
Sbjct: 111 FSKHGYRGQYKGLTFWTPNVNIFRDPRWGRGQETYGEDPFLTSRIGVSFVKGLQG----- 165

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
               +  + LK++A  KHYA +   N     R  FD++V+ +D+ ET++  FE  V E  
Sbjct: 166 ----NHPKYLKVAALAKHYAVH---NGPEALRHEFDAKVSMKDLWETYLPAFEALVKEAG 218

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKED 295
           V  VM +YNR NG P CA P L+ + +R  W F GY VSDC +I      HK + DT E+
Sbjct: 219 VEGVMGAYNRTNGDPCCAHPYLMQEVLREKWGFDGYYVSDCGAIMDFYTGHKIV-DTPEE 277

Query: 296 AVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF--DGSPQ 353
           A A  L AG +L+CGD Y +  + ++++G   E +ID S++ L+   +RLG F  +G+  
Sbjct: 278 AAAMALNAGCNLNCGDTYASL-LKSLEKGLTTEEEIDRSVKQLFKTRLRLGLFAPEGAVP 336

Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
           Y  +  + I + +H +LA EAAR+ +VLLKN+   LP+   ++K + + GP A   +A++
Sbjct: 337 YDTISTDVIRSKEHQKLALEAARKSVVLLKNEANTLPV-ARDVKKVYVTGPTATHVQALL 395

Query: 414 GNYEGTPCRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV 470
            NY G     T+ ++G          + Y  G        N+M   +  AA +AD TV  
Sbjct: 396 ANYYGVSEDMTTILEGIVGKVSPQTSVQYRQGALLYEANRNTMDWFS-GAAASADVTVAC 454

Query: 471 AGLDLSVEAEG---------KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDIN 521
            G+   +E E           DR    LP  Q + + ++  +AK  + +VI S  A+ + 
Sbjct: 455 LGISQLIEGEEGEAIASEHRGDRERTRLPQNQIDFLKRIRASAK-KLVVVITSGSAISL- 512

Query: 522 FAKNNPKI----KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSM 577
                P+I     ++L+V YPGE+GG+A+ADV+FG   P GRLP+T  ++     PY + 
Sbjct: 513 -----PEIYDMADALLYVWYPGEQGGKAVADVLFGDAVPSGRLPVTVVKSVDDLPPYENY 567

Query: 578 PLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
            ++      GRTY++ +    +PFG+GLSYT F Y                      N T
Sbjct: 568 DMK------GRTYRYMEVSPQFPFGFGLSYTDFTYS---------------------NLT 600

Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQ-VI 696
           + +NK          VK  +       ++ N G+ D  EVV  Y      +    KQ +I
Sbjct: 601 LESNK----------VKSGE-SVRLSFDLTNEGEYDADEVVQFYITDVEASVNVPKQSLI 649

Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           G++RV +AAG+S K+ FT+     +KIVDN    +L SG   I +G
Sbjct: 650 GFKRVGLAAGESTKIEFTVTP-DMMKIVDNNGEKILESGEFKIYIG 694


>gi|345519864|ref|ZP_08799275.1| beta-glucosidase [Bacteroides sp. 4_3_47FAA]
 gi|254836262|gb|EET16571.1| beta-glucosidase [Bacteroides sp. 4_3_47FAA]
          Length = 736

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 260/772 (33%), Positives = 383/772 (49%), Gaps = 108/772 (13%)

Query: 6   KVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHG 65
           + ++ + P+ +A LP   R KDLV R+TL EKV  M   +  +PRLG+P Y+WW+EALHG
Sbjct: 18  QAQVENLPFRNADLPLEVRVKDLVARLTLEEKVLLMQHHSPAIPRLGIPAYDWWNEALHG 77

Query: 66  VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--- 122
           V+    RT           +   T FP  I   A+F+    +K+G   STE RA++N   
Sbjct: 78  VA----RT-----------LEKVTVFPQAIGMAATFDTEALQKMGDITSTEGRALFNEDW 122

Query: 123 ------LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEY 176
                     GLT+W+PNIN+ RDPRWGR  ET GEDPY+  +     VRGL+       
Sbjct: 123 KAGKTGTRYRGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAKMGAAIVRGLEG------ 176

Query: 177 HRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDV 236
               D   LK  AC KHYA +    +   +R  FD+R +  D+ +T++  F   V +  V
Sbjct: 177 ---EDPHYLKSVACAKHYAVHSGPEY---NRHSFDARPSVFDLWDTYMPAFRELVTKAKV 230

Query: 237 SSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDA 296
             VMC+YNR+NG P C +  LL   +R  W+F GY+ SDC +++   E HK  +     A
Sbjct: 231 HGVMCAYNRLNGQPCCGNDPLLVDILRNQWHFDGYVTSDCWALKDFAEFHK-THPEHTIA 289

Query: 297 VARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--Y 354
           ++  L AG DL+CG+ Y     G V++G  +E DI+ SL  L+ +L ++G FD + +  Y
Sbjct: 290 MSDALLAGTDLECGNLYHLLAEG-VKKGLHSERDINVSLSRLFTILFKIGMFDPAERVPY 348

Query: 355 KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIG 414
            ++G+  +    H + A   A++ IVLL+N N  LPL+   IK++AL+GP+A+  +  + 
Sbjct: 349 SSIGREVLECEAHKQHAERMAKESIVLLENKNHILPLDASKIKSIALIGPNADNGQTQLA 408

Query: 415 NYEGTPCRYTSPMDGFYAY--SKV-INYAPGCA--DIVCQNNSMIPAAIDAAKNADATVI 469
           NY GTP    +P          K+ INY PG    D +    S +  A  AA+ +D  V 
Sbjct: 409 NYFGTPSEIVTPYMSLKRRLGDKIKINYLPGVGIVDKLKDAPSFVQVAHKAAQ-SDVIVF 467

Query: 470 VAGLDLSVE-------------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAG 516
           V+G+    E                 DR  + LP  Q EL+ K+    + P+ +V MS  
Sbjct: 468 VSGISADYEGEAGDAGAAGYGGFASGDRTTMQLPLVQIELLKKLKKTGR-PLIIVNMSGS 526

Query: 517 AVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS 576
            +   +   N    ++L   Y G+  G AI DV+FG  NP GR+P+T Y+++        
Sbjct: 527 VMSFEWESQNA--DALLQAWYGGQAAGDAIVDVLFGHCNPAGRMPLTTYKSD------ND 578

Query: 577 MPLRPVNNFP--GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDI 634
           +P  P  N+   GRTY++F G   YPFGYGLSYT F Y               D QC D 
Sbjct: 579 LP--PFENYSMLGRTYRYFKGEPRYPFGYGLSYTTFAY--------------SDVQCVDE 622

Query: 635 NYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHI 692
            +T  T +                     + V N G  DG EVV +Y   P  G     +
Sbjct: 623 THTGDTAR-------------------VTVTVSNTGDCDGDEVVQLYVVHPQDGRKQIPL 663

Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
             + G++R+ +  G+S  V FT+   + L + +   N +  +G  T+ VG G
Sbjct: 664 CALKGFKRIHLKRGESTSVSFTLTP-EELALTETDGNLVEKNGQVTLFVGGG 714


>gi|30316196|sp|P83344.1|XYNB_PRUPE RecName: Full=Putative beta-D-xylosidase; AltName: Full=PpAz152
 gi|19879972|gb|AAM00218.1|AF362990_1 beta-D-xylosidase, partial [Prunus persica]
          Length = 461

 Score =  394 bits (1012), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 210/468 (44%), Positives = 296/468 (63%), Gaps = 21/468 (4%)

Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QY 354
           A  +KAGLDLDCG +    T  AV++G +++ +I+ +L     V MRLG FDG P   QY
Sbjct: 1   ADAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQY 60

Query: 355 KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIG 414
            NLG  ++C P H +LA EAARQGIVLL+N   +LPL+T   +T+A++GP+++ T  MIG
Sbjct: 61  GNLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSTRRHRTVAVIGPNSDVTVTMIG 120

Query: 415 NYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
           NY G  C YT+P+ G   Y++ I+ A GC D+ C  N +  AA  AA+ ADATV+V GLD
Sbjct: 121 NYAGVACGYTTPLQGIGRYTRTIHQA-GCTDVHCNGNQLFGAAEAAARQADATVLVMGLD 179

Query: 475 LSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
            S+EAE  DR  LLLPG Q EL+++VA A++GP  LV+MS G +D+ FAKN+P+I +I+W
Sbjct: 180 QSIEAEFVDRAGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIW 239

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYK 591
           VGYPG+ GG AIA+V+FG  NPGG+LP+TWY  NYV  +P T M +R  P   +PGRTY+
Sbjct: 240 VGYPGQAGGTAIANVLFGTANPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYR 299

Query: 592 FFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRD---INYTVGTNKPPCAAV 648
           F+ GPVV+PFG GLSYT F + +A  P  V + L   +   +   ++ TV  + P C A+
Sbjct: 300 FYIGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKTVRVSHPDCNAL 359

Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQS 708
              DV          ++V+N G MDG+  ++V++ PP       KQ++G+ ++ IA G  
Sbjct: 360 SPLDV---------HVDVKNTGSMDGTHTLLVFTSPPDGKWASSKQLMGFHKIHIATGSE 410

Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
            +V   ++ CK L +VD      +  G H + +G+    VS  LQ NL
Sbjct: 411 KRVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGDLSHHVS--LQTNL 456


>gi|372208556|ref|ZP_09496358.1| beta-glucosidase [Flavobacteriaceae bacterium S85]
          Length = 729

 Score =  393 bits (1009), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 262/759 (34%), Positives = 385/759 (50%), Gaps = 103/759 (13%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           + D  L + ER   LV+ MTL EK+ Q+   +  V RL +P Y WW+EALHGV+  G+  
Sbjct: 26  WLDTSLTFEERIHHLVKAMTLKEKIAQLDSGSPEVKRLDIPEYNWWNEALHGVARNGK-- 83

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL----GN---- 125
                         +T FP  I   A+F+  L K++   +S EARA +N+    GN    
Sbjct: 84  --------------STVFPQAIGLAATFDPVLAKQVASAISDEARAKFNISQSIGNRGQY 129

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
           AGLTFW+PN+N+ RDPRWGR  ET GEDPY+  +  + +V+GLQ          +  + L
Sbjct: 130 AGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGVAFVKGLQG---------NHPKYL 180

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K +AC KH+A +   +     R HF++  +++D+ ET++  FE  V + +V  VM +YN 
Sbjct: 181 KSAACAKHFAVH---SGPEELRHHFNANPSKKDLYETYLPAFEALVKQANVEGVMSAYNA 237

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           V G+P  +   LL +T+R  W F GYIVSDC ++  I + HK +  T  +A A  LKAG+
Sbjct: 238 VYGVPAGSSEFLLKETLRKSWGFDGYIVSDCGALGDIFKGHKQVK-TMPEAAAVALKAGV 296

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNIC 363
           +L+CG  Y      AVQQG ++E  IDT L+ L     +LG+FD      Y  +  + I 
Sbjct: 297 NLNCGYVYNGALEKAVQQGLVSEELIDTRLKQLLKTRFKLGFFDPKEANPYNAIPTSVIH 356

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
           +  HI LA + A++ IVLLKN N  LPL+  NIK   + GP A+++  ++ NY G     
Sbjct: 357 SDDHIALARKTAQKSIVLLKNKNHTLPLDK-NIKVPYVTGPFASSSDVLLANYYGMTTNL 415

Query: 424 TSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPA--AIDAAKNADATVIVAGLDLSVE 478
            S ++G     +    +NY  G       N ++ P   A + AK ADA + V GL    E
Sbjct: 416 VSVLEGIADKVSLGTSLNYRMGALPF---NKNLNPKNWAPNVAKTADAVIAVVGLSADFE 472

Query: 479 AE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
            E           D+ DL LP  Q + + ++A   KGP+ LV+ S  AV +    +    
Sbjct: 473 GEEVDAIASPNKGDKKDLKLPQNQIDYVKEMAAKKKGPLILVVASGSAVALGELYDLADA 532

Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP--G 587
             ++W  YPGE+GG A+ADV+FG  +P G LP+T+        P +   L P  ++   G
Sbjct: 533 IVLMW--YPGEQGGNAVADVLFGDVSPSGHLPVTF--------PKSVAQLPPFEDYSMQG 582

Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
           RTYK+ +   ++PFG+GLSYT FK+       +V I  +K                    
Sbjct: 583 RTYKYMEEEPLFPFGFGLSYTDFKF------SNVQISEEK-------------------- 616

Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIK-QVIGYERVFIAAG 706
                +K KD  FT    V N GK+DG EVV +Y  P        K Q++ ++R+ I   
Sbjct: 617 -----IKKKD-SFTVSCSVANNGKVDGEEVVQLYLVPLNSNKDLPKYQLLKFKRIEIQKN 670

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGV 745
            S  V F + A K L  V+         G + ++V   +
Sbjct: 671 TSKTVSFNLEA-KDLFQVNKEGKKTWIKGKYKLVVANAL 708


>gi|366163035|ref|ZP_09462790.1| glycoside hydrolase family 3 [Acetivibrio cellulolyticus CD2]
          Length = 705

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 247/749 (32%), Positives = 380/749 (50%), Gaps = 103/749 (13%)

Query: 21  YPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTH 80
           Y ++A++LV +MTL EK  Q+   +  + RLG+P Y WW+EALHGV+  G          
Sbjct: 7   YKKKAEELVAQMTLEEKASQLTYNSPAIERLGIPAYNWWNEALHGVARAGT--------- 57

Query: 81  FDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWS 132
                  AT FP  I   A F++    KI   ++ EARA YN  +         GLT WS
Sbjct: 58  -------ATVFPQAIGLAAMFDDEFLMKIANAIAIEARAKYNESSKHGDRDIYKGLTIWS 110

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           PNIN+ RDPRWGR  ET GEDP++ G+  + +++GLQ           D   +  +AC K
Sbjct: 111 PNINIFRDPRWGRGHETYGEDPFLSGKLGVAFIKGLQ----------GDKDVMMTAACVK 160

Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           H+AAY   +   + R  F++ VT++D+ ET++  FE CV +  V +VM  YNR NG P C
Sbjct: 161 HFAAY---SGPEDLRHGFNAEVTKKDLWETYLPAFETCVKDAKVEAVMGGYNRTNGEPCC 217

Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
               LL   +R  W F G++VSDC +I+     H  +  T E++VA  + AG DL+CG+ 
Sbjct: 218 GSYTLLRDILREKWGFEGHVVSDCWAIKDFHTDH-MVTKTPEESVALAIDAGCDLNCGNM 276

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAA 372
           Y    + A+Q+G I E  I  +   ++    +LG F+GS ++ N+    +   +H E+A 
Sbjct: 277 YLMLLI-ALQEGLITEEHITRAAVRIFTTRFKLGLFEGS-EFDNIPYEVVECSEHKEMAI 334

Query: 373 EAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF-- 430
           EAAR+  VLLKND G LP+N G IKT+ ++GP+AN+  A+ GNY GT  RY + ++G   
Sbjct: 335 EAARKSAVLLKND-GILPINKGAIKTIGVIGPNANSRIALKGNYHGTSSRYITLLEGIQD 393

Query: 431 -YAYSKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK- 482
                  + Y+ GC       +++   N  +  A+  A+++D  V+  GLD ++E E   
Sbjct: 394 EVGDEVRVLYSNGCELVKDRTEVLAYANDRLAEAVTVAEHSDLVVLCLGLDETIEGEQSD 453

Query: 483 --------DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
                   D+ DL LP  Q  L+ K+    K P  L +M+  A+++++A  +     IL 
Sbjct: 454 EGNNGGSGDKKDLDLPEVQKSLLEKIVATGK-PTVLCLMAGSAINLSYAHEH--CNGILL 510

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD 594
             YPG  GG+A+AD++FG  +P G+LP+T+Y +     P T   ++       RTY++ +
Sbjct: 511 TWYPGARGGKAVADILFGNASPSGKLPVTFYRSLDNLPPITDYSMK------NRTYRYIE 564

Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
              +YPFGYGL+Y   + K      +V+I+       +DI  TV                
Sbjct: 565 EAPLYPFGYGLTYGDVELKHVEIKGTVEIE-------KDIYITV---------------- 601

Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
                      ++N G +   EVV  Y K    +       +  + RV + A +  +V  
Sbjct: 602 ----------TLQNRGSVAVEEVVQAYIKDEQSMYAVTNTSLCAFMRVGLGANEEKQVSM 651

Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVG 742
            +    SLK+V+     +L S   T+  G
Sbjct: 652 RI-PFDSLKVVNLDGEKVLDSKKFTLFAG 679


>gi|365135698|ref|ZP_09343911.1| hypothetical protein HMPREF1032_03710 [Subdoligranulum sp.
           4_3_54A2FAA]
 gi|363612160|gb|EHL63713.1| hypothetical protein HMPREF1032_03710 [Subdoligranulum sp.
           4_3_54A2FAA]
          Length = 643

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 241/628 (38%), Positives = 345/628 (54%), Gaps = 68/628 (10%)

Query: 21  YPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTH 80
           + +RA+ LV +MTL EKV QM   A  + RLG+P Y WW+E LHGV   G          
Sbjct: 4   FAQRARALVAQMTLEEKVSQMRYDAPAIERLGIPAYNWWNECLHGVGRSGT--------- 54

Query: 81  FDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGNAG----LTFWS 132
                  AT FP  I   ASF+ESL + + Q +S EARA YN     G  G    LTFWS
Sbjct: 55  -------ATVFPQPIGMAASFDESLLEHVAQAISDEARAKYNQYKTFGETGIYQGLTFWS 107

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           PNIN+ RDPRWGR  ET GEDP + GR    ++RGLQ+ E        DS+  K+ A  K
Sbjct: 108 PNINLFRDPRWGRGHETYGEDPLLTGRMGTAFIRGLQEGE--------DSQYRKLDATVK 159

Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           H+AA+         R  F++ V+ +DM ++++  F  C+     ++VM +YNR+NG P C
Sbjct: 160 HFAAHSGPE---AGRHSFNAEVSAEDMADSYLWAFRYCIEHAKPAAVMGAYNRINGEPAC 216

Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
           A    L   +  +W F GY+VSDC +IQ I E+H    + KE A A  +  G  L+CG  
Sbjct: 217 ASSTYLKGVLYEEWKFDGYVVSDCGAIQDINENHHVTKNEKESA-ALAVNNGCQLNCGKA 275

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAA 372
           Y ++   AV+ G I+E  +  ++  L+    RLG FD    Y ++  N I   +H EL  
Sbjct: 276 Y-HWVKAAVEDGLISEDTVTCAVERLFEARFRLGMFDSDCVYDSIPMNVIECRKHRELNR 334

Query: 373 EAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA 432
           + A++ IVLLKN NG LPLN    KT+A++GP+A+    ++GNY GTP  +T+ + G   
Sbjct: 335 KMAQESIVLLKN-NGILPLNPE--KTIAVIGPNADDKTVLLGNYNGTPSHWTTLLRGIQD 391

Query: 433 YSK-VINYAPGCADIVCQNNSM------IPAAIDAAKNADATVIVAGLDLSVE------- 478
            ++  + YA G   ++ +  ++      +  AI  AK AD  V+  GL   +E       
Sbjct: 392 QARGEVYYARG--SVLVEKEALPWAEKPLHEAIYTAKAADVVVLCLGLSPLLEGEEGDAY 449

Query: 479 --AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
             A+  DR D+ LP  Q +L+  + D  K PV LV +S G VD+  A  + +  +IL   
Sbjct: 450 NGADSGDRKDISLPDIQQQLLCAILDTEK-PVVLVNVSGGCVDLRQA--DERCAAILQCF 506

Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
           YPG EGG A+AD++FG+ +P GRLP+T+Y       P+T   ++      GRTY+FFDG 
Sbjct: 507 YPGAEGGNALADILFGRVSPSGRLPVTFYRTVEDLPPFTDYSMK------GRTYRFFDGK 560

Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIK 624
            +YPFG+GL+Y   K +  + P +V +K
Sbjct: 561 PLYPFGHGLTYADIKEQW-TDPYTVRVK 587


>gi|280977785|gb|ACZ98610.1| glucosidase [Cellulosilyticum ruminicola]
          Length = 711

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 251/757 (33%), Positives = 387/757 (51%), Gaps = 107/757 (14%)

Query: 21  YPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTH 80
           +   AK+LV +M L EK  Q+   A  + RLG+P Y WW+EALHGV+  G          
Sbjct: 4   FKNEAKELVRQMDLLEKASQLRYDAPAIKRLGIPTYNWWNEALHGVARAGV--------- 54

Query: 81  FDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWS 132
                  AT FP  I   A F+E    +I   ++ E RA YN  +         G+TFW+
Sbjct: 55  -------ATVFPQAIGLAAMFDEEKLGEIADIIAIEGRAKYNQFSQKEDRDIYKGMTFWA 107

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           PNIN+ RDPRWGR  ET GEDPY+  R  + +++GLQ  E  +Y        LK +AC K
Sbjct: 108 PNINIFRDPRWGRGHETYGEDPYLTARLGVAFIKGLQGDENEDY--------LKAAACAK 159

Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           H+A +   +    DR HFD+ V+++D+ ET++  FE  V E +V  VM +YNRVNG P C
Sbjct: 160 HFAVH---SGPEEDRHHFDAIVSKKDLYETYLPAFEAAVKEANVIGVMGAYNRVNGEPAC 216

Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
               LL   ++ DW F GYIVSDC +I+     H  +  T  ++ A  +  G +L+CG+ 
Sbjct: 217 GSKTLLVDILKKDWGFDGYIVSDCWAIRDFHTEH-MVTHTAAESAALAINNGCELNCGNT 275

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN-NICNPQHIELA 371
           Y +  + A Q+G + E  I  +   L  + M+LG FD + +Y  +    N C   H E+A
Sbjct: 276 YLHM-LEAHQEGLVKEEIITEAAEKLMRIRMQLGLFDKNCKYNEIPYAVNDCKV-HREVA 333

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            EA+R+ +V+LKND G LPLN   +K++ ++GP AN    + GNY GT  RYT+ ++G  
Sbjct: 334 LEASRRSMVMLKND-GILPLNKDKLKSIGIIGPTANNRTVLEGNYNGTASRYTTFVEGIQ 392

Query: 432 AY---SKVINYAPGC-------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE- 480
            Y      + Y+ GC       +++  +N+    A I  A+ +D  V+  GLD ++E E 
Sbjct: 393 DYVGDDVRVYYSEGCHLFANGMSNLAWENDREAEALI-VAEQSDVVVLCLGLDSTIEGEQ 451

Query: 481 --------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
                   G D++ L L G Q +L+ KV    K PV LV+ +  A+ IN+A  +    +I
Sbjct: 452 GDTGNAFAGGDKLSLNLIGRQQQLLEKVVAVGK-PVILVLSTGSAMAINYA--DEHCNAI 508

Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
               YPG +GG+A+A ++FG+Y+P G+LP+T+Y+          +P     +   RTY++
Sbjct: 509 FQTWYPGAQGGKALAQLLFGEYSPSGKLPVTFYKTT------EELPAFEDYSMKDRTYRY 562

Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
                +YPFGYGLSY   K       +SV + LD  +     N++ G             
Sbjct: 563 MPNEALYPFGYGLSYADIKV------QSVKV-LDGAKGEEITNFSAGQT----------- 604

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-------PGIAGTHIKQVIGYERVFIAA 705
                 K+  ++E+EN   +D  +VV +Y K        P  +      +  ++ VF+ A
Sbjct: 605 ------KYKVKVELENKSNVDSYDVVQIYIKDMESQYAVPNFS------LCSFKSVFLKA 652

Query: 706 GQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           G+S +V   +   K+  +++     ++ S    + +G
Sbjct: 653 GESKEVTLNVGE-KAFTVINEEGKRIVDSKKFKLFIG 688


>gi|429850127|gb|ELA25427.1| glycoside hydrolase family 3 protein [Colletotrichum
           gloeosporioides Nara gc5]
          Length = 918

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 259/742 (34%), Positives = 390/742 (52%), Gaps = 48/742 (6%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD  L   +RA  LV  +T+ EK+  + + A G+PRL +P YEWWSE LHGV+       
Sbjct: 170 CDESLSDKQRAAALVAELTIWEKLDNLVNEAPGIPRLRVPPYEWWSEGLHGVA------- 222

Query: 75  SPPGTHFDSE--VPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
             PGT F S+     ATSFP  IL  ++F++ L + +G+ VS EARA  N G +GL  +S
Sbjct: 223 RSPGTKFTSKGNFSYATSFPQPILLGSAFDDELVRAVGEVVSREARAFSNAGRSGLDLYS 282

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           PNIN  +DPRWGR  ETPGED + + +Y    + GL+           D    K+ A CK
Sbjct: 283 PNINAFKDPRWGRGQETPGEDTFHLQKYVSAMLSGLE----------GDDPDKKLIATCK 332

Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           HYAA D +N++G DR  F++ ++ QD+ E ++ PF+ C  E +V S MCSYN +NG P C
Sbjct: 333 HYAANDFENYKGVDRSGFNAVISTQDLSEYYLPPFKTCAVEKNVGSFMCSYNGINGTPLC 392

Query: 253 ADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           A+  L+   +R  W ++G   Y+ +DCD +  +V  H +  D    A A  ++AG DL+C
Sbjct: 393 ANSYLIEDILRKHWGWNGDGQYVSTDCDCVALMVSYHHYAPDLGH-AAAWSMQAGTDLEC 451

Query: 310 GDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQ 366
             +  +  +  A  Q  I+E D+D +L  +Y  L+ +G FD   +   ++LG + +   +
Sbjct: 452 NAFPGSEALQSAWNQSLISEKDVDKALTRMYTSLVSVGLFDLDRKDPLRSLGWDEVNTKE 511

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
             +LA  AA +G VL+KND G LPL+  + K  AL+GP  +AT  M GNY G      SP
Sbjct: 512 AQDLAYRAAVEGAVLMKND-GILPLSPDSSKKYALIGPWVSATTQMQGNYFGPAPYLISP 570

Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
                       Y  G      +++S    AI AA+ AD  + + G+D ++E E  DR  
Sbjct: 571 RKAAKDLGLDFTYFLGSR--TNKSDSSFAQAIKAAQAADVVIFMGGVDNTLEQETLDRNT 628

Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
           L  P  Q +L+  +++  K P+ ++    G VD      N  + +ILW GYPG+ GG+AI
Sbjct: 629 LAWPEPQLQLLRALSEVGK-PLVVLQFGGGQVDDTELLANDSVNAILWGGYPGQSGGKAI 687

Query: 547 ADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
            D++FG+  P GRL +T Y A+Y   +P T M LR  P N+  GRTY+++ G    P+G+
Sbjct: 688 LDIVFGRAAPAGRLSVTQYPASYNDAVPATDMNLRPGPGNSGLGRTYRWYTGETPVPYGF 747

Query: 604 GLSYTQFK--YKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
           GL YT+F    K AS+  ++DI     Q   + N    +  P     L      +    T
Sbjct: 748 GLHYTKFSVDMKPASNVHNIDIA----QMAAEANDDAASEIPSWQRGL------ERRMVT 797

Query: 662 FQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNACK 719
             +  +N G +    V +V+ +   G      K ++GY R+  I  G+  K    +   +
Sbjct: 798 VTVSAKNEGNVISDYVALVFLRSEAGPKPWPQKTLVGYTRLRNIKPGEERKEEIIIKM-E 856

Query: 720 SLKIVDNAANSLLASGAHTILV 741
            L  VD   N +L  G +++ +
Sbjct: 857 QLVRVDEVGNRVLYEGLYSLFL 878


>gi|449299051|gb|EMC95065.1| glycoside hydrolase family 3 protein [Baudoinia compniacensis UAMH
           10762]
          Length = 849

 Score =  389 bits (1000), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 268/777 (34%), Positives = 390/777 (50%), Gaps = 71/777 (9%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD      +RA  ++  M + EK+  + D++YG  RLGLP YEWWSEALHGV+       
Sbjct: 43  CDTNATPYQRASAIINAMNITEKLANLLDVSYGSARLGLPPYEWWSEALHGVA------- 95

Query: 75  SPPGTHFDSE--VPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
             PG +F S      ATSFP  I  +++F++   + I   +STEARA  N    GL +++
Sbjct: 96  GSPGVNFTSSGNYSYATSFPMPITFSSAFDDPSVQNIASVISTEARAYSNAARGGLDYFT 155

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           PNIN  +DPRWGR  ETPGEDP  +  Y  N + GL+  +   Y   S S   K+ A CK
Sbjct: 156 PNINPFKDPRWGRGSETPGEDPLRIQGYVKNLLIGLEGTDD-GYFNTSHSGYKKMIATCK 214

Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           H+A YDL++W+G  R+ +D+ +T QD+ E ++ PF+ C  + +V+S+MCSYN VN +P C
Sbjct: 215 HFAGYDLEDWDGYIRYGYDAEITTQDLAEYYLPPFQTCARDQNVASIMCSYNSVNSVPAC 274

Query: 253 ADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           A+  L    +R  W +   + YI SDC++I  I  +H + +     A    L  G+D  C
Sbjct: 275 ANSYLQETILREHWGWTIDNNYITSDCNAISDIYYNHNY-SVNNAAAAGLSLSNGMDTAC 333

Query: 310 GDYYTNFTM---GAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNICN 364
               T       G+   G + EA I T+L   Y  L+  GYFD   S  Y+++G +++  
Sbjct: 334 IVANTGVMTDVNGSYYGGYVTEATITTALIRQYEALVIAGYFDPASSNPYRSIGWSSVNT 393

Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
           P    LA +AA +G  LLKN  G LP    +   +A++G  AN T  M G Y G      
Sbjct: 394 PAAQTLARQAATEGTTLLKN-TGLLPYKFTSQTKVAMIGMWANGTSQMQGGYSGPAPYLH 452

Query: 425 SPMDGFYAYSKV---INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
           SP+   YA S++    NYA G  +     ++    A  AA+NAD  +   G+D SVEAE 
Sbjct: 453 SPL---YAASQLGLSYNYANGPINQTTLTSNYSQNATAAAQNADVILFFGGIDWSVEAEA 509

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
            DR  +  PG Q  LI ++  AA G   +V+     +D     +N  I +++WVGYPG++
Sbjct: 510 MDRYQIAWPGAQQALIAQL--AALGKPMIVLQMGSMLDATPILSNNNISALVWVGYPGQD 567

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
           GG A  D++ G   P GRLP+T Y A+YV ++P T+M LRP    PGRTYK+++  V+ P
Sbjct: 568 GGVAAFDILTGAVAPAGRLPVTMYPADYVNQVPMTNMSLRPGPGNPGRTYKWYNNAVL-P 626

Query: 601 FGYGLSYTQFK--------------YKVASSPKSVDIK--------------LDKDQQCR 632
           F YGL YT FK                  ++P S  ++                +  Q  
Sbjct: 627 FAYGLHYTTFKATFNGGPPGPGSPWSPPWNAPWSAKVRRGWGWGNWGPPNWGWTQPSQVA 686

Query: 633 DIN------YTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK-PP 685
             N      Y + +    C A   D         +  I V+N G+     V +V+S    
Sbjct: 687 PGNGGLSSSYNIQSLLSSCTAAHPDLCAFP----SVAISVQNAGQTTSDFVALVFSNTTA 742

Query: 686 GIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
           G A    K +  Y R+  +AAGQ+      M     L   D+  N +L  G + +L+
Sbjct: 743 GPAPYPYKSLASYTRLHSVAAGQTVTASLNMT-LGVLARRDDQGNQILYPGTYNLLL 798


>gi|365120422|ref|ZP_09338009.1| hypothetical protein HMPREF1033_01355 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363647477|gb|EHL86692.1| hypothetical protein HMPREF1033_01355 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 735

 Score =  389 bits (999), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 251/755 (33%), Positives = 390/755 (51%), Gaps = 99/755 (13%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           FP+ +  L + +R  DLV R+TL EK+ QM + A  + RLG+P Y+WW+E LHGV   GR
Sbjct: 27  FPFQNPDLSFEKRVDDLVSRLTLEEKISQMLNKAPAIERLGIPAYDWWNECLHGV---GR 83

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
                            T FP  I   A+++++L++++  +++ E RA+Y+   +     
Sbjct: 84  TPYK------------VTVFPQAIGMAATWDDALFQQVASSIADEGRAIYHDAISKGVHE 131

Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
              GLT+W+PNIN+ RDPRWGR  ET GEDPY+ G     +V GLQ           D +
Sbjct: 132 IYHGLTYWTPNINIFRDPRWGRGQETYGEDPYLTGTLGKAFVNGLQ---------GDDPK 182

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LK SAC KHYA +   +     R  F++ V+  D+ +T++  F   V +  VSSVMC+Y
Sbjct: 183 YLKASACAKHYAVH---SGPEISRHFFNTEVSMYDLWDTYLPAFRDLVVDAKVSSVMCAY 239

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N + G P C +  L+   +R  W F GY+ SDC +I   ++ HK   D    +   VL  
Sbjct: 240 NALAGQPCCGNDLLMQDILRKQWKFTGYVTSDCGAIDDFLK-HKTHADAAHASADAVLH- 297

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNN 361
           G DL+CG       + AV+QG I EA ID S++ L++   RLG FD +   +Y +   + 
Sbjct: 298 GTDLECGQNIYVKLVDAVKQGLITEAQIDESVKRLFMTRFRLGLFDPADRVKYADTPLSV 357

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           +   +H  LA + +R+ +VLLKNDN  LPL   N+K +A++GP+A+ +  ++GNY G P 
Sbjct: 358 LECDEHKALALKMSRESVVLLKNDN-VLPLRK-NLKKIAVIGPNADDSTVVLGNYNGFPS 415

Query: 422 RYTSPMDGFYA----YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSV 477
           +  +P++   +     ++VI Y      +   +   + A I+  K  D  + V G+   +
Sbjct: 416 KVITPLEAIRSKVGKRTQVI-YDRAIDCVKPSDEKTLNALIERLKGVDQVIFVGGISPRL 474

Query: 478 EAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
           E E          G DR  + LP  QTEL+ K+ +A   PV  V+M+  A+ I +   N 
Sbjct: 475 EGEELPISVDGFRGGDRTTIALPEVQTELMKKMKEAGL-PVIFVMMTGSALGIEWESQN- 532

Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG 587
            I +IL   Y G+  G+AIADV+FG YNP G+LP+T+Y ++    P+ +  +        
Sbjct: 533 -IPAILNAWYGGQFAGQAIADVLFGDYNPSGKLPVTFYRSDSDLPPFGAFSM------AN 585

Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
           RTY++F G  +YPFG+GLSYT F Y V   P+ V               + G    P   
Sbjct: 586 RTYRYFKGEALYPFGFGLSYTMFDYSV---PQVV---------------SGGKVGEPIKV 627

Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQ 707
                           ++V+N+GK +G EVV +Y    G+    I  + G++RV++ AG+
Sbjct: 628 ---------------SVKVKNIGKKNGDEVVQLYLSHEGVEKAPITALKGFKRVYLKAGE 672

Query: 708 SAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
              + F ++  + + + D+     +  G  TI  G
Sbjct: 673 EKTLSFEISP-RDMSLPDDNGIITVFPGKKTIYAG 706


>gi|302669556|ref|YP_003829516.1| beta-xylosidase [Butyrivibrio proteoclasticus B316]
 gi|302394029|gb|ADL32934.1| beta-xylosidase Xyl3A [Butyrivibrio proteoclasticus B316]
          Length = 709

 Score =  389 bits (999), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 251/727 (34%), Positives = 382/727 (52%), Gaps = 108/727 (14%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RAK+LV +MT+ EK  Q+   A  + RLG+P Y WW+EALHGV+  G            
Sbjct: 8   KRAKELVAKMTVEEKASQLRYDAPAIDRLGIPAYNWWNEALHGVARAGT----------- 56

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
                AT FP  I   A+F+E L  ++G+ ++ EARA YN  +         GLTFW+PN
Sbjct: 57  -----ATMFPQAIGLAAAFDEELMSEVGEVIAEEARAKYNEQSKREDRDIYKGLTFWAPN 111

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           +N+ RDPRWGR  ET GEDP++  R A+ +V+ +Q           D   +K +AC KH+
Sbjct: 112 VNIFRDPRWGRGHETYGEDPFLTSRLAVPFVKAMQ----------GDGEYMKAAACAKHF 161

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A +     E   R  FD++ +++D++ET++  FE  V E +V +VM +YNR NG P CA+
Sbjct: 162 AVHSGPEGE---RHFFDAKASKKDLEETYLPAFEALVKEAEVEAVMGAYNRTNGEPCCAN 218

Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
             L+  T+RG W F G+ VSDC +I+   E+HK +  + E++    L+ G DL+CG  Y 
Sbjct: 219 KPLMVDTLRGKWGFQGHFVSDCWAIKDFHENHK-VTSSPEESAKLALEMGCDLNCGCTYQ 277

Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEA 374
           +  M  V+ G I E  I  S   L+     LG FD + ++  +    +   +H+ +A  A
Sbjct: 278 SI-MNGVRAGLIDEKLITESCERLFTTRFLLGMFDKT-EFDEIPYEKVECKEHLAVAKRA 335

Query: 375 ARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYS 434
           AR+ +VLLKND G LPLN  +IKT+ +VGP+AN+  ++IGNY GT  RY + ++G     
Sbjct: 336 ARESVVLLKND-GLLPLNKDSIKTIGVVGPNANSRLSLIGNYHGTSSRYITVLEGI--QD 392

Query: 435 KV-----INYAPGCADIVCQNN----------SMIPAAIDAAKNADATVIVAGLDLSVEA 479
           KV     + Y+ GC   + QNN            +  A   A ++D  V+V GLD ++E 
Sbjct: 393 KVGDDVRVLYSEGCD--IFQNNISNLADPNLPDRLSEAQAVADHSDVVVVVVGLDENLEG 450

Query: 480 E---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIK 530
           E           D+++L LP  Q +L+N V D  K P  ++ M+  A+D++ A++  +  
Sbjct: 451 EEGDAGNQFASGDKINLNLPLSQRQLLNAVLDCGK-PTIVIDMAGSAIDLSKAQD--EAN 507

Query: 531 SILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTY 590
           ++L   YPG  GG  +AD++FG  +P G+LP+T+Y++         +P     +   RTY
Sbjct: 508 AVLQAFYPGARGGADVADILFGDVSPSGKLPVTFYKSA------DDLPDFKDYSMKNRTY 561

Query: 591 KFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
           K+F G  +YPFGYGL+Y     K                   D ++ V            
Sbjct: 562 KYFTGTPLYPFGYGLTYGDCYVKP------------------DYDFNVK---------YA 594

Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSA 709
           D  K    + T  + V N GK+D  EVV +Y K       T    ++G++RV + AG   
Sbjct: 595 DADKVSGAEIT--VTVVNDGKLDTDEVVQLYIKDMDSYFATTNPSLVGFKRVHVPAGGET 652

Query: 710 KVGFTMN 716
           +V  T++
Sbjct: 653 RVTLTVS 659


>gi|291548352|emb|CBL21460.1| Beta-glucosidase-related glycosidases [Ruminococcus sp. SR1/5]
          Length = 697

 Score =  389 bits (998), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 245/749 (32%), Positives = 378/749 (50%), Gaps = 111/749 (14%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           ++A+ LV RMTL EK  Q+   A  + RLG+P Y WW+E LHGV+  G+           
Sbjct: 8   KKAEALVARMTLEEKASQLRYDAPAIKRLGIPAYNWWNEGLHGVARAGQ----------- 56

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
                AT FP  I   A+F+     ++   V+TE RA YN  +         GLTFWSPN
Sbjct: 57  -----ATVFPQAIGMAAAFDRKSVAEMAGIVATEGRAKYNAYSVNGDRDIYKGLTFWSPN 111

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           +N+ RDPRWGR  ET GEDPY+     +++V+ LQ           +   +K +AC KH+
Sbjct: 112 VNIFRDPRWGRGHETYGEDPYLTKELGVSFVKALQ----------GNGDTMKAAACAKHF 161

Query: 195 AAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           A +      G +  R  FD+  + +DM+ET++  FE  V E  V +VM +YNR NG P C
Sbjct: 162 AVHS-----GPEALRHEFDAEASAKDMEETYLPAFEGLVKEAKVEAVMGAYNRTNGEPCC 216

Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
             P  L + +RG+W F G+ VSDC +I+   E H  + DT  ++ A  +  G DL+CG+ 
Sbjct: 217 GSP-TLQKKLRGEWKFQGHFVSDCWAIRDFHEHH-MVTDTAVESAALAINNGCDLNCGNT 274

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAA 372
           Y +  M A ++G + E  I  +   L+     LG FDGS +Y NL    + +P+H++ A 
Sbjct: 275 YLHI-MKAYEKGLVTEETITRAAVRLFTTRYLLGLFDGS-EYDNLSYMEVESPRHLDAAE 332

Query: 373 EAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA 432
           +AA +  VLLKN NG LPL+   +KT+ ++GP+A++ +A+IGNY GT  RY +  +G   
Sbjct: 333 KAAEKSFVLLKN-NGILPLDKEKLKTIGIIGPNADSRQALIGNYHGTASRYITIQEGIQD 391

Query: 433 Y---SKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE--- 480
           Y      I  + GC       + +      I  A   A+N+D  ++  GLD ++E E   
Sbjct: 392 YVGDDVRILTSRGCDLFRDRTEHLAFTRDRIAEAKVVAENSDVVILCMGLDETLEGEEGD 451

Query: 481 ------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
                   D+ D+ LPG Q EL+  +AD  K PV   +++   +D+ +A        +LW
Sbjct: 452 TGNSYVSGDKEDIELPGVQRELMEAIADTGK-PVVFCLLAGSDLDLKYAAEKFDAVMMLW 510

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD 594
             YPG +GG+A A V+FG+ +P G+LP+T+YE+      +T   ++      GRTY++ +
Sbjct: 511 --YPGCQGGKAAAKVLFGEISPSGKLPVTFYESLEELPDFTDYSMK------GRTYRYME 562

Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
               +PFGYGL+Y++             + +DK +                       VK
Sbjct: 563 RKAQFPFGYGLTYSK-------------VAVDKAE-----------------------VK 586

Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGF 713
               K   ++EV+N G  D  +VV +Y K           ++ G++R+F+ AG+  K+  
Sbjct: 587 TCGQKINVEVEVQNNGAYDTEDVVQIYVKNIDSKNAIPNPMLAGFQRIFLKAGECRKIEI 646

Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVG 742
            +   K+  +VD     +       I  G
Sbjct: 647 PIWE-KAFTVVDETGKRMEEGKKFEIYAG 674


>gi|374372635|ref|ZP_09630297.1| Beta-glucosidase [Niabella soli DSM 19437]
 gi|373235166|gb|EHP54957.1| Beta-glucosidase [Niabella soli DSM 19437]
          Length = 734

 Score =  387 bits (995), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 257/785 (32%), Positives = 376/785 (47%), Gaps = 120/785 (15%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S  P+ + KL +  R  DLV R+TL EKV+QM + A  +PRLG+P Y+WWSE LHGV+  
Sbjct: 24  SQLPFWNYKLSFEARVNDLVSRLTLEEKVKQMLNHAPAIPRLGIPAYDWWSEVLHGVART 83

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---- 125
              T               T +P  I   A+++      +    + E RA++N       
Sbjct: 84  PYHT---------------TVYPQAIAMAATWDTVALYTMADQSAREGRAIHNKATEEGK 128

Query: 126 -----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
                 GLT+W+PNIN+ RDPRWGR  ET GEDP++       +VRGLQ           
Sbjct: 129 NGDRYVGLTYWTPNINIFRDPRWGRGQETYGEDPFLTAMLGRAFVRGLQ---------GE 179

Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
           D + LK +AC KHYA   + +     R  FD  V++ D+  T++  F+  V    V+ VM
Sbjct: 180 DPKYLKAAACAKHYA---IHSGPEAVRHSFDVDVSDYDLWNTYLPAFKELVTHAKVAGVM 236

Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
           C+YN     P C    L+   +R  W F GY+ SDC +I      HK  +   E A    
Sbjct: 237 CAYNAFRKKPCCGSDLLMTDILRRQWGFTGYVTSDCGAIDDFFNYHK-THPNAEAAAIDA 295

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLG 358
           +  G D++CG+        AV+ G+IAE +ID S++ L+++ MRLG FD      Y    
Sbjct: 296 VTNGTDVECGNRAYLTLTDAVKTGRIAEKEIDRSVKRLFMIRMRLGMFDPVSMVSYAQTS 355

Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
              + +  H   A + A++ IVLLKN+N  LPL+  +IK +A+VGP+A+ + A++GNY G
Sbjct: 356 PAVLESAPHKAQALKMAQESIVLLKNENHLLPLSK-SIKKIAVVGPNADNSIAVLGNYNG 414

Query: 419 TPCRYTSPMDGFYA---------YSKVINYAPGCADIVCQNNSMIP-------AAIDAAK 462
           TP +  + +DG  A         Y K +N+           N+M+P       A     K
Sbjct: 415 TPSKIVTALDGIKAKLGTNGSVVYEKAVNF----------TNAMLPEGKTDFAALTSRVK 464

Query: 463 NADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVI 512
           +ADA + V G+   +E E            DR  +LLP  QTE + K   A   PV  V+
Sbjct: 465 DADAIIFVGGISPQLEGEEMKVNEPGFNSGDRTTILLPTVQTEAM-KALKATGKPVVFVM 523

Query: 513 MSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKI 572
           M+  A+ I + + N  I +I+   Y G+  G AIADV+FG YNP GRLP+T+Y+++    
Sbjct: 524 MTGSALAIPWEQEN--IPAIVNAWYGGQAAGTAIADVLFGDYNPSGRLPVTFYKSD---- 577

Query: 573 PYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCR 632
               +P         RTY++F G  +YPFGYGLSYT F+Y+    P +V  K+       
Sbjct: 578 --ADLPAFDDYRMENRTYRYFSGQALYPFGYGLSYTTFRYEGLKVPTTVKNKV------- 628

Query: 633 DINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI-AGTH 691
                                     +    I++ N G   G EVV +Y    G      
Sbjct: 629 --------------------------RIPVSIQLTNTGAKGGEEVVQLYISYQGQPIKKP 662

Query: 692 IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFP 751
           +K + G++RV++  GQ+  + F +    +L I       L   G   I VG G   V+ P
Sbjct: 663 LKALKGFQRVWLNRGQTKTIKFLLTP-DALAIAGENGKLLNPKGKLRISVGGGQPDVNTP 721

Query: 752 LQLNL 756
              N+
Sbjct: 722 ATSNV 726


>gi|410723195|ref|ZP_11362440.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp.
           Maddingley MBC34-26]
 gi|410603399|gb|EKQ57833.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp.
           Maddingley MBC34-26]
          Length = 709

 Score =  386 bits (992), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 244/746 (32%), Positives = 378/746 (50%), Gaps = 104/746 (13%)

Query: 25  AKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSE 84
           AK+LV +MTL E+ +Q+   +  +  L +P Y WW+E LHGV+  G              
Sbjct: 16  AKELVSKMTLQERAEQLTYQSPAIKHLNVPEYNWWNEGLHGVARAGT------------- 62

Query: 85  VPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPNIN 136
              AT FP  I   A F+E    +I   +STE RA YN  +         GLT+WSPN+N
Sbjct: 63  ---ATVFPQAIGLAAIFDEEFLGEIADIISTEGRAKYNEYSKKDDRGIYKGLTYWSPNVN 119

Query: 137 VVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAA 196
           + RDPRWGR  ET GEDPY+  R  + +++GLQ           + + LK++AC KH+A 
Sbjct: 120 IFRDPRWGRGHETYGEDPYLTSRLGVAFIKGLQ----------GEGKYLKLAACAKHFAV 169

Query: 197 YDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPK 256
           +     EG  R  F++ V ++D+ ET++  FE CV E +V SVM +YNR NG P C    
Sbjct: 170 HS--GPEGL-RHEFNAVVEKKDLYETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKT 226

Query: 257 LLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNF 316
           LL   +RG W F G++VSDC ++      H  +  T  ++VA  ++ G DL+CG+ Y N 
Sbjct: 227 LLKDILRGKWGFKGHVVSDCWALADF-HLHHMITSTATESVALAIENGCDLNCGNMYLNL 285

Query: 317 TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLG-KNNICNPQHIELAAEAA 375
            + A ++G + E  I T+   L     +LG FD   +Y  +  + N C  +H E+A  A+
Sbjct: 286 LL-AYKEGLVTEEQITTAAERLMTTRFKLGMFDEDCEYNRIPYEVNDCK-EHNEIALIAS 343

Query: 376 RQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY---A 432
           R+ +VLLKND G LPL+  ++K++A++GP+AN+   + GNY GT  +YT+ ++G +    
Sbjct: 344 RKSMVLLKND-GTLPLDKSSLKSIAVIGPNANSEIMLKGNYSGTASKYTTILEGIHNAVG 402

Query: 433 YSKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE------ 480
            +  + Y+ GC       + +   +  +  AI  A+ +D  ++  GLD ++E E      
Sbjct: 403 DNIRVYYSEGCHLFKDKVEDLAGPDDRLSEAISVAERSDVVILCLGLDSTIEGEQGDAGN 462

Query: 481 ---GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGY 537
                D+  L LPG Q  L+ KV +  K PV +V+ +  A+  N A+   K  +IL   Y
Sbjct: 463 SYGAGDKESLNLPGRQQNLLEKVLEVGK-PVIVVLGAGSALTFNGAEE--KCAAILNAWY 519

Query: 538 PGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV 597
           PG  GG A+AD++FGK +P G+LP+T+Y+       +T   ++      GRTY++ +   
Sbjct: 520 PGSHGGTAVADILFGKCSPSGKLPVTFYKDTANLPEFTDYSMK------GRTYRYLEHES 573

Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
           +YPFGYGL+Y++ +      P                                  VK   
Sbjct: 574 LYPFGYGLTYSKVELSNLQVPF---------------------------------VKADF 600

Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMN 716
             F   I++ N G     EVV  Y K        +   + G++RV +  G+S  V   ++
Sbjct: 601 ESFDISIDIRNTGNYGIEEVVQCYVKDLKSKYAVLNHSLAGFKRVSLKKGESKTVTIELS 660

Query: 717 ACKSLKIVDNAANSLLASGAHTILVG 742
             +S + V+N    LL S +  + VG
Sbjct: 661 K-RSFEAVNNDGERLLDSKSFKLFVG 685


>gi|182415033|ref|YP_001820099.1| Beta-glucosidase [Opitutus terrae PB90-1]
 gi|177842247|gb|ACB76499.1| Beta-glucosidase [Opitutus terrae PB90-1]
          Length = 905

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 260/764 (34%), Positives = 384/764 (50%), Gaps = 112/764 (14%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           + D+  P   RA DL+ RM+L EKV Q+ + A G+PRLGLP Y++W+EA HG++  G   
Sbjct: 205 WRDSSKPLRVRADDLIRRMSLAEKVSQLKNAAPGIPRLGLPAYDYWNEAAHGIANNGI-- 262

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN------LGNA- 126
                         AT FP  I   A++N +L  + G  +  E RA +N       G++ 
Sbjct: 263 --------------ATVFPQAIGAAAAWNPALLHQEGTVIGIEGRAKFNDYANRHNGDSK 308

Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
              GLT+W+PNIN+ RDPRWGR  ET GEDP++     I +V+G+Q           D R
Sbjct: 309 WWTGLTYWAPNINLFRDPRWGRGQETYGEDPFLTAEIGIEFVKGVQG---------DDPR 359

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            +   AC KHYA +         R  F++ + E+D+ +T++  FE  V EG V+ VM +Y
Sbjct: 360 YMLAMACAKHYAVHSGPE---RTRHSFNAEIPERDLFDTYLPHFERVVREGKVAGVMSAY 416

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVL 301
           N VNG+P  A+  LL + +R  W F GY+ SDCD+I+ I   + H ++  T E+A A  +
Sbjct: 417 NAVNGVPASANSFLLTELLRKRWGFEGYVPSDCDAIRDIYGEKQHHYVK-TAEEAAALAV 475

Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYK----NL 357
           KAG +L CG  Y N  + AVQQG + E D+D +L        RLG FD + Q       L
Sbjct: 476 KAGCNLCCGGDY-NALVRAVQQGLVTEKDLDGALYHTLWTRFRLGLFDPAEQVPFSGYTL 534

Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
             N++  P H ++A E ARQ IVLLKND G LPL+   +K +A++GP+A +   + GNY 
Sbjct: 535 KDNDL--PAHSQVALELARQAIVLLKND-GTLPLDRTKLKQIAVIGPNAASKSMLEGNYH 591

Query: 418 GTPCRYTSPMDGF-----------YAYSKVINYAPGCADIVCQNNSM-------IPAAID 459
           G+  R  S +D             +A    +   PG A    Q+N+           A+ 
Sbjct: 592 GSASRSISILDDIRNLVGSEIKITHAMGSPVTTKPGTAPWSGQDNTTDRPVAELKAEALK 651

Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
            A  ADA + V G+  + E E  DR  + LP  Q +LI  +    K PV +V  S  A+ 
Sbjct: 652 LAAEADAIIYVGGITPAQEGESFDRESIELPSEQEDLIRALHATGK-PVVMVNCSGSAMA 710

Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
           + +   N  + +I+   YPG+EGGRA+A+V+FG+ NP G LPIT+Y +         +P 
Sbjct: 711 LTWQDEN--LPAIVQAWYPGQEGGRAVAEVLFGETNPSGHLPITFYRST------ADLPD 762

Query: 580 RPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
               +   RTY++F G  +Y FG+GLSY+ F+Y                      N  V 
Sbjct: 763 FSDYSMKNRTYRYFTGRPLYAFGHGLSYSTFEYA---------------------NLRVA 801

Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG-THIKQVIGY 698
               P A          +   T  +++ N GK DG +VV +Y+ PP  +    ++ + G+
Sbjct: 802 ----PAA----------NGALTVTLDLTNSGKRDGDDVVQLYATPPASSQPQELRALCGF 847

Query: 699 ERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            R  + AG++  V  T+ A    +      +  + SG  TI  G
Sbjct: 848 RRTHVKAGETRTVTVTVPAVALRRWDIAKKDYAIPSGDWTIAAG 891


>gi|373954937|ref|ZP_09614897.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373891537|gb|EHQ27434.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 723

 Score =  385 bits (989), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 257/754 (34%), Positives = 380/754 (50%), Gaps = 103/754 (13%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D   P   R +DL+ ++TL EKV QM D++  VPRL LP Y WW+EALHGV+  G   
Sbjct: 24  YLDPFNPTDVRVRDLISKLTLEEKVHQMMDVSPSVPRLNLPKYNWWNEALHGVARSGV-- 81

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN-------- 125
                         AT FP  I   A+F++ L K+    +S EARAMYN           
Sbjct: 82  --------------ATIFPQAIALGATFDQDLAKRESTAISDEARAMYNAAMVNGYNEKY 127

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLTFW+PNIN+ RDPRWGR  ET GEDP++  +  + +++GLQ           D   L
Sbjct: 128 GGLTFWTPNINIFRDPRWGRGQETYGEDPFLTSQIGVAFIQGLQ---------GDDPEHL 178

Query: 186 KISACCKHYAAYDLDNWEGNDRFH--FDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
           K++AC KH+A +      G +R    F++  + +D++ET++  F+  VN   V +VMC+Y
Sbjct: 179 KVAACAKHFAVHS-----GPERLRHSFNAIASPKDLRETYLPAFKALVN-ARVEAVMCAY 232

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NR N    C    LL+Q +R +W+F G++VSDC +I      HK +    E AVA  +K 
Sbjct: 233 NRTNSEVCCGSNLLLDQILRDEWHFTGHVVSDCGAIVDFYMGHKVVPGQPE-AVALAVKH 291

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD---GSPQYKNLGKN 360
           G+DL+CGD Y    + AV++G I E +ID +L  L     +LG FD    SP Y N+  +
Sbjct: 292 GVDLNCGDEYPAL-IEAVKRGLITEKEIDKALATLLKTRFKLGLFDPKQNSP-YNNIPVS 349

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            I +  H  LA E A + IVLLKN+   LPL   N+    + GP+A +  A++GNY G  
Sbjct: 350 VINSTDHRALAKEVALKSIVLLKNEK-CLPLKN-NLSKYYITGPNAASVDALMGNYYGVN 407

Query: 421 CRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSV 477
              ++ ++G          + Y PG   +   NN+ I      AK +D T +V G+   +
Sbjct: 408 PHMSTILEGIAGAIQPGSQMQYKPGIL-LDRDNNNPIDWTTGDAKASDVTFVVMGITGLL 466

Query: 478 EAEG---------KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
           E E           DR+D  LP  Q + + K+    K  V  +I   G   +N ++ +  
Sbjct: 467 EGEEGEAIASPNYGDRLDYNLPKNQIDFLRKIRKGNKNKVVAII--TGGSPMNLSEVHEL 524

Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGR 588
             ++L   YPGEEGG A+AD++FGK +P GRLP+T+ ++     PY    ++      GR
Sbjct: 525 ADAVLLAWYPGEEGGNAVADILFGKVSPSGRLPVTFPKSFAQLPPYEDYSMK------GR 578

Query: 589 TYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
           TY++     +Y FGYGLSY+ + Y          + L + Q  +++     T        
Sbjct: 579 TYRYMTAEPMYTFGYGLSYSTYTYS--------SLTLSEKQIKKNMTIIAET-------- 622

Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQS 708
                            V N GKM+G EVV +Y   P         + G++RV + AG+S
Sbjct: 623 ----------------MVTNTGKMEGEEVVQLYITVPQTEKNPQYSLKGFKRVNLKAGES 666

Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            KV F +     +K VD   + +L SG++ + +G
Sbjct: 667 RKVQFQITP-DLMKSVDANGSEVLLSGSYVVRIG 699


>gi|319641744|ref|ZP_07996426.1| beta-glucosidase [Bacteroides sp. 3_1_40A]
 gi|317386631|gb|EFV67528.1| beta-glucosidase [Bacteroides sp. 3_1_40A]
          Length = 702

 Score =  385 bits (989), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 256/754 (33%), Positives = 373/754 (49%), Gaps = 108/754 (14%)

Query: 24  RAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
           R KDLV R+TL EKV  M   +  +PRLG+P Y+WW+EALHGV+    RT          
Sbjct: 2   RVKDLVARLTLEEKVLLMQHHSPAIPRLGIPAYDWWNEALHGVA----RT---------- 47

Query: 84  EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN---------LGNAGLTFWSPN 134
            +   T FP  I   A+F+    +K+G   STE RA++N             GLT+W+PN
Sbjct: 48  -LEKVTVFPQAIGMAATFDTEALQKMGDITSTEGRALFNEDWKAGKTGTRYRGLTYWTPN 106

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           IN+ RDPRWGR  ET GEDPY+  +     VRGL+           D   LK  AC KHY
Sbjct: 107 INIFRDPRWGRGQETYGEDPYLTAKMGAAIVRGLEG---------EDPHYLKSVACAKHY 157

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A +    +   +R  FD+R +  D+ +T++  F   V +  V  VMC+YNR+NG P C +
Sbjct: 158 AVHSGPEY---NRHSFDARPSVFDLWDTYMPAFRELVTKAKVHGVMCAYNRLNGQPCCGN 214

Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
             LL   +R  W+F GY+ SDC +++   E HK  +     A++  L AG DL+CG+ Y 
Sbjct: 215 DPLLVDILRNQWHFDGYVTSDCWALKDFAEFHK-THPEHTIAMSDALLAGTDLECGNLYH 273

Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELAA 372
               G V++G  +E DI+ SL  L+ +L ++G FD + +  Y ++G+  +    H + A 
Sbjct: 274 LLAEG-VKKGLHSERDINVSLSRLFTILFKIGMFDPAERVPYSSIGREVLECEAHKQHAE 332

Query: 373 EAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA 432
             A++ IVLL+N N  LPL+   IK++AL+GP+A+  +  + NY GTP    +P      
Sbjct: 333 RMAKESIVLLENKNHILPLDASKIKSIALIGPNADNGQTQLANYFGTPSEIVTPYMSLKR 392

Query: 433 Y--SKV-INYAPGCA--DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE--------- 478
               K+ INY PG    D +    S +  A  AA+ +D  V V+G+    E         
Sbjct: 393 RLGDKIKINYLPGVGIVDKLKDAPSFVQVAHKAAQ-SDVIVFVSGISADYEGEAGDAGAA 451

Query: 479 ----AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
                   DR  + LP  Q EL+ K+    + P+ +V MS   +   +   N    ++L 
Sbjct: 452 GYGGFASGDRTTMQLPLVQIELLKKLKKTGR-PLIIVNMSGSVMSFEWESQNA--DALLQ 508

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP--GRTYKF 592
             Y G+  G AI DV+FG  NP GR+P+T Y+++        +P  P  N+   GRTY++
Sbjct: 509 AWYGGQAAGDAIVDVLFGHCNPAGRMPLTTYKSD------NDLP--PFENYSMLGRTYRY 560

Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
           F G   YPFGYGLSYT F Y               D QC D  +T  T +          
Sbjct: 561 FKGEPRYPFGYGLSYTTFAY--------------SDVQCVDETHTGDTAR---------- 596

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERVFIAAGQSAK 710
                      + V N G  DG EVV +Y   P  G     +  + G++R+ +  G+S  
Sbjct: 597 ---------VTVTVSNTGDCDGDEVVQLYVVHPQDGRKQIPLCALKGFKRIHLKRGESTS 647

Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
           V FT+   + L + +   N +  +G  T+ VG G
Sbjct: 648 VSFTLTP-EELALTETDGNLVEKNGQVTLFVGGG 680


>gi|333381510|ref|ZP_08473192.1| hypothetical protein HMPREF9455_01358 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332830480|gb|EGK03108.1| hypothetical protein HMPREF9455_01358 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 738

 Score =  385 bits (989), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 253/759 (33%), Positives = 375/759 (49%), Gaps = 101/759 (13%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +P+ D KL   +R  DLV R+TL EKV QM +    + RL +P Y WW+E LHG   IGR
Sbjct: 24  YPFRDTKLSTDKRVSDLVSRLTLEEKVLQMLNNTPAIERLNIPAYNWWNECLHG---IGR 80

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
                  T +       T FP  I   A+++  L K +   +S E RA+YN  +A     
Sbjct: 81  -------TEYK-----VTVFPQAIGMAAAWDARLLKDVANAISDEGRAIYNDASAKGNYS 128

Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
              GLT+W+PN+N+ RDPRWGR  ET GEDPY+ G    ++V GLQ           DS+
Sbjct: 129 IYHGLTYWTPNVNIFRDPRWGRGQETYGEDPYLTGALGKSFVAGLQG---------DDSQ 179

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LK +AC KHYA +   +   N R  F++ VT  D+ +T++  F   V +  V+ VMC+Y
Sbjct: 180 YLKAAACAKHYAVH---SGPENTRHTFNTFVTTFDLWDTYLPAFRDLVVDAKVAGVMCAY 236

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N  +G P C +  L+ + +R  W F GY+ SDC +I      HK   D K  A A  + +
Sbjct: 237 NAFSGEPCCGNNLLMQEILRDKWGFTGYVTSDCGAIDDFYRHHKTHPDAKY-AAADAVYS 295

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNN 361
           G D+DCG+      + AV+ G I E  ID SL+ L+ +  RLG FD +   ++  +  + 
Sbjct: 296 GTDIDCGNEAYKALVDAVKTGLITEEQIDISLKRLFEIRFRLGMFDPAEDVKFSKIPLSV 355

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           + +  H +LA +  R+ IVLLKN+N  LPL +  +K +A++GP+A+   +++GNY G P 
Sbjct: 356 LESQPHKDLALKITRESIVLLKNENNFLPL-SKKLKKVAVIGPNADNEVSVLGNYNGFPT 414

Query: 422 RYTSPMDGFYAYSK--VINYAPGCADIVCQNNSM--IPAAIDAAKNADATVIVAGLDLSV 477
           +  +P        K   + Y  G   +    NS   I A     K  D  +   G+   +
Sbjct: 415 QIITPYKAIKNKLKNTEVIYEKGIDFVKPSENSKEEIAALAKRLKGMDVVIFAGGISPEL 474

Query: 478 EAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
           E E          G DR  + LP  QTEL+  +  A + P   V+M+  A+   +   N 
Sbjct: 475 EGEEMPVKIEGFTGGDRTSIKLPKIQTELMQAL-KAERIPTVFVMMTGSAIAAEWESQN- 532

Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG 587
            + +IL   Y G++ G AIADV+FG YNP G+LP+T+Y  +      + +P         
Sbjct: 533 -VPAILNAWYGGQDAGTAIADVLFGDYNPSGKLPVTFYTKD------SDLPAFNSYEMKN 585

Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
           RTY++FDG V+YPFGYGLSYT+F+Y     P S+                 G N      
Sbjct: 586 RTYRYFDGQVLYPFGYGLSYTKFEYSPIQMPASIK---------------AGEN------ 624

Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH----IKQVIGYERVFI 703
                           I V+N GK DG EVV +Y       GT+    +  +  +ER+ +
Sbjct: 625 ------------MEVSITVKNTGKTDGEEVVQLYISHDN-NGTNRQLPLYALKSFERISL 671

Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            AG+S  V F ++  + + + D      +  G   + +G
Sbjct: 672 KAGESKSVTFKLSP-REMALADEDGVLKMTKGKSKLYIG 709


>gi|109897152|ref|YP_660407.1| beta-glucosidase [Pseudoalteromonas atlantica T6c]
 gi|109699433|gb|ABG39353.1| Beta-glucosidase [Pseudoalteromonas atlantica T6c]
          Length = 733

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 258/764 (33%), Positives = 384/764 (50%), Gaps = 96/764 (12%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           +D P+ D +LP  ER + L++ MTL EK  Q+ +    + RLGLP Y++W+EALHGV+  
Sbjct: 22  NDHPWFDTQLPTNERIESLIDAMTLKEKASQLVNGNVAIERLGLPEYDFWNEALHGVARN 81

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGN 125
           GR                AT FP  I   A+F++ L  +    +S EARA +N    +GN
Sbjct: 82  GR----------------ATVFPQAIGMAATFDQDLLLQAATVISDEARAKFNVSSEIGN 125

Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
               +GLTFW+PNIN+ RDPRWGR  ET GEDPY+  +     V GLQ            
Sbjct: 126 RSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQG---------DH 176

Query: 182 SRPLKISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
            + LK +A  KH+A +      G +  R  FD+  +E+DM ET+   FE  V E DV +V
Sbjct: 177 PKYLKTAAAAKHFAVH-----SGPEALRHEFDAIASEKDMYETYFPAFEALVTEADVETV 231

Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
           M +YNRVNG P      LLN  +R  W F G+IVSDC  +    E HK   +  E A A 
Sbjct: 232 MAAYNRVNGHPAGGSDFLLNTVLRDKWGFSGHIVSDCWGLADFHEYHKVTANAVESA-AL 290

Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNL 357
            +  G DL+CG  YT     AV+ G + E  IDT L  +     +LG+FD      Y ++
Sbjct: 291 AINTGTDLNCGSVYTALP-DAVEAGLVDEKTIDTRLHKVLATKFKLGFFDPKDDNPYNSI 349

Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
             + + +  H ++A E A + IVLL+N+N  LPL+  NI+ + + GP A++++ ++GNY 
Sbjct: 350 SADVVNSDAHADVAYEMAVKSIVLLQNENQVLPLDK-NIRNVYVTGPFASSSEVLLGNYY 408

Query: 418 GTPCRYTSPMDGFYAYSKV---INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
           G   + T+ +DG  A   V   INY  G        N +     +A +  D  + V GL 
Sbjct: 409 GLSGKTTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLS 468

Query: 475 LSVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
            + E E           DR+ L LP  Q E + K+      PV +V+++AG   +N  + 
Sbjct: 469 GAYEGEEGEAIASPHKGDRLSLDLPEHQIEFLRKLRKDNDKPV-IVVLTAG-TPVNVTEI 526

Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
                +I++  YPG+EGG+A+AD++FG+ +P GRLPIT+ ++     PY    ++     
Sbjct: 527 AQLADAIVFAWYPGQEGGKAVADILFGERSPSGRLPITFPKSEAQLPPYDDYSMQ----- 581

Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
            GRTY++     +YPFG+GLSY   K+                      N T+G      
Sbjct: 582 -GRTYRYMTEEPMYPFGFGLSYATVKFD---------------------NITLGN----- 614

Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH--IKQVIGYERVFI 703
            A  +     +       + V N G  +  EVV +Y K P  AG    I+ + G++R+ +
Sbjct: 615 -AEALSSTDGQKGTLDVSVNVTNTGTRELEEVVQLYLKTPN-AGIDQPIQSLKGFQRIKL 672

Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGG 747
           A GQ+ +V FT++  K L  ++     +L  G + ++VG    G
Sbjct: 673 APGQTGQVSFTVSK-KQLYSINAKGKPVLLEGDYHVIVGNASPG 715


>gi|291530120|emb|CBK95705.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum 70/3]
          Length = 689

 Score =  383 bits (984), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 252/753 (33%), Positives = 396/753 (52%), Gaps = 114/753 (15%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D +L   ERA  L + ++  E+ QQ+   A  + + GLP Y WW+E LHGV+  G   
Sbjct: 4   YKDKQLSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT-- 61

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   A+F++ +  ++G+ VSTEARAMYN           
Sbjct: 62  --------------ATVFPQAIALAAAFDKDMMCRVGEVVSTEARAMYNSAAKHGDTDIY 107

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLT W+PNIN+ RDPRWGR  ET GEDPY+  R  +N+V+G+Q  E          + L
Sbjct: 108 KGLTLWAPNINIFRDPRWGRGHETYGEDPYLTSRLGVNFVKGIQGEE----------KYL 157

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           + +AC KH+A +   +   + R  FD+RV+E+D++ET++  F+  V EG V  VM +YNR
Sbjct: 158 RAAACAKHFAVH---SGPESLRHEFDARVSEKDLEETYLPAFKALVKEGRVEGVMGAYNR 214

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           VNG P+CA  KL+ + +R +W F GY VSDC +I+    +HK + DT   + A  LKAG 
Sbjct: 215 VNGEPSCASEKLMGK-LR-EWGFDGYFVSDCGAIRDFHTNHK-ITDTAPQSAAMALKAGC 271

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNP 365
           D++CG+ Y +  + A+++G I + DI T+        +RLG  D + ++ +L  + I   
Sbjct: 272 DVNCGNTYLHI-LAALEEGLITKQDIRTACIHALRTRIRLGQLDDN-EFDDLPFDIIACD 329

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            +  L+ EAA + +VLL ND G LPL+   I ++A++GP+A++  A++GNYEGTP R  +
Sbjct: 330 GNKALSLEAAEKSMVLLHND-GILPLDKSRISSIAVIGPNADSRAALLGNYEGTPDRSVT 388

Query: 426 PMDGFY-AYSKVINYAPGCADIVCQNNSM-IPA-----AIDAAKNADATVIVAGLDLSVE 478
            ++G   A+   + YA GC     +   + +P      A+ A + AD TV+  GLD ++E
Sbjct: 389 FLEGIQDAFDGRVYYAEGCQLFRDRTQGLALPGDRYAEAVAACEAADVTVVCVGLDSTLE 448

Query: 479 AE-------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKS 531
            E         D+ DL LP  Q  L+ K+ D  K P+ +V+ +  +V+     N     +
Sbjct: 449 GEEGDTENKSGDKPDLRLPEVQRVLLQKLKDTGK-PLIIVLAAGSSVNTECEGN-----A 502

Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYK 591
           ++   YPG+ GG+A+A+++FG+ +P G+LP+T+Y++  +   +T   ++       RTY+
Sbjct: 503 LINAWYPGQYGGKALAEILFGEVSPSGKLPVTFYKSADMLPDFTDYSMK------NRTYR 556

Query: 592 FFDGP--VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL 649
           F D    V+YPFGYGL+Y+ F                   +C DI+Y             
Sbjct: 557 FCDDESNVLYPFGYGLTYSHF-------------------ECGDISY------------- 584

Query: 650 IDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSA 709
                 KD   T  + V N G     +V+ VY +       H   +  +ERV +  G+S 
Sbjct: 585 ------KDN--TLAVNVTNTGSRSAEDVLQVYIRSENGVKNH--SLCAFERVSLFDGESR 634

Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            +   +    + + VD+     + SG +T+  G
Sbjct: 635 TISINIPE-GAFETVDDNGVRAVRSGRYTLYAG 666


>gi|350295750|gb|EGZ76727.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
          Length = 839

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 275/802 (34%), Positives = 389/802 (48%), Gaps = 108/802 (13%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD     PERA  LV+++T+ EK+  + D A G  R+GLP Y WWSE LHGV+       
Sbjct: 37  CDVTGTAPERAASLVDQLTIDEKLVNLVDQALGASRIGLPKYAWWSEGLHGVA------- 89

Query: 75  SPPGTHFDSE---VPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
             PG  F++       ATSF   I   ASF++ L  ++G  +STEARA  N G  GL +W
Sbjct: 90  GSPGVTFNTTGYPFSYATSFANAINLGASFDDDLVYEVGTAISTEARAFANFGFGGLDYW 149

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
           +PN+N  +DPRWGR  ETPGEDP  +  Y    + GL+  E V           K+ A C
Sbjct: 150 TPNVNPYKDPRWGRGAETPGEDPLHIKGYVKAMLAGLEGNETVR----------KVIATC 199

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV----- 246
           KHYAAYDL+ W G  R+ F++ VT QD+ E ++ PF+ C  +  V S+MCSYN +     
Sbjct: 200 KHYAAYDLERWHGLTRYEFEAIVTLQDLSEYYLPPFQQCARDSKVGSIMCSYNALTIRDM 259

Query: 247 -------------NGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLN 290
                           P CA+  L+   +R  WN+   + YI SDC++I   +  +   +
Sbjct: 260 AGGSKPDEIINLTTAQPACANTYLMT-ILRDHWNWTEHNNYITSDCNAILDFLPDNHNFS 318

Query: 291 DTKEDAVARVLKAGLDLDCGDYYTNFT--MGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
            T  +A A   KAG D  C    +  T  +GA  Q  + EA IDT+LR LY  L+R GY 
Sbjct: 319 QTPAEAAAAAYKAGTDTVCEVSGSPLTDVVGAYNQSLLPEAVIDTALRRLYEGLIRAGYL 378

Query: 349 D--------------GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTG 394
           D               SP Y  L  N++  P   ELA  +A +GIVLLKN    LPL+  
Sbjct: 379 DHGRSAVAGGDGGSFSSPAYDALNWNDVNTPSTQELALRSATEGIVLLKNSGSLLPLDFS 438

Query: 395 NIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMI 454
             K +AL+G  ANAT  M G Y G P  Y +P+      +  ++YA G        ++  
Sbjct: 439 G-KKVALIGHWANATGTMRGPYSGIPPFYHNPLYAAQQLNLSLSYANGPVVNASDPDTWT 497

Query: 455 PAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMS 514
             A+ AA+ AD  +   G D +V +E  DR  +  P  Q +L++++A   K PV +VI  
Sbjct: 498 APALAAAEGADVVLYFGGTDTTVASEDLDRESIAWPEAQMKLLSELAGLGK-PV-VVIQL 555

Query: 515 AGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIP 573
              VD +   NN  + SILWVGYPG+ GG A+ DV+ GK  P GRLP+T Y   YV ++P
Sbjct: 556 GDQVDDSSLLNNGNVSSILWVGYPGQSGGTAVFDVLTGKKAPAGRLPVTQYPEGYVDEVP 615

Query: 574 YTSMPLRPVNN-------------------------------FPGRTYKFFDGPVVYPFG 602
            T M LRP N+                                PGRTYK++  PV+ PFG
Sbjct: 616 LTEMALRPFNHSSSNLEEEVSVQGGASLTIQARSTPGNKTLSSPGRTYKWYSTPVL-PFG 674

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK-CKDYKFT 661
           YGL YT F           ++ L         ++++ +   PC A  +D           
Sbjct: 675 YGLHYTTF-----------NVSLSLSSNASSPSFSIPSLLTPCTATHLDLCPFSPSANSA 723

Query: 662 FQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNACK 719
             + + N G      V +++ S   G     +K ++ Y+RV  I  G++  V     +  
Sbjct: 724 LSVSITNTGTHTSDYVALLFLSGEFGPEPYPLKTLVSYKRVKDIKPGETVTVKDVPVSLG 783

Query: 720 SLKIVDNAANSLLASGAHTILV 741
           ++  VD   N++L  G +  +V
Sbjct: 784 AISRVDGDGNTVLYPGTYRFVV 805


>gi|266619450|ref|ZP_06112385.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
 gi|288869013|gb|EFD01312.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
          Length = 714

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 252/760 (33%), Positives = 378/760 (49%), Gaps = 109/760 (14%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D      ER +DLV +MTL EKV Q+   A  V RLG+P Y WW+EALHGV+  G   
Sbjct: 5   YLDESRTDEERVRDLVSQMTLEEKVSQLRYDAPAVERLGIPSYNWWNEALHGVARAG--- 61

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL----GNAGL- 128
                         AT FP  I   A F+E+L +KIG   + E RA Y+     G+ GL 
Sbjct: 62  -------------AATVFPQAIGLAAMFDEALLEKIGDVTALEGRAKYHEAVRNGDRGLY 108

Query: 129 ---TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
              TFWSPNIN+ RDPRWGR  ET GEDP + GR    Y++G+Q           + + L
Sbjct: 109 KGITFWSPNINIFRDPRWGRGHETYGEDPCLTGRMGTAYIKGMQ----------GNGKRL 158

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K +AC KH+AA+   +     R  F+S V+++D+ ET+   FE CV E  V  VM  YNR
Sbjct: 159 KAAACVKHFAAH---SGPEKGRHSFNSVVSKKDLTETYFPAFERCVKEAGVEGVMGGYNR 215

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           +NG   C    L+ + +R  W F GY VSDC +I+     H  L DT +++ A  LK+G 
Sbjct: 216 LNGEAACGSHHLITEILREKWGFDGYYVSDCGAIKDF-HMHHGLTDTPQESAALALKSGC 274

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN-NICN 364
           DL+CG  Y +  M A  QG ++  DID ++  L +  MRLG FD   ++  +    N C 
Sbjct: 275 DLNCGAVYLH-VMSAYNQGLVSAEDIDRAVTHLMMTRMRLGMFDQHTEFDEIPYEINDC- 332

Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
            +H  LA +AA + +VLLKND G LPL+   +KT+A++GP+ ++ + + GNY GT     
Sbjct: 333 AEHHGLALKAAEESMVLLKND-GILPLDKTALKTVAVIGPNGDSEEILKGNYNGTATEKY 391

Query: 425 SPMDGFYAY---------SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL 475
           + ++G  A          S+  +      + + + +  +  A+  A  +D   +  GL+ 
Sbjct: 392 TILEGIRAVLGKETRIFCSEGSHLYRDNVENLAEADDRLKEAVSMAVRSDVVFLCLGLNG 451

Query: 476 SVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           ++E E         G D+ DL LP  Q  L+  V      PV L++ +  A+ IN+A  +
Sbjct: 452 TLEGEEGDANNSYAGADKADLNLPESQMRLLKAVCGTGT-PVILLLAAGSAMAINYAAEH 510

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
               +IL + YPG+ GG A A ++ G+  P GRLP+T+Y+       +T   ++      
Sbjct: 511 --CSAILHIWYPGQMGGLAAARLLTGEAVPSGRLPVTFYQTTEELPEFTDYSMK------ 562

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++ +   +YPFGYGLSY  F+Y   S+ K+   +   D Q R              
Sbjct: 563 GRTYRYMEREALYPFGYGLSYGDFEY---SNFKAEQTEAGPDGQVR-------------- 605

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIK----QVIGYERVF 702
                          F +++ N  K +  E+  VY +   IA + +      +  + R+ 
Sbjct: 606 ---------------FSVKITNRSKAECDEIAEVYVR---IADSELAAPGGSLADFRRIH 647

Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           + AG+S  V FT+   K+  +V+     +L      +  G
Sbjct: 648 MKAGESVTVPFTL-PVKAFMVVNEEGEYILDGSTAVVTCG 686


>gi|386347261|ref|YP_006045510.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
           6578]
 gi|339412228|gb|AEJ61793.1| glycoside hydrolase family 3 domain protein [Spirochaeta
           thermophila DSM 6578]
          Length = 693

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 261/750 (34%), Positives = 384/750 (51%), Gaps = 114/750 (15%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           ER   L+ +M++ EK   M   A G+PRLG+P Y WW+EALHGV+  G            
Sbjct: 5   ERMTSLLSKMSIEEKAGLMLHRAKGIPRLGIPHYNWWNEALHGVANSGE----------- 53

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN-LGNA-------GLTFWSPN 134
                AT FP  I   A+F+  L +++ + +STEARA +N +G         GLTFWSPN
Sbjct: 54  -----ATVFPQAIGLAATFDPDLVRRVAEAISTEARAKFNAIGKERAAEYERGLTFWSPN 108

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           IN+ RDPRWGR  ET GEDP++  +  +++V+GLQ      Y+       ++++AC KHY
Sbjct: 109 INIYRDPRWGRGQETYGEDPFLTSKIGVSFVKGLQGDH--PYY-------MRVAACAKHY 159

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A +     EG  R  FD+RV+E+D+ ET++  FE  V  G V +VM +YNRVNG P C  
Sbjct: 160 AVH--SGPEGL-RHVFDARVSEKDLWETYLPAFEALVKAG-VEAVMGAYNRVNGEPACGS 215

Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
            +LL++ +R  W F G++VSDC +I      HK   D  E ++A  L+AG DL+CG+ Y 
Sbjct: 216 KRLLDEILRKRWGFKGHVVSDCWAIADFHLHHKVTKDPIE-SIAMALEAGCDLNCGNTYE 274

Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEA 374
           +  + AV+ G ++E  +D S+  L   L RLG F     Y  L  ++I    H  LA EA
Sbjct: 275 HL-LDAVKAGVVSEELVDRSVARLLSTLDRLGLFTDDHPYARLSLSDIDWEAHRALAREA 333

Query: 375 ARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYS 434
           A + +VLLKN NG LP +   ++ + + GP+A    A++GNY G   R  + ++G   Y+
Sbjct: 334 AEKSVVLLKN-NGILPFDRQKLRYIYVTGPNAANPVALLGNYAGVSSRLVTVLEGITGYA 392

Query: 435 K---VINYAPGCADIVCQNNSMIPA--AIDAAKNADATVIVAGLDLSVEAEGKDRV---- 485
                + Y  GC     Q N + P   A   A+ AD TV V G D +VE E  D +    
Sbjct: 393 GPGITVTYKIGCP---LQGNKINPIDWASGVARYADVTVAVMGRDSTVEGEEGDAIFSDN 449

Query: 486 -----DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIK----SILWVG 536
                DL LP  Q E + ++ +  K P+ +V++S   V       +P+++    +I++  
Sbjct: 450 YGDLSDLDLPREQIEYLRRIKEIGK-PLVVVLLSGAPV------CSPELEELADAIVYAW 502

Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
           YPGEEGG AIA V+FG+ +P GRLPIT+        P+T   +       GRTY++    
Sbjct: 503 YPGEEGGNAIARVLFGEISPSGRLPITFPRGVDQLPPFTDYSME------GRTYRYMREE 556

Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
            +YPFG+GLSY  F Y+   S  S   + DK +                      ++ C 
Sbjct: 557 PLYPFGFGLSYATFSYRGLQSSAS---RWDKRETL--------------------ELVC- 592

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSK----PPGIAGTHIKQVIGYERVFIAAGQSAKVG 712
                   EVEN   +   EVV +Y +    P  +    +K   G+ RV + AG+  +V 
Sbjct: 593 --------EVENTSSIPADEVVQLYVRWEDAPFRVPLWSLK---GFTRVSLGAGERKQVR 641

Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVG 742
           F ++  + L  +D     +L  G     VG
Sbjct: 642 FVLSP-EELSFIDEEGRKVLPEGRLHFHVG 670


>gi|451821678|ref|YP_007457879.1| periplasmic beta-glucosidase BglX [Clostridium
           saccharoperbutylacetonicum N1-4(HMT)]
 gi|451787657|gb|AGF58625.1| periplasmic beta-glucosidase BglX [Clostridium
           saccharoperbutylacetonicum N1-4(HMT)]
          Length = 710

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 243/750 (32%), Positives = 383/750 (51%), Gaps = 107/750 (14%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           E+AK+LV +MTL E+ +Q+   A  +  L +  Y WW+E LHGV+  G            
Sbjct: 14  EKAKELVSKMTLQERAEQLTYKAPAIKHLNISRYNWWNEGLHGVARAGT----------- 62

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
                AT FP  I   A F++ L +KI   ++TE RA YN  +         GLTFWSPN
Sbjct: 63  -----ATVFPQAIGLAAIFDDELLEKIAGIIATEGRAKYNENSKKEDKDIYKGLTFWSPN 117

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           +N+ RDPRWGR  ET GEDPY+  R  + +V+GLQ           D + LKI+AC KH+
Sbjct: 118 VNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQ----------GDEKYLKIAACAKHF 167

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A +     EG  R  F++ V+++D+ ET++  FE CV E DV +VM +YNR N  P C  
Sbjct: 168 AVHS--GPEGL-RHEFNAVVSKKDLYETYLPAFEACVKEADVEAVMGAYNRTNDEPCCGS 224

Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
             LL   +RG W F G++VSDC +I      H  +  T  ++ A  +K G DL+CG+ Y 
Sbjct: 225 SLLLKDILRGKWQFKGHVVSDCWAIADFHLYHG-VTSTATESAALAIKNGCDLNCGNVYL 283

Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN-NICNPQHIELAAE 373
              + A ++G + E DI  +   L    +RLG FD   ++  +    N C   H E++  
Sbjct: 284 QMLL-AYKEGLVTEEDITRAAERLMATRIRLGMFDEECEFNKIPYTMNDCKEHH-EVSLM 341

Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY-- 431
           A+R+ IV+L+N NG LPL+   +K++ ++GP+A++   + GNY GT  +Y + ++G +  
Sbjct: 342 ASRKSIVMLRN-NGLLPLDKSKLKSIGIIGPNADSELMLKGNYFGTASKYITVLEGIHEA 400

Query: 432 --AYSKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE--- 480
             + +  I Y+ GC         + + +  +  A+  A+++D  ++  GLD S+E E   
Sbjct: 401 VDSENIRIFYSEGCHLYKDRVQDLAEPDDRMAEAVTVAEHSDVVILCLGLDSSIEGEQGD 460

Query: 481 ------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
                   D+++L LPG Q EL+ KV    K PV +V+ +  A+ +   + N    +IL 
Sbjct: 461 AGNSDGAGDKLNLNLPGKQQELLEKVIATGK-PVIVVLGAGSALTLQGQEEN--CAAILN 517

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD 594
             YPG  GGRAIAD+IFGK +P G+LP+T+Y+       +T   ++       RTY++  
Sbjct: 518 AWYPGSFGGRAIADLIFGKCSPSGKLPVTFYKTTEELPEFTDYSMK------NRTYRYMK 571

Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
              +YPFG+GL+Y++ +                D    DI+                   
Sbjct: 572 NESLYPFGFGLTYSKVQL--------------SDLSVSDIS------------------- 598

Query: 655 CKDYK-FTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVG 712
            KD++     I++ N+G  D  EV+  Y K            +  ++RV +  G+S  V 
Sbjct: 599 -KDFEGVEVSIKISNVGNFDIEEVLQCYIKDLESKYAVDNHSLSAFKRVALNKGESKVVK 657

Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVG 742
            T+N  ++ ++V++  + +L S    + VG
Sbjct: 658 MTINK-RAFEVVNDEGDRILDSKKFKLFVG 686


>gi|374316077|ref|YP_005062505.1| beta-glucosidase-like glycosyl hydrolase [Sphaerochaeta pleomorpha
           str. Grapes]
 gi|359351721|gb|AEV29495.1| beta-glucosidase-like glycosyl hydrolase [Sphaerochaeta pleomorpha
           str. Grapes]
          Length = 701

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 234/634 (36%), Positives = 336/634 (52%), Gaps = 70/634 (11%)

Query: 21  YPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTH 80
           + E+AK LV  M+L E   Q+   A  +PRLGLP Y WW+EALHG +  G          
Sbjct: 6   FREQAKQLVAHMSLKEMFSQLLHEAPAIPRLGLPRYNWWNEALHGAARSGT--------- 56

Query: 81  FDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWS 132
                  AT FP  I   A F++   K+I   +STE RA YN  +A        GLT WS
Sbjct: 57  -------ATVFPQAIGLAAMFDDVFLKEIATVISTEQRAKYNTFSALGDRGIYKGLTLWS 109

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           PN+N+ RDPRWGR  ET GEDPY+  +  +++++GLQ           D   LK +AC K
Sbjct: 110 PNVNIFRDPRWGRGQETYGEDPYLASQLGVSFIQGLQ----------GDGPYLKTAACVK 159

Query: 193 HYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           H+A +      G +  R  F++ V+ +D+ ET++  FE CV EG+V++VM +Y+ VNG P
Sbjct: 160 HFAVHS-----GPEPLRHDFNAIVSRKDLYETYLPAFEACVKEGEVNAVMGAYSAVNGEP 214

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
            C  P L+   +R DW F G  +SDC +I+    +H  +   + D+VA  L AG DL+CG
Sbjct: 215 CCGSPFLITDILRNDWGFEGMYISDCWAIRDFHLNHA-VTKNQVDSVALALNAGCDLNCG 273

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIEL 370
             Y +    A QQG I    I  +   +      LG F     Y N+G       +H ++
Sbjct: 274 CEYLSLEK-AYQQGLIDRKTITQACIRVMTTRFALGLFSEDCTYSNIGYEQNDTEEHRKV 332

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A +A+   +VLLKND G LPL++ ++  +A++GP+A++ +A+ GNY GT   YT+ ++GF
Sbjct: 333 AFKASCNSLVLLKND-GMLPLDSRSLHAIAIIGPNADSREALWGNYHGTSSTYTTVLEGF 391

Query: 431 ---YAYSKVINYAPGCA------DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE- 480
                 S  + Y+ G A      + + + N  I  AI  A  +D  ++  G D +VE E 
Sbjct: 392 RKTLGESVKVKYSQGSAIQKEKLERLAEPNDRIAEAIAVATVSDTIILCLGYDETVEGEM 451

Query: 481 --------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
                     D+ DL LP  Q  L+  VA   K P+ LV++S GA+D    +  P +K++
Sbjct: 452 HDDGNGGWAGDKQDLRLPPCQRALLKAVASTGK-PIVLVLLSGGAIDPEIER-FPNVKAL 509

Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
           L   YPG+EGG AIA  I G  NP G LP+T+Y +       T +P        GRTY++
Sbjct: 510 LQGWYPGQEGGLAIAHTILGLNNPSGHLPVTFYRSE------TVLPDFCDYRMEGRTYRY 563

Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
               V+YPFG+GLSYT F Y   S+ K  D  L+
Sbjct: 564 VQEKVLYPFGFGLSYTTFSYGNLSTGKQADGNLE 597


>gi|167751044|ref|ZP_02423171.1| hypothetical protein EUBSIR_02029 [Eubacterium siraeum DSM 15702]
 gi|167655962|gb|EDS00092.1| glycosyl hydrolase family 3 C-terminal domain protein [Eubacterium
           siraeum DSM 15702]
          Length = 691

 Score =  381 bits (978), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 253/755 (33%), Positives = 396/755 (52%), Gaps = 116/755 (15%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D +L   ERA  L + ++  E+ QQ+   A  + + GLP Y WW+E LHGV+  G   
Sbjct: 4   YKDKQLSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT-- 61

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   A+F++ +  ++G+ +STEARAMYN           
Sbjct: 62  --------------ATVFPQAIALAAAFDKDMMYRVGEVISTEARAMYNSAAKHGDTDIY 107

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLT W+PNIN+ RDPRWGR  ET GEDPY+  R  +N+V+G+Q  E  EY        L
Sbjct: 108 KGLTLWAPNINIFRDPRWGRGHETYGEDPYLTSRLGVNFVKGIQGEE--EY--------L 157

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           + +AC KH+A +   +   + R  FD+RV+E+DM+ET++  F+  V EG V  VM +YNR
Sbjct: 158 RAAACAKHFAVH---SGPESLRHEFDARVSEKDMEETYLPAFKALVKEGRVEGVMGAYNR 214

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           VNG P+CA  KL+ + +R +W F GY VSDC +I+    +HK + DT   + A  LKAG 
Sbjct: 215 VNGEPSCASEKLMGK-LR-EWGFDGYFVSDCWAIRDFHTTHK-ITDTAPQSAAMALKAGC 271

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNP 365
           D++CG+ Y +  + A+++G I + +I T+        +RLG  D + ++ +L  + I   
Sbjct: 272 DVNCGNTYLHI-LAALEEGLITKQNIRTACIHALRTRIRLGQLDDN-EFDDLPFDIIACD 329

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            +  L+ EAA + +VLL ND G LPL+   I ++A++GP+A++  A++GNY GTP R  +
Sbjct: 330 GNKALSLEAAEKSMVLLHND-GILPLDKSRISSIAVIGPNADSRAALLGNYNGTPDRSVT 388

Query: 426 PMDGFY-AYSKVINYAPGCADIVCQNNSM-IPA-----AIDAAKNADATVIVAGLDLSVE 478
            ++G   A+   + YA GC     +   + +P      A+ A + AD TV+  GLD ++E
Sbjct: 389 FLEGIQDAFDGRVYYAEGCQLFRDRTQGLALPGDRYAEAVAACEAADVTVVCVGLDATLE 448

Query: 479 AE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
            E           D+ DL LP  Q  L+ K+ D  K P+ +V+ +  +V+     N    
Sbjct: 449 GEEGDTGNEFASGDKPDLRLPEVQRVLLQKLKDTGK-PLIIVLAAGSSVNTECEGN---- 503

Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRT 589
            +++   YPG+ GG+A+A+++FG+ +P G+LP+T+Y++  +   +T   ++       RT
Sbjct: 504 -ALINAWYPGQYGGKALAEILFGEVSPSGKLPVTFYKSADMLPDFTDYSMK------NRT 556

Query: 590 YKFFDGP--VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
           Y+F D    V+YPFGYGL+Y+ F                   +C DI+Y           
Sbjct: 557 YRFCDDESNVLYPFGYGLTYSHF-------------------ECGDISY----------- 586

Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQ 707
                   KD   T  + V N G     +V+ VY K       H   +  +ERV +  G+
Sbjct: 587 --------KDN--TLAVNVTNTGSRSAEDVLQVYIKSENGVKNH--SLCAFERVSLFDGE 634

Query: 708 SAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           S  +   +       + DN   +++ SG +T+  G
Sbjct: 635 SRTISINIPEGAFETVDDNGVRAVI-SGRYTLYAG 668


>gi|121700633|ref|XP_001268581.1| beta-xylosidase XylA [Aspergillus clavatus NRRL 1]
 gi|119396724|gb|EAW07155.1| beta-xylosidase XylA [Aspergillus clavatus NRRL 1]
          Length = 743

 Score =  381 bits (978), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 251/753 (33%), Positives = 366/753 (48%), Gaps = 95/753 (12%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD      +RA  L+   TL E V   G+ + GVPRLGLP Y+ W+EALHG+  
Sbjct: 57  LSKTIVCDTLTSPYDRAAALISLFTLEELVNATGNTSPGVPRLGLPPYQVWNEALHGLD- 115

Query: 69  IGRRTNSPPGTHFDSE--VPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
                      +F  E     +TSFP  ILT ++ N +L  ++   +ST+ RA  N G  
Sbjct: 116 ---------RAYFTDEGQFSWSTSFPMPILTMSALNRTLINQVASIISTQGRAFSNAGRY 166

Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSRPL 185
           GL  +SPNIN  R P WGR  ETPGED Y +   YA  Y+ G+Q   GV      D + L
Sbjct: 167 GLDVYSPNINSFRHPVWGRGQETPGEDAYCLSSAYAYEYITGIQG--GV------DPKSL 218

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K+ A  KHYA YD++NW+G+ R   D  +T+QD+ E +   F +   +  V SVMCSYN 
Sbjct: 219 KLVATAKHYAGYDIENWDGHSRLGNDMNITQQDLSEYYTPQFLVAARDAKVRSVMCSYNA 278

Query: 246 VNGIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           VNG+P+CA+   L   +R  + F   GYI SDCDS   +   H++  +    A A  ++A
Sbjct: 279 VNGVPSCANSFFLQTLLRDTFGFVEDGYISSDCDSAYNVFNPHEYAANVSS-AAADSIRA 337

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC 363
           G D+DCG  Y  +   AV Q  ++ ADI+  +  LY  LMRLGYFD  P           
Sbjct: 338 GTDIDCGTTYQYYFDEAVDQNLLSRADIERGVIRLYSNLMRLGYFDVGPWM--------- 388

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
                                          N+ T             + GNY G     
Sbjct: 389 -------------------------------NVST------------QLQGNYFGPAPYL 405

Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
            SP+D F      +NYA G  +I   +      A+ AAK +DA +   G+D S+EAE  D
Sbjct: 406 ISPLDAFRDSHLDVNYAFGT-NISSNSTDGFSKALSAAKKSDAIIFAGGIDNSLEAETLD 464

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R+++  PG Q ELI++++   K P+ ++ M  G VD +  K+N  + S++W GYPG+ GG
Sbjct: 465 RMNITWPGKQLELIDQLSQLGK-PLIVLQMGGGQVDSSLLKSNKNVNSLIWGGYPGQSGG 523

Query: 544 RAIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFG 602
           +A+ D+I GK  P GRL +T Y A Y  + P T M LRP  N PG+TY ++ G  VY FG
Sbjct: 524 QALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGNNPGQTYMWYTGTPVYEFG 583

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
           +GL YT F+   A +     +K+      +D+       +P    + ++ +        F
Sbjct: 584 HGLFYTTFRVSHARA-----VKIKPTYNIQDL-----LAQPHPGYIHVEQMPF----LNF 629

Query: 663 QIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
            +++ N GK       M+++    G A    K ++G++R+      ++K+        S+
Sbjct: 630 TVDITNTGKASSDYTAMLFANTTAGPAPYPKKWLVGFDRLPTLGPSTSKLMTIPVTINSM 689

Query: 722 KIVDNAANSLLASGAHTILVGEGVGGVSFPLQL 754
              D   N +L  G + + +      V  PL L
Sbjct: 690 ARTDELGNRVLYPGKYELALNNE-RSVVLPLSL 721


>gi|410098444|ref|ZP_11293422.1| hypothetical protein HMPREF1076_02600 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409222318|gb|EKN15263.1| hypothetical protein HMPREF1076_02600 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 738

 Score =  380 bits (975), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 254/767 (33%), Positives = 373/767 (48%), Gaps = 108/767 (14%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
           D+P+ +  LP   R +D++ R+TL EKVQ M   A  VPRLG+P Y WW+EALHGV+   
Sbjct: 25  DYPFRNPDLPLDVRVQDIISRLTLEEKVQLMKHAAPAVPRLGIPAYNWWNEALHGVA--- 81

Query: 71  RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGNA 126
            RT               T FP  I   A+F+    +K+G   S+E RA++N     G  
Sbjct: 82  -RTKEK-----------VTVFPQAIGMAATFDTEALQKMGDMTSSEGRALFNEDLKAGKT 129

Query: 127 G-----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
           G     LT+W+PNIN+ RDPRWGR  ET GEDPY+  +     V GL+          ++
Sbjct: 130 GEIYRGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAKMGSAIVHGLEG---------NN 180

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
              LK  AC KHYA +   +   ++R  +D+RV+  D+ +T++  F   V +  V  VMC
Sbjct: 181 PEYLKSVACAKHYAVH---SGPEHNRHSYDARVSMYDLWDTYLPAFRELVTKAKVHGVMC 237

Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHK-FLNDTKEDAVARV 300
           +YNR  G P C   +LL   +R  W F GY+ SDC ++    + HK   NDT  +AVA  
Sbjct: 238 AYNRFEGTPCCGHNELLQDILRNQWKFDGYVTSDCWAVSDFAKYHKTHSNDT--EAVADA 295

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLG 358
           +  G DL+CG+ Y     G V++G I+E DI+ SL  L+ +  +LG +D + +  Y ++G
Sbjct: 296 VLNGTDLECGNLYQKLQQG-VEKGLISEKDINVSLARLFEIQFKLGMYDPADRVPYASIG 354

Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
           +  I    H + A E A++ +VLLKN+   LPLN   IK +AL+GP+ +    ++ NY G
Sbjct: 355 REVIECDAHKKHAYEMAQKSMVLLKNNKNILPLNASKIKRIALIGPNMDNGSTLLANYFG 414

Query: 419 TPCRYTSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPA---AIDAAKNADATVIVAG 472
           TP    +P       +  S  I+   G    + Q     P+       AK AD  + V G
Sbjct: 415 TPSEIITPYKSLQKRFGNSIQIDTLTGVG--IVQKLEGAPSFAQVAAQAKKADIIIFVGG 472

Query: 473 LDLSVE-------------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
           +    E                 DR  + LP  QTEL+ ++    + P+ LV MS   + 
Sbjct: 473 ISADYEGEAGDAGAAGYGGFASGDRTTMKLPPVQTELMKELKKTGR-PLILVNMSGSVMS 531

Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
            ++   N    +IL   Y G+  G AI DV+FG YNP GR+P+T Y  +        +P 
Sbjct: 532 FDWESRNA--DAILQAWYGGQAAGDAITDVLFGDYNPAGRMPLTTYMND------EDLPD 583

Query: 580 RPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
               +   RTY++F G V YPFGYGLSYT F Y    +  +V     K  +   +  T  
Sbjct: 584 FEDYSMANRTYRYFKGDVRYPFGYGLSYTTFGYAPLQNASTV-----KTGESIQVTTT-- 636

Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI--KQVIG 697
                                     V N GK  G EVV +Y   P    T +  + + G
Sbjct: 637 --------------------------VTNTGKRAGDEVVQLYISHPQNGNTRVPLRALKG 670

Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
           ++R+ +  G+S +V FT++  + L +VD   N +   G   + +G G
Sbjct: 671 FKRIHLDTGESRQVTFTLSP-EELSLVDEKGNQVEKEGTVELYIGGG 716


>gi|255572559|ref|XP_002527213.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
 gi|223533389|gb|EEF35139.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
          Length = 454

 Score =  380 bits (975), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 190/446 (42%), Positives = 275/446 (61%), Gaps = 9/446 (2%)

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG---SPQYKNLGKNN 361
           +D++CG Y       AV +GK+ E DID +L  L+ V +RLG FDG   +  +  LG  +
Sbjct: 1   MDINCGSYAIRNAQSAVDKGKLREEDIDRALLNLFSVQLRLGLFDGDRINGHFSKLGPED 60

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           +C  +H +LA EAARQGIVLLKN+   LPLN   + +LA++GP AN   ++ G+Y G  C
Sbjct: 61  VCTEEHKKLALEAARQGIVLLKNEKKFLPLNKKAVSSLAIIGPLANNGGSLGGDYTGYSC 120

Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
              S  DG  AY K  +YA GC+++ C ++   P AI  AK AD  ++VAG+DLS E E 
Sbjct: 121 NPQSLFDGVQAYIKRTSYAVGCSNVSCDSDDQFPEAIHIAKTADFVIVVAGIDLSQETED 180

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
           +DR+ LLLPG Q  L++ VA A+K PV LV+   G VD++FAK + +I SILW+GYPGE 
Sbjct: 181 RDRISLLLPGKQMALVSYVAAASKKPVILVLTGGGPVDVSFAKRDSRIASILWIGYPGEA 240

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVY 599
           G +A+AD+IFG+YNPGGRLP+TWY  ++  +P   M +R  P   +PGRTY+F+ G  VY
Sbjct: 241 GAKALADIIFGEYNPGGRLPMTWYPESFTNVPMNDMNMRANPNRGYPGRTYRFYTGERVY 300

Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV-KCKDY 658
            FG GLSYT + YK  S+P  + +        R     +         + ID++  C   
Sbjct: 301 GFGEGLSYTNYAYKFLSAPSKLSLSGSLTATSR--KRILHQRGDRLDYIFIDEISSCNSL 358

Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSAKVGFTMNA 717
           +FT QI V N+G MDGS VVM++S+ P ++ GT  KQ++G+ER+   + +S +    ++ 
Sbjct: 359 RFTVQISVMNVGDMDGSHVVMLFSRVPQVSEGTPEKQLVGFERINTVSHKSTETSILLDP 418

Query: 718 CKSLKIVDNAANSLLASGAHTILVGE 743
           CK L I +     ++  G+H +L+G+
Sbjct: 419 CKHLSIANGQGKRIMPVGSHVLLLGD 444


>gi|295134875|ref|YP_003585551.1| beta-glucosidase [Zunongwangia profunda SM-A87]
 gi|294982890|gb|ADF53355.1| beta-glucosidase [Zunongwangia profunda SM-A87]
          Length = 735

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 257/766 (33%), Positives = 390/766 (50%), Gaps = 103/766 (13%)

Query: 3   ESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEA 62
           +  K+  S+F + D  L   ER  DL+ R+TL EK QQM + +  + RLG+P Y+WW+EA
Sbjct: 23  QQTKIDKSEFDFYDTDLSMDERIDDLISRLTLEEKAQQMLNASPAIERLGIPAYDWWNEA 82

Query: 63  LHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN 122
           LHG+   G                 AT FP  I   A+F++ L  K+   +S EARA +N
Sbjct: 83  LHGLGRSGV----------------ATVFPQAIGMGATFDDDLILKVSTAISDEARANFN 126

Query: 123 LGNA----------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVE 172
             NA          GLTFW+PN+N+ RDPRWGR  ET GEDPY+  +    +V+GLQ   
Sbjct: 127 --NAVKHGYHRKYGGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTSKLGEAFVKGLQG-- 182

Query: 173 GVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVN 232
                 D+D + LK +A  KHYA +   +     R  F++ V+E+D+ ET++  F+  V 
Sbjct: 183 ------DND-KYLKTAAAAKHYAVH---SGPEKLRHEFNADVSEKDLWETYLPAFKTLV- 231

Query: 233 EGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDT 292
           + +V ++MC+YN  NG P CA+ +L+N  +R  W F+G++VSDC ++Q  V  H  + ++
Sbjct: 232 DANVETIMCAYNSTNGEPCCANNRLINDILRDKWGFNGHVVSDCWALQDFVSGHDIV-ES 290

Query: 293 KEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--G 350
            E A A  ++ G++L+CGD Y NF   AV+ G ++E  +D  L  L     +LG FD   
Sbjct: 291 PEAAAALAVEVGIELNCGDTY-NFLAKAVEDGLVSEELVDKRLHKLLETRFKLGLFDPEE 349

Query: 351 SPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATK 410
           S  Y  +G   + + +H  LA E AR+ IVLLKND G LPL   N+    + GP+A   +
Sbjct: 350 SNPYNKIGVEVMNSDEHRALARETARKSIVLLKND-GVLPLKN-NLSKYFITGPNATNIE 407

Query: 411 AMIGNYEGTPCRYTSPMDGFYAYSK---VINYAPGCADIVCQNNSMIPAAIDAAKNADAT 467
            ++GNY G      + ++G     K    + Y  G    +   N    A+ +A  N+DAT
Sbjct: 408 VLLGNYHGVNPDMVTVLEGIAKAIKPESQLQYRMGTRLNLPNENPQDWASPNAG-NSDAT 466

Query: 468 VIVAGLDLSVEAEG---------KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAV 518
            +V G+   +E E           DR+D  LP  Q + + KV++AA+    + I++ G+ 
Sbjct: 467 FVVMGISGLLEGEEGESIASPTFGDRMDYNLPQNQIDYLQKVSEAAEDRPVVAIVTGGS- 525

Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
            +N  + +    ++L V YPGEEGG A+AD+IFGK +P GRLPIT+       +    +P
Sbjct: 526 PMNLTEVHKLADAVLLVWYPGEEGGNAVADIIFGKNSPSGRLPITF------PMTIEDLP 579

Query: 579 LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV 638
                   GRTYK+ D   +YPFGYGLSYT F+Y      K    K +  +         
Sbjct: 580 AYEDYTMEGRTYKYMDVVPMYPFGYGLSYTDFEYSEIKLSKDKIKKKESVEA-------- 631

Query: 639 GTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIK--QVI 696
                                   +I V N G  +  EVV VY K    A + +   +++
Sbjct: 632 ------------------------RISVTNTGDFEADEVVQVYLKDVK-ASSRVPNFELV 666

Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            ++ + +  G+S ++ F +   + L  +D+     L  GA  I +G
Sbjct: 667 AFKNIHLKRGESKELTFEITP-EMLSFIDDNGKEKLEKGAFEIYIG 711


>gi|402308386|ref|ZP_10827395.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
 gi|400375830|gb|EJP28725.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
          Length = 721

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 256/743 (34%), Positives = 379/743 (51%), Gaps = 98/743 (13%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
             AK+++ RMT+ EK+ Q+ + +  +  LG+  Y+WWSE LHGV   GR           
Sbjct: 32  RHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR----------- 80

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG--------NAGLTFWSPN 134
                AT FP  I   A+F+E+L ++IG  V+TE RA +N+         NAGLTFWSPN
Sbjct: 81  -----ATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVAQKLKNYSRNAGLTFWSPN 135

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           +N+ RDPRWGR +ET GEDP + G     YVRGLQ           D+  LK  AC KHY
Sbjct: 136 VNIFRDPRWGRGMETYGEDPLLSGMLGTAYVRGLQ---------GDDAFYLKTGACAKHY 186

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A +     EG  R   D   + +D+ ET++  F+M V +G V +VM +YNRV G P    
Sbjct: 187 AVH--SGPEGT-RHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGS 243

Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
             LL   +R  W F+G+IVSDCD+I      H+++  T E+A A  +KAGL+++CG  + 
Sbjct: 244 KYLLTDILRKSWGFNGHIVSDCDAINDFYGGHRYVK-TPEEACAAAIKAGLNVECGHTFK 302

Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF--DGSPQYKNLGKNNICNPQHIELAA 372
               GA+ QG +AEAD+D +L  L +  ++LG    D +  Y +  ++ IC+P H  LA 
Sbjct: 303 AM-QGALDQGLLAEADLDRALFPLVMTRLKLGILEPDSACPYNSYDESEICSPAHTALAL 361

Query: 373 EAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA 432
            AA + +VLLKN NG LPL+  NI+TL + GP A+    ++GNY G   RY++ + G  +
Sbjct: 362 RAADEAMVLLKN-NGILPLDK-NIRTLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIVS 419

Query: 433 Y---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE--------- 480
                  +N+ P    I  + N M   A++ A  A+  ++V G + ++E E         
Sbjct: 420 RVSSGTSVNFRPAFMQITEELNDM-NWAVNEACAAEVAIVVMGNNGNMEGEEGEAIASAS 478

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
             DRV + LP  Q   + +V  A KG   +V+++ G+  I+  K +    +++   YPG+
Sbjct: 479 RGDRVGIGLPASQLNYLRRV-KARKGGRIVVVLTGGS-PIDLRKISKLADAVVMAWYPGQ 536

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
           EGG A+ D++FG  N  GRLPIT+           S+P     +  GRTYK+  G V+YP
Sbjct: 537 EGGEALGDLLFGDKNFSGRLPITF------PADVDSLPAFDDYSMNGRTYKYMSGNVMYP 590

Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
           FGYGLSY +  Y  A                      VG  K             K    
Sbjct: 591 FGYGLSYGRVTYTDAR--------------------VVGRIK-------------KGEPL 617

Query: 661 TFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
             ++ + N G     EV   Y + P    G+ +  ++G+ RV I    S K  F +   +
Sbjct: 618 AVEVVLTNNGDRTIDEVAQAYIATPTAGKGSPMASLVGFRRVSIPPKSSVKAVFKIVPER 677

Query: 720 SLKIVDNAANSLLASGAHTILVG 742
            + I  + ++ LL  G +T+ +G
Sbjct: 678 LMTIQSDGSSKLL-KGNYTLTIG 699


>gi|307719075|ref|YP_003874607.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
           6192]
 gi|306532800|gb|ADN02334.1| glycoside hydrolase family 3 [Spirochaeta thermophila DSM 6192]
          Length = 693

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 263/750 (35%), Positives = 380/750 (50%), Gaps = 114/750 (15%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           ER   L+ RM++ EK   M   A GVPRLG+P Y WW+EALHGV+  G            
Sbjct: 5   ERMTSLLSRMSIEEKAGLMVHRAKGVPRLGIPNYNWWNEALHGVANSGE----------- 53

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN-LGNA-------GLTFWSPN 134
                AT FP  I   A+F+  L +++   +S EARA +N +G         GLTFWSPN
Sbjct: 54  -----ATVFPQAIGLAATFDPDLVRRVADAISREARAKFNAVGKERAAEYERGLTFWSPN 108

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           IN+ RDPRWGR  ET GEDP++  +  + +V+GLQ      Y+       L+++AC KHY
Sbjct: 109 INIYRDPRWGRGQETYGEDPFLTSKIGVAFVKGLQGDH--PYY-------LRVAACAKHY 159

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A +     EG  R  FD+RV+E+D+ ET++  FE  V  G V +VM +YNRVNG P C  
Sbjct: 160 AVH--SGPEGL-RHVFDARVSEKDLWETYLPAFEALVKAG-VEAVMGAYNRVNGEPACGS 215

Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
            +LL + +R  W F G++VSDC +I      HK   D  E ++A  L+AG DL+CG+ Y 
Sbjct: 216 KRLLEEILRKKWGFKGHVVSDCWAIADFHLHHKVTKDPIE-SIAMALEAGCDLNCGNTYE 274

Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEA 374
           +  + AV+ G ++E  +D S+  L   L RLG F     Y  L   +I    H  LA EA
Sbjct: 275 HL-LDAVKAGAVSEELVDRSVARLLSTLDRLGLFTDDHPYVRLSLADIDWEAHRALAREA 333

Query: 375 ARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYS 434
           A + +VLLKN NG LPL+   ++ + + GP+A    A++GNY G   R  + ++G   Y+
Sbjct: 334 AEKSVVLLKN-NGILPLDRRKLRYIYVTGPNAANPVALLGNYAGVSSRLVTVLEGITGYA 392

Query: 435 K---VINYAPGCADIVCQNNSMIPA--AIDAAKNADATVIVAGLDLSVEAEGKDRV---- 485
                + Y  GC     Q N + P   A   A+ AD TV V G D +VE E  D +    
Sbjct: 393 GPGITVTYKIGCP---LQGNKINPIDWASGVARYADVTVAVMGRDSAVEGEEGDAIFSDN 449

Query: 486 -----DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIK----SILWVG 536
                DL L   Q + + ++ +  K P+ +V++S   V       +P+++    +I++  
Sbjct: 450 YGDLSDLNLSREQIDYLRRIKEIGK-PLVVVLLSGAPV------CSPELEELADAIVYAW 502

Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
           YPGEEGG AIA V+FG+ +P GRLPIT+ +      P+T   +       GRTY++    
Sbjct: 503 YPGEEGGNAIARVLFGEVSPSGRLPITFPKGVDQLPPFTDYSME------GRTYRYMKEE 556

Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
            +YPFG+GLSY  F Y+    PKS   + DK +                      +V C 
Sbjct: 557 PLYPFGFGLSYATFSYR---DPKSSASRWDKRETL--------------------EVVC- 592

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSK----PPGIAGTHIKQVIGYERVFIAAGQSAKVG 712
                   EVEN   +   EVV +Y +    P  +    +K   G+ RV +  G+  +V 
Sbjct: 593 --------EVENTSSIPADEVVQLYVRWEDAPFRVPLWSLK---GFTRVSLGTGERIQVR 641

Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVG 742
           F ++  + L  +D     +L  G     VG
Sbjct: 642 FVLSP-EDLSFIDEKGRKVLPEGRLRFHVG 670


>gi|291556907|emb|CBL34024.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum V10Sc8a]
          Length = 691

 Score =  378 bits (971), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 252/755 (33%), Positives = 395/755 (52%), Gaps = 116/755 (15%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D +L   ERA  L + ++  E+ QQ+   A  + + GLP Y WW+E LHGV+  G   
Sbjct: 4   YKDKQLSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT-- 61

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   A+F++ +  ++G+ +STEARAMYN           
Sbjct: 62  --------------ATVFPQAIALAAAFDKDMMYRVGEVISTEARAMYNSAAKHGDTDIY 107

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLT W+PNIN+ RDPRWGR  ET GEDPY+  R  +++V+G+Q  E  EY        L
Sbjct: 108 KGLTLWAPNINIFRDPRWGRGHETYGEDPYLTSRLGVSFVKGIQGEE--EY--------L 157

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           + +AC KH+A +   +   + R  FD+RV+E+DM+ET++  F+  V EG V  VM +YNR
Sbjct: 158 RAAACAKHFAVH---SGPESLRHEFDARVSEKDMEETYLPAFKALVKEGRVEGVMGAYNR 214

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           VNG P+CA  KL+ + +R +W F GY VSDC +I+    +HK + DT   + A  LKAG 
Sbjct: 215 VNGEPSCASEKLMGK-LR-EWGFDGYFVSDCWAIRDFHTTHK-ITDTAPQSAAMALKAGC 271

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNP 365
           D++CG+ Y +  + A+++G I + DI T+        +RLG  D + ++ +L  + I   
Sbjct: 272 DVNCGNTYLHI-LAALEEGLITKQDIRTACIHALRTRIRLGQLDDN-EFDDLPFDIIACD 329

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
            +  L+ EAA + +VLL ND G LPL+   I ++A++GP+A++  A++GNY GTP R  +
Sbjct: 330 GNKALSLEAAEKSMVLLHND-GILPLDKSRISSIAVIGPNADSRAALLGNYNGTPDRSVT 388

Query: 426 PMDGFY-AYSKVINYAPGCADIVCQNNSM-IPA-----AIDAAKNADATVIVAGLDLSVE 478
            ++G   A+   + YA GC     +   + +P      A+ A + AD TVI  GLD ++E
Sbjct: 389 FLEGIQDAFDGRVYYAEGCQLFRDRTQGLALPGDRYAEAVAACEAADVTVICVGLDATLE 448

Query: 479 AE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
            E           D+ DL LP  Q  L+  + D  K P+ +V+ +  +V+     N    
Sbjct: 449 GEEGDTGNEFASGDKPDLRLPEVQRVLLQNLKDTGK-PLIIVLAAGSSVNTECEGN---- 503

Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRT 589
            +++   YPG+ GG+A+A+++FG+ +P G+LP+T+Y++  +   +T   ++       RT
Sbjct: 504 -ALINAWYPGQYGGKALAEILFGEVSPSGKLPVTFYKSADMLPDFTDYSMK------NRT 556

Query: 590 YKFFDGP--VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
           Y+F D    V+YPFGYGL+Y+ F                   +C D++Y           
Sbjct: 557 YRFCDDESNVLYPFGYGLTYSHF-------------------ECGDVSY----------- 586

Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQ 707
                   KD   T  + V N G     +V+ VY K       H   +  +ERV +  G+
Sbjct: 587 --------KDN--TLAVNVTNTGSRSAEDVLQVYIKSENGVKNH--SLCAFERVSLFDGE 634

Query: 708 SAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           S  +   +    + + VD+     + SG +T+  G
Sbjct: 635 SRTISINIPE-GAFETVDDNGIRAVRSGRYTLYAG 668


>gi|371776901|ref|ZP_09483223.1| beta-glucosidase [Anaerophaga sp. HS1]
          Length = 720

 Score =  378 bits (971), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 254/753 (33%), Positives = 377/753 (50%), Gaps = 98/753 (13%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           + D  L    RAK L+  +TL EK+  +G     V RL +P Y WW+EALHGV+  G   
Sbjct: 29  FRDEALDIETRAKALLSELTLKEKISLLGYNNPPVERLQIPAYNWWNEALHGVARAGE-- 86

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   A+F+ +L  +I   +STEAR+ YN+  +       
Sbjct: 87  --------------ATVFPQAIALAATFDTTLVYRIADAISTEARSKYNINRSKGFQNQY 132

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            G+TFW+PNIN+ RDPRWGR  ET GEDP++       +V+GLQ  E          R L
Sbjct: 133 LGITFWTPNINIFRDPRWGRGQETYGEDPFLTASMGKAFVKGLQGSE--------PERRL 184

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K +A  KH+A +        DR HF++ V E+D++ET++  F+  V  G V+++MC+YNR
Sbjct: 185 KTAAGAKHFAVHSGPE---ADRHHFNAVVDEKDLRETYLPAFKALVENG-VTTIMCAYNR 240

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           VNG P C    LL   +R +W F G +V+DC ++  I   HK +  T+ +  A  +KAG+
Sbjct: 241 VNGEPCCTGKTLLQDILRDEWGFKGQVVTDCWALDDIWLRHKTI-PTRVEVAAAAVKAGV 299

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNIC 363
           +LDC +        A+++  +    +D++L       ++LG++D      Y++ G +++ 
Sbjct: 300 NLDCANILQEDVQDAIEKRLLTLEQVDSALLPTLQTQLKLGFYDDPSHSPYRHYGIDSVN 359

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
           N  HI LA EAA + +VLLKND G LPL    I ++ +VG +A +  A+ GNY G     
Sbjct: 360 NSYHISLAKEAAEKSMVLLKND-GILPLKKDTISSIMVVGENAASISALTGNYHGLSGNM 418

Query: 424 TSPMDGFYAYS---KVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
            + ++G          + Y  GC+     + S     I AA   D T+ V GL   +E E
Sbjct: 419 VTFVEGLVKAGGPGMSVQYDYGCS---FADTSHF-GGIWAAGFTDVTIAVIGLSPLLEGE 474

Query: 481 ---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKS 531
                    G D+ DL +P      + K+ ++   PV  V+    A+DI+  +  P   +
Sbjct: 475 HGDAFLSNWGGDKKDLRMPRSHEIYLKKLRESHNHPVIAVVTGGSALDISAIE--PYADA 532

Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYK 591
           I++  YPGE+GG A+AD+IFG+ +P GRLPIT+Y+      PY         N   RTY+
Sbjct: 533 IIYAWYPGEQGGTALADLIFGEVSPSGRLPITFYKDIKDLPPYHDY------NMTNRTYR 586

Query: 592 FFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLID 651
           +F G V+YPFGYGLSYT F Y+  S P +                           V  D
Sbjct: 587 YFQGDVLYPFGYGLSYTSFHYEWLSKPST--------------------------KVSED 620

Query: 652 DVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKV 711
           D+       +  I V N G MD  EV+ VY   P I    ++++ G+ R+ I AGQ+   
Sbjct: 621 DI------ISVNIAVTNTGTMDADEVIQVYIVYPDIERMPLRELKGFSRIHIKAGQTQNT 674

Query: 712 GFTMNACKSLKIVDNAANSL-LASGAHTILVGE 743
              +   K+LK  D+  N   L  G + I V +
Sbjct: 675 DIQI-PVKNLKKWDSKNNRWKLYKGKYKIQVSQ 706


>gi|288924872|ref|ZP_06418809.1| beta-glucosidase [Prevotella buccae D17]
 gi|288338659|gb|EFC77008.1| beta-glucosidase [Prevotella buccae D17]
          Length = 721

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 255/743 (34%), Positives = 379/743 (51%), Gaps = 98/743 (13%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
             AK+++ RMT+ EK+ Q+ + +  +  LG+  Y+WWSE LHGV   GR           
Sbjct: 32  RHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR----------- 80

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG--------NAGLTFWSPN 134
                AT FP  I   A+F+E+L ++IG  V+TE RA +N+         NAGLTFWSPN
Sbjct: 81  -----ATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVARKLKNYSRNAGLTFWSPN 135

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           +N+ RDPRWGR +ET GEDP + G     YVRGLQ           D+  LK  AC KHY
Sbjct: 136 VNIFRDPRWGRGMETYGEDPLLSGMLGTAYVRGLQ---------GDDAFYLKTGACAKHY 186

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A +     EG  R   D   + +D+ ET++  F+M V +G V +VM +YNRV G P    
Sbjct: 187 AVH--SGPEGT-RHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGS 243

Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
             LL   +R  W F+G+IVSDCD+I      H+++  T E+A A  +KAGL+++CG  + 
Sbjct: 244 KYLLTDILRKSWGFNGHIVSDCDAINDFYGGHRYVK-TPEEACAAAIKAGLNVECGHTFK 302

Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF--DGSPQYKNLGKNNICNPQHIELAA 372
               GA+ QG +AEAD+D +L  L +  ++LG    D +  Y +  ++ IC+P H  LA 
Sbjct: 303 AM-QGALDQGLLAEADLDRALFPLVMTRLKLGILEPDSACPYNSYDESEICSPAHTALAL 361

Query: 373 EAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA 432
            AA + +VLLKN NG LPL+  NI+TL + GP A+    ++GNY G   RY++ + G  +
Sbjct: 362 RAADEAMVLLKN-NGILPLDK-NIRTLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIVS 419

Query: 433 Y---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE--------- 480
                  +N+ P    I  + N M   A++ A  A+  ++V G + ++E E         
Sbjct: 420 RVSSGTSVNFRPAFMQITEELNDM-NWAVNEACAAEVAIVVMGNNGNMEGEEGEAIASAS 478

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
             DRV + LP  Q   + +V  A KG   +V+++ G+  I+  + +    +++   YPG+
Sbjct: 479 RGDRVGIGLPASQMNYLRRV-KARKGGRIVVVLTGGS-PIDLREISKLADAVVMAWYPGQ 536

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
           EGG A+ D++FG  N  GRLPIT+           S+P     +  GRTYK+  G V+YP
Sbjct: 537 EGGEALGDLLFGDKNFSGRLPITF------PADVDSLPAFDDYSMNGRTYKYMSGNVMYP 590

Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
           FGYGLSY +  Y  A                      VG  K             K    
Sbjct: 591 FGYGLSYGRVTYTDAR--------------------VVGRIK-------------KGEPL 617

Query: 661 TFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
             ++ + N G     EV   Y + P    G+ +  ++G+ RV I    S K  F +   +
Sbjct: 618 AVEVVLTNNGDRTIDEVAQAYIATPTAGKGSPMASLVGFRRVSIPPKSSVKAVFKIVPER 677

Query: 720 SLKIVDNAANSLLASGAHTILVG 742
            + I  + ++ LL  G +T+ +G
Sbjct: 678 LMTIQSDGSSKLL-KGNYTLTIG 699


>gi|402074909|gb|EJT70380.1| hypothetical protein GGTG_11406 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 793

 Score =  377 bits (969), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 259/767 (33%), Positives = 389/767 (50%), Gaps = 79/767 (10%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD  L   ERA  LV  + + EK+  +   A G  R+GLP Y WWSEALHGV++      
Sbjct: 44  CDRSLSPSERAAALVAALNVTEKMANLVSNANGSARIGLPKYNWWSEALHGVAYA----- 98

Query: 75  SPPGTHFDSEVPG----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
             PGT F    PG    +TSFP  +L  ASF++SL +KIG  + TE+RA  N   +GL +
Sbjct: 99  --PGTQF-RRGPGDFNSSTSFPMPLLLAASFDDSLIEKIGDVIGTESRAFGNGRWSGLDY 155

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           W+PN+N  +DPRWGR  ETPGED   + RYA + ++GL+       H + + R   + + 
Sbjct: 156 WTPNVNPFKDPRWGRGSETPGEDILRIKRYAASMIKGLEGP-----HPEKERR---VVST 207

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKHYAA D ++W G  R  FD+R++ QD+ E +++PF+ C  +  V S+MC+YN VNG+P
Sbjct: 208 CKHYAANDFEDWNGTSRHDFDARISAQDLAEYYLMPFQQCARDSRVGSIMCAYNAVNGVP 267

Query: 251 TCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL 307
           +CA+  LL+  +R  W + G   Y+ SDC+++  +   HK+   T  +  A   +AG D 
Sbjct: 268 SCANSYLLDTVLRKHWGWTGHNNYVTSDCEAVLDVSAGHKYAR-TNAEGTAMCFEAGTDT 326

Query: 308 DCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNICNPQ 366
            C    ++   GA  QG + E  +D +L  LY  L+R+GYFDG S  + ++   ++  P 
Sbjct: 327 SCEYTPSSDIRGAYAQGLLREETMDRALLRLYEGLVRVGYFDGNSSAFSDISWADVNAPA 386

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKT-------------LALVGPHANATKAMI 413
             +L+ ++A +GIV+LKND G LPL  G   +             LA++G  A+A + + 
Sbjct: 387 AQDLSLQSAVEGIVMLKND-GTLPLPLGAKCSSKSKKRSSSGGPKLAMIGFWADAPEKLR 445

Query: 414 GNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQN------NSMIPAAIDAAKNADAT 467
           G Y GT     +P    YA  ++          V Q       ++    A+ AA+ AD  
Sbjct: 446 GGYSGTAAYLRTPA---YAARQMGLDVVTAGGPVLQGAAAAAADNWTAPALAAAEGADYI 502

Query: 468 VIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
           V   GLD +   E KDR D+  PG Q  L+ ++  AA G   +V+     +D      N 
Sbjct: 503 VYFGGLDETAAGENKDRWDVEWPGAQLALVKRL--AALGKPLVVVQMGDQLDGTPLLANA 560

Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRP--VNN 584
            + ++LW  +PG++GG A+  ++ G  +P GRLP+T Y ANY + +P T M LRP    +
Sbjct: 561 GVGAVLWASWPGQDGGPAVMRLLSGAASPAGRLPVTQYPANYTRLVPMTEMALRPSASGS 620

Query: 585 FPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSP----KSVDIKLDKDQQCRDINYTVGT 640
            PGRTY+++  PV+ PFG+GL YT F   V   P     S        + CRD +     
Sbjct: 621 RPGRTYRWYSTPVL-PFGFGLHYTNFTPAVTVPPALAAASGVTTSSLLEACRDPHPERCA 679

Query: 641 NKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYE 699
             P                   ++ V N G+     V + + S   G     IK +  Y 
Sbjct: 680 LPP------------------LRVAVANTGRRASDYVALAFVSGDYGPRPRPIKTLAAYA 721

Query: 700 RVF-IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGV 745
           R+  + AG SA+          +   D   N++L  G + + + E V
Sbjct: 722 RLRGVRAGGSAEADLAWT-LGDIARHDEDGNTVLYPGTYKVQIDEPV 767


>gi|325192664|emb|CCA27085.1| unnamed protein product [Albugo laibachii Nc14]
          Length = 2278

 Score =  377 bits (968), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 255/771 (33%), Positives = 391/771 (50%), Gaps = 95/771 (12%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAY---GVPRLGLPLYEWWSEALHGVSF 68
           FP+C++ L    R +DL++R+ L EKV+ +   A     +PRLG+P Y W +  +HGV  
Sbjct: 34  FPFCNSSLSLDLRVEDLLQRLQLDEKVRMLTARASTHGSIPRLGVPEYNWGANCVHGV-- 91

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG---- 124
                 S  GTH       ATSFP  +   A F+ +   K+ Q +  E RA+   G    
Sbjct: 92  -----QSTCGTH------CATSFPNPVNLGAIFDPNEIYKMAQVIGKELRALRLEGAREN 140

Query: 125 -----NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD 179
                + GL  WSPNIN+ RDPRWGR +ETP EDPYV  +Y + Y +GLQ+ +       
Sbjct: 141 YARGPHIGLDCWSPNININRDPRWGRAMETPSEDPYVNAKYGVAYTKGLQEGQ------- 193

Query: 180 SDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
            DSR L+     KHY AY  +N+ G DR  FD+ V+  D  +T+   FE  V +G    +
Sbjct: 194 -DSRFLQAVVTLKHYLAYSYENYGGTDRTQFDAIVSAYDFADTYFPAFEASVVDGKAKGI 252

Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
           MCSYN +NGIPTCA+ K LNQ +R D  F GYI SD  +IQ I + HK+   T  +A   
Sbjct: 253 MCSYNSLNGIPTCAN-KWLNQLLRDDLEFDGYITSDTGAIQGIFDGHKY-TKTLCEATKI 310

Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK 359
            +++G+D+  G+ Y N  +  +       A ID ++R    +  +LG FD      + G 
Sbjct: 311 AMESGVDICSGNAYWN-CLKQLANSTNFSASIDEAIRRTLKLRFQLGLFDAIGDQPHFGP 369

Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
            ++   + ++L+ + AR+ IVLL+N    LPL  G    +A++GPH+   + ++GNY G 
Sbjct: 370 EDVRTAKSLQLSLDLARKSIVLLQNHGNTLPLRLG--LRIAVIGPHSMTRRGIMGNYYGQ 427

Query: 420 PCR--------YTSPMDGFYAYSKVIN--YAPGCADIVCQNNSMIPAAIDAAKNADATVI 469
            C           SP++   + +   N  +  GC  I   + +    A+ A + AD  V+
Sbjct: 428 LCHGDYDEVRCIQSPLEAIQSVNGRNNTHHVNGCG-INDTSTAEFDDALQAVRTADVAVL 486

Query: 470 VAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
             G+D+S+E E KDR ++ +P  Q EL+  +  A K P  +V+ + G + I   K     
Sbjct: 487 FLGIDISIERESKDRDNIDVPHIQLELLKAIRVAGK-PTVVVLFNGGILGIE--KLILYA 543

Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANY---VKIPYTSMPLRPVNNFP 586
            S+L   YPG  G +AIA+++FG  NP G+LP+T Y +N+   V +   SM L     +P
Sbjct: 544 DSVLEAFYPGFFGAQAIAEILFGSINPSGKLPVTMYRSNFINDVDMKSMSMTL-----YP 598

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GR+Y+++    VY FG+GLSYT F      S +S+D         R +N+ V T +P   
Sbjct: 599 GRSYRYYTEVPVYSFGWGLSYTTF------SIQSID-----SHDTRAMNH-VLTAQPK-- 644

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-----IKQVIGYERV 701
                          ++I + N GK  G EV+  + +P  I  T       +Q+  Y RV
Sbjct: 645 --------------MYRILITNNGKYYGEEVLFAFFRPLDIHATGPVESLQQQLFNYTRV 690

Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGV-GGVSFP 751
            +  G   +V   +   ++L + D   N  +  G + +++  GV   ++FP
Sbjct: 691 RLDPGDMREVPLHVKD-ENLALHDRNGNLCVFEGFYELIISNGVEEQLTFP 740


>gi|373460527|ref|ZP_09552278.1| hypothetical protein HMPREF9944_00542 [Prevotella maculosa OT 289]
 gi|371955145|gb|EHO72949.1| hypothetical protein HMPREF9944_00542 [Prevotella maculosa OT 289]
          Length = 699

 Score =  377 bits (968), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 247/745 (33%), Positives = 375/745 (50%), Gaps = 103/745 (13%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           ++A+ L+  MTL EK+ QM +   G+PRLG+  Y+WW+E LHGV   GR           
Sbjct: 11  QKARRLINMMTLDEKISQMMNETPGIPRLGIKPYDWWNEGLHGVGRDGR----------- 59

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN--------AGLTFWSPN 134
                AT FP  I   A+FN +L ++IG  ++TE RA YN+           GLTFWSPN
Sbjct: 60  -----ATVFPQPIGMGATFNPALIRQIGDAIATEGRAKYNVAQRNNNYARYTGLTFWSPN 114

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           IN+ RDPRWGR +ET GEDP++ G   I YV+G+Q          +D   LK++AC KHY
Sbjct: 115 INIFRDPRWGRGMETYGEDPFLTGTLGIAYVQGMQ---------GNDPFYLKVAACGKHY 165

Query: 195 AAYDLDNWEGNDRFHFDSRV--TEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           A +      G +    ++ V  T++D+ ET++  F+M V +G V ++M +YNRV G    
Sbjct: 166 AVHS-----GPEATRHEANVSPTKRDLFETYLPAFKMLVQQGHVEAIMGAYNRVYGEACS 220

Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
               LL   +R  W F G+IVSDCD++  I   HK +  T+ +A A  +KAGL+++CG  
Sbjct: 221 GSKYLLTDVLRKQWGFRGHIVSDCDAVADIHAGHKIVK-TEAEACAIAIKAGLNIECGHT 279

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY--FDGSPQYKNLGKNNICNPQHIEL 370
           +      AV Q  + E +ID +L  L +  ++LG   +D    Y  + +  IC+P+HI L
Sbjct: 280 FEAMKQ-AVAQKLLTEQEIDRALLPLMMTRLKLGILEYDAECPYNEVKETEICSPEHIAL 338

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A +AA + +VLLKN NG LPL+  N+ TL + GP A+ +  ++GNY G   RY + + G 
Sbjct: 339 ARKAATESMVLLKN-NGILPLDK-NLHTLFIAGPGASDSFWLMGNYFGISNRYCTYLQGI 396

Query: 431 ---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG------ 481
               +    +N+ P   +     N+ I  A+D A  A+ T++V G + ++E E       
Sbjct: 397 ADKVSSGTAVNFRPAFGESTPTKNT-INWALDEAIAAEKTIVVMGNNGNLEGEEGESIAS 455

Query: 482 ---KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
               DRV + LP  Q + +  +  A K  + +V+     +D+           + W  YP
Sbjct: 456 ETRGDRVSMRLPASQMKFLRDL-KARKNGIVVVLTGGSPIDVREISRLADAVVMAW--YP 512

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVV 598
           G+EGG A+AD++FG  N  GRLP+T+ E+     P+    ++      GRTYK+    + 
Sbjct: 513 GQEGGYALADLLFGDENFSGRLPVTFPESTDALPPFEDYAMK------GRTYKYQTAHIQ 566

Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
           YPFGYGLSYT   Y  A                                  ++ +  K  
Sbjct: 567 YPFGYGLSYTTVTYAHAK---------------------------------VETMPQKGR 593

Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPPGIAGT-HIKQVIGYERVFIAAGQSAKVGFTMNA 717
             T    ++N G     EV  VY   PG   T  +  ++ ++R+ +  G+   V F +  
Sbjct: 594 GMTVSAVLKNTGNKAVDEVAQVYLSAPGAGTTAALASLVAFKRIGLQPGEQQLVRFDIPF 653

Query: 718 CKSLKIVDNAANSLLASGAHTILVG 742
            + L + ++    LL  G +TI VG
Sbjct: 654 DRLLTVQEDGTAQLL-KGNYTITVG 677


>gi|332307852|ref|YP_004435703.1| glycoside hydrolase family protein [Glaciecola sp. 4H-3-7+YE-5]
 gi|332175181|gb|AEE24435.1| glycoside hydrolase family 3 domain protein [Glaciecola sp.
           4H-3-7+YE-5]
          Length = 733

 Score =  377 bits (967), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 254/763 (33%), Positives = 380/763 (49%), Gaps = 94/763 (12%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           +D P+ D +LP  ER   L++ MTL EK  Q+ +    + RLGLP Y++W+EALHGV+  
Sbjct: 22  NDQPWFDTQLPTQERIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEALHGVARN 81

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGN 125
           GR                AT FP  I   A+F++ L  K    +S EARA +N    +GN
Sbjct: 82  GR----------------ATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIGN 125

Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
               +GLTFW+PNIN+ RDPRWGR  ET GEDPY+  +     V GLQ            
Sbjct: 126 RSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQG---------DH 176

Query: 182 SRPLKISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
            + LK +A  KH+A +      G +  R  FD+  + +DM ET+   FE  + E +V +V
Sbjct: 177 PKYLKTAAAAKHFAVHS-----GPEALRHEFDAIASPKDMYETYFPAFEALITEANVETV 231

Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
           M +YNRVNG P      LLN  +R  W F G++VSDC  +    + HK   +  E A A 
Sbjct: 232 MAAYNRVNGHPAGGSDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKVTANAVESA-AL 290

Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNL 357
            +  G DL+CG  Y N    AV+ G + E  ID  L  +     +LG+FD      Y N+
Sbjct: 291 AINTGTDLNCGAVY-NALPDAVEAGLVDEKTIDKRLSKVLATKFKLGFFDPKDDNPYNNI 349

Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
             + + +  H ++A E A + IVLL+N N  LPL+  NI+ L + GP A++++ ++GNY 
Sbjct: 350 SADVVNSEAHAQVAYEMAVKSIVLLQNKNNILPLDR-NIRNLYVTGPFASSSEVLLGNYY 408

Query: 418 GTPCRYTSPMDGFYAYSKV---INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
           G   + T+ +DG  A   V   INY  G        N +     +A +  D  + V GL 
Sbjct: 409 GLSGKTTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLS 468

Query: 475 LSVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
            + E E           DR+ L LP  Q   + K+      PV +V+++AG   +N  + 
Sbjct: 469 GAYEGEEGEAIASPHKGDRLSLDLPEHQIAFLRKLRKDNDKPV-IVVLTAG-TPVNLTEI 526

Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
                +I++  YPG+EGG+A+AD++FG+ +P GRLPIT+ ++     PY    ++     
Sbjct: 527 AELADAIVFAWYPGQEGGKAVADILFGERSPSGRLPITFPKSEAQLPPYDDYSMQ----- 581

Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
            GRTY++     +YPFG+GLSY Q K+         +I L   Q           N+P  
Sbjct: 582 -GRTYRYMTQEPMYPFGFGLSYAQVKFD--------NITLGNTQAL------ASKNEP-- 624

Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIA 704
                          T  + V N G+ +  EVV +Y K P    +  +  + G+ R+ +A
Sbjct: 625 -----------QENMTVTVNVTNTGEREFEEVVQLYLKTPDAGVSQPLHSLKGFTRIKLA 673

Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGG 747
           AGQ+ +V F++   K L  ++     +L  G ++++VG    G
Sbjct: 674 AGQTEQVLFSI-PKKHLYSINEQGKPVLLKGQYSVIVGNASPG 715


>gi|320161274|ref|YP_004174498.1| beta-D-xylosidase [Anaerolinea thermophila UNI-1]
 gi|319995127|dbj|BAJ63898.1| beta-D-xylosidase [Anaerolinea thermophila UNI-1]
          Length = 712

 Score =  376 bits (966), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 256/775 (33%), Positives = 394/775 (50%), Gaps = 116/775 (14%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y +   P  ER  DL+ RMTL EK+ QM +    +PRLG+P Y++WSEALHGV+  G+  
Sbjct: 8   YLNPDAPLEERVNDLISRMTLEEKISQMCNSCAAIPRLGIPAYDYWSEALHGVARNGK-- 65

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN-----LGNA-- 126
                         AT FP  I   A+++  L +++   +++EARA ++      G    
Sbjct: 66  --------------ATVFPQAIGMAATWDTELIERVADAIASEARAKFHETLRKFGKTDI 111

Query: 127 --GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLT WSPNIN+ RDPRWGR  ET GEDPY+ G     +VRGLQ           D   
Sbjct: 112 YQGLTMWSPNINIFRDPRWGRGQETWGEDPYLTGEMGAAFVRGLQG---------KDPHY 162

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
           LK +AC KHY  +     E   R  F++ VT +++ +T++  F+  V E  V +VM +YN
Sbjct: 163 LKTAACAKHYTVHSGPEKE---RHTFNAIVTRRELFDTYLPAFKKLVTEAKVEAVMGAYN 219

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           R  G P C  P LL + +R  W F G++VSDC +I      H+   D  E A A  +K G
Sbjct: 220 RTLGEPCCGSPYLLKEILRNQWGFKGHVVSDCGAINDFHLHHQVTKDGAESA-ALGIKNG 278

Query: 305 LDLDCGDYYT--NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLG 358
            D+ C   Y+  N T  A+ +G I E DID +LR       +LG FD  PQ    Y ++ 
Sbjct: 279 CDMACICTYSYENLTE-ALNRGLITEEDIDHALRNTLRTRFKLGLFD--PQEKVPYAHIS 335

Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
            + +    H +LA E A +  VLLKN N  LP+   ++K++ +VGP+A     ++GNY G
Sbjct: 336 MSVVGCEAHRKLAYETAVKSAVLLKNHNHILPVKP-DVKSILIVGPNAGNVHVLLGNYYG 394

Query: 419 TPCRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL 475
                T+ M+G          + + PG    +  ++  I      A  A   +++A + L
Sbjct: 395 LSDSMTTFMEGLVGRLPEGVRMEFMPGS---LLTDSKKIKNDWSVASAASFDLVIAFMGL 451

Query: 476 SVEAEGK----------DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
           S   EG+          DR D+ LP  Q E I  +A A    + LV+    A+ +N  ++
Sbjct: 452 SPLLEGEEGEAILSDNGDREDIALPKAQQEYIRDLA-ATGAKIVLVLTGGSAIALNGIED 510

Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
              +++ILWVGYPG+EGGRAIAD+IFG ++P G+LPIT+        P ++  L P   +
Sbjct: 511 --LVEAILWVGYPGQEGGRAIADLIFGDHSPSGKLPITF--------PVSTDQLPPFREY 560

Query: 586 P--GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKP 643
               RTY++     ++PFG+GLSYTQF+YK        +++L+                P
Sbjct: 561 SMKERTYRYMTSSPLFPFGFGLSYTQFEYK--------NLQLE---------------HP 597

Query: 644 PCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF 702
             +A        +  + TF  E+ N+G+ +G EVV VY S         ++++I ++RV 
Sbjct: 598 VLSA-------GEALRGTF--ELANVGEYEGEEVVQVYLSDLEASTIVPLQKLISFQRVR 648

Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLN 757
           +  G++ ++ F +   +++ ++D+  N +L  G   + +G        P+Q +L+
Sbjct: 649 LKPGETVQLSFAIQP-EAMMMIDDEGNQVLEPGKFKLTIGGAA-----PIQRSLD 697


>gi|333494646|gb|AEF56854.1| putative glycosyl hydrolase [synthetic construct]
          Length = 743

 Score =  376 bits (965), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 261/760 (34%), Positives = 377/760 (49%), Gaps = 109/760 (14%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D  L + ERA+DLV RMTL EK+ QM   A  + RLG+P Y WW+EALHGV+  G   
Sbjct: 30  YRDENLSFEERARDLVSRMTLEEKIAQMQHEAPSIERLGVPAYNWWNEALHGVARAGV-- 87

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN-------- 125
                         +T FP  I   A+F+  L +K    +STE RA Y+           
Sbjct: 88  --------------STMFPQAIGMAATFDAELIEKTADVISTEGRARYHEFQRKGDRDIY 133

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLTFWSP IN+ RDPRWGR  ET GEDPY+  R A++++RG+Q             R L
Sbjct: 134 KGLTFWSPTINIDRDPRWGRGQETYGEDPYLTSRLAVSFIRGIQ----------GRGRYL 183

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K +AC KH+A +       ++R  F++ V+++D+ ET++  FE  V E  V+ VM +YNR
Sbjct: 184 KAAACAKHFAVHSGPE---SERHQFNAEVSQKDLWETYLPAFEASVKEAKVAGVMGAYNR 240

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           VNG P C    LL   +RG+W F GY+ SDC +I+ I E H  +  T E++ A  +K+G 
Sbjct: 241 VNGEPCCGSGTLLGDVLRGEWEFGGYVTSDCWAIKDINEGHG-VTKTIEESSALAVKSGC 299

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLG-KNNI 362
           DL+CG  Y +  + A + G I E +IDT++  L +  MRLG FD   +  Y ++  + N 
Sbjct: 300 DLNCGCAYASL-VKAYRAGLIGEKEIDTAVHRLMLTRMRLGMFDAPEKVPYSSIPYEKND 358

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
           C  +H   A E A + +VLL+N +G LPL+   I+++A++GP+A++  A+ GNY GT   
Sbjct: 359 C-AEHRAFALEVAEKSLVLLRNRSGFLPLDRSRIRSVAVIGPNADSRVALEGNYNGTASE 417

Query: 423 YTSPMDGFYAY---SKVINYAPGCADI------VCQNNSMIPAAIDAAKNADATVIVAGL 473
           Y + +DG          + YA G          + Q N  +  A  AA+ AD  V+  GL
Sbjct: 418 YVTVLDGIREAVGDRARVYYAEGSHLFRNSMGGLSQKNDRLAEAAAAAERADVAVVCLGL 477

Query: 474 DLSVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAK 524
           +  +E E           D+ DL LPG Q EL+  V  A   PV LV++S  A+ +N+A 
Sbjct: 478 NRDIEGEEGDPSNEYPAGDKRDLRLPGLQEELLETV-KATGTPVVLVLLSGSALAVNWAD 536

Query: 525 NNPKIKSILWVGYPGEEG-GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVN 583
            N       W  YPG +  GR  A  +FG   P G  P         +    +   R   
Sbjct: 537 ENADAVVQAW--YPGAQAEGRRGA--LFGIIRPAGGFP--------SRSTVRTRTSRIFG 584

Query: 584 NFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKP 643
                      G  +YPFGYGLSYT+F+Y         D+KL               ++ 
Sbjct: 585 TIHENRLPLLQGDPLYPFGYGLSYTKFQYG--------DLKL-------------AASEI 623

Query: 644 PCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIK-QVIGYERVF 702
           P      +D +         + V N G+ D  EVV +Y +    +    K Q+ G+ RV 
Sbjct: 624 PAG----EDAEV-------SVTVRNAGERDSDEVVQLYLQDLESSVPVPKWQLAGFRRVH 672

Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +  G+SA V FT+ A + + ++D     +L  G   +  G
Sbjct: 673 LKPGESAGVRFTV-AARQMALIDEDGRCVLEPGGFRVYAG 711


>gi|317474362|ref|ZP_07933636.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316909043|gb|EFV30723.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 723

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 260/764 (34%), Positives = 377/764 (49%), Gaps = 119/764 (15%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           PY +  L   ERA DLV R+TL EK+  M + +  V RLG+  YEWW+EALHGV+  G  
Sbjct: 24  PYQNKSLSPTERAADLVSRLTLEEKITLMQNNSSAVKRLGIKPYEWWNEALHGVARNGL- 82

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY----NLGN--- 125
                          AT +P  I   ASFN++L  ++  ++S EAR  Y      GN   
Sbjct: 83  ---------------ATVYPQAIGMGASFNDTLLYQVFTSISDEARVKYRQAREAGNYKR 127

Query: 126 -AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLTFW+PNIN+ RDPRWGR  ET GEDPY+  R  ++ V GLQ  +  +Y+       
Sbjct: 128 YTGLTFWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLSVVNGLQGPQNTKYN------- 180

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            K  AC KHYA +    W   +R  F++  +  +D+ ET++  F+  V +G+V  VMC+Y
Sbjct: 181 -KTHACAKHYAVHSGPEW---NRHSFNAENINPRDLWETYLPAFQDLVIQGNVKEVMCAY 236

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV-----ESHKFLNDTKEDAVA 298
           NR  G P C   +LL   +R +WN+ G +VSDC +I         E+HK     K DA A
Sbjct: 237 NRFEGDPCCGSDRLLINILRNEWNYKGLVVSDCGAIDNFYFKGRHETHK----NKADASA 292

Query: 299 RVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLG 358
             + +G DL+CG  YT   + AV++G I E+ ID SL  L      LG  D +  +  L 
Sbjct: 293 AAVLSGTDLECGRSYTGL-ISAVKEGLINESAIDQSLCRLMKARFELGEMDDTTPWDQLP 351

Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
            + +    H +LA + AR+ + LL+N    LPL+     T+AL+GP+AN +     NY G
Sbjct: 352 DSLLSCHAHQQLALQMARESMTLLQNHKNILPLDKE--MTVALIGPNANDSVMQWANYNG 409

Query: 419 TPCRYTSPMDGFYAY--SKVINYAPGCADIVCQNNSM-------IPAAIDAAKNADATVI 469
            P    + ++G   Y   + + Y P   +I  Q           I A I+ A  AD  + 
Sbjct: 410 FPVHTITLLEGLTQYLPQERLIYIPQ-KNIEVQKYPWVNYYPNDIQAVINQAAKADVIIY 468

Query: 470 VAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
             G+  S+E E          G DR  + LP  Q +L+ K   A   P+  V  S  A+ 
Sbjct: 469 AGGISASLEGEEMDVDAEGFRGGDRTTIELPNVQRKLV-KALKATGKPIVFVNFSGCAMG 527

Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
           +     +    +IL   YPG+ GG AIA+V+FG YNP GRLPIT+Y+ +        +P 
Sbjct: 528 LQ--PESQICDAILQAWYPGQAGGTAIAEVLFGDYNPAGRLPITFYKKD------NQLPD 579

Query: 580 RPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
               N  GRTY++ +   +YPFG+GLSYT F Y   S+P                     
Sbjct: 580 FEDYNMQGRTYRYLNYEPLYPFGHGLSYTTFSY---STP--------------------- 615

Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYE 699
                     I++ K K       ++V N G  +G EV+ +Y K        +K + G++
Sbjct: 616 ---------FIENGKLK-------VKVTNSGNYNGDEVIQLYIKRYDDPDGPLKTLRGFQ 659

Query: 700 RVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
           R+ I AGQ+++V F + +  +    D  +N++    G + ILVG
Sbjct: 660 RIHIPAGQTSEVSFPLTS-DTFTWWDKDSNTVHPLQGRYKILVG 702


>gi|372209074|ref|ZP_09496876.1| glycoside hydrolase family protein [Flavobacteriaceae bacterium
           S85]
          Length = 727

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 256/768 (33%), Positives = 389/768 (50%), Gaps = 108/768 (14%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
           D  + D      ERA+ LV +MTL EK+ Q+ + A  + RL +P Y+WW+EALHGV+  G
Sbjct: 18  DLSFLDTDKSIEERAEILVSQMTLKEKIAQLKNTAPAISRLKVPDYDWWNEALHGVARNG 77

Query: 71  RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY----NLGN- 125
           +                AT FP  I   A+F+  L  ++   +STEARA Y     +GN 
Sbjct: 78  K----------------ATIFPQGIGIGATFDPDLALRVASAISTEARAKYTISQQMGNH 121

Query: 126 ---AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
              AGLTFW+PN+N+ RDPRWGR  ET GEDPY++ +  + +V+GLQ           D 
Sbjct: 122 SRYAGLTFWTPNVNIFRDPRWGRGQETFGEDPYLMTQMGVAFVKGLQG---------DDP 172

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
             LK +AC KHYA +   +   + R  F++  T+QD+ ET++  FE  V + +V  VM +
Sbjct: 173 NYLKSAACAKHYAVH---SGPESLRLEFNAVPTQQDLYETYLPAFEALVKDANVEGVMPA 229

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           +N V G P  A+  LL   +R  W F GY+V+DC +I+ I   HK++ D++  A A  LK
Sbjct: 230 HNAVFGAPMAANKFLLTDVLRDRWGFDGYVVTDCGAIKQIKVGHKYV-DSEVAAAAVALK 288

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD---GSPQYKNLGK 359
           AG +L+CG  Y      A+ QG + E  +    + L+    RLG FD       Y  +G 
Sbjct: 289 AGTNLNCGATYKELKK-AIDQGLVTEELVHERTKQLFKTRFRLGMFDKDLSKNPYSKIGP 347

Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
             I + +HIELA EAA++ IV+LKN N  LPL T +IK   + GP AN++  ++G+Y G 
Sbjct: 348 ELIHSKEHIELAREAAQKSIVMLKNKNNLLPLPT-DIKVPYVTGPFANSSDMLMGSYYGV 406

Query: 420 PCRYTSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
                + + G     +    +NY  G      +N +    A + A  +D T+ V GL   
Sbjct: 407 SPGVVTILAGITDAVSLGTSLNYRSGALPF-QKNINPKNWAPNVAGMSDVTICVVGLTAD 465

Query: 477 VEAEG---------KDRVDLLLPGFQTELINKVADAAK-GPVTLVIMSAGAVDINFAKNN 526
            E EG          DR+DL LP  Q   + ++A   K  P+ LVI S   V +   + +
Sbjct: 466 REGEGVDAIASNHKGDRLDLKLPENQINYVKQLAAKKKDKPLVLVIASGSPVSLEGIEEH 525

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
               +IL + YPGE+GG A+ADV+FGK +P G LP+T+ ++         +P     +  
Sbjct: 526 --CDAILQIWYPGEQGGNAVADVLFGKVSPTGHLPMTFPKS------VAQLPDYKDYSMK 577

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTYK+     ++PFG+GL+Y++ ++K                                 
Sbjct: 578 GRTYKYMTEEPMFPFGFGLTYSKTEFK--------------------------------- 604

Query: 647 AVLIDDVKC-KDYKFTFQIEVENMGKMDGSEVVMVYSKPP------GIAGTHIKQVIGYE 699
            ++++D K  K       +EV N+G  D  E+V +Y  P       G+  T +K    ++
Sbjct: 605 NLVVEDAKLRKKESLKVSVEVTNVGDFDIDEIVQLYISPKSQKEGEGLPFTTLK---AFK 661

Query: 700 RVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGG 747
           RV +  G++ KV FT++  +SLK+++     +   GA+ + VG    G
Sbjct: 662 RVALKKGETQKVEFTIHP-ESLKVINVKGQKVWRKGAYKVTVGNSSPG 708


>gi|160881137|ref|YP_001560105.1| glycoside hydrolase family 3 [Clostridium phytofermentans ISDg]
 gi|160429803|gb|ABX43366.1| glycoside hydrolase family 3 domain protein [Clostridium
           phytofermentans ISDg]
          Length = 717

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 220/622 (35%), Positives = 334/622 (53%), Gaps = 67/622 (10%)

Query: 21  YPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTH 80
           + +RA +LV++MTL EKV Q    A  +PRL +  Y +W+EALHGV+  G          
Sbjct: 10  FQQRATELVKKMTLEEKVFQTLHSAPSIPRLDIKAYNYWNEALHGVARAGV--------- 60

Query: 81  FDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWS 132
                  AT FP  I   A+F+E L ++I  T+STE R  +N            GLTFWS
Sbjct: 61  -------ATVFPQAIGLAATFDEDLIEEIADTISTEGRGKFNAQQKYGDHDIYKGLTFWS 113

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           PN+N+ RDPRWGR  ET GEDP++ G     +V G+Q           D   LK +AC K
Sbjct: 114 PNVNIFRDPRWGRGHETFGEDPFLSGTLGGRFVDGIQG---------HDETYLKAAACAK 164

Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           H+A +   +   + R  F++ V+EQD++ET++  F+  V E  V +VM +YNR NG P C
Sbjct: 165 HFAVH---SGPEDIRHSFNAEVSEQDLRETYLPAFKKLVKEHKVEAVMGAYNRTNGEPCC 221

Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
               LL   +RG+W F G++ SDC +I+   E H   ++  E +VA  +  G DL+CG+ 
Sbjct: 222 GSKTLLEDILRGEWEFVGHVTSDCWAIKDFHEHHMVTSNAVE-SVALAMNRGCDLNCGNL 280

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG--SPQYKNLGKNNICNPQHIEL 370
           Y N  + AV+ G + E  IDT+L  L+   M+LG FD   S  +  +  + +      EL
Sbjct: 281 YVNL-LQAVRDGLVEEETIDTALIRLFTTRMKLGLFDKEESIPFNTITYDQVDTKSSKEL 339

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
             +A+++ +VLLKN++  LPLN   I ++ ++GP+AN   A++GNYEGT   Y + ++G 
Sbjct: 340 NIKASKKCVVLLKNEDNILPLNPKKITSVGVIGPNANNRNALVGNYEGTASEYITVLEGI 399

Query: 431 YAY---SKVINYAPGCADI------VCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE- 480
                    + ++ GC         + Q N  I       +++D  +   GLD  +E E 
Sbjct: 400 KQVVPEDVRVYFSEGCHLFKNKLSNLSQENDRIAEVRAVCEHSDVVIACLGLDPGLEGEE 459

Query: 481 --------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
                     D+  L LPG Q +++  + +  K PV L+++S  A+ + +A  +  I +I
Sbjct: 460 GDQGNQFASGDKKTLALPGIQEDVLKTIYECGK-PVILILLSGSALAVPWA--DEHIPAI 516

Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
           L   YPG +GGRAIA++IFG  NP G+LP+T+Y        +T   ++       RTY++
Sbjct: 517 LQGWYPGAQGGRAIAELIFGDGNPEGKLPVTFYRTTEELPEFTDYAMK------NRTYRY 570

Query: 593 FDGPVVYPFGYGLSYTQFKYKV 614
                +YPFGYGLSYT F++ +
Sbjct: 571 MKNEALYPFGYGLSYTTFEHTL 592


>gi|7671419|emb|CAB89360.1| beta-glucosidase-like protein [Arabidopsis thaliana]
 gi|9758998|dbj|BAB09525.1| unnamed protein product [Arabidopsis thaliana]
          Length = 411

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 192/411 (46%), Positives = 261/411 (63%), Gaps = 21/411 (5%)

Query: 343 MRLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTL 399
           MRLG+FDG+P+   Y  LG  ++C  ++ ELA E ARQGIVLLKN  G+LPL+   IKTL
Sbjct: 1   MRLGFFDGNPKNQPYGGLGPKDVCTVENRELAVETARQGIVLLKNSAGSLPLSPSAIKTL 60

Query: 400 ALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAID 459
           A++GP+AN TK MIGNYEG  C+YT+P+ G         Y  GC ++ C    +  A   
Sbjct: 61  AVIGPNANVTKTMIGNYEGVACKYTTPLQGLERTVLTTKYHRGCFNVTCTEADLDSAKTL 120

Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
           AA +ADATV+V G D ++E E  DR+DL LPG Q EL+ +VA AA+GPV LVIMS G  D
Sbjct: 121 AA-SADATVLVMGADQTIEKETLDRIDLNLPGKQQELVTQVAKAARGPVVLVIMSGGGFD 179

Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMP 578
           I FAKN+ KI SI+WVGYPGE GG AIADVIFG++NP G+LP+TWY  +YV K+P T+M 
Sbjct: 180 ITFAKNDEKITSIMWVGYPGEAGGIAIADVIFGRHNPSGKLPMTWYPQSYVEKVPMTNMN 239

Query: 579 LRP--VNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINY 636
           +RP   N + GRTY+F+ G  VY FG GLSYT F +++  +PK V + LD+ Q CR    
Sbjct: 240 MRPDKSNGYLGRTYRFYIGETVYAFGDGLSYTNFSHQLIKAPKFVSLNLDESQSCRS--- 296

Query: 637 TVGTNKPPCAAVLIDDVKCKD-----YKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH 691
                 P C ++      C+        F  Q++V N+G  +G+E V +++ PP + G+ 
Sbjct: 297 ------PECQSLDAIGPHCEKAVGERSDFEVQLKVRNVGDREGTETVFLFTTPPEVHGSP 350

Query: 692 IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            KQ++G+E++ +   +   V F ++ CK L +VD      LA G H + VG
Sbjct: 351 RKQLLGFEKIRLGKKEETVVRFKVDVCKDLGVVDEIGKRKLALGHHLLHVG 401


>gi|358380569|gb|EHK18247.1| glycoside hydrolase family 3 protein, partial [Trichoderma virens
           Gv29-8]
          Length = 722

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 256/728 (35%), Positives = 383/728 (52%), Gaps = 62/728 (8%)

Query: 32  MTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI--GRRTNSPPGTHFDSEVPGAT 89
           +TL EK   + + A GV RLGLP YEW +EALHG++ +  G+  NS   T  +     +T
Sbjct: 12  LTLDEKAANLVNNAPGVKRLGLPPYEWRNEALHGLAGVSPGQGINST-FTQGNVAFNSST 70

Query: 90  SFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLET 149
            FP+ I+  A+F++ L   I   VSTEARA  N   AGL +W+PNIN  RDPRWGR  ET
Sbjct: 71  QFPSPIVLGAAFDDHLVHDIATAVSTEARAFSNHLKAGLDYWAPNINPYRDPRWGRGQET 130

Query: 150 PGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFH 209
           PGEDPY V +YA NYV GL+   G            K+ + CKH+A YD+++ +G  R  
Sbjct: 131 PGEDPYHVAQYAYNYVVGLKGGVG--------PAKSKVVSTCKHFAGYDIEDSDGVVRGS 182

Query: 210 FDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFH 269
           +++ ++ QD+ E ++  F  C  +    +VMCSYN VNG P+CA+  +L+  +R  W + 
Sbjct: 183 YNAIISTQDLAEYYLPSFRSCFRDAKTGAVMCSYNAVNGHPSCANSYMLDTVLRDHWGWG 242

Query: 270 G---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKI 326
               ++  DC ++  +   H  +  +    VA  +  G DLDCG  Y +    AVQ    
Sbjct: 243 SSAHWVTGDCGAVDGVFNQHH-VGQSAAQGVAFAINNGTDLDCGTAYASNIASAVQNNYT 301

Query: 327 AEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKN 384
            EA +D +L  LY  L+ LGYFD     +Y+ LG +++  P   +LA  A  +GI +L  
Sbjct: 302 TEAQLDQALSRLYSSLIVLGYFDPPEGQEYRTLGVSDVNTPSTQKLAYTALVEGINILP- 360

Query: 385 DNGALPLNTGNIKTLALVGPHA-NATKAMIGNYEGTPCRYTSPMD--GFYAYSKVINYAP 441
                P+     +T+  VGP A NA+ +M GNY G     T P+      AY+  + Y+ 
Sbjct: 361 ---IRPMG----QTVLFVGPWANNASVSMFGNYNGVAPYKTIPVPTANSSAYNWNVTYSQ 413

Query: 442 GCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVA 501
           G   ++  + S   AA+ AA+ AD  V + G+D  VEAE  DR  +  PG Q  LI ++ 
Sbjct: 414 GLQYVLSNDTSQFAAAVSAAQEADVVVYIGGIDEQVEAEAHDRTSIDWPGAQLNLIKQL- 472

Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
            AA  PV +V +  G VD +    N  +K +LW+GYPG+E G  + D++ G   P GRLP
Sbjct: 473 -AAVKPVVVVQVGGGQVDDSSLLQNKNVKGLLWMGYPGQEFGSGLIDILSGASAPAGRLP 531

Query: 562 ITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQF--KYKVASSP 618
           +T Y ANY+ ++P T   LRP ++ PGRTY++++G V+ PFG G+ YT+F   +K   S 
Sbjct: 532 VTQYPANYITQVPMTDQSLRPSSSNPGRTYRWYNGSVI-PFGTGIHYTKFNISWKTGGSG 590

Query: 619 KSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY-KF-TFQIEVENMGKMDGSE 676
           +      D                       I+    KD  +F  FQI VEN+G      
Sbjct: 591 RGTYDTAD----------------------FINAEDPKDLAEFDVFQINVENVGSTTSDY 628

Query: 677 VVMVY--SKPPGIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNACKSLKIVDNAANSLLA 733
           V +++  S   G     +K ++ Y R      G++ K+   +N  + +   D++ N +L 
Sbjct: 629 VALLFVKSSDSGPQPYPLKTLVSYARAHGTQPGETTKIDLRVNVGQ-IARNDSSGNLVLY 687

Query: 734 SGAHTILV 741
            GA+T+ +
Sbjct: 688 PGAYTLEI 695


>gi|410648100|ref|ZP_11358515.1| beta-glucosidase [Glaciecola agarilytica NO2]
 gi|410132388|dbj|GAC06914.1| beta-glucosidase [Glaciecola agarilytica NO2]
          Length = 733

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 252/763 (33%), Positives = 380/763 (49%), Gaps = 94/763 (12%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           +D P+ D +LP  +R   L++ MTL EK  Q+ +    + RLGLP Y++W+EALHGV+  
Sbjct: 22  NDQPWFDTQLPTQKRIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEALHGVARN 81

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGN 125
           GR                AT FP  I   A+F++ L  K    +S EARA +N    +GN
Sbjct: 82  GR----------------ATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIGN 125

Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
               +GLTFW+PNIN+ RDPRWGR  ET GEDPY+  +     V GLQ            
Sbjct: 126 RSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQG---------DH 176

Query: 182 SRPLKISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
            + LK +A  KH+A +      G +  R  FD+  + +DM ET+   FE  V E +V +V
Sbjct: 177 PKYLKTAAAAKHFAVHS-----GPEALRHEFDAIASPKDMYETYFPAFEALVTEANVETV 231

Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
           M +YNRVNG P      LLN  +R  W F G++VSDC  +    + HK   +  E A A 
Sbjct: 232 MAAYNRVNGHPAGGSDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKVTANAVESA-AL 290

Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNL 357
            +  G DL+CG  Y N    AV+ G + E  ID  L  +     +LG+FD      Y N+
Sbjct: 291 AINTGTDLNCGAVY-NALPDAVEAGLVDEKTIDKRLSKVLATKFKLGFFDPKDDNPYNNI 349

Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
             + + +  H ++A E A + IVLL+N N  LPL+  NI+ L + GP A++++ ++GNY 
Sbjct: 350 SADVVNSEAHAQVAYEMAVKSIVLLQNKNNILPLDR-NIRNLYVTGPFASSSEVLLGNYY 408

Query: 418 GTPCRYTSPMDGFYAYSKV---INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
           G   + T+ +DG  A   V   INY  G        N +     +A +  D  + V GL 
Sbjct: 409 GLSGKTTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLS 468

Query: 475 LSVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
            + E E           DR+ L LP  Q   + K+      PV +V+++AG   +N  + 
Sbjct: 469 GAYEGEEGEAIASPHKGDRLSLDLPEHQIAFLRKLRKDNDKPV-IVVLTAG-TPVNLTEI 526

Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
                +I++  YPG+EGG+A+AD++FG+ +P GRLPIT+ ++     PY    ++     
Sbjct: 527 AELADAIVFAWYPGQEGGKAVADILFGERSPSGRLPITFPKSEAQLPPYDDYSMQ----- 581

Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
            GRTY++     +YPFG+GLSY Q K+                      N T+G  +   
Sbjct: 582 -GRTYRYMTQEPMYPFGFGLSYAQVKFD---------------------NITLGNTQALA 619

Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIA 704
           +   + +        T  + V N G+ +  EVV +Y K P    +  +  + G+ R+ +A
Sbjct: 620 SKNELQE------NMTVTVNVTNTGEREFEEVVQLYLKTPDAGVSQPLHSLKGFTRIKLA 673

Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGG 747
           AGQ+ +V F +   K L  ++     +L  G ++++VG    G
Sbjct: 674 AGQTEQVLFNI-PKKHLYSINEQGKPVLLKGQYSVIVGNASPG 715


>gi|315607899|ref|ZP_07882892.1| beta-glucosidase [Prevotella buccae ATCC 33574]
 gi|315250368|gb|EFU30364.1| beta-glucosidase [Prevotella buccae ATCC 33574]
          Length = 721

 Score =  374 bits (959), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 255/743 (34%), Positives = 377/743 (50%), Gaps = 98/743 (13%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
             AK+++ RMT+ EK+ Q+ + +  +  LG+  Y+WWSE LHGV   GR           
Sbjct: 32  RHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR----------- 80

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG--------NAGLTFWSPN 134
                AT FP  I   A+F+E+L ++IG  V+TE RA +N+         NAGLTFWSPN
Sbjct: 81  -----ATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVARKLKNYSRNAGLTFWSPN 135

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           +N+ RD RWGR +ET GEDP + G     YVRGLQ           D+  LK  AC KHY
Sbjct: 136 VNIFRDLRWGRGMETYGEDPLLSGMLGTAYVRGLQG---------DDAFYLKTGACAKHY 186

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A +     EG  R   D   + +D+ ET++  F+M V +G V +VM +YNRV G P    
Sbjct: 187 AVHS--GPEGT-RHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGS 243

Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
             LL   +R  W F+G+IVSDCD+I      H+++  T E+A A  +KAGL+++CG  + 
Sbjct: 244 KYLLTDILRKSWGFNGHIVSDCDAINDFYGGHRYVK-TPEEACAAAIKAGLNVECGHTFK 302

Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF--DGSPQYKNLGKNNICNPQHIELAA 372
               GA+ QG +AEAD+D +L  L +  ++LG    D +  Y +  ++ IC+P H  LA 
Sbjct: 303 AM-QGALDQGLLAEADLDRALFPLVMTRLKLGILEPDSACPYNSYDESEICSPAHTALAL 361

Query: 373 EAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA 432
            AA + +VLLKN NG LPL+  NI+TL + GP A+    ++GNY G   RY++ + G  +
Sbjct: 362 RAADEAMVLLKN-NGILPLDK-NIRTLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIVS 419

Query: 433 Y---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE--------- 480
                  +N+ P    I  + N M   A++ A  A+  ++V G + ++E E         
Sbjct: 420 RVSSGTSVNFRPAFMQITEELNDM-NWAVNEACAAEVAIVVMGNNGNMEGEEGEAIASAS 478

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
             DRV + LP  Q   + +V  A KG   +V+++ G+  I+  + +    +++   YPG+
Sbjct: 479 RGDRVGIGLPASQLNYLRRV-KARKGGRIVVVLTGGS-PIDLREISKLADAVVMAWYPGQ 536

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
           EGG A+ D++FG  N  GRLPIT+           S+P     +  GRTYK+  G V+YP
Sbjct: 537 EGGEALGDLLFGDKNFSGRLPITF------PADVDSLPAFDDYSMNGRTYKYMSGNVMYP 590

Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
           FGYGLSY +  Y  A                      VG  K             K    
Sbjct: 591 FGYGLSYGRVTYTDAR--------------------VVGRIK-------------KGEPL 617

Query: 661 TFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
             ++ + N G     EV   Y + P    G+ +  ++G+ RV I    S K  F +   +
Sbjct: 618 AVEVVLTNNGDRTIDEVAQAYIATPTAGKGSPMASLVGFRRVSIPPKSSVKAVFKI-VPE 676

Query: 720 SLKIVDNAANSLLASGAHTILVG 742
            L  V +  +S L  G +T+ +G
Sbjct: 677 RLMTVQSDGSSKLLKGNYTLTIG 699


>gi|255284060|ref|ZP_05348615.1| beta-glucosidase [Bryantella formatexigens DSM 14469]
 gi|255265405|gb|EET58610.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Marvinbryantia formatexigens DSM 14469]
          Length = 700

 Score =  374 bits (959), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 243/736 (33%), Positives = 371/736 (50%), Gaps = 113/736 (15%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA+ LV +MT+ EK  Q+   A  + RLG+P Y WW+EALHGV+  G+           
Sbjct: 8   KRAEALVAQMTVEEKASQLKYDAPAIKRLGIPAYNWWNEALHGVARAGQ----------- 56

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
                AT FP  I   A+F+E+L  +I   ++TE RA YN   A        GLTFWSPN
Sbjct: 57  -----ATVFPQAIGLGATFDEALLGEIADVIATEGRAKYNAYAAKEDRDIYKGLTFWSPN 111

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           +N+ RDPRWGR  ET GEDP +  R  + +V+GLQ           D   +K +AC KH+
Sbjct: 112 VNIFRDPRWGRGHETYGEDPCLTSRLGVAFVKGLQ----------GDGETMKAAACAKHF 161

Query: 195 AAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           A +      G +  R  F++  + +DM+ET++  FE  V E DV +VM +YNR NG   C
Sbjct: 162 AVHS-----GPEAVRHEFNAEASAKDMEETYLPAFEALVKEADVEAVMGAYNRTNGEACC 216

Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
           A P +L + +R DW F G+ VSDC +I+   E H  L  T +++ A  + +G DL+CG+ 
Sbjct: 217 ASP-VLQKILREDWGFEGHFVSDCWAIRDFHE-HHMLTATAKESAAMAINSGCDLNCGNT 274

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAA 372
           Y +  + A + G ++E  I  +   L+     LG FDGS +Y ++    + + +H+ LA 
Sbjct: 275 YLHI-LHAYRDGLVSEETITEAAVRLFTTRFLLGLFDGS-EYDDIPYTVVESKEHLALAE 332

Query: 373 EAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA 432
           +AA +  VLLKN NG LPL    ++T+ ++GP+A++  A+ GNY GT  RY +   G   
Sbjct: 333 KAALESAVLLKN-NGILPLKKERLRTVGVIGPNADSRAALAGNYHGTASRYETIQQGLQD 391

Query: 433 Y----SKVINYAPGCA-------DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE- 480
           Y     +V+  + GCA        +    + +  A I  A+N+D  ++  GLD ++E E 
Sbjct: 392 YLGEDVRVLT-SVGCALSEDRTEKLALAGDRLAEAQI-VAENSDVVILCLGLDETLEGEE 449

Query: 481 --------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
                     D+  LLLP  Q +L+  VA   K PV L +MS   +D+++A  +    +I
Sbjct: 450 GDTGNSYASGDKETLLLPEAQRDLMEAVAATGK-PVVLCMMSGSDLDMSYAAEH--FDAI 506

Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
           L + YPG +GG A A ++FG+ +P G+LP+T+YE          +P     +  GRTY++
Sbjct: 507 LQLWYPGSQGGSAAAKLLFGEVSPSGKLPVTFYET------LEELPAFEDYSMKGRTYRY 560

Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
              P  YPFG+GL+Y                    D +  D N    +            
Sbjct: 561 MGHPAQYPFGFGLTY-------------------GDVRVTDANIRGAS------------ 589

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKV 711
               +   T  +  EN G     EV+ +Y K    A       +  + R+ + AG+   +
Sbjct: 590 ---AEGDLTLAVTAENAGNAVTDEVLQIYVKCTDSANAVPNPALAAFGRIHLEAGEKKTI 646

Query: 712 GFTMNACKSLKIVDNA 727
             T+ A ++  +VD A
Sbjct: 647 EMTVPA-RAFTVVDEA 661


>gi|219887077|gb|ACL53913.1| unknown [Zea mays]
 gi|224035251|gb|ACN36701.1| unknown [Zea mays]
 gi|413919685|gb|AFW59617.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 405

 Score =  374 bits (959), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 189/407 (46%), Positives = 263/407 (64%), Gaps = 17/407 (4%)

Query: 343 MRLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTL 399
           MRLG+FDG P+   + NLG +++C P + ELA EAARQGIVLLKN  G LPL+  +IK++
Sbjct: 1   MRLGFFDGDPRELPFGNLGPSDVCTPSNQELAREAARQGIVLLKN-TGKLPLSATSIKSM 59

Query: 400 ALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAI 458
           A++GP+ANA+  MIGNYEGTPC+YT+P+ G  A    + Y PGC ++ C  NS+ + AA 
Sbjct: 60  AVIGPNANASFTMIGNYEGTPCKYTTPLQGLGANVATV-YQPGCTNVGCSGNSLQLDAAT 118

Query: 459 DAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAV 518
            AA +AD TV+V G D S+E E  DR  LLLPG Q +L++ VA+A+ GP  LV+MS G  
Sbjct: 119 KAAASADVTVLVVGADQSIERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPF 178

Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
           DI+FAK++ KI +ILWVGYPGE GG AIADV+FG +NP GRLP+TWY  ++ K+P T M 
Sbjct: 179 DISFAKSSDKIAAILWVGYPGEAGGAAIADVLFGYHNPSGRLPVTWYPESFTKVPMTDMR 238

Query: 579 LR--PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINY 636
           +R  P   +PGRTY+F+ G  VY FG GLSYT F + + S+PK + ++L +   C     
Sbjct: 239 MRPDPSTGYPGRTYRFYTGDTVYAFGDGLSYTSFAHHLVSAPKQLALQLAEGHACLTEQ- 297

Query: 637 TVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI 696
                   C +V  +   C+   F   + V N G+  G   V ++S PP +     K ++
Sbjct: 298 --------CPSVEAEGAHCEGLAFDVHLRVRNAGERSGGHTVFLFSSPPAVHNAPAKHLL 349

Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
           G+E+V +  GQ+  V F ++ CK L +VD   N  +A G+HT+ VG+
Sbjct: 350 GFEKVSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGD 396


>gi|358061481|ref|ZP_09148135.1| hypothetical protein HMPREF9473_00197 [Clostridium hathewayi
           WAL-18680]
 gi|356700240|gb|EHI61746.1| hypothetical protein HMPREF9473_00197 [Clostridium hathewayi
           WAL-18680]
          Length = 695

 Score =  372 bits (955), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 228/613 (37%), Positives = 336/613 (54%), Gaps = 74/613 (12%)

Query: 24  RAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
           +A  LVE+MTL E+  QM   A  VPRLG+P Y WW E LHGV+  G             
Sbjct: 9   KAVRLVEQMTLEERASQMRYDAPAVPRLGIPAYNWWGEGLHGVARAGT------------ 56

Query: 84  EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL----GN----AGLTFWSPNI 135
               AT FP  I   A F+  L ++I   VSTE RA YN     G+     GLTFWSPN+
Sbjct: 57  ----ATMFPQAIAMAAMFDVELTEEIANVVSTEGRAKYNQFCEEGDRDIYKGLTFWSPNV 112

Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
           N+ RDPRWGR  ET GEDPY+  R    +VRGLQ           D   LKI+AC KH+A
Sbjct: 113 NIFRDPRWGRGHETYGEDPYLTSRLGTAFVRGLQ----------GDGEHLKIAACAKHFA 162

Query: 196 AYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
            +      G +  R  F +  +++D+ ET++  FE CV E  V SVM +YN  +G P CA
Sbjct: 163 VHS-----GPEALRHEFWADTSKKDLWETYLPAFEACVKEAHVESVMGAYNSYHGEPCCA 217

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
           +  L+ + +RG W F G+ VSDC +I+    ++  + DT  ++ A  +K G DL+CG+ Y
Sbjct: 218 NTLLMEEILRGQWGFEGHFVSDCWAIRDFHMNY-MVTDTAMESAALAVKKGCDLNCGNTY 276

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAE 373
               + A ++G + +A +  ++  L+     LG  + + +Y ++    +   +H ELA E
Sbjct: 277 LQ-VLKACEEGLLDDACVTEAVVRLFTTRYLLGMGEET-EYDDIPYEVVECKEHRELAVE 334

Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF--- 430
           AAR+ +VLLKND G LPL+   + T+A++GP+A+   A+IGNY GT   YT+ ++G    
Sbjct: 335 AARRSMVLLKND-GLLPLHAEKLNTIAVIGPNADNRTALIGNYHGTSSCYTTILEGIQDA 393

Query: 431 YAYSKVINYAPGC-------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE--- 480
                 + YA GC         +    + +  A I  AK++D  V+  GLD ++E E   
Sbjct: 394 VGEDVRVLYAEGCHLFKDRVEHLAVAGDRLSEARI-VAKHSDVVVLCVGLDETLEGEEGD 452

Query: 481 ------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
                   D+ DLLLP  Q  L+ ++ +  K PV +  MS  A+D++ A+   K  +++ 
Sbjct: 453 TGNSHASGDKKDLLLPESQRRLMEEILNLGK-PVVVCNMSGSAIDLSLAQE--KAGAVIQ 509

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD 594
           V YPG EGGRA+AD++FGK +P G+LP+T+Y+         ++P     +  GRTY++  
Sbjct: 510 VWYPGAEGGRALADLLFGKASPSGKLPVTFYK------DLENLPPFEDYSMDGRTYRYLT 563

Query: 595 GPVVYPFGYGLSY 607
              +YPFG+GL+Y
Sbjct: 564 AEPLYPFGFGLTY 576


>gi|410639677|ref|ZP_11350222.1| beta-glucosidase [Glaciecola chathamensis S18K6]
 gi|410140558|dbj|GAC08409.1| beta-glucosidase [Glaciecola chathamensis S18K6]
          Length = 733

 Score =  371 bits (953), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 253/763 (33%), Positives = 378/763 (49%), Gaps = 94/763 (12%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           +D P+ D +LP  +R   L++ MTL EK  Q+ +    + RLGLP Y++W+EALHGV+  
Sbjct: 22  NDQPWFDTQLPTQKRIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEALHGVARN 81

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGN 125
           GR                AT FP  I   A+F++ L  K    +S EARA +N    +GN
Sbjct: 82  GR----------------ATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIGN 125

Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
               +GLTFW+PNIN+ RDPRWGR  ET GEDPY+  +     V GLQ            
Sbjct: 126 RSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQG---------DH 176

Query: 182 SRPLKISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
            + LK +A  KH+A +      G +  R  FD+  + +DM ET+   FE  V E +V +V
Sbjct: 177 PKYLKTAAAAKHFAVHS-----GPEALRHEFDAIASPKDMYETYFPAFEALVTEANVETV 231

Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
           M +YNRVNG P      LLN  +R  W F G++VSDC  +    + HK   +  E A A 
Sbjct: 232 MAAYNRVNGHPAGGSDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKVTANAVESA-AL 290

Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNL 357
            +  G DL+CG  Y N    AV+ G + E  ID  L  +     +LG+FD      Y N+
Sbjct: 291 AINTGTDLNCGAVY-NALPDAVEAGLVDEKTIDKRLSKVLATKFKLGFFDPKDDNPYNNI 349

Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
             + + +  H ++A E A + IVLL+N N  LPL+  NI+ L + GP A++++ ++GNY 
Sbjct: 350 SADVVNSEAHAQVAYEMAVKSIVLLQNKNNILPLDR-NIRNLYVTGPFASSSEVLLGNYY 408

Query: 418 GTPCRYTSPMDGFYAYSKV---INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
           G   + T+ +DG  A   V   INY  G        N +     +A +  D  + V GL 
Sbjct: 409 GLSGKTTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLS 468

Query: 475 LSVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
            + E E           DR+ L LP  Q   + K+      PV +V+++AG   +N  + 
Sbjct: 469 GAYEGEEGEAIASPHKGDRLSLDLPEHQIAFLRKLRKDNDKPV-IVVLTAG-TPVNLTEI 526

Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
                +I++  YPG+EGG+A+AD++FG+ +P GRLPIT+ ++     PY    ++     
Sbjct: 527 AELADAIVFAWYPGQEGGKAVADILFGERSPSGRLPITFPKSEAQLPPYDDYSMQE---- 582

Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
             RTY++     +YPFG+GLSY Q K+         +I L   Q           N+P  
Sbjct: 583 --RTYRYMTQEPMYPFGFGLSYAQVKFD--------NITLGNTQAL------ASKNEP-- 624

Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIA 704
                          T  + V N G+ +  EVV +Y K P    +  +  + G+ R+ +A
Sbjct: 625 -----------QENMTVTVNVTNTGEREFEEVVQLYLKTPDAGVSQPLHSLKGFTRIKLA 673

Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGG 747
           AGQ+ +V F +   K L  ++     +L  G ++++VG    G
Sbjct: 674 AGQTEQVLFNI-PKKHLYSINAQGKPVLLKGQYSVIVGNASPG 715


>gi|339499234|ref|YP_004697269.1| beta-glucosidase [Spirochaeta caldaria DSM 7334]
 gi|338833583|gb|AEJ18761.1| Beta-glucosidase [Spirochaeta caldaria DSM 7334]
          Length = 699

 Score =  371 bits (953), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 257/743 (34%), Positives = 375/743 (50%), Gaps = 111/743 (14%)

Query: 28  LVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPG 87
           L+  M+L EK+  M   A G+PRLG+P Y WW+EALHGV+  G                 
Sbjct: 15  LISNMSLEEKIGLMIHRAKGIPRLGIPDYNWWNEALHGVANNGE---------------- 58

Query: 88  ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN-LG-------NAGLTFWSPNINVVR 139
           AT FP  I   A+F+E L  ++ + +S EARA +N +G       + GLTFW+PNIN+ R
Sbjct: 59  ATVFPQAIALGATFDEDLVHRVAEAISIEARAKFNAVGKEKAEQYHRGLTFWAPNINIFR 118

Query: 140 DPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDL 199
           DPRWGR  ET GEDP +  R    YVRGLQ          SD   L+ +AC KH+A +  
Sbjct: 119 DPRWGRGQETYGEDPVLTSRLGTAYVRGLQ---------GSDPYYLRAAACAKHFAVH-- 167

Query: 200 DNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLN 259
              EG  R  F++ V+++D++ET++  F+  V  G V SVM +YNRVNG P C    LL 
Sbjct: 168 SGPEGL-RHTFNAEVSQKDLEETYLPAFKALVKSG-VESVMGAYNRVNGEPACGSTYLLK 225

Query: 260 QTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMG 319
           Q +R +W F G++VSDC +I    ++HK  ND  E ++A  L++G DL+CGD Y N+   
Sbjct: 226 QKLREEWQFQGHVVSDCWAICDFHKNHKVTNDILE-SIALALRSGCDLNCGDAY-NYLAE 283

Query: 320 AVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGI 379
           AV +G + E DI+ ++  L I L +LG       Y+ +  + I   +H  LA EAA + I
Sbjct: 284 AVLKGYVTEDDINRAVVRLLITLDKLGLIHDDGPYQGITIHQIDWKKHDSLALEAAEKSI 343

Query: 380 VLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK---V 436
           VLLKN NG LPL    I  + + GP+A  + A++GNY G   R  + ++     +     
Sbjct: 344 VLLKN-NGVLPLKKDKISYIYVTGPNATNSDALLGNYAGVSSRLLTVLEAIVEEAGPEIT 402

Query: 437 INYAPGC--ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV--------- 485
           + Y  GC  A+     N     A    K AD T+ V G D SVE E  D +         
Sbjct: 403 VTYKKGCPLAERRVNPNDW---ASGVTKYADVTIAVMGRDTSVEGEEGDAILSSTYGDFE 459

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           DL L   Q   ++K+ ++ K P+ +V+M  G   I   + +    +IL   YPG+ GG A
Sbjct: 460 DLNLNDEQLSYLHKLKESGK-PLIVVLM--GGAPICSPELHEIADAILVAWYPGQAGGTA 516

Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP--GRTYKFFDGPVVYPFGY 603
           +++++FGK NP G+LP+T+        P +   L    N+   GRTY++     +YPFG+
Sbjct: 517 VSNIVFGKTNPSGKLPVTF--------PKSVRQLPEFENYSMQGRTYRYMTEEPLYPFGF 568

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           GLSYT+ ++K                      +  G  K P    LI             
Sbjct: 569 GLSYTKMEFK----------------------HVTGRWKSPEKDELI-----------VS 595

Query: 664 IEVENMGKMDGSEVVMVY----SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
            E+ N G +DG EVV +Y      P  +       +I ++RV +AAG S    F +   +
Sbjct: 596 TELYNQGTIDGEEVVQLYYHWKDAPFAVPNW---SLIDFKRVLVAAGASCICEFKI-PLE 651

Query: 720 SLKIVDNAANSLLASGAHTILVG 742
            L+ +D +   ++ +G     VG
Sbjct: 652 KLQCIDPSGKGVIPTGTLQFYVG 674


>gi|282877070|ref|ZP_06285912.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
           buccalis ATCC 35310]
 gi|281300752|gb|EFA93079.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
           buccalis ATCC 35310]
          Length = 721

 Score =  371 bits (952), Expect = e-99,   Method: Compositional matrix adjust.
 Identities = 247/757 (32%), Positives = 372/757 (49%), Gaps = 109/757 (14%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
           SI+ +++ +P+ DA+L + +RA DL +R+TL EK   M + +  VPRLG+  ++WW EAL
Sbjct: 17  SIQAQVT-YPFQDARLSFEQRADDLCKRLTLEEKAGLMQNNSKPVPRLGIKQFQWWGEAL 75

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           HG +  G                 AT FP  I   ASF++ L  ++    STEARA YN+
Sbjct: 76  HGSARTGL----------------ATVFPQTIGMAASFDDELLLQVFNIASTEARAKYNV 119

Query: 124 G--------NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
                    +  ++ W+PN+N+ RDPRWGR  ET GEDPY+  R     V GLQ  +G  
Sbjct: 120 AAKKGYFDTSWSVSLWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGCAVVEGLQGGKGPH 179

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
            +        K  AC KH+A +    W  +     D  V+ +D  ET++  F+  V  G 
Sbjct: 180 KY-------YKAFACAKHFAVHSGPEWNRHS-ISIDD-VSPRDFHETYLPAFKHLVQVGG 230

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKED 295
           V  VMC+YN ++G P C+D +LL Q +R +W F G +VSDC +I  I    K  ++ + D
Sbjct: 231 VKEVMCAYNSIDGEPCCSDQRLLEQLLRDEWGFKGIVVSDCGAIDDIWR--KGFHEVEPD 288

Query: 296 AV---ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF--DG 350
           A    AR +K G D+ CG  Y +    AV+ GK+ E  ID SL+ L +  M+LG F  D 
Sbjct: 289 AAHASARAVKGGTDMSCGQTYGSLPE-AVRLGKVTEERIDKSLKRLIVGRMQLGEFDPDS 347

Query: 351 SPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATK 410
             ++  +   ++  P   E+A + AR+ + LL N   ALPL+   +K + ++GP+AN + 
Sbjct: 348 ITRWNAISMKDVSTPASREVALKMARETMTLLHNPMHALPLSK-QLKQVVVMGPNANDSV 406

Query: 411 AMIGNYEGTPCRYTSPMDGFYAY--SKVINYAPGCADIVCQ---NNSMIPAAIDAAKNAD 465
            M GNY GTP    + +DG      ++ + +  GC  +      N ++    +      +
Sbjct: 407 MMWGNYNGTPHHTVTILDGIRRKIGAQRVKFIEGCGLVEPHRRGNQALTTQQLVEEVGDN 466

Query: 466 ATVI--------VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMS 514
            TVI        + G  L VEA   +G DRV + LP  Q E+I  +  A K    +++++
Sbjct: 467 KTVIFVGGISPQLEGEQLEVEAKGFKGGDRVTIELPQVQREMIAALHAAGK---QVIMVN 523

Query: 515 AGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPY 574
                I          +IL   YPGE GG A+ADV+FG YNP G+LP+T+Y  +      
Sbjct: 524 CSGSAIGLVPEVTHTDAILQAWYPGERGGEAVADVLFGDYNPAGKLPVTFYRDD------ 577

Query: 575 TSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDI 634
           + +P     N   RTY++F G  ++PFG+GLSYT FK                       
Sbjct: 578 SQLPDYLDYNMRNRTYRYFKGKPLFPFGHGLSYTSFK----------------------- 614

Query: 635 NYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQ 694
                          I   K ++ K T  + V+N GK DG EVV +Y          IK 
Sbjct: 615 ---------------IGKAKMRNGKLT--VSVKNTGKRDGEEVVQLYISCLDDPNGPIKS 657

Query: 695 VIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL 731
           + G++R+ + AG+   V   +   KS +  D   N++
Sbjct: 658 LRGFKRMALQAGEQRTVTLNLPR-KSFERFDEQTNTI 693


>gi|402493386|ref|ZP_10840139.1| beta-glucosidase [Aquimarina agarilytica ZC1]
          Length = 734

 Score =  370 bits (949), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 261/760 (34%), Positives = 382/760 (50%), Gaps = 113/760 (14%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
           +F + D    + +RAK LV  +TL EK+  M D +  + RL +P Y WW+E LHGV+  G
Sbjct: 38  NFEWFDTNKSFEKRAKALVASLTLEEKISLMVDQSAPIDRLNIPEYNWWNECLHGVARNG 97

Query: 71  RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGN- 125
           R                AT FP  I   A+F++ L  K+   +STEARA +N    +GN 
Sbjct: 98  R----------------ATVFPQAIGLAATFDQDLIFKVADAISTEARAKFNASIAIGNR 141

Query: 126 ---AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
              AGLTFW+PNIN+ RDPRWGR  ET GEDPY+  +  +N+V+GLQ          +  
Sbjct: 142 GKYAGLTFWTPNINIFRDPRWGRGQETYGEDPYLTSQIGVNFVKGLQG---------NHP 192

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
           + LK +AC KHYA +   +     R  FD+  +++DM ET++  FE  V E  V  VM +
Sbjct: 193 KYLKSAACAKHYAVH---SGPEELRHEFDAIASKKDMAETYLPAFEALVKEAKVEGVMGA 249

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           YNRVNG   CA P LL + ++  W F GYIVSDC ++  + + HK +  T E++ A  L 
Sbjct: 250 YNRVNGEGACASPYLLEKLLKDTWGFKGYIVSDCWALSDLHKFHK-VTQTAEESAAAALN 308

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
            GL+++CG+ Y     GA++QG  +E  +D  L+   +   +LG+FD S    Y  +  +
Sbjct: 309 VGLNVNCGNVYPALD-GAIKQGLTSEKQLDNVLQHQLLTRFKLGFFDPSNNNPYNKITTD 367

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + +  H  +A EAA++ IVLLKN+N  L     ++K++ + GP+A     ++GNY G  
Sbjct: 368 VVDSEAHRAIALEAAQKSIVLLKNNNNLL-PLKKDLKSVYVAGPNAAREDVLLGNYYGVT 426

Query: 421 CRYTSPMDGFYAYSKV-----INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL 475
            +  + +DG    SKV     INY  G      +N + I  +      AD  +IV GL  
Sbjct: 427 SKTQTILDGI--VSKVSAGTSINYKQGLLPF-QKNVNPIDWSTGEISRADVGIIVMGLSG 483

Query: 476 SVEAE---------GKDRVDLLLPGFQTELINKVADAAKG-PVTLVIMSAGAVDINFAKN 525
           + E E           DRVD+ LP  Q + I K+     G P+ LV+   G   I   + 
Sbjct: 484 NYEGEEGEAIASESKGDRVDIRLPQNQIDYIKKIKAKNTGNPLVLVL--TGGSPIAMPEV 541

Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
              + +I++  YPGEEGG+A+AD++FG   P G+LPIT+        P +   L P N++
Sbjct: 542 YDLVDAIVFAWYPGEEGGQAVADILFGDVVPSGKLPITF--------PKSVDDLPPYNDY 593

Query: 586 P--GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKP 643
              GRTYK+      +PFG+GLSYT FKY                               
Sbjct: 594 AMKGRTYKYMTKTPQFPFGFGLSYTSFKY------------------------------- 622

Query: 644 PCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF 702
                  D++K    K +F I   N G +D  EV  VY S P    G  +  ++G+ RV 
Sbjct: 623 -------DNLKVYKEKASFSI--TNNGNVDAEEVAQVYVSSPNAGKGDPLNTLVGFTRVS 673

Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           + AG + +V    +  K+    D+    +   G +TI VG
Sbjct: 674 LKAGATKQVSIPFSK-KAFVQFDSDGKEITRKGTYTIHVG 712


>gi|313202830|ref|YP_004041487.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312442146|gb|ADQ78502.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 742

 Score =  369 bits (946), Expect = 4e-99,   Method: Compositional matrix adjust.
 Identities = 254/727 (34%), Positives = 366/727 (50%), Gaps = 99/727 (13%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +P+ D      ER KDLV R+TL EK  QM   A  + RLG+  Y WW+EALHGV+  GR
Sbjct: 38  YPFQDTSKTIDERVKDLVSRLTLDEKAGQMLHNAPAIKRLGILPYSWWNEALHGVARTGR 97

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN------ 125
                           AT FP  +   A+F+E L  +IGQ +S EA A YN+        
Sbjct: 98  ----------------ATVFPENVGLAATFDEDLVYRIGQAISDEAWAKYNIAQRLENYG 141

Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
             +G+TF++PN+N+ RDPRWGR  ET GEDP++  R  + YV+G+Q          +D +
Sbjct: 142 QYSGITFYAPNVNIFRDPRWGRGQETYGEDPFLTSRMGVAYVKGMQG---------NDPK 192

Query: 184 PLKISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
            LK +AC KHY  +      G +  R  +D+    +D  ET++  FE  V EG V SVMC
Sbjct: 193 YLKTAACAKHYVVHS-----GPEALRHSYDAEPPMKDFMETYVPAFETLVKEGKVESVMC 247

Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
           +YNR  G P C    LL+  +R  W F GY+ +DC +IQ     H    D+ E A A  +
Sbjct: 248 AYNRTFGKPCCGSSFLLHDLLREKWGFTGYVTTDCWAIQNFYLHHGAAKDSLE-ACALAI 306

Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLG 358
           K+G++L+CG+ + N+   AV++G + E ++D +L  L     RLG FD SP    Y  + 
Sbjct: 307 KSGVNLNCGNEF-NYLPAAVRKGLVTEKEVDEALSQLLRTRFRLGLFD-SPNENPYAKIK 364

Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
           +  I + Q+I+LA EAA + +VLL+N N  LPL   ++K+L +VGP+A     ++GNY G
Sbjct: 365 EEVIGSQQNIDLAYEAAAKSLVLLQNKNNTLPLKK-DMKSLYVVGPYAANQDILLGNYNG 423

Query: 419 TPCRYTSPMD---GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVI--VAGL 473
              R T+ M    G  +    +NY  G        NSM  +  +AA       +  ++G+
Sbjct: 424 VNSRLTTIMQAIVGKVSAGTSVNYRIGVEPSAPNKNSMNYSIGEAADADAVVAVFGISGV 483

Query: 474 DLSVEAEGK------DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
               E E        DR+DL LP  Q + + ++    K P+ LV+   G   I   +   
Sbjct: 484 FEGEEGESTASTSRGDRLDLNLPQNQLDYLRELKKKCKKPIILVL--TGGSPICTPELAD 541

Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG 587
            + +IL+V YPG+EGG A+ADVIFG  NP GRL IT+ ++       + +P     +  G
Sbjct: 542 MVDAILFVWYPGQEGGHAVADVIFGDVNPSGRLCITFPKS------VSQLPAFEDYSMKG 595

Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
           RTY++     +YPFG+GLSYT + Y    S    D    K  Q   +  T          
Sbjct: 596 RTYRYMTEEPLYPFGFGLSYTNYSY----SNIKTDKDKIKKGQSVHVTAT---------- 641

Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAG 706
                             V N GK  G EV  +Y +     A T +  + G +RV +AAG
Sbjct: 642 ------------------VSNTGKTAGEEVAQLYITDVKASAPTPLYALKGTKRVKLAAG 683

Query: 707 QSAKVGF 713
           +S +V F
Sbjct: 684 ESKEVSF 690


>gi|167519969|ref|XP_001744324.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777410|gb|EDQ91027.1| predicted protein [Monosiga brevicollis MX1]
          Length = 721

 Score =  368 bits (944), Expect = 7e-99,   Method: Compositional matrix adjust.
 Identities = 243/728 (33%), Positives = 365/728 (50%), Gaps = 74/728 (10%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAY-------GVPRLGLPLYEWWSEAL 63
           D P+CD  L + +RA DL +R+TL E  QQ+   ++       GVPRLGL  Y + +E L
Sbjct: 41  DLPFCDLSLDFRDRAWDLAQRLTLDELAQQLNTYSFTPQAYAPGVPRLGLRNYSYHAEGL 100

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN- 122
           HG+       N P           AT +P V    A+ N SL  ++   + TE RA+ N 
Sbjct: 101 HGIR-DANVVNYP-----------ATLYPQVTAMAATANASLIHEMSTIMGTELRAVNNR 148

Query: 123 -------LGNAG-LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
                   G  G L+ + P +N++RD RWGR  E+  EDP++ G YA+N+V GL+     
Sbjct: 149 AQELGEIFGRGGALSIYGPTMNIIRDGRWGRSQESVSEDPWLNGLYAVNFVLGLE----- 203

Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGN-DRFHFDSRVTEQDMQETFILPFEMCVNE 233
              R+S S+ L+ +  CKH  AY  + +     R  F++ + E D+ +T++  F  CV  
Sbjct: 204 --QRNS-SKYLQAATSCKHLFAYSFEGYNNTLTRHSFNAVIDELDIHDTYLPAFRACVEL 260

Query: 234 GDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTK 293
           G V  +MCSYN VNGIP CA   + N  +R  W F G IVSDCD++  I  +H +   T 
Sbjct: 261 GHVQQIMCSYNSVNGIPACARGDVQNDRVRKAWGFEGLIVSDCDAVADIYNTHNY-TRTP 319

Query: 294 EDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF--DGS 351
           EDAV   L+ G DLDCGD+Y+     AVQQ     A +  S+  +  +   LG F  D S
Sbjct: 320 EDAVTVALQGGCDLDCGDFYSQHLASAVQQNLTTLAALQQSMTRVLEMRFLLGEFDPDTS 379

Query: 352 PQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKA 411
             Y+ LG+  I  P   + +  A+R+ +VLL+N    LP+       +AL+GP+ N T  
Sbjct: 380 VPYRQLGREAIDTPFARDSSLRASRESVVLLENRIKLLPVTLSADIKVALIGPYVNLTTI 439

Query: 412 MI-GNYEGTPCRYTSPMDGFYAYSKV-INYAPGCADIVCQNNSMIPAAIDAAKNADATVI 469
           M+ G  + TP   T+   GF A     +  +PGC +I       +  A+  A  AD  V+
Sbjct: 440 MMGGKLDYTPSFITTYFQGFQAIGITHLTSSPGC-NITAPLPGALDKAVQIATQADLVVL 498

Query: 470 VAGLDLSVEAEGKDRVDLLLPGFQTELINKVADA-AKGPVTLVIMSAGAVDINFAKNN-P 527
             GL   +E EG DR  L LP  Q +L + ++ A     + +V+++ G V ++  K    
Sbjct: 499 TLGLSSDIEHEGGDRETLGLPTPQQDLYDAISAAIPSSKLVVVLVNGGPVSVDRIKYGIA 558

Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNN 584
           +  +I+   Y G+  G A+A+ IFG+ NP G LP T + +N    +P+T M LRP     
Sbjct: 559 RTPTIIEAFYGGQSAGTALAETIFGQNNPSGTLPYTVFFSNITAHVPFTDMHLRPDAATG 618

Query: 585 FPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPP 644
           FPGRT++FFD PV++PFG+GLSY+ F                +D+    I  T G    P
Sbjct: 619 FPGRTHRFFDAPVMWPFGHGLSYSTFSLAW------------QDETVPSI--TTGDFTQP 664

Query: 645 CAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFI 703
               L+  +          + V N G + G   + +Y + P       ++ ++G ++ ++
Sbjct: 665 ---TLMHQL--------LSVNVTNHGPLPGRRALHLYVTVPVTNVSVPLRNLVGLQKHWL 713

Query: 704 AAGQSAKV 711
           A  QS  V
Sbjct: 714 AVDQSMTV 721


>gi|268610157|ref|ZP_06143884.1| glycoside hydrolase family 3 protein [Ruminococcus flavefaciens
           FD-1]
          Length = 690

 Score =  366 bits (939), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 242/750 (32%), Positives = 367/750 (48%), Gaps = 118/750 (15%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D  L   ERA+DL  R+TL E+  Q+   A  V RL +P Y WWSE LHGV+  G   
Sbjct: 4   YKDKSLSAQERAEDLTNRLTLEEQASQLKYDAPAVDRLDIPAYNWWSEGLHGVARAGT-- 61

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   A F+E    K+G  +  EARA YN  +A       
Sbjct: 62  --------------ATMFPQAIGLAAMFDEEAMNKVGSIIGDEARAKYNEYSAHGDHDIY 107

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GL  WSPN+N+ RDPRWGR  ET GEDPY+  R  + + +GLQ           +   L
Sbjct: 108 KGLCLWSPNVNIFRDPRWGRGQETYGEDPYLTTRLGVAFAKGLQ----------GEGEVL 157

Query: 186 KISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
           K +AC KH A +      G +  R  FD+  + +DM+ET++  FE  V E  V  VM +Y
Sbjct: 158 KTAACAKHLAVH-----SGPEAIRHEFDAVASPKDMEETYLPAFEALVKEAKVEGVMGAY 212

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NRVNG P CA   L+ +    +W F GY VSDC +I+    +H  +  T  ++ A  LK 
Sbjct: 213 NRVNGEPACASKFLMGKL--DEWGFDGYFVSDCWAIRDFHTNH-MVTKTAPESAAMALKL 269

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC 363
           G DL+CG+ Y +  + A  +G I + DI  +   L    +RLG FD   +Y  L  + + 
Sbjct: 270 GCDLNCGNTYLHL-LHAYNEGLINDEDIKKACTHLMRTRVRLGMFDDETEYDKLDYSIVA 328

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
           N ++   A + + + +V+LKN NG LPL+   IKT+ ++GP+A++  A+ GNY G   RY
Sbjct: 329 NEENKAYARKCSERSMVMLKN-NGILPLDPSKIKTIGVIGPNADSRPALEGNYNGRADRY 387

Query: 424 TSPMDGFY-AYSKVINYAPG-------CADIVCQNNSMIPAAIDAAKNADATVIVAGLDL 475
            + ++G   A+   + Y+ G       C  +   ++ +  A I   +++D  V+  GLD 
Sbjct: 388 ITFLEGIQDAFGGRVLYSEGSHLYKDRCMGLAVADDRLSEAEI-VTEHSDVVVLCVGLDA 446

Query: 476 SVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           ++E E           D+ DL LP  Q +L+  V    K PV +V  +  A+++      
Sbjct: 447 TIEGEEGDTGNEFSSGDKNDLRLPEAQRKLVETVMRKGK-PVIIVTAAGSAINV-----E 500

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP-YTSMPLRPVNNF 585
               +++   YPG+ GG A+AD++FGK +P G+LP+T+Y  +  K+P +T   ++     
Sbjct: 501 ADCDALIHAWYPGQFGGTALADILFGKISPSGKLPVTFY-TDTTKLPEFTDYSMK----- 554

Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
            GRTY++    ++YPFGYGL+Y++ +                                  
Sbjct: 555 -GRTYRYTQDNILYPFGYGLTYSKTE---------------------------------- 579

Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAA 705
               + D+K ++ K +  ++V N G  D  +VV  Y K  G        + G+ RVF+  
Sbjct: 580 ----VSDLKFENGKAS--VKVTNTGDFDTEDVVQFYIKGEGSDYVPFYSLCGFRRVFLKK 633

Query: 706 GQSAKVGFTMNACKSLKIVDNAANSLLASG 735
           G+S  V  T+       + +N   S  A G
Sbjct: 634 GESTVVEVTLGDSAFEAVDENGRRSRSAKG 663


>gi|291240563|ref|XP_002740191.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 747

 Score =  365 bits (938), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 247/744 (33%), Positives = 368/744 (49%), Gaps = 100/744 (13%)

Query: 2   FESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG-------VPRLGLP 54
           F  I   L DFP+ +  LP+ ER  DLV R+TL E V QM     G       + RLG+ 
Sbjct: 15  FSLISTILGDFPFRNTSLPWSERVDDLVGRLTLEEIVLQMSRGGTGSNGPAPPIDRLGIG 74

Query: 55  LYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
            Y W +E LHG                D     ATSFP      A+F+  L ++I    +
Sbjct: 75  PYSWNTECLHG----------------DVAAGPATSFPQAFGLAATFDAVLIEQIANATA 118

Query: 115 TEARAMYNL--------GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVR 166
            E RA YN          + GL+ +SP IN+ R P WGR+ ET GEDPY+ G  A +YV 
Sbjct: 119 YEVRAKYNNYAKHKEYGDHKGLSCFSPVINIARHPLWGRIQETYGEDPYLSGTLAASYVN 178

Query: 167 GLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILP 226
           GLQ          +  R +  +A CKH+ AY       + R  FD++V+++D++ TF+  
Sbjct: 179 GLQG---------NHPRYVTANAGCKHFDAYAGPEDIPSSRSTFDAKVSDRDLRMTFLPA 229

Query: 227 FEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESH 286
           F  C+  G   S+MCSYN +NG+P CA+ KLL   +R +WNF GY++SD  +++ + ++H
Sbjct: 230 FHECIQAG-THSLMCSYNSINGVPACANKKLLTDILRTEWNFTGYVISDQSAVEKVYDAH 288

Query: 287 KFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVL 342
            +  D  + A+A V  +GL+L+      D     T  AV+QG +    +   +  L+   
Sbjct: 289 HYTKDMLDTAIACV-NSGLNLELSSNLEDNVMMQTTKAVKQGNVTMKTVKARVSPLFYTR 347

Query: 343 MRLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTL 399
           MRLG FD  P+   Y  L  + I + +H EL+ +AA +  VLLKN+N  LPL    I  L
Sbjct: 348 MRLGEFD-PPEMNPYSKLDLSIIQSQEHQELSLKAAAKSFVLLKNENRFLPLKE-KIDKL 405

Query: 400 ALVGPHANATKAMIGNYEGTPCRYT-SPMDGFYAYSKVINYAPGCADIVCQ--NNSMIPA 456
           A+VGP A+   A+ G+Y  TP  YT +P +G    +   +YA GC +  C+  ++  + +
Sbjct: 406 AVVGPLADNVDALYGDYSATPNNYTVTPRNGLARLAGNTSYASGCDNPKCRKYDSGQVKS 465

Query: 457 AIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAG 516
           A+     AD  V+  G    +E+EG DR +L LPG Q  L+         PV L++ +AG
Sbjct: 466 AVSG---ADMVVVCVGTGTDIESEGNDRHELALPGKQLSLLQDAVKFGTKPVILLLFNAG 522

Query: 517 AVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFG---KYNPGGRLPITWYEANYVKIP 573
            +D+++A  NP +++I+   +P +  G A+  +      + NP GRLP+TW  +     P
Sbjct: 523 PLDVSWAVENPAVQTIVACFFPAQATGDALYRMFMNTSPESNPAGRLPMTWPRSMEQVPP 582

Query: 574 YTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRD 633
            T   ++      GRTY++ D   ++PFG+GLSYT FKY   S+  +V    D       
Sbjct: 583 MTDYTMK------GRTYRYSDADPLFPFGFGLSYTLFKYYNTSASPTVIKSCD------- 629

Query: 634 INYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIK 693
                                      T  + V N+G   G EV+ VY      + T  K
Sbjct: 630 -------------------------TVTIPLTVTNVGDFPGDEVMQVYISWSNASVTVPK 664

Query: 694 -QVIGYERVF-IAAGQSAKVGFTM 715
            Q++G+ RV  I    SA V F +
Sbjct: 665 LQLVGFRRVREIEPSASATVHFAV 688


>gi|6573772|gb|AAF17692.1|AC009243_19 F28K19.27 [Arabidopsis thaliana]
          Length = 696

 Score =  365 bits (937), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 201/490 (41%), Positives = 297/490 (60%), Gaps = 22/490 (4%)

Query: 275 DCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTS 334
           DCD++  I ++  +   + EDAVA VLKAG+D++CG Y    T  A+QQ K++E DID +
Sbjct: 221 DCDAVSIIYDAQGYAK-SPEDAVADVLKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRA 279

Query: 335 LRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPL 391
           L  L+ V +RLG F+G P    Y N+  N +C+P H  LA +AAR GIVLLKN+   LP 
Sbjct: 280 LLNLFSVRIRLGLFNGDPTKLPYGNISPNEVCSPAHQALALDAARNGIVLLKNNLKLLPF 339

Query: 392 NTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNN 451
           +  ++ +LA++GP+A+  K ++GNY G PC+  +P+D   +Y K   Y  GC  + C +N
Sbjct: 340 SKRSVSSLAVIGPNAHVVKTLLGNYAGPPCKTVTPLDALRSYVKNAVYHQGCDSVAC-SN 398

Query: 452 SMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLV 511
           + I  A+  AKNAD  V++ GLD + E E  DRVDL LPG Q ELI  VA+AAK PV LV
Sbjct: 399 AAIDQAVAIAKNADHVVLIMGLDQTQEKEDFDRVDLSLPGKQQELITSVANAAKKPVVLV 458

Query: 512 IMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK 571
           ++  G VDI+FA NN KI SI+W GYPGE GG AI+++IFG +NPGGRLP+TWY  ++V 
Sbjct: 459 LICGGPVDISFAANNNKIGSIIWAGYPGEAGGIAISEIIFGDHNPGGRLPVTWYPQSFVN 518

Query: 572 IPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVAS-SPKSVDIKLDKDQQ 630
           I  T M +R    +PGRTYKF+ GP VY FG+GLSY+ + Y+  + +  ++ +   K Q 
Sbjct: 519 IQMTDMRMRSATGYPGRTYKFYKGPKVYEFGHGLSYSAYSYRFKTLAETNLYLNQSKAQT 578

Query: 631 CRD-INYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP--GI 687
             D + YT+ +         +    C   K    +EVEN G+M G   V+++++    G 
Sbjct: 579 NSDSVRYTLVSE--------MGKEGCDVAKTKVTVEVENQGEMAGKHPVLMFARHERGGE 630

Query: 688 AGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVG 746
            G    KQ++G++ + ++ G+ A++ F +  C+ L   +     +L  G + + VG+   
Sbjct: 631 DGKRAEKQLVGFKSIVLSNGEKAEMEFEIGLCEHLSRANEFGVMVLEEGKYFLTVGDS-- 688

Query: 747 GVSFPLQLNL 756
               PL +N+
Sbjct: 689 --ELPLIVNV 696



 Score =  221 bits (564), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 106/198 (53%), Positives = 134/198 (67%), Gaps = 13/198 (6%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           + +C   LP  +RA+DLV R+T+ EK+ Q+ + A G+PRLG+P YEWWSEALHGV++ G 
Sbjct: 36  YQFCRTDLPIGKRARDLVSRLTIDEKISQLVNTAPGIPRLGVPAYEWWSEALHGVAYAG- 94

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
                PG  F+  V  ATSFP VILT ASF+   W +I Q +  EAR +YN G A G+TF
Sbjct: 95  -----PGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEARGVYNAGQANGMTF 149

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ--DVEGVEYHRDSDSRPLKIS 188
           W+PNIN+ RDPRWGR  ETPGEDP + G YA+ YVRGLQ    +G    R + S  L+ S
Sbjct: 150 WAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDSFDG----RKTLSNHLQAS 205

Query: 189 ACCKHYAAYDLDNWEGND 206
           ACCKH+ AYDLD W+  D
Sbjct: 206 ACCKHFTAYDLDRWKDCD 223


>gi|238578959|ref|XP_002388893.1| hypothetical protein MPER_12044 [Moniliophthora perniciosa FA553]
 gi|215450599|gb|EEB89823.1| hypothetical protein MPER_12044 [Moniliophthora perniciosa FA553]
          Length = 658

 Score =  364 bits (934), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 249/700 (35%), Positives = 361/700 (51%), Gaps = 71/700 (10%)

Query: 56  YEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVST 115
           Y WWSEAL+                F S    ATSFP  I   A+F++ L   I   +ST
Sbjct: 1   YNWWSEALN----------------FSS----ATSFPAPITMGATFDDGLIHAIATVIST 40

Query: 116 EARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
           EARA  N+   GL F++PNIN  +DPRWGR  ETPGEDP+ + +Y    V GLQ   G  
Sbjct: 41  EARAFNNVNRGGLDFFTPNINPFKDPRWGRGQETPGEDPFHISQYVYQLVTGLQGGVG-- 98

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
                    LKI+A CKH+AAYDL+N  G  RF FD++VT QD+ E +   F+ C+ +  
Sbjct: 99  ------PTNLKIAADCKHWAAYDLENL-GVSRFEFDAKVTMQDLAEFYSPSFQSCIRDAK 151

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTK 293
           V+S+MCSYN VNGIP+CA+  LL    R  W      +I  DC ++  I   H + +D  
Sbjct: 152 VASIMCSYNAVNGIPSCANRYLLQTLARDFWGLGEEQWITGDCGAVGNIFARHHYTDD-P 210

Query: 294 EDAVARVLKAGLDLDC---GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
            +  A  L AG D+DC      Y+     A+ +  ++E  + T++   Y  L+RL + D 
Sbjct: 211 ANGTAVALNAGTDIDCDSGAAAYSQNLGQALNRSLVSEDQLRTAVTRQYNSLVRLSWDD- 269

Query: 351 SPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATK 410
                      +      +LA +AA +GIVLLKND G LPL + ++K +A+VGP ANAT 
Sbjct: 270 -----------VNTEPAQQLAYQAAVEGIVLLKND-GILPLAS-SVKKVAVVGPMANATT 316

Query: 411 AMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV 470
            M  NY G      SP   F      + +A G   +   + S   AAI AA +AD    V
Sbjct: 317 QMQSNYNGIAPFLVSPQQAFRNAGFNVTFANGTG-LNSSDTSGFSAAIAAADDADVVFYV 375

Query: 471 AGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIK 530
            G+D ++E E +DR ++   G Q  L+ ++A   K P+ ++ M  G VD +  ++N  + 
Sbjct: 376 GGIDTTIEREDRDRPEISWTGNQLALVQQLASLGK-PLIVLQMGGGQVDSSSLRDNTSVN 434

Query: 531 SILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNNFPGRT 589
           +++W GYPG+ GG A+ D+I GK  P GRLPIT Y A+YV   P T M LRP ++ PGRT
Sbjct: 435 ALIWGGYPGQSGGTALVDLITGKQAPAGRLPITQYPASYVDGFPMTDMTLRPSSSNPGRT 494

Query: 590 YKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL 649
           YK++ G  ++ FG+GL YT F  + AS   S  ++ D     ++            + V 
Sbjct: 495 YKWYTGAPIFEFGFGLHYTTFDAEWASGGDSFSVQ-DLVSSAKN------------SGVA 541

Query: 650 IDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERV-FIAAGQ 707
             D+   D   TF + V N G +    V +++S+   G +    K+++ Y RV  I  G 
Sbjct: 542 HVDLGVLD---TFNVTVTNSGTVASDYVALLFSRTTAGPSPAPNKELVSYTRVKGIEPGA 598

Query: 708 SAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGG 747
           S+     +    ++   D   N +L  G + +L+  G  G
Sbjct: 599 SSAASLKVT-LGAVARTDEQGNRVLYPGEYVLLLDTGAEG 637


>gi|125534110|gb|EAY80658.1| hypothetical protein OsI_35835 [Oryza sativa Indica Group]
          Length = 511

 Score =  364 bits (934), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 197/487 (40%), Positives = 285/487 (58%), Gaps = 20/487 (4%)

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEAD 330
           Y+ SDCD++ TI ++H +   + ED VA  +KAG+D++CG+Y     M AVQ+G + E D
Sbjct: 16  YVASDCDAVATIRDAHHY-TLSPEDTVAVSIKAGMDVNCGNYTQVHAMAAVQKGNLTEKD 74

Query: 331 IDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNPQHIELAAEAARQGIVLLKNDN 386
           ID +L  L+ V MRLG+FDG P+    Y +LG  ++C+P H  LA EAA+ GIVLLKND 
Sbjct: 75  IDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVLLKNDA 134

Query: 387 GALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY-SKVINYAPGCAD 445
           GALPL    + +LA++GP+A+   A+ GNY G PC  T+P+ G   Y      +  GC  
Sbjct: 135 GALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFLAGCDS 194

Query: 446 IVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAK 505
             C   +    A   A ++D  V+  GL    E EG DR  LLLPG Q  LI  VA+AA+
Sbjct: 195 PACAVAATN-EAAALASSSDHVVLFMGLSQKQEQEGLDRTSLLLPGEQQGLITAVANAAR 253

Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
            PV LV+++ G VD+ FAK+NPKI +IL  GYPG+ GG AIA V+FG +NP GRLP+TWY
Sbjct: 254 RPVILVLLTGGPVDVTFAKDNPKIGAILLAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWY 313

Query: 566 EANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASS---PKS 620
              + K+P T M +R  P   +PGR+Y+F+ G  VY FGYGLSY++F  ++ SS     +
Sbjct: 314 PEEFTKVPMTDMRMRADPATGYPGRSYRFYQGNTVYNFGYGLSYSKFSRRMFSSFSTSNA 373

Query: 621 VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYKFTFQIEVENMGKMDGSEV 677
            ++ L      R      G +    ++ L+ ++   +C    F   +EV+N G MDG   
Sbjct: 374 GNLSLLAGVMAR----RAGDDGGGMSSYLVKEIGVERCSRLVFPAVVEVQNHGPMDGKHS 429

Query: 678 VMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGA 736
           V++Y + P    G   +Q+IG+    +  G+ A V F ++ C+    V      ++  GA
Sbjct: 430 VLMYLRWPTKSGGRPARQLIGFRSQHVKVGEKAMVSFEVSPCEHFSWVGEDGERVIDGGA 489

Query: 737 HTILVGE 743
           H ++VG+
Sbjct: 490 HFLMVGD 496


>gi|333995841|ref|YP_004528454.1| beta-glucosidase [Treponema azotonutricium ZAS-9]
 gi|333737309|gb|AEF83258.1| periplasmic beta-glucosidase (Gentiobiase)(Cellobiase)
           (Beta-D-glucoside glucohydrolase) [Treponema
           azotonutricium ZAS-9]
          Length = 706

 Score =  363 bits (933), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 250/755 (33%), Positives = 378/755 (50%), Gaps = 110/755 (14%)

Query: 24  RAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
           R K+++ +MTL EKV Q+   A  V   G+P Y WW+E LHGV+  G             
Sbjct: 6   RIKEMISKMTLEEKVSQLSYDAPAVESAGIPKYNWWNECLHGVARAGL------------ 53

Query: 84  EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGNA----GLTFWSPNI 135
               AT FP  I   A+F+E+  + +   +S E RA YN     GN     GLTFW+PN+
Sbjct: 54  ----ATVFPQAIALAATFDEAFIRSVADAISDEGRAKYNEAVKRGNRSQYYGLTFWTPNV 109

Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
           N+ RDPRWGR  ET GEDPY+ GR  + +++GLQ           D+  LK++AC KHYA
Sbjct: 110 NIFRDPRWGRGQETYGEDPYLTGRIGLAFMKGLQ---------GDDTEHLKVAACAKHYA 160

Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
            +   +     R  FD+ V+++D+ ET++  F++ V  G V +VM +YNR  G P     
Sbjct: 161 VH---SGPEKLRHTFDAVVSKKDLFETYLPAFKLLVENG-VEAVMGAYNRTLGEPCGGST 216

Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
            LL + +RG W F G++ SDC +I+   E+HK +  + E++ A  L AG DL+CG  Y  
Sbjct: 217 YLLKEILRGRWGFKGHVTSDCWAIRDFHENHK-VTKSPEESAAMALNAGCDLNCGCTYPY 275

Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELAAE 373
            T+ + ++G + +  IDT+L  L     +LG FD   Q  Y+NLG + +   +H  LA E
Sbjct: 276 LTV-SHKKGLVTDETIDTALTRLLRTRFKLGLFDPPEQDPYRNLGNDIVGCEKHRNLALE 334

Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
           AA++ IVLLKND+  LPL+    K L L+GP A     ++ NY G   R  + ++G    
Sbjct: 335 AAQKSIVLLKNDSNILPLDDSARKIL-LMGPGAANILTLLANYYGMSSRLVTILEGLAEK 393

Query: 434 SKV-----INYAPGCADIVCQNNSMIP---------AAIDAAKNADATVIVAGLDLSVEA 479
            K        Y  G       + S +P         A I      D  + V GLD S+E 
Sbjct: 394 IKTKTAISFEYRQGSLMYEPNHLSNVPFGSTGVDAEAPIYGLDEIDLVIAVYGLDGSMEG 453

Query: 480 E---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIK 530
           E           DR  + LP +Q   + ++  A K    +V++  G   I F ++     
Sbjct: 454 EEGDSIASDANGDRDTIELPSWQLNFLRRIRKAGK---KVVLILTGGSPIAFPED--LAD 508

Query: 531 SILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTY 590
           ++L+  YPGE+GG A+AD++FG  +P G+LPIT+ ++     PY    L+      GRTY
Sbjct: 509 AVLFAWYPGEQGGNAVADILFGDVSPSGKLPITFPQSTAQLPPYDDYALK------GRTY 562

Query: 591 KFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
           ++     +YPFG+GLSYT F++       SV++   K         + G           
Sbjct: 563 RYMKETPLYPFGFGLSYTSFRF------DSVELSSSK--------ISAG----------- 597

Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSA 709
           + VK K       ++V N GK D  EVV +Y +K           + G+ R+ I AG+SA
Sbjct: 598 NSVKAK-------VQVSNTGKRDAEEVVQLYIAKDNRSEDEPASSLRGFRRLKILAGKSA 650

Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
            V   + A  + + ++    S+L  G++T++  + 
Sbjct: 651 SVEIELPAS-AFETINAEGASVLIPGSYTVIAADA 684


>gi|332638085|ref|ZP_08416948.1| glycoside hydrolase family 3 protein [Weissella cibaria KACC 11862]
          Length = 713

 Score =  363 bits (932), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 239/772 (30%), Positives = 386/772 (50%), Gaps = 117/772 (15%)

Query: 24  RAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
           +AK +V++MT+ EK+ Q+   A  + RL +P Y +W+EALHGV+  G             
Sbjct: 13  QAKVIVDQMTIDEKIGQIKYEAPAIERLNIPEYNYWNEALHGVARAGV------------ 60

Query: 84  EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPNI 135
               AT FP  I   A+F++ L   I   + TE RA YN            GLTFWSPN+
Sbjct: 61  ----ATVFPQAIGLAATFDDQLINDIADVIGTEGRAKYNEFTKHEDRDIYKGLTFWSPNV 116

Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
           N+ RDPRWGR  ET GEDP++  ++ + +++GLQ            ++ LK++A  KH+A
Sbjct: 117 NIFRDPRWGRGHETYGEDPFLTSKFGVAFIKGLQ----------GQAKYLKLAATAKHFA 166

Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
            +     EG  R  FD+ V+++D+ ET++  F+  V E DV S+M +YN V+G+P     
Sbjct: 167 VHS--GPEGL-RHGFDAVVSDKDLYETYLPAFKAAVEEADVESIMTAYNAVDGVPASVSE 223

Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
            LL   +   W+F G++VSD  + + + E+HK+  D  E  +   +KAGL+L  G +   
Sbjct: 224 MLLRDILHDKWSFEGHVVSDYMAPEDVHENHKYTKDAAE-TMGLAIKAGLNLVAG-HIEQ 281

Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAA 375
               A+ +G + E +I  ++  LY   +RLG F    +Y  +         H  L+  AA
Sbjct: 282 SLHEALNRGLVTEEEITNAVISLYATRVRLGMFATDNEYDAIPYEANDTKAHNNLSEIAA 341

Query: 376 RQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY-- 433
            +  VLLKND G LPL    ++ +A+VGP+A++  A++GNY GTP R  + ++G      
Sbjct: 342 EKSFVLLKND-GVLPLRKETMEAIAVVGPNAHSEIALLGNYFGTPSRSYTILEGIQERLG 400

Query: 434 -SKVINYAPG-------CADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----- 480
               ++Y+ G        A+ + + +     AI AA+++D  V V GLD ++E E     
Sbjct: 401 DDVRVHYSIGSGVFQDHAAEPLAKADERESEAIIAAEHSDVIVAVLGLDSTIEGEEGDAG 460

Query: 481 ----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
                 D+ +L LPG Q +L+ ++    K PV +++ S  ++ ++  +N+P +++I+ + 
Sbjct: 461 NSQGAGDKPNLSLPGRQRQLLERLLAVGK-PVVVLLASGSSLQLDGLENHPNLRAIMQIW 519

Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
           YPG  GG A+ADV+FG  +P G+LP+T+Y+         ++P     N  GRTY++    
Sbjct: 520 YPGARGGLAVADVLFGTVSPSGKLPVTFYKNT------DNLPAFEDYNMAGRTYRYMTEE 573

Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
            +YPFGYGL+Y+                                      +V + D++ K
Sbjct: 574 ALYPFGYGLTYS--------------------------------------SVELSDLQVK 595

Query: 657 DYK--FTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
            Y+   T  + ++N G  D  EVV VY K           Q+ G++RVF+  G    + F
Sbjct: 596 SYEETATATVTIQNTGNFDTDEVVQVYVKDLESEFAVPNAQLKGFKRVFLGKGSKQTITF 655

Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVGE--------GVGGVSFPLQLNLN 757
            +   +  ++ D   ++ + S    I VG          + GV  PLQ  LN
Sbjct: 656 DLR-PQDFEVFDEQGHNFIDSNRFEISVGVSQPDARSIALTGVQ-PLQTELN 705


>gi|255590044|ref|XP_002535159.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
 gi|223523880|gb|EEF27223.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
          Length = 449

 Score =  363 bits (932), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 182/450 (40%), Positives = 283/450 (62%), Gaps = 21/450 (4%)

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNN 361
           +D++CG+Y  N+T  AV++ K++E++ID +L  L+ + MRLG F+G+P    Y ++  + 
Sbjct: 1   MDVNCGNYLKNYTKSAVEKKKVSESEIDRALHNLFSIRMRLGLFNGNPTKLPYGDISADQ 60

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           +C+ +H  +A EAAR GIVLLKN N  LPL+     +LA++GP+A+ +  ++GNY G PC
Sbjct: 61  VCSQEHQAVALEAARDGIVLLKNSNQLLPLSKSKTTSLAIIGPNADNSTILVGNYAGPPC 120

Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
           +  +P  G   Y K   Y PGC+ + C +++ I  AI  AK AD  V+V GLD + E E 
Sbjct: 121 KTVTPFQGLQNYIKTTKYHPGCSTVAC-SSAAIDQAIKIAKEADQVVLVMGLDQTQEREE 179

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
            DRVDL+LPG Q ELI  VA AAK PV LV++  G VDI+FAK +  I  ILW GYPGE 
Sbjct: 180 HDRVDLVLPGKQQELIISVARAAKKPVVLVLLCGGPVDISFAKYDRNIGGILWAGYPGEA 239

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVY 599
           GG A+A++IFG +NPGGRLP+TWY  ++ K+P T M +R  P + +PGRTY+F+ G  V+
Sbjct: 240 GGIALAEIIFGNHNPGGRLPVTWYPQDFTKVPMTDMRMRPQPSSGYPGRTYRFYKGKKVF 299

Query: 600 PFGYGLSYTQFKYKVAS-SPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK---C 655
            FGYGLSY+ + Y++ S +   + ++   DQ+          N  P     I +++   C
Sbjct: 300 EFGYGLSYSNYSYELVSVTQNKISLRSSIDQKAE--------NSSPIGYKTISEIEEELC 351

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSK--PPGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
           +  KF+  + V+N G+M G   V+++++   PG +G  IK++I ++ V + AG++A++ +
Sbjct: 352 ERSKFSVTVRVKNQGEMTGKHPVLLFARQDKPG-SGGPIKKLIAFQSVKLNAGENAEIEY 410

Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVGE 743
            +N C+ L   +     ++  G+  +LVG+
Sbjct: 411 KVNPCEHLSRANEDGLMVMEEGSQYLLVGD 440


>gi|326789672|ref|YP_004307493.1| beta-glucosidase [Clostridium lentocellum DSM 5427]
 gi|326540436|gb|ADZ82295.1| Beta-glucosidase [Clostridium lentocellum DSM 5427]
          Length = 704

 Score =  362 bits (930), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 220/619 (35%), Positives = 327/619 (52%), Gaps = 67/619 (10%)

Query: 24  RAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
           +A  LV +M L EK   +   +  + RLG+P Y WWSEALHGV+  G             
Sbjct: 8   KAGQLVAQMDLLEKASMLRYDSPAIKRLGVPTYNWWSEALHGVARAGV------------ 55

Query: 84  EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPNI 135
               AT FP  I   A F+E    +I   ++TEARA YN            G+T W+PNI
Sbjct: 56  ----ATVFPQAIGMAAMFDEEYLYEIADIIATEARAKYNEFAKKEDRDIYKGMTLWAPNI 111

Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
           N+ RDPRWGR  ET GEDPY+  R  + ++ GLQ  E   Y         K +AC KH+A
Sbjct: 112 NIFRDPRWGRGHETYGEDPYLTSRLGVAFIHGLQGDENHHY--------WKAAACAKHFA 163

Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
            +     E   R HFD+ V+++D+ ET++  FE  V +G V+ +M +YNRVNG P C   
Sbjct: 164 VHSGPEEE---RHHFDAVVSKKDLYETYLPAFEAAVTKGKVAGMMGAYNRVNGEPACGSK 220

Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
            LL   ++ +W F GY+VSDC +I+     H  +  T  ++ A  +  G  L+CG+ Y +
Sbjct: 221 VLLQDILKEEWGFDGYVVSDCWAIRDFHTEH-MVTHTATESAALAINNGCQLNCGNTYLH 279

Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLG-KNNICNPQHIELAAEA 374
             + A ++G + E  I  S + L  + M+LG FD + +Y  +  + N C   H ++A + 
Sbjct: 280 M-LQAYKEGLVTEETITKSAQKLMAIRMKLGLFDKNCEYNKIPYEVNDCKV-HRDIALDV 337

Query: 375 ARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY- 433
           AR+ +VLLKN NG LPLN    K + ++GP AN+   + GNY GT  RYT+ ++G   Y 
Sbjct: 338 ARRSMVLLKN-NGILPLNLKQTKAIGVIGPTANSRTVLQGNYFGTASRYTTFLEGIQDYV 396

Query: 434 --SKVINYAPGCADI------VCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----- 480
             +  + YA GC         +   N  +  A+  A+ +D  ++  GLD S+E E     
Sbjct: 397 GDAARVYYAEGCHLFKNSISGLSWENDRLSEALIVAEQSDVVILCLGLDASIEGEQGDTG 456

Query: 481 ----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
                 D+ DL L G Q  L+ +V    K P  L++ S  A+ I+ A+     ++IL   
Sbjct: 457 NAFAAGDKSDLNLIGRQQLLLEEVLKIGK-PTILILSSGSAMAIHTAQE--YCEAILETW 513

Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
           YPG+ GG+A+A ++FG+Y+P G+LPIT+Y+          +P     +  GRTY++    
Sbjct: 514 YPGQSGGKALAQLLFGEYSPSGKLPITFYKTT------EELPDFRDYSMAGRTYRYMKNE 567

Query: 597 VVYPFGYGLSYTQFKYKVA 615
            +YPFGYGL+Y + + K A
Sbjct: 568 ALYPFGYGLNYAKVEVKDA 586


>gi|325970053|ref|YP_004246244.1| beta-glucosidase [Sphaerochaeta globus str. Buddy]
 gi|324025291|gb|ADY12050.1| Beta-glucosidase [Sphaerochaeta globus str. Buddy]
          Length = 698

 Score =  362 bits (929), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 238/749 (31%), Positives = 370/749 (49%), Gaps = 107/749 (14%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA++LVERM LP+ + Q+   A  +  LG+P Y WW+E LHG +  G            
Sbjct: 5   QRAQELVERMNLPQMMSQLRHDAPAIESLGIPAYNWWNEGLHGSARSGT----------- 53

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN--------AGLTFWSPN 134
                AT FP  I   + F+      +   VSTE RA YNL           GLT WSPN
Sbjct: 54  -----ATVFPQAIGLASLFDPDFLYAVASVVSTEQRAKYNLFTHENDRDIYKGLTVWSPN 108

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           +N+ RDPRWGR  ET GEDPY+  R A+ ++RGLQ           +   LK ++C KH+
Sbjct: 109 VNIFRDPRWGRGQETFGEDPYLTARLAVAFIRGLQ----------GEGPVLKTASCVKHF 158

Query: 195 AAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           AA+      G +  R  F++ V ++D++ET++  F   V E    +VM +Y+ +N  P C
Sbjct: 159 AAHS-----GPEPLRHGFNAVVGKKDLEETYLPAFASAVKEAKADAVMGAYSALNDEPCC 213

Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
           A   L+ +T+R  W F G  +SDC +I+    +HK +   +E++ A  LK G DL CG  
Sbjct: 214 ASSFLMEETLRLRWGFEGMYISDCWAIRDFHLNHK-VTKNEEESAALALKRGCDLACGCE 272

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAA 372
           Y +    A Q+G I    I  +   +     +LG FD    Y  LG  ++ + +H  LA 
Sbjct: 273 YQSLEK-AFQKGLITREQIKKAAIRVMTTRFKLGQFDQGTAYDTLGLESLDSDEHAALAF 331

Query: 373 EAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA 432
           EA+ + +VLLKND   LPL    +  LA++GP+A++ +A+ GNY GT  RY + ++G   
Sbjct: 332 EASCRSLVLLKND-ALLPLKKEAVSCLAVIGPNADSRQALWGNYHGTSSRYVTILEGLRD 390

Query: 433 Y---SKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE--- 480
           Y   S  I Y+ G        + + +++  +  A+  AK +D  V+  GL+ +VE E   
Sbjct: 391 YVGSSTRILYSEGSNLTKNKVERLAKDDDRLSEAVFMAKASDVVVLCLGLNETVEGEMHD 450

Query: 481 ------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
                   D+ DL LP  Q +L+  VA+  K P+ +V++S G++D    +    +K+++ 
Sbjct: 451 DGNGGWAGDKDDLRLPLCQRKLLKAVAETGK-PIIVVLLSGGSLDPEI-EQYANVKALIQ 508

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD 594
             YPG+EGG+AIA +++G   P G+LP+T+Y+A     P+T   L        RTY++ D
Sbjct: 509 AWYPGQEGGKAIAHLLYGALCPSGKLPVTFYKAEAKLPPFTDYSL------IRRTYRYCD 562

Query: 595 GP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
            P V+YPFG+GLSY  F + ++++ +                    T +   AA ++   
Sbjct: 563 DPDVLYPFGFGLSYASFSFCLSAAQE--------------------TEQNGVAATVL--- 599

Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
                       V N   +D   VV +Y    G        + G + V + AG+  ++ F
Sbjct: 600 ------------VRNTSALDARTVVQLYLAMEGKDLPPHPVLCGMKSVHLKAGEETQITF 647

Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVG 742
            +   K    V    N     G +T+  G
Sbjct: 648 ILEE-KQFTAVQEDGNRYAVRGGYTLYAG 675


>gi|164428543|ref|XP_964543.2| hypothetical protein NCU00709 [Neurospora crassa OR74A]
 gi|157072187|gb|EAA35307.2| hypothetical protein NCU00709 [Neurospora crassa OR74A]
          Length = 786

 Score =  362 bits (929), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 265/773 (34%), Positives = 367/773 (47%), Gaps = 104/773 (13%)

Query: 45  AYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSE---VPGATSFPTVILTTASF 101
           A G  RLGLP Y WWSE LHGV+         PG  F++       ATSF   I   ASF
Sbjct: 8   ALGASRLGLPKYAWWSEGLHGVA-------GSPGVKFNTTGYPFSYATSFANAINLGASF 60

Query: 102 NESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYA 161
           ++ L  ++G  +STEARA  N G  GL +W+PN+N  +DPRWGR  ETPGEDP  +  Y 
Sbjct: 61  DDDLVYEVGTAISTEARAFANFGFGGLDYWTPNVNPYKDPRWGRGAETPGEDPLHIKGYV 120

Query: 162 INYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQE 221
              + GL+  E V           K+ A CKHYAAYDL+ W G  R+ F++ VT QD+ E
Sbjct: 121 KAILAGLEGNETVR----------KVIATCKHYAAYDLERWHGLTRYEFEAIVTLQDLSE 170

Query: 222 TFILPFEMCVNEGDVSSVMCSYNRV-----------------NGIPTCADPKLLNQTIRG 264
            ++ PF+ C  +  V S+MCSYN +                    P CA P L+   +R 
Sbjct: 171 YYLPPFQQCARDSKVGSIMCSYNALTIRDMASGKPDEEINLTTAQPACAKPYLMT-ILRD 229

Query: 265 DWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT--MG 319
            WN+   + YI SDC++I   +  +   + T  +A A   KAG D  C    +  T  +G
Sbjct: 230 HWNWTEHNNYITSDCNAILDFLPDNHNFSQTPAEAAAAAYKAGTDTVCEVSGSPLTDVVG 289

Query: 320 AVQQGKIAEADIDTSLRFLYIVLMRLGYFD---------------GSPQYKNLGKNNICN 364
           A  Q  + EA IDT+LR LY  L+R GY D                SP Y  L   ++  
Sbjct: 290 AYNQSLLPEAVIDTALRRLYEGLIRAGYLDHGRSSAVAGGDGGSFSSPAYDALNWEDVNT 349

Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
           P   ELA  +A +GIVLLKN    LPL+    K +AL+G  ANAT  M G Y G P  Y 
Sbjct: 350 PSTQELALRSATEGIVLLKNAGSLLPLDFSG-KKVALIGHWANATGTMRGPYSGIPPFYH 408

Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           +P+      +   +YA G        ++    A+ AA+ AD  +   G D +V +E  DR
Sbjct: 409 NPLYAAQQLNLSFSYANGPVVNASDPDTWTAPALAAAEGADVVLYFGGTDTTVASEDLDR 468

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
             +  P  Q +L++++A   K P+ +VI     VD +   NN  + SILWVGYPG+ GG 
Sbjct: 469 ESIAWPETQMQLLSELAGLGK-PL-VVIQLGDQVDDSSLLNNGNVSSILWVGYPGQSGGT 526

Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVN-------------------- 583
           A+ DV+ GK  P GRLP+T Y   YV ++P T M LRP N                    
Sbjct: 527 AVFDVLTGKKAPAGRLPVTQYPEGYVDEVPLTEMALRPFNYSSSSNLEQEVSVQGRGSLT 586

Query: 584 ------------NFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQC 631
                       + PGRTYK++  PV+ PFGYGL YT F   ++ S  +           
Sbjct: 587 IQPRSTPGNKTLSSPGRTYKWYSSPVL-PFGYGLHYTTFNVSLSLSSSNASSSSSSPSFS 645

Query: 632 RDINYTVGTNKPPCAAVLIDDVK-CKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAG 689
                T      PC A  +D             + + N G      VV+++ S   G   
Sbjct: 646 IPSLLT------PCTATHLDLCPFSPSANSALSVSITNTGTHTSDYVVLLFLSGEFGPKP 699

Query: 690 THIKQVIGYERVF-IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
             +K ++ Y+RV  I  G++  V     +  ++  VD   N++L  G +  +V
Sbjct: 700 YPLKTLVSYKRVKDIKPGETVTVKDVPVSLGAISRVDGDGNTVLYPGTYRFVV 752


>gi|291544853|emb|CBL17962.1| Beta-glucosidase-related glycosidases [Ruminococcus champanellensis
           18P13]
          Length = 697

 Score =  361 bits (927), Expect = 8e-97,   Method: Compositional matrix adjust.
 Identities = 221/623 (35%), Positives = 338/623 (54%), Gaps = 80/623 (12%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y +  L   ERA+DL +R+T+ E+  Q+   A  +PRLG+P Y WW+E LHGV+  G   
Sbjct: 9   YLNPSLTPDERAEDLADRLTVEEQASQLRYDALPIPRLGIPAYNWWNEGLHGVARAGT-- 66

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   A+F+ +L  +IG+  +TEARA +            
Sbjct: 67  --------------ATMFPQAIGMAATFDTALLHQIGEITATEARAKHMAAREHGDFDIY 112

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLT W+PNIN+ RDPRWGR  ET GEDP++  R  + +V+G+Q           + + L
Sbjct: 113 KGLTLWAPNINLFRDPRWGRGHETYGEDPFLTARLGVAFVKGMQ----------GEGKVL 162

Query: 186 KISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
           K +AC KH+A +      G +  R  FD++V+ +D++E+++  F   V E  V  VM +Y
Sbjct: 163 KAAACAKHFAVHS-----GPEALRHSFDAQVSPKDLEESYLPAFHALVAEAKVEGVMGAY 217

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NRVNG P+CA P L+++  +  W F GY VSDC +IQ   + H    +  E A A  L+ 
Sbjct: 218 NRVNGEPSCASPMLMDKLHQ--WGFAGYFVSDCWAIQDFHKHHGVTKNVTESA-ALALRT 274

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC 363
           G DL+CG+ Y  + + A+++G I  ADI  +   +    +RLG FD  P +     + I 
Sbjct: 275 GCDLNCGNTYL-YVLAALEEGLIDAADIRRACIRVLRTRIRLGLFDPEPHFAACTYDTIA 333

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
           +P H  ++   A + +VLLKND G LPL+   +  +A++GP+A++  A+ GNY GT  RY
Sbjct: 334 SPAHKAVSLSCAEKSMVLLKND-GILPLDLSKLHAIAVIGPNADSRAALEGNYCGTADRY 392

Query: 424 TSPMDGFY-AYSKVINYAPGCADIVCQNNSMIPAAID-------AAKNADATVIVAGLDL 475
            + ++G   A+   ++YA GC  +     S +  A D       AA+ +D  ++  GLD 
Sbjct: 393 VTFLEGIQDAFPGRVHYAQGC-HLYKDRTSNLAMADDRYAEALAAAEASDVVILCLGLDA 451

Query: 476 SVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           ++E E           D+ DL LP  Q +L+ K+    K PV LV+ +  A+       N
Sbjct: 452 TLEGEEGDTGNEFSSGDKADLRLPPPQCKLLEKLHAVGK-PVILVLAAGSAL-------N 503

Query: 527 PKIK--SILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNN 584
           P+I   ++L   YPG+ GG+A+A ++FGK +P G+LP+T+YE       +T   ++    
Sbjct: 504 PEISCNAVLQAWYPGQCGGQALAHILFGKVSPSGKLPVTFYETAEQLPDFTDYSMQ---- 559

Query: 585 FPGRTYKFFDGPVVYPFGYGLSY 607
              RTY++    V+YPFGYGL+Y
Sbjct: 560 --NRTYRYARNNVLYPFGYGLTY 580


>gi|317057539|ref|YP_004106006.1| glycoside hydrolase family protein [Ruminococcus albus 7]
 gi|315449808|gb|ADU23372.1| glycoside hydrolase family 3 domain protein [Ruminococcus albus 7]
          Length = 691

 Score =  361 bits (926), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 225/623 (36%), Positives = 335/623 (53%), Gaps = 72/623 (11%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D  L   ERA+ L + MT  E+  Q+   A  V RLG+P Y WW+E +HG++  G   
Sbjct: 4   YLDETLSAQERAEALTDEMTTEEQASQLRYDAPAVERLGIPAYNWWNEGIHGLARSGV-- 61

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   A F++ L KK  +  S EARA YN  +        
Sbjct: 62  --------------ATMFPQAIGLAAMFDDELTKKTAEVTSEEARAKYNAYSGEEDRDIY 107

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLT W+PNIN+ RDPRWGR  ET GEDPY+  +  +  VRGLQ           D + +
Sbjct: 108 KGLTLWAPNINIFRDPRWGRGHETFGEDPYLTTKNGMAVVRGLQ----------GDGKVI 157

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K +AC KH+A +   +     R  FD++   +DM+ET++  FE  V E  V SVM +YNR
Sbjct: 158 KAAACAKHFAVH---SGPEAIRHSFDAKANAKDMEETYLPAFEALVKEAKVESVMGAYNR 214

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           VNG P CA   L+++    +W F GY VSDC +I+   E+H    +  E + A  LKAG 
Sbjct: 215 VNGEPACASNFLMDKL--KEWEFDGYFVSDCWAIRDFHENHMVTANAIE-STAMALKAGC 271

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNP 365
           D++CG  Y N  + A+++G + + DI T+   L    +RLG FD   +Y ++  + +   
Sbjct: 272 DVNCGCTYQNLLV-ALEKGAVTKEDIRTACVHLMRTRIRLGMFDKKTEYDDIPYDKVACK 330

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H  ++ E A + +V+L+N NG LP++T   KT+A++GP+A++  A+ GNY G   RYT+
Sbjct: 331 EHKAISLECAEKSLVMLEN-NGILPVDTSKYKTIAVIGPNADSRTALEGNYNGLSDRYTT 389

Query: 426 PMDGFY-AYSKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
            ++G    +   + +A GC         + Q       A+ AAK AD T++  GLD ++E
Sbjct: 390 FLNGIQDRFDGRVIFAEGCHLYKDRVSNLAQAGDRYAEAVAAAKFADMTILCLGLDATIE 449

Query: 479 AE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
            E           D+  L LP  Q EL+ K+    K PV  V+ +  A++        K 
Sbjct: 450 GEEGDTGNEFSSGDKNGLTLPPPQRELVKKIMAVGK-PVVTVVCAGSAIN-----TESKP 503

Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP-YTSMPLRPVNNFPGR 588
            +++   YPG EGG+A+A+V+FG  +P G+LP+T+YE +  K+P +T   ++      GR
Sbjct: 504 DALIHAFYPGAEGGKALAEVLFGDVSPSGKLPVTFYE-DTDKLPEFTDYSMK------GR 556

Query: 589 TYKFFDGPVVYPFGYGLSYTQFK 611
           TY++    V+YPFGYGL+Y   K
Sbjct: 557 TYRYTTENVLYPFGYGLTYGSVK 579


>gi|157676888|emb|CAP07659.1| beta-xylosidase [uncultured rumen bacterium]
          Length = 761

 Score =  359 bits (922), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 263/801 (32%), Positives = 374/801 (46%), Gaps = 149/801 (18%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
            +  Y D   P   RAK L+ +++L EK   +   +  V RLG+  Y WWSEALHGV+  
Sbjct: 27  QEISYTDKSQPAELRAKALLPKLSLEEKAGLVQYNSPAVERLGIKAYNWWSEALHGVARN 86

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---- 125
           G                 AT FP  I   ASF+    + +   VS EAR    +      
Sbjct: 87  G----------------SATVFPQPIGMAASFDVEKIETVFTAVSDEARVKNRIAAEDGR 130

Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
               AGL+FW+PNIN+ RDPRWGR +ET GEDPY++G+  +  VRGLQ         D D
Sbjct: 131 VYQYAGLSFWTPNINIFRDPRWGRGMETYGEDPYLMGQLGMAVVRGLQG--------DPD 182

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
           +  LK  AC KHYA +     E N R  FD++V+E+D++ET++  F+  V +  V  VM 
Sbjct: 183 ADVLKTHACAKHYAVH--SGLESN-RHRFDAQVSERDLRETYLPAFKDLVTKAGVKEVMT 239

Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVAR 299
           +YNR  G P  A   L+ + +R +W + G +VSDC +I    E   H F+  T E+A A 
Sbjct: 240 AYNRFRGYPCAASEYLVQKILREEWGYKGLVVSDCWAIPDFFEPGRHGFVA-TGEEAAAL 298

Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK 359
            +  GLD++CG  ++     A+ QG + E D+D +L  +     RLG  DG   + +L  
Sbjct: 299 AVANGLDVECGSTFSKIP-AAIDQGLLKEEDLDRNLLRVLTERFRLGEMDGESPWDDLDP 357

Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
             +  P+H  L+ + AR+ +VLL+N NG LPL  G  + +AL+GP+A+  +   GNY   
Sbjct: 358 AIVEGPEHRALSLDIARETMVLLRN-NGVLPLKAG--EKIALIGPNADDAQMQWGNYNPV 414

Query: 420 PC-------------------RYTSPMDGFY-----AYSKVI------------NYAPGC 443
           P                    R    +D  Y     AY+ +I             YA   
Sbjct: 415 PKSTITLLQAMQARVPGLVYDRACGILDAEYAPQGSAYANLIGASEAQLEAAARRYAVSV 474

Query: 444 ADIVC-------QNNSMIPAAIDAA-----KNADATVIVAGLDLSVEAE----------G 481
            DI         Q  S +PA  +AA     +  D  V   G+   +E E          G
Sbjct: 475 NDIKNYIRRDEEQRRSFMPALDEAAVLKKLEGVDVVVFAGGISPRLEGEEMRVQVPGFSG 534

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
            DR D+ LPG Q  L+  + DA K  V LV  S  A  I          +IL   YPG+E
Sbjct: 535 GDRTDIELPGVQRRLLKALHDAGK-KVVLVNFSGCA--IGLVPETESCDAILQAWYPGQE 591

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
           GG AIADV+FG  NP G+LP+T+Y+ N  ++P          N  G TY++F G  +YPF
Sbjct: 592 GGTAIADVLFGDVNPSGKLPVTFYK-NVDQLPDVED-----YNMEGHTYRYFRGEPLYPF 645

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
           GYGLSYT F +     PK                                 VK K+    
Sbjct: 646 GYGLSYTSFAF---GEPK---------------------------------VKGKN---- 665

Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
            +I+V N G + G+EVV +Y + P      +K +  + RV + AGQ+ KV   ++    L
Sbjct: 666 LEIDVTNTGSVAGTEVVQLYVRKPDDTAGPVKTLRAFRRVSVPAGQTVKVSIPLDKETFL 725

Query: 722 KIVDNAANSLLASGAHTILVG 742
              +   + +   G + +L G
Sbjct: 726 WWSEKDQDMVPVRGRYELLCG 746


>gi|336463686|gb|EGO51926.1| hypothetical protein NEUTE1DRAFT_125528 [Neurospora tetrasperma
           FGSC 2508]
          Length = 788

 Score =  359 bits (921), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 263/777 (33%), Positives = 371/777 (47%), Gaps = 110/777 (14%)

Query: 45  AYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSE---VPGATSFPTVILTTASF 101
           A G  R+GLP Y WWSE LHGV+         PG  F++       ATSF   I   ASF
Sbjct: 8   ALGASRIGLPKYAWWSEGLHGVA-------GSPGVTFNTTGYPFSYATSFANAINLGASF 60

Query: 102 NESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYA 161
           ++ L  ++G  +STEARA  N G  GL +W+PN+N  +DPRWGR  ETPGEDP  +  Y 
Sbjct: 61  DDDLVYEVGTAISTEARAFANFGFGGLDYWTPNVNPYKDPRWGRGAETPGEDPLHIKGYV 120

Query: 162 INYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQE 221
              + GL+  E V           K+ A CKHYAAYDL+ W G  R+ F++ VT QD+ E
Sbjct: 121 KAMLAGLEGNETVR----------KVIATCKHYAAYDLERWHGLTRYEFEAIVTLQDLSE 170

Query: 222 TFILPFEMCVNEGDVSSVMCSYNRV-----------------NGIPTCADPKLLNQTIRG 264
            ++ PF+ C  +  V S+MCSYN +                    P CA+  L+   +R 
Sbjct: 171 YYLPPFQQCARDSKVGSIMCSYNALTIRDMAGGNPDEIINLTTAQPACANTYLMT-ILRD 229

Query: 265 DWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT--MG 319
            WN+   + YI SDC++I   +  +   + T  +A A   KAG D  C    +  T  +G
Sbjct: 230 HWNWTEHNNYITSDCNAILDFLPDNHNFSQTPAEAAAAAYKAGTDTVCEVSGSPLTDVVG 289

Query: 320 AVQQGKIAEADIDTSLRFLYIVLMRLGYFD---------------GSPQYKNLGKNNICN 364
           A  Q  + EA IDT+LR LY  L+R GY D                SP Y  L   ++  
Sbjct: 290 AYNQSLLPEAVIDTALRRLYEGLIRAGYLDHGRSSAVAGGDGGSFSSPAYDALNWEDVNT 349

Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
           P   ELA  +A +GIVLLKN    LPL+  + K +AL+G  ANAT  M G Y G P  Y 
Sbjct: 350 PSTQELALRSATEGIVLLKNSGSLLPLDFSSGKKVALIGHWANATGTMRGPYSGIPPFYH 409

Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           +P+      +   +YA G        ++    A+ AA+ AD  +   G D +V +E  DR
Sbjct: 410 NPLYAAQQLNLSFSYANGPVVNASDPDTWTAPALAAAEGADVVLYFGGTDTTVASEDLDR 469

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
             +  P  Q +L++++A   K P+ +VI     VD +F   N  + SILWVGYPG+ GG 
Sbjct: 470 ESIAWPKAQMKLLSELAGLGK-PL-VVIQLGDQVDDSFLLENGNVSSILWVGYPGQSGGT 527

Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNF------------------ 585
           A+ DV+ GK  P GRLP+T Y   YV ++P T M LRP N+                   
Sbjct: 528 AVFDVLTGKKAPAGRLPVTQYPEGYVDEVPLTEMALRPFNHSSSTSSSSNPEEEVSVQGS 587

Query: 586 ------------------PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
                             PGRTYK++  PV+ PFGYGL YT F         +V + L  
Sbjct: 588 GSLTIQPRSTPGNKTLSSPGRTYKWYSNPVL-PFGYGLHYTTF---------NVSLSLSS 637

Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVK-CKDYKFTFQIEVENMGKMDGSEVVMVY-SKPP 685
           +      ++++ +   PC A  +D             I + N G      V +++ S   
Sbjct: 638 NASSPSPSFSIPSLLTPCTATHLDLCPFSPSANSALSISITNTGTHTSDYVALLFLSGEF 697

Query: 686 GIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
           G     +K ++ Y+RV  I  G++  V     +  ++  VD   N++L  G +   V
Sbjct: 698 GPKPYPLKTLVSYKRVKDIKPGETVTVKDVPVSLGAISRVDGDGNTVLYPGTYRFAV 754


>gi|443695317|gb|ELT96258.1| hypothetical protein CAPTEDRAFT_179825 [Capitella teleta]
          Length = 750

 Score =  357 bits (917), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 245/774 (31%), Positives = 387/774 (50%), Gaps = 104/774 (13%)

Query: 1   RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQM----GDLAYGVPRLGLPLY 56
           RF      L  FP+ +  LP   R  DL+ R+T+ + + Q     G    G+ RLG+   
Sbjct: 26  RFAPSSHALDSFPFRNVSLPIETRLNDLISRLTIEDAINQTVARYGKFTPGIERLGIKPI 85

Query: 57  EWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTE 116
           E+ +E L GV    RR N             AT FP  +   ASF+  L +++   VS E
Sbjct: 86  EYITECLRGV----RREN-------------ATGFPQALGLAASFSRDLMQRVATAVSVE 128

Query: 117 ARAMYN-------LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
            RA YN        G  G+T +SP IN++R P WGR  ET GEDPY+ G  A  YV GLQ
Sbjct: 129 VRAFYNHDIQRETYGAHGITCFSPVINILRHPLWGRNQETYGEDPYLSGELASQYVSGLQ 188

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                      D R L++SA CKH+ A+   +     +F FD+++ E+D+Q TF+  F+ 
Sbjct: 189 G---------DDPRYLRVSAGCKHFDAHGGPDTIPVRKFGFDAKIEERDLQMTFLPAFKK 239

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
           C+      +VMCS+N +NG+P+CA+ +LL   +R  W + G++VSD  +++ I   H + 
Sbjct: 240 CI-AAKPYNVMCSFNSINGVPSCANKRLLTDVLRAQWGYEGFVVSDDAAVEYIFTEHHY- 297

Query: 290 NDTKEDAVARVLKAGLDLD-CGDY---YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRL 345
           N + E A    +K+G +++  G +   Y   T  A+ +  I + ++  ++R +++    L
Sbjct: 298 NSSFETAAVEAIKSGCNMELVGKFDPSYWQLT-KALNEHLITKDELMENVRPVFLTRFLL 356

Query: 346 GYFD--GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG 403
           G FD      +  + K+ + + +H  LA EAA +  VLLKND   LPL   ++KT+A+VG
Sbjct: 357 GEFDPPALNPFNQITKDVVLSAEHQRLALEAAVKSFVLLKNDRNFLPLLKNSLKTVAVVG 416

Query: 404 PHANATKAMIGNY--EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQN--NSMIPAAID 459
           P +N T  +IG+Y  +  P    +P+ G    +  + +A GC++  C +   + + AA+D
Sbjct: 417 PMSNYTDGLIGDYSTDTDPSLILTPLHGIKKLAPNVQFASGCSNSTCTDYRATDVAAAVD 476

Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG-PVTLVIMSAGAV 518
            A+      +  G    VEAE  DR D++LPG Q +L+      A G PV L++ + G +
Sbjct: 477 GAQ---VVFVALGTGFIVEAENNDRSDIVLPGAQLQLLKDAVYHANGRPVVLLLFNGGPL 533

Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIF---GKYNPGGRLPITWYEANYVKIP-Y 574
           D+ FA+    I SI+   +P    G AI  ++    G  +P GRLP+TW  A   ++P  
Sbjct: 534 DVTFAQLTSGIVSIVECFFPAMMTGEAIYRMLINNEGISSPAGRLPLTW-PAYLNQVPNI 592

Query: 575 TSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDI 634
           T   ++      GRTY+++    +YPFGYGLSYTQFKY   S  K   +++ K Q+ R  
Sbjct: 593 TDYTMK------GRTYRYYTEDPLYPFGYGLSYTQFKY---SDLKVTPLEVTKGQEIR-- 641

Query: 635 NYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEV-VMVYSKPPGIAGTHIK 693
                                       +++V N+G  D  EV ++V         T I 
Sbjct: 642 ---------------------------VKVKVTNIGLYDADEVRIIVVQAYVSWPKTEIP 674

Query: 694 ----QVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
               Q++ ++R+ IA+G+S  V  T+ A   L++  N      +  G  T+ +G
Sbjct: 675 VPRWQLVAFDRIHIASGKSETVELTIEA-SLLEVWQNPETGFDILEGEMTLYIG 727


>gi|348684866|gb|EGZ24681.1| hypothetical protein PHYSODRAFT_325770 [Phytophthora sojae]
          Length = 805

 Score =  355 bits (912), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 252/768 (32%), Positives = 381/768 (49%), Gaps = 86/768 (11%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPR-----LGLPLYEWWSEALHG 65
           + P+C+  L   +R +DL+ R+ L EK   +   A   PR     +GLP Y W +  +HG
Sbjct: 34  ELPFCNTSLSTADRVEDLLSRLPLQEKATLL--TARASPRGNMSSIGLPEYNWGANCVHG 91

Query: 66  V-SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
           V S  G  TN P            TSFP  +   A F+  +   + Q +  E RA++  G
Sbjct: 92  VQSTCG--TNCP------------TSFPNPVNLGAIFDPQVVFDMAQVIGWELRALWLEG 137

Query: 125 ---------NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
                    + GL  WSPNIN+ RDPRWGR  ETP EDP V  +Y + Y RGLQ+     
Sbjct: 138 ATENYKGGPHLGLDCWSPNININRDPRWGRNTETPSEDPLVNSKYGVAYTRGLQE----- 192

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
             +  D R L+     KHYAAY  +N+ G +R  FD+ V+  D  +T+   F   V +G+
Sbjct: 193 -GKRQDPRFLQAVVTLKHYAAYSYENYGGVNRMEFDAIVSPYDFADTYFPAFRSSVVDGN 251

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKED 295
              VMCSYN VNGIP CA+ +L+   +RG   F GY+ SD  +++ I + H +  D++ +
Sbjct: 252 AKGVMCSYNSVNGIPMCANKELVETLLRGTLGFDGYVTSDSGAVEAISDMHHYA-DSQCE 310

Query: 296 AVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQ 353
           A    + AG D++ G  Y       V   ++ E  +D +LR    +   LG FD      
Sbjct: 311 AARLAILAGTDINSGKSYEACLKTLVDDNQLEEKALDDALRHTLKLRFELGLFDPIDDQP 370

Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
           Y N+  + +       L+  A R+ +V+L+N+   LPL  G    LA++GPHA + + ++
Sbjct: 371 YWNVTPSEVNTAAAKALSLNATRKSLVMLQNNASVLPLQKG--VKLAVLGPHAKSKRGLL 428

Query: 414 GNYEGTPCR--------YTSPMDGFYAYSKVIN--YAPGCADIVCQNNSMIPAAIDAAKN 463
           GNY G  C           +P+D   A +   N  +A GC  I   + +    A+ AAK 
Sbjct: 429 GNYLGQMCHGDYDEVGCVQTPLDAIRAANGASNTTFAEGCG-ISGNSTAGFEKAVAAAKE 487

Query: 464 ADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFA 523
           ADA V+  G+D S+E E  DR ++ LP  Q +L+ +V   A G  T+V++  G V I   
Sbjct: 488 ADAVVLFLGIDKSIEGEVGDRNNIDLPNIQMQLLQRVH--AVGRPTVVVLINGGV-IGAE 544

Query: 524 KNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPV 582
           +   +  +++   YPG  G RA+ADV+FG  NP G+LP+T Y ++YV ++   SM +   
Sbjct: 545 EIIERTDALVEAFYPGFFGARAMADVLFGDTNPSGKLPVTMYRSDYVDQVEMKSMDMTA- 603

Query: 583 NNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
              PGRTY++F G  V+PFG+GLSYT F   V S                  N +  +N 
Sbjct: 604 --HPGRTYRYFKGEPVFPFGWGLSYTTFSLSVDSG----------------TNSSSHSNN 645

Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-----PGIAGTHIKQVIG 697
              +   + D        T  + V+N G++ G EVV+ + +P      G A    +Q+  
Sbjct: 646 AAFSGGEVSDTA----NVTISVVVKNDGEVAGDEVVLAFFRPVNSNVTGPATLLNEQLFD 701

Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGV 745
           Y+RV +    S +V FT+    +L + D   N     G++ ++V  GV
Sbjct: 702 YQRVSLGPLDSTEVSFTIER-STLALPDEEGNLASFPGSYEVIVSNGV 748


>gi|194700280|gb|ACF84224.1| unknown [Zea mays]
          Length = 452

 Score =  355 bits (910), Expect = 7e-95,   Method: Compositional matrix adjust.
 Identities = 188/456 (41%), Positives = 269/456 (58%), Gaps = 16/456 (3%)

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNN 361
           +D++CG Y  +    A+QQGKI E DI+ +L  L+ V MRLG F+G P+   Y ++G + 
Sbjct: 1   MDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGDIGPDQ 60

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGA--LPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
           +C  +H +LA EAA+ GIVLLKND GA  LPL+  N+ +LA++G +AN    + GNY G 
Sbjct: 61  VCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRGNYFGP 120

Query: 420 PCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
           PC   +P+     Y K  ++  GC    C N + IP A+ AA +AD+ V+  GLD   E 
Sbjct: 121 PCVTVTPLQVLQGYVKDTSFVAGCNSAAC-NVTTIPEAVQAASSADSVVLFMGLDQDQER 179

Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
           E  DR+DL LPG Q  LI  VA+AAK PV LV++  G VD++FAK NPKI +ILW GYPG
Sbjct: 180 EEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAGYPG 239

Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPV 597
           E GG AIA V+FG++NPGGRLP+TWY  ++ ++P T M +R  P   +PGRTY+F+ GP 
Sbjct: 240 EAGGIAIAQVLFGEHNPGGRLPVTWYPQDFTRVPMTDMRMRADPATGYPGRTYRFYRGPT 299

Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
           V+ FGYGLSY+++ ++ A+ P             + +  T G          I    C  
Sbjct: 300 VFNFGYGLSYSKYSHRFATKPPPT----SNVAGLKAVEATAG-GMASYDVEAIGSETCDR 354

Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPPGI---AGTHIKQVIGYERVFIAAGQSAKVGFT 714
            KF   + V+N G MDG   V+V+ + P     +G    Q+IG++ + + A Q+A V F 
Sbjct: 355 LKFPAVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPASQLIGFQSLHLRATQTAHVEFE 414

Query: 715 MNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
           ++ CK           ++  G+H ++VGE    +SF
Sbjct: 415 VSPCKHFSRATEDGRKVIDQGSHFVMVGEDEFEMSF 450


>gi|363742357|ref|XP_003642627.1| PREDICTED: probable beta-D-xylosidase 5-like [Gallus gallus]
          Length = 748

 Score =  354 bits (909), Expect = 9e-95,   Method: Compositional matrix adjust.
 Identities = 252/764 (32%), Positives = 379/764 (49%), Gaps = 106/764 (13%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQM---GDLAYG----VPRLGLPLYEWWSEALH 64
           FP+ D  LP+  R +DL+ R+T  E V QM   G L  G    +PRLG+  Y W +E L 
Sbjct: 27  FPFRDPTLPWHRRLEDLLGRLTPAEMVLQMARGGALGNGPAPPIPRLGIAPYNWNTECLR 86

Query: 65  GVSFIGRRTNSPPGTHFDSEVPG-ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           G                D+E PG AT+FP  +   A+F+  L  ++    +TE RA +N 
Sbjct: 87  G----------------DAEAPGWATAFPQALGLAAAFSPELVYRVANATATEVRAKHNS 130

Query: 124 --------GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
                    + GL+ +SP +N++R P WGR  ET GEDPY+    A ++V+GLQ      
Sbjct: 131 FVAAGRYDDHTGLSCFSPVLNIMRHPLWGRNQETYGEDPYLTAELATSFVQGLQG----- 185

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
                  R +K SA CKH++ +         R  FD++V E+D   TF+  F+ CV  G 
Sbjct: 186 ----QHPRYIKASAGCKHFSVHGGPENIPVSRLSFDAKVLERDWHTTFLPQFQACVRAGS 241

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKED 295
            S  MCSYNR+NG+P CA+ KLL   +RG+W F GY+VSD  +++ I+  H++ +   E 
Sbjct: 242 YS-FMCSYNRINGVPACANKKLLTDILRGEWGFEGYVVSDEGAVELILLGHRYTHTFLET 300

Query: 296 AVARVLKAGLDLDCGDYYTNFTM----GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS 351
           A+A V  AGL+L+      N        A+  G I    +   +R L+   +RLG FD  
Sbjct: 301 AIASV-NAGLNLELSYGMRNNVFMHIPKALAMGNITLEMLRDRVRPLFYTRLRLGEFDPP 359

Query: 352 PQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT 409
               Y  L  + + + +H  L+ EAA +  VLLKN    LPL   + K LA+VGP A+  
Sbjct: 360 AMNPYNALELSVVQSSEHRNLSLEAAIKSFVLLKNQRDTLPLRELHGKRLAVVGPFADNP 419

Query: 410 KAMIGNYEGTP-CRYT-SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADAT 467
           + + G+Y   P  +Y  +P  G       +++A GC +  C   S      +A + AD  
Sbjct: 420 RVLFGDYAPVPEPQYIYTPRRGLQTLPANVSFAAGCREPRCWVYSRDEVE-NAVRGADVV 478

Query: 468 VIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG-PVTLVIMSAGAVDINFAKNN 526
           ++  G  + VE E +DR DL LPG Q +L+     AA G PV L++ +AG +D+++A+ +
Sbjct: 479 LVCLGTGIDVEMEARDRKDLSLPGHQLQLLQDAVRAAAGHPVILLLFNAGPLDVSWAQLH 538

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKY--NPGGRLPITWYEANYVKIPYTSMPLRPVNN 584
             + +IL   +P +  G AIA V+ GK   +P GRLP TW  A   ++P       P+ N
Sbjct: 539 DGVGAILACFFPAQATGLAIASVLLGKQGASPAGRLPATW-PAGMHQVP-------PMEN 590

Query: 585 F--PGRTYKFF--DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGT 640
           +   GRTY+++  + P +YPFGYGLSYT F Y+        D+ L               
Sbjct: 591 YTMEGRTYRYYGQEAP-LYPFGYGLSYTTFHYR--------DLVLS-------------- 627

Query: 641 NKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK--PPGIAGTHIKQVIGY 698
             PP   +      C +   +  + +EN G  D  EVV +Y +   P +      Q++ +
Sbjct: 628 --PPVLPI------CAN--LSVSVVLENTGPRDSEEVVQLYLRWEQPSVPVPRW-QLVAF 676

Query: 699 ERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            RV + AG + K+ F + A +    +       L  GA T+  G
Sbjct: 677 RRVAVPAGGATKLSFGVTAAQRAVWMQQWH---LEPGAFTLFAG 717


>gi|340369765|ref|XP_003383418.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 748

 Score =  354 bits (908), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 254/773 (32%), Positives = 369/773 (47%), Gaps = 112/773 (14%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGD-------LAYGVPRLGLPLYEWWSEAL 63
           +FP+ D  LP  ER KD+V++++L + V+QM          A G+P+  +  Y+W +E L
Sbjct: 26  EFPFRDPSLPIEERVKDIVDQLSLDQLVEQMAHGGAGSNGPAPGIPKFNIKPYQWGTECL 85

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
            G                D     ATSFP  I   ASFN  L K++    + E RA    
Sbjct: 86  SG----------------DVNAGDATSFPMSIGMAASFNYDLLKQVSNATAYEVRAKNTA 129

Query: 124 G--------NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
                    + GL+ WSP +N++RDPRWGR  ET GEDPY+ G     +V GLQ      
Sbjct: 130 AVLNGSYAFHTGLSCWSPVLNIMRDPRWGRNQETYGEDPYLSGYLGQAFVTGLQG----- 184

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
                D   +  +A CKH+  +         R  FD+ VT  D + TF+  F+ CV  G 
Sbjct: 185 ----DDPTYVIANAGCKHFDVHGGPEDTPLPRASFDANVTMIDWRMTFLPQFKACVEAGA 240

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLND---- 291
           +S +MCSYNR+NG+P CA+ KLL   +R +WNF GY+VSD  +++ IV  H +  D    
Sbjct: 241 LS-LMCSYNRINGVPACANKKLLTDILRNEWNFKGYVVSDQGALENIVTQHHYAPDFVTA 299

Query: 292 -TKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF-- 348
                     L+ G     G    +    AV++G ++   +  ++  L+ V  +LG F  
Sbjct: 300 AADAANAGTCLEDGNSEGKGGNVFDNLDDAVEKGLVSVDTLKDAVSRLFYVRTKLGEFDP 359

Query: 349 -DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGA---LPLNTGNIKTLALVGP 404
            D +  Y N+  + I + +HI+L+ +AA + IVL+KNDN     LPL   + K   +VGP
Sbjct: 360 PDNNNPYANIPLSIIQSDEHIKLSIQAAMETIVLMKNDNDGSPFLPLAADDFKKACVVGP 419

Query: 405 HANATKAMIGNYEGTPCR--YTSPMDGFYAY---SKVINYAPGCAD-IVCQNNSMIPAAI 458
                  M G+Y  T       +P+ G       S ++NY  GC D   C+         
Sbjct: 420 FIENADTMFGDYSPTMMTDYIVTPLAGIKTTQIGSDLLNYEDGCTDGPACEIYDGYKVRT 479

Query: 459 DAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAA-KGPVTLVIMSAGA 517
            A +  D  ++ AGL   +E EG D  D+ LPG Q  L+     A+   P+ L++ +A  
Sbjct: 480 -ACEGVDLVIVTAGLSRYLEHEGHDISDIYLPGHQMSLLTDAESASGSAPIILLLFNANP 538

Query: 518 VDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP---- 573
           +DI++AK+NP+  +IL   YPG+E G AIA+V+ G YNP GRLP TW  A+  ++P    
Sbjct: 539 LDISYAKSNPRFAAILEAYYPGQEAGVAIANVLTGSYNPAGRLPNTW-PASLDQVPDMID 597

Query: 574 YTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRD 633
           YT            RTY++F    +YPFGYGLS+T F Y                    D
Sbjct: 598 YT---------MKERTYRYFTQEPLYPFGYGLSFTTFNYS-------------------D 629

Query: 634 INY--TVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH 691
           +N   T  TN     AV               + V N G MDG EV   Y K   +A   
Sbjct: 630 LNVASTANTNGEGSIAV--------------SVTVMNTGTMDGDEVTQAYVKWDNVAEAP 675

Query: 692 IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS--LLASGAHTILVG 742
             Q++G  R FI+ GQS  V FT+   + L++  N  +    +  G +++ VG
Sbjct: 676 NIQLVGVSRKFISKGQSITVSFTIKP-EQLQVWINGDDGKWSIPGGTYSLFVG 727


>gi|424661938|ref|ZP_18098975.1| hypothetical protein HMPREF1205_02324 [Bacteroides fragilis HMW
           616]
 gi|404578249|gb|EKA82984.1| hypothetical protein HMPREF1205_02324 [Bacteroides fragilis HMW
           616]
          Length = 722

 Score =  354 bits (908), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 239/742 (32%), Positives = 369/742 (49%), Gaps = 92/742 (12%)

Query: 16  DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
           D   P   R K L+++MTL EK  Q+   +  +PRL LP Y +W+E LHGV+  G     
Sbjct: 53  DLSQPIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGE---- 108

Query: 76  PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
                        T FP  I   ++++  L K++   +STEAR  Y     GLT+WSP I
Sbjct: 109 ------------VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTI 156

Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
           N+ RDPRWGR  ET GEDPY+  R  + +V+GLQ               LK  A  KH+ 
Sbjct: 157 NMARDPRWGRNEETYGEDPYLTSRLGVAFVKGLQ---------GDHPAYLKTVATIKHFV 207

Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
           A    N E N+RF   S++  + + E +   +E CV E DV SVM +YN  NG+P     
Sbjct: 208 A----NNEENNRFSSSSQIPTKQLYEYYFPAYEACVKEADVQSVMTAYNAFNGVPPSGSR 263

Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
            LL + +R +W F G++VSDC +I  +   H+ +N   E+A A  + +G DL+CG  Y  
Sbjct: 264 WLLGEVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSL-EEAAALGVNSGCDLECGTTYKE 322

Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAE 373
             + AV+QG I+EA ID +L  +     +LG FD      Y +  K  +   +  ELA E
Sbjct: 323 KLVQAVKQGLISEATIDQALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYE 382

Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG---F 430
           AA + +VLLKN+N  LPL+    K++A+VGP A+     +G Y G P    + + G    
Sbjct: 383 AAVKSVVLLKNEN-LLPLSKEKTKSVAVVGPFADHN--YLGGYSGQPPYSITLLKGVKDL 439

Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
                 +NY  G   I    +S++     A K  D  ++  G D  +  E  D   + LP
Sbjct: 440 MGKRGKVNYLNG---IGASRDSIVA----AVKGVDVVLVALGSDEKMARENHDMTSIYLP 492

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
             Q +L+  +       + LV  S   +   +A  +  I +I+   YPG+E GRA+A+++
Sbjct: 493 EEQEKLLKAIYQ-VNPRIVLVFHSGNPLTSEWADTH--IPAIMQAWYPGQEAGRALANLL 549

Query: 551 FGKYNPGGRLPITWYEANYVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
           FG  NP G+LP+T Y+          +P +   + + GRTY++  G  +Y FG+GLSYT 
Sbjct: 550 FGNENPSGKLPMTIYKTE------EQLPDILDFDMWKGRTYRYMKGEPLYSFGHGLSYTS 603

Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
           F++       +  ++ D   QC                                +E+ N 
Sbjct: 604 FEFDNIQGNDT--LQPDAILQC-------------------------------SVELSNS 630

Query: 670 GKMDGSEVVMVY-SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           G++ G EVV VY S+      T+ +K+++ +++V +A+G+  KV FT+ A + L + ++ 
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689

Query: 728 ANSLLASGAHTILVGEGVGGVS 749
              +L SG +T+ +G G  G++
Sbjct: 690 KWRML-SGKYTLFIGSGQPGLA 710


>gi|336275603|ref|XP_003352555.1| hypothetical protein SMAC_01389 [Sordaria macrospora k-hell]
 gi|380094444|emb|CCC07823.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 833

 Score =  353 bits (907), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 274/807 (33%), Positives = 378/807 (46%), Gaps = 136/807 (16%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD+    P+RA  LVE++T+ EK+  + D + G PRLGLP Y WWSE LHGV+       
Sbjct: 37  CDSTASAPDRAASLVEQLTIDEKLVNLVDQSKGAPRLGLPPYAWWSEGLHGVA------- 89

Query: 75  SPPGTHFDSE---VPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
             PG  F++       ATSF  VI   A+ ++ L  ++G  +STEARA    G  GL +W
Sbjct: 90  GSPGVVFNTSGYPFSYATSFANVITLGAALDDDLVYEVGTAISTEARAFAKFGFGGLDYW 149

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
           +PNIN  +DPRWGR  ETPGEDP  +  Y    V GL+    V           K+ A C
Sbjct: 150 TPNINPYKDPRWGRGAETPGEDPLRIKGYVKAMVAGLEGNGTVR----------KVIATC 199

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC---------- 241
           KH+AAYDL+ W G  R+ FD+ V+ QD+ E ++ PF+ C  +  V S+MC          
Sbjct: 200 KHFAAYDLERWRGLTRYDFDAVVSLQDLSEYYLPPFQQCARDSRVGSIMCRYVSFFLPPF 259

Query: 242 ----------------------SYNRVNGIPTCADPKLLNQTIRGDWNF---HGYIVSDC 276
                                 SYN +NG P CA   L+   +R  WN+   + YI SDC
Sbjct: 260 PSFPRLVTRQSGNQVDIVDNFRSYNALNGTPACASTYLMTNILRDHWNWTNHNNYITSDC 319

Query: 277 DSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAVQQGKIAEADID 332
           ++IQ  +  +   + T  +A A    AG D  C       YT+  +GA  Q  ++E+ ID
Sbjct: 320 NAIQDFLPDNHNFSQTPAEAAAAAYIAGTDTVCEVSGWPPYTD-VVGAYNQSLLSESVID 378

Query: 333 TSLRFLYIVLMRLGYFD-GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPL 391
           T+LR LY  L+R GY D G P   +  K    +P                       LPL
Sbjct: 379 TALRRLYEGLIRAGYLDHGRPASSSPDKAPFSSPDF---------------------LPL 417

Query: 392 N-TGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQN 450
           + TG  KT+AL+G  ANAT+ + G Y G P  Y +PM           YA G        
Sbjct: 418 DLTG--KTVALIGHWANATRTIRGPYSGLPPFYHNPMYAVRQLKLSFYYANGPVVNSTDA 475

Query: 451 NSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTL 510
           ++   AA+ AA++AD  +   G D +V +E  DR  +  P  Q  LI K+A   K  V  
Sbjct: 476 DTWTAAAMLAAESADVVLYFGGTDTTVASEDLDRESIAWPKTQLTLIEKLAQVGKPMV-- 533

Query: 511 VIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV 570
           VI     VD     NN  I SILWVGYPG+ GG A+ DV+ GK    GRLP+T Y A YV
Sbjct: 534 VIQLGDQVDDTPLLNNKNISSILWVGYPGQSGGTAVFDVLTGKKASAGRLPVTQYPAGYV 593

Query: 571 -KIPYTSMPLRPVNNF----------------------------------PGRTYKFFDG 595
            ++P T M LRP N+                                   PGRTYK++  
Sbjct: 594 DEVPLTEMGLRPFNHSSSTTSSDVSQSGVEEGNGLTIQTRSTRGNKTLSSPGRTYKWYPR 653

Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
           PV+ PFGYGL YT F   ++ S  S +     D     I   + +    C A+ +D    
Sbjct: 654 PVL-PFGYGLHYTPFNISLSLS-TSSNASSTTDNTSISIRSLLTSQT--CTAIHLDLCPF 709

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF-IAAGQSAKVG- 712
                 F + + N G      V +++ S   G     +K ++GY+RV  I  G++  VG 
Sbjct: 710 S----PFSVSITNTGSHTSDYVALLFLSGKFGPKPDPLKTLVGYKRVKDIKPGETRVVGG 765

Query: 713 --FTMNACKSLKIVDNAANSLLASGAH 737
               +N   ++  VD   N++L  G +
Sbjct: 766 EDIPVN-LAAVARVDGNGNTVLYPGTY 791


>gi|265765457|ref|ZP_06093732.1| beta-xylosidase [Bacteroides sp. 2_1_16]
 gi|263254841|gb|EEZ26275.1| beta-xylosidase [Bacteroides sp. 2_1_16]
          Length = 722

 Score =  351 bits (900), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 240/742 (32%), Positives = 369/742 (49%), Gaps = 92/742 (12%)

Query: 16  DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
           D   P   R + L+++MTL EKV Q+   +  +PRL LP Y +W+E LHGV+  G     
Sbjct: 53  DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE---- 108

Query: 76  PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
                        T FP  I   ++++  L K++   +STEAR  Y     GLT+WSP I
Sbjct: 109 ------------VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTI 156

Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
           N+ RDPRWGR  ET GEDP++  R  + +V+GLQ               LK  A  KH+ 
Sbjct: 157 NMARDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ---------GDHPTYLKTVATIKHFV 207

Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
           A    N E N+RF   S++  + + E +   +E CV E +  SVM +YN  NG+P     
Sbjct: 208 A----NNEENNRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263

Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
            LL+  +R +W F G++VSDC +I  +   H+ +N   E+A A  + +G DL+CG  Y  
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSL-EEAAALGVNSGCDLECGTTYKE 322

Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAE 373
             + AV+QG I+EA ID +L  +     +LG FD      Y +  K  +   +  ELA E
Sbjct: 323 KLVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYE 382

Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
           AA + +VLLKND   LPLN   IK++A+VGP A+     +G Y G P    S + G    
Sbjct: 383 AAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKG---- 435

Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAA---KNADATVIVAGLDLSVEAEGKDRVDLLLP 490
              +    G    V   N M  +A   A   K AD  ++  G D  +  E  D   + LP
Sbjct: 436 ---VKELIGKKGKVTYLNGMGTSADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLP 492

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
             Q +L+ K+       + LV  +   +   +A  +  I +I+   YPG+E GRA+A+++
Sbjct: 493 EEQEKLLKKIYQ-VNPRIVLVFHTGNPLTSEWADTH--IPAIMQAWYPGQEAGRALANLL 549

Query: 551 FGKYNPGGRLPITWYEANYVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
           FG  NP G+LP+T Y+          +P +   + + GRTY++  G  +Y FG+GLSYT 
Sbjct: 550 FGNENPSGKLPMTIYKTE------EQLPDILDFDMWKGRTYRYMKGEPLYGFGHGLSYTS 603

Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
           F++       +  ++ D   QC                                +E+ N 
Sbjct: 604 FEFDNIQGNDT--LQPDAILQC-------------------------------SVELSNS 630

Query: 670 GKMDGSEVVMVY-SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           G++ G EVV VY S+      T+ +K+++ +++V +A+G+  KV FT+ A + L + ++ 
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689

Query: 728 ANSLLASGAHTILVGEGVGGVS 749
              +L SG +T+ +G G  G++
Sbjct: 690 KWRML-SGKYTLFIGSGQPGLA 710


>gi|375357164|ref|YP_005109936.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
 gi|301161845|emb|CBW21389.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
          Length = 722

 Score =  351 bits (900), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 240/742 (32%), Positives = 369/742 (49%), Gaps = 92/742 (12%)

Query: 16  DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
           D   P   R + L+++MTL EKV Q+   +  +PRL LP Y +W+E LHGV+  G     
Sbjct: 53  DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE---- 108

Query: 76  PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
                        T FP  I   ++++  L K++   +STEAR  Y     GLT+WSP I
Sbjct: 109 ------------VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTI 156

Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
           N+ RDPRWGR  ET GEDP++  R  + +V+GLQ               LK  A  KH+ 
Sbjct: 157 NMARDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ---------GDHPTYLKTVATIKHFV 207

Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
           A    N E N+RF   S++  + + E +   +E CV E +  SVM +YN  NG+P     
Sbjct: 208 A----NNEENNRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263

Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
            LL+  +R +W F G++VSDC +I  +   H+ +N   E+A A  + +G DL+CG  Y  
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSL-EEAAALGVNSGCDLECGTTYKE 322

Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAE 373
             + AV+QG I+EA ID +L  +     +LG FD      Y +  K  +   +  ELA E
Sbjct: 323 KLVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYE 382

Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
           AA + +VLLKND   LPLN   IK++A+VGP A+     +G Y G P    S + G    
Sbjct: 383 AAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKG---- 435

Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAA---KNADATVIVAGLDLSVEAEGKDRVDLLLP 490
              +    G    V   N M  +A   A   K AD  ++  G D  +  E  D   + LP
Sbjct: 436 ---VKELIGKKGKVTYLNGMGTSADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLP 492

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
             Q +L+ K+       + LV  +   +   +A  +  I +I+   YPG+E GRA+A+++
Sbjct: 493 EEQEKLLKKIYQ-VNPRIVLVFHTGNPLTSEWADTH--IPAIMQAWYPGQEAGRALANLL 549

Query: 551 FGKYNPGGRLPITWYEANYVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
           FG  NP G+LP+T Y+          +P +   + + GRTY++  G  +Y FG+GLSYT 
Sbjct: 550 FGNENPSGKLPMTIYKTE------EQLPDILDFDMWKGRTYRYMKGEPLYGFGHGLSYTS 603

Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
           F++       +  ++ D   QC                                +E+ N 
Sbjct: 604 FEFDNIQGNDT--LQPDAILQC-------------------------------SVELSNS 630

Query: 670 GKMDGSEVVMVY-SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           G++ G EVV VY S+      T+ +K+++ +++V +A+G+  KV FT+ A + L + ++ 
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689

Query: 728 ANSLLASGAHTILVGEGVGGVS 749
              +L SG +T+ +G G  G++
Sbjct: 690 KWRML-SGKYTLFIGSGQPGLA 710


>gi|308208211|gb|ADO20356.1| putative beta-D-xylosidase/alpha-L-arabinosidase [uncultured rumen
           bacterium]
          Length = 780

 Score =  350 bits (897), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 252/809 (31%), Positives = 368/809 (45%), Gaps = 149/809 (18%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
           ++ + LS  PY D  LP  ERAKDLV R+TL EK       +  V  LG+  Y WWSEAL
Sbjct: 36  AVTLSLSAQPYKDRSLPPEERAKDLVSRLTLEEKASLSMHPSAPVEALGIKAYNWWSEAL 95

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           HGV+  G                 AT FP  I   ASF+E L  ++   VS EAR  Y +
Sbjct: 96  HGVARNG----------------AATVFPQPIGMAASFDEPLLYEVFTAVSDEARVKYKI 139

Query: 124 GN--------AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
                      G+TFW+PNIN+ RDPRWGR +ET GEDPY+ G+  +  VRGLQ      
Sbjct: 140 AKESGHIGQYQGVTFWTPNINIFRDPRWGRGMETYGEDPYLTGQMGMAVVRGLQG----- 194

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
               SDS  LK  AC KHYA +    W   +R  +D+ V+E+D++ET++  F+  V + +
Sbjct: 195 ---PSDSPVLKAHACAKHYAVHSGPEW---NRHSYDAEVSERDLRETYLPAFKDLVTKAN 248

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI-VESHKFLNDTKE 294
           V  VM +YNR  G P  A   L+N  +RG+W + G I SDC +++   V+     +    
Sbjct: 249 VQEVMTAYNRFRGEPCGASDYLINTILRGEWGYKGLITSDCWAVEDFYVQGRHGYSPDVA 308

Query: 295 DAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQY 354
            A A  + AG+D +CG  Y +    AV++G + E D+D +L  L+    +LG  D    +
Sbjct: 309 SAAAAAVHAGVDTECGQAYRHIPE-AVERGLLDEKDLDRNLIRLFTARYQLGEMDDISLW 367

Query: 355 KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIG 414
            +L  + +  P+H+ L+ + A++ +VLL+N  G LPL   +++ +ALVGP+ +  +   G
Sbjct: 368 DDLPASILEGPEHLALSRKMAQESMVLLQNKGGILPL-APDVR-VALVGPNGDDREMQWG 425

Query: 415 NYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQ-------NNSM-------------- 453
           NY   P R  +  D        I Y  GC  +  +       NN +              
Sbjct: 426 NYNPVPGRTVTLYDALKERFPGIKYVRGCGIVGAEFAPKPDPNNPLSQALGKSREEMEAI 485

Query: 454 ------------------------------IPAAIDAAKNADATVIVAGLDLSVEAE--- 480
                                         + + +   +  D  +   G+    E E   
Sbjct: 486 ARQYAIGVQDILNYVRRQERMQASFLPELDVQSVLKELEGIDVVIFAGGISPRFEGEEMP 545

Query: 481 -------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
                  G DR D+ LP  Q +L+  + DA K  V LV  S  A  I          +IL
Sbjct: 546 VNLPGFKGGDRTDIQLPQVQRDLMKALHDAGK-KVILVNFSGCA--IGLVPETESCDAIL 602

Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
              YPGEEGG AI DV+FG  NP G+LP+T+Y +         +P     +  G TY++F
Sbjct: 603 QAWYPGEEGGLAITDVLFGDVNPSGKLPVTFYRS------VEDLPDFENYDMKGHTYRYF 656

Query: 594 DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
            G  ++PFGYGLSY+ F+YK                                        
Sbjct: 657 KGKPLFPFGYGLSYSTFRYK---------------------------------------- 676

Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
           + K    +  I V+N GK + +EVV VY +  G     +K +  + RV I AG++ KV  
Sbjct: 677 RAKVRNNSLIIPVKNTGKREATEVVQVYVRRKGDPDGPVKTLRAFRRVTIPAGKTVKVCI 736

Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVG 742
            +     L   + A + +   G + +L G
Sbjct: 737 PLEDETFLWWSEEAQDMVPLPGKYELLYG 765


>gi|336408348|ref|ZP_08588841.1| hypothetical protein HMPREF1018_00856 [Bacteroides sp. 2_1_56FAA]
 gi|423248801|ref|ZP_17229817.1| hypothetical protein HMPREF1066_00827 [Bacteroides fragilis
           CL03T00C08]
 gi|423253750|ref|ZP_17234681.1| hypothetical protein HMPREF1067_01325 [Bacteroides fragilis
           CL03T12C07]
 gi|335937826|gb|EGM99722.1| hypothetical protein HMPREF1018_00856 [Bacteroides sp. 2_1_56FAA]
 gi|392655379|gb|EIY49022.1| hypothetical protein HMPREF1067_01325 [Bacteroides fragilis
           CL03T12C07]
 gi|392657742|gb|EIY51373.1| hypothetical protein HMPREF1066_00827 [Bacteroides fragilis
           CL03T00C08]
          Length = 722

 Score =  350 bits (897), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 242/742 (32%), Positives = 372/742 (50%), Gaps = 92/742 (12%)

Query: 16  DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
           D   P   R + L+++MTL EKV Q+   +  +PRL LP Y +W+E LHGV+  G     
Sbjct: 53  DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE---- 108

Query: 76  PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
                        T FP  I   ++++  L K++   +STEAR  Y     GLT+WSP I
Sbjct: 109 ------------VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTI 156

Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
           N+ RDPRWGR  ET GEDP++  R  + +V+GLQ               LK  A  KH+ 
Sbjct: 157 NMARDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ---------GDHPTYLKTVATIKHFV 207

Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
           A    N E N+RF   S++  + + E +   +E CV E +  SVM +YN  NG+P     
Sbjct: 208 A----NNEENNRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263

Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
            LL+  +R +W F G++VSDC +I  +   H+ +N   E+A A  + +G DL+CG  Y  
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSL-EEAAALGVNSGCDLECGTTYKE 322

Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAE 373
             + AV+QG I+EA ID +L  +     +LG FD      Y +  K  +   +  ELA E
Sbjct: 323 KLVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYE 382

Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
           AA + +VLLKND   LPLN   IK++A+VGP A+     +G Y G P    S + G    
Sbjct: 383 AAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKG---- 435

Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAA---KNADATVIVAGLDLSVEAEGKDRVDLLLP 490
              +    G    V   N M  +A   A   K AD  ++  G D  +  E  D   + LP
Sbjct: 436 ---VKELIGKKGKVTYLNGMGTSADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLP 492

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
             Q +L+ ++       + LV  +   +   +A  +  I +I+   YPG+E GRA+A+++
Sbjct: 493 EEQEKLLKEIYQ-VNPRIVLVFHTGNPLTSEWADTH--IPAIMQAWYPGQEAGRALANLL 549

Query: 551 FGKYNPGGRLPITWYEANYVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
           FG  NP G+LP+T Y+          +P +   + + GRTY++  G  +Y FG+GLSYT 
Sbjct: 550 FGNENPSGKLPMTIYKTE------EQLPDILDFDMWKGRTYRYMKGEPLYGFGHGLSYTS 603

Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
           F++             D  Q         G +     A+L    +C        +E+ N 
Sbjct: 604 FEF-------------DNIQ---------GNDTLQSDAIL----QC-------SVELSNS 630

Query: 670 GKMDGSEVVMVY-SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           G++ G EVV VY S+      T+ +K+++ +++V +A+G+  KV FT+ A + L + ++ 
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689

Query: 728 ANSLLASGAHTILVGEGVGGVS 749
              +L SG +T+ +G G  G++
Sbjct: 690 KWRML-SGKYTLFIGSGQPGLA 710


>gi|443692971|gb|ELT94448.1| hypothetical protein CAPTEDRAFT_221920 [Capitella teleta]
          Length = 757

 Score =  349 bits (896), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 257/761 (33%), Positives = 366/761 (48%), Gaps = 93/761 (12%)

Query: 7   VKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQM-----GDLAYGVPRLGLPLYEWWSE 61
           V+  DFP+ D  L + +RA DLV R+TL E   Q      G     + RLG+  Y W +E
Sbjct: 15  VQSYDFPFQDPSLSWDDRADDLVARLTLEEIAPQTQASYGGQHTPAIERLGIKPYVWITE 74

Query: 62  ALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY 121
            L G      + N+            AT++P  I   ASF+E L   + + +S E RA +
Sbjct: 75  CLAG------QVNT-----------NATAYPQPIGMAASFSEELLFNVSRDISYEVRAHW 117

Query: 122 NLGNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEG 173
           N   A        GL+ +SP IN++R P WGR  ET GEDP + G  A ++VRGLQ    
Sbjct: 118 NANRAVGKYSTKVGLSCFSPVINIMRHPLWGRNQETYGEDPLLSGTLAQSFVRGLQG--- 174

Query: 174 VEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNE 233
                  D R L+ +A CKH+  +         RF FD++V  +D + TF+  F+MCV+ 
Sbjct: 175 ------DDPRYLRANAGCKHFDVHGGPEDIPVSRFSFDAKVNMRDWRMTFLPQFKMCVDA 228

Query: 234 GDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTK 293
           G  S +MCSYNR+NGIP CA+ +LL    R +W FHGYIVSD  +I  I E H + N T 
Sbjct: 229 GSYS-LMCSYNRINGIPACANKQLLTDITRDEWGFHGYIVSDSGAISNIKEQHHYTNSTV 287

Query: 294 EDAVARVLKAGLDLDCG---DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
              VA  +KAG +L+ G   + Y    + A++QG + E +I  ++R L    +RLG FD 
Sbjct: 288 ATVVA-AIKAGTNLELGGGSNMYYPKQLDAMKQGLLTEKEIRDNVRPLLYTRLRLGEFDP 346

Query: 351 SPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
                Y  +G + I +P+H E A +AA  G VLLKN N  LP+     K LA+VGP  NA
Sbjct: 347 EAMVDYNKIGVDVIQSPEHREQAVKAAYMGFVLLKNHNNLLPIKKQYSK-LAIVGPFTNA 405

Query: 409 TKAMIGNYEG-TPCRYTSPM-DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADA 466
           T  + G Y      ++TS + +G          A GC +  C +  +      A   AD 
Sbjct: 406 TSELFGTYSSEVNLKFTSTIFEGLSPLGGSTRSANGCTNSAC-SGYVRDDVETAVAGADL 464

Query: 467 TVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG-PVTLVIMSAGAVDINFAKN 525
            ++  G     E+EG DR  L L G Q +++      + G PV LV+++AG +DI +AK 
Sbjct: 465 VIVALGSGQRFESEGNDRAYLDLHGHQLDILKDAVFFSNGAPVILVLINAGPLDITWAKL 524

Query: 526 NPKIKSILWVGYPGEEGGRAIADVIF---GKYNPGGRLPITWYEANYVKIPYTSMPLRPV 582
           +P + +IL  GYP +  G A+   +     +  P GRL  TW   N  ++P  +      
Sbjct: 525 DPGVTAILSCGYPAQSTGEALRRSLTMSEPQAAPAGRLQATW-PLNLDQVPKITD----- 578

Query: 583 NNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
               GRTY+++ G  +YPFG+GLSYT F Y   S   SV               T G N 
Sbjct: 579 YTMQGRTYRYYVGEPLYPFGFGLSYTSFSYTRLSISPSV--------------ITQGDN- 623

Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERV 701
                             T ++ ++N G  D  EVV VY   P       K  +  + R 
Sbjct: 624 -----------------VTVEVCLKNTGSYDSDEVVQVYMSWPQTPFPLPKWTLAAFARP 666

Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           FI+AGQ+  V   + A +    + + A      G  T+  G
Sbjct: 667 FISAGQTICVKSVIRADQMAVWLSDDAGFGFVPGVMTVYAG 707


>gi|340368019|ref|XP_003382550.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 742

 Score =  349 bits (896), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 246/743 (33%), Positives = 371/743 (49%), Gaps = 111/743 (14%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGD-------LAYGVPRLGLPLYEWWSEALH 64
           FP+ +  L   +R KD+V+ +TL E V+QM          A G+PRL +  Y+W +E L 
Sbjct: 24  FPFQNTSLSIEDRVKDIVDNLTLEELVEQMAHGGATLNGPAPGIPRLHINPYQWGTECLS 83

Query: 65  GVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
           G    G                 ATSFP  I   ASFN  L K++    + E RA +   
Sbjct: 84  GNVSAG----------------DATSFPMPIGMAASFNYDLLKRVTNATAYEVRAKHAAA 127

Query: 125 --------NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEY 176
                   + GL+ WSP +N++RDPRWGR  ET GEDPY+ G     YV GLQ       
Sbjct: 128 VKDGSYAFHTGLSCWSPVLNIMRDPRWGRNQETYGEDPYLSGYLGQAYVNGLQG------ 181

Query: 177 HRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDV 236
              ++SR +  +A CKH+  +         RF FD++V+ +D + TF+  F+ CV  G +
Sbjct: 182 ---NNSRYIIANAGCKHFDVHGGPENIPTSRFSFDAKVSMRDWRMTFLPQFKACVEAGAL 238

Query: 237 SSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDA 296
           S +MCSYNR+NG+P CA+  LL   +R +W+F GY+VSD  +++ IV  H +  D  + A
Sbjct: 239 S-LMCSYNRINGVPACANKALLTDILRNEWDFKGYVVSDQGALEFIVIEHHYAPDFMK-A 296

Query: 297 VARVLKAGLDLDCGDYYTNF------TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD- 349
            A    AG  L+ G+    F       + AV+   ++   +  ++  L+ V M+LG FD 
Sbjct: 297 AADAANAGTCLEDGNIGRKFFNVFEHLVDAVKNNLVSVDTLKNAVSRLFYVRMKLGEFDP 356

Query: 350 -GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGA----LPLNTGNIKTLALVGP 404
             +  Y N+  + I +  HI L+ +AA + IVL+KND+G     LP+ T  +K   +VGP
Sbjct: 357 PDNNPYANIPLSVIQSDAHINLSLQAAMESIVLMKNDDGFRSPFLPI-TNEVKKACMVGP 415

Query: 405 HANATKAMIGNYEGTPCR--YTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAID 459
            ++  + + G+Y  T  R    + + G       +  +NYA GC D     N        
Sbjct: 416 FSDDPEVLFGDYSPTLMRDYVITSLAGLKNANIGTDTLNYAVGCEDGPACRNYDSAKVRS 475

Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAK-GPVTLVIMSAGAV 518
           A    +  ++ AGL   +E+EGKD  D+ LPG Q +L+     A+K   V L++ +A  +
Sbjct: 476 ACDGVELIIVTAGLSKHLESEGKDLSDINLPGHQLDLMQDAEAASKNASVILILFNASPL 535

Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP-YTSM 577
           DI +AK +P+I  IL   YPG+  G+AIA+V+ G+YNP GRLP TW  A+  ++P  T+ 
Sbjct: 536 DIRYAKTDPRIVGILEAYYPGQTAGKAIANVLTGEYNPSGRLPNTW-PASLDQVPGITNY 594

Query: 578 PLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
            ++       RTY++F    +YPFGYGLSYT F Y                      N  
Sbjct: 595 TMKE------RTYRYFTQEPLYPFGYGLSYTTFHYS---------------------NLN 627

Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-----SKPPGIAGTHI 692
           + +      A +I             + V N G MDG+EV  VY     S  P +     
Sbjct: 628 ISSTATASGAGMI----------AVSVLVTNTGSMDGTEVTQVYVWCNISYAPKL----- 672

Query: 693 KQVIGYERVFIAAGQSAKVGFTM 715
            Q++G  + FI+ G++ +V F++
Sbjct: 673 -QLVGVNKDFISKGKTLEVSFSI 694


>gi|53712125|ref|YP_098117.1| beta-xylosidase [Bacteroides fragilis YCH46]
 gi|52214990|dbj|BAD47583.1| beta-xylosidase [Bacteroides fragilis YCH46]
          Length = 722

 Score =  349 bits (896), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 239/742 (32%), Positives = 370/742 (49%), Gaps = 92/742 (12%)

Query: 16  DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
           D   P   R + L+++MTL EKV Q+   +  +PRL LP Y +W+E LHGV+  G     
Sbjct: 53  DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE---- 108

Query: 76  PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
                        T FP  I   ++++  L K++   +STEAR  Y     GLT+WSP I
Sbjct: 109 ------------VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTI 156

Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
           N+ RDPRWGR  ET GEDP++  R  + +V+GLQ               LK  A  KH+ 
Sbjct: 157 NMARDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ---------GDHPTYLKTVATIKHFV 207

Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
           A    N E N+RF   S++  + + E +   +E CV E +  SVM +YN  NG+P     
Sbjct: 208 A----NNEENNRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263

Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
            LL+  +R +W F G++VSDC +I  +   H+ +N + E+A A  + +G DL+CG  Y  
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVN-SLEEAAALGVNSGCDLECGTTYKE 322

Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAE 373
             + AV+QG I+EA ID +L  +     +LG FD      Y +  K  +   +  ELA E
Sbjct: 323 KLVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYE 382

Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
           AA + +VLLKND   LPLN   IK++A+VGP A+     +G Y G P    S + G    
Sbjct: 383 AAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKG---- 435

Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAA---KNADATVIVAGLDLSVEAEGKDRVDLLLP 490
              +    G    V   N M  +A   A   K AD  ++  G D  +  E  D   + LP
Sbjct: 436 ---VKELIGKKGKVTYLNGMGTSADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLP 492

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
             Q +L+ ++       + LV  +   +   +A  +  I +I+   YPG+E GRA+A+++
Sbjct: 493 EGQEKLLKEIYQ-VNPRIVLVFHTGNPLTSEWADTH--IPAIMQAWYPGQEAGRALANLL 549

Query: 551 FGKYNPGGRLPITWYEANYVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
           FG  NP G+LP+T Y+          +P +   + + GRTY++  G  +Y FG+GLSYT 
Sbjct: 550 FGNENPSGKLPMTIYKTE------EQLPDILDFDMWKGRTYRYMKGEPLYGFGHGLSYTS 603

Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
           F++       +  ++ D   QC                                +E+ N 
Sbjct: 604 FEFDNIQGNDT--LQPDAILQC-------------------------------SVELSNS 630

Query: 670 GKMDGSEVVMVY-SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           G++ G EVV VY S+      T+ +K+++ +++V +A+G+  KV FT+ A + L + ++ 
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689

Query: 728 ANSLLASGAHTILVGEGVGGVS 749
              +L SG +T+ +G G  G++
Sbjct: 690 KWRML-SGKYTLFIGSGQPGLA 710


>gi|423258868|ref|ZP_17239791.1| hypothetical protein HMPREF1055_02068 [Bacteroides fragilis
           CL07T00C01]
 gi|423264161|ref|ZP_17243164.1| hypothetical protein HMPREF1056_00851 [Bacteroides fragilis
           CL07T12C05]
 gi|387776448|gb|EIK38548.1| hypothetical protein HMPREF1055_02068 [Bacteroides fragilis
           CL07T00C01]
 gi|392706427|gb|EIY99550.1| hypothetical protein HMPREF1056_00851 [Bacteroides fragilis
           CL07T12C05]
          Length = 722

 Score =  349 bits (896), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 239/742 (32%), Positives = 369/742 (49%), Gaps = 92/742 (12%)

Query: 16  DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
           D   P   R + L+++MTL EKV Q+   +  +PRL LP Y +W+E LHGV+  G     
Sbjct: 53  DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE---- 108

Query: 76  PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
                        T FP  I   ++++  L K++   +STEAR  Y     GLT+WSP I
Sbjct: 109 ------------VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTI 156

Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
           N+ RDPRWGR  ET GEDP++  R  + +V+GLQ               LK  A  KH+ 
Sbjct: 157 NMARDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ---------GDHPTYLKTVATIKHFV 207

Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
           A    N E N+RF   S++  + + E +   +E CV E +  SVM +YN  NG+P     
Sbjct: 208 A----NNEENNRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263

Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
            LL+  +R +W F G++VSDC +I  +   H+ +N   E+A A  + +G DL+CG  Y  
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSL-EEAAALGVNSGCDLECGTTYKE 322

Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAE 373
             + AV+QG I+EA ID +L  +     +LG FD      Y +  K  +   +  ELA E
Sbjct: 323 KLVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYE 382

Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
           AA + +VLLKND   LPLN   IK++A+VGP A+     +G Y G P    S + G    
Sbjct: 383 AAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKG---- 435

Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAA---KNADATVIVAGLDLSVEAEGKDRVDLLLP 490
              +    G    V   N M  +A   A   K AD  ++  G D  +  E  D   + LP
Sbjct: 436 ---VKELIGKKGKVTYLNGMGTSADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLP 492

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
             Q +L+ ++       + LV  +   +   +A  +  I +I+   YPG+E GRA+A+++
Sbjct: 493 EEQEKLLKEIYQ-VNPRIVLVFHTGNPLTSEWADTH--IPAIMQAWYPGQEAGRALANLL 549

Query: 551 FGKYNPGGRLPITWYEANYVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
           FG  NP G+LP+T Y+          +P +   + + GRTY++  G  +Y FG+GLSYT 
Sbjct: 550 FGNENPSGKLPMTIYKTE------EQLPDILDFDMWKGRTYRYMKGEPLYGFGHGLSYTS 603

Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
           F++       +  ++ D   QC                                +E+ N 
Sbjct: 604 FEFDNIQGNDT--LQPDAILQC-------------------------------SVELSNS 630

Query: 670 GKMDGSEVVMVY-SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           G++ G EVV VY S+      T+ +K+++ +++V +A+G+  KV FT+ A + L + ++ 
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689

Query: 728 ANSLLASGAHTILVGEGVGGVS 749
              +L SG +T+ +G G  G++
Sbjct: 690 KWRML-SGKYTLFIGSGQPGLA 710


>gi|60680313|ref|YP_210457.1| glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
 gi|60491747|emb|CAH06504.1| putative glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
          Length = 722

 Score =  349 bits (895), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 239/742 (32%), Positives = 368/742 (49%), Gaps = 92/742 (12%)

Query: 16  DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
           D   P   R + L+++MTL EKV Q+   +  +PRL LP Y +W+E LHGV+  G     
Sbjct: 53  DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE---- 108

Query: 76  PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
                        T FP  I   ++++  L K++   +STEAR  Y     GLT+WSP I
Sbjct: 109 ------------VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTI 156

Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
           N+ RDPRWGR  ET GEDP++  R  + +V+GLQ               LK  A  KH+ 
Sbjct: 157 NMARDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ---------GDHPTYLKTVATIKHFV 207

Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
           A    N E N+RF   S++  + + E +   +E CV E +  SVM +YN  NG+P     
Sbjct: 208 A----NNEENNRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263

Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
            LL+  +R +W F G++VSDC +I  +   H+ +N   E+A A  + +G DL+CG  Y  
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSL-EEAAALGVNSGCDLECGTTYKE 322

Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAE 373
             + AV+QG I+EA ID +L  +     +LG FD      Y +  K  +   +  ELA E
Sbjct: 323 KLVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYE 382

Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
           AA + +VLLKND   LPLN   IK++A+VGP A+     +G Y G P    S + G    
Sbjct: 383 AAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKG---- 435

Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAA---KNADATVIVAGLDLSVEAEGKDRVDLLLP 490
              +    G    V   N M  +A   A   K AD  ++  G D  +  E  D   + LP
Sbjct: 436 ---VKELIGKKGKVTYLNGMGTSADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLP 492

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
             Q + + K+       + LV  +   +   +A  +  I +I+   YPG+E GRA+A+++
Sbjct: 493 EEQEKFLKKIYQ-VNPRIVLVFHTGNPLTSEWADTH--ILAIMQAWYPGQEAGRALANLL 549

Query: 551 FGKYNPGGRLPITWYEANYVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
           FG  NP G+LP+T Y+          +P +   + + GRTY++  G  +Y FG+GLSYT 
Sbjct: 550 FGNENPSGKLPMTIYKTE------EQLPDILDFDMWKGRTYRYMKGEPLYGFGHGLSYTS 603

Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
           F++       +  ++ D   QC                                +E+ N 
Sbjct: 604 FEFDNIQGNDT--LQPDAILQC-------------------------------SVELSNS 630

Query: 670 GKMDGSEVVMVY-SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           G++ G EVV VY S+      T+ +K+++ +++V +A+G+  KV FT+ A + L + ++ 
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689

Query: 728 ANSLLASGAHTILVGEGVGGVS 749
              +L SG +T+ +G G  G++
Sbjct: 690 KWRML-SGKYTLFIGSGQPGLA 710


>gi|423281966|ref|ZP_17260851.1| hypothetical protein HMPREF1204_00389 [Bacteroides fragilis HMW
           615]
 gi|404582453|gb|EKA87147.1| hypothetical protein HMPREF1204_00389 [Bacteroides fragilis HMW
           615]
          Length = 722

 Score =  349 bits (895), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 239/742 (32%), Positives = 369/742 (49%), Gaps = 92/742 (12%)

Query: 16  DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
           D   P   R + L+++MTL EKV Q+   +  +PRL LP Y +W+E LHGV+  G     
Sbjct: 53  DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE---- 108

Query: 76  PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
                        T FP  I   ++++  L K++   +STEAR  Y     GLT+WSP I
Sbjct: 109 ------------VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTI 156

Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
           N+ RDPRWGR  ET GEDP++  R  + +V+GLQ               LK  A  KH+ 
Sbjct: 157 NMARDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ---------GDHPTYLKTVATIKHFV 207

Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
           A    N E N+RF   S++  + + E +   +E CV E +  SVM +YN  NG+P     
Sbjct: 208 A----NNEENNRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263

Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
            LL+  +R +W F G++VSDC +I  +   H+ +N   E+A A  + +G DL+CG  Y  
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSL-EEAAALGVNSGCDLECGTTYKE 322

Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAE 373
             + AV+QG I+EA ID +L  +     +LG FD      Y +  K  +   +  ELA E
Sbjct: 323 KLVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYE 382

Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
           AA + +VLLKND   LPLN   IK++A+VGP A+     +G Y G P    S + G    
Sbjct: 383 AAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKG---- 435

Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAA---KNADATVIVAGLDLSVEAEGKDRVDLLLP 490
              +    G    V   N M  +A   A   K AD  ++  G D  +  E  D   + LP
Sbjct: 436 ---VKELIGKKGKVTYLNGMGTSADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLP 492

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
             Q +L+ ++       + LV  +   +   +A  +  I +I+   YPG+E GRA+A+++
Sbjct: 493 EEQEKLLKEIYQ-VNPRIALVFHTGNPLTSEWADTH--IPAIMQAWYPGQEAGRALANLL 549

Query: 551 FGKYNPGGRLPITWYEANYVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
           FG  NP G+LP+T Y+          +P +   + + GRTY++  G  +Y FG+GLSYT 
Sbjct: 550 FGNENPSGKLPMTIYKTE------EQLPDILDFDMWKGRTYRYMKGEPLYGFGHGLSYTS 603

Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
           F++       +  ++ D   QC                                +E+ N 
Sbjct: 604 FEFDNIQGNDT--LQPDAILQC-------------------------------SVELSNS 630

Query: 670 GKMDGSEVVMVY-SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           G++ G EVV VY S+      T+ +K+++ +++V +A+G+  KV FT+ A + L + ++ 
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689

Query: 728 ANSLLASGAHTILVGEGVGGVS 749
              +L SG +T+ +G G  G++
Sbjct: 690 KWRML-SGKYTLFIGSGQPGLA 710


>gi|383117083|ref|ZP_09937830.1| hypothetical protein BSHG_0813 [Bacteroides sp. 3_2_5]
 gi|251947612|gb|EES87894.1| hypothetical protein BSHG_0813 [Bacteroides sp. 3_2_5]
          Length = 722

 Score =  349 bits (895), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 239/742 (32%), Positives = 368/742 (49%), Gaps = 92/742 (12%)

Query: 16  DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
           D   P   R + L+++MTL EKV Q+   +  +PRL LP Y +W+E LHGV+  G     
Sbjct: 53  DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE---- 108

Query: 76  PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
                        T FP  I   ++++  L K++   +STEAR  Y     GLT+WSP I
Sbjct: 109 ------------VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTI 156

Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
           N+ RDPRWGR  ET GEDP++  R  + +V+GLQ               LK  A  KH+ 
Sbjct: 157 NMARDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ---------GDHPTYLKTVATIKHFV 207

Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
           A    N E N+RF   S++  + + E +   +E CV E +  SVM +YN  NG+P     
Sbjct: 208 A----NNEENNRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263

Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
            LL+  +R +W F G++VSDC +I  +   H+ +N   E+A A  + +G DL+CG  Y  
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSL-EEAAALGVNSGCDLECGTTYKE 322

Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAE 373
             + AV+QG I+E  ID +L  +     +LG FD      Y +  K  +   +  ELA E
Sbjct: 323 KLVQAVEQGLISEVAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYE 382

Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
           AA + +VLLKND   LPLN   IK++A+VGP A+     +G Y G P    S + G    
Sbjct: 383 AAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKG---- 435

Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAA---KNADATVIVAGLDLSVEAEGKDRVDLLLP 490
              +    G    V   N M  +A   A   K AD  ++  G D  +  E  D   + LP
Sbjct: 436 ---VKELIGKKGKVTYLNGMGTSADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLP 492

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
             Q +L+ K+       + LV  +   +   +A  +  I +I+   YPG+E GRA+A+++
Sbjct: 493 EEQEKLLKKIYQ-VNPRIVLVFHTGNPLTSEWADTH--IPAIMQAWYPGQEAGRALANLL 549

Query: 551 FGKYNPGGRLPITWYEANYVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
           FG  NP G+LP+T Y+          +P +   + + GRTY++  G  +Y FG+GLSYT 
Sbjct: 550 FGNENPSGKLPMTIYKTE------EQLPDILDFDMWKGRTYRYMKGEPLYGFGHGLSYTS 603

Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
           F++       +  ++ D   QC                                +E+ N 
Sbjct: 604 FEFDNIQGNDT--LQPDAILQC-------------------------------SVELSNS 630

Query: 670 GKMDGSEVVMVY-SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           G++ G EVV VY S+      T+ +K+++ +++V +A+G+  KV FT+ A + L + ++ 
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689

Query: 728 ANSLLASGAHTILVGEGVGGVS 749
              +L SG +T+ +G G  G++
Sbjct: 690 KWRML-SGKYTLFIGSGQPGLA 710


>gi|423269271|ref|ZP_17248243.1| hypothetical protein HMPREF1079_01325 [Bacteroides fragilis
           CL05T00C42]
 gi|423273165|ref|ZP_17252112.1| hypothetical protein HMPREF1080_00765 [Bacteroides fragilis
           CL05T12C13]
 gi|392701693|gb|EIY94850.1| hypothetical protein HMPREF1079_01325 [Bacteroides fragilis
           CL05T00C42]
 gi|392708197|gb|EIZ01305.1| hypothetical protein HMPREF1080_00765 [Bacteroides fragilis
           CL05T12C13]
          Length = 722

 Score =  347 bits (891), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 238/742 (32%), Positives = 369/742 (49%), Gaps = 92/742 (12%)

Query: 16  DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
           D   P   R + L+++MTL EKV Q+   +  +PRL LP Y +W+E LHGV+  G     
Sbjct: 53  DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE---- 108

Query: 76  PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
                        T FP  I   ++++  L K++   +STEAR  Y     GLT+WSP I
Sbjct: 109 ------------VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTI 156

Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
           N+ RDPRWGR  ET GE+P++  R  + +V+GLQ               LK  A  KH+ 
Sbjct: 157 NMARDPRWGRNEETYGEEPHLTSRLGVAFVKGLQ---------GDHPTYLKTVATIKHFV 207

Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
           A    N E N+RF   S++  + + E +   +E CV E +  SVM +YN  NG+P     
Sbjct: 208 A----NNEENNRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263

Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
            LL+  +R +W F G++VSDC +I  +   H+ +N   E+A A  + +G DL+CG  Y  
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSL-EEAAALGVNSGCDLECGTTYKE 322

Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAE 373
             + AV+QG I+EA ID +L  +     +LG FD      Y +  K  +   +  ELA E
Sbjct: 323 KLVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYE 382

Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
           AA + +VLLKND   LPLN   IK++A+VGP A+     +G Y G P    S + G    
Sbjct: 383 AAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKG---- 435

Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAA---KNADATVIVAGLDLSVEAEGKDRVDLLLP 490
              +    G    V   N M  +A   A   K AD  ++  G D  +  E  D   + LP
Sbjct: 436 ---VKELIGKKGKVTYLNGMGTSADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLP 492

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
             Q +L+ ++       + LV  +   +   +A  +  I +I+   YPG+E GRA+A+++
Sbjct: 493 EGQEKLLKEIYQ-VNPRIVLVFHTGNPLTSEWADTH--IPAIMQAWYPGQEAGRALANLL 549

Query: 551 FGKYNPGGRLPITWYEANYVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
           FG  NP G+LP+T Y+          +P +   + + GRTY++  G  +Y FG+GLSYT 
Sbjct: 550 FGNENPSGKLPMTIYKTE------EQLPDILDFDMWKGRTYRYMKGEPLYGFGHGLSYTS 603

Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
           F++       +  ++ D   QC                                +E+ N 
Sbjct: 604 FEFDNIQGNDT--LQPDAILQC-------------------------------SVELSNS 630

Query: 670 GKMDGSEVVMVY-SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           G++ G EVV VY S+      T+ +K+++ +++V +A+G+  KV FT+ A + L + ++ 
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689

Query: 728 ANSLLASGAHTILVGEGVGGVS 749
              +L SG +T+ +G G  G++
Sbjct: 690 KWRML-SGKYTLFIGSGQPGLA 710


>gi|167537541|ref|XP_001750439.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163771117|gb|EDQ84789.1| predicted protein [Monosiga brevicollis MX1]
          Length = 834

 Score =  347 bits (890), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 245/769 (31%), Positives = 371/769 (48%), Gaps = 91/769 (11%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQM-GDLAYGVPRLGLPLYEWWSEALHGVSF 68
           S +P+CD KL   +R KDLV R++  +   Q+    +  +  +GLP Y W + A+HG+  
Sbjct: 105 SSYPFCDTKLSVDDRLKDLVSRVSTADAATQLRARESAQIDNIGLPAYYWGTNAIHGMQN 164

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG-NAG 127
                        D + P  TSFP     +A+FN SL K +G+ +  E RA YN   + G
Sbjct: 165 TACLA--------DGQCP--TSFPAPNGLSATFNYSLVKDMGRIIGRELRAYYNTKFHNG 214

Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
           L  WSP IN  RDPRWGR +E+PGE P+V G+Y   Y  GLQ+ +  +Y         + 
Sbjct: 215 LDTWSPTINPSRDPRWGRNVESPGESPFVCGQYGAAYTEGLQNGDDKDY--------TQA 266

Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
               KH+ AY +++++   R+ +++ V+E D+ +T+   +E  V       VMCSYN +N
Sbjct: 267 VVTLKHWVAYSVEDYDNVTRYEYNAIVSEYDLMDTYFPGWEYVVKNAKPLGVMCSYNSLN 326

Query: 248 GIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL 307
           G+PTC +P  L   +R DW F GYI SD DSI  I   H + ++    A    L  G D+
Sbjct: 327 GVPTCGNPA-LTAYLREDWGFEGYITSDSDSIHCIWADHHYESNAVL-ATRDGLLGGCDI 384

Query: 308 DCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG--SPQYKNLGKNNICNP 365
           D GD Y +    AV Q  +  + +D +L   Y +   LG FD   +  Y  +  + +   
Sbjct: 385 DSGDTYADNLEAAVNQSLVNRSAVDAALTNSYRMRFNLGLFDPNVTNAYDRISADEVGMS 444

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
              E +  AAR+ + LLKND   LP  TG  K +A++G  +N+ + ++GNY G  C    
Sbjct: 445 SSQETSLLAARKSMTLLKNDGQTLPFATG--KKVAVIGKSSNSAEDILGNYVGPIC---- 498

Query: 426 PMDGF----YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
           P   F      Y  V     G A  +  + + I  AI  A +AD  V+    +     EG
Sbjct: 499 PSGAFDCVQTLYQGVAAANQGGATTLSDDVADINTAIQLAMDADQVVLTIS-NYGQAGEG 557

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
           KDR  + L   Q EL+  V    K P  +V+++ G + +++ K+  + ++IL    PG  
Sbjct: 558 KDRTYIGLDTDQQELVAAVLKVGK-PTAIVMLNGGLISLDWIKD--EAQAILVAFAPGVH 614

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPV-------------NNFPG 587
           GG+A+A+ IFG  NPGG+LP+T Y ++YV  + + +M ++ V             +  PG
Sbjct: 615 GGQAVAETIFGANNPGGKLPVTMYASDYVNDVDFLNMSMQAVAVLHLMNVNGERDDTGPG 674

Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
           R+YK++ G  +YPF YGLSYT F    + +P                             
Sbjct: 675 RSYKYYTGEPLYPFAYGLSYTTFNLSWSPAPP---------------------------- 706

Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP--------PGIAGTHIKQVIGYE 699
             +          T+   V N G + G EVV  + KP        P      IK++ G++
Sbjct: 707 --MTTFTSTLRSTTYTATVTNTGSVGGDEVVFAFYKPKSESLKTLPVGNPVPIKEIFGFQ 764

Query: 700 RVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
           RV +  GQS +V F +NA ++L  V    +  L SG   I +  G G V
Sbjct: 765 RVALGPGQSTQVTFELNA-ETLAQVTLDGHRELHSGEFEIELTRGHGEV 812


>gi|332377068|gb|AEE64772.1| Xyl3A [Ruminococcus albus 8]
          Length = 691

 Score =  347 bits (889), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 247/756 (32%), Positives = 370/756 (48%), Gaps = 118/756 (15%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D  L   ERA+ L + MT  E+  Q+   A  + RLG+P Y WW+E +HG++  G   
Sbjct: 4   YLDESLSAEERAEALTDEMTTEEQASQLRYDAPAIERLGIPAYNWWNEGIHGLARSGV-- 61

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   A F++ L K+  +  S EARA YN           
Sbjct: 62  --------------ATMFPQAIGLAAMFDDELTKRTAEITSEEARAKYNAYTVEGDRDIY 107

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLT W+PNIN+ RDPRWGR  ET GEDPY+  +     VRGLQ           D + +
Sbjct: 108 KGLTLWAPNINIFRDPRWGRSHETFGEDPYLTAQNGKAVVRGLQ----------GDGKVM 157

Query: 186 KISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
           K +AC KH+A +      G +  R  FD++   +DM+ET++  FE  V E  V SVM +Y
Sbjct: 158 KAAACAKHFAVHS-----GPEALRHSFDAKADAKDMEETYLPAFEALVKEAKVESVMGAY 212

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NRVNG P CA   L+ +    +W F GY VSDC +I+   E H    +  E A A  LKA
Sbjct: 213 NRVNGEPACASDYLMEKL--KEWEFDGYFVSDCWAIRDFHEHHMVTANAVESA-AMALKA 269

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC 363
           G D++CG  Y N  + A+ +G I +  I T+   L    +RLG FD    + ++  + + 
Sbjct: 270 GCDVNCGCTYQNL-LAALDKGLITKEQIRTACVHLMRTRIRLGMFDKHTDFDDIPYSKVA 328

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
             +H  ++ E A + +VLLKN NG LPL+    KT+A++GP+A++  A+ GNY G   RY
Sbjct: 329 CAEHKAVSLECAEKSLVLLKN-NGILPLDDKKYKTIAVIGPNADSRTALEGNYNGLSDRY 387

Query: 424 TSPMDGFY-AYSKVINYAPGCA------DIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
           T+ ++G    +   + +A GC         + Q       A+ AAKNAD  ++  GLD +
Sbjct: 388 TTFLNGIQDRFEGRVIFAEGCHLYKKSISGLAQAGDRYAEAVAAAKNADLVIMCVGLDAT 447

Query: 477 VEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
           +E E           D+  L LP  Q  L+ K+    K PV  V+ +  A++    ++ P
Sbjct: 448 IEGEEGDTGNEFSSGDKNGLTLPPPQKILVEKIMSVGK-PVVTVVCAGSAIN---TESQP 503

Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP-YTSMPLRPVNNFP 586
              +++   YPG EGG+A+A+V+FG  +P G+LP+T+YE +  K+P +T   ++      
Sbjct: 504 --DALIHAFYPGAEGGKALAEVLFGDVSPSGKLPVTFYE-DTDKLPEFTDYSMK------ 554

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++    +++PFGYGL+Y                                       
Sbjct: 555 GRTYRYTTDNILFPFGYGLTY--------------------------------------G 576

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
            V ++ V+ KD K    + VEN G+    +V+ +Y K           + G++RV +  G
Sbjct: 577 GVKVNAVEYKDGKAV--VSVENSGRAT-EDVIELYLKDYCEQAVPNVSLCGFKRVKLGEG 633

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           + A V   +   K+   VDN     +     T+L G
Sbjct: 634 EKATVEIAIPE-KAFTAVDNNGVRKVFGSKFTLLAG 668


>gi|348684872|gb|EGZ24687.1| family 3 glycoside hydrolase [Phytophthora sojae]
          Length = 805

 Score =  347 bits (889), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 246/773 (31%), Positives = 376/773 (48%), Gaps = 90/773 (11%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAY---GVPRLGLPLYEWWSEALHGVSF 68
           FP+CDA L   ER +DL+ R+ L EKV  +   A     +  +GLP Y W +  +HGV  
Sbjct: 34  FPFCDASLSTSERVEDLLRRLPLDEKVTLLTARASPKGNMSSIGLPEYNWGANCVHGV-- 91

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG---- 124
                 S  GT+       ATSFP  +   A F+      + Q +  E RA++  G    
Sbjct: 92  -----QSTCGTNC------ATSFPNPVNLGAIFDPQAVFDMAQVIGWELRALWLEGAREN 140

Query: 125 -----NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD 179
                + GL  WSPNIN+ RDPRWGR +ETP EDP V  +Y + Y RGLQ+         
Sbjct: 141 YAAGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTRGLQE--------G 192

Query: 180 SDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
            D R L+     KHYAAY  ++++G DR  F+++V+  D  +T++  F   V EG    V
Sbjct: 193 KDKRFLQAVVTLKHYAAYSYEHYDGIDRMAFNAQVSRYDFADTYLPAFHASVVEGKAKGV 252

Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
           MCSYN VNG+P CA+ +L  + +R    F GYI SD  +I+ I     +     E     
Sbjct: 253 MCSYNSVNGMPMCANEQLNTKLLREALGFDGYITSDSGAIEGIYRQRHYTKSLCEAGRLA 312

Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNL 357
           ++ +G D++ G  Y       V  G++ E  +D ++R    +   LG FD      Y ++
Sbjct: 313 IM-SGTDVNSGSVYKKCLADLVTSGQLPEKAVDDAMRRTLKLRFELGLFDPIDDQPYWHV 371

Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
             + +   +  +L+ E  R+ IVLL+N    LPL  G  K LA++GPHA A +A++GNY 
Sbjct: 372 APSEVGKTESKQLSLELTRKSIVLLQNHGNVLPLRKG--KKLAVIGPHAKAKRALLGNYL 429

Query: 418 GTPCR--------YTSPMDGFYAYSKVIN--YAPGCADIVCQNNSMIPAAIDAAKNADAT 467
           G  C           +P++   A +   N  YA G   I   + +   AA  AA+ ADA 
Sbjct: 430 GQMCHGDYLEVGCVQTPLEAITAANGASNTVYAKGSG-INDTSTADFDAAEAAARGADAV 488

Query: 468 VIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
           V+  G+D S+E E  DR ++ +P  Q +L+ +V  A K P  +V+ + G V     +   
Sbjct: 489 VLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAGK-PTVVVLFNGGVVGAE--ELIL 545

Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG 587
               +    YPG  G +A++D++FG   P G+LP+T Y +NY+      M    +  +PG
Sbjct: 546 HTDGVAEAFYPGFFGAQAVSDILFGDAIPSGKLPVTMYPSNYIN--SVDMKSMSMTKYPG 603

Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
           R+Y+++    V+PFG+GLSYT+F            + LD +               P   
Sbjct: 604 RSYRYYKEVPVFPFGWGLSYTKFT-----------LALDGEM--------------PDDP 638

Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-----PGIAGTHIKQVIGYERVF 702
           ++I     +D   T  + V N G + G EVV  + +P      G A    +Q+  Y RV 
Sbjct: 639 IVI----TRDLDQTVTVIVSNDGDLVGDEVVFAFFRPLNVNATGDAALLNEQLFDYRRVS 694

Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGG-VSFPLQL 754
           +   Q  K+ F +    +L +VD++ N     G + +++  GV   V+F + L
Sbjct: 695 LRPTQYRKLTFRIQQ-STLAMVDDSGNKASFPGFYEVIITNGVHERVTFAIHL 746


>gi|5690010|emb|CAB51937.1| Family 3 Glycoside Hydrolase [Ruminococcus flavefaciens]
          Length = 690

 Score =  344 bits (883), Expect = 8e-92,   Method: Compositional matrix adjust.
 Identities = 230/727 (31%), Positives = 354/727 (48%), Gaps = 112/727 (15%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D  L   ERA+D+ +R++  EK +Q    A    RLG   Y WWSE LHGV+  G   
Sbjct: 6   YLDEALSDLERAEDITDRLSTEEKAEQQKYDAPAEERLGKDAYNWWSEGLHGVARAGT-- 63

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   A F++    + G+T S EARA YN  +A       
Sbjct: 64  --------------ATMFPQTIGMAAMFDDEAVHRAGETTSREARAKYNEYSAHDDRDIY 109

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLT WSPN+N+ RDPRWGR  ET GEDPY+     + Y +GLQ           D + L
Sbjct: 110 KGLTLWSPNVNIFRDPRWGRGQETYGEDPYLTSCLGVAYAKGLQ----------GDGKVL 159

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           + +AC KH+A +   +     R  FD++   +DM ET+I  FE  V +  V SVM +YNR
Sbjct: 160 RTAACAKHFAVH---SGPEATRHEFDAKANMKDMTETYIAAFEALVKDAKVESVMGAYNR 216

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           VNG P CA   ++N+    +W F G+ VSDC +I+    +H  +  T  ++ A  LK G 
Sbjct: 217 VNGEPACASDFVMNKL--EEWGFDGHFVSDCWAIRDFHTNHG-VTKTAPESAALALKKGC 273

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNP 365
           DL+CG+ Y +  + A  +G I E D+  S   L    +RLG FD S +Y  L  + +   
Sbjct: 274 DLNCGNTYLHL-LAAFNEGLINEEDLRRSCIKLMRTRVRLGMFDKSTEYDGLDYDIVACD 332

Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
           +H E +   + + +VLLKN NG LPL+    KT+ ++GP+A++  A+ GNY G    Y +
Sbjct: 333 EHKEFSLRCSERSMVLLKN-NGILPLDGSKYKTIGVIGPNADSVPALEGNYNGKADEYIT 391

Query: 426 PMDGFY-AYSKVINYAPG-------CADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSV 477
            + G   A+   + Y  G       C  +   ++ +  A I   +    +  +  LD ++
Sbjct: 392 FLSGIREAHDGRVLYTEGSHLYKDRCMGLALPDDRLSEAEI-ITRTLRCSGSLCWLDATI 450

Query: 478 EAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
           E E           D+ DL LP  Q +L+  V   AKG   +++ +AG+  IN   +   
Sbjct: 451 EGEEGDTGNEFSSGDKNDLRLPESQRKLVKTV--MAKGKPVIIVTAAGSA-INVEAD--- 504

Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGR 588
             +++   YPG+ GGRA+A+++FGK +P G+LP+T+YE        + +P     +   R
Sbjct: 505 CDALIQAWYPGQLGGRALANILFGKVSPSGKLPVTFYE------DASKLPDFSDYSMKNR 558

Query: 589 TYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
           TY++ +G +++PFGYGL+Y++                    +C ++++  G         
Sbjct: 559 TYRYSEGNILFPFGYGLTYSE-------------------TECSELSFENGVA------- 592

Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQS 708
                          ++V N G     +VV +Y K           + G++RV + AG+S
Sbjct: 593 --------------TVKVTNTGSRFTEDVVQIYIKGYSENAVPNHSLCGFKRVALDAGES 638

Query: 709 AKVGFTM 715
             V  T+
Sbjct: 639 RIVQITL 645


>gi|409385818|ref|ZP_11238358.1| Beta-glucosidase [Lactococcus raffinolactis 4877]
 gi|399206850|emb|CCK19273.1| Beta-glucosidase [Lactococcus raffinolactis 4877]
          Length = 695

 Score =  344 bits (883), Expect = 8e-92,   Method: Compositional matrix adjust.
 Identities = 226/720 (31%), Positives = 352/720 (48%), Gaps = 106/720 (14%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           E A  +V +MTL EK+ Q+   A  + RL +P Y +W+E LHGV+  G            
Sbjct: 10  EEAIKIVSQMTLAEKISQIDFDASAIERLNIPHYNYWNEGLHGVARAGV----------- 58

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
                AT FP  I   A+F+  L K I + +S E RA YN            GLTFWSPN
Sbjct: 59  -----ATVFPQAIGLAATFDTELVKHIAEVISIEGRAKYNAYTKHGDRDIYKGLTFWSPN 113

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           IN+ RDPRWGR  ET GEDP++  +  + +++GLQ           + + L+++AC KH+
Sbjct: 114 INLFRDPRWGRGQETYGEDPFLTAQIGVAFIKGLQ----------GEGKYLRLAACTKHF 163

Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
           A +   +    DR +FD+ V  +D+ E ++  F+  + E DV S M +YN +NG P C +
Sbjct: 164 AVH---SGPEADRHYFDAVVNPKDLNEFYLPQFKAAIEEADVESFMGAYNAINGQPACVN 220

Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
            +L+ +T+ G W F G++VSD  +++ + E+H +   T  + +A  +K G +L C    +
Sbjct: 221 EELIAKTLLGKWGFEGHVVSDYAALEDVHENHHY-TQTAAETMALAMKIGTNL-CAGKIS 278

Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEA 374
           +    AV +G + E +I  S+  LY   +RLG F     Y  +      + +H  L+ +A
Sbjct: 279 DALFEAVGKGLVTETEITASVVKLYTTHVRLGMFAEDNDYDTIPYEVNASAEHEMLSLKA 338

Query: 375 ARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF---Y 431
           A + +VLLKNDN  LPL+   IK++A++GP A    A+ GNY GT   Y + + G     
Sbjct: 339 AEKSMVLLKNDN-FLPLSQSEIKSVAVIGPTARNIGALEGNYAGTANHYETFVSGIQQAL 397

Query: 432 AYSKVINYAPGC---AD----IVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE---- 480
           +    + YA GC   AD     + + N     AI AA++AD  V+  GLD ++E E    
Sbjct: 398 SNQARVTYALGCHLYADHAESSLSRANERESEAIIAAEHADIAVLCVGLDPTIEGEQGDA 457

Query: 481 -----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
                  D+  L LPG Q  LI KV +  K  V LV+ S  A+ +   + +  +K+I+  
Sbjct: 458 GNVYGSGDKPSLSLPGQQKRLIEKVLETGK-TVILVLTSGSALSLEGLEKHTGVKAIIQA 516

Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDG 595
            YPG  GG A+A+++ GK +P G+LP+T+ +          +P     +   RTY+    
Sbjct: 517 WYPGAHGGTALANILLGKVSPSGKLPVTFCKDT------QGLPDFSDYSMAERTYQNTQL 570

Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
            V+YPFGYGL+Y   + K                                  + +DD+  
Sbjct: 571 EVLYPFGYGLTYGHAEIKT---------------------------------LQLDDL-- 595

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
                T  +  EN G  D  EV+ VY K          ++I ++R+ +   ++  V   +
Sbjct: 596 -----TLSVTAENKGDYDIEEVIQVYVKINSEFAPKNHKLIAFKRIALPKNETVTVKIEL 650


>gi|325679939|ref|ZP_08159508.1| glycosyl hydrolase family 3 C-terminal domain protein [Ruminococcus
           albus 8]
 gi|324108377|gb|EGC02624.1| glycosyl hydrolase family 3 C-terminal domain protein [Ruminococcus
           albus 8]
          Length = 691

 Score =  343 bits (881), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 246/756 (32%), Positives = 369/756 (48%), Gaps = 118/756 (15%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D  L   ERA+ L + MT  E+  Q+   A  + RLG+P Y WW+E +HG++  G   
Sbjct: 4   YLDESLSAEERAEALTDEMTTEEQASQLRYDAPAIERLGIPAYNWWNEGIHGLARSGV-- 61

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   A F++ L K+  +  S EARA YN           
Sbjct: 62  --------------ATMFPQAIGLAAMFDDELTKRTAEITSEEARAKYNAYTVEGDRDIY 107

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLT W+PNIN+ RDPRWGR  ET GEDPY+  +     VRGLQ           D + +
Sbjct: 108 KGLTLWAPNINIFRDPRWGRGHETFGEDPYLTAQNGKAVVRGLQ----------GDGKVM 157

Query: 186 KISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
           K +AC KH+A +      G +  R  FD++   +DM+ET++  FE  V E  V SVM +Y
Sbjct: 158 KAAACAKHFAVHS-----GPEALRHSFDAKADAKDMEETYLPAFEALVKEAKVESVMGAY 212

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NRVNG P CA   L+ +    +W F GY VSDC +I+   E H    +  E A A  LKA
Sbjct: 213 NRVNGEPACASDYLMEKL--KEWEFDGYFVSDCWAIRDFHEHHMVTANAVESA-AMALKA 269

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC 363
           G D++CG  Y N  + A+ +G I +  I T+   L    +RLG FD    + ++  + + 
Sbjct: 270 GCDVNCGCTYQNL-LAALDKGLITKEQIRTACVHLMRTRIRLGMFDKHTDFDDIPYSKVA 328

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
             +H  ++ E A + +VLLKN NG LPL+    KT+A++GP+A++  A+ GNY G   RY
Sbjct: 329 CAEHKAVSLECAEKSLVLLKN-NGILPLDDKKYKTIAVIGPNADSRTALEGNYNGLSDRY 387

Query: 424 TSPMDGFY-AYSKVINYAPGCA------DIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
           T+ ++G    +   + +A GC         + Q       A+ AAKNAD  ++  GLD +
Sbjct: 388 TTFLNGIQDRFEGRVIFAEGCHLYKKSISGLAQAGDRYAEAVAAAKNADLVIMCVGLDAT 447

Query: 477 VEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
           +E E           D+  L LP  Q  L+ K+    K PV  V+ +  A++    ++ P
Sbjct: 448 IEGEEGDTGNEFSSGDKNGLTLPPPQKILVEKIMSVGK-PVVTVVCAGSAIN---TESQP 503

Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP-YTSMPLRPVNNFP 586
              +++   YPG EG +A+A+V+FG  +P G+LP+T+YE +  K+P +T   ++      
Sbjct: 504 --DALIHAFYPGAEGSKALAEVLFGDVSPSGKLPVTFYE-DTDKLPEFTDYSMK------ 554

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++    +++PFGYGL+Y                                       
Sbjct: 555 GRTYRYTTDNILFPFGYGLTY--------------------------------------G 576

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
            V ++ V+ KD K    + VEN G+    +V+ +Y K           + G++RV +  G
Sbjct: 577 GVKVNAVEYKDGKAV--VSVENSGRAT-EDVIELYLKDYCEQAVPNVSLCGFKRVKLGEG 633

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           + A V   +   K+   VDN     +     T+L G
Sbjct: 634 EKATVEIAIPE-KAFTAVDNNGVRKVFGSKFTLLAG 668


>gi|423279990|ref|ZP_17258903.1| hypothetical protein HMPREF1203_03120 [Bacteroides fragilis HMW
           610]
 gi|404584326|gb|EKA88991.1| hypothetical protein HMPREF1203_03120 [Bacteroides fragilis HMW
           610]
          Length = 722

 Score =  343 bits (881), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 238/742 (32%), Positives = 361/742 (48%), Gaps = 92/742 (12%)

Query: 16  DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
           D   P   R K L+++MTL EK  Q+   +  +PRL LP Y +W+E LHGV+  G     
Sbjct: 53  DLSQPIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGE---- 108

Query: 76  PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
                        T FP  I   ++++  L K++   +STEAR  Y     GLT+WSP I
Sbjct: 109 ------------VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTI 156

Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
           N+ RDPRWGR  ET GEDPY+  R  + +V+GLQ               LK  A  KH+ 
Sbjct: 157 NMARDPRWGRNEETYGEDPYLTSRLGVAFVKGLQ---------GDHPAYLKTVATIKHFV 207

Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
           A    N E N+RF   S++  + + E +   +E CV E  V SVM +YN  NG+P     
Sbjct: 208 A----NNEENNRFSSSSQIPTKQLYEYYFPAYEACVKEAGVQSVMTAYNAFNGVPPSGSR 263

Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
            LL + +R +W F G++VSDC +I  +   H+ +N   E+A A  + +G DL+CG  Y  
Sbjct: 264 WLLGEVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSL-EEAAALGVNSGCDLECGTTYKE 322

Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAE 373
             + AV+QG I+EA ID +L  +     +LG FD      Y +  K  +   +  ELA E
Sbjct: 323 KLVQAVKQGLISEATIDQALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYE 382

Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG---F 430
           AA + +VLLKN+N  LPL+    K++A+VGP A+     +G Y G P    + + G    
Sbjct: 383 AAVKSVVLLKNEN-LLPLSKEKTKSVAVVGPFADHN--YLGGYSGQPPYSVTLLKGVKDL 439

Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
                 +NY  G   I    +S++     A K  D  ++  G D  +  E  D   + LP
Sbjct: 440 MGKRGKVNYLNG---IGASRDSIVA----AVKGVDVVLVALGSDEKMARENHDMTSIYLP 492

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
             Q +L+  +       + LV  S   +   +A  +  I +I+   YPG+E GRA+AD++
Sbjct: 493 EEQEKLLKAIYQ-VNPRIVLVFHSGNPLTSEWA--DVHIPAIMQAWYPGQEAGRALADLL 549

Query: 551 FGKYNPGGRLPITWYEANYVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
           FG  NP G+LP+T Y A         +P +   + + GRTY++     +Y FG+GLSYT 
Sbjct: 550 FGNENPSGKLPMTIYRAE------DQLPDILDFDMWKGRTYRYMKEDPLYGFGHGLSYTS 603

Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
           F +       S  +K     QC                                +E+ N 
Sbjct: 604 FGFDGIQG--SDTLKSGARLQC-------------------------------SVELSNT 630

Query: 670 GKMDGSEVVMVY-SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           GK  G EVV VY S+      T+ +K+++ +++V +A G+  +V F +   + L + +N 
Sbjct: 631 GKWTGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLAPGEKKRVEFNI-PPRELSVWEN- 688

Query: 728 ANSLLASGAHTILVGEGVGGVS 749
            N  + +G +T+ +G G  G++
Sbjct: 689 GNWRMLTGKYTLFIGSGQPGLA 710


>gi|313145345|ref|ZP_07807538.1| beta-glucosidase [Bacteroides fragilis 3_1_12]
 gi|313134112|gb|EFR51472.1| beta-glucosidase [Bacteroides fragilis 3_1_12]
          Length = 722

 Score =  343 bits (880), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 238/742 (32%), Positives = 361/742 (48%), Gaps = 92/742 (12%)

Query: 16  DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
           D   P   R K L+++MTL EK  Q+   +  +PRL LP Y +W+E LHGV+  G     
Sbjct: 53  DLLQPIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGE---- 108

Query: 76  PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
                        T FP  I   ++++  L K++   +STEAR  Y     GLT+WSP I
Sbjct: 109 ------------VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTI 156

Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
           N+ RDPRWGR  ET GEDPY+  R  + +V+GLQ               LK  A  KH+ 
Sbjct: 157 NMARDPRWGRNEETYGEDPYLTSRLGVAFVKGLQ---------GDHPAYLKTVATIKHFV 207

Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
           A    N E N+RF   S++  + + E +   +E CV E  V SVM +YN  NG+P     
Sbjct: 208 A----NNEENNRFSSSSQIPTKQLYEYYFPAYEACVKEAGVQSVMTAYNAFNGVPPSGSR 263

Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
            LL + +R +W F G++VSDC +I  +   H+ +N   E+A A  + +G DL+CG  Y  
Sbjct: 264 WLLGEVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSL-EEAAALGVNSGCDLECGTTYKE 322

Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAE 373
             + AV+QG I+EA ID +L  +     +LG FD      Y +  K  +   +  ELA E
Sbjct: 323 KLVQAVKQGLISEATIDQALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYE 382

Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG---F 430
           AA + +VLLKN+N  LPL+    K++A+VGP A+     +G Y G P    + + G    
Sbjct: 383 AAVKSVVLLKNEN-LLPLSKEKTKSVAVVGPFADHN--YLGGYSGQPPYSVTLLKGVKDL 439

Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
                 +NY  G   I    +S++     A K  D  ++  G D  +  E  D   + LP
Sbjct: 440 MGKRGKVNYLNG---IGASRDSIVA----AVKGVDVVLVALGSDEKMARENHDMTSIYLP 492

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
             Q +L+  +       + LV  S   +   +A  +  I +I+   YPG+E GRA+AD++
Sbjct: 493 EEQEKLLKAIYQ-VNPRIVLVFHSGNPLTSEWA--DVHIPAIMQAWYPGQEAGRALADLL 549

Query: 551 FGKYNPGGRLPITWYEANYVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
           FG  NP G+LP+T Y A         +P +   + + GRTY++     +Y FG+GLSYT 
Sbjct: 550 FGNENPSGKLPMTIYRAE------DQLPDILDFDMWKGRTYRYMKEDPLYGFGHGLSYTS 603

Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
           F +       S  +K     QC                                +E+ N 
Sbjct: 604 FGFDGIQG--SDTLKSGTTLQC-------------------------------SVELSNT 630

Query: 670 GKMDGSEVVMVY-SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           GK  G EVV VY S+      T+ +K+++ +++V +A G+  +V F +   + L + +N 
Sbjct: 631 GKWTGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLAPGEKKRVEFNI-PPRELSVWEN- 688

Query: 728 ANSLLASGAHTILVGEGVGGVS 749
            N  + +G +T+ +G G  G++
Sbjct: 689 GNWRMLTGKYTLFIGSGQPGLA 710


>gi|390956994|ref|YP_006420751.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
           18391]
 gi|390411912|gb|AFL87416.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
           18391]
          Length = 742

 Score =  343 bits (880), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 248/751 (33%), Positives = 368/751 (49%), Gaps = 96/751 (12%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D   P  +R  DL++R TL EK  Q+     GVPRLGLP++  W++ LHGV       
Sbjct: 38  YRDMSRPIEDRITDLIKRFTLQEKAMQLNHTNRGVPRLGLPMWGGWNQTLHGVW------ 91

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
                    S+ P  T FP      A+++  L   +   +S EARA+YN    G      
Sbjct: 92  ---------SKQP-TTLFPIPTAMGATWDPELVHTVADAMSDEARALYNAHAEGPRTPHG 141

Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
           L + SP IN+ RDPRWGR+ E   EDP + GR  + YVRGLQ           D + LK+
Sbjct: 142 LVYRSPVINISRDPRWGRIQEVFSEDPLLTGRMGVAYVRGLQ---------GDDLQHLKL 192

Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
           +A  KH+A  +++    + R H ++ V E+++ E ++  +   + E    SVM SYN +N
Sbjct: 193 AATVKHFAVNNVE----SGRQHLNADVDERNLFEFWLPHWRAAIMEAHAQSVMSSYNAIN 248

Query: 248 GIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI--------VESHKFLNDTKEDAVAR 299
           G+P   +  LL   +R  W F G++  D  ++  +         E  +  ++    A A 
Sbjct: 249 GMPDAVNHWLLTDVLRKKWGFDGFVTDDLGAVALLSGTRATNTSEPGQHFSEDPVVAAAA 308

Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNL 357
            ++AG D D  ++ TN  + AVQ+G + E D+D +LR +  V  RLG +D   + +Y  +
Sbjct: 309 AIRAGNDSDDVEFETNLPL-AVQRGLLTEKDVDGALRNVLRVGFRLGAYDPPQASKYSRI 367

Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
           G + + +  H +L+   A + + LL N    LPL    +K++A++GP A       GNY 
Sbjct: 368 GMDVVRSQAHRDLSQRVAEESMTLLLNRRQFLPLQRDQVKSVAVIGP-AGGEAYETGNYY 426

Query: 418 GTPCRYTSPMDGFYAY--SKV-INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
           GTP   TS  +G  A   S V + Y  G   +   ++  I  A + A+ +D  V+  G +
Sbjct: 427 GTPAVKTSVTEGLRALLGSGVKVEYEKGAGYVDLADDKEIERAANLARKSDVVVLCLGTN 486

Query: 475 LSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
           L VEAEG+DR DL LPG Q  L+  V  AA   V LV+M+AG + + +A ++  + +IL 
Sbjct: 487 LQVEAEGRDRRDLNLPGAQQRLLEAVY-AANPKVALVLMNAGPLGVTWAHDH--VPAILS 543

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD 594
             YPGE GG AIA  +FG  NPGG LP T Y AN   +P    P    +   G TY++F 
Sbjct: 544 AWYPGELGGAAIARTLFGLNNPGGHLPYTVY-ANLDGVP----PQNEYDVSRGYTYQYFK 598

Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRD-INYTVGTNKPPCAAVLIDDV 653
           G  +YPFG+GLSYT F Y           KL   Q   D  N TV               
Sbjct: 599 GVPLYPFGHGLSYTHFDYS----------KLKVTQTSGDHANVTV--------------- 633

Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVG 712
                 FT      N G+  G+EV  +YS     +    ++ + G+ERV +  G+S  V 
Sbjct: 634 -----SFTLT----NTGQSAGAEVTQLYSHQVKSSEVQPLRTLRGFERVTLQPGESKAVA 684

Query: 713 FTMNACKSLKIVDNAANSL-LASGAHTILVG 742
            ++    +L   D A ++  +  GA   +VG
Sbjct: 685 ISI-PTSALGWYDTAVHNFRVEPGAFNFMVG 714


>gi|429738050|ref|ZP_19271875.1| glycosyl hydrolase family 3 protein [Prevotella saccharolytica
           F0055]
 gi|429161155|gb|EKY03583.1| glycosyl hydrolase family 3 protein [Prevotella saccharolytica
           F0055]
          Length = 722

 Score =  343 bits (879), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 234/746 (31%), Positives = 367/746 (49%), Gaps = 109/746 (14%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           ++AK ++ ++TL EK+ Q+   A G+ RLG+  Y W +EALHGV   GR           
Sbjct: 33  QKAKSIISQLTLDEKISQLTQDAKGIDRLGIKPYYWLNEALHGVGRDGR----------- 81

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN--------AGLTFWSPN 134
                AT FP  I   A+F+  +  +IG  ++TE RA + +          AGLTFW+PN
Sbjct: 82  -----ATVFPQPINLGATFDPKIVHQIGDAIATEGRAKFIVAQRQKNYSMYAGLTFWAPN 136

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           +N+ RDPRWGR +ET GEDP++ G     +V+G+Q           D   LK +AC KH+
Sbjct: 137 VNIFRDPRWGRGMETYGEDPFLTGTLGTAFVKGMQ---------GDDPFYLKAAACGKHF 187

Query: 195 AAYDLDNWEGNDRFHFDSRV--TEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           A +      G +R    + V  T++D+ ET++  F+M V +G V S+M +Y R+ G    
Sbjct: 188 AVHS-----GPERTRHTANVEPTKRDLYETYLPAFKMLVQKGKVESIMGAYQRLYGESCS 242

Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
               LL   +R DW F G++VSDC ++  + E HK +    E AVA  +KAGL+L+CG+ 
Sbjct: 243 GSKYLLTDILRKDWGFKGHVVSDCGAVTDMYEGHKLVKSEAE-AVAFAIKAGLNLECGNS 301

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF--DGSPQYKNLGKNNICNPQHIEL 370
                  A+QQ  I E D+D +L  L +  ++LG    D +  Y    ++ I +  + ++
Sbjct: 302 MRTMK-DAIQQKLITEKDLDKALLPLMMTRLKLGILQPDAACPYNEFPESVIGSEANRKI 360

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A +AA + +VLLKN NG LP+   +I+TL + GP A     ++GNY G   RY++ ++G 
Sbjct: 361 AEQAAEESMVLLKN-NGVLPI-AKDIRTLFVTGPGATDAYYLMGNYFGLSNRYSTYLEGI 418

Query: 431 ---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGL---------DLSVE 478
               +    +NY  G    V +N + +  ++  ++ A+ ++++ G          D    
Sbjct: 419 VGKVSNGTSVNYKQGFMQ-VFKNLNDVNWSVSESRGAEVSILIMGNSGNTEGEEGDAIAS 477

Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
           AE  DRV+L LP  Q E + +V+      + +V+     +D+           + W  YP
Sbjct: 478 AERGDRVNLRLPDSQMEYLREVSKDRTNKLVVVLTGGSPIDVKEITELADAVVMAW--YP 535

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVV 598
           G+EGG A+A+++FG  N  GRLP+T+ E+         +P     +  GRTYK+    ++
Sbjct: 536 GQEGGVALANLLFGDANFSGRLPVTFPESA------DRLPAFDDYSMKGRTYKYMTDNIL 589

Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
           YPFGYGLSY++  Y  A+  K                                 +  K  
Sbjct: 590 YPFGYGLSYSKVTYSNAAVTK---------------------------------MPTKTT 616

Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNA 717
             T  ++V N G M   EVV VY   PG   T  I+ +IG++RV        K+   +  
Sbjct: 617 PMTVYVDVTNNGDMPVDEVVQVYLSTPGAGNTSPIESLIGFKRV--------KIYPHITV 668

Query: 718 CKSLKIVDNAANSLLASGAHTILVGE 743
            K  +I      ++ A G   +L GE
Sbjct: 669 TKDFQIPMELLETVQADGTSKLLKGE 694


>gi|390630430|ref|ZP_10258413.1| Beta-xylosidase B [Weissella confusa LBAE C39-2]
 gi|390484359|emb|CCF30761.1| Beta-xylosidase B [Weissella confusa LBAE C39-2]
          Length = 674

 Score =  343 bits (879), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 220/722 (30%), Positives = 361/722 (50%), Gaps = 108/722 (14%)

Query: 51  LGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIG 110
           + +P Y +W+EALHGV+  G                 AT FP  I   A+F++ L  +I 
Sbjct: 1   MNIPEYNYWNEALHGVARAGV----------------ATVFPQAIGLAATFDDHLINEIA 44

Query: 111 QTVSTEARAMYNLGNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAI 162
             + TE RA YN            GLTFWSPN+N+ RDPRWGR  ET GEDP++  ++ +
Sbjct: 45  DVIGTEGRAKYNEFTKHDDRDIYKGLTFWSPNVNIFRDPRWGRGHETYGEDPFLTSKFGV 104

Query: 163 NYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQET 222
            +++GLQ            ++ LK++A  KH+A +     EG  R  FD+ V+++D+ ET
Sbjct: 105 AFIKGLQ----------GQAKYLKLAATAKHFAVHS--GPEGL-RHGFDAVVSDKDLYET 151

Query: 223 FILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI 282
           ++  F+  V E DV S+M +YN V+G+P      LL   +   W+F G++VSD  + + +
Sbjct: 152 YLPAFKAAVEEADVESIMTAYNAVDGVPASVSEMLLKDILHDKWSFEGHVVSDYMAPEDV 211

Query: 283 VESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVL 342
            E+HK+  D  E  +   +KAGL+L  G +       A+ +G + E +I  ++  LY   
Sbjct: 212 HENHKYTKDAAE-TMGLAIKAGLNLVAG-HIEQSLHEALDRGLVTEEEITNAVISLYATR 269

Query: 343 MRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALV 402
           +RLG F    +Y  +         H  L+  AA +  VLLKND G LPL    ++ +A+V
Sbjct: 270 VRLGMFATDNEYDAIPYEANDTKAHNNLSEIAAEKSFVLLKND-GVLPLRKETMEAIAVV 328

Query: 403 GPHANATKAMIGNYEGTPCRYTSPMDGFYAY---SKVINYAPG-------CADIVCQNNS 452
           GP+A++  A++GNY GTP R  + ++G          ++Y+ G        A+ + + + 
Sbjct: 329 GPNAHSEIALLGNYFGTPSRSYTILEGIQERLGDDVRVHYSIGSGLFQDHAAEPLAKADE 388

Query: 453 MIPAAIDAAKNADATVIVAGLDLSVEAE---------GKDRVDLLLPGFQTELINKVADA 503
               A+ AA+++D  V V GLD ++E E           D+ +L LPG Q +L+ ++   
Sbjct: 389 RESEAVIAAEHSDVVVAVLGLDSTIEGEEGDAGNSQGAGDKPNLSLPGRQRQLLERLLAV 448

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
            K PV +++ S  ++ ++  +N+P +++I+ + YPG  GG A+ADV+FG  +P G+LP+T
Sbjct: 449 GK-PVVVLLASGSSLQLDGLENHPNLRAIMQIWYPGARGGLAVADVLFGAVSPSGKLPVT 507

Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
           +Y+         ++P     N  GRTY++     +YPFGYGL+Y+               
Sbjct: 508 FYK------NVDNLPAFEDYNMAGRTYRYMTDEALYPFGYGLTYS--------------- 546

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI--EVENMGKMDGSEVVMVY 681
                                  +V + D++ K Y+ T  +   ++N G  D  EVV VY
Sbjct: 547 -----------------------SVELSDLQVKSYEDTATVTATIQNTGNFDTDEVVQVY 583

Query: 682 SKPPGIA-GTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
            K  G        Q+ G++RV++  G    + F +   +  ++ D    + + S    I 
Sbjct: 584 VKDLGSEFAVPNAQLKGFKRVYLGKGAKQTITFDLR-PQDFEVFDAQGRNFIDSDRFEIS 642

Query: 741 VG 742
           VG
Sbjct: 643 VG 644


>gi|291240561|ref|XP_002740190.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 763

 Score =  340 bits (873), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 232/638 (36%), Positives = 326/638 (51%), Gaps = 75/638 (11%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVP--RLGLPLYEWWSEALHGVSFI 69
           +P+ +  L + ER  DLV R+TL E V QM   +   P  RLG+  Y W SE LHGV   
Sbjct: 26  YPFQNTSLSWEERVDDLVSRLTLDEMVLQMARTSPAPPIDRLGIKPYVWNSECLHGVV-- 83

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN------- 122
                 P G         AT+FP  I   ASF+  L   + + +  E RA +N       
Sbjct: 84  -----PPDGL--------ATAFPQSIGLAASFSPDLLSDVAKAIGLEVRAKHNDYVQRGV 130

Query: 123 -LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
              + GL+ +SP IN+ R P WGR  ET GEDP+++G     YVRGLQ            
Sbjct: 131 YQEHTGLSCFSPVINIARHPLWGRNQETYGEDPFLIGELGSAYVRGLQG---------DH 181

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
            R +  +A CKH+  +         RF FD++V E+D Q TF+  F  CV  G V SVMC
Sbjct: 182 PRYVLANAGCKHFDVHGGPEDIPVSRFSFDAKVFERDWQMTFLPAFHECVKAG-VYSVMC 240

Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
           SYNR+N +P CA+ +LL   +R +W F GY+VSD  +++ I+ SH +  D+  D VA  +
Sbjct: 241 SYNRINEVPACANTRLLTDILRKEWGFDGYVVSDEGAVEFIMTSHHY-TDSIVDTVASAV 299

Query: 302 KAGLDLD----CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---Y 354
            AG +LD     GD        AV  GKI E  +   ++ L+   MRLG FD  P+   Y
Sbjct: 300 NAGCNLDLAFPVGDGMYIKIGDAVTAGKIKEKTVVERVKPLFYTRMRLGEFD-PPELNPY 358

Query: 355 KNLGKNNICNPQHIELAAEAARQG-----IVLLKNDNGALPLNTGNIKTLALVGPHANAT 409
            NL  + + + +H ELA +AA Q       VLLK +   LPL+T  +  LA++GP A+  
Sbjct: 359 ANLNLSVVQSEEHRELAVKAALQSFVLLNFVLLKREGRVLPLDT-LVNKLAVIGPFADNP 417

Query: 410 KAMIGNYEGTPCR--YTSPMDGFYAYSKVINYAPGCADIVCQN--NSMIPAAIDAAKNAD 465
             + G+Y   P +    +P  G    ++     PGC    C    + M+ AA+     AD
Sbjct: 418 SYLFGDYSPNPDKEFVVTPCKGLSNAARDTRCTPGCLTAPCTTYFSEMVKAAV---TGAD 474

Query: 466 ATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG-PVTLVIMSAGAVDINFAK 524
             V+  G  + +EAE  DR DL LPG Q +L+  V   A G P+ L++ +AG +DI +A 
Sbjct: 475 LIVVCLGTGVKIEAEFVDRSDLSLPGKQFQLLQDVVKYANGKPIILLLFNAGPLDIVWAV 534

Query: 525 NNPKIKSILWVGYPGEEGGRAIADVIF-------GKYNPGGRLPITWYEANYVKIPYTSM 577
            NP I+ I+   +P +  G A+  +         G  NPGGRLPITW        P +  
Sbjct: 535 ENPAIQVIVACFFPSQATGDALYRMFMNTHGVDTGNGNPGGRLPITW--------PRSMN 586

Query: 578 PLRPVNNFP--GRTYKFFDGPVVYPFGYGLSYTQFKYK 613
            + P+ N+   GRTY++F+G  ++PFGYGLSY  F Y 
Sbjct: 587 QVPPMTNYTMEGRTYRYFNGDPLFPFGYGLSYGSFSYS 624


>gi|301090543|ref|XP_002895482.1| beta-glucosidase, putative [Phytophthora infestans T30-4]
 gi|262098232|gb|EEY56284.1| beta-glucosidase, putative [Phytophthora infestans T30-4]
          Length = 809

 Score =  338 bits (868), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 240/773 (31%), Positives = 373/773 (48%), Gaps = 86/773 (11%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAY---GVPRLGLPLYEWWSEALHGVSF 68
           F +C+A L   ER +DL+ R+ L EKV  +   A     +  +GLP Y W +  +HGV  
Sbjct: 35  FAFCNASLSTAERVEDLLRRLPLDEKVTLLTARASPKGNMSSIGLPEYNWGANCVHGV-- 92

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG---- 124
                 S  GT+       ATSFP  +   A F+      + Q V  E RA++  G    
Sbjct: 93  -----QSTCGTNC------ATSFPNPVNLGAIFDPRAVFDMAQVVGWELRALWLEGAREN 141

Query: 125 -----NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD 179
                + GL  WSPNIN+ RDPRWGR +ETP EDP V  +Y + Y +GLQ+         
Sbjct: 142 YATGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTKGLQE--------G 193

Query: 180 SDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
            D R L+     KHYAAY  ++++G DR  F++ V+  D  +T++  FE  V  G    V
Sbjct: 194 KDKRFLQAVVTLKHYAAYSYEHYDGIDRMAFNAVVSRYDFADTYLPAFEASVVHGKAKGV 253

Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
           MCSYN VNG+P CA+ +L ++ +R    F GYI SD  +I  I     +     E     
Sbjct: 254 MCSYNSVNGMPMCANEQLNSKLLRDALGFDGYITSDSGAIAGIYHQRHYTKTLCEAGRLA 313

Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNL 357
           +L +G D++ G  Y       V  G++ E  +D ++R    +   LG FD      Y ++
Sbjct: 314 IL-SGTDVNSGSVYKQCLAELVTSGQLPEKAVDDAMRRTLKLRFELGLFDPIDDQPYWHV 372

Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
             N +   +  +L+ + +R+ IVLL+N    LPL  G  K LA++GPHA A +A++GNY 
Sbjct: 373 APNEVNTAESKQLSLDLSRKSIVLLQNHGNILPLAKG--KKLAVIGPHAAAKRALLGNYL 430

Query: 418 GTPCR--------YTSPMDGFYAYSKVIN--YAPGCADIVCQNNSMIPAAIDAAKNADAT 467
           G  C           +P++     +   N  YA G + I   + +    A  AA+ A+  
Sbjct: 431 GQMCHGDYLEVGCVQTPLEAITIANGASNTLYAKG-SGINDTSTAGFDEAEAAARKAETV 489

Query: 468 VIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
           V+  G+D S+E E  DR ++ +P  Q +L+ +V  A K P  +V+ + G V     +   
Sbjct: 490 VLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAGK-PTVVVLFNGGVVGAE--ELIL 546

Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG 587
               ++   YPG  G +A++D++FG   P G+LP+T Y +NYV      M    +  +PG
Sbjct: 547 HTDGVVEAFYPGFFGAQAVSDILFGDAIPSGKLPVTMYPSNYVT--SVDMKSMSMTKYPG 604

Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
           R+Y+++    V+PFG+GLSYT+F   + SS    D                     P   
Sbjct: 605 RSYRYYKEVPVFPFGWGLSYTRFTMALDSSSGVTD---------------------PSEP 643

Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-----PGIAGTHIKQVIGYERVF 702
           +++     +    T  + + N G + G EVV  + +P      G A    +Q+  Y RV 
Sbjct: 644 IVV----TRQLDQTVTVILSNDGNLVGDEVVFAFFRPLKVNATGNAALLNEQLFDYRRVS 699

Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGG-VSFPLQL 754
           +   Q  K+ F +    +L +VD++ N     G + +++  GV   V+F + L
Sbjct: 700 LRPTQYRKLKFRIQQ-STLAMVDDSGNQASFPGFYEVIITNGVHERVTFAIHL 751


>gi|405968899|gb|EKC33925.1| Putative beta-D-xylosidase 5 [Crassostrea gigas]
          Length = 748

 Score =  338 bits (867), Expect = 6e-90,   Method: Compositional matrix adjust.
 Identities = 243/738 (32%), Positives = 364/738 (49%), Gaps = 115/738 (15%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQM--------GDLAYGVPRLGLPLYEWWSE 61
           S+FP+ +  L + ER  DLV R+TL + VQQ+        G  A  +  LG+  Y+W +E
Sbjct: 22  SNFPFQNVSLSWSERVDDLVGRLTLDQIVQQLARGGAGLNGGPAPAIENLGIGPYQWNTE 81

Query: 62  ALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY 121
            L G                D E   ATSFP  I   A+F++ L   + +  +TE RA +
Sbjct: 82  CLRG----------------DVEAGNATSFPQAIGLAAAFSKDLIFNVSKAAATEVRAKH 125

Query: 122 N--------LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEG 173
           N          + GL+ +SP +N++R P WGR  ET GEDPY+ G YA  +V+GLQ    
Sbjct: 126 NDFVKRGIFTDHTGLSCFSPVVNIMRHPLWGRNQETYGEDPYLSGTYASYFVQGLQG--- 182

Query: 174 VEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNE 233
                D D R ++ +A CKH+ A+         R  FD++V+ +D++ TF+  F+ CV  
Sbjct: 183 -----DHD-RYIQANAGCKHFDAHGGPEDIPESRMGFDAKVSMRDLRLTFLPAFQKCVQA 236

Query: 234 GDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTK 293
           G   S+MCSYN +NG+P C++  L+   +RG+WNF GY+VSD  +I+  +  H + N++ 
Sbjct: 237 G-AYSLMCSYNSINGVPACSNKLLMMDILRGEWNFTGYVVSDEGAIENQISFHHYYNNS- 294

Query: 294 EDAVARVLKAGLDLDCGDYYTN---FTMG-AVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
           EDA A  + AG +L+     T      +G AV+ GK+ E+ +   ++ L+   MRLG FD
Sbjct: 295 EDAAAGSVNAGCNLELSGNLTEPVFMKIGDAVKSGKLEESVVRNRVKPLFYTRMRLGEFD 354

Query: 350 GSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDN--------GALPLNTGNIKT 398
             P+   Y ++  + I + +H  L+  AA + +VLLK  +        G  P      + 
Sbjct: 355 -PPEMNPYSSVNLSVIQSEEHRNLSLTAAAKSLVLLKRPSKFSKRHLIGGFP-----SER 408

Query: 399 LALVGPHANATKAMIGNYEGT--PCRYTSPMDGFYAYSKVINYAPGCAD-IVCQNNSMIP 455
           +A++GP AN T  + G+Y  T  P    +P+ G    +  +NYA GC D   C N S   
Sbjct: 409 MAVIGPMANNTDQIFGDYSPTTDPRFVKTPLKGLTELNFSMNYAAGCVDGTRCLNYSQDD 468

Query: 456 AAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSA 515
               A   AD  V+  G    +E+E  DR D++LPG Q +L+  V       V L++ SA
Sbjct: 469 VKT-ALVGADLVVVCLGTGKDLESENVDRKDMMLPGKQLQLLQDVVSMTNKAVYLLVFSA 527

Query: 516 GAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIF---GKYNPGGRLPITWYEANYVKI 572
           G V+I +A+ + ++  IL   YP +  G AI   +    G++NP GRLP TWY       
Sbjct: 528 GPVNITWAQESERVLIILQCFYPAQSAGDAITQALIMRDGRFNPAGRLPYTWYR------ 581

Query: 573 PYT-SMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKY-KVASSPKSVDIKLDKDQQ 630
            YT  +P     +   +TY++F G  +YPFGYGLSY+ F + K+   PK           
Sbjct: 582 -YTEQIPEMTDYSMARKTYRYFTGVPLYPFGYGLSYSTFVFSKLYFLPK----------- 629

Query: 631 CRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGT 690
                  V    P                   Q+ V N G  DG EV+ VY K       
Sbjct: 630 -------VNAGDPNVV----------------QVRVFNEGPFDGDEVLQVYIKWMSTKER 666

Query: 691 HIK-QVIGYERVFIAAGQ 707
             + Q++ +ERVFI + Q
Sbjct: 667 MPRVQLVAFERVFIRSQQ 684


>gi|301118693|ref|XP_002907074.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
 gi|262105586|gb|EEY63638.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
          Length = 809

 Score =  338 bits (866), Expect = 9e-90,   Method: Compositional matrix adjust.
 Identities = 240/773 (31%), Positives = 372/773 (48%), Gaps = 86/773 (11%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAY---GVPRLGLPLYEWWSEALHGVSF 68
           F +C+A L   ER +DL+ R+ L EKV  +   A     +  +GLP Y W +  +HGV  
Sbjct: 35  FAFCNASLSTAERVEDLLRRLPLDEKVTLLTARASPKGNMSSIGLPEYNWGANCVHGV-- 92

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG---- 124
                 S  GT+       ATSFP  +   A F+      + Q V  E RA++  G    
Sbjct: 93  -----QSTCGTNC------ATSFPNPVNLGAIFDPRAVFDMAQVVGWELRALWLEGAREN 141

Query: 125 -----NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD 179
                + GL  WSPNIN+ RDPRWGR +ETP EDP V  +Y + Y +GLQ+         
Sbjct: 142 YATGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTKGLQE--------G 193

Query: 180 SDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
            D R L+     KHYAAY  ++++G DR  F++ V+  D  +T++  FE  V  G    V
Sbjct: 194 KDKRFLQAVVTLKHYAAYSYEHYDGIDRMAFNAVVSRYDFADTYLPAFEASVVHGKAKGV 253

Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
           MCSYN VNG+P CA+ +L ++ +R    F GYI SD  +I  I     +     E     
Sbjct: 254 MCSYNSVNGMPMCANEQLNSKLLRDALGFDGYITSDSGAIAGIYHQRHYTKTLCEAGRLA 313

Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNL 357
           +L +G D++ G  Y       V  G++ E  +D ++R    +   LG FD      Y ++
Sbjct: 314 IL-SGTDVNSGSVYKQCLAELVTSGQLPEKAVDDAMRRTLKLRFELGLFDPIDDQPYWHV 372

Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
             N +   +  +L+ + +R+ IVLL+N    LPL  G  K LA++GPHA A +A++GNY 
Sbjct: 373 APNEVNTAESKQLSLDLSRKSIVLLQNHGNILPLAKG--KKLAVIGPHAAAKRALLGNYL 430

Query: 418 GTPCR--------YTSPMDGFYAYSKVIN--YAPGCADIVCQNNSMIPAAIDAAKNADAT 467
           G  C           +P++     +   N  YA G + I   +      A  AA+ A+  
Sbjct: 431 GQMCHGDYLEVGCVQTPLEAITIANGASNTLYAKG-SGINDTSTGGFDEAEAAARKAETV 489

Query: 468 VIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
           V+  G+D S+E E  DR ++ +P  Q +L+ +V  A K P  +V+ + G V     +   
Sbjct: 490 VLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAGK-PTVVVLFNGGVVGAE--ELIL 546

Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG 587
               ++   YPG  G +A++D++FG   P G+LP+T Y +NYV      M    +  +PG
Sbjct: 547 HTDGVVEAFYPGFFGAQAVSDILFGDAIPSGKLPVTMYPSNYVT--SVDMKSMSMTKYPG 604

Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
           R+Y+++    V+PFG+GLSYT+F   + SS    D                     P   
Sbjct: 605 RSYRYYKEVPVFPFGWGLSYTRFTMALDSSSGVTD---------------------PSEP 643

Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-----PGIAGTHIKQVIGYERVF 702
           +++     +    T  + + N G + G EVV  + +P      G A    +Q+  Y RV 
Sbjct: 644 IVV----TRQLDQTVTVILSNDGNLVGDEVVFAFFRPLKVNATGNAALLNEQLFDYRRVS 699

Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGG-VSFPLQL 754
           +   Q  K+ F +    +L +VD++ N     G + +++  GV   V+F + L
Sbjct: 700 LRPTQYRKLKFRIQQ-STLAMVDDSGNQASFPGFYEVIITNGVHERVTFAIHL 751


>gi|359473580|ref|XP_003631325.1| PREDICTED: protein BRASSINOSTEROID INSENSITIVE 1-like [Vitis
           vinifera]
          Length = 785

 Score =  337 bits (863), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 171/321 (53%), Positives = 224/321 (69%), Gaps = 14/321 (4%)

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEAD 330
           YIVSDC  ++ IV++  +LN++K DAVA+ L+AGLDL+CG YYT+    +V  GK+++ +
Sbjct: 10  YIVSDCYGLEVIVDNQNYLNESKVDAVAKTLQAGLDLECGHYYTDALNESVLTGKVSQYE 69

Query: 331 IDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALP 390
           +D +L+ +Y++LMR+GYFDG P Y++LG  +IC   HIELA EAARQGIVLLKND   LP
Sbjct: 70  LDRALKNIYVLLMRVGYFDGIPAYESLGLKDICAADHIELAREAARQGIVLLKNDYEVLP 129

Query: 391 LNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQN 450
           L  G  K L LVGPHANAT+ MIGNY G P +Y SP++ F A   V  YA GC D  C N
Sbjct: 130 LKPG--KKLVLVGPHANATEVMIGNYAGLPYKYVSPLEAFSAIGNV-TYATGCLDASCSN 186

Query: 451 NSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTL 510
           ++    A +AAK A+ T+I  G DLS+EAE  DRVD LLPG QTELI +VA+ + GPV L
Sbjct: 187 DTYFSEAKEAAKFAEVTIIFVGTDLSIEAEFVDRVDFLLPGNQTELIKQVAEVSSGPVIL 246

Query: 511 VIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGG------RLPITW 564
           V++S   +DI FAKNNP+I +ILWVG+PGE+GG AIADV+FGKYNP        +L  +W
Sbjct: 247 VVLSGSNIDITFAKNNPRISAILWVGFPGEQGGHAIADVVFGKYNPDTIPEWLWKLDFSW 306

Query: 565 YEAN----YVKIPYTSMPLRP 581
            + +    Y K+P  S+   P
Sbjct: 307 LDLSKNQLYGKLP-NSLSFSP 326


>gi|348684865|gb|EGZ24680.1| family 3 glycoside hydrolase [Phytophthora sojae]
          Length = 769

 Score =  336 bits (862), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 224/638 (35%), Positives = 330/638 (51%), Gaps = 60/638 (9%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPR-----LGLPLYEWWSEALHG 65
           + P+C+  L   +R +DL+ R+ L EK   +   A   PR     +GLP Y W +  +HG
Sbjct: 33  ELPFCNTSLSTADRVEDLLSRLPLQEKATLL--TARASPRGNMSSIGLPEYNWGANCVHG 90

Query: 66  V-SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
           V S  G  TN P            TSFP  +   A F+  +   + Q +  E RA++  G
Sbjct: 91  VQSTCG--TNCP------------TSFPNPVNLGAIFDPQVVFDMAQVIGWELRALWLEG 136

Query: 125 ---------NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
                    + GL  WSPNIN+ RDPRWGR  ETP EDP V  +Y + Y RGLQ+     
Sbjct: 137 ATENYKGGPHLGLDCWSPNININRDPRWGRNTETPSEDPLVNSKYGVAYTRGLQE----- 191

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
             +  D R L+     KHYAAY  +N+ G +R  FD+ V+  D  +T+   F   V +G+
Sbjct: 192 -GKRQDPRFLQAVVTLKHYAAYSYENYGGVNRMEFDAIVSPYDFADTYFPAFRSSVVDGN 250

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKED 295
              VMCSYN VNGIP CA+ +L+   +RG   F GY+ SD  +++ I + H +  D++ +
Sbjct: 251 AKGVMCSYNSVNGIPMCANKELVETLLRGTLGFDGYVTSDSGAVEAISDMHHYA-DSQCE 309

Query: 296 AVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQ 353
           A    + AG D++ G  Y       V   ++ E  +D +LR    +   LG FD      
Sbjct: 310 AARLAILAGTDINSGKSYEACLKTLVDDNQLEEKALDDALRHTLKLRFELGLFDPIDDQP 369

Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
           Y N+  + +       L+  A R+ +V+L+N+   LPL  G    LA++GPHA + + ++
Sbjct: 370 YWNVTPSEVNTAAAKALSLNATRKSLVMLQNNASVLPLQKG--VKLAVLGPHAKSKRGLL 427

Query: 414 GNYEGTPCR--------YTSPMDGFYAYSKVIN--YAPGCADIVCQNNSMIPAAIDAAKN 463
           GNY G  C           +P+D   A +   N  +A GC  I   + +    A+ AAK 
Sbjct: 428 GNYLGQMCHGDYDEVGCVQTPLDAIRAANGASNTTFAEGCG-ISGNSTAGFEKAVAAAKE 486

Query: 464 ADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFA 523
           ADA V+  G+D S+E E  DR ++ LP  Q +L+ +V   A G  T+V++  G V I   
Sbjct: 487 ADAVVLFLGIDKSIEGEVGDRNNIDLPNIQMQLLQRV--HAVGRPTVVVLINGGV-IGAE 543

Query: 524 KNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPV 582
           +   +  +++   YPG  G RA+ADV+FG  NP G+LP+T Y ++YV ++   SM +   
Sbjct: 544 EIIERTDALVEAFYPGFFGARAMADVLFGDTNPSGKLPVTMYRSDYVDQVEMKSMDMTA- 602

Query: 583 NNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKS 620
              PGRTY++F G  V+PFG+GLSYT F   V S   S
Sbjct: 603 --HPGRTYRYFKGEPVFPFGWGLSYTTFSLSVDSGTNS 638


>gi|291240559|ref|XP_002740189.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 745

 Score =  336 bits (862), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 238/711 (33%), Positives = 350/711 (49%), Gaps = 102/711 (14%)

Query: 2   FESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQM---GDLAYG----VPRLGLP 54
           F  I   LSDFP+ +  LP+ +R +DLV R+ L E V QM   G  + G    + RL + 
Sbjct: 15  FSLISTILSDFPFRNTSLPWNKRVEDLVGRLKLEEIVLQMSRGGRYSNGPAPPIDRLNIG 74

Query: 55  LYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
            Y W +E L G                D     ATSFP      A+F+  L K+I    +
Sbjct: 75  PYSWNTECLRG----------------DLSAGPATSFPQAFGLAATFDAVLIKQIANATA 118

Query: 115 TEARAMYNL--------GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVR 166
            E RA YN          + GL+ +SP IN+ R P WGR+ ET GEDPY+ G  A ++V 
Sbjct: 119 YEVRAKYNNYTKHKEYGDHKGLSCFSPVINIARHPLWGRIQETYGEDPYLSGTLAASFVT 178

Query: 167 GLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILP 226
           GLQ          +  R +  +A CKH+ AY       + R  FD++V+++D++ TF+  
Sbjct: 179 GLQG---------NHPRYVTANAGCKHFDAYAGPENIPSSRSTFDAKVSDRDLRMTFLPA 229

Query: 227 FEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESH 286
           F  C+  G   S+MCSYN +NG+P CA+ KLL   +R +WNF GY++SD  +++ + ++H
Sbjct: 230 FHECIQAG-TYSLMCSYNSINGVPACANKKLLTDILRTEWNFTGYVISDQSAVEKVYDAH 288

Query: 287 KFLNDTKEDAVARVLKAGLDLDCGDYYTNFTM----GAVQQGKIAEADIDTSLRFLYIVL 342
            +  D  + A+A V  +GL+L+     T+  M     AV+QG +    +   +  L+   
Sbjct: 289 HYTKDMLDTAIACV-NSGLNLELSSNLTDNVMMQTTKAVKQGNVTMKTVKARVSPLFYTR 347

Query: 343 MRLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTL 399
           MRLG FD  P+   Y  L  + I + +H EL+ +AA +  VLLKN+N  LPL    I  L
Sbjct: 348 MRLGEFD-PPEMNPYSKLDLSIIQSQEHQELSLKAAAKSFVLLKNENRFLPLKE-KIDKL 405

Query: 400 ALVGPHANATKAMIGNYEGTPCRYT-SPMDGFYAYSKV-INYAPGCADIVCQ--NNSMIP 455
           A+VGP  +    + G+        T +P  G    +++   +A GC    C   +     
Sbjct: 406 AVVGPFGDNPIEIYGSKSPDVSNLTVTPRYGLSKIARLATTFASGCLSPACTEYDPKSTK 465

Query: 456 AAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELI-NKVADAAKGPVTLVIMS 514
            AID     D  V+  G    VE E  DR +L LPG Q  L+ + V  AA  PV L++ +
Sbjct: 466 QAID---RVDMVVVCLGTGNEVENEAHDRSELTLPGQQLRLLQDAVTFAADKPVILLLFN 522

Query: 515 AGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGK--YNPGGRLPITWYEANYVKI 572
           AG +DI +A +NP I  I+   +P +  G A+  +       NPGGRLPITW        
Sbjct: 523 AGPLDITWAVSNPAIPVIVECFFPAQTTGTALYHLFVNSPGSNPGGRLPITW-------- 574

Query: 573 PYTSMPLRPVNNFP--GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQ 630
           P +   + P+ ++   GRTY++F+G  ++PFGYGLSYT F Y                  
Sbjct: 575 PKSMSQVPPMEDYTMEGRTYRYFNGDPLFPFGYGLSYTTFHYS----------------- 617

Query: 631 CRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY 681
             D+  T  T   PC+++ ID            + +EN G + G EV   Y
Sbjct: 618 --DLLITPSTPIKPCSSINID------------VFLENTGDVTGDEVTQFY 654


>gi|405955586|gb|EKC22647.1| Putative beta-D-xylosidase 2 [Crassostrea gigas]
          Length = 745

 Score =  335 bits (860), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 233/770 (30%), Positives = 379/770 (49%), Gaps = 107/770 (13%)

Query: 7   VKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------VPRLGLPLYEW 58
           + + D+P+ +  LP+  R KDLV+R+T+ E V QM     G        VPRLG+  + W
Sbjct: 21  LHVQDYPFRNTSLPWDARVKDLVDRLTIEEIVVQMSRGGSGPRASPAPAVPRLGVGPFSW 80

Query: 59  WSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEAR 118
            +E L G  + G                 ATSFP  +   A+F+  +   +    S E R
Sbjct: 81  NTECLRGDVYAG----------------NATSFPQALGLAATFSTEVICDVASATSIEVR 124

Query: 119 AMYNL--------GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQD 170
           A +N          + G++ +SP IN++R P WGR  ET GEDP++ G  A  +V+ LQ 
Sbjct: 125 AKFNDYQRRKIYGDHKGISCFSPVINIMRHPLWGRNQETYGEDPFLSGELAAIFVKCLQG 184

Query: 171 VEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMC 230
                     D   ++ +A CKH+  +         RF FD++V+E+D + TF+  F+ C
Sbjct: 185 ---------DDPTYIRANAGCKHFDVHGGPENIPVSRFSFDAKVSERDWRLTFLPAFKRC 235

Query: 231 VNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLN 290
           V  G  S +MCS+NR+NG+P C + +LL   +R +W F GY+VSD ++I+ I+  H + N
Sbjct: 236 VQAGSYS-LMCSFNRINGVPACGNKRLLTDILRTEWGFTGYVVSDQEAIENIMTYHHYTN 294

Query: 291 DTKEDAVARVLKAGLDLDCGDYYTN----FTMGAVQQGKIAEADIDTSLRFLYIVLMRLG 346
           ++  D  A  +KAG +L+           + + A++ GK+ + D+  S+  L+   MRLG
Sbjct: 295 NSV-DTAALCVKAGCNLELSTNEVKPTYFYIIDALKAGKLDKEDLVKSVSPLFYTRMRLG 353

Query: 347 YFDGSPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGP 404
            FD      Y  +  + I + +H  ++  AA +  VLLKN  G LP+ T    T++++GP
Sbjct: 354 EFDPPDHNPYNFIDLSVIQSEEHRAISLNAAMKSFVLLKNKGGFLPI-TKLFDTISVLGP 412

Query: 405 HANATKAMIGNY--EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQ--NNSMIPAAIDA 460
            A+     IG+Y  +  P   T+P+ G    SK + YA GC D  C   N + I  A+++
Sbjct: 413 MADNKYQQIGSYAPDVMPSYTTTPLQGLSKLSKRVQYAAGCNDNACSKYNRTEIQRAVNS 472

Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELI-NKVADAAKG-PVTLVIMSAGAV 518
              +D   +  G    +E E  DR  + LPG Q +L+ + +  +AKG P+ L++ + G V
Sbjct: 473 ---SDIFFVCLGTGPMIENEDHDRASMELPGQQAQLLKDAIMFSAKGVPIVLLLFNGGPV 529

Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIF---GKYNPGGRLPITW--YEANYVKIP 573
           +I +A  + ++ +I+   +P +E G A+  V+       NP GRLP TW  Y+     + 
Sbjct: 530 NITWADRSDRVVAIMECFFPAQETGEAVLRVVTNTGNSSNPAGRLPYTWPKYQDQIPSMV 589

Query: 574 YTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRD 633
             SM         GRTY++F G  +YPFGYGLSY+ F +  A                  
Sbjct: 590 NYSM--------EGRTYRYFHGDPLYPFGYGLSYSTFNFTNA------------------ 623

Query: 634 INYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-I 692
                           ++ +  +    T ++EV N G  DG EV+ VY K      T  I
Sbjct: 624 ---------------WMNPIISQGQDLTVRVEVCNEGPTDGDEVIQVYLKWLDTNETMPI 668

Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            Q++G+ERV + A ++     T+ A +++ + + +    +  G + + +G
Sbjct: 669 HQLVGFERVSLRAKETLSWLITVRA-ENMAVWNESRGFYIEPGRYRLYIG 717


>gi|323451996|gb|EGB07871.1| hypothetical protein AURANDRAFT_71699 [Aureococcus anophagefferens]
          Length = 1202

 Score =  335 bits (860), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 267/806 (33%), Positives = 373/806 (46%), Gaps = 126/806 (15%)

Query: 12   FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            +PYCD  LP   R  DL  R T+ E + QMG +A  VPRLGLP   +  EALHGV     
Sbjct: 341  YPYCDRALPIRARVADLAARFTVNETISQMGTMAAAVPRLGLPALNYGGEALHGVWSTCA 400

Query: 72   RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-------- 123
                P            T FP      ASF+  LW+ +G     EARA++          
Sbjct: 401  AGRCP------------TQFPAPHAMGASFDRDLWRAVGAASGLEARALFRWNQRHNASD 448

Query: 124  ------GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYH 177
                  G  GLTF++PN+N+ RDPRWGR+ E P EDP + G Y   +VRG Q   G   +
Sbjct: 449  CARSLEGCLGLTFYAPNVNLARDPRWGRIEEVPSEDPLLNGVYGAEFVRGFQ---GDGAY 505

Query: 178  RDSDSRPLKISACCKHYAAYDLD---------NWEG-------NDRFHFDSRVTEQDMQE 221
            R ++       A  KH+A Y+L+         +W G       NDR  FD+RV+ +D +E
Sbjct: 506  RVAN-------AVVKHFAVYNLEVDVEDTPPADWCGSAACAPPNDRHSFDARVSPRDFEE 558

Query: 222  TFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQT 281
            T++ PF         ++ MCSYN VNG P C D  LL   +RG  NF G + +DC +++ 
Sbjct: 559  TYVGPFVA-PVAAGAAAAMCSYNAVNGEPACTDGALLRGALRGALNFTGVLATDCGALED 617

Query: 282  IVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIV 341
             V  HK      E A A  + AG+D +CG   T+    A+  G +    +   L  L   
Sbjct: 618  AVARHKRYATEAEAAAA-AIAAGVDSNCGKVLTSALPEALAAGLVRPDALRPPLERLLEA 676

Query: 342  LMRLGY---FDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKT 398
             +RLG    +D          + + +P H  LA  AAR+G+VLL+N N  LPL+     T
Sbjct: 677  RLRLGLLDDWDADAPVPRPDVDAVDSPAHRALALRAAREGLVLLQNPNQILPLD--GRGT 734

Query: 399  LALVGPHANATKAMIGNYEGTPC--RYTSPMDGFYAY---SKVINYAPGCADIVCQNNSM 453
            LA++GP+ANA+  ++  Y GTP      SP+    A     KV+ YA GC +      + 
Sbjct: 735  LAVIGPNANASMNLLSGYHGTPPPDLLRSPLQELEARWRGGKVV-YAVGC-NASGAATAA 792

Query: 454  IPAAIDAAKNADATVIVAGL------------DLSV----EAEGKDRVDLLLPGFQTELI 497
            +  A+D AK AD  V+  GL            D +     EAE  DR  L LPG Q  L 
Sbjct: 793  LDEAVDLAKTADVVVLGLGLCGDNYGGGPPKEDATCFSIDEAESVDRTSLKLPGAQEALF 852

Query: 498  NKVADAAKG-PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNP 556
            +K+    K   V + ++SAGAVD +FAK+     ++L  GY GE GG A+AD + G YNP
Sbjct: 853  SKIWALGKPVAVAVFLVSAGAVDASFAKDK---AALLLAGYGGEFGGVAVADALLGAYNP 909

Query: 557  GGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYP---FGYGLSYTQFKYK 613
            GG L  T      +  P+  M +RP    PGRTY+F D   V P   FG+GLSYT F   
Sbjct: 910  GGALTATMLPDAGLP-PFRDMAMRPSAASPGRTYRFLDERRVAPLWRFGFGLSYTAFAVS 968

Query: 614  VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMD 673
            +A                       G  + P           +     F + V N+G + 
Sbjct: 969  LA-----------------------GPTRVP-----------RRAATRFSVVVRNVGAVS 994

Query: 674  GSEVVMVYSKPPGIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNACKSLKIVDNAANSLL 732
            G  VV  +    G     ++++  + RV  +A   S KV   +   +SL +VD A     
Sbjct: 995  GDVVVACFVAAVGRPDAPLRELFDFARVRDLAPAASTKVSMELRP-RSLSLVDEAGVRST 1053

Query: 733  ASGAHTILVGEGVGGVSFPLQLNLNH 758
             +GA+ +    G    +  ++L   H
Sbjct: 1054 TAGAYDVRCSAGRVADTEDIRLTTAH 1079


>gi|323344407|ref|ZP_08084632.1| beta-glucosidase [Prevotella oralis ATCC 33269]
 gi|323094534|gb|EFZ37110.1| beta-glucosidase [Prevotella oralis ATCC 33269]
          Length = 722

 Score =  335 bits (860), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 229/744 (30%), Positives = 365/744 (49%), Gaps = 102/744 (13%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           ++AK ++ ++TL EK+ Q+   A G+ RLG+  Y W +EALHGV   GR           
Sbjct: 33  QKAKSIISQLTLDEKISQLTQDAKGIDRLGIKPYYWLNEALHGVGRDGR----------- 81

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN--------AGLTFWSPN 134
                AT FP  I   A+F+  + ++IG  ++TE RA + +          AGLTFW+PN
Sbjct: 82  -----ATVFPQPISLGATFDPEIVQQIGDAIATEGRAKFIVAQRQKNYSMYAGLTFWAPN 136

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           +N+ RDPRWGR +ET GEDP++ G     +V+G+Q          +D   LK +AC KH+
Sbjct: 137 VNIFRDPRWGRGMETYGEDPFLTGVLGTAFVKGMQ---------GNDPFYLKAAACGKHF 187

Query: 195 AAYDLDNWEGNDRFHFDSRV--TEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           A +      G +R    + V  T+ D+ ET++  F+M V +G V S+M +Y R+ G    
Sbjct: 188 AVHS-----GPERTRHTANVEPTKHDLYETYLPAFKMLVQQGKVESIMGAYQRLYGESCS 242

Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
               LL   +R DW F G++VSDC ++  + E HK +    E AVA  +KAGL+L+CG+ 
Sbjct: 243 GSKYLLTDILRKDWGFKGHVVSDCGAVTDMYEGHKLVKSEAE-AVAFAIKAGLNLECGNS 301

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF--DGSPQYKNLGKNNICNPQHIEL 370
                  A++Q  I E D+D +L  L +  ++LG    D +  Y    ++ I +  +  +
Sbjct: 302 MRTMK-DALKQKLITEKDLDKALLPLMMTRLKLGILQPDVACPYNEFPESVIGSIDNRNI 360

Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
           A  AA + +VLLKND G LP+   +I+TL + GP A     ++GNY G   RY++ ++G 
Sbjct: 361 AQRAAEESMVLLKND-GVLPI-AKDIRTLFVTGPGATDAYYLMGNYFGLSDRYSTYLEGI 418

Query: 431 ---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGL---------DLSVE 478
               +    +NY  G    V +N + +  ++  ++ A+ ++I+ G          D    
Sbjct: 419 VGKVSNGTSVNYKQGFMQ-VFKNLNDVNWSVSESRGAEVSIIIMGNSGNTEGEEGDAIAS 477

Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
           +E  DRVDL LP  Q + + +V+      + +V+     +D+           + W  YP
Sbjct: 478 SERGDRVDLRLPEPQMQYLREVSKDRTNKLVVVLTGGSPIDVKEITELADAVVMAW--YP 535

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVV 598
           G+EGG A+A+++FG  N  GRLP+T+ E          +P     +  GRTYK+    ++
Sbjct: 536 GQEGGVALANLLFGDANFSGRLPVTFPETT------DKLPSFDDYSMKGRTYKYMTDNIL 589

Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
           YPFGYGLSY +  Y  A+  K                                 +  K  
Sbjct: 590 YPFGYGLSYGKVAYGNATVTK---------------------------------LPTKHS 616

Query: 659 KFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNA 717
             T  +++ N G M   EVV VY S P     + I+ ++ ++RV IA   +    F +  
Sbjct: 617 SMTVSVDLSNDGNMPVDEVVQVYLSTPSAGVTSPIESLVAFKRVKIAPHATVTTDFEI-P 675

Query: 718 CKSLKIVDNAANSLLASGAHTILV 741
            + L+ V     S L  G + +++
Sbjct: 676 VERLETVQEDGTSKLLKGEYRVMI 699


>gi|427385932|ref|ZP_18882239.1| hypothetical protein HMPREF9447_03272 [Bacteroides oleiciplenus YIT
           12058]
 gi|425726971|gb|EKU89834.1| hypothetical protein HMPREF9447_03272 [Bacteroides oleiciplenus YIT
           12058]
          Length = 732

 Score =  334 bits (856), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 236/773 (30%), Positives = 376/773 (48%), Gaps = 102/773 (13%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLY-EWWSEALHGVSFIGRR 72
           + + ++    R  DL+ R+TL +K Q +      V   G  +  + W++ LHGV +    
Sbjct: 33  FLNQEMSMEARVADLMSRLTLEQKAQLLNHRGKTVVVDGFSIRADQWNQCLHGVKWTEPT 92

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL--------- 123
           TN                FPT I   A+++  L  ++   +S EARA+YN          
Sbjct: 93  TN----------------FPTSIALGATWDTELIHRVATVISDEARAIYNGWKQDPEFRG 136

Query: 124 GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
            + GL + SP IN+ R+P WGR+ E  GEDPY  GR  + YV+GLQ           DS 
Sbjct: 137 EHKGLIYRSPVINISRNPYWGRINEIFGEDPYHTGRMGVAYVKGLQG---------DDSH 187

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LK+++  KHYA  +++     DR    ++V E+ + E ++  F+ C+ EG   SVM SY
Sbjct: 188 YLKLASTLKHYAVNNVEV----DRMKLSAQVPERMLYEYWLPHFKDCIVEGKAQSVMASY 243

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N +NG+P   +  LL   ++  W   G++VSD   ++T+VE H     + E+AV R + A
Sbjct: 244 NAINGVPNNINKLLLTDILKNQWGHEGFVVSDLGGVKTMVEGHHQRQISCEEAVGRSIMA 303

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNN 361
           G D    + Y  +   A+++G + E  ++ +LR + +V  RLG FD   S  Y  +  + 
Sbjct: 304 GCDFSDAE-YEKYIPDALRKGYLTEERLNDALRRVLLVRFRLGEFDDFKSVPYSRISPDV 362

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           I   +H  L+ EAAR+ IVLLKN+   LP++   IK +A++GP+A+      GNY G P 
Sbjct: 363 IGCKEHRNLSLEAARKSIVLLKNEKKLLPIDRSIIKRVAVIGPYADLFNQ--GNYGGVPK 420

Query: 422 RYTSPMDGF---YAYSKVINYAPGC--ADIVCQNNSMIP----------AAIDAAKNADA 466
              +P+ G       +  + Y  G     +  +    IP           A++ A+N+D 
Sbjct: 421 DPVTPLQGIKNAVGNNVEVLYCKGAQITPVKVRKGQPIPPRFDKEAEMKKAVEMARNSDV 480

Query: 467 TVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
             +  G    +E EG+DR  L+LPG Q EL+  V +  K  V +V+MSAG V +   K N
Sbjct: 481 VFLFVGTTADIEVEGRDRKTLVLPGNQNELVKAVYEVNK-KVVVVLMSAGPVAVPEVKKN 539

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
             I ++L   +PG+EGG AIADV+FG YNPGG+LP T Y ++  ++P T       +   
Sbjct: 540 --IPAVLQAWWPGDEGGNAIADVLFGDYNPGGKLPYTMYASDE-QVPSTD----EYDISK 592

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           G TY +     ++ FG+GLSY++F Y         D+++                     
Sbjct: 593 GFTYMYLKKKPLFAFGHGLSYSKFHYS--------DLQIS------------------SP 626

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAA 705
            V ++D        +  ++V+NMGK  G EVV +Y +          K++ G++R+ +  
Sbjct: 627 VVSVNDT------VSVVLKVKNMGKRTGEEVVQLYVRDVKAKVVRPTKELRGFKRIALQP 680

Query: 706 GQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVGEGVGGVSFPLQLNLN 757
            +  ++   M   KSL   D +    L   G+  IL+G     +    +L +N
Sbjct: 681 NEEQEIRL-MLPVKSLAFYDESIGDFLVEPGSFEILLGSASDDIRLQSKLIVN 732


>gi|85813774|emb|CAJ65923.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
          Length = 704

 Score =  333 bits (854), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 179/483 (37%), Positives = 284/483 (58%), Gaps = 30/483 (6%)

Query: 275 DCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTS 334
           DCD++  +    K+   T EDAVA  LK+G+      Y  N+T  AV++ K+  ++ID +
Sbjct: 229 DCDAVNVLHVEQKYAK-TPEDAVADALKSGIS-----YLRNYTKSAVEKKKVTVSEIDRA 282

Query: 335 LRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPL 391
           L  L+   MRLG F+G P    Y ++G + +C+ +H  LA EAA  GIVLLKN +  LPL
Sbjct: 283 LHNLFSTRMRLGLFNGDPTKQLYSDIGPDQVCSQEHQALALEAALDGIVLLKNADRLLPL 342

Query: 392 NTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNN 451
           +   I +LA++GP+A+ +  ++GNY G  C+  + ++G   Y    +Y  GC ++ C  +
Sbjct: 343 SKSGISSLAVIGPNAHNSTNLLGNYFGPACKNVTILEGLRNYVSSASYEKGCNNVSC-TS 401

Query: 452 SMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLV 511
           +     ++ A+  D  ++V GLD S E E  DR+DL+LPG Q  LI  VA AAK P+ LV
Sbjct: 402 AAKKKPVEMAQTEDQVILVMGLDQSQEKERLDRMDLVLPGKQPTLITAVAKAAKRPIVLV 461

Query: 512 IMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNP---GGRLPITWYEAN 568
           ++    +D+ FAKNN KI SILW GYPG+ G  A+A +IFG++NP   GGRLP+TWY  +
Sbjct: 462 LLGGSPMDVTFAKNNRKIGSILWAGYPGQAGATALAQIIFGEHNPGNAGGRLPMTWYPQD 521

Query: 569 YVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVAS-SPKSVDIKL 625
           + K+P T M +R  P    PGRTY+F++G  V+ FGYGLSY+ + Y  AS +   +++K 
Sbjct: 522 FTKVPMTDMRMRPQPSTGNPGRTYRFYEGEKVFEFGYGLSYSDYSYTFASVAQNQLNVKD 581

Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYKFTFQIEVENMGKMDGSEVVMVYS 682
             +QQ          N       L+ D+   +C++ KF   + V+N G+M G   V++++
Sbjct: 582 SSNQQPE--------NSETPGYKLVSDIGEEQCENIKFKVTVSVKNEGQMAGKHPVLLFA 633

Query: 683 K--PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
           +   PG  G  IK+++G++ V + AG+  ++ + ++ C+ L   +     ++  G+  +L
Sbjct: 634 RHAKPG-KGRPIKKLVGFQTVKLGAGEKTEIEYELSPCEHLSSANEDGVMVMEEGSQILL 692

Query: 741 VGE 743
           VG+
Sbjct: 693 VGD 695



 Score =  209 bits (531), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 101/195 (51%), Positives = 128/195 (65%), Gaps = 10/195 (5%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           + +C   LP   RA+DLV R+T  EK  Q+ D +  +PRLG+P YEWWSE LHG+ F+ R
Sbjct: 42  YDFCKTTLPISRRAEDLVSRLTFEEKATQLVDTSPAIPRLGIPAYEWWSEGLHGIGFLTR 101

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN-AGLTF 130
                  + F+  +  ATSFP VILT ASF+  +W +IGQ V  EARA+YN G   GL F
Sbjct: 102 VQQGI--SFFNRTIQHATSFPQVILTAASFDAHIWYRIGQ-VGKEARALYNAGQVTGLGF 158

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ--DVEGVEYHRDSDSRPLKIS 188
           W+PN+N+ RDPRWGR  ETPGEDP VVG+Y  ++VRG+Q    EG     D     L+ S
Sbjct: 159 WAPNVNIFRDPRWGRGQETPGEDPLVVGKYGASFVRGVQGDSFEGESTLGDH----LQAS 214

Query: 189 ACCKHYAAYDLDNWE 203
           ACCKHY A+DLDNW+
Sbjct: 215 ACCKHYTAHDLDNWD 229


>gi|116181370|ref|XP_001220534.1| hypothetical protein CHGG_01313 [Chaetomium globosum CBS 148.51]
 gi|88185610|gb|EAQ93078.1| hypothetical protein CHGG_01313 [Chaetomium globosum CBS 148.51]
          Length = 549

 Score =  332 bits (851), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 207/520 (39%), Positives = 292/520 (56%), Gaps = 37/520 (7%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L+D   CD K   PERA  LV+ + + EK+Q + D++ G  RLGLP Y WWSEALHGV+ 
Sbjct: 33  LADNTVCDPKATPPERAAALVKALNIEEKLQNLVDMSKGAERLGLPAYAWWSEALHGVAA 92

Query: 69  I-GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG 127
             G R N   G  F S    ATSF   I  +A+F++ L  K+  T+STEARA  N G AG
Sbjct: 93  SPGVRFNRTAGGRFSS----ATSFANSITLSAAFDDELVYKVADTISTEARAFANAGLAG 148

Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
           L +W+PNIN  +DPRWGR  ETPGEDP  +  Y    + GL+           D    K+
Sbjct: 149 LDYWTPNINPYKDPRWGRGHETPGEDPVRIKGYVKALLAGLEG---------DDPSIRKV 199

Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
            A CKHYAAYDL+ W+G  R  FD+ V+ QD+ E ++ PF+ C  +  V S MCSYN +N
Sbjct: 200 VATCKHYAAYDLERWQGTTRHRFDAVVSLQDLSEYYLPPFQQCARDSKVGSFMCSYNALN 259

Query: 248 GIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLN----DTKEDAVARV 300
           G P CA   L++  +R  W +   + YI SDC++IQ  +   K+ N     T+ +A A  
Sbjct: 260 GTPACASTYLMDDILRKHWGWTEHNNYITSDCNAIQDFLPGPKWHNFSSTQTEAEAAAVA 319

Query: 301 LKAGLDLDC----GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD---GSPQ 353
            +AG D  C       YT+  +GA  Q  ++E  IDT+L+ LY  L+R+GYFD   GSP 
Sbjct: 320 YQAGTDTVCEVPGWPPYTD-VIGAYNQTLLSEEVIDTALKRLYEGLVRVGYFDPASGSP- 377

Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKA-- 411
           Y+++G  ++  P+  ELA ++   G+VLLKND G LPLN  + KT+AL+G  AN+T    
Sbjct: 378 YRSIGWEDVNTPEAQELALQSGTDGLVLLKND-GTLPLNLED-KTVALIGFWANSTNGGR 435

Query: 412 MIGNYEGTPCRYTSPMDGFYAYSKVINYAPG-CADIVCQN--NSMIPAAIDAAKNADATV 468
           ++G Y G P    SP+D     +   +YA G  A+ + Q   +  +  A++ AK ++  +
Sbjct: 436 ILGGYSGFPPYIHSPVDAAEKLNLTYHYASGPLAENITQAAIDDWVAKALEPAKKSNVIL 495

Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPV 508
              G D S+ AE  DR  +  P  Q  +I  ++   + P 
Sbjct: 496 YFGGTDTSIAAEDLDRDSIAWPEIQLAVIEALSALRQAPA 535


>gi|443717728|gb|ELU08656.1| hypothetical protein CAPTEDRAFT_228276 [Capitella teleta]
          Length = 731

 Score =  331 bits (849), Expect = 7e-88,   Method: Compositional matrix adjust.
 Identities = 243/765 (31%), Positives = 375/765 (49%), Gaps = 106/765 (13%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPE----KVQQMGDLAYGVPRLGLPLYEWWSEALHG 65
           + FP+ D  L + +R  DLV+R+T+ E     V Q G     V RLG+  Y++ +E + G
Sbjct: 18  AKFPFEDVTLSWDKRVDDLVQRLTIEEVVNISVAQYGKSTIPVDRLGVKPYQFINECITG 77

Query: 66  VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-- 123
           V    R  NS             T+FP  I   ASF+  L   + Q ++ E R  YN   
Sbjct: 78  V----RWENS-------------TAFPQAIGLGASFSPDLAFNMSQAIARELRGFYNTEV 120

Query: 124 -----GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHR 178
                G+ G+  ++P IN++R P WGR  ET GEDP++ G+ ++ +V+GLQ         
Sbjct: 121 KSQIYGHRGVNCFTPVINIMRHPLWGRNQETYGEDPWLSGQLSVGFVKGLQG-------- 172

Query: 179 DSDSRPLKISACCKHYAAYDLDNWEGN---DRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
               R ++ S  CKH+   D+ N   N    RF FD++V+E+D + TF+  F+ CV  G 
Sbjct: 173 -DHPRYIQASGGCKHF---DVHNGPENIPVSRFGFDAKVSERDWRMTFLPQFKTCVEAGS 228

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKED 295
           ++ +MCSYNR+NG+P CA+ KLL   +R +W F+GY++SD  +I+ IV  HK+   T  +
Sbjct: 229 IN-IMCSYNRINGVPACANKKLLTDILRKEWGFNGYVISDSGAIENIVYHHKY-TKTLAE 286

Query: 296 AVARVLKAGLDLD------CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
           A A  +KAG +++       G  Y N  + AV+Q  I+E ++  +L+      MR G FD
Sbjct: 287 AAADSVKAGCNVELTGATGSGVAYFNL-LNAVKQNLISEEELRENLKKPMYSRMRQGEFD 345

Query: 350 GSPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
                 +  +  + + + +H +LA +A+    VL+KN N  LPL       LA++GP A+
Sbjct: 346 PVDMNPFTKIDMSVVLSQEHQDLAVKASAMSFVLMKNLNRVLPLKK-RFDRLAIIGPFAD 404

Query: 408 ATKAMIGNY--EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAID-AAKNA 464
             + + G+Y     P   ++P +G  +    + YA GC D  C N    P AI+ A K A
Sbjct: 405 NAETLFGDYIPNWDPKFVSTPYEGLKSLGDDVRYASGCDDPSCTNYD--PKAIEKAVKGA 462

Query: 465 DATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAK-GPVTLVIMSAGAVDINFA 523
               +  G+  ++E EG DR DL LPG+Q +++      ++  P+ LV+ +AG VD+ + 
Sbjct: 463 QFVFVCLGVGSNLEREGHDRADLDLPGYQLQILKDAEFFSREAPLVLVLFNAGPVDLTWP 522

Query: 524 KNNPKIKSILWVGYPGEEGGRAIADVIFGKYN---PGGRLPITWYEANYVKIPYTSMPLR 580
           K +P++  I+   YP    G+A+  V+    +   P  RLP TW  A   ++P  +    
Sbjct: 523 KLSPEVDGIIECFYPAMGTGKALYQVVTATGDDGVPAARLPSTW-PAQLHQVPSITD--- 578

Query: 581 PVNNFPGRTYKFFD-GPVVYPFGYGLSYTQFKYKVAS-SPKSVDIKLDKDQQCRDINYTV 638
              N  G TY++FD G  +YPFGYGLSYT F Y+  S SP SV                 
Sbjct: 579 --YNMTGHTYRYFDGGDPLYPFGYGLSYTSFHYQTVSVSPTSV---------------RA 621

Query: 639 GTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIG 697
           G N                   T  ++V N G  +  EV  VY S            ++G
Sbjct: 622 GGN------------------VTVTVQVLNRGPYNADEVTQVYMSWMEATVPVPRWTLVG 663

Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           ++R      QS+ + F ++A +    VD A    +  G   I  G
Sbjct: 664 FKRHRHTVNQSSSLSFVVSAEQMAVWVDEATGFQVQPGKMLIYAG 708


>gi|365118446|ref|ZP_09337032.1| hypothetical protein HMPREF1033_00378 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363649697|gb|EHL88801.1| hypothetical protein HMPREF1033_00378 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 1283

 Score =  331 bits (848), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 239/751 (31%), Positives = 366/751 (48%), Gaps = 110/751 (14%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDL--AYGVPRLGLPLYEWWSEALHGVSFIGR 71
           Y +  +P  ER  DL+ R+TL EKV Q+ D   + G+ RL +P     +E LHG S+   
Sbjct: 72  YLNPNIPIEERIDDLLPRLTLEEKVIQLSDSWGSKGIARLKIPAM-LKTEGLHGQSY--- 127

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
                          G+T FP  I   ++F+  L +++G+  + EA+A  NL       W
Sbjct: 128 -------------ATGSTIFPHGINMGSTFDTELIQEVGKATAIEAKAA-NL----RVSW 169

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
           SP ++V RD RWGRV ET GEDPY+VGR  + +++G Q                 + AC 
Sbjct: 170 SPVLDVARDARWGRVEETYGEDPYLVGRIGVAWIKGFQGEH--------------MFACP 215

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
           KH+A +         R   D  ++++ M+   + PF   + E +   VM +Y   NG+P 
Sbjct: 216 KHFAGH---GQPVGGRDSHDYGLSDRVMRNIHLAPFRDVIKEANAFGVMAAYGLWNGVPD 272

Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
               +LL + +R +W F G++VSDC   + I      +  T E+A A  ++AG+D++CG 
Sbjct: 273 NGSKELLQKILREEWGFEGFVVSDCSGPENIQRKQSVVG-TMEEAAAMAVRAGVDIECGS 331

Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC---NPQHI 368
            Y      AV++G I E+++D +LR ++   MRLG FD  P  +N+  N +     P+H 
Sbjct: 332 AYKKALASAVKKGIIKESELDANLRRVFRAKMRLGLFD-RPSIENMVWNKLPEYDTPEHR 390

Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG--TPCRYTSP 426
            LA + A +  VLLKN+N  LPL+  NIKT+A++GP  NA +   G+Y     P +  S 
Sbjct: 391 ALARKVAVKSTVLLKNENNLLPLDK-NIKTIAVIGP--NADQGQTGDYSAKYAPGQIISV 447

Query: 427 MDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLD--------- 474
           ++G   +   S  + YA GC  +   + +    A++ AK ADA ++V G +         
Sbjct: 448 LEGVKNHVSPSTKVLYAQGCTQL-DMDTTGFAEAVNIAKQADAVILVVGDNSNRHENGNK 506

Query: 475 LSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
            S   E  D   L +PG Q +LI K  +A   PV LV+++     + +   N  I+SIL 
Sbjct: 507 KSTTGENVDGATLEIPGVQRQLI-KAVEATGKPVVLVLVNGKPFTLTWEDEN--IESILE 563

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD 594
             YPGEEGG A AD+IFG  NP GRLPI+     + + P   +PL       GR Y ++D
Sbjct: 564 TWYPGEEGGNATADIIFGDENPSGRLPIS-----FPRHP-GQLPLWYNYETSGRNYDYYD 617

Query: 595 GPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
            P   +Y FG+GLSYT F+Y                    ++  T  +  P    V +D 
Sbjct: 618 MPFTPLYRFGHGLSYTTFRYS-------------------NLKATTKSGDPGFVTVSVD- 657

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKV 711
                        +EN GK  G EV  +Y +       T +  + G++RVF+  G+   V
Sbjct: 658 -------------IENTGKRPGEEVAQLYITDLVASVNTAVIDLKGFKRVFLKPGEKKTV 704

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            F +N    L +++     +L +G   + VG
Sbjct: 705 TFELNPY-LLSLLNPDMKRVLEAGKFRMHVG 734


>gi|224068504|ref|XP_002302759.1| predicted protein [Populus trichocarpa]
 gi|222844485|gb|EEE82032.1| predicted protein [Populus trichocarpa]
          Length = 273

 Score =  329 bits (843), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 155/251 (61%), Positives = 186/251 (74%), Gaps = 15/251 (5%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
           +FP+C  KLP   R  DL+ RMTL EKV  + + A  VPRLG+  YEWWSEALHGVS +G
Sbjct: 38  NFPFCQVKLPIQSRVSDLIGRMTLQEKVGLLVNDAAAVPRLGIKGYEWWSEALHGVSNVG 97

Query: 71  RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
                 PGT F    PGATSFP VI T ASFN +LW+ IG+ VS EARAM+N G AGLT+
Sbjct: 98  ------PGTQFGGAFPGATSFPQVITTAASFNATLWEAIGRVVSDEARAMFNGGVAGLTY 151

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           WSPN+N+ RDPRWGR  ETPGEDP V G+YA +YVRGLQ          +D   LK++AC
Sbjct: 152 WSPNVNIFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQ---------GNDGDRLKVAAC 202

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH+ AYDLDNW G DRFHF+++V++QDM++TF +PF MCV EG V+SVMCSYN+VNGIP
Sbjct: 203 CKHFTAYDLDNWNGVDRFHFNAQVSKQDMEDTFDVPFRMCVKEGKVASVMCSYNQVNGIP 262

Query: 251 TCADPKLLNQT 261
           TCADPKLL +T
Sbjct: 263 TCADPKLLKKT 273


>gi|361127339|gb|EHK99311.1| putative exo-1,4-beta-xylosidase bxlB [Glarea lozoyensis 74030]
          Length = 569

 Score =  328 bits (841), Expect = 7e-87,   Method: Compositional matrix adjust.
 Identities = 202/555 (36%), Positives = 283/555 (50%), Gaps = 70/555 (12%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD   P  +RA  LV+ M   EK+Q +   + GV RLGLP Y WWSEALHGV+       
Sbjct: 65  CDTTAPPADRAAALVKAMQSSEKLQNIISKSAGVSRLGLPPYNWWSEALHGVA------- 117

Query: 75  SPPGTHFDSEVPG--ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
             PG  F S  P   ATS P  IL  A+F++ L +K+G  + TEARA  N  ++G+ FW+
Sbjct: 118 GAPGIQFSSSSPWNYATSLPMPILMAAAFDDDLIEKVGTLIGTEARAFGNGNHSGIDFWT 177

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           PNIN  +DPRWGR  ETPGED   +  Y    +RGL   EG +  R       +I A CK
Sbjct: 178 PNINPFKDPRWGRGSETPGEDTLRLKGYVAALLRGL---EGNKAQR-------RIIATCK 227

Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           HYAA DL++W G  R  FD++++ QD+ E ++ PF+ C  +  V S MCSYN VNG+P C
Sbjct: 228 HYAANDLESWNGVTRHDFDAKISMQDLAEYYLQPFQQCARDSKVGSFMCSYNSVNGVPAC 287

Query: 253 ADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           A+  LL   +R  WN+   + Y+ SDC+++Q I  +H + + T     A    AG D  C
Sbjct: 288 ANKYLLQTILRDHWNWTSENQYVTSDCEAVQDISLNHHYAS-TNAAGTALAFNAGTDSSC 346

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICNPQHI 368
                                               GYFDGS   Y +LG +++  PQ  
Sbjct: 347 ----------------------------------EAGYFDGSKALYSSLGWSDVNTPQAQ 372

Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
           +LA +A   GIV+LKND G LPL   +   +A++G  A+ +  + G Y G      +P+ 
Sbjct: 373 QLALQATVDGIVMLKND-GTLPLKLDSKSKVAMIGFWASDSSKLQGGYSGKAPYLRTPV- 430

Query: 429 GFYAYSKVINYAPGCADIVCQNNS-----MIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
             YA ++ + + P  A    Q ++         A+ AA  +D  +   GLD S  AEG D
Sbjct: 431 --YA-AQQLGFTPNVATGPVQQSASATDNWTTNALAAASKSDYILYFGGLDTSAAAEGVD 487

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R  L  P  Q  LI K+  +A G   ++I     +D      N  + SILW  +PG++GG
Sbjct: 488 RTSLEWPSAQLALIKKL--SALGKPLIIIQEGDQMDNTPLLTNKGVSSILWASWPGQDGG 545

Query: 544 RAIADVIFGKYNPGG 558
            A+  +I G  +P G
Sbjct: 546 PAVMQIISGAKSPAG 560


>gi|449489074|ref|XP_002195511.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Taeniopygia guttata]
          Length = 685

 Score =  327 bits (838), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 233/698 (33%), Positives = 352/698 (50%), Gaps = 102/698 (14%)

Query: 48  VPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPG-ATSFPTVILTTASFNESLW 106
           +PRLG+  Y W +E L G                D E PG AT+FP  +   A+F+  L 
Sbjct: 9   IPRLGIAPYNWNTECLRG----------------DGEAPGWATAFPQALGLAAAFSPELI 52

Query: 107 KKIGQTVSTEARAMYNLGNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVG 158
            ++    +TE RA +N   A        GL+ +SP +N++R P WGR  ET GEDP++ G
Sbjct: 53  YRVANATATEVRAKHNSFAAAGRYSDHTGLSCFSPVLNIMRHPLWGRNQETYGEDPFLSG 112

Query: 159 RYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR-FHFDSRVTEQ 217
             A ++V+GLQ             R +K SA CKH++ +      G++    +   V E+
Sbjct: 113 ELARSFVQGLQG---------PHPRYVKASAGCKHFSVHG-----GHENILLYLLTVLER 158

Query: 218 DMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCD 277
           D + TF+  F+ CV  G  S  MCSYNR+NG+P CA+ KLL   +RG+W F GY+VSD  
Sbjct: 159 DWRMTFLPQFQACVRAGSYS-FMCSYNRINGVPACANKKLLTDILRGEWGFDGYVVSDEG 217

Query: 278 SIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTM----GAVQQGKIAEADIDT 333
           +++ I+  H +     E AVA V  AG +L+      N        A+  G I    +  
Sbjct: 218 AVELIMLGHHYTRSFLETAVASV-NAGCNLELSYGMRNNVFMRIPEALAMGNITLQMLRD 276

Query: 334 SLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPL 391
            +R L+   MRLG FD      Y +L  + + +P+H  L+ EAA +  VLLKN  G LPL
Sbjct: 277 RVRPLFYTRMRLGEFDPPAMNPYSSLDLSVVQSPEHRNLSLEAAVKSFVLLKNVRGTLPL 336

Query: 392 NTGNIKT--LALVGPHANATKAMIGNYEGTP-CRYT-SPMDGFYAYSKVINYAPGCADIV 447
              ++ +  LA+VGP A+  + + G+Y   P  RY  +P  G       +++A GC++  
Sbjct: 337 KAQDLSSQHLAVVGPFADNPRVLFGDYAPVPEPRYIYTPRRGLEMLGANVSFAAGCSEPR 396

Query: 448 CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG- 506
           CQ  S     +     AD  ++  G  + VE E KDR DL LPG Q EL+     AA G 
Sbjct: 397 CQRYSRA-ELVKVVGAADVVLVCLGTGVDVETEAKDRSDLSLPGHQLELLQDAVQAAAGR 455

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGK--YNPGGRLPITW 564
           PV L++ +AG +D+++A+ +  + +IL   +P +  G AIA V+ G+   +P GRLP TW
Sbjct: 456 PVILLLFNAGPLDVSWAQAHDGVGAILACFFPAQATGLAIARVLLGEAGASPAGRLPATW 515

Query: 565 YEANYVKIPYTSMPLRPVNNF--PGRTYKFF--DGPVVYPFGYGLSYTQFKYKVASSPKS 620
             A   ++P       P+ N+   GRTY+++  + P +YPFGYGLSYT F+Y+       
Sbjct: 516 -PAGMHQVP-------PMENYTMEGRTYRYYGQEAP-LYPFGYGLSYTTFRYR------- 559

Query: 621 VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
            D+ L                 PP   +      C +   +  + +EN G  D  EVV +
Sbjct: 560 -DLVLS----------------PPVLPL------CAN--LSVSVVLENTGLRDSEEVVQL 594

Query: 681 YSKPPGIAGTHIK-QVIGYERVFIAAGQSAKVGFTMNA 717
           Y +    +    + Q++ + RV + AG+ AK+ F + A
Sbjct: 595 YLRWEHSSVPVPRWQLVAFRRVAVPAGREAKLSFQVLA 632


>gi|413925161|gb|AFW65093.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 323

 Score =  320 bits (819), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 150/290 (51%), Positives = 198/290 (68%), Gaps = 16/290 (5%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           +CD  L   +RA DLV R+T  EK+ Q+GD A GVPRLG+P Y+WW+EALHG++  G+  
Sbjct: 46  FCDVTLAPAQRAADLVSRLTAAEKIAQLGDQAPGVPRLGVPGYKWWNEALHGLATSGK-- 103

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTFWS 132
               G HFD+ V  ATSFP V+LT A+F++ LW +IGQ +  EARA++N+G A GLT WS
Sbjct: 104 ----GLHFDAAVRAATSFPQVLLTAAAFDDDLWLRIGQAIGREARALFNVGQAEGLTIWS 159

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           PN+N+ RDPRWGR  ETPGEDP V  RYA+ +VRG+Q         +S S  L+ SACCK
Sbjct: 160 PNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQG--------NSSSSLLQTSACCK 211

Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           H  AYDL++W G  R+ F +RVTEQD+++TF  PF  CV E   S VMC+Y  +NG+P C
Sbjct: 212 HATAYDLEDWNGVARYSFVARVTEQDLEDTFNPPFRSCVVEAKASCVMCAYTAINGVPAC 271

Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           A+  LL  T+RGDW   GY+ SDCD++  + ++ ++   T EDAVA  LK
Sbjct: 272 ANSDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRYA-PTPEDAVAVSLK 320


>gi|389636381|ref|XP_003715843.1| beta-xylosidase [Magnaporthe oryzae 70-15]
 gi|351648176|gb|EHA56036.1| beta-xylosidase [Magnaporthe oryzae 70-15]
 gi|440480767|gb|ELQ61414.1| beta-xylosidase [Magnaporthe oryzae P131]
          Length = 517

 Score =  319 bits (818), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 187/502 (37%), Positives = 270/502 (53%), Gaps = 29/502 (5%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD  L  PERA  LVE +++ EK+Q +   + G PR+GLP Y WWSEALHGV++
Sbjct: 35  LSTNNVCDRTLSPPERAAALVEALSIEEKLQNLVSKSQGAPRIGLPAYNWWSEALHGVAY 94

Query: 69  IGRRTNSPPGTHF---DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
                   PGT+F   + E   +TS+P  +L  A F+++L +KIG  +  EARA  N G 
Sbjct: 95  A-------PGTYFPQGNVEFNSSTSYPMPLLMAAGFDDNLIEKIGTAIGIEARAWGNSGW 147

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
           AG  +W+PN+N  +DPRWGR  ETPGED   + RYA    RGL      E  R       
Sbjct: 148 AGFDYWTPNVNAFKDPRWGRGSETPGEDVLRIKRYAEYITRGLDGPVPNEQRR------- 200

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
            I + CKHYA  D ++W G  R  F++++T QD+ E ++ PF+ C  +  V S+MC+YN 
Sbjct: 201 -IISTCKHYAGNDFEDWNGTTRHDFNAKITMQDLAEYYLKPFQQCARDSKVGSIMCAYNA 259

Query: 246 VNGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           VNG+P+CA+  LL   +R  W +   + Y+ SDC+++  +  +H +   T     A   +
Sbjct: 260 VNGVPSCANKYLLQTILRDHWKWTEHNNYVTSDCEAVLDVSANHHYA-PTNAAGTAICFE 318

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNN 361
           AG+D  C    ++   GA  QG + E  +D +L  LY  L+R GYFDG    Y +L   +
Sbjct: 319 AGMDTSCEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGYFDGEEAMYADLDWQH 378

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           + + +   LA +AA +G+VLLKN NG LPL+      +A++G  A+A + + G Y G   
Sbjct: 379 VNSAEAQSLALQAAVEGMVLLKN-NGTLPLDLDPSHKIAMIGFWADAPEKLQGGYSGRAH 437

Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNS---MIPAAIDAAKNADATVIVAGLDLSVE 478
              SP   F A    ++       ++  NN+       A++AA  AD  +   GLD S  
Sbjct: 438 HLYSP--AFAARQLGLDITVASGPVLQDNNASDNWTTNALEAASGADYILYFGGLDTSAA 495

Query: 479 AEGKDRVDLLLPGFQTELINKV 500
            E  DR DL  P  Q  L+  V
Sbjct: 496 GETLDRTDLDWPEAQLTLVKVV 517


>gi|397642422|gb|EJK75223.1| hypothetical protein THAOC_03061, partial [Thalassiosira oceanica]
          Length = 534

 Score =  319 bits (818), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 195/565 (34%), Positives = 296/565 (52%), Gaps = 93/565 (16%)

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVN--------- 232
           RP +I+A CKH AAY L+     DRF+F +  +   D + T++  F+ CV+         
Sbjct: 7   RP-RIAATCKHLAAYSLET----DRFNFSADGIDRTDWEGTYLPAFDACVHAERFLLEHY 61

Query: 233 ----------EGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI 282
                     +     VMCSYN ++G+P CADP LL   +R DWNF G +VSDC ++  I
Sbjct: 62  NASGGGGGGQDRGALGVMCSYNAIDGVPACADPALLKDMLRRDWNFTGLVVSDCWAVDNI 121

Query: 283 VESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVL 342
             +H+F+  + E+AV   L++G+DLDCG+ + +F   A  +  + E DID +L  L+ VL
Sbjct: 122 HSNHRFVA-SYEEAVGLALRSGVDLDCGNTFQDFGRLAYDESLLDEDDIDEALSRLFRVL 180

Query: 343 MRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKN-----DNGALPLNTGNIK 397
           M LGYFD + +     K++    +H +LA EAA Q IVLLKN     + G LPL+    K
Sbjct: 181 MDLGYFDETDEPD--AKSSDDEMEHDQLALEAALQSIVLLKNGINEDEPGPLPLSLAKHK 238

Query: 398 TLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAA 457
            +AL GP A+    ++GNY G P    +P+ G       + +    +  VC  +      
Sbjct: 239 EIALFGPLADNQTVLLGNYHGLPSTIVTPLMGLAKMGVEVAFRQRAS--VCDFH------ 290

Query: 458 IDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG---PVTLVIMS 514
                   AT++V GLD S+EAE +DR  LLLP  Q +LI  ++  +K    PV LV++S
Sbjct: 291 -----GESATILVVGLDQSLEAEDQDRTTLLLPVEQRDLIKTISRCSKVRDLPVVLVVVS 345

Query: 515 AGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIP 573
            G VD++  KN+  I +++ + YPG+ GG A+A V++G YNP G+L  T Y  +Y+ ++ 
Sbjct: 346 GGMVDLSRYKNSSDIDAMIHMSYPGQNGGSALAQVLYGAYNPSGKLVGTMYPESYLNEVS 405

Query: 574 YTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRD 633
              M +RP   FPGRT++++ G V+YPFGYGLSYT F+Y                     
Sbjct: 406 LHDMRMRPDGKFPGRTHRYYRGDVIYPFGYGLSYTSFRYA-------------------- 445

Query: 634 INYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-- 691
           + +  GT K                     + V N G MDGS  V+++   P        
Sbjct: 446 MEFLGGTVK---------------------VTVSNSGSMDGSVAVLLFHSAPQAGNEQEP 484

Query: 692 IKQVIGYERVFIAAGQSAKVGFTMN 716
            + +IG+E+++++ G S  V F ++
Sbjct: 485 FRSLIGFEKIYVSVGDSQLVSFDVS 509


>gi|440476402|gb|ELQ45004.1| beta-xylosidase, partial [Magnaporthe oryzae Y34]
          Length = 515

 Score =  319 bits (817), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 186/499 (37%), Positives = 269/499 (53%), Gaps = 29/499 (5%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           LS    CD  L  PERA  LVE +++ EK+Q +   + G PR+GLP Y WWSEALHGV++
Sbjct: 35  LSTNNVCDRTLSPPERAAALVEALSIEEKLQNLVSKSQGAPRIGLPAYNWWSEALHGVAY 94

Query: 69  IGRRTNSPPGTHF---DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
                   PGT+F   + E   +TS+P  +L  A F+++L +KIG  +  EARA  N G 
Sbjct: 95  A-------PGTYFPQGNVEFNSSTSYPMPLLMAAGFDDNLIEKIGTAIGIEARAWGNSGW 147

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
           AG  +W+PN+N  +DPRWGR  ETPGED   + RYA    RGL      E  R       
Sbjct: 148 AGFDYWTPNVNAFKDPRWGRGSETPGEDVLRIKRYAEYITRGLDGPVPNEQRR------- 200

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
            I + CKHYA  D ++W G  R  F++++T QD+ E ++ PF+ C  +  V S+MC+YN 
Sbjct: 201 -IISTCKHYAGNDFEDWNGTTRHDFNAKITMQDLAEYYLKPFQQCARDSKVGSIMCAYNA 259

Query: 246 VNGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           VNG+P+CA+  LL   +R  W +   + Y+ SDC+++  +  +H +   T     A   +
Sbjct: 260 VNGVPSCANKYLLQTILRDHWKWTEHNNYVTSDCEAVLDVSANHHYA-PTNAAGTAICFE 318

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNN 361
           AG+D  C    ++   GA  QG + E  +D +L  LY  L+R GYFDG    Y +L   +
Sbjct: 319 AGMDTSCEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGYFDGEEAMYADLDWQH 378

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           + + +   LA +AA +G+VLLKN NG LPL+      +A++G  A+A + + G Y G   
Sbjct: 379 VNSAEAQSLALQAAVEGMVLLKN-NGTLPLDLDPSHKIAMIGFWADAPEKLQGGYSGRAH 437

Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNS---MIPAAIDAAKNADATVIVAGLDLSVE 478
              SP   F A    ++       ++  NN+       A++AA  AD  +   GLD S  
Sbjct: 438 HLYSP--AFAARQLGLDITVASGPVLQDNNASDNWTTNALEAASGADYILYFGGLDTSAA 495

Query: 479 AEGKDRVDLLLPGFQTELI 497
            E  DR DL  P  Q  L+
Sbjct: 496 GETLDRTDLDWPEAQLTLV 514


>gi|381170979|ref|ZP_09880130.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           citri pv. mangiferaeindicae LMG 941]
 gi|380688543|emb|CCG36617.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           citri pv. mangiferaeindicae LMG 941]
          Length = 901

 Score =  318 bits (816), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 185/448 (41%), Positives = 254/448 (56%), Gaps = 37/448 (8%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           PY D +  + +RA DLV RMTL EK  QM + A  +PRL +P Y+WW+EALHGV+  G  
Sbjct: 33  PYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG-- 90

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------LG 124
                         GAT FP  I   A+F+  L  ++   +S EARA ++          
Sbjct: 91  --------------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHAR 136

Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLTFWSPNIN+ RDPRWGR  ET GEDP++  R  + +V+GLQ  EG    +++   P
Sbjct: 137 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGAAAPKNAQGEP 195

Query: 185 L-KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
             K+ A  KH+A +        DR HFD+R +++D+ ET++  FE  V EG V +VM +Y
Sbjct: 196 YRKLDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKEGKVDAVMGAY 252

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NRV G    A   LL   +R  W F GY+VSDC +I  I + HK +  T+E A A  +K 
Sbjct: 253 NRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKH 311

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNN 361
           G +L+CG+ Y      AV+QG I EA IDT+L+ L    MRLG FD  G   +  +  + 
Sbjct: 312 GTELECGEEYATLP-AAVRQGLIDEAQIDTALKTLMTARMRLGMFDPPGQLPWSTIPASV 370

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
             +P H  LA   AR+ +VLLKND G LPL+   +K +A++GP A+ T A++GNY GTP 
Sbjct: 371 NQSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPA 429

Query: 422 RYTSPMDGFYAYS--KVINYAPGCADIV 447
              + + G  A +    + YA G AD+V
Sbjct: 430 APVTVLQGIRAAAPNAQVLYARG-ADLV 456



 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 97/298 (32%), Positives = 151/298 (50%), Gaps = 53/298 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+D A++AD  V V GL   VE E          G DR DL LP  Q +L+  +  A   
Sbjct: 628 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-QATGR 686

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV  V+ +  A+ I++A+ +  + +IL   YPG+ GG A+AD +FG  NPGGRLP+T+Y+
Sbjct: 687 PVVAVLTTGSALAIDWAQQH--LPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 744

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
            +     +    +R      GRTY++F G  +YPFG+GLSYTQF Y          ++LD
Sbjct: 745 ESETLPAFDDYAMR------GRTYRYFGGTPLYPFGHGLSYTQFAYS--------GLRLD 790

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           +         T+ T                D   T  + V+N G+  G EVV +Y  P  
Sbjct: 791 RT--------TIAT----------------DGSLTATVTVKNTGQRAGDEVVQLYLHPLA 826

Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
                  K++ G++R+ +  G+  ++GFT+NA  +L++ D    +  +  GA+ + +G
Sbjct: 827 PQRERAGKELHGFQRIALQPGEQRELGFTINAKDALRLYDEQRKAYGVDPGAYEVQIG 884


>gi|256393466|ref|YP_003115030.1| glycoside hydrolase family 3 [Catenulispora acidiphila DSM 44928]
 gi|256359692|gb|ACU73189.1| glycoside hydrolase family 3 domain protein [Catenulispora
           acidiphila DSM 44928]
          Length = 1343

 Score =  318 bits (816), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 248/794 (31%), Positives = 367/794 (46%), Gaps = 119/794 (14%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQM-GDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           Y D    + ERA DLV RMTLPEK  Q+  + A  +PRLG+  Y +WSE  HGV+ +G  
Sbjct: 49  YLDTHYSFAERAADLVSRMTLPEKAAQLQTNSAPAIPRLGVQEYTYWSEGQHGVNTLGAD 108

Query: 73  TNSPPGTHFDSEVPG---ATSFPTVILTTASFNESLWKKIGQTVSTEARAMY-------- 121
           +N         +V G   ATSFP     T S++ +L  K    VS E R           
Sbjct: 109 SNR-------GDVTGGVHATSFPVNFAATMSWDPALTYKETTAVSDEVRGFLDKSLWGTG 161

Query: 122 --NLGNAG-----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
             NLG +      LTFW+PN+N+ RDP WGR  E+ GEDPY+    A  +V G Q   G 
Sbjct: 162 QNNLGPSASDYGALTFWAPNVNMDRDPLWGRTNESFGEDPYLTSTMAGAFVDGYQ---GQ 218

Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
                  +  LK++A  KHY+   L+N E + R    S  T+ ++++ +   F   V + 
Sbjct: 219 SMTGQQQTPYLKVAATAKHYS---LNNIE-DSRHTGSSDTTDANIRDYYTKQFASLVRDA 274

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI--VESHKFL--- 289
            VS +M SYN VNG P+ AD   +++ ++  + F GY  SDC +I  +    SH +    
Sbjct: 275 HVSGIMTSYNAVNGTPSPADTYTVDELLQATYGFAGYTTSDCGAIGDVYGAASHGWAPPG 334

Query: 290 ---NDTK--EDAVAR-----------VLKAGLDLDC--GDYYTNFTMGAVQQGKIAEADI 331
              N T    +A  R            ++AG  L+C  G+        A+  G ++   +
Sbjct: 335 WTSNGTSWTNNATGRQISAAAGGQAFAIRAGTQLNCAGGEMTAQNISAAIDLGLLSNGVV 394

Query: 332 DTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGA- 388
           D +L  L+ V M  G FD  G   Y  + K+ I +P H  LA + A   IVLL+  NGA 
Sbjct: 395 DATLTRLFTVRMETGEFDPAGKVGYTKITKDQIESPAHQALAEQVAANDIVLLQ--NGAV 452

Query: 389 -------LPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAP 441
                  LP++     ++ +VG  AN  K  +G Y G P    + + G  A  +  N + 
Sbjct: 453 SGTSAKLLPVDPAKTDSVVIVGDLAN--KVTLGGYSGEPTHEVNAVQGITAAVQAANPSA 510

Query: 442 GCADIVCQNNSMI--PAAIDAA-----KNADATVIVAGLDLSVEAEGKDRVDLLLPGFQT 494
                 C   + I  PA+  AA     K+A   ++VAG DLSV  E  DR  L LPG   
Sbjct: 511 TVTFDACGTGTQITTPASCSAATQAAIKSASLVLVVAGSDLSVADEANDRSTLALPGNYD 570

Query: 495 ELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKY 554
            LI++V+        LV+ + G  DI  A+ +    +I++ GY G+  G A+A V+FG+ 
Sbjct: 571 SLISQVSALGNPRTALVMQADGPYDIQDAQKD--FPAIVFSGYNGQSQGTALAQVLFGQQ 628

Query: 555 NPGGRLPITWYEANYVKIPYTSMPLRPVNNFP-GRTYKFFDGPVVYPFGYGLSYTQFKYK 613
           NP G L  TWY  +    P  +  L P      GRTY++F G   YPFGYG SY+ F Y 
Sbjct: 629 NPAGHLDFTWYSGDSQLAPMDNYGLTPSQTGGLGRTYQYFTGTPTYPFGYGQSYSSFAYS 688

Query: 614 -VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKM 672
            V   P++ +                                  D       +V+N G +
Sbjct: 689 HVQVGPQNTN---------------------------------ADGTVHVSFDVKNTGTV 715

Query: 673 DGSEVVMVYSKPPGIAGTH---IKQVIGYERV-FIAAGQSAKVGFTMNACKSLKIVDNAA 728
            G+ V  +Y+ PPG AGT+    +Q+ G+++   +  GQS  +  ++         +++ 
Sbjct: 716 AGTTVAQLYAAPPG-AGTNDTTREQLAGFQKTNTLKPGQSQHISLSVKVSSLSTWDESSL 774

Query: 729 NSLLASGAHTILVG 742
             ++A GA+   VG
Sbjct: 775 KQVVADGAYQFRVG 788


>gi|413925165|gb|AFW65097.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 412

 Score =  318 bits (816), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 157/297 (52%), Positives = 198/297 (66%), Gaps = 13/297 (4%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            P+C+ KLP  +RA DLV RMT  EK  Q+GD+A GVPRLG+P Y+WW+EALHGV+  G+
Sbjct: 96  LPFCNTKLPAAQRAADLVSRMTPAEKASQLGDVANGVPRLGVPSYKWWNEALHGVAISGK 155

Query: 72  RTNSPPGTHFD-SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLT 129
                 G H D   V  ATSFP V+LT ASFN++LW +IGQ    EARA YN+G A GLT
Sbjct: 156 ------GIHMDRGAVRSATSFPQVLLTAASFNDNLWFRIGQATGKEARAFYNIGQAEGLT 209

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
            WSPN+N+ RDPRWGR  ETPGEDP V  RYA  +VRGLQ   G   +  S    L  SA
Sbjct: 210 MWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRGLQ---GSSSNTKSVPPVLLTSA 266

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
           CCKH  AYDL++W+G  R+ F + VT QD+ +TF  PF  CV +G  S VMC+Y  VNG+
Sbjct: 267 CCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKASCVMCAYTSVNGV 326

Query: 250 PTCADPKLLNQTIRGDWNFHG-YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           P+CA+  LL +T RG W   G Y+ +DCD++ +I+ + +F   T ED VA  LKAG+
Sbjct: 327 PSCANADLLTKTFRGSWGLDGRYVAADCDAV-SIMRNSQFYRPTAEDTVATTLKAGM 382


>gi|346726970|ref|YP_004853639.1| beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346651717|gb|AEO44341.1| Beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 902

 Score =  318 bits (816), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 184/447 (41%), Positives = 255/447 (57%), Gaps = 37/447 (8%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D +  + +RA DLV RMTL EK  QM + A  +PRLG+P Y+WW+EALHGV+  G   
Sbjct: 35  YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 91

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------LGN 125
                        GAT FP  I   A+F+  L  ++   +S EARA ++           
Sbjct: 92  -------------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARY 138

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLTFWSPNIN+ RDPRWGR  ET GEDP++  R  + +V+GLQ  EG +  +++   P 
Sbjct: 139 QGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGADAPKNAQGEPY 197

Query: 186 -KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
            K+ A  KH+A +        DR HFD+R +++D+ ET++  FE  V +G V +VM +YN
Sbjct: 198 RKLDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYN 254

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           RV G    A   LL   +R  W F GY+VSDC +I  I + HK +  T+E A A  +K G
Sbjct: 255 RVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHG 313

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNI 362
            +L+CG+ Y+     AV QG I EA IDT+L+ L    MRLG FD  G   +  +  +  
Sbjct: 314 TELECGEEYSTLP-AAVHQGLIDEAQIDTALKTLMTARMRLGMFDPPGQLPWSTIPASVN 372

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
            +P H  LA   AR+ +VLLKND G LPL+   +K +A++GP A+ T A++GNY GTP  
Sbjct: 373 QSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAA 431

Query: 423 YTSPMDGFYAYS--KVINYAPGCADIV 447
             + + G  A +    + YA G AD+V
Sbjct: 432 PVTVLQGIRAAAPNAQVLYARG-ADLV 457



 Score =  141 bits (356), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 94/298 (31%), Positives = 145/298 (48%), Gaps = 53/298 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+D A++AD  V V GL   VE E          G DR DL LP  Q +L+  +    K 
Sbjct: 629 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK- 687

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV  V+ +  A+ I++A+ +  + +IL   YPG+ GG A+AD +FG  NPGGRLP+T+Y+
Sbjct: 688 PVVAVLTTGSALAIDWAQQH--LPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 745

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
            +     +    +R      GRTY++F G  +YPFG+GLSYTQF Y          ++LD
Sbjct: 746 ESETLPAFDDYAMR------GRTYRYFGGTPLYPFGHGLSYTQFAYS--------GLRLD 791

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           +                             D   T  + V+N G+  G EVV +Y  P  
Sbjct: 792 R------------------------TTIAADGSLTATVTVKNTGQRAGDEVVQLYLHPLT 827

Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
                  K++ G++R+ +  G+   + FT++A  +L+I D    +     GA+ + +G
Sbjct: 828 PQRERAGKELHGFQRIALQPGEQRALHFTLDAKNALRIYDAQRKAYAVDPGAYEVQIG 885


>gi|285016879|ref|YP_003374590.1| beta-glucosidase [Xanthomonas albilineans GPE PC73]
 gi|283472097|emb|CBA14604.1| putative beta-glucosidase protein [Xanthomonas albilineans GPE
           PC73]
          Length = 914

 Score =  317 bits (813), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 184/447 (41%), Positives = 257/447 (57%), Gaps = 37/447 (8%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D++  + +RA DLV RMTL EKV QM + A  +PRLG+P Y+WW+E LHGV+  G   
Sbjct: 34  YLDSQRTFAQRADDLVARMTLEEKVAQMQNAAPAIPRLGVPAYDWWNEGLHGVARAG--- 90

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG--------N 125
                        GAT FP  I   A+F+  L  ++   +S EARA ++           
Sbjct: 91  -------------GATVFPQAIGLAATFDLPLMHEVSTAISDEARAKHHEALRRGEHGRY 137

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLTFWSPNIN+ RDPRWGR  ET GEDP++  R  + +V+G+Q  EG +  +++     
Sbjct: 138 QGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGMQG-EGADAPKNAQGETY 196

Query: 186 -KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
            K+ A  KH+A +   +   ++R HFD+R +++D+ ET++  FE  V EG V +VM +YN
Sbjct: 197 RKLDATAKHFAVH---SGPESERHHFDARPSQRDLYETYLPAFEALVKEGKVDAVMGAYN 253

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           R+ G    A   LL   +R  W FHGY+VSDC +I  I ++HK +  T+E A A  +K G
Sbjct: 254 RLFGESASASKFLLRDVLRERWGFHGYVVSDCWAIVDIWKNHKIVA-TREQAAALAVKNG 312

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNI 362
             L+CG  Y      AVQQG I E DID +LR L    MRLG FD  G  ++  L  +  
Sbjct: 313 TQLECGQEYATLP-AAVQQGLIGETDIDAALRTLMTARMRLGMFDPPGQLRWAQLPISVN 371

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
            +P+H  LA   AR+ +VLLKND G LPL+    K +A++GP A+ T A++GNY GTP  
Sbjct: 372 QSPEHDALARRTARESLVLLKND-GLLPLSRAKHKRIAVIGPTADDTMALLGNYYGTPAT 430

Query: 423 YTSPMDGFYAYSKVIN--YAPGCADIV 447
             + + G  A +   +  YA G AD+V
Sbjct: 431 PVTILQGIRAAAPDADVLYARG-ADLV 456



 Score =  155 bits (393), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 100/284 (35%), Positives = 147/284 (51%), Gaps = 56/284 (19%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+D A+ AD  V V GL   VE E          G DR DL LP  Q EL+  ++   K 
Sbjct: 628 ALDTARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLQALSATGK- 686

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV  V+ +  A+ I++A+ +  + +IL   YPG+ GG A+ADV+FG  NPGGRLP+T+Y+
Sbjct: 687 PVVAVLTTGSALAIDWAQEH--VPAILLAWYPGQRGGSAVADVLFGDTNPGGRLPVTFYK 744

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
           A+     +    +R      GRTY++F G  +YPFG+GLSYTQF Y         D++LD
Sbjct: 745 ASETLPAFDDYAMR------GRTYRYFAGTPLYPFGHGLSYTQFAYS--------DLRLD 790

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           + +                           D + +  ++V N G   G EVV +Y  P  
Sbjct: 791 RRK------------------------VAADGQLSATLKVTNTGTRAGDEVVQLYLHP-- 824

Query: 687 IAGTH---IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           +A T    IK++ G++R+ +A G+S  V FT++    L+I D A
Sbjct: 825 LAPTRARAIKELRGFQRIALAPGESRDVHFTISPQTDLRIYDEA 868


>gi|167524198|ref|XP_001746435.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775197|gb|EDQ88822.1| predicted protein [Monosiga brevicollis MX1]
          Length = 834

 Score =  317 bits (812), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 229/743 (30%), Positives = 353/743 (47%), Gaps = 110/743 (14%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQM--GDLAYGVP-----RLGLPLYEWWSEAL 63
           ++P+ +  LP+  R  DLV R+TL EK+QQ+  G  A   P     RLG+  + W SE +
Sbjct: 33  EYPFRNPDLPWAARLDDLVGRLTLEEKLQQLQHGGAAQMTPAPAVERLGIGPFVWGSECV 92

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
            G   +G   N P GT          +FP  +   A+F+ +L K+   T++ E RA  N 
Sbjct: 93  TG---LGTDGNDPHGT----------AFPQPLGMAATFDPALLKRAAGTIALELRAQRNF 139

Query: 124 G--------NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
                    + GL+ WSP +N+ R P WGR  ET GE P +    A ++V G+Q      
Sbjct: 140 DRENGVVKFHHGLSCWSPVVNINRHPLWGRNDETFGECPVLSSFMARSFVEGIQG----- 194

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNE 233
               + +R    +A CKH     LD + G D  R+ FD+ V++ D+  TF++ FE C   
Sbjct: 195 ----NHTRYYAAAAACKH-----LDVYGGPDNLRYVFDADVSQADLTGTFLMAFEECAAA 245

Query: 234 GDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTK 293
           G V   MCSYN + G+P CA+ + +    R  W F GY+VSD  ++  I ESH +  +  
Sbjct: 246 G-VMGYMCSYNSIRGVPACANYRTMTFFAREQWGFEGYVVSDQGAVFRITESHNYTANQT 304

Query: 294 EDAVARVLKAGLDLDCGD-----YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
             AVA  L AG D++  D      Y N ++ A+       A ID S+  L+ V MRLG F
Sbjct: 305 LGAVA-ALNAGCDMEDSDDAQHVAYYNLSL-ALDLKLTDMATIDASVSRLFYVRMRLGEF 362

Query: 349 DGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLN-TGNIKTLALVGP 404
           D  P+   +++L  + + +P H+E+A + A   IVLLKN N  LPL+      +  L+GP
Sbjct: 363 D-PPENDPWRSLNMSIVSSPAHVEMARDVATASIVLLKNQNETLPLSAAAKNASYCLLGP 421

Query: 405 HANATKAMIGNY--EGTPCRYTSPMDGFYA------YSKVINYAPGCADIVCQNNSMIPA 456
            A+    M+G Y   G+     +   G  A       +    Y  GC    C        
Sbjct: 422 FADNADLMMGKYSPHGSTNVTVTYRAGLAAALQNASQTASFQYLEGCTGPFCDGLDTAAV 481

Query: 457 AIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADA--AKGPVTLVIMS 514
                +  D  ++  G    VE+E  DR ++  PG Q  L+  V +A   K  + L++ +
Sbjct: 482 TTFIQQGCDTVLLAVGTSYHVESESLDRSNMSFPGAQPTLVQTVLEALGTKQRLVLLVST 541

Query: 515 AGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPY 574
           AG VD+   + + ++ +IL + Y G+  G A+AD++ G+ +P GRLP +W        P 
Sbjct: 542 AGPVDLAALEQDTRVAAILDLIYLGQTAGTALADILLGETSPSGRLPFSW--------PN 593

Query: 575 TSMPLRPVNNFP--GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCR 632
               + P++++   GRTY+F    V++PFGYGLSYTQF     ++P  +           
Sbjct: 594 KVSDVPPIDDYTMQGRTYRFAQADVLFPFGYGLSYTQFNLSHLAAPYIL----------- 642

Query: 633 DINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI 692
                     P C A+ +             + V N G++ G+  + VY + P   G  I
Sbjct: 643 ----------PVCQALRLS------------VNVTNTGRLSGAIPLQVYVEWPNAVGGPI 680

Query: 693 KQVIGYERVFIAAGQSAKVGFTM 715
           +Q+    RVF+ A  S  V  ++
Sbjct: 681 RQLATTTRVFVDAASSKTVQLSI 703


>gi|390991557|ref|ZP_10261819.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
 gi|372553724|emb|CCF68794.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
          Length = 901

 Score =  316 bits (810), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 184/448 (41%), Positives = 253/448 (56%), Gaps = 37/448 (8%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           PY D +  +  RA DLV RMTL EK  QM + A  +PRL +P Y+WW+EALHGV+  G  
Sbjct: 33  PYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG-- 90

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------LG 124
                         GAT FP  I   A+F+  L  ++   +S EARA ++          
Sbjct: 91  --------------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHAR 136

Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLTFWSPNIN+ RDPRWGR  ET GEDP++  R  + +V+GLQ  EG    +++   P
Sbjct: 137 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGAAAPKNAQGEP 195

Query: 185 L-KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
             K+ A  KH+A +        DR HFD+R +++D+ ET++  FE  V +G V +VM +Y
Sbjct: 196 YRKLDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAY 252

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NRV G    A   LL   +R  W F GY+VSDC +I  I + HK +  T+E A A  +K 
Sbjct: 253 NRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKH 311

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNN 361
           G +L+CG+ Y      AV+QG I EA IDT+L+ L    MRLG FD  G   +  +  + 
Sbjct: 312 GTELECGEEYATLP-AAVRQGLIDEAQIDTALKTLMTARMRLGMFDPPGQLPWSTIPASV 370

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
             +P H  LA   AR+ +VLLKND G LPL+   +K +A++GP A+ T A++GNY GTP 
Sbjct: 371 NQSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPA 429

Query: 422 RYTSPMDGFYAYS--KVINYAPGCADIV 447
              + + G  A +    + YA G AD+V
Sbjct: 430 APVTVLQGIRAAAPKAQVLYARG-ADLV 456



 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 97/298 (32%), Positives = 151/298 (50%), Gaps = 53/298 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+D A++AD  V V GL   VE E          G DR DL LP  Q +L+  +  A   
Sbjct: 628 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-QATGR 686

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV  V+ +  A+ I++A+ +  + +IL   YPG+ GG A+AD +FG  NPGGRLP+T+Y+
Sbjct: 687 PVVAVLTTGSALAIDWAQQH--LPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 744

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
            +     +    +R      GRTY++F G  +YPFG+GLSYTQF Y          ++LD
Sbjct: 745 ESETLPAFDDYAMR------GRTYRYFGGTPLYPFGHGLSYTQFAYS--------GLRLD 790

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           +         T+ T                D   T  + V+N G+  G EVV +Y  P  
Sbjct: 791 RT--------TIAT----------------DGSLTATVTVKNTGQRAGDEVVQLYLHPLA 826

Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
                  K++ G++R+ +  G+  ++GFT+NA  +L++ D    +  +  GA+ + +G
Sbjct: 827 PQRERAGKELHGFQRIALQPGEQRELGFTINAKDALRLYDEQRKAYGVDPGAYEVQIG 884


>gi|78049893|ref|YP_366068.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
 gi|78038323|emb|CAJ26068.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
          Length = 902

 Score =  315 bits (808), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 183/447 (40%), Positives = 255/447 (57%), Gaps = 37/447 (8%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D +  + +RA DLV RMTL EK  QM + A  +PRLG+P Y+WW+EALHGV+  G   
Sbjct: 35  YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 91

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------LGN 125
                        GAT FP  I   A+F+  L  ++   +S EARA ++           
Sbjct: 92  -------------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARY 138

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLTFWSPNIN+ RDPRWGR  ET GEDP++  R  + +V+GL+  EG +  +++   P 
Sbjct: 139 QGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLRG-EGADAPKNAQGEPY 197

Query: 186 -KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
            K+ A  KH+A +        DR HFD+R +++D+ ET++  FE  V +G V +VM +YN
Sbjct: 198 RKLDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYN 254

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           RV G    A   LL   +R  W F GY+VSDC +I  I + HK +  T+E A A  +K G
Sbjct: 255 RVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHG 313

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNI 362
            +L+CG+ Y+     AV+QG I EA IDT+L  L    MRLG FD  G   +  +  +  
Sbjct: 314 TELECGEEYSTLP-AAVRQGLIDEAQIDTALTTLMTARMRLGMFDPPGQLPWSTIPASVN 372

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
            +P H  LA   AR+ +VLLKND G LPL+   +K +A++GP A+ T A++GNY GTP  
Sbjct: 373 QSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAA 431

Query: 423 YTSPMDGFYAYS--KVINYAPGCADIV 447
             + + G  A +    + YA G AD+V
Sbjct: 432 PVTVLQGIRAAAPNAQVLYARG-ADLV 457



 Score =  140 bits (352), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 94/298 (31%), Positives = 144/298 (48%), Gaps = 53/298 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+D A +AD  V V GL   VE E          G DR DL LP  Q +L+  +    K 
Sbjct: 629 ALDVASSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK- 687

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV  V+ +  A+ I++A+ +  + +IL   YPG+ GG A+AD +FG  NPGGRLP+T+Y+
Sbjct: 688 PVVAVLTTGSALAIDWAQQH--LPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 745

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
            +     +    +R      GRTY++F G  +YPFG+GLSYTQF Y          ++LD
Sbjct: 746 ESETLPAFDDYAMR------GRTYRYFGGTPLYPFGHGLSYTQFAYS--------GLRLD 791

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           +                             D   T  + V+N G+  G EVV +Y  P  
Sbjct: 792 R------------------------TTIAADGSLTATVTVKNTGQRAGDEVVQLYLHPLT 827

Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
                  K++ G++R+ + AG+   + F ++A  +L+I D    +     GA+ + +G
Sbjct: 828 PQRERAGKELHGFQRITLQAGEQRALHFILDAKNALRIYDAQRKAYAVDPGAYEVQIG 885


>gi|21244948|ref|NP_644530.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
           306]
 gi|21110666|gb|AAM39066.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
           306]
          Length = 901

 Score =  315 bits (808), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 185/448 (41%), Positives = 251/448 (56%), Gaps = 37/448 (8%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           PY D +  +  RA DLV RMTL EK  QM + A  +PRL +P Y+WW+EALHGV+  G  
Sbjct: 33  PYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG-- 90

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------LG 124
                         GAT FP  I   A+F+  L  ++   +S EARA ++          
Sbjct: 91  --------------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHAR 136

Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLTFWSPNIN+ RDPRWGR  ET GEDP++  R  + +V+GLQ  EG    +++   P
Sbjct: 137 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGAAAPKNAQGEP 195

Query: 185 L-KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
             K+ A  KH A +        DR HFD+R +++D+ ET++  FE  V EG V +VM +Y
Sbjct: 196 YRKLDATAKHLAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKEGKVDAVMGAY 252

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NRV G    A   LL   +R  W F GY+VSDC +I  I + HK +  T+E A A  +K 
Sbjct: 253 NRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKH 311

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNN 361
           G +L+CG+ Y      AV+QG I EA IDT+L+ L    MRLG FD  G   +  +  + 
Sbjct: 312 GTELECGEEYATLP-AAVRQGLIDEAQIDTALKTLMTARMRLGMFDPPGQLPWSTIPASV 370

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
             +P H  LA   AR+ +VLLKND G LPL+    K +A++GP A+ T A++GNY GTP 
Sbjct: 371 NQSPAHDALARRTARESLVLLKND-GLLPLSRAKFKRIAVIGPTADDTMALLGNYYGTPA 429

Query: 422 RYTSPMDGFYAYS--KVINYAPGCADIV 447
              + + G  A +    + YA G AD+V
Sbjct: 430 APVTVLQGIRAAAPNAQVLYARG-ADLV 456



 Score =  145 bits (366), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 96/298 (32%), Positives = 150/298 (50%), Gaps = 53/298 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+D A++AD  V V GL   VE E          G DR DL LP  Q +L+  +  A   
Sbjct: 628 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-QATGR 686

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV  V+ +  A+ I++A+ +  + +IL   YPG+ GG A+AD +FG  NPGGRLP+T+Y+
Sbjct: 687 PVVAVLTTGSALAIDWAQQH--LPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 744

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
            +     +    +R      GRTY++F G  +YPFG+GLSYTQF Y          ++LD
Sbjct: 745 ESETLPAFDDYAMR------GRTYRYFGGTPLYPFGHGLSYTQFAYS--------GLRLD 790

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           +         T+ T                D      + V+N G+  G EVV +Y  P  
Sbjct: 791 RT--------TIAT----------------DGSLAATVTVKNTGQRAGDEVVQLYLHPLA 826

Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
                  K++ G++R+ +  G+  ++GFT+NA  +L++ D    +  +  GA+ + +G
Sbjct: 827 PQRERAGKELHGFQRIALQPGEQRELGFTINAKDALRLYDEQRKAYGVDPGAYEVQIG 884


>gi|325916103|ref|ZP_08178390.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325537647|gb|EGD09356.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 896

 Score =  315 bits (808), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 192/460 (41%), Positives = 256/460 (55%), Gaps = 45/460 (9%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           PY D +LP+  RA DLV RMTL EK  QM + A  +PRL +P Y+WW+EALHGV+  G  
Sbjct: 39  PYLDTQLPFETRAADLVSRMTLEEKAAQMQNAAPAIPRLRVPAYDWWNEALHGVARAG-- 96

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------ 126
                         GAT FP  I   A+F+  L  ++   +S EARA ++   A      
Sbjct: 97  --------------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARDEHKR 142

Query: 127 --GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLTFWSPNIN+ RDPRWGR  ET GEDP++  R  + +V+GLQ  +G  Y        
Sbjct: 143 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQG-PYR------- 194

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
            K+ A  KHYA +   +    DR HFD   +E+D+ ET++  F+  V EG V++VM +YN
Sbjct: 195 -KLDATAKHYAVH---SGPEADRHHFDVHPSERDLHETYLPAFQALVQEGHVAAVMGAYN 250

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           RVNG    A  + L   +R DW F GYIVSDC +I+ I ++HK +  T E A A  +K G
Sbjct: 251 RVNGESASASTR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVP-TPEAAAALGVKHG 308

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNI 362
            DLDCGD Y      AV+ G I EA IDTSL+ L    MRLG FD   +  +  +  +  
Sbjct: 309 TDLDCGDTYAALPK-AVRAGLIDEATIDTSLKRLMTTRMRLGMFDPPAKVAWAQIPASVN 367

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
            +PQH  LA   AR+ +VLLKND G LPL    +K +A+VGP A+   +++GNY GTP  
Sbjct: 368 QSPQHDALARRTARESLVLLKND-GLLPLKP-TLKRIAVVGPTADDPMSLLGNYYGTPAA 425

Query: 423 YTSPMDGFY--AYSKVINYAPGCADIVCQNNSMIPAAIDA 460
             + + G    A    + YA G   +  + +    A IDA
Sbjct: 426 PVTILQGIRDAAPQAEVVYARGSDLVEGREDPNAAAPIDA 465



 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 98/298 (32%), Positives = 149/298 (50%), Gaps = 53/298 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+DAA+NA+  V V GL   VE E          G DR D  LP  Q EL+  +  A   
Sbjct: 623 AVDAARNAEVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQAL-QATGT 681

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV  V+ +  A+ +++A+ +  + +IL   YPG+ GG A+ DV+FG+ +PGGRLPIT+Y+
Sbjct: 682 PVVAVLTTGSALAVDWAQQH--VPAILLAWYPGQRGGSAVGDVLFGQASPGGRLPITFYK 739

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
                  +    +R      GRTY++F G  +YPFG+GLSYTQF Y         D++LD
Sbjct: 740 EAERLPAFDDYAMR------GRTYRYFTGTALYPFGHGLSYTQFAYS--------DLRLD 785

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           +         T+G                 D      ++V N GK  G EVV +Y  P  
Sbjct: 786 RT--------TLGA----------------DGTLRATLKVRNTGKRAGDEVVQLYLHPLD 821

Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS-LLASGAHTILVG 742
                  K++ G++R+ +  G+  +V FT+ A  +L+I D    +  +  GA+ + +G
Sbjct: 822 PKRERAGKELRGFQRMTLQPGEQREVAFTLKAADALRIYDEQRKTYAVDPGAYEVQIG 879


>gi|289668505|ref|ZP_06489580.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. musacearum
           NCPPB 4381]
          Length = 902

 Score =  315 bits (807), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 184/447 (41%), Positives = 249/447 (55%), Gaps = 35/447 (7%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           PY D +  + +RA DLV RMTL EK  QM + A  +PRLG+  Y+WW+EALHGV+  G  
Sbjct: 34  PYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVAAYDWWNEALHGVARAG-- 91

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL--------G 124
                         GAT FP  I   A+F+  L  ++   +S EARA ++          
Sbjct: 92  --------------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHER 137

Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLTFWSPNIN+ RDPRWGR  ET GEDP++  R  + +VRGLQ   G           
Sbjct: 138 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVRGLQGEGGDAPKNAQGESY 197

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
            K+ A  KH+A +        DR HFD+R +++D+ ET++  FE  V +G V +VM +YN
Sbjct: 198 RKLDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYN 254

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           RV G    A   LL   +R  W F GY+VSDC +I  I + HK +  T+E A A  +K G
Sbjct: 255 RVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHG 313

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNI 362
            +L+CG+ Y+     AV QG I EA IDTSL+ L    MRLG FD  G   +  +  +  
Sbjct: 314 TELECGEEYSTLP-AAVHQGLIEEAQIDTSLQTLMTARMRLGMFDPPGQLPWSKIPASVN 372

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
            +P H  LA   AR+ +VLLKND G LPL+   +K +A++GP A+ T A++GNY GTP  
Sbjct: 373 QSPAHDALARRTARESLVLLKND-GLLPLSRTKLKRIAVIGPTADDTMALLGNYYGTPAA 431

Query: 423 YTSPMDGFYAYS--KVINYAPGCADIV 447
             + + G  A +    + YA G AD+V
Sbjct: 432 PVTVLQGIRAAAPNAQVLYARG-ADLV 457



 Score =  145 bits (367), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 98/298 (32%), Positives = 150/298 (50%), Gaps = 53/298 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+D A++A+  V V GL   VE E          G DR DL LP  Q EL+  +    K 
Sbjct: 629 ALDVARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQATGK- 687

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV  V+ +  A+ I++A+ +  + +IL   YPG+ GG A+AD +FG  NPGGRLP+T+Y+
Sbjct: 688 PVVAVLTAGSALAIDWAQQH--VPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 745

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
            +       ++P        GRTY++F G  +YPFG+GLSYTQF Y         D++LD
Sbjct: 746 ES------EALPAFDDYAMHGRTYRYFGGTPLYPFGHGLSYTQFAYS--------DLRLD 791

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           ++        TV                  D  FT  + V+N G+  G EV  +Y  P  
Sbjct: 792 RN--------TV----------------AADGSFTATVTVKNTGQRAGDEVAQLYLHPLT 827

Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
                  K++ G++RV +  G+  ++ F +NA ++L+I D    +     GA+ + +G
Sbjct: 828 PQRERAGKELRGFQRVALHPGEQRELRFPINAKEALRIYDEQRKTYTVDPGAYEVQIG 885


>gi|289666226|ref|ZP_06487807.1| beta-glucosidase precursor [Xanthomonas campestris pv. vasculorum
           NCPPB 702]
          Length = 902

 Score =  315 bits (807), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 184/447 (41%), Positives = 249/447 (55%), Gaps = 35/447 (7%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           PY D +  + +RA DLV RMTL EK  QM + A  +PRLG+  Y+WW+EALHGV+  G  
Sbjct: 34  PYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVAAYDWWNEALHGVARAG-- 91

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL--------G 124
                         GAT FP  I   A+F+  L  ++   +S EARA ++          
Sbjct: 92  --------------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHER 137

Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLTFWSPNIN+ RDPRWGR  ET GEDP++  R  + +VRGLQ   G           
Sbjct: 138 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVRGLQGEGGDAPKNAQGESY 197

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
            K+ A  KH+A +        DR HFD+R +++D+ ET++  FE  V +G V +VM +YN
Sbjct: 198 RKLDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYN 254

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           RV G    A   LL   +R  W F GY+VSDC +I  I + HK +  T+E A A  +K G
Sbjct: 255 RVYGESASASKFLLQDLLRQQWGFKGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHG 313

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNI 362
            +L+CG+ Y+     AV QG I EA IDTSL+ L    MRLG FD  G   +  +  +  
Sbjct: 314 TELECGEEYSTLP-AAVHQGLIEEAQIDTSLQTLMTARMRLGMFDPPGQLPWSKIPASVN 372

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
            +P H  LA   AR+ +VLLKND G LPL+   +K +A++GP A+ T A++GNY GTP  
Sbjct: 373 QSPAHDALARRTARESLVLLKND-GLLPLSRTKLKRIAVIGPTADDTMALLGNYYGTPAA 431

Query: 423 YTSPMDGFYAYS--KVINYAPGCADIV 447
             + + G  A +    + YA G AD+V
Sbjct: 432 PVTVLQGIRAAAPNAQVLYARG-ADLV 457



 Score =  146 bits (369), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 98/298 (32%), Positives = 150/298 (50%), Gaps = 53/298 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+D A++A+  V V GL   VE E          G DR DL LP  Q EL+  +    K 
Sbjct: 629 ALDVARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQATGK- 687

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV  V+ +  A+ I++A+ +  + +IL   YPG+ GG A+AD +FG  NPGGRLP+T+Y+
Sbjct: 688 PVVAVLTAGSALAIDWAQQH--VPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 745

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
            +       ++P        GRTY++F G  +YPFG+GLSYTQF Y         D++LD
Sbjct: 746 ES------EALPAFDDYAMHGRTYRYFGGTPLYPFGHGLSYTQFAYS--------DLRLD 791

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           ++        TV                  D  FT  + V+N G+  G EV  +Y  P  
Sbjct: 792 RN--------TV----------------AADGSFTATVTVKNTGQRAGDEVAQLYLHPLT 827

Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
                  K++ G++RV +  G+  ++ F +NA ++L+I D    +     GA+ + +G
Sbjct: 828 PQRERAGKELRGFQRVALHPGEQRELSFPINAKEALRIYDEQRKTYTVDPGAYEVQIG 885


>gi|418518550|ref|ZP_13084692.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
 gi|418522850|ref|ZP_13088880.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
 gi|410700720|gb|EKQ59264.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
 gi|410703176|gb|EKQ61671.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
          Length = 901

 Score =  315 bits (807), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 184/448 (41%), Positives = 254/448 (56%), Gaps = 37/448 (8%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           PY D +  +  RA DLV RMTL EK  QM + A  +PRL +P Y+WW+EALHGV+  G  
Sbjct: 33  PYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG-- 90

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------LG 124
                         GAT FP  I   A+F+  L  ++   +S EARA ++          
Sbjct: 91  --------------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHAR 136

Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD-SR 183
             GLTFWSPNIN+ RDPRWGR  ET GEDP++  R  + +V+GLQ  EG +  +++   R
Sbjct: 137 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGADAPKNAQGER 195

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
             K+ A  KH+A +        DR HFD+R +++D+ ET++  FE  V +G V +VM +Y
Sbjct: 196 YRKLDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAY 252

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NRV G    A   LL   +R  W F GY+VSDC +I  I + HK +  T+E A A  +K 
Sbjct: 253 NRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKH 311

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNN 361
           G +L+CG+ Y      AV+QG I EA IDT+L+ L    MRLG FD  G   +  +  + 
Sbjct: 312 GTELECGEEYATLP-AAVRQGLIDEAQIDTALKTLMTARMRLGMFDPPGQLPWSTIPASV 370

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
             +P H  LA   AR+ +VLLKND G LPL+   +K +A++GP A+ T A++GNY GTP 
Sbjct: 371 NQSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPA 429

Query: 422 RYTSPMDGFYAYS--KVINYAPGCADIV 447
              + + G  A +    + YA G AD+V
Sbjct: 430 APVTVLQGIRAAAPNAQVLYARG-ADLV 456



 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 97/298 (32%), Positives = 151/298 (50%), Gaps = 53/298 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+D A++AD  V V GL   VE E          G DR DL LP  Q +L+  +    K 
Sbjct: 628 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK- 686

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV  V+ +  A+ I++A+ +  + +IL   YPG+ GG A+AD +FG  NPGGRLP+T+Y+
Sbjct: 687 PVVAVLTAGSALAIDWAQQH--LPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 744

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
            +     +    +R      GRTY++F G  +YPFG+GLSYTQF Y          ++LD
Sbjct: 745 ESETLPAFDDYAMR------GRTYRYFGGTPLYPFGHGLSYTQFAYS--------GLRLD 790

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           +         T+ T                D   T  + V+N G+  G EVV +Y  P  
Sbjct: 791 RT--------TIAT----------------DGSLTATVTVKNTGQRAGDEVVQLYLHPLA 826

Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
                  K++ G++R+ +  G+  ++GFT+NA  +L++ D    +  +  GA+ + +G
Sbjct: 827 PQRERAGKELHGFQRIALQPGEQRELGFTINAKDALRLYDEQRKAYGVDPGAYEVQIG 884


>gi|390340546|ref|XP_001186857.2| PREDICTED: probable beta-D-xylosidase 2-like [Strongylocentrotus
           purpuratus]
          Length = 623

 Score =  314 bits (805), Expect = 9e-83,   Method: Compositional matrix adjust.
 Identities = 208/614 (33%), Positives = 319/614 (51%), Gaps = 69/614 (11%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGD-------LAYGVPRLGLPLYEWWSEA 62
           S  P+ +  LP+ +R  DL+ R+ + +   Q+          A  + RL +  Y W +E 
Sbjct: 28  SQLPFWNQSLPWDQRLDDLLSRLKVDDMTYQLARGGADPNGPAPAIGRLQIGKYVWNTEC 87

Query: 63  LHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN 122
           L G                D++   AT+FP  +  +A+F+  L  ++      E RA YN
Sbjct: 88  LRG----------------DAQAGNATAFPQALGLSAAFSRDLLFEVANATGYEVRAKYN 131

Query: 123 L--------GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
                     + GL  +SP IN++R P WGR  ET GEDPY+ G  A ++V GLQ     
Sbjct: 132 YYLQKGDFNNHQGLNCFSPVINIMRHPYWGRNQETYGEDPYLTGELAKSFVWGLQG---- 187

Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
                +  R L  +A CKH+AAY       + RF FD++V+++D+Q TF   F+ C+  G
Sbjct: 188 -----NHPRYLLTNAGCKHFAAYSGPENYPSSRFSFDAKVSDKDLQVTFFPAFKECIKAG 242

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
              SVMCSYN VNGIP CA+  LLN  +R +W F GY+VSD  +++    +H +     +
Sbjct: 243 -TYSVMCSYNSVNGIPACANSYLLNDVLRTEWGFKGYVVSDQRALELEELAHNYTTSYLD 301

Query: 295 DAVARVLKAGLDLDCGDY---YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS 351
            A+ + LKAG +LD G       ++   AV+ G +   D+  S+  L+   +RLG FD  
Sbjct: 302 TAI-KSLKAGCNLDLGTTKPAVYDYLAEAVELGMLTAQDLRDSIAPLFYTRLRLGEFD-P 359

Query: 352 PQYKNLGKNN----ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
           P +    K N    + +P+H E+A +AA +  VL+KND   LP+  G I TLA+VGP AN
Sbjct: 360 PDHNPYVKLNVDQVVESPEHQEIALKAALKSFVLVKNDGSTLPIE-GTIHTLAVVGPFAN 418

Query: 408 ATKAMIGNYEGTP-CRY-TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNAD 465
            +K + G+Y   P  R+ T+ ++G    +    +A GC    C         ++A   AD
Sbjct: 419 NSKLLFGDYAPNPDPRFVTTVLEGLSPMATKTRHASGCPSPKCVTYDQ-QGVLNAVTGAD 477

Query: 466 ATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG-PVTLVIMSAGAVDINFAK 524
             V+  G  + +E+EG DR D+LLPG Q +L+   A  A G PV L++ +AG ++I +A 
Sbjct: 478 VVVVCLGTGIELESEGNDRRDMLLPGKQEQLLQDAARYAAGKPVILLLFNAGPLNITWAL 537

Query: 525 NNPKIKSILWVGYPGEEGGRAIADVIFGK---YNPGGRLPITWYEANYVKIPYTSMPLRP 581
           ++P +++I+   +P +  G A+  ++F      NPGGRLP TW        P T   + P
Sbjct: 538 SSPSVQAIVECFFPAQATGVAL-RMMFQNAPGANPGGRLPSTW--------PATVAQIPP 588

Query: 582 VNNFP--GRTYKFF 593
           + N+   GRTY++F
Sbjct: 589 MENYSMDGRTYRYF 602


>gi|198425898|ref|XP_002119549.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 754

 Score =  314 bits (805), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 241/749 (32%), Positives = 360/749 (48%), Gaps = 112/749 (14%)

Query: 1   RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGD-------LAYGVPRLGL 53
            F S KV   +FP+ +  LP  ER +DLV R+T+ E + Q+          A  + RLG+
Sbjct: 14  HFASSKVTSEEFPFRNFSLPIEERLEDLVNRLTIEEVILQLSRGGVRDNGPAPAITRLGI 73

Query: 54  PLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTV 113
             Y+W +E L G +  G                 AT FP  I   A+F++ L  K+ +TV
Sbjct: 74  GPYQWNTECLRGYAMNG----------------DATCFPQPIGLAATFDQGLIYKMAKTV 117

Query: 114 STEARAMYNL----GN----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYV 165
           + EARA +N     GN     GL+ +SP IN++R P WGR  ET GEDP +    A  YV
Sbjct: 118 ALEARAKHNNFTKNGNFGDHTGLSCFSPVINILRHPLWGRNQETYGEDPVLTSLMARAYV 177

Query: 166 RGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFIL 225
            GLQ           D   L  +A CKH+ AY         RF F + V++ D+  TF  
Sbjct: 178 TGLQ----------GDEIYLPATAVCKHFVAYGGPENIPTTRFSFSANVSDHDIGTTFYP 227

Query: 226 PFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVES 285
            F  CV+ G    VMCSYN +NG+P+CA+P +L  T+R  ++F GY+VSD ++++ I   
Sbjct: 228 AFRECVHAG-AQGVMCSYNAINGVPSCANP-MLETTLRKKFHFDGYVVSDENALENIDLY 285

Query: 286 HKFLNDTKEDAVARVLKAGLDLDCGDY-YTN---FTMGAVQQGKIAEADIDTSLRFLYIV 341
             F   +K +  A  L AG+DL+   +  TN       AV+QG + EA +  S + L+  
Sbjct: 286 FNF-TKSKLETAAVALNAGVDLELTGFGKTNRYSLLNQAVEQGLVTEAALRRSAKRLFRT 344

Query: 342 LMRLGYFDGSPQYK---NLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKT 398
            M LG FD  P++    N+  + + +  H + A E A +  VLLKND G LPL     K 
Sbjct: 345 RMALGEFD-PPEFNHWLNVPIDVVQSLAHRKQAVEVAAKSFVLLKND-GILPLKQLYDK- 401

Query: 399 LALVGPHANATKAMIGNY--EGTPCRYTSPMDGFYAYSK--VINYAPGCADIVCQNNSMI 454
           +++VGP  N ++A+ G+Y  E     ++SP+    + S   V  +  GC   V  NN  +
Sbjct: 402 VSIVGPFINNSEALTGDYPAEFNLKYFSSPLFAANSLSSSGVARFTTGC---VGTNNQNL 458

Query: 455 PAAI--------DAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG 506
           P           +    +D  ++  G    VEAE  DR D+ LPG Q +LI  V   A G
Sbjct: 459 PICATYNSTNVKEVVTGSDIVLVTLGTGRGVEAESNDRRDINLPGKQLQLIQDVVKYANG 518

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV +V+ +AG +D+++   N    +++   +  +  G A+ +V+ G  NP GRLP TW  
Sbjct: 519 PVIVVLFNAGPLDVSWVMGN--TAAVIACHFSAQMTGEAMLEVLTGVVNPAGRLPNTWPA 576

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKY-KVASSPKSVDIKL 625
           +     P T   +        RTY++     ++PFGYGLSYT+F Y      P ++    
Sbjct: 577 SMEQVPPMTDYSMHE------RTYRYSTSSPLFPFGYGLSYTKFWYLDAVVEPTTI---- 626

Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
              Q+C          + P   VLI                +N G +DG EVV +Y    
Sbjct: 627 ---QRC----------QIPVVRVLI----------------QNTGHLDGEEVVQIYMTSK 657

Query: 686 GIAGTH-IKQVIGYERVFIAAGQSAKVGF 713
                  ++Q++ ++RV I AG+   +  
Sbjct: 658 KKRDRELLRQLVAFQRVPIKAGEEVSISL 686


>gi|118489157|gb|ABK96385.1| unknown [Populus trichocarpa x Populus deltoides]
          Length = 343

 Score =  314 bits (804), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 149/335 (44%), Positives = 219/335 (65%), Gaps = 11/335 (3%)

Query: 412 MIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVA 471
           MIGNY G  C YT+P+ G   Y+K ++ + GC D+ C  N    AA  AA++ADAT++V 
Sbjct: 1   MIGNYAGVACGYTTPLQGIRRYAKTVHLS-GCNDVFCNGNQQFNAAEVAARHADATILVM 59

Query: 472 GLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKS 531
           GLD S+EAE +DR  LLLPG+Q EL+++VA A++GP  LV+MS G +D++FAKN+P+I +
Sbjct: 60  GLDQSIEAEFRDRKGLLLPGYQQELVSRVARASRGPTILVLMSGGPIDVSFAKNDPRIGA 119

Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGR 588
           ILWVGYPG+ GG AIADV+FG  NPGG+LP+TWY  +Y+ K+P T+M +R  P   +PGR
Sbjct: 120 ILWVGYPGQAGGAAIADVLFGTANPGGKLPMTWYPHDYLAKVPMTNMGMRADPSRGYPGR 179

Query: 589 TYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
           TY+F+ GPVV+PFG+G+SYT F + +  +P+ V + L      R  N T  +N     A+
Sbjct: 180 TYRFYKGPVVFPFGHGMSYTTFAHSLVQAPREVSVPLASLHVSR--NTTGASN-----AI 232

Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQS 708
            +    C+       I+V+N G MDG+  ++V+S PPG   +  KQ+IG+E+V +  G  
Sbjct: 233 RVSHANCEALALGVHIDVKNTGDMDGTHTLLVFSSPPGGKWSTQKQLIGFEKVHLVTGSQ 292

Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
            +V   ++ CK L +VD      + +G H + +G+
Sbjct: 293 KRVKIDIHVCKHLSVVDRFGIRRIPNGEHYLYIGD 327


>gi|58584046|ref|YP_203062.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|84625823|ref|YP_453195.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|58428640|gb|AAW77677.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|84369763|dbj|BAE70921.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
           311018]
          Length = 904

 Score =  313 bits (803), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 186/458 (40%), Positives = 258/458 (56%), Gaps = 39/458 (8%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           PY   +  + +RA DLV RMTL EK  QM + A  +PRLG+P Y+WW+EALHGV+  G  
Sbjct: 36  PYLQTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG-- 93

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------LG 124
                         GAT FP  I   A+F+  L  ++   +S EARA ++          
Sbjct: 94  --------------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHRFLRQHQHAR 139

Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD-SR 183
             GLTFWSPNIN+ RDPRWGR  ET GEDP++  R  + +V+GLQ  EG +  +++   R
Sbjct: 140 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGSDAPKNAQGER 198

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
             K+ A  KH+A +        DR HFD+R +++D+ ET++  FE  V +G V +VM +Y
Sbjct: 199 YRKLDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAY 255

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NRV G    A   LL   +R  W F GY+VSDC +I  + + HK +  T+E A A  +  
Sbjct: 256 NRVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVA-TREQAAALAVTH 314

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNN 361
           G +L+CG+ Y+     AV QG I EA IDT+L+ L    MRLG FD  G   +  +  + 
Sbjct: 315 GTELECGEEYSTLP-AAVHQGLIDEAQIDTALQTLMTARMRLGMFDPPGQLPWSKIPASV 373

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
             +P H  LA   AR+ +VLLKND G LPL+   +K +A++GP A+ T A++GNY GTP 
Sbjct: 374 NQSPAHDALARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTMALLGNYYGTPA 432

Query: 422 RYTSPMDGFYAYS--KVINYAPGCADIVCQNNSMIPAA 457
              + + G  A +    + YA G AD+V   N   PAA
Sbjct: 433 APVTVLQGIRAAAPNAQVLYARG-ADLVEGRND--PAA 467



 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 97/298 (32%), Positives = 147/298 (49%), Gaps = 53/298 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+D A++AD  V V GL   VE E          G DR DL LP  Q EL+  +    K 
Sbjct: 631 ALDVARSADVVVFVGGLTGDVEGEEMKVSYPGFAGGDRTDLRLPKPQRELLEALQATGK- 689

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV  V+ +  A+ +++A+ +  + +IL   YPG+ GG A+AD +FG  NPGGRLP+T+Y+
Sbjct: 690 PVVAVLTAGSALAVDWAQQH--VPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 747

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
            +       ++P        GRTY++F G  +YPFG+GLSYTQF Y         D++LD
Sbjct: 748 ES------ETLPAFDDYAMHGRTYRYFGGTPLYPFGHGLSYTQFAYS--------DLRLD 793

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           +                             D   T  + V+N G+  G EVV +Y  P  
Sbjct: 794 R------------------------STLTADGALTATVAVKNTGQRAGDEVVQLYLHPLK 829

Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
                  K++ G++R+ +  GQ  ++ FT+NA  +L+I D    +     GA+ + +G
Sbjct: 830 PQRERAGKELRGFQRLALQPGQQRELRFTINAKDALRIYDAQRKAYTVDPGAYEVQIG 887


>gi|188574621|ref|YP_001911550.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
 gi|188519073|gb|ACD57018.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
          Length = 904

 Score =  313 bits (803), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 186/458 (40%), Positives = 258/458 (56%), Gaps = 39/458 (8%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           PY   +  + +RA DLV RMTL EK  QM + A  +PRLG+P Y+WW+EALHGV+  G  
Sbjct: 36  PYLQTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG-- 93

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------LG 124
                         GAT FP  I   A+F+  L  ++   +S EARA ++          
Sbjct: 94  --------------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHRFLRQHQHAR 139

Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD-SR 183
             GLTFWSPNIN+ RDPRWGR  ET GEDP++  R  + +V+GLQ  EG +  +++   R
Sbjct: 140 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGSDAPKNAQGER 198

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
             K+ A  KH+A +        DR HFD+R +++D+ ET++  FE  V +G V +VM +Y
Sbjct: 199 YRKLDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAY 255

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NRV G    A   LL   +R  W F GY+VSDC +I  + + HK +  T+E A A  +  
Sbjct: 256 NRVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVA-TREQAAALAVTH 314

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNN 361
           G +L+CG+ Y+     AV QG I EA IDT+L+ L    MRLG FD  G   +  +  + 
Sbjct: 315 GTELECGEEYSTLP-AAVHQGLIDEAQIDTALQTLMTARMRLGMFDPPGQLPWSKIPASV 373

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
             +P H  LA   AR+ +VLLKND G LPL+   +K +A++GP A+ T A++GNY GTP 
Sbjct: 374 NQSPAHDALARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTMALLGNYYGTPA 432

Query: 422 RYTSPMDGFYAYS--KVINYAPGCADIVCQNNSMIPAA 457
              + + G  A +    + YA G AD+V   N   PAA
Sbjct: 433 APVTVLQGIRAAAPNAQVLYARG-ADLVEGRND--PAA 467



 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 98/298 (32%), Positives = 147/298 (49%), Gaps = 53/298 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+D A++AD  V V GL   VE E          G DR DL LP  Q EL+  +    K 
Sbjct: 631 ALDVARSADVVVFVGGLTGDVEGEEMKVSYPGFAGGDRTDLRLPKPQRELLEALQATGK- 689

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV  V+ +  A+ I++A+ +  + +IL   YPG+ GG A+AD +FG  NPGGRLP+T+Y+
Sbjct: 690 PVVAVLTAGSALAIDWAQQH--VPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 747

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
            +       ++P        GRTY++F G  +YPFG+GLSYTQF Y         D++LD
Sbjct: 748 ES------ETLPAFDDYAMHGRTYRYFGGTPLYPFGHGLSYTQFAYS--------DLRLD 793

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           +                             D   T  + V+N G+  G EVV +Y  P  
Sbjct: 794 R------------------------STLTADGALTATVAVKNTGQRAGDEVVQLYLHPLK 829

Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
                  K++ G++R+ +  GQ  ++ FT+NA  +L+I D    +     GA+ + +G
Sbjct: 830 PQRERAGKELRGFQRLALQPGQQRELRFTINAKDALRIYDAQRKAYTVDPGAYEVQIG 887


>gi|384421334|ref|YP_005630694.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
           BLS256]
 gi|353464247|gb|AEQ98526.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
           BLS256]
          Length = 904

 Score =  313 bits (802), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 186/458 (40%), Positives = 256/458 (55%), Gaps = 39/458 (8%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           PY D    + +RA DLV RMTL EK  QM + A  +PRL +P Y+WW+EALHGV+  G  
Sbjct: 36  PYLDTARSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG-- 93

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------LG 124
                         GAT FP  I   A+F+  L  ++   +S EARA ++          
Sbjct: 94  --------------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHAR 139

Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLTFWSPNIN+ RDPRWGR  ET GEDP++  R  + +V+GLQ  EG    +++   P
Sbjct: 140 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGAAAPKNAQGEP 198

Query: 185 L-KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
             K+ A  KH+A +     E   R HFD+R +++D+ ET++  FE  V +G V +VM +Y
Sbjct: 199 YRKLDATAKHFAVHSGPEAE---RHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAY 255

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NRV G    A   LL   +R  W F GY+VSDC +I  + + HK +  T+E A A  +  
Sbjct: 256 NRVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVA-TREQAAALAVTH 314

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNN 361
           G +L+CG+ Y+     AV QG I EA IDT+L+ L    MRLG FD  G   +  +  + 
Sbjct: 315 GTELECGEEYSTLP-AAVHQGLIDEAQIDTALQTLMTARMRLGMFDPPGQLPWSKIPASV 373

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
             +P H  LA   AR+ +VLLKND G LPL+   +K +A++GP A+ T A++GNY GTP 
Sbjct: 374 NQSPAHDALARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTMALLGNYYGTPA 432

Query: 422 RYTSPMDGFYAYS--KVINYAPGCADIVCQNNSMIPAA 457
              + + G  A +    + YA G AD+V   N   PAA
Sbjct: 433 APVTVLQGIRAAAPNAQVLYARG-ADLVEGRND--PAA 467



 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 97/298 (32%), Positives = 147/298 (49%), Gaps = 53/298 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+D A++AD  V V GL   VE E          G DR DL LP  Q EL+  +    K 
Sbjct: 631 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQATGK- 689

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV  V+ +  A+ I++A+ +  + +IL   YPG+ GG A+AD +FG  NPGGRLP+T+Y+
Sbjct: 690 PVVAVLTAGSALAIDWAQQH--VPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 747

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
            +       ++P        GRTY++F G  +YPFG+GLSYTQF Y         D++LD
Sbjct: 748 ES------ETLPAFDDYTMHGRTYRYFGGTPLYPFGHGLSYTQFAYS--------DLRLD 793

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           +                             D   T  + V+N G+  G EVV +Y  P  
Sbjct: 794 R------------------------STLTADGALTATVAVKNTGQRAGDEVVQLYLHPLK 829

Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
                  K++ G++R+ +  G+  ++ FT+NA  +L+I D    +     GA+ + +G
Sbjct: 830 PQRERAGKELRGFQRLALQPGEQRELRFTINATDALRIYDAQRKAYTVDPGAYEVQIG 887


>gi|294667502|ref|ZP_06732718.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292602731|gb|EFF46166.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 901

 Score =  313 bits (802), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 182/447 (40%), Positives = 249/447 (55%), Gaps = 35/447 (7%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           PY D +  +  RA DLV RMTL EK  QM + A  +PRL +P Y+WW+EALHGV+  G  
Sbjct: 33  PYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG-- 90

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL--------G 124
                         GAT FP  I   A+F+  L  ++   +S EARA ++          
Sbjct: 91  --------------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHER 136

Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLTFWSPNIN+ RDPRWGR  ET GEDP++  R  + +V+GLQ   G         R 
Sbjct: 137 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGGDAPKNAQGERY 196

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
            K+ A  KH+A +        DR HFD+  +++D+ ET++  FE  V +G V +VM +YN
Sbjct: 197 RKLDATAKHFAVHSGPE---ADRHHFDAHPSQRDLYETYLPAFEALVKDGKVDAVMGAYN 253

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           RV G    A   LL   +R  W F GY+VSDC +I  I + HK +  T+E A A  +K G
Sbjct: 254 RVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHG 312

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNI 362
            +L+CG+ Y+     AV+QG I EA IDT+L+ L    MRLG FD  G   +  +  +  
Sbjct: 313 TELECGEEYSTLP-AAVRQGLIDEAQIDTALKTLMTARMRLGMFDPPGQLPWSQIPASVN 371

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
            +P H  LA   AR+ +VLLKND G LPL+   +K +A++GP A+ T A++GNY GTP  
Sbjct: 372 QSPAHDALARRTARESLVLLKND-GLLPLSRARLKRIAVIGPTADDTMALLGNYYGTPAA 430

Query: 423 YTSPMDGFYAYS--KVINYAPGCADIV 447
             + + G  A +    + YA G AD+V
Sbjct: 431 PVTVLQGIRAAAPNAQVLYARG-ADLV 456



 Score =  146 bits (368), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 96/298 (32%), Positives = 151/298 (50%), Gaps = 53/298 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+D A++A+  V V GL   VE E          G DR DL LP  Q +L+  +    K 
Sbjct: 628 ALDVARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALHATGK- 686

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV  V+ +  A+ I++A+ +  + +IL   YPG+ GG A+AD +FG  NPGGRLP+T+Y+
Sbjct: 687 PVVAVLTTGSALAIDWAQQH--LPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 744

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
            +     +    +R      GRTY++F G  +YPFG+GLSYTQF Y          ++LD
Sbjct: 745 ESETLPAFDDYAMR------GRTYRYFGGTPLYPFGHGLSYTQFAYS--------GLRLD 790

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           +         T+ T                D   T  + V+N G+  G EVV +Y  P  
Sbjct: 791 RT--------TIAT----------------DGSLTATVTVKNTGQRAGDEVVQLYLHPLT 826

Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
                  K++ G++R+ +  G+  ++GFT+NA  +L++ D    + +   GA+ + +G
Sbjct: 827 PQRERAGKELHGFQRIALTPGEQRELGFTINAKDALRLYDEQRKAYVVDPGAYEVQIG 884


>gi|116621778|ref|YP_823934.1| glycoside hydrolase family 3 protein [Candidatus Solibacter
           usitatus Ellin6076]
 gi|116224940|gb|ABJ83649.1| glycoside hydrolase, family 3 domain protein [Candidatus Solibacter
           usitatus Ellin6076]
          Length = 850

 Score =  313 bits (801), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 178/462 (38%), Positives = 260/462 (56%), Gaps = 46/462 (9%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S  P+ D  L    RA DLV RMTL EKV QM + A  +PRLG+P Y+WW+EALHGV+  
Sbjct: 22  SQLPFMDPDLSAERRAADLVARMTLDEKVLQMQNSAPAIPRLGIPAYDWWNEALHGVARA 81

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG----- 124
           G                 AT FP  I   A+++ +L  +I +T+STEARA YN       
Sbjct: 82  GL----------------ATVFPQAIGLAATWDATLMHRIAETISTEARAKYNEAIRNDD 125

Query: 125 ---NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
                GLTFWSPNIN+ RDPRWGR  ET GEDP++  R A+ +++G+Q           D
Sbjct: 126 HSRYRGLTFWSPNINIFRDPRWGRGQETYGEDPFLTSRMAVAFIKGMQG---------ED 176

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
               K+ A  KHYA +       + R  FD + + +D+ +T++  F   + E    S+MC
Sbjct: 177 PHYYKVIATAKHYAVHSGPE---SSRHQFDVKPSPRDLADTYLPAFRASIVEARADSLMC 233

Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
           +YNRV+GIP CA   LL + +RG+W F G++VSDC ++  I   H +  D    +    +
Sbjct: 234 AYNRVDGIPACASTDLLEKRLRGEWGFQGFVVSDCGAVSDIFRGHHYQPDAASASAV-AV 292

Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGK 359
           KAG DL CG+ Y    + AV+ G I E +I+ SL  L++   +LG FD   +  + N+  
Sbjct: 293 KAGTDLTCGNEYRAL-VDAVKTGLITEPEINRSLERLFVARFKLGMFDPPERVPFSNIPY 351

Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
           + + +  H ++A EAAR+ IVLLKND G LPL + +IK +A++GP A+  +A++GNY G 
Sbjct: 352 SEVDSAGHRKIALEAARKSIVLLKND-GTLPLKS-SIKKIAVIGPAADDAEALLGNYNGF 409

Query: 420 PCRYTSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPAAI 458
                +P+ G    +A    + YA G A+   Q+ + +PA++
Sbjct: 410 SSLQVTPLAGIEHQWAGKAEVRYALG-ANYTAQSQAPLPASV 450



 Score =  124 bits (310), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 105/347 (30%), Positives = 160/347 (46%), Gaps = 71/347 (20%)

Query: 379 IVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVIN 438
           I L + +  A P  TG  + L L   HA     +IG     P R      G  A ++V+ 
Sbjct: 535 IFLEERELTADPPPTGRGRPLLL---HAQ----LIGG-RAYPIRVEYSASGPAASAQVL- 585

Query: 439 YAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLL 488
           +AP  A ++        AAI+A  NAD T+   GL+ S+E E          G DR +L 
Sbjct: 586 WAPPDAPLLA-------AAIEAVSNADVTLAFVGLNPSLEGEEMPVSVPGFQGGDRTNLE 638

Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
           LP  Q +LI + A A   PV +V+ S  AV +NFA  +    ++L   Y GEE G AIAD
Sbjct: 639 LPEPQEKLI-EAAIATGKPVVVVLASGSAVAMNFAAQH--ASALLETWYNGEETGTAIAD 695

Query: 549 VIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYT 608
            + G  NP GRLP+T+Y +     P+    ++      GRTY++F+G  +Y FG+GLSY+
Sbjct: 696 TLAGINNPSGRLPVTFYRSVDQLPPFEEYAMK------GRTYRYFNGDALYSFGFGLSYS 749

Query: 609 QFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVEN 668
           +F+Y                         + T +     ++   V+             N
Sbjct: 750 KFQYSA-----------------------LKTRRAGSGTIVASRVR-------------N 773

Query: 669 MGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
              ++G EVV +Y    G  G  I+ + G++R+ +  G+S +V F +
Sbjct: 774 ASSIEGDEVVQLYVNGSGADGDPIRSLRGFQRIHLRPGESREVHFPL 820


>gi|424796589|ref|ZP_18222299.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
 gi|422794891|gb|EKU23686.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
          Length = 913

 Score =  310 bits (794), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 181/447 (40%), Positives = 258/447 (57%), Gaps = 37/447 (8%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D +  + +RA DLV RMTL EK  QM + A  +PRLG+P Y+WW+EALHGV+  G   
Sbjct: 37  YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG--------N 125
                        GAT FP  I   A+F+  L  ++   +S EARA ++           
Sbjct: 94  -------------GATVFPQAIGMAATFDLPLMHEVSTAISDEARAKHHEALRHDQHARY 140

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLTFWSPNIN+ RDPRWGR  ET GEDP++  R  + +V+GLQ  EG +  +++     
Sbjct: 141 QGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGADAPKNAQGDAY 199

Query: 186 -KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
            K+ A  KH+A +   +    DR HFD+  +++D+ ET++  FE  V EG V +VM +YN
Sbjct: 200 RKLDATAKHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYN 256

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           RV G    A   LL   +R  W F GY+VSDC +I  I ++HK +  T+E A A  +  G
Sbjct: 257 RVYGESASASKFLLRDVLRDTWGFDGYVVSDCWAIVDIWKNHKIVA-TREQAAALAVNNG 315

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNI 362
            +L+CG+ Y+     AV++G I+EAD+D +L+ L    MRLG FD   + ++  +  +  
Sbjct: 316 TELECGEEYSTLP-AAVRKGLISEADVDKALQKLMYSRMRLGMFDPPDTLRWAQIPLSAN 374

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
            +P+H  LA   AR+ +VLLKND G LPL+ G IK +A++GP A+ T A++GNY GTP  
Sbjct: 375 QSPEHDALARRTARESLVLLKND-GVLPLSRGKIKRIAVIGPTADDTMALLGNYYGTPAA 433

Query: 423 YTSPMDGFY--AYSKVINYAPGCADIV 447
             + + G    A    + YA G AD+V
Sbjct: 434 PVTVLQGIREAAPDAEVLYARG-ADLV 459



 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 96/298 (32%), Positives = 148/298 (49%), Gaps = 53/298 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+DAA+ AD  V V GL   VE E          G DR DL LP  Q EL+  +    K 
Sbjct: 631 ALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQGTGK- 689

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV  V+ +  A+ I++A+ +  + +IL   YPG+ GG A+ADV+FG  NPGGRLP+T+Y+
Sbjct: 690 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVTFYK 747

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
            +     +    +R      GRTY++F G  +YPFG+GLSYTQF Y         D++LD
Sbjct: 748 ESEKLPAFDDYAMR------GRTYRYFAGTALYPFGHGLSYTQFAYS--------DLRLD 793

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           + +                           D      ++V+N G+  G EVV +Y  P  
Sbjct: 794 RSKL------------------------ATDGSLHATLKVKNTGQRAGDEVVQLYLHPLS 829

Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
                  K++ G++R+ +  G++ +V F ++    L++ D A  + +   G + + VG
Sbjct: 830 PQRERARKELRGFQRIALQPGETREVSFAISPQTDLRLYDEARKAYVVDPGDYELQVG 887


>gi|440731995|ref|ZP_20911965.1| glucan 1,4-beta-glucosidase [Xanthomonas translucens DAR61454]
 gi|440370332|gb|ELQ07251.1| glucan 1,4-beta-glucosidase [Xanthomonas translucens DAR61454]
          Length = 913

 Score =  310 bits (794), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 182/447 (40%), Positives = 257/447 (57%), Gaps = 37/447 (8%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D +  + +RA DLV RMTL EK  QM + A  +PRLG+P Y+WW+EALHGV+  G   
Sbjct: 37  YLDTQRSFEQRAADLVARMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG--------N 125
                        GAT FP  I   A+F+  L  ++   +S EARA ++           
Sbjct: 94  -------------GATVFPQAIGMAATFDVPLMHEVSTAISDEARAKHHEALRHDQHARY 140

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLTFWSPNIN+ RDPRWGR  ET GEDP++  R  + +V+GLQ  EG +  +++     
Sbjct: 141 QGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGADAPKNAQGEAY 199

Query: 186 -KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
            K+ A  KH+A +   +    DR HFD+  +++D+ ET++  FE  V EG V +VM +YN
Sbjct: 200 RKLDATAKHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYN 256

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           RV G    A   LL   +R  W F GY+VSDC +I  I ++HK +  T+E+A A  +K G
Sbjct: 257 RVYGESASASKFLLRDVLRDRWGFDGYVVSDCWAIVDIWKNHKIVA-TREEAAALAVKHG 315

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNI 362
            +L+CG  Y+     AV++G I+EAD+D +L+ L    MRLG FD   +  +  +  +  
Sbjct: 316 TELECGAEYSTLP-SAVRKGLISEADVDKALQKLMYSRMRLGMFDPPEKLAWAQIPLSAN 374

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
            +P+H  LA   AR+ +VLLKND G LPL+   IK +A+VGP A+ T A++GNY GTP  
Sbjct: 375 QSPEHDALARRTARESLVLLKND-GVLPLSRAKIKRIAVVGPTADDTMALLGNYYGTPAA 433

Query: 423 YTSPMDGFY--AYSKVINYAPGCADIV 447
             + + G    A    + YA G AD+V
Sbjct: 434 PVTVLQGIREAAPDAEVLYARG-ADLV 459



 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 92/282 (32%), Positives = 141/282 (50%), Gaps = 52/282 (18%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+DAA+ AD  V V GL   VE E          G DR DL LP  Q  L+  +    K 
Sbjct: 631 ALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRALLEALHGTGK- 689

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV  V+ +  A+ I++A+ +  + +IL   YPG+ GG A+ADV+FG  NPGGRLP+T+Y+
Sbjct: 690 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVTFYK 747

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
            +     +    +R      GRTY++F G  +YPFG+GLSYTQF Y         D++LD
Sbjct: 748 ESETLPAFDDYAMR------GRTYRYFAGTPLYPFGHGLSYTQFAYS--------DLRLD 793

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           + +                           D +    ++V+N G+  G EVV +Y +P  
Sbjct: 794 RSKLA------------------------ADGRLHATLKVKNTGQRAGDEVVQLYLQPLS 829

Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
                  K + G++R+ +  G++ +V F ++    L++ D A
Sbjct: 830 PQRERASKDLRGFQRIALQPGETREVRFAISPQSDLRLYDEA 871


>gi|325919363|ref|ZP_08181395.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
           19865]
 gi|325550152|gb|EGD20974.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
           19865]
          Length = 876

 Score =  310 bits (793), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 187/459 (40%), Positives = 254/459 (55%), Gaps = 45/459 (9%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           PY D + P+  RA DLV RMTL EK  QM + A  +PRL +P Y+WW+EALHGV+  G  
Sbjct: 19  PYLDTQRPFDARAADLVARMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG-- 76

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------ 126
                         GAT FP  I   A+F+  L  ++   +S EARA ++   A      
Sbjct: 77  --------------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARGEYKR 122

Query: 127 --GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLTFWSPNIN+ RDPRWGR  ET GEDP++  R  + +V+GLQ  +G  Y        
Sbjct: 123 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQG-PYR------- 174

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
            K+ A  KH+A +   +    DR HFD   +E+D+ ET++  F+  V EG V++VM +YN
Sbjct: 175 -KLDATAKHFAVH---SGPEADRHHFDVHPSERDLHETYLPAFQALVQEGKVAAVMGAYN 230

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           RVNG    A  + L   +R DW F GYIVSDC +I+ I ++HK +  T E A A  +K G
Sbjct: 231 RVNGESASASTR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVP-TPEAAAALGVKHG 288

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNI 362
            DLDCGD Y      AV+ G I EA IDT+L+ L    MRLG FD   +  +  +  +  
Sbjct: 289 TDLDCGDTYAALP-AAVRAGLIDEATIDTALKRLMTTRMRLGMFDPPAKVPWAQIPASAN 347

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
            +PQH  LA   AR+ +VLLKND G LPL    +K +A++GP A+   +++GNY GTP  
Sbjct: 348 QSPQHDALARRTARESLVLLKND-GVLPLKP-TLKRIAVIGPTADDPMSLLGNYYGTPAA 405

Query: 423 YTSPMDGFY--AYSKVINYAPGCADIVCQNNSMIPAAID 459
             + + G    A    + YA G   +  + +    A ID
Sbjct: 406 PVTILQGIRDAAPQAQVIYARGSDLVEGREDPNAAAPID 444



 Score =  142 bits (357), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 101/335 (30%), Positives = 160/335 (47%), Gaps = 60/335 (17%)

Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAA-------IDAAKNADATVIVAGLDLSVEA 479
           ++G  AY   + Y     D   +    +P A       +DAA++A+  V V GL   VE 
Sbjct: 566 LEGGKAYDLRVEYYEATRDAGVRLAWRMPGAKPPLQEAVDAARDAEVVVFVGGLTGDVEG 625

Query: 480 E----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
           E          G DR D  LP  Q EL+  +  A   PV  V+ +  A+ I++A+ +  +
Sbjct: 626 EEMDVNYPGFAGGDRTDTRLPKPQRELLQAL-QATGTPVVAVLTTGSALAIDWAQQH--V 682

Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRT 589
            +IL   YPG+ GG A+ DV+FG+ +PGGRLP+T+Y+       +    +R      GRT
Sbjct: 683 PAILLAWYPGQRGGSAVGDVLFGQASPGGRLPVTFYKEAERLPAFDDYAMR------GRT 736

Query: 590 YKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL 649
           Y++F G  +YPFG+GLSYTQF Y         D++LD+         TV           
Sbjct: 737 YRYFQGKPLYPFGHGLSYTQFAYS--------DLRLDRT--------TV----------- 769

Query: 650 IDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQS 708
                  D   T  + ++N G+  G EVV +Y  P        +K++ G +R+ +  G+ 
Sbjct: 770 -----AADGTLTATVTLKNTGQRAGDEVVQLYLHPLKPQRERALKELHGLQRITLQPGEQ 824

Query: 709 AKVGFTMNACKSLKIVDNAANS-LLASGAHTILVG 742
            ++ FT+ A  +L+I D    +  +  GA+ + +G
Sbjct: 825 RQLRFTIKAQDALRIYDEQRKAYAVDPGAYEVQIG 859


>gi|389794400|ref|ZP_10197553.1| beta-glucosidase-related glycosidase [Rhodanobacter fulvus Jip2]
 gi|388432423|gb|EIL89432.1| beta-glucosidase-related glycosidase [Rhodanobacter fulvus Jip2]
          Length = 902

 Score =  309 bits (791), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 176/418 (42%), Positives = 239/418 (57%), Gaps = 42/418 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D    + +RA DLV  MTL EK  QM + A  +PRLG+  Y+WW+E LHGV+  G+  
Sbjct: 47  YRDLSRSFHDRAADLVAHMTLEEKAAQMQNTAPAIPRLGVAAYDWWNEGLHGVARAGQ-- 104

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN-------- 125
                         AT FP  I   A+F+  L  ++   +S EARA YN           
Sbjct: 105 --------------ATVFPQAIGLAATFDVPLMHEVATAISDEARAKYNEFQRKGSHGRY 150

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLT+WSPNIN+ RDPRWGR  ET GEDPY+  R  + +V GLQ  +   Y         
Sbjct: 151 EGLTYWSPNINIFRDPRWGRGQETYGEDPYLTERMGVAFVTGLQG-DNPTYR-------- 201

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K+ A  KH+A +   +    DR HFD   +E+D+ ET++  F+  V E DV +VM +YNR
Sbjct: 202 KLDATAKHFAVH---SGPEADRHHFDVHPSERDLYETYLPAFQTLVQEADVDAVMSAYNR 258

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           VNG P    P+LL Q +R DW F GY+VSDC +++ I + HK + DT E A A  +K G+
Sbjct: 259 VNGEPATGSPRLLGQILRKDWGFKGYVVSDCGAVEDIYKHHKVV-DTVEAASALAVKNGV 317

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNIC 363
           DLDCG  Y    + AV  G I E++ID +L  L    MRLG FD + +  + ++  +   
Sbjct: 318 DLDCGTEYAAL-VKAVHDGLIKESEIDAALTRLMQARMRLGMFDPASKVPWSDVPYSVNQ 376

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           +PQH  LA  AAR+ +VLLKND G LPL+  +IK +A++GP A+   A++GNY GTP 
Sbjct: 377 SPQHDALARRAARESMVLLKND-GVLPLSK-DIKHIAVIGPTADDVMALVGNYHGTPA 432



 Score =  134 bits (338), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 94/289 (32%), Positives = 140/289 (48%), Gaps = 57/289 (19%)

Query: 468 VIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA 517
           V   GL   VE E          G DR DL LP  Q +L+  +    K PV LV+ S  A
Sbjct: 642 VFAGGLTSDVEGEEMKVNYPGFAGGDRTDLRLPATQRKLLEALQATGK-PVVLVLTSGSA 700

Query: 518 VDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSM 577
           + +++A  N  + ++L   YPG+ GG A+ADV+FGK +P GRLP+T+Y+A+        +
Sbjct: 701 LAVDWA--NQHLPAVLLAWYPGQRGGNAVADVLFGKADPAGRLPVTFYKAS------EKL 752

Query: 578 PLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
           P        GRTY++F G  +YPFGYGLSYT+F Y         D+KLD ++        
Sbjct: 753 PAFDDYRMDGRTYRYFKGEPLYPFGYGLSYTKFTY--------ADLKLDHNK-------- 796

Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI---KQ 694
           +G N                 K    ++V N GK  G EVV +Y +  G+   H    K 
Sbjct: 797 IGKND----------------KLHVTVKVHNAGKRAGDEVVQLYLR--GVGTPHERSNKD 838

Query: 695 VIGYERVFIAAGQSAKVGFTMNACKSLKIVDN-AANSLLASGAHTILVG 742
           + G +R+ +  GQ+  V F ++    L+  D   A   + +G + + +G
Sbjct: 839 LRGIQRITLQPGQTRDVSFDVSPATDLRYYDTKKAAYAVDAGRYEVQIG 887


>gi|433677589|ref|ZP_20509555.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
           18974]
 gi|430817300|emb|CCP39963.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
           18974]
          Length = 913

 Score =  309 bits (791), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 182/447 (40%), Positives = 257/447 (57%), Gaps = 37/447 (8%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D +  + +RA DLV RMTL EK  QM + A  +PRLG+P Y+WW+EALHGV+  G   
Sbjct: 37  YLDTQRSFEQRAADLVARMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG--------N 125
                        GAT FP  I   A+F+  L  ++   +S EARA ++           
Sbjct: 94  -------------GATVFPQAIGMAATFDLPLMHEVSTAISDEARAKHHEALRHDQHARY 140

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLTFWSPNIN+ RDPRWGR  ET GEDP++  R  + +V+GLQ  E V+  +++     
Sbjct: 141 QGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EDVDVPKNAQGEAY 199

Query: 186 -KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
            K+ A  KH+A +   +    DR HFD+  +++D+ ET++  FE  V EG V +VM +YN
Sbjct: 200 RKLDATAKHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYN 256

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           RV G    A   LL   +R  W F GY+VSDC +I  I ++HK +  T+E+A A  +K G
Sbjct: 257 RVYGESASASKFLLRDVLRDRWGFDGYVVSDCWAIVDIWKNHKIVA-TREEAAALAVKHG 315

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNI 362
            +L+CG  Y+     AV++G I+EAD+D +L+ L    MRLG FD   +  +  +  +  
Sbjct: 316 TELECGAEYSTLPT-AVRKGLISEADVDNALQKLMYSRMRLGMFDPPEKLAWAQIPLSAN 374

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
            +P+H  LA   AR+ +VLLKND G LPL+   IK +A+VGP A+ T A++GNY GTP  
Sbjct: 375 QSPEHDALARRTARESLVLLKND-GVLPLSRAKIKRIAVVGPTADDTMALLGNYYGTPAA 433

Query: 423 YTSPMDGFY--AYSKVINYAPGCADIV 447
             + + G    A    + YA G AD+V
Sbjct: 434 PVTVLQGIREAAPDAEVLYARG-ADLV 459



 Score =  145 bits (366), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 95/298 (31%), Positives = 148/298 (49%), Gaps = 53/298 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+DAA+ AD  V V GL   VE E          G DR DL LP  Q  L+  +    K 
Sbjct: 631 ALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRALLEALHGTGK- 689

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV  V+ +  A+ I++A+ +  + +IL   YPG+ GG A+ADV+FG  NPGGRLP+T+Y+
Sbjct: 690 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVTFYK 747

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
            +     +    +R      GRTY++F G  +YPFG+GLSYTQF Y         D++LD
Sbjct: 748 ESETLPAFDDYAMR------GRTYRYFAGTALYPFGHGLSYTQFAYS--------DLRLD 793

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           + +                           D +    ++V+N G+  G EVV +Y +P  
Sbjct: 794 RSKLA------------------------ADGRLHATLKVKNTGQRAGDEVVQLYLQPLS 829

Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
                  K + G++R+ +  G++ +V F ++    L++ D A  + +   G + + VG
Sbjct: 830 PQRERASKDLRGFQRIALQPGETREVRFAISPQSDLRLYDEARKAYVVDPGDYELQVG 887


>gi|372209036|ref|ZP_09496838.1| glycoside hydrolase [Flavobacteriaceae bacterium S85]
          Length = 859

 Score =  308 bits (788), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 222/741 (29%), Positives = 358/741 (48%), Gaps = 93/741 (12%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +R  DL+  MTL EK+   G     + RLG+P +EW+ EALHG+                
Sbjct: 34  DRVNDLLANMTLEEKISYCGSRIPEIKRLGIPYFEWYGEALHGIISWN------------ 81

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPR 142
                 T FP  I   A++N  L   +   +S EARA+ N G   +  +SP +N+ RDPR
Sbjct: 82  -----CTQFPQNIAMGATWNPDLMFDVATAISNEARALKNAGKKEVMMFSPTVNMARDPR 136

Query: 143 WGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNW 202
           WGR  E   EDP+++   A  YVRG+Q          +D + +K     KHY A +++  
Sbjct: 137 WGRNGECYAEDPHLMSEMARMYVRGMQ---------GNDPKYVKTVTTVKHYVANNVE-- 185

Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
               R    S + ++D+ E +   ++ C+ + + + +M + N +NGIP  A   L+N  +
Sbjct: 186 --TKREWIHSNIGKKDLYEYYFPAYKTCIVDEEATGIMTALNGLNGIPCSAHDWLVNGVL 243

Query: 263 RGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTM---- 318
           R +W F GY+++D  ++Q + +  K+ +   + A   + KAG+D +C   + N       
Sbjct: 244 RNEWGFKGYVIADWAAVQGLEKRMKYASSQAQAAAMAI-KAGVDQEC---FRNKVRQAPM 299

Query: 319 -----GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELA 371
                 A+QQG I E ++D +++ L  +    G FD      Y  +  + +    H +LA
Sbjct: 300 VQALPDALQQGLITEKELDVTVKRLLRLRFMTGDFDDPSLNPYSAIPTSVLECDAHKQLA 359

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA Q IVLLKND   LPL   ++K++A++GP A+  +  +G Y G P    SP+DG  
Sbjct: 360 LKAAEQSIVLLKND-AVLPLKK-DLKSIAMIGPFAD--RCWMGIYSGHPKSKVSPLDGIK 415

Query: 432 AYSKV-INYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLL 489
           AY+   +++A GC     +++   I  A+  AK ++  ++V G D +   E  DR  + L
Sbjct: 416 AYTNAKVSFAQGCEVTAKEDDEQKIAEAVALAKKSEQVILVVGNDETTSTENTDRKSIKL 475

Query: 490 PGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADV 549
           PG Q +LI K   A    V LV++ +G   + + + N  I  I+     G+E G A+A V
Sbjct: 476 PGNQHQLI-KAVQAVNKNVILVLVPSGPTAVTWEQKN--IPGIVCAWPNGQEQGTALAKV 532

Query: 550 IFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
           +FG  NPGG+L  TWY+++     +    +   N    RTY +F G  +YPFGYGLSYT 
Sbjct: 533 LFGDVNPGGKLNATWYQSDKDLPNFHDYKMAGGN----RTYMYFKGKPLYPFGYGLSYTN 588

Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
           F           D+ ++K                         ++  +Y  T + +V N 
Sbjct: 589 FTIS--------DVSINKKT-----------------------LQANEY-VTVKAKVNNT 616

Query: 670 GKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAA 728
           G + G EVV VY +       T +K + G++R+ +AAG S  V   +   ++    +   
Sbjct: 617 GAVAGDEVVQVYIRDVKSKEKTPLKALKGFQRISVAAGASKWVEIKI-PYEAFSHYNTKK 675

Query: 729 NSLL-ASGAHTILVGEGVGGV 748
            +L+ A G   ILVG     +
Sbjct: 676 EALMVAKGEFEILVGNASDAI 696


>gi|389736853|ref|ZP_10190363.1| glucan 1,4-beta-glucosidase [Rhodanobacter sp. 115]
 gi|388438821|gb|EIL95541.1| glucan 1,4-beta-glucosidase [Rhodanobacter sp. 115]
          Length = 868

 Score =  306 bits (784), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 181/442 (40%), Positives = 246/442 (55%), Gaps = 45/442 (10%)

Query: 15  CDAKLP-YPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
            DA+ P    RA  LV +MTLPEKV QM + A  +PRLG+P Y+WWSE LHG++  G   
Sbjct: 22  VDARTPDAHSRAVALVAKMTLPEKVAQMQNDAPAIPRLGVPAYDWWSEGLHGIARNGY-- 79

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
                         AT FP  I   AS++ SL   +G  +STEARA +N   +G      
Sbjct: 80  --------------ATVFPQAIGLAASWDTSLLHAVGTVISTEARAKFNASGSGRAHGLF 125

Query: 128 --LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
             LT WSPNIN+ RDPRWGR  ET GEDPY+ G+ A+ +VRG+Q         D    P 
Sbjct: 126 QGLTLWSPNINIFRDPRWGRGQETYGEDPYLTGQLAVAFVRGIQG--------DDPQHPR 177

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
            I A  KH+ A+      G D F  D  V+  D+++T++  F   V +G   SVMC+YN 
Sbjct: 178 AI-ATPKHFVAHSGPE-AGRDSFDVD--VSPHDLEDTYLPAFRTAVVDGHAGSVMCAYNA 233

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           ++G P CA+  LL+  +R DW F GY+VSDCD++  I   H F  D  + +VA V +AG 
Sbjct: 234 LHGTPACANAGLLDTRLRKDWGFAGYVVSDCDAVGDIASYHYFKPDDVQASVAAV-QAGT 292

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNIC 363
           DLDCG  Y +    AV+QG IAE+ +D SL  L+    RLG     G+  Y  +G + I 
Sbjct: 293 DLDCGHTYASLAQ-AVRQGDIAESALDASLVRLFTARYRLGELGSRGNDPYARIGADQID 351

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
           +P H +LA +AA + +VLLKN +  LPL+ G    LA++GP A+A + +  NY GT    
Sbjct: 352 SPAHRKLALQAALESLVLLKNAHSTLPLHAG--MRLAVIGPDADALETLEANYHGTARHP 409

Query: 424 TSPMDGFYAY--SKVINYAPGC 443
            +P+ G  A   +  + YA G 
Sbjct: 410 VTPLQGLRARFGADHVAYAQGA 431



 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 102/298 (34%), Positives = 146/298 (48%), Gaps = 56/298 (18%)

Query: 462 KNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLV 511
            +ADA V   GL   VE E          G DR D+ LP  Q  L+ + A A+  P+ +V
Sbjct: 596 HDADAVVAFIGLSPDVEGEQLRIDVPGFDGGDRTDIGLPAPQRALLER-ARASGKPLIVV 654

Query: 512 IMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK 571
           ++S  AV +++A+ +    +IL   YPG+ GG AIA V+ G YNPGGRLP+T+Y +    
Sbjct: 655 LLSGSAVALDWAQQH--ADAILAAWYPGQAGGTAIAQVLAGDYNPGGRLPVTFYRSTRDL 712

Query: 572 IPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQC 631
            PY S  ++      GRTY++FDG  +YPFGYGLSYT+F Y  A +  +  +K     Q 
Sbjct: 713 PPYVSYAMQ------GRTYRYFDGRPLYPFGYGLSYTRFTY-AAPTLSAATLKAGGTLQV 765

Query: 632 RDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY--SKPPGIAG 689
                                            EV N G+  G EVV VY  + P  +A 
Sbjct: 766 -------------------------------SAEVRNAGQRAGDEVVQVYLDTPPSPLAP 794

Query: 690 THIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGG 747
            H   ++G+ R+ +AAG+   V FT+ A + L  VD A    +  G + + +G G  G
Sbjct: 795 RH--ALVGFRRIHLAAGEQRLVRFTL-APRQLSSVDAAGARAVEPGQYRVFIGAGQPG 849


>gi|90021134|ref|YP_526961.1| Beta-glucosidase [Saccharophagus degradans 2-40]
 gi|89950734|gb|ABD80749.1| b-xylosidase-like protein [Saccharophagus degradans 2-40]
          Length = 893

 Score =  306 bits (783), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 177/452 (39%), Positives = 254/452 (56%), Gaps = 47/452 (10%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +P+ DA L    R  DLV R+T  EK+ QM +    + RLG+P Y WW+E+LHGV+  G+
Sbjct: 43  YPFRDASLSVDARVDDLVSRLTTTEKIAQMFNDTPAIERLGIPAYNWWNESLHGVARAGK 102

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--LGN---- 125
                           AT +P  I   ++F+E L  ++  ++S E RA Y+  L      
Sbjct: 103 ----------------ATVYPQAIGLASTFDEDLMLRVATSISDEGRAKYHDFLSKDVRT 146

Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
              GLTFWSPNIN+ RDPRWGR  ET GEDP++ GR AIN+V+G+Q         + +S 
Sbjct: 147 IYGGLTFWSPNINIFRDPRWGRGQETYGEDPFLTGRMAINFVKGIQG-------ENDNSD 199

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LK  A  KHYA +   +     R   D   T +D+ ET++  F M + E +V S+MC+Y
Sbjct: 200 YLKAVATIKHYAVH---SGPEKTRHSDDYHPTRKDLFETYLPAFRMAIAETNVQSLMCAY 256

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHK-FLNDTKEDAVARVLK 302
           NRV+G P C + +L+ + +RGD  F+GY+VSDC +I    ES    + D+  +A A  +K
Sbjct: 257 NRVDGAPACGNNELMQEILRGDMGFNGYVVSDCGAIADFYESRSHHVVDSPAEAAAWAVK 316

Query: 303 AGLDLDCGD----YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKN 356
           +G DL+CGD     YTN    A+QQG I E  ID +++ L+   ++LG FD   +  Y  
Sbjct: 317 SGTDLNCGDSHGNTYTNLHY-ALQQGLITEDYIDIAVKRLFKARIKLGMFDEQDRVPYSE 375

Query: 357 LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
           +G + + +P+H+ L  EAA + IVLLKN NG LPL  G    +A++GP+A     ++GNY
Sbjct: 376 IGMDVVGSPKHLALTQEAAEKSIVLLKN-NGVLPLKAG--VKVAVIGPNAVDEDVLVGNY 432

Query: 417 EGTPCRYTSPMDGFYAYSKVIN--YAPGCADI 446
            G P +   P++G        N  YAPG A I
Sbjct: 433 HGVPVKPVLPLEGIVNRVGEANVFYAPGSAQI 464



 Score =  110 bits (274), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 86/302 (28%), Positives = 135/302 (44%), Gaps = 54/302 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAEGK----------DRVDLLLPGFQTELINKVADAAKG 506
           A+ AA+ AD  + + G+D  +E E            DR  + LP  QT L+ ++    K 
Sbjct: 620 ALAAARKADVIIFMGGIDAHLEGEEMPLELDGFTHGDRTHINLPKVQTNLLKQLKATGK- 678

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV +V  S  A+ +N+   + K+ +IL   YPGE  G A+A++++G  +P GRLP+T+Y+
Sbjct: 679 PVVMVNFSGSAMALNW--ESEKLDAILQAFYPGEATGTALANILWGDVSPSGRLPVTFYK 736

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
                     +P     +   RTYKF+ G  +Y FG+GL Y  F Y              
Sbjct: 737 G------VDDLPAFNDYHMENRTYKFYRGEPLYAFGHGLGYVDFAYN------------- 777

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPP 685
                   N  V        A+ I             + V N GKM   +V  VY S   
Sbjct: 778 --------NLVVANTAEAGKALPI------------AVSVTNTGKMQAEDVAQVYISLLD 817

Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGV 745
             A T I+ +  ++R  +AAG+S ++ F + A + L  +D+   +   +G   + VG G 
Sbjct: 818 APANTPIRDLKAFKRTKLAAGESTELEFNLPA-RVLTYIDDNGKTQTYTGRVEVTVGSGQ 876

Query: 746 GG 747
            G
Sbjct: 877 KG 878


>gi|188993706|ref|YP_001905716.1| beta-glucosidase [Xanthomonas campestris pv. campestris str. B100]
 gi|167735466|emb|CAP53681.1| exported beta-glucosidase [Xanthomonas campestris pv. campestris]
          Length = 896

 Score =  305 bits (780), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 189/460 (41%), Positives = 251/460 (54%), Gaps = 45/460 (9%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           PY D   P   RA DLV RMTL EK  QM + A  +PRL +P Y+WW+EALHGV+  G  
Sbjct: 39  PYLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG-- 96

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG----- 127
                         GAT FP  I   A+F+  L  ++   +S EARA ++   AG     
Sbjct: 97  --------------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLAGGEHKR 142

Query: 128 ---LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
              LTFWSPNIN+ RDPRWGR  ET GEDP++  R  + +V+GLQ  +G  Y        
Sbjct: 143 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQG-PYR------- 194

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
            K+ A  KHYA +   +    DR HFD   +E+D+ ET++  F+  V EG V++VM +YN
Sbjct: 195 -KLDATAKHYAVH---SGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYN 250

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           RVNG    A  + L   +R DW F GYIVSDC +I+ I ++HK +  T E A A  +K G
Sbjct: 251 RVNGESASASTR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVP-TPEAAAALGVKHG 308

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNI 362
            DLDCGD Y      AV+ G I EA ID SL  L    +RLG FD   +  +  +  +  
Sbjct: 309 TDLDCGDTYAALP-AAVRAGLIDEATIDRSLTRLMAARLRLGMFDPPAKVPWAQIPASAN 367

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
            +PQH  LA   AR+ +VLLKND G LPL    +K +A+VGP A+   +++GNY GTP  
Sbjct: 368 QSPQHDALARRTARESLVLLKND-GLLPLKP-TLKRIAVVGPTADDPMSLLGNYYGTPAA 425

Query: 423 YTSPMDGFY--AYSKVINYAPGCADIVCQNNSMIPAAIDA 460
             + + G    A    + YA G   +  + +    A IDA
Sbjct: 426 PVTILQGIRDAAPQAEVVYARGSDLVEGREDPNAAAPIDA 465



 Score =  143 bits (361), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 96/298 (32%), Positives = 149/298 (50%), Gaps = 53/298 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+DAA+NAD  V V GL   VE E          G DR D  LP  Q EL+  +  A   
Sbjct: 623 AVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQAL-QATGT 681

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV  V+ +  A+ I++A+ +  + +IL   YPG+ GG A+ DV+FG+ +PGGRLPIT+Y+
Sbjct: 682 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPITFYK 739

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
            +     +    +R      GRTY++FDG  +YPFG+GL+YTQF Y         +++LD
Sbjct: 740 EDERLPAFDDYAMR------GRTYRYFDGKPLYPFGHGLAYTQFAYS--------NLRLD 785

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           +         TV                  D      + V+N G+  G EVV +Y  P  
Sbjct: 786 RT--------TV----------------AADGTLRATVSVKNTGQRAGDEVVQLYLHPLN 821

Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS-LLASGAHTILVG 742
                  K++ G++R+ +  G+  +V F +   ++L+I D    +  +  GA+ + +G
Sbjct: 822 PQRERARKELRGFQRITLQPGEHREVSFNITPREALRIYDEQRKAYAVDPGAYELQIG 879


>gi|389737578|ref|ZP_10190998.1| beta-glucosidase [Rhodanobacter sp. 115]
 gi|388434298|gb|EIL91245.1| beta-glucosidase [Rhodanobacter sp. 115]
          Length = 898

 Score =  304 bits (779), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 173/428 (40%), Positives = 239/428 (55%), Gaps = 42/428 (9%)

Query: 3   ESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEA 62
           ++I  K +   Y D    + ERA DLV RMTL EKV QM + A  +PRLG+P Y+WW+EA
Sbjct: 32  KTIAAKQTQPLYLDTAHSFQERAADLVSRMTLAEKVAQMQNSAPAIPRLGVPAYDWWNEA 91

Query: 63  LHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN 122
           LHGV+  G                 AT FP  I   A+F+ +L       +S EARA YN
Sbjct: 92  LHGVARAGE----------------ATVFPQAIGLAATFDPALLHHEATAISDEARAKYN 135

Query: 123 LGN--------AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
                       GLTFWSPN N+ RDPRWGR  ET GEDPY+  R  + +VRGL+     
Sbjct: 136 DFQRRGMRGRYEGLTFWSPNTNIFRDPRWGRGQETYGEDPYLTSRMGVAFVRGLEG---- 191

Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
                 D    K+ A  KH+A +   +   ++R  FD   +E+D+ ET++  F+  V +G
Sbjct: 192 -----DDPTYQKLDATAKHFAVH---SGPESERHRFDVHPSERDLHETYLPAFQALVQQG 243

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
            V +VM +YNRV+G+P  A  +LL   +R DW F GY+VSDCD++  I + HK +  T E
Sbjct: 244 GVDAVMGAYNRVDGVPATASHRLLQDILRRDWGFKGYVVSDCDAVADIYQFHKVV-PTAE 302

Query: 295 DAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSP 352
            A A  +  G DL+CG  Y    + AV  G + E  IDT++  L +   RLG FD  G  
Sbjct: 303 QAAALAVNNGDDLNCGTTYATL-VKAVHDGLVNEHTIDTAVTRLMLARFRLGMFDPPGRV 361

Query: 353 QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAM 412
            +  L  + + +PQH  LA   A++ +VLLKND G LPL+  N++ +A++GP A+   A+
Sbjct: 362 PWSTLPMSVVQSPQHDALALRTAQESMVLLKND-GLLPLSH-NVRRIAVIGPTADNVTAL 419

Query: 413 IGNYEGTP 420
           +GNY GTP
Sbjct: 420 LGNYHGTP 427



 Score =  145 bits (367), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 96/306 (31%), Positives = 155/306 (50%), Gaps = 57/306 (18%)

Query: 451 NSMIPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKV 500
            S   AA+DAA++AD  +   GL   +E E          G DR  L LP  Q +L+  +
Sbjct: 620 KSPFEAALDAARHADVVIFAGGLSSDLEGEEMPVDYPGFAGGDRTTLALPATQRKLLQAL 679

Query: 501 ADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRL 560
               K PV LV+ +  A+ I++AK +  + +IL   YPG++GG A+AD +FG  +P GRL
Sbjct: 680 QVTGK-PVVLVLTTGSALAIDWAKQH--LPAILLAWYPGQDGGHAVADALFGNVDPAGRL 736

Query: 561 PITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKS 620
           P+T+Y++     P+    ++      GRTY++F G  ++PFG+GLSYT+F Y        
Sbjct: 737 PVTFYKSARQLPPFDDYAMK------GRTYRYFTGQPLFPFGFGLSYTRFAYS------- 783

Query: 621 VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
            D++LD+D        T+G +                 +    + V+N G+  G EVV +
Sbjct: 784 -DLQLDRD--------TLGPSD----------------RMRISLRVKNTGQRAGDEVVQL 818

Query: 681 YSKPPGIAGTH---IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGA 736
           Y +P  +   H   IK + G++R+ +  G+   V F ++    LK  D A ++  +A G 
Sbjct: 819 YLRP--LRAPHARAIKSLRGFQRISLKPGEERSVSFDISPQTDLKYYDVAHHAYAVAPGR 876

Query: 737 HTILVG 742
           + + VG
Sbjct: 877 YQVQVG 882


>gi|295135996|ref|YP_003586672.1| beta-glucosidase [Zunongwangia profunda SM-A87]
 gi|294984011|gb|ADF54476.1| putative beta-glucosidase [Zunongwangia profunda SM-A87]
          Length = 796

 Score =  304 bits (778), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 225/730 (30%), Positives = 346/730 (47%), Gaps = 118/730 (16%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+PL     EA+HG   +G                  T FPT I   +++N  L KK+
Sbjct: 126 RLGIPLL-LEEEAMHGHMAVG-----------------TTVFPTAIGQASTWNPDLIKKM 167

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
              ++ E RA         T + P I++ R+PRW RV ET GEDPY++     + V G Q
Sbjct: 168 AHVIAKEIRA-----QGSNTAYGPIIDIAREPRWSRVEETFGEDPYLIAEMGKSMVTGFQ 222

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR-FHFDSRVTEQDMQETFILPFE 228
                  H         ++A  KH+AAY +     N    H   R    D+ + ++ P +
Sbjct: 223 G-----SHESDLKSNEHVAATLKHFAAYGVSEGGHNGAAVHIGQR----DLFQNYMYPVK 273

Query: 229 MCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKF 288
             V+ G V SVM +Y+ ++G+P+ A   LL   ++  W F G+++SD  SI+ ++  H  
Sbjct: 274 EAVDNG-VMSVMTAYSSIDGVPSTAHKNLLTNILKEKWGFKGFVISDLASIEGLLGDHHI 332

Query: 289 LNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY 347
           + DT+EDA A  + AG+D+D G + Y +  + AV  GK+AE  ID ++R +  V  +LG 
Sbjct: 333 V-DTEEDAAAMAMNAGVDVDLGGNGYDDALIDAVNAGKVAEERIDEAVRRILTVKFKLGL 391

Query: 348 FDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
           F+     +   +  + N +HIELA E ARQ I +LKN++  LPLN   ++ +A++G +A+
Sbjct: 392 FENPYANEKQAEKIVRNSEHIELAREVARQSITMLKNEDNILPLNK-ELQNIAVIGSNAD 450

Query: 408 ATKAMIGNYEG--TPCRYTSPMDGFYAY--SKVINYAPGCADIVCQNNSMIPAAIDAAKN 463
                +G+Y    +     + ++G      +  I Y  G A +     + IPAA++AAKN
Sbjct: 451 MQYNQLGDYTAPQSEENIITVLEGIQHKMPNANIEYVKGTA-VRDTTQTNIPAAVEAAKN 509

Query: 464 ADATVIVAG----LDLSVE----------------------AEGKDRVDLLLPGFQTELI 497
           A+  ++V G     D   E                       EG DR  L L G Q EL+
Sbjct: 510 AEVAIVVLGGSSARDFKTEYLETGAATISSKEDQVLSDMESGEGYDRSTLNLMGKQLELL 569

Query: 498 NKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPG 557
             V  A   P  LV++    + +N+   N  +  IL   YPG+EGG AIADVIFG +NP 
Sbjct: 570 QAVV-ATGTPTVLVLIKGRPLLLNWPAEN--VPVILDAWYPGQEGGSAIADVIFGDFNPA 626

Query: 558 GRLPITWYEANYVKIPYTSMPLRPVNNFPGRT-YKFFDGPVVYPFGYGLSYTQFKY---K 613
           GRLP++      V      +P+     FP R  Y   D   +YPFGYGLSY++FKY   K
Sbjct: 627 GRLPVS------VPKSLGQIPVYYNYWFPNRRDYVETDAKPLYPFGYGLSYSEFKYSDLK 680

Query: 614 VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMD 673
           VA+S K                                    ++ K    +++ N  K+D
Sbjct: 681 VATSGKG-----------------------------------RNTKIEISLKISNTSKVD 705

Query: 674 GSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL 732
           G EV+ +Y +       + +KQ+  +ERV I AG++  V F +   K L + D      +
Sbjct: 706 GDEVIQLYIRDMVSTVLSPVKQLRAFERVSIKAGETKTVQFEL-LPKELSLFDTEMKQKV 764

Query: 733 ASGAHTILVG 742
            +G   +++G
Sbjct: 765 QAGEFKLMIG 774


>gi|348688508|gb|EGZ28322.1| family 3 glycoside hydrolase [Phytophthora sojae]
          Length = 701

 Score =  304 bits (778), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 232/762 (30%), Positives = 353/762 (46%), Gaps = 145/762 (19%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPR-----LGLPLYEWWSEALHGV-S 67
           +C+  LP   R +DL+ R+ L EK   +   A   PR     +GLP Y W +  +HGV S
Sbjct: 34  FCNTSLPVSARVEDLLARLPLDEKAILL--TARASPRGNMSSIGLPEYNWGANCVHGVRS 91

Query: 68  FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG 127
             G  TN P            TSFP  +      N S+ ++                   
Sbjct: 92  TCG--TNCP------------TSFPNPV------NLSIHRR------------------- 112

Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
                      RDPRWGR  ETP EDP V  +Y + Y +GLQ  EG    +  D R L+ 
Sbjct: 113 -----------RDPRWGRNTETPSEDPLVNSKYGVAYTKGLQ--EG----KHEDPRYLQA 155

Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
               KHY AY  +N+ G +R  F++ V+  D  +T+   F   + +G+   VMCSYN VN
Sbjct: 156 VVTLKHYVAYSYENYGGGNRKTFNAIVSPYDFADTYFPAFRSSIVDGNAKGVMCSYNSVN 215

Query: 248 GIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL 307
           G+P CA+ +L N+ +RG   F GYI SD  +I+ I +   ++  T+ +A    + AG D+
Sbjct: 216 GVPACANNELENKLLRGMLGFDGYITSDSGAIEAISDWLHYV-PTRCEAARLAILAGTDV 274

Query: 308 DCGD--YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD---GSPQYKNLGKNNI 362
           + G    Y       V+  ++    +D  LR    +   LG FD     P +K +  N++
Sbjct: 275 NSGRGFGYMACLKELVESNQLDVKVVDDVLRHTLKLRFELGLFDPIEDQPYWK-VTPNDV 333

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
                 +L+ + AR+ IVLL+N+   LPL  G    LA+VGPHA A +A++GNY G  C 
Sbjct: 334 NTDAAKKLSLDLARKSIVLLQNNQPVLPLRRG--VKLAVVGPHAQAKRALLGNYLGQMCH 391

Query: 423 --------YTSPMDGFYAYS--KVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG 472
                     +P +   A +      YA GC ++   + +    A+ A + A+A V+  G
Sbjct: 392 GDYNEVGCIKTPFEAVSASNGDSSTTYALGC-NVTGNSTAGFVEAVKAVQGAEAVVLFLG 450

Query: 473 LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
           +D SVEAE +DR ++ LP  Q +L+ +V    K P  +V+M+ G +         +  ++
Sbjct: 451 IDKSVEAEVRDRNNIDLPAIQVQLLQRVRAVGK-PTVVVLMNGGVLTAEDIIG--QTDAL 507

Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
           +   YPG  G +A+ D++FG  NPGG+LP+T Y ++YV      M    V  +PGR+Y++
Sbjct: 508 VEAFYPGFFGAQAMTDILFGDANPGGKLPVTMYRSDYVNT--VDMKSMNVTAYPGRSYRY 565

Query: 593 FDGPVVYPFGYGLSYTQFKYK----VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
           F G  V+PFG+GLSYT F  K     A++ KSV   +         N T+          
Sbjct: 566 FKGEPVFPFGWGLSYTSFSLKADDATATTAKSVSATM---------NTTI---------- 606

Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-----PGIAGTHIKQVIGYERVFI 703
                                       VV  Y +P      G A    KQ+  Y RV +
Sbjct: 607 ---------------------------SVVFAYFRPIKTDASGPATLLNKQLFDYRRVTL 639

Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGV 745
              +S ++ F +    +L +VD   N +   G++ I++  GV
Sbjct: 640 KPSESTRLSFEVQR-STLALVDEEGNLVSFPGSYDIIITNGV 680


>gi|21233528|ref|NP_639445.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
 gi|66770493|ref|YP_245255.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. 8004]
 gi|21115383|gb|AAM43327.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
 gi|66575825|gb|AAY51235.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. 8004]
          Length = 896

 Score =  303 bits (777), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 189/460 (41%), Positives = 250/460 (54%), Gaps = 45/460 (9%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           PY D   P   RA DLV RMTL EK  QM + A  +PRL +P Y+WW+EALHGV+  G  
Sbjct: 39  PYLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG-- 96

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG----- 127
                         GAT FP  I   A+F+  L  ++   +S EARA ++   AG     
Sbjct: 97  --------------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLAGGEHKR 142

Query: 128 ---LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
              LTFWSPNIN+ RDPRWGR  ET GEDP++  R  + +V+GLQ  +G  Y        
Sbjct: 143 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQG-PYR------- 194

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
            K+ A  KHYA +   +    DR HFD   +E+D+ ET++  F+  V EG V++VM +YN
Sbjct: 195 -KLDATAKHYAVH---SGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYN 250

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           RVNG    A  + L   +R DW F GYIVSDC +I+ I ++HK +  T E A A  +K G
Sbjct: 251 RVNGESASASTR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVP-TPEAAAALGVKHG 308

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNI 362
            DLDCGD Y      AV+ G I EA ID SL  L    +RLG FD   +  +     +  
Sbjct: 309 TDLDCGDTYAALP-AAVRAGLIDEATIDRSLTRLMAARLRLGMFDPPAKVPWAQTPASAN 367

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
            +PQH  LA   AR+ +VLLKND G LPL    +K +A+VGP A+   +++GNY GTP  
Sbjct: 368 QSPQHDALARRTARESLVLLKND-GLLPLKP-TLKRIAVVGPTADDPMSLLGNYYGTPAA 425

Query: 423 YTSPMDGFY--AYSKVINYAPGCADIVCQNNSMIPAAIDA 460
             + + G    A    + YA G   +  + +    A IDA
Sbjct: 426 PVTILQGIRDAAPQAEVVYARGSDLVEGREDPNAAAPIDA 465



 Score =  143 bits (361), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 96/298 (32%), Positives = 149/298 (50%), Gaps = 53/298 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+DAA+NAD  V V GL   VE E          G DR D  LP  Q EL+  +  A   
Sbjct: 623 AVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQAL-QATGT 681

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV  V+ +  A+ I++A+ +  + +IL   YPG+ GG A+ DV+FG+ +PGGRLPIT+Y+
Sbjct: 682 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPITFYK 739

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
            +     +    +R      GRTY++FDG  +YPFG+GL+YTQF Y         +++LD
Sbjct: 740 EDERLPAFDDYAMR------GRTYRYFDGKPLYPFGHGLAYTQFAYS--------NLRLD 785

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           +         TV                  D      + V+N G+  G EVV +Y  P  
Sbjct: 786 RT--------TV----------------AADGTLRATVSVKNTGQRAGDEVVQLYLHPLN 821

Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS-LLASGAHTILVG 742
                  K++ G++R+ +  G+  +V F +   ++L+I D    +  +  GA+ + +G
Sbjct: 822 PQRERARKELRGFQRITLQPGEHREVSFNITPREALRIYDEQRKAYAVDPGAYELQIG 879


>gi|371777036|ref|ZP_09483358.1| glycoside hydrolase [Anaerophaga sp. HS1]
          Length = 890

 Score =  303 bits (776), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 178/444 (40%), Positives = 248/444 (55%), Gaps = 46/444 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D  LP+ ERA DLV +MTL EKV QM   A  + RLG+P Y WW+E LHGV   G   
Sbjct: 40  YLDPTLPFEERAADLVSKMTLEEKVSQMQHAAPAIERLGIPEYNWWNECLHGVGRAGI-- 97

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGN---- 125
                         AT FP  I   A +++    +I   VS EARA ++     G     
Sbjct: 98  --------------ATVFPQAIGMAAMWDDEEMYRIATAVSDEARAKHHDFARRGKRGIY 143

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLTFW+PNIN+ RDPRWGR +ET GEDP++ G  A++Y++GLQ           D R L
Sbjct: 144 QGLTFWTPNINIFRDPRWGRGMETYGEDPFLTGELAVDYIKGLQ---------GDDDRYL 194

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K+ A  KH+  +        DR HFD+R + +D   T+   F+  + E  V SVMC+YNR
Sbjct: 195 KLVATSKHFLVHSGPE---PDRHHFDARTSARDSLMTYTPHFKKTIQEAGVYSVMCAYNR 251

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVES-HKFLNDTKEDAVARVLKAG 304
            NG+P C   K +   +R +W F GYIVSDC ++    +  H  +  T E+A A  +KAG
Sbjct: 252 YNGLPCCGS-KPVENLLRNEWGFKGYIVSDCWAVADFYKKGHHEVVPTVEEAAAMAVKAG 310

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNN 361
            DL+CG+ Y    + AV+QG ++E +ID  ++ L    +RLG FD  P+   Y N+  + 
Sbjct: 311 TDLNCGNSYPAL-VDAVKQGLVSEEEIDVLVKRLMEARLRLGMFD-PPEMVPYTNIPYSV 368

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           + + +H ELA  AAR+ +VLLKNDN  LPL+  N+K +A++GP+AN    ++ NY G P 
Sbjct: 369 VDSKEHRELALIAARKSMVLLKNDNNTLPLDK-NVKNVAVIGPNANNLDVLLANYNGYPS 427

Query: 422 RYTSPMDGFYAY--SKVINYAPGC 443
              +P+DG      +  + YA GC
Sbjct: 428 NPVTPLDGIRQKLPNANVQYALGC 451



 Score =  149 bits (377), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 92/269 (34%), Positives = 137/269 (50%), Gaps = 52/269 (19%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           AI  A  +D  ++  GL  ++E E          G DRVD+ LP  QT+L+  +    K 
Sbjct: 610 AIQIAAASDVVLMFMGLSPNLEGEEMPVNVPGFSGGDRVDIKLPQIQTDLVKAIMSLGK- 668

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV LV+++  A+ IN+   N  + +IL   YPG+ GG AIADV+FG YNP GRLP+T+Y+
Sbjct: 669 PVVLVLLNGSALAINWEAEN--VPAILEAWYPGQAGGTAIADVLFGDYNPAGRLPVTFYK 726

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
           +       T +P     +  GRTY++F G  ++PFGYGLSYT FKY     P       D
Sbjct: 727 S------VTQLPPFEDYSMDGRTYQYFKGEALFPFGYGLSYTSFKYDNLVVP-------D 773

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           K +  +++                          T  ++V N G  DG EVV +Y   P 
Sbjct: 774 KLEAGKEV--------------------------TVHVDVTNTGNRDGDEVVQLYVSHPD 807

Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
           +    I+ + G++R+ + AG++  V FT+
Sbjct: 808 VESAPIRSLQGFDRIALKAGETKTVSFTL 836


>gi|384430040|ref|YP_005639401.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. raphani
           756C]
 gi|341939144|gb|AEL09283.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. raphani
           756C]
          Length = 896

 Score =  303 bits (776), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 189/460 (41%), Positives = 249/460 (54%), Gaps = 45/460 (9%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           PY D   P   RA DLV RMTL EK  QM + A  +PRL +P Y+WW+EALHGV+  G  
Sbjct: 39  PYLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG-- 96

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------ 126
                         GAT FP  I   A+F+  L  ++   +S EARA ++   A      
Sbjct: 97  --------------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARGEHKR 142

Query: 127 --GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLTFWSPNIN+ RDPRWGR  ET GEDP++  R  + +V+GLQ  +G  Y        
Sbjct: 143 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQG-PYR------- 194

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
            K+ A  KHYA +        DR HFD   +E+D+ ET++  F+  V EG V++VM +YN
Sbjct: 195 -KLDATAKHYAVHSGPE---ADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYN 250

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           RVNG    A  + L   +R DW F GYIVSDC +I+ I ++HK +  T E A A  +K G
Sbjct: 251 RVNGESASASTR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVP-TPEAAAALGVKHG 308

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNI 362
            DLDCGD Y      AV+ G I EA ID SL  L    +RLG FD   +  +     +  
Sbjct: 309 TDLDCGDTYAALP-AAVRAGLIDEATIDRSLTRLMAARLRLGMFDPPAKVPWAQTPASAN 367

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
            +PQH  LA   AR+ +VLLKND G LPL    +K +A+VGP A+   +++GNY GTP  
Sbjct: 368 QSPQHDALARRTARESLVLLKND-GLLPLKP-TLKRIAVVGPTADDPMSLLGNYYGTPAA 425

Query: 423 YTSPMDGFY--AYSKVINYAPGCADIVCQNNSMIPAAIDA 460
             + + G    A    + YA G   +  + +    A IDA
Sbjct: 426 PVTILQGIRDAAPQAEVVYARGSDLVEGREDPNAAAPIDA 465



 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 97/298 (32%), Positives = 150/298 (50%), Gaps = 53/298 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+DAA+NAD  V V GL   VE E          G DR D  LP  Q EL+  +  A   
Sbjct: 623 AVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQAL-QATGT 681

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV  V+ +  A+ I++A+ +  + +IL   YPG+ GG A+ DV+FG+ +PGGRLPIT+Y+
Sbjct: 682 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPITFYK 739

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
            +     +    +R      GRTY++FDG  +YPFG+GL+YTQF Y         +++LD
Sbjct: 740 EDERLPAFDDYAMR------GRTYRYFDGKPLYPFGHGLAYTQFAYS--------NLRLD 785

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           +         TV                  D      + V+N G+  G EVV +Y  P  
Sbjct: 786 RT--------TV----------------AADGTLRATVWVKNTGQRAGDEVVQLYLHPLN 821

Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS-LLASGAHTILVG 742
                  K++ G++R+ +  G+  +V FT+   ++L+I D    +  +  GA+ + +G
Sbjct: 822 PQRERARKELRGFQRITLQPGEHREVSFTITPREALRIYDEQRKAYAVDPGAYELQIG 879


>gi|325929067|ref|ZP_08190221.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
           91-118]
 gi|325540562|gb|EGD12150.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
           91-118]
          Length = 850

 Score =  300 bits (769), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 176/429 (41%), Positives = 244/429 (56%), Gaps = 37/429 (8%)

Query: 32  MTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSF 91
           MTL EK  QM + A  +PRLG+P Y+WW+EALHGV+  G                GAT F
Sbjct: 1   MTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG----------------GATVF 44

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYN--------LGNAGLTFWSPNINVVRDPRW 143
           P  I   A+F+  L  ++   +S EARA ++            GLTFWSPNIN+ RDPRW
Sbjct: 45  PQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFWSPNINIFRDPRW 104

Query: 144 GRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL-KISACCKHYAAYDLDNW 202
           GR  ET GEDP++  R  + +V+GLQ  EG +  +++   P  K+ A  KH+A +     
Sbjct: 105 GRGQETYGEDPFLTARMGVTFVQGLQG-EGADAPKNAQGEPYRKLDATAKHFAVHSGPE- 162

Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
              DR HFD+R +++D+ ET++  FE  V +G V +VM +YNRV G    A   LL   +
Sbjct: 163 --ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESASASKFLLQDVL 220

Query: 263 RGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQ 322
           R  W F GY+VSDC +I  I + HK +  T+E A A  +K G +L+CG+ Y+     AV+
Sbjct: 221 RQQWGFKGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHGTELECGEEYSTLP-AAVR 278

Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
           QG I EA IDT+L  L    MRLG FD  G   +  +  +   +P H  LA   AR+ +V
Sbjct: 279 QGLIDEAQIDTALTTLMTARMRLGMFDPPGQLPWSTIPASVNQSPAHDALARRTARESLV 338

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYS--KVIN 438
           LLKND G LPL+   +K +A++GP A+ T A++GNY GTP    + + G  A +    + 
Sbjct: 339 LLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQGIRAAAPNAQVL 397

Query: 439 YAPGCADIV 447
           YA G AD+V
Sbjct: 398 YARG-ADLV 405



 Score =  141 bits (355), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 94/298 (31%), Positives = 145/298 (48%), Gaps = 53/298 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+D A++AD  V V GL   VE E          G DR DL LP  Q +L+  +    K 
Sbjct: 577 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK- 635

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV  V+ +  A+ I++A+ +  + +IL   YPG+ GG A+AD +FG  NPGGRLP+T+Y+
Sbjct: 636 PVVAVLTTGSALAIDWAQQH--LPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 693

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
            +     +    +R      GRTY++F G  +YPFG+GLSYTQF Y          ++LD
Sbjct: 694 ESETLPAFDDYAMR------GRTYRYFGGTPLYPFGHGLSYTQFAYS--------GLRLD 739

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           +                             D   T  + V+N G+  G EVV +Y  P  
Sbjct: 740 R------------------------TTIAADGSLTATVTVKNTGQRAGDEVVQLYLHPLT 775

Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
                  K++ G++R+ +  G+   + FT++A  +L+I D    +     GA+ + +G
Sbjct: 776 PQRERAGKELHGFQRIALQPGEQRALHFTLDAKNALRIYDAQRKAYAVDPGAYEVQIG 833


>gi|326427096|gb|EGD72666.1| hypothetical protein PTSG_04397 [Salpingoeca sp. ATCC 50818]
          Length = 614

 Score =  300 bits (767), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 205/631 (32%), Positives = 315/631 (49%), Gaps = 65/631 (10%)

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
           SPNIN+ RDPRWGR  E P EDP + G +   Y  GLQ  E        DSR  K+    
Sbjct: 11  SPNININRDPRWGRNQEVPSEDPLLNGEFGKLYTMGLQQGE--------DSRYTKVVVTL 62

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
           KH+ AY L++ +G  R +FD++V+   + +T+   F   V EG+   VMCSYN +NG PT
Sbjct: 63  KHWDAYSLEDSDGFTRHNFDAKVSNFALMDTYWPAFRKAVMEGNAKGVMCSYNALNGRPT 122

Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
           C  P LL + +R  W F GY+ SD  +I+ I   H +  +      A +     D+D G 
Sbjct: 123 CTHP-LLTKVLRDIWKFDGYVTSDTGAIEDIYAKHHYTANASAAVAAALRDGRCDMDSGA 181

Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNICNPQHIE 369
            Y +  + AV  G+ +  D+D +L     +   LG FD      Y  +  ++I      +
Sbjct: 182 VYHDALLDAVNSGECSMDDVDRALYNTLKLRFELGLFDPIEDQPYWRINASSINTTYAQD 241

Query: 370 LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR------Y 423
           L  +   + ++LL+N N ALP   G  + +A++GPH NA +A++GNY G  C        
Sbjct: 242 LNMKITLESMILLQNHNNALPFKKG--RKVAVIGPHINAQEALVGNYLGQLCPDDSFDCI 299

Query: 424 TSPMDGFYAYSKVINY--APGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
           TSP+    A + + N   A G   + C + S I  A++ AK+AD  V++ G++ ++EAE 
Sbjct: 300 TSPLAAIEAINGMSNTVSAMGSGVLACTDAS-IQEAVNVAKDADYVVLLIGINDTIEAES 358

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
            DR  + LP  Q +L   +A   K     V+++ G + I   K   ++ +I+  GYPG  
Sbjct: 359 NDRTSIDLPQCQHKLTAAIAHLNK-TTAAVLINGGMLAIEQEKK--QLPAIIEAGYPGFY 415

Query: 542 GGRAIADVIFGKYNP-GGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
           GG AIA  IFG  N  GG+LP T Y A+Y+ KI  + M +    N PGR+Y+++ G  ++
Sbjct: 416 GGAAIAKTIFGDNNHLGGKLPYTVYPADYIHKINMSDMEM---TNSPGRSYRYYTGQPLW 472

Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
           PFG+GL+YT F  +      S               +  G+N                  
Sbjct: 473 PFGFGLAYTTFSVQSPGPSAST--------------FATGSNT----------------S 502

Query: 660 FTFQIEVENMGKMDGSEVVMVYSKP---PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
           F+  + V N GK  G  VV VY  P   P  + +  KQ+I +ERV +   Q   V   ++
Sbjct: 503 FSLPVHVVNTGKRTGDTVVQVYMAPVSLPHRSFSLKKQLIAFERVHLTPNQRLGVTIPLS 562

Query: 717 ACKSLKIVDNAANSLLAS-GAHTILVGEGVG 746
           A     +VD    +++++ G++ ++V +GV 
Sbjct: 563 A-DVFNMVDPVTGNVVSTPGSYRLVVSDGVA 592


>gi|397690575|ref|YP_006527829.1| glucan 1,4-beta-glucosidase [Melioribacter roseus P3M]
 gi|395812067|gb|AFN74816.1| glucan 1,4-beta-glucosidase [Melioribacter roseus P3M]
          Length = 860

 Score =  298 bits (764), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 170/462 (36%), Positives = 259/462 (56%), Gaps = 52/462 (11%)

Query: 2   FESIKVKLSDFP-YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
           F S+ +   + P Y +  LP+ ERA+DL++R++L EK+  M   +  + RLG+P Y WW+
Sbjct: 10  FLSVNLFAQNIPGYLNVNLPFEERAEDLLQRLSLDEKISLMVHQSPAIERLGIPEYNWWN 69

Query: 61  EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
           EALHGV+  GR                AT FP  I   A+++  L  +I   +S EARA 
Sbjct: 70  EALHGVARNGR----------------ATVFPMPIGLAATWDRDLIYRIADVISNEARAK 113

Query: 121 YNLG--------NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVE 172
           YN            G++ W+PNIN+ RDPRWGR +ET GEDPY+ G  A+++++GLQ   
Sbjct: 114 YNSALKKNQRGIYQGISLWAPNINIFRDPRWGRGMETYGEDPYLTGELAVSFIKGLQG-- 171

Query: 173 GVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVN 232
                   D + LK  A  KH A +     E   R HF++ V+  D+ ET++  F+  + 
Sbjct: 172 -------QDKKYLKTIATPKHLAVHSGPEPE---RHHFNALVSNYDLNETYLPHFKKSIM 221

Query: 233 EGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDT 292
           +G   SVMC+YNR+ G   C    LL   +R  W F G +VSDC ++  I  SHK + D+
Sbjct: 222 KGKAYSVMCAYNRLRGKACCGHDTLLTDILRNKWGFEGIVVSDCWAVYDIFNSHKIV-DS 280

Query: 293 KEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP 352
            E A A  + +G DL+CG+ + +    A + G I E +ID++LR + +   +LG FD  P
Sbjct: 281 PEKAAALAVSSGTDLECGNTFLSLK-NAYRDGLITEKEIDSALRRVLLARFKLGMFD-PP 338

Query: 353 Q---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT 409
           +   Y  + ++ + N  + E+A EAAR+ IVLLKNDN  LPL++ +I  +A++GP+A+  
Sbjct: 339 EIVSYSQIDESYLDNSYNREIALEAARKSIVLLKNDNKLLPLDS-SINKIAVIGPNADNL 397

Query: 410 KAMIGNYEGTPCRYTSPM--------DGFYAYSKVINYAPGC 443
           ++++GNY G P  Y +P+        +G   Y K  ++APG 
Sbjct: 398 ESLLGNYHGFPSEYITPLQAIRRVLKNGEVFYEKGCDFAPGV 439



 Score =  117 bits (294), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 92/296 (31%), Positives = 140/296 (47%), Gaps = 53/296 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A   A  +DA ++  GL   +E E          G DR+ L LP  Q +LI K+    K 
Sbjct: 591 AYKTALKSDAVIMFMGLCPRMEGEALKIKLDGFKGGDRLKLSLPANQLKLIKKIHSTGK- 649

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV LV+++ G +   +   N  I +IL   YPG+ GGRAI DVI+GKYNP G+LP+T Y+
Sbjct: 650 PVILVLLNGGPISTVWESEN--IPAILEAWYPGQAGGRAITDVIWGKYNPSGKLPVTIYK 707

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
           +     P+ +  +       GRTY++F G V+YPFG+GL+YT             DI + 
Sbjct: 708 SENDLPPFENYDME------GRTYRYFKGEVLYPFGWGLNYT-------------DITIS 748

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
                   N  +  N          ++K  D      ++++N G + G E V +Y+K   
Sbjct: 749 --------NIELSAN----------EIKDND-TIRVVVKLKNNGNLAGEETVQLYTKALK 789

Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
              T IK + G+E++ +  G    V F ++       VD      +  G + I+VG
Sbjct: 790 DNRT-IKTLRGFEKIKLEPGTEGMVEFYLSKSDLAVWVDGLGFETMP-GVYEIIVG 843


>gi|296081549|emb|CBI20072.3| unnamed protein product [Vitis vinifera]
          Length = 333

 Score =  298 bits (762), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 156/334 (46%), Positives = 209/334 (62%), Gaps = 14/334 (4%)

Query: 412 MIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVA 471
           MIGNYEGTP +YT+P+ G  A      Y PGC+++ C   + I  A   A  ADATV++ 
Sbjct: 1   MIGNYEGTPGKYTTPLQGLTALVAT-TYLPGCSNVAC-GTAQIDEAKKIAAAADATVLIV 58

Query: 472 GLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKS 531
           G+D S+EAEG+DRV++ LPG Q  LI +VA A+KG V LV+MS G  DI+FAKN+ KI S
Sbjct: 59  GIDQSIEAEGRDRVNIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISFAKNDDKITS 118

Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGR 588
           ILWVGYPGE GG AIADVIFG YNP GRLP TWY  +YV K+P T+M +R  P + +PGR
Sbjct: 119 ILWVGYPGEAGGAAIADVIFGFYNPSGRLPTTWYPQSYVDKVPMTNMNMRPDPASGYPGR 178

Query: 589 TYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
           TY+F+ G  +Y FG GLSYTQF + +  +PKSV I +++   C         +   C +V
Sbjct: 179 TYRFYTGETIYTFGDGLSYTQFNHHLIQAPKSVSIPIEEGHSC---------HSSKCKSV 229

Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQS 708
                 C++  F   + V N G + GS  V ++S PP +  +  K ++G+E+VF+ A   
Sbjct: 230 DAVQESCQNLAFDIHLRVNNAGNISGSHTVFLFSSPPSVHNSPQKHLLGFEKVFVTAKAE 289

Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           A V F ++ CK L IVD      +A G H + VG
Sbjct: 290 ALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVG 323


>gi|147826476|emb|CAN72807.1| hypothetical protein VITISV_033721 [Vitis vinifera]
          Length = 236

 Score =  298 bits (762), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 138/215 (64%), Positives = 164/215 (76%), Gaps = 6/215 (2%)

Query: 1   RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
           R+  + + +  F +CD  L Y ERAKDLV RMTL EKV Q    A GV RLGLP Y WWS
Sbjct: 23  RYALLGLDMKSFAFCDKSLSYEERAKDLVSRMTLQEKVMQSVHTASGVRRLGLPEYSWWS 82

Query: 61  EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
           EALHG+S +G      PG  FD  +PGATSFPTVIL+TA+FN++LWK +G+ VSTE RAM
Sbjct: 83  EALHGISNLG------PGVFFDETIPGATSFPTVILSTAAFNQTLWKTLGRVVSTEGRAM 136

Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
           YNLG+AGLTFWSPNINVVRD RWGR  ET GEDP++VG +A+NYVRGLQDVEG E   D 
Sbjct: 137 YNLGHAGLTFWSPNINVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVEGTENVTDL 196

Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVT 215
           +SRPLK+S+CCKHYAAYD+D+W   DR  FD+RV+
Sbjct: 197 NSRPLKVSSCCKHYAAYDIDSWLNVDRHTFDARVS 231


>gi|84623339|ref|YP_450711.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|188577358|ref|YP_001914287.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
 gi|84367279|dbj|BAE68437.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|188521810|gb|ACD59755.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
          Length = 889

 Score =  296 bits (757), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 175/446 (39%), Positives = 243/446 (54%), Gaps = 46/446 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA DLV  M+  EKV Q  + A  +PRLG+P YEWWSE LHG++  G            
Sbjct: 39  QRAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 87

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
                AT FP  I   AS+N  L +++G  VSTEARA +N            AGLT WSP
Sbjct: 88  -----ATVFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGKDHKRYAGLTIWSP 142

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDP++ G+ A+ ++ GLQ         D    P  I A  KH
Sbjct: 143 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIHGLQG--------DDLDHPRTI-ATPKH 193

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
            A +   +     R  FD  V+ +D++ T+   F   + EG   +VMC+YN ++G P CA
Sbjct: 194 LAVH---SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPACA 250

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              L+N  +RGDW F G++VSDCD++  + + H F  D    + A  LKAG DL+CG  Y
Sbjct: 251 ADWLINGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 309

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
                 A+ +G++ EA +D SL  L+    RLG  +   +  Y  LG  ++ N QH  LA
Sbjct: 310 RELGT-AIARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAQHRALA 368

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA + IVLLKN+   LPLN G    LA++GP+A+A  A+  NY+GT     +P+ G  
Sbjct: 369 LQAAAESIVLLKNNANTLPLNAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 426

Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
               ++ ++YA G A +      MIP
Sbjct: 427 QRFGAQQVSYAQG-APLAAGVPGMIP 451



 Score =  133 bits (334), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 89/280 (31%), Positives = 138/280 (49%), Gaps = 49/280 (17%)

Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           V G +L ++    +G DR D+ LP  Q  L+ + A A+  P+ +V+MS  AV +N+AK +
Sbjct: 631 VEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 689

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
                  W  YPG+ GG AIA ++ G  NPGGRLP+T+Y +      Y S  ++      
Sbjct: 690 ADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK------ 741

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++F G  ++PFGYGLSYT+F Y                 Q        G+      
Sbjct: 742 GRTYRYFKGEPLFPFGYGLSYTRFAYDAP--------------QLSSTAVQAGS------ 781

Query: 647 AVLIDDVKCKDYKFTFQI--EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIA 704
                         T Q+   V N G   G EV  VY + P    + ++ ++G++RV +A
Sbjct: 782 --------------TLQVTTTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLA 827

Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
           AG+   + F ++A ++L  VD +    + +G +T+ VG G
Sbjct: 828 AGEQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGG 866


>gi|284174578|ref|ZP_06388547.1| Beta-xylosidase [Sulfolobus solfataricus 98/2]
 gi|356934752|gb|AET42953.1| beta-xylosidase-like protein [Sulfolobus solfataricus 98/2]
          Length = 754

 Score =  295 bits (756), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 216/691 (31%), Positives = 348/691 (50%), Gaps = 106/691 (15%)

Query: 88  ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRV 146
           +T+FP  I   +++N  L   +  T+ ++ R +      G+    SP ++V RDPRWGR 
Sbjct: 101 STAFPQAIGLASTWNPELLTNVASTIRSQGRLI------GVNQCLSPVLDVCRDPRWGRC 154

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN- 205
            ET GEDPY+V    + Y+ GLQ                ++ A  KH+AA+       N 
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQG-------------ETQLVATAKHFAAHGFPEGGRNI 201

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
            + H  +R    +++ETF+ PFE+ V  G V S+M +Y+ ++G+P   +P+LL   +R +
Sbjct: 202 AQVHVGNR----ELRETFLFPFEVAVKIGKVMSIMPAYHEIDGVPCHGNPQLLTNILRQE 257

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQ 323
           W F G +VSD D I+ +   HK  ++  E A+   L++G+D++    D Y    + A+++
Sbjct: 258 WGFDGIVVSDYDGIRQLEAIHKVASNKMEAAIL-ALESGVDIEFPTIDCYGEPLVTAIKE 316

Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLK 383
           G ++EA ID ++  +  +  RLG  D     ++     + + +  ELA +AAR+ IVLLK
Sbjct: 317 GLVSEAIIDRAVERVLRIKERLGLLDNPFVDESAVPERLDDRKSRELALKAARESIVLLK 376

Query: 384 NDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT-------PCRYTSPMDGF---YAY 433
           N+N  LPL+  NI  +A++GP+AN  + M+G+Y  T            + + G       
Sbjct: 377 NENNMLPLSK-NINKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIAKKVGE 435

Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV----AGLDLS------------- 476
            KV+ YA GC DI  ++      AI+ AK AD  + V    +GL LS             
Sbjct: 436 GKVL-YAKGC-DIAGESKEGFSEAIEIAKQADVIIAVMGEKSGLPLSWTDIPSEEEFKKY 493

Query: 477 --VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
             V  EG DR  L L G Q EL+ ++    K P+ LV+++   + ++   N   +K+I+ 
Sbjct: 494 QAVTGEGNDRASLRLLGVQEELLKELYKTGK-PIILVLINGRPLVLSPIINY--VKAIIE 550

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
             +PGEEGG AIAD+IFG YNP GRLPIT+  +   + + Y+  P    ++F  R Y   
Sbjct: 551 AWFPGEEGGNAIADIIFGDYNPSGRLPITFPMDTGQIPLYYSRKP----SSF--RPYVML 604

Query: 594 DGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
               ++ FGYGLSYTQF+Y  +  +PK V                      P + +    
Sbjct: 605 HSSPLFTFGYGLSYTQFEYSNLEVTPKEVG---------------------PLSYI---- 639

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKV 711
                   T  ++V+N+G M+G EVV +Y SK        +K++ G+ +V +  G+  +V
Sbjct: 640 --------TILLDVKNVGNMEGDEVVQLYISKSFSSVARPVKELKGFAKVHLKPGEKRRV 691

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            F +   ++L   DN    ++  G + IL+G
Sbjct: 692 KFAL-PMEALAFYDNFMRLVVEKGEYQILIG 721


>gi|15899739|ref|NP_344344.1| Beta-xylosidase [Sulfolobus solfataricus P2]
 gi|13816430|gb|AAK43134.1| Beta-xylosidase [Sulfolobus solfataricus P2]
          Length = 754

 Score =  295 bits (756), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 216/691 (31%), Positives = 348/691 (50%), Gaps = 106/691 (15%)

Query: 88  ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRV 146
           +T+FP  I   +++N  L   +  T+ ++ R +      G+    SP ++V RDPRWGR 
Sbjct: 101 STAFPQAIGLASTWNPELLTNVASTIRSQGRLI------GVNQCLSPVLDVCRDPRWGRC 154

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN- 205
            ET GEDPY+V    + Y+ GLQ                ++ A  KH+AA+       N 
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQG-------------ETQLVATAKHFAAHGFPEGGRNI 201

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
            + H  +R    +++ETF+ PFE+ V  G V S+M +Y+ ++G+P   +P+LL   +R +
Sbjct: 202 AQVHVGNR----ELRETFLFPFEVAVKIGKVMSIMPAYHEIDGVPCHGNPQLLTNILRQE 257

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQ 323
           W F G +VSD D I+ +   HK  ++  E A+   L++G+D++    D Y    + A+++
Sbjct: 258 WGFDGIVVSDYDGIRQLEAIHKVASNKMEAAIL-ALESGVDIEFPTIDCYGEPLVTAIKE 316

Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLK 383
           G ++EA ID ++  +  +  RLG  D     ++     + + +  ELA +AAR+ IVLLK
Sbjct: 317 GLVSEAIIDRAVERVLRIKERLGLLDNPFVDESAVPERLDDRKSRELALKAARESIVLLK 376

Query: 384 NDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT-------PCRYTSPMDGF---YAY 433
           N+N  LPL+  NI  +A++GP+AN  + M+G+Y  T            + + G       
Sbjct: 377 NENNMLPLSK-NINKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIAKKVGE 435

Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV----AGLDLS------------- 476
            KV+ YA GC DI  ++      AI+ AK AD  + V    +GL LS             
Sbjct: 436 GKVL-YAKGC-DIAGESKEGFSEAIEIAKQADVIIAVMGEKSGLPLSWTDIPSEEEFKKY 493

Query: 477 --VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
             V  EG DR  L L G Q EL+ ++    K P+ LV+++   + ++   N   +K+I+ 
Sbjct: 494 QAVTGEGNDRASLRLLGVQEELLKELYKTGK-PIILVLINGRPLVLSPIINY--VKAIIE 550

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
             +PGEEGG AIAD+IFG YNP GRLPIT+  +   + + Y+  P    ++F  R Y   
Sbjct: 551 AWFPGEEGGNAIADIIFGDYNPSGRLPITFPMDTGQIPLYYSRKP----SSF--RPYVML 604

Query: 594 DGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
               ++ FGYGLSYTQF+Y  +  +PK V                      P + +    
Sbjct: 605 HSSPLFTFGYGLSYTQFEYSNLEVTPKEVG---------------------PLSYI---- 639

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKV 711
                   T  ++V+N+G M+G EVV +Y SK        +K++ G+ +V +  G+  +V
Sbjct: 640 --------TILLDVKNVGNMEGDEVVQLYISKSFSSVARPVKELKGFAKVHLKPGEKRRV 691

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            F +   ++L   DN    ++  G + IL+G
Sbjct: 692 KFAL-PMEALAFYDNFMRLVVEKGEYQILIG 721


>gi|58581402|ref|YP_200418.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|58425996|gb|AAW75033.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
           10331]
          Length = 889

 Score =  295 bits (756), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 176/446 (39%), Positives = 247/446 (55%), Gaps = 46/446 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA DLV  M+  EKV Q  + A  +PRLG+P YEWWSE LHG++  G            
Sbjct: 39  QRAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 87

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL----GN-----AGLTFWSP 133
                AT FP  I   AS+N  L +++G  VSTEARA +N     GN     AGLT WSP
Sbjct: 88  -----ATVFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGNDHKRYAGLTIWSP 142

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDP++ G+ A+ ++ GLQ  E +++ R          A  KH
Sbjct: 143 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIHGLQG-EDLDHPR--------TIATPKH 193

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
            A +   +     R  FD  V+ +D++ T+   F   + EG   +VMC+YN ++G P CA
Sbjct: 194 LAVH---SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPACA 250

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              L+N  +RGDW F G++VSDCD++  + + H F  D    + A  LKAG DL+CG  Y
Sbjct: 251 ADWLINGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 309

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
                 A+ +G++ EA +D SL  L+    RLG  +   +  Y  LG  ++ N QH  LA
Sbjct: 310 RELGT-AIARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAQHRALA 368

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA + IVLLKN+   LPLN G    LA++GP+A+A  A+  NY+GT     +P+ G  
Sbjct: 369 LQAAAESIVLLKNNANTLPLNAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 426

Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
               ++ ++YA G A +      MIP
Sbjct: 427 QRFGAQQVSYAQG-APLAAGVPGMIP 451



 Score =  133 bits (334), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 89/280 (31%), Positives = 138/280 (49%), Gaps = 49/280 (17%)

Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           V G +L ++    +G DR D+ LP  Q  L+ + A A+  P+ +V+MS  AV +N+AK +
Sbjct: 631 VEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 689

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
                  W  YPG+ GG AIA ++ G  NPGGRLP+T+Y +      Y S  ++      
Sbjct: 690 ADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK------ 741

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++F G  ++PFGYGLSYT+F Y                 Q        G+      
Sbjct: 742 GRTYRYFKGEPLFPFGYGLSYTRFAYDAP--------------QLSSTAVQAGS------ 781

Query: 647 AVLIDDVKCKDYKFTFQI--EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIA 704
                         T Q+   V N G   G EV  VY + P    + ++ ++G++RV +A
Sbjct: 782 --------------TLQVTTTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLA 827

Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
           AG+   + F ++A ++L  VD +    + +G +T+ VG G
Sbjct: 828 AGEQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGG 866


>gi|431798021|ref|YP_007224925.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
           DSM 17526]
 gi|430788786|gb|AGA78915.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
           DSM 17526]
          Length = 906

 Score =  295 bits (756), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 171/431 (39%), Positives = 242/431 (56%), Gaps = 44/431 (10%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
           DF + D +  + ER   LV++M+L EKV QM + +  +PRL +P Y WW+E LHGV+  G
Sbjct: 49  DFSFLDMEKNFEERVDILVDQMSLEEKVSQMMNASPAIPRLKVPEYNWWNECLHGVARAG 108

Query: 71  RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--LGNA-- 126
                            AT FP  I   ASF+++L K IG  +S EARA ++  + N   
Sbjct: 109 Y----------------ATVFPQSISVAASFDKNLMKDIGSVISDEARAKHHEFIRNGKR 152

Query: 127 ----GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
               GL FWSPNIN+ RDPRWGR  ET GEDPY+ G  A  ++ GLQD         SD 
Sbjct: 153 GIYTGLDFWSPNINIFRDPRWGRGHETYGEDPYLTGELASQFIEGLQD---------SDG 203

Query: 183 RPLKISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
           + LK  A  KH+A +      G +  R  FD  V+++D+ ET++  F   V E  V S+M
Sbjct: 204 KYLKTIATSKHFAVHS-----GPEPLRHTFDVDVSDRDLYETYLPAFRKTVKEAKVYSIM 258

Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
            +YNR  G        LLNQ +R  W F GY+VSDC +IQ I   HK  +   E A   V
Sbjct: 259 GAYNRFRGESCSGHDFLLNQLLREQWGFEGYVVSDCGAIQDIHTGHKIASTAAEAAAIGV 318

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLG 358
              G DL+CG+YYT+ T  AV +G I+E +ID +++ L++   RLG FD   +  Y  + 
Sbjct: 319 -SGGCDLNCGNYYTHLTE-AVAEGLISEEEIDIAVKRLFLARFRLGMFDPEEAVSYAQIP 376

Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
              +C+  H  LA +AA++ +VLLKN    LPL+   IK +A++GP+A+  ++++GNY G
Sbjct: 377 FGIVCSEAHNTLARQAAQKSMVLLKNQKNLLPLSVDKIKRIAVIGPNADNVESLLGNYHG 436

Query: 419 TPCRYTSPMDG 429
            P +  + +DG
Sbjct: 437 IPKKPVTFLDG 447



 Score =  156 bits (394), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 106/313 (33%), Positives = 156/313 (49%), Gaps = 54/313 (17%)

Query: 452 SMIPAAIDAAKNADATVIVAGLDLSVEAEGKD----------RVDLLLPGFQTELINKVA 501
           S I  A+  AK+AD  V+V GL   +E E  D          R  + LP  Q  L+  V 
Sbjct: 615 SKIDEAVAMAKSADLAVVVLGLSQRLEGESMDVVTPGFDRGDRTAITLPAQQEALLKAVK 674

Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
           +  K PV LV+ +  A+ IN+AK N  + +I+  GYPGEEGG A+ADV+FG YNP GRLP
Sbjct: 675 ETGK-PVILVLNAGSAMAINWAKEN--VDAIISAGYPGEEGGNALADVVFGDYNPAGRLP 731

Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
           IT+Y++     P+    ++      GRTY++F+G  +YPFGYGLSYT+F YK    P  V
Sbjct: 732 ITYYQSVEDLPPFEDYDMK------GRTYRYFEGKPLYPFGYGLSYTRFSYKDLEVPAKV 785

Query: 622 DIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY 681
           +                            D V+         + V N+G   G EVV +Y
Sbjct: 786 NAG--------------------------DPVQI-------SVTVTNIGSRAGDEVVQLY 812

Query: 682 SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
                 +    I+Q+ G++R+ +  G+S  V FT++A + L +++  +  ++  G  +I 
Sbjct: 813 LNDKEASTMRPIRQLEGFQRIHLKPGESKVVNFTLSA-RQLSMINGESKRVIEEGVFSIH 871

Query: 741 VGEGVGGVSFPLQ 753
           VG    G    LQ
Sbjct: 872 VGGEQPGFDGKLQ 884


>gi|294627323|ref|ZP_06705909.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
 gi|292598405|gb|EFF42556.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
          Length = 886

 Score =  295 bits (756), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 176/446 (39%), Positives = 244/446 (54%), Gaps = 46/446 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA  LV +M+  EKV Q  + A  +PRLG+P YEWWSE LHG++  G            
Sbjct: 36  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 84

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
                AT FP  I   AS+N SL +++G  VSTEARA +N            AGLT WSP
Sbjct: 85  -----ATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSP 139

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ  E +++ R          A  KH
Sbjct: 140 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQG-EDLDHPR--------TIATPKH 190

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
            A +         R  FD  V+  D++ T+   F   + EG   SVMC+YN ++G P CA
Sbjct: 191 IAVHSGPE---PGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACA 247

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              LLN  +RGDW F G++VSDCD++  + + H F  D    + A  LKAG DL+CG  Y
Sbjct: 248 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 306

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
            +    A+++G + EA +D SL  L+    RLG  +   +  Y  LG  ++ N  H  LA
Sbjct: 307 RDLGT-AIERGDVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALA 365

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA + IVLLKND   LPL  G    LA++GP+A+A  A+  NY+GT     +P+ G  
Sbjct: 366 LQAAAESIVLLKNDANTLPLKAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 423

Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
               ++ ++YA G A +      MIP
Sbjct: 424 QRFGAQQVSYAQG-APLAAGVPGMIP 448



 Score =  132 bits (332), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 90/282 (31%), Positives = 138/282 (48%), Gaps = 45/282 (15%)

Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           V G +L ++    +G DR D+ LP  Q  L+ + A A+  P+ +V+MS  AV +N+AK +
Sbjct: 628 VEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 686

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
                  W  YPG+ GG AIA ++ G  NPGGRLP+T+Y +      Y S  ++      
Sbjct: 687 ADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK------ 738

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++F G  ++PFGYGLSYT+F Y             D  Q       T+    P   
Sbjct: 739 GRTYRYFKGEPLFPFGYGLSYTRFAY-------------DAPQLS---TTTLQAGNP--- 779

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
                              V N G   G EV  VY + P    + ++ ++G++RV +AAG
Sbjct: 780 -------------LQVTTTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAAG 826

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
           +   + F ++A ++L  VD +    + +G +T+ VG G  G 
Sbjct: 827 EQRTLTFHLDA-RALSDVDRSGQRAVEAGDYTLFVGGGQPGT 867


>gi|289670678|ref|ZP_06491753.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. musacearum
           NCPPB 4381]
          Length = 886

 Score =  295 bits (755), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 174/446 (39%), Positives = 240/446 (53%), Gaps = 46/446 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA  LV +M+  EKV Q  + A  +PRLG+P YEWWSE LHG++  G            
Sbjct: 36  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGH----------- 84

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
                AT FP  I   AS+N +L +++G  VSTEARA +N            AGLT WSP
Sbjct: 85  -----ATVFPQAIGLAASWNTNLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSP 139

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ           D    +  A  KH
Sbjct: 140 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQG---------EDLNHPRTIATPKH 190

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
            A +         R  FD  V+  D++ T+   F   + +G   SVMC+YN ++G P CA
Sbjct: 191 LAVHSGPE---PGRHGFDVDVSPHDVEATYTPAFRAALVQGQAGSVMCAYNSLHGTPACA 247

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              LLN  +RGDW F G++VSDCD++  + + H F  D    + A  LKAG DL+CG  Y
Sbjct: 248 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 306

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
                 A+++G + EA +D SL  L+    RLG  +   +  Y  LG  ++ N  H  LA
Sbjct: 307 RELGT-AIERGDVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALA 365

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA + IVLLKND   LPLN G    LA++GP+A+A  A+  NY+GT     +P+ G  
Sbjct: 366 LQAAAESIVLLKNDANTLPLNAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 423

Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
               ++ + YA G A +      MIP
Sbjct: 424 QRFGAQQVRYAQG-APLAAGVPGMIP 448



 Score =  136 bits (343), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 95/296 (32%), Positives = 141/296 (47%), Gaps = 52/296 (17%)

Query: 463 NADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVI 512
            +DA V   GL   VE E          G DR D+ LP  Q  L+ + A A+  P+ +V+
Sbjct: 614 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVL 672

Query: 513 MSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKI 572
           MS  AV +N+AK +       W  YPG+ GG AIA ++ G  NPGGRLP+T+Y +     
Sbjct: 673 MSGSAVALNWAKTHADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLP 730

Query: 573 PYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCR 632
            Y S  ++      GRTY++F G  ++PFGYGLSYT+F Y             D  Q   
Sbjct: 731 AYVSYDMK------GRTYRYFKGEPLFPFGYGLSYTRFAY-------------DAPQLS- 770

Query: 633 DINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI 692
             + T+    P                      V N G   G EV  VY + P    + +
Sbjct: 771 --STTLQAGNP----------------LQVTTTVRNTGTHAGDEVAQVYLQYPDRPQSPL 812

Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
           + ++G++RV +AAG+   + F ++A ++L  VD +    + +G +T+ VG G  G 
Sbjct: 813 RSLVGFQRVHLAAGEQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGGQPGT 867


>gi|294665226|ref|ZP_06730524.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292605014|gb|EFF48367.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 886

 Score =  295 bits (755), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 176/446 (39%), Positives = 244/446 (54%), Gaps = 46/446 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA  LV +M+  EKV Q  + A  +PRLG+P YEWWSE LHG++  G            
Sbjct: 36  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 84

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
                AT FP  I   AS+N SL +++G  VSTEARA +N            AGLT WSP
Sbjct: 85  -----ATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSP 139

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ  E +++ R          A  KH
Sbjct: 140 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQG-EDLDHPR--------TIATPKH 190

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
            A +         R  FD  V+  D++ T+   F   + EG   SVMC+YN ++G P CA
Sbjct: 191 IAVHSGPE---PGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACA 247

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              LLN  +RGDW F G++VSDCD++  + + H F  D    + A  LKAG DL+CG  Y
Sbjct: 248 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 306

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
            +    A+++G + EA +D SL  L+    RLG  +   +  Y  LG  ++ N  H  LA
Sbjct: 307 RDLGT-AIERGDVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALA 365

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA + IVLLKND   LPL  G    LA++GP+A+A  A+  NY+GT     +P+ G  
Sbjct: 366 LQAAAESIVLLKNDANTLPLKAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 423

Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
               ++ ++YA G A +      MIP
Sbjct: 424 QRFGAQQVSYAQG-APLAAGVPGMIP 448



 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 89/282 (31%), Positives = 138/282 (48%), Gaps = 45/282 (15%)

Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           V G +L ++    +G DR D+ LP  Q  L+ + A A+  P+ +V+MS  AV +N+AK +
Sbjct: 628 VEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 686

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
                  W  YPG+ GG A+A ++ G  NPGGRLP+T+Y +      Y S  ++      
Sbjct: 687 ADAIVAAW--YPGQSGGTAMARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK------ 738

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++F G  ++PFGYGLSYT+F Y             D  Q       T+    P   
Sbjct: 739 GRTYRYFKGEPLFPFGYGLSYTRFAY-------------DAPQLS---TTTLQAGNP--- 779

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
                              V N G   G EV  VY + P    + ++ ++G++RV +AAG
Sbjct: 780 -------------LQVTTTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAAG 826

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
           +   + F ++A ++L  VD +    + +G +T+ VG G  G 
Sbjct: 827 EQRTLTFHLDA-RALSDVDRSGQRAVEAGDYTLFVGGGQPGT 867


>gi|227828570|ref|YP_002830350.1| glycoside hydrolase [Sulfolobus islandicus M.14.25]
 gi|229585800|ref|YP_002844302.1| glycoside hydrolase family protein [Sulfolobus islandicus M.16.27]
 gi|227460366|gb|ACP39052.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           M.14.25]
 gi|228020850|gb|ACP56257.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           M.16.27]
          Length = 755

 Score =  295 bits (755), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 224/700 (32%), Positives = 351/700 (50%), Gaps = 116/700 (16%)

Query: 85  VPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWG 144
           V  AT+FP  I   ++++  L +++  T+  +A+ +    N  L   SP ++V RDPRWG
Sbjct: 98  VKTATAFPQAIGLASTWDPDLIREVSSTIRYQAKLIGT--NQCL---SPVLDVCRDPRWG 152

Query: 145 RVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEG 204
           R  ET GED Y+V    + YV+GLQ                ++ A  KH+AA+     EG
Sbjct: 153 RCEETYGEDQYLVASIGLAYVKGLQGEN-------------ELIATVKHFAAHGFP--EG 197

Query: 205 NDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRG 264
             R      V  ++++E F+ PFE+ +  G   SVM +Y+ ++GIP  ++ +LL + +R 
Sbjct: 198 G-RNIAPVHVGNRELREVFLFPFEVAIKLGKAMSVMPAYHEIDGIPCHSNAELLTKILRQ 256

Query: 265 DWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLD-----LDCGDYYTNFTMG 319
           +W F G +VSD D+I+ +   HK   + KE A+   L+AG+D     +DC   +    + 
Sbjct: 257 EWGFEGIVVSDYDAIRQLEAIHKVSLNKKEAAIL-ALEAGVDTEFPNIDC---FGEPLLE 312

Query: 320 AVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGI 379
           AV++G I+E+ ID ++  +  +  +LG F+     +N     + N +  ELA + AR+ I
Sbjct: 313 AVKEGLISESIIDRAVERVLRIKEKLGLFNNHYINENNVPEKLDNSKSRELALDVARKSI 372

Query: 380 VLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT-------PCRYTSPMDGFYA 432
           VLLKNDN  LPLN  NI T+A++GP+AN  + ++G+Y  T            + ++G   
Sbjct: 373 VLLKNDN-ILPLNK-NIGTIAVIGPNANEPRNLLGDYTYTGHLNADGGIEVVTVLEGI-- 428

Query: 433 YSKVIN-----YAPGCADIVCQNNSMIPAAIDAAKNADATVIV----AGLDLS------- 476
             KV N     YA GC DI  ++      AI+ AK  D  + V    +GL LS       
Sbjct: 429 MRKVSNNTNVLYAKGC-DIAAESKEGFSEAIEIAKKGDIIIAVMGEKSGLPLSWTDVPGK 487

Query: 477 --------VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
                   V  EG DR  L LPG Q EL+ ++    K P+ LV+++   + ++   N  +
Sbjct: 488 DEFEKYQAVTGEGNDRTSLRLPGVQEELLKELHKTGK-PIILVLVNGRPLALSSIFN--E 544

Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMP--LRPVNNF 585
           + +I+   +PGEEGG AIADVIFG YNP GRLPI++  +   + I Y   P  LRP    
Sbjct: 545 VNAIIDAWFPGEEGGNAIADVIFGDYNPSGRLPISFPIDTGQIPIYYNRKPSSLRP---- 600

Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPP 644
               Y       ++PFGYGLSYT+FKY  +  +PK V+                      
Sbjct: 601 ----YVMMKSKPLFPFGYGLSYTEFKYSNLEVTPKEVN---------------------- 634

Query: 645 CAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFI 703
                         K    +EVEN+GK +G E V +Y SK        IK++ G+ +V++
Sbjct: 635 -----------SSGKIKISLEVENVGKREGEETVQLYISKQYSGVSRPIKELKGFAKVYL 683

Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
              +  K+ F++   ++L   D     ++ +G + IL+G+
Sbjct: 684 KPNEKRKITFSL-PLEALAFYDQYMRLIIDTGDYEILIGK 722


>gi|418518029|ref|ZP_13084183.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
 gi|410705279|gb|EKQ63755.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
          Length = 886

 Score =  295 bits (755), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 177/446 (39%), Positives = 241/446 (54%), Gaps = 46/446 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA  LV +M+  EKV Q  + A  +PRLG+P YEWWSE LHG++  G            
Sbjct: 36  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 84

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
                AT FP  I   AS+N SL +++G  VSTEARA +N            AGLT WSP
Sbjct: 85  -----ATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSP 139

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ           D    +  A  KH
Sbjct: 140 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQG---------EDLNHPRTIATPKH 190

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
            A +         R  FD  V+  D++ T+   F   + EG   SVMC+YN ++G P CA
Sbjct: 191 IAVHSGPE---PGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCA 247

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              LLN  +RGDW F G++VSDCD+I  + + H F  D    +VA  LKAG DL+CG  Y
Sbjct: 248 ADWLLNGRVRGDWGFKGFVVSDCDAIDDMTQFHYFRPDNAGSSVA-ALKAGHDLNCGHAY 306

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
                 A+ +G++ EA +D SL  L+    RLG  +   +  Y  LG  ++ N  H  LA
Sbjct: 307 RELGT-AIARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNVAHRALA 365

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA + IVLLKND   LPL  G    LA++GP+A+A  A+  NY+GT     +P+ G  
Sbjct: 366 LQAAAESIVLLKNDANTLPLRAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 423

Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
               ++ ++YA G A +      MIP
Sbjct: 424 QRFGAQQVSYAQG-APLAAGVPGMIP 448



 Score =  132 bits (333), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 89/282 (31%), Positives = 139/282 (49%), Gaps = 45/282 (15%)

Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           V G +L ++    +G DR D+ LP  Q  L+ + A A+  P+ +V+MS  AV +N+AK +
Sbjct: 628 VEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 686

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
                  W  YPG+ GG AIA ++ G  NPGGRLP+T+Y +      Y S  ++      
Sbjct: 687 ADAIMAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK------ 738

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++F G  ++PFGYGLSYT+F Y    +P+     L      + I            
Sbjct: 739 GRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQLSTTTLQAGNPLQVI------------ 783

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
                              V N G   G EV  VY + P    + ++ ++G++RV +AAG
Sbjct: 784 -----------------ATVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAAG 826

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
           +   + F ++A ++L  VD +    + +G +T+ VG G  G 
Sbjct: 827 EQRTLTFHLDA-RALSDVDRSGQRAVEAGDYTLFVGGGQPGT 867


>gi|238620766|ref|YP_002915592.1| glycoside hydrolase family protein [Sulfolobus islandicus M.16.4]
 gi|238381836|gb|ACR42924.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           M.16.4]
          Length = 755

 Score =  295 bits (754), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 224/700 (32%), Positives = 351/700 (50%), Gaps = 116/700 (16%)

Query: 85  VPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWG 144
           V  AT+FP  I   ++++  L +++  T+  +A+ +    N  L   SP ++V RDPRWG
Sbjct: 98  VKTATAFPQAIGLASTWDPDLIREVSSTIRYQAKLIGT--NQCL---SPVLDVCRDPRWG 152

Query: 145 RVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEG 204
           R  ET GED Y+V    + YV+GLQ                ++ A  KH+AA+     EG
Sbjct: 153 RCEETYGEDQYLVASIGLAYVKGLQGEN-------------ELIATVKHFAAHGFP--EG 197

Query: 205 NDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRG 264
             R      V  ++++E F+ PFE+ +  G   SVM +Y+ ++GIP  ++ +LL + +R 
Sbjct: 198 G-RNIAPVHVGNRELREVFLFPFEVAIKLGKAMSVMPAYHEIDGIPCHSNAELLTKILRQ 256

Query: 265 DWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLD-----LDCGDYYTNFTMG 319
           +W F G +VSD D+I+ +   HK   + KE A+   L+AG+D     +DC   +    + 
Sbjct: 257 EWGFEGIVVSDYDAIRQLEAIHKVSLNKKEAAIL-ALEAGVDTEFPNIDC---FGEPLLE 312

Query: 320 AVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGI 379
           AV++G I+E+ ID ++  +  +  +LG F+     +N     + N +  ELA + AR+ I
Sbjct: 313 AVKEGLISESIIDRAVERVLRIKEKLGLFNDHYINENNVPEKLDNSKSRELALDVARKSI 372

Query: 380 VLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT-------PCRYTSPMDGFYA 432
           VLLKNDN  LPLN  NI T+A++GP+AN  + ++G+Y  T            + ++G   
Sbjct: 373 VLLKNDN-ILPLNK-NIGTIAVIGPNANEPRNLLGDYTYTGHLNADVGIEVVTVLEGI-- 428

Query: 433 YSKVIN-----YAPGCADIVCQNNSMIPAAIDAAKNADATVIV----AGLDLS------- 476
             KV N     YA GC DI  ++      AI+ AK  D  + V    +GL LS       
Sbjct: 429 MRKVSNNTNVLYAKGC-DIAAESKEGFSEAIEIAKKGDIIIAVMGEKSGLPLSWTDVPGK 487

Query: 477 --------VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
                   V  EG DR  L LPG Q EL+ ++    K P+ LV+++   + ++   N  +
Sbjct: 488 DEFEKYQAVTGEGNDRTSLRLPGVQEELLKELHKTGK-PIILVLVNGRPLALSSIFN--E 544

Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMP--LRPVNNF 585
           + +I+   +PGEEGG AIADVIFG YNP GRLPI++  +   + I Y   P  LRP    
Sbjct: 545 VNAIIDAWFPGEEGGNAIADVIFGDYNPSGRLPISFPIDTGQIPIYYNRKPSSLRP---- 600

Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPP 644
               Y       ++PFGYGLSYT+FKY  +  +PK V+                      
Sbjct: 601 ----YVMMKSKPLFPFGYGLSYTEFKYSNLEVTPKEVN---------------------- 634

Query: 645 CAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFI 703
                         K    +EVEN+GK +G E V +Y SK        IK++ G+ +V++
Sbjct: 635 -----------SSGKIKISLEVENVGKREGEETVQLYISKQYSGVSRPIKELKGFAKVYL 683

Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
              +  K+ F++   ++L   D     ++ +G + IL+G+
Sbjct: 684 KPNEKRKITFSL-PLEALAFYDQYMRLIIDTGDYEILIGK 722


>gi|289664871|ref|ZP_06486452.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. vasculorum
           NCPPB 702]
          Length = 886

 Score =  295 bits (754), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 174/446 (39%), Positives = 239/446 (53%), Gaps = 46/446 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA  LV  M+  EKV Q  + A  +PRLG+P YEWWSE LHG++  G            
Sbjct: 36  QRAAALVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGH----------- 84

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
                AT FP  I   AS+N +L +++G  VSTEARA +N            AGLT WSP
Sbjct: 85  -----ATVFPQAIGLAASWNTNLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSP 139

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ           D    +  A  KH
Sbjct: 140 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQG---------EDLNHPRTIATPKH 190

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
            A +         R  FD  V+  D++ T+   F   + +G   SVMC+YN ++G P CA
Sbjct: 191 LAVHSGPE---PGRHGFDVDVSPHDVEATYTPAFRAALVQGQAGSVMCAYNSLHGTPACA 247

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              LLN  +RGDW F G++VSDCD++  + + H F  D    + A  LKAG DL+CG  Y
Sbjct: 248 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 306

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
                 A+++G + EA +D SL  L+    RLG  +   +  Y  LG  ++ N  H  LA
Sbjct: 307 RELGT-AIERGDVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALA 365

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA + IVLLKND   LPLN G    LA++GP+A+A  A+  NY+GT     +P+ G  
Sbjct: 366 LQAAAESIVLLKNDANTLPLNAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 423

Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
               ++ + YA G A +      MIP
Sbjct: 424 QRFGAQQVRYAQG-APLAAGVPGMIP 448



 Score =  136 bits (342), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 95/296 (32%), Positives = 142/296 (47%), Gaps = 52/296 (17%)

Query: 463 NADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVI 512
            +DA V   GL   VE E          G DR D+ LP  Q  L+ + A A+  P+ +V+
Sbjct: 614 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVL 672

Query: 513 MSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKI 572
           MS  AV +N+AK +       W  YPG+ GG AIA ++ G  NPGGRLP+T+Y +     
Sbjct: 673 MSGSAVALNWAKTHADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLP 730

Query: 573 PYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCR 632
            Y S  ++      GRTY++F G  ++PFGYGLSYT+F Y    +P+     L   Q   
Sbjct: 731 AYVSYDMK------GRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQLSTTAL---QAGN 778

Query: 633 DINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI 692
            +  T                            V N G   G EV  VY + P    + +
Sbjct: 779 PLQVTT--------------------------TVRNTGTRAGDEVAQVYLQYPDRPQSPL 812

Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
           + ++G++RV +AAG+   + F ++A ++L  VD +    + +G +T+ VG G  G 
Sbjct: 813 RSLVGFQRVHLAAGEQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGGQPGT 867


>gi|319788503|ref|YP_004147978.1| glycoside hydrolase [Pseudoxanthomonas suwonensis 11-1]
 gi|317467015|gb|ADV28747.1| glycoside hydrolase family 3 domain protein [Pseudoxanthomonas
           suwonensis 11-1]
          Length = 916

 Score =  295 bits (754), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 184/482 (38%), Positives = 268/482 (55%), Gaps = 43/482 (8%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           P+ D  L + ERA  LV RMTL EK  QM + +  + RLGLP Y+WW+EALHGV+  G  
Sbjct: 49  PWLDTSLSFEERAAALVSRMTLEEKAAQMQNDSPAIERLGLPAYDWWNEALHGVARAG-- 106

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGN--- 125
                         GAT FP  I   ASF+  L  ++   +S EARA ++     G    
Sbjct: 107 --------------GATVFPQAIGMAASFDVPLMDQVSAAISDEARAKHHDFLRKGEHGR 152

Query: 126 -AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLTFWSPNIN+ RDPRWGR  ET GEDP++  R  +++VRGLQ ++  +  +  D + 
Sbjct: 153 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTTRMGVSFVRGLQGMD-PQTGQPLDPKY 211

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
            K+ A  KH+A +   +    DR  FD   ++QD+ +T++  FE  V E DV +VM +YN
Sbjct: 212 RKLDATAKHFAVH---SGPEADRHTFDVHPSKQDLYDTYLPAFESLVKEADVYAVMGAYN 268

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           RV G        LL  T+R DW F GY++SDC +I  I ++HK + +T E+A A  +K G
Sbjct: 269 RVYGESASGSKFLLLDTLRRDWGFDGYVMSDCWAIVDIWKNHKIV-ETPEEAAALAVKNG 327

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNI 362
            +L+CG  Y +    AV++G I+EA++D +L  L++  M LG FD   Q  +  +  +  
Sbjct: 328 TELNCGSTYADHLPVAVKKGLISEAELDDALTRLFVARMELGMFDPPEQVRWAQVPYSVN 387

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
            + +H  LA + A++ +VLLKND G LPL+  +I+ LA+VGP A+ T A++GNY GTP  
Sbjct: 388 QSAEHDALARKMAQESLVLLKND-GVLPLSK-DIRRLAVVGPTADDTMALLGNYYGTPAD 445

Query: 423 YTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
             + + G      +   APG   +  +   ++    D A    AT ++    L  EA   
Sbjct: 446 PVTILRG------IREAAPGVDVVYARGVDLVEGRDDPA----ATPLIEPQYLRPEAGST 495

Query: 483 DR 484
           +R
Sbjct: 496 ER 497



 Score =  146 bits (368), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 102/312 (32%), Positives = 155/312 (49%), Gaps = 55/312 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A++AA +ADA V V GL   VE E          G DR D+ LP  Q +L+  V    K 
Sbjct: 643 ALEAANSADAVVFVGGLTGDVEGEEMKVDYPGFAGGDRTDIRLPATQQKLLEAVHATGK- 701

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV +V+ +  A+ I++A+ N  +  IL   YPG+ GG A+ + +FG YNPGGRLP+T+Y 
Sbjct: 702 PVVMVLTTGSALGIDWARRN--VPGILVAWYPGQRGGTAVGEALFGDYNPGGRLPVTFYS 759

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
           A+    P+    ++       RTY++F G  ++PFG+GLSYT F Y          +KLD
Sbjct: 760 ADEKLPPFDDYAMKE------RTYRYFTGQPLFPFGHGLSYTSFGYS--------GLKLD 805

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           + +                 A   D+V       T  + V+N GK  G EVV +Y  P  
Sbjct: 806 RKR-----------------AGAGDEV-------TVSVTVKNQGKRAGDEVVQLYLAPVK 841

Query: 687 IAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVGEG 744
                 +K++ G++RV +  G+S  V F++   + L++ D AA       G + + VG  
Sbjct: 842 PQRERALKELRGFQRVHLQPGESRTVTFSIVPERDLRVYDEAAGRYTVDPGRYEVQVGAS 901

Query: 745 VGGV--SFPLQL 754
              +  S PL++
Sbjct: 902 SADIRASVPLEV 913


>gi|255572557|ref|XP_002527212.1| beta-glucosidase, putative [Ricinus communis]
 gi|223533388|gb|EEF35138.1| beta-glucosidase, putative [Ricinus communis]
          Length = 349

 Score =  295 bits (754), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 145/313 (46%), Positives = 192/313 (61%), Gaps = 26/313 (8%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
            + +C+  L  P RA  L+  +TL EK++Q+ D A G+PR G+P YEWWSE+LHG++  G
Sbjct: 39  SYTFCNQSLSVPTRAHSLISLLTLEEKIKQLSDNASGIPRFGIPPYEWWSESLHGIAING 98

Query: 71  RRTNSPPGTHFD-SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
                 PG  F    V  AT FP VI++ A+FN +LW  IG  ++ EARAM+N+G +GLT
Sbjct: 99  ------PGVSFTIGPVSAATGFPQVIISAAAFNRTLWFLIGSAIAIEARAMHNVGQSGLT 152

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ-----------------DVE 172
           FW+PN+N+ RDPRWGR  ETPGEDP +   YAI +V+G Q                   E
Sbjct: 153 FWAPNVNIFRDPRWGRGQETPGEDPMLTSAYAIEFVKGFQGGNWKSGVSGSGSGRYGFGE 212

Query: 173 GVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVN 232
                 D     L +SACCKH  AYDL+ W    R+ F++ VTEQD+++T+  PF  C+ 
Sbjct: 213 KRMLRDDDGDDGLMLSACCKHLTAYDLEKWGNFSRYSFNAVVTEQDLEDTYQPPFRSCIE 272

Query: 233 EGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDT 292
           EG  S +MCSYN VNG+P CA   LL Q  R +W F GYIVSDCD++ TI E   + + +
Sbjct: 273 EGKASCLMCSYNEVNGVPACAREDLL-QKAREEWGFEGYIVSDCDAVATIFEYQNY-SKS 330

Query: 293 KEDAVARVLKAGL 305
            EDAVA  LKAG+
Sbjct: 331 AEDAVAIALKAGM 343


>gi|188990656|ref|YP_001902666.1| beta-glucosidase [Xanthomonas campestris pv. campestris str. B100]
 gi|167732416|emb|CAP50610.1| exported beta-glucosidase [Xanthomonas campestris pv. campestris]
          Length = 888

 Score =  294 bits (753), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 185/477 (38%), Positives = 254/477 (53%), Gaps = 50/477 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA  LV +M+  EKV Q  + A  +PRLG+P YEWWSE LHG++  G            
Sbjct: 38  QRAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 86

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
                AT FP  I   AS+N  L +++G  VSTEARA +N            AGLT WSP
Sbjct: 87  -----ATVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSP 141

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ         D    P  I A  KH
Sbjct: 142 NINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQG--------DDLEHPRTI-ATPKH 192

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
            A +         R  FD  V+ +D++ T+   F   + EG   SVMC+YN ++G P CA
Sbjct: 193 IAVHSGPE---PGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACA 249

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              LLN  +RGDW F G++VSDCD++  + + H F  D    + A  LKAG DL+CG  Y
Sbjct: 250 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAS-LKAGHDLNCGTAY 308

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG--SPQYKNLGKNNICNPQHIELA 371
                 A+++G++ EA +D SL  L+    RLG        +Y  LG  +I N  +  LA
Sbjct: 309 RALGT-AIERGEVDEALLDQSLVRLFAARYRLGELQAPRKDRYARLGAKDIDNAGNRALA 367

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA + IVLLKN N  LPL  G    LA++GP+A+A  A+  NY+GT  +  +P+ G  
Sbjct: 368 LQAAAESIVLLKNANATLPLKAGT--RLAVIGPNADALAALEANYQGTSSQPVTPLLGLR 425

Query: 432 AY--SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL-SVEAEGKDRV 485
               ++ + YA G A +      MIP   + A  +D    + G    +VE EG  RV
Sbjct: 426 QRFGAQQVRYAQG-APLAAGVPGMIP---ETALRSDGKPGLRGEYFDNVELEGAPRV 478



 Score =  134 bits (338), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 89/287 (31%), Positives = 142/287 (49%), Gaps = 45/287 (15%)

Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           V G +L ++    +G DR D+ LP  Q  L+ + A A+  P+ +V+MS  AV +N+AK +
Sbjct: 630 VEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 688

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
                  W  YPG+ GG AIA  + G  NPGGRLP+T+Y +     PY S  ++      
Sbjct: 689 ADAIVAAW--YPGQSGGTAIARALAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK------ 740

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++F G  ++PFGYGLSYT+F Y+   +P+   + +   Q    +  T         
Sbjct: 741 GRTYRYFKGEALFPFGYGLSYTRFAYE---TPR---LSVTTLQAGSPLQVTT-------- 786

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
                              V N G+  G EV  VY + P    + ++ ++G++RV +  G
Sbjct: 787 ------------------TVRNTGERAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLQPG 828

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQ 753
           +   + FT++A ++L  VD     ++ +G + + VG G      P Q
Sbjct: 829 EQRTLTFTLDA-RALSDVDRTGTRVVEAGDYRLFVGGGQPDTGAPGQ 874


>gi|21232323|ref|NP_638240.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
 gi|21114093|gb|AAM42164.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
          Length = 888

 Score =  294 bits (753), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 185/477 (38%), Positives = 254/477 (53%), Gaps = 50/477 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA  LV +M+  EKV Q  + A  +PRLG+P YEWWSE LHG++  G            
Sbjct: 38  QRAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 86

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
                AT FP  I   AS+N  L +++G  VSTEARA +N            AGLT WSP
Sbjct: 87  -----ATVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSP 141

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ         D    P  I A  KH
Sbjct: 142 NINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQG--------DDLEHPRTI-ATPKH 192

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
            A +         R  FD  V+ +D++ T+   F   + EG   SVMC+YN ++G P CA
Sbjct: 193 IAVHSGPE---PGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACA 249

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              LLN  +RGDW F G++VSDCD++  + + H F  D    + A  LKAG DL+CG  Y
Sbjct: 250 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAS-LKAGHDLNCGTAY 308

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG--SPQYKNLGKNNICNPQHIELA 371
                 A+++G++ EA +D SL  L+    RLG        +Y  LG  +I N  +  LA
Sbjct: 309 RALGT-AIERGEVDEALLDQSLVRLFAARYRLGELQAPRKDRYARLGAKDIDNAGNRALA 367

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA + IVLLKN N  LPL  G    LA++GP+A+A  A+  NY+GT  +  +P+ G  
Sbjct: 368 LQAAAESIVLLKNANATLPLKAGT--RLAVIGPNADALAALEANYQGTSSQPVTPLLGLR 425

Query: 432 AY--SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL-SVEAEGKDRV 485
               ++ + YA G A +      MIP   + A  +D    + G    +VE EG  RV
Sbjct: 426 QRFGAQQVRYAQG-APLAAGVPGMIP---ETALRSDGKPGLRGEYFDNVELEGAPRV 478



 Score =  132 bits (333), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 88/287 (30%), Positives = 133/287 (46%), Gaps = 45/287 (15%)

Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           V G +L ++    +G DR D+ LP  Q  L+ + A A+  P+ +V+MS  AV +N+AK +
Sbjct: 630 VEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 688

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
                  W  YPG+ GG AIA  + G  NPGGRLP+T+Y +     PY S  ++      
Sbjct: 689 ADAIVAAW--YPGQSGGTAIARALAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK------ 740

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++F G  ++PFGYGLSYT F Y                 Q        G+      
Sbjct: 741 GRTYRYFKGEALFPFGYGLSYTSFAYDAP--------------QLSSTTLQAGS------ 780

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
                              V N G   G EV  VY + P    + ++ ++G++RV +  G
Sbjct: 781 ------------PLQVTTTVRNTGTRAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLQPG 828

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQ 753
           +   + FT++A ++L  VD      + +G + + VG G      P Q
Sbjct: 829 EQRTLTFTLDA-RALSDVDRTGTRAVEAGDYRLFVGGGQPDTGAPGQ 874


>gi|418519424|ref|ZP_13085476.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
 gi|410704868|gb|EKQ63347.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
          Length = 886

 Score =  294 bits (752), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 175/446 (39%), Positives = 240/446 (53%), Gaps = 46/446 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA  LV +M+  EKV Q  + A  +PRLG+P YEWWSE LHG++  G            
Sbjct: 36  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 84

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
                AT FP  I   AS+N SL +++G  VSTEARA +N            AGLT WSP
Sbjct: 85  -----ATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSP 139

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ           D    +  A  KH
Sbjct: 140 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQG---------EDLNHPRTIATPKH 190

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
            A +         R  FD  V+  D++ T+   F   + EG   SVMC+YN ++G P CA
Sbjct: 191 IAVHSGPE---PGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCA 247

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              LLN  +RGDW F G++VSDCD++  + + H F  D    + A  LKAG DL+CG  Y
Sbjct: 248 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 306

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
                 A+ +G++ EA +D SL  L+    RLG  +   +  Y  LG  ++ N  H  LA
Sbjct: 307 RELGT-AIARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALA 365

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA + IVLLKND   LPL  G    LA++GP+A+A  A+  NY+GT     +P+ G  
Sbjct: 366 LQAAAESIVLLKNDANTLPLRAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 423

Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
               ++ ++YA G A +      MIP
Sbjct: 424 QRFGAQQVSYAQG-APLAAGVPGMIP 448



 Score =  132 bits (331), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 90/282 (31%), Positives = 138/282 (48%), Gaps = 45/282 (15%)

Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           V G +L ++    +G DR D+ LP  Q  L+ + A A+  P+ +V+MS  AV +N+AK +
Sbjct: 628 VEGEELRIDVPGFDGGDRNDIALPAPQQTLLER-AKASGKPLVVVLMSGSAVALNWAKTH 686

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
                  W  YPG+ GG AIA ++ G  NPGGRLP+T+Y +      Y S  ++      
Sbjct: 687 ADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK------ 738

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++F G  ++PFGYGLSYT+F Y             D  Q       T+    P   
Sbjct: 739 GRTYRYFKGEPLFPFGYGLSYTRFAY-------------DAPQLS---TTTLQAGNP--- 779

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
                              V N G   G EV  VY + P    + ++ ++G++RV +AAG
Sbjct: 780 -------------LQVTATVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAAG 826

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
           +   + F ++A ++L  VD +    + +G +T+ VG G  G 
Sbjct: 827 EQRTLTFHLDA-RALSDVDRSGQRAVEAGDYTLFVGGGQPGT 867


>gi|21243803|ref|NP_643385.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
           306]
 gi|21109396|gb|AAM37921.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
           306]
          Length = 886

 Score =  294 bits (752), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 175/446 (39%), Positives = 240/446 (53%), Gaps = 46/446 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA  LV +M+  EKV Q  + A  +PRLG+P YEWWSE LHG++  G            
Sbjct: 36  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 84

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
                AT FP  I   AS+N SL +++G  VSTEARA +N            AGLT WSP
Sbjct: 85  -----ATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSP 139

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ           D    +  A  KH
Sbjct: 140 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQG---------EDLNHPRTIATPKH 190

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
            A +         R  FD  V+  D++ T+   F   + EG   SVMC+YN ++G P CA
Sbjct: 191 IAVHSGPE---PGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCA 247

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              LLN  +RGDW F G++VSDCD++  + + H F  D    + A  LKAG DL+CG  Y
Sbjct: 248 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 306

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
                 A+ +G++ EA +D SL  L+    RLG  +   +  Y  LG  ++ N  H  LA
Sbjct: 307 RELGT-AIARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALA 365

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA + IVLLKND   LPL  G    LA++GP+A+A  A+  NY+GT     +P+ G  
Sbjct: 366 LQAAAESIVLLKNDANTLPLRAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 423

Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
               ++ ++YA G A +      MIP
Sbjct: 424 QRFGAQQVSYAQG-APLAAGVPGMIP 448



 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 91/282 (32%), Positives = 139/282 (49%), Gaps = 45/282 (15%)

Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           V G +L ++    +G DR D+ LP  Q  L+ + A A+  P+ +V+MS  AV +N+AK +
Sbjct: 628 VEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKMH 686

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
                  W  YPG+ GG AIA ++ G  NPGGRLP+T+Y +     PY S  ++      
Sbjct: 687 ADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK------ 738

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++F G  ++PFGYGLSYT+F Y             D  Q       T+    P   
Sbjct: 739 GRTYRYFKGEPLFPFGYGLSYTRFAY-------------DAPQLS---TTTLQAGNP--- 779

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
                              V N G   G EV  VY + P    + ++ ++G++RV +AAG
Sbjct: 780 -------------LQVTATVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAAG 826

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
           +   + F ++A ++L  VD +    + +G +T+ VG G  G 
Sbjct: 827 EQRTLTFHLDA-RALSDVDRSGQRAVEAGDYTLFVGGGQPGT 867


>gi|66767544|ref|YP_242306.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. 8004]
 gi|66572876|gb|AAY48286.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. 8004]
          Length = 888

 Score =  294 bits (752), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 185/477 (38%), Positives = 254/477 (53%), Gaps = 50/477 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA  LV +M+  EKV Q  + A  +PRLG+P YEWWSE LHG++  G            
Sbjct: 38  QRAAALVAQMSREEKVAQSMNAAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 86

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
                AT FP  I   AS+N  L +++G  VSTEARA +N            AGLT WSP
Sbjct: 87  -----ATVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSP 141

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ         D    P  I A  KH
Sbjct: 142 NINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQG--------DDLEHPRTI-ATPKH 192

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
            A +         R  FD  V+ +D++ T+   F   + EG   SVMC+YN ++G P CA
Sbjct: 193 IAVHSGPE---PGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACA 249

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              LLN  +RGDW F G++VSDCD++  + + H F  D    + A  LKAG DL+CG  Y
Sbjct: 250 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAS-LKAGHDLNCGTAY 308

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG--SPQYKNLGKNNICNPQHIELA 371
                 A+++G++ EA +D SL  L+    RLG        +Y  LG  +I N  +  LA
Sbjct: 309 RALGT-AIERGEVDEALLDQSLVRLFAARYRLGELQAPRKDRYARLGAKDIDNAGNRALA 367

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA + IVLLKN N  LPL  G    LA++GP+A+A  A+  NY+GT  +  +P+ G  
Sbjct: 368 LQAAAESIVLLKNANATLPLKAGT--RLAVIGPNADALAALEANYQGTSSQPVTPLLGLR 425

Query: 432 AY--SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL-SVEAEGKDRV 485
               ++ + YA G A +      MIP   + A  +D    + G    +VE EG  RV
Sbjct: 426 QRFGAQQVRYAQG-APLAAGVPGMIP---ETALRSDGKPGLRGEYFDNVELEGAPRV 478



 Score =  132 bits (333), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 88/287 (30%), Positives = 133/287 (46%), Gaps = 45/287 (15%)

Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           V G +L ++    +G DR D+ LP  Q  L+ + A A+  P+ +V+MS  AV +N+AK +
Sbjct: 630 VEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 688

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
                  W  YPG+ GG AIA  + G  NPGGRLP+T+Y +     PY S  ++      
Sbjct: 689 ADAIVAAW--YPGQSGGTAIARALAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK------ 740

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++F G  ++PFGYGLSYT F Y                 Q        G+      
Sbjct: 741 GRTYRYFKGEALFPFGYGLSYTSFAYDAP--------------QLSSTTLQAGS------ 780

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
                              V N G   G EV  VY + P    + ++ ++G++RV +  G
Sbjct: 781 ------------PLQVTTTVRNTGTRAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLQPG 828

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQ 753
           +   + FT++A ++L  VD      + +G + + VG G      P Q
Sbjct: 829 EQRTLTFTLDA-RALSDVDRTGTRAVEAGDYRLFVGGGQPDTGAPGQ 874


>gi|325925754|ref|ZP_08187127.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
           91-118]
 gi|325543811|gb|EGD15221.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
           91-118]
          Length = 874

 Score =  293 bits (751), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 174/446 (39%), Positives = 240/446 (53%), Gaps = 46/446 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA  LV +M+  EKV Q  + A  +PRLG+P YEWWSE LHG++  G            
Sbjct: 24  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 72

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
                AT FP  I   AS+N  L +++G  VSTEARA +N            AGLT WSP
Sbjct: 73  -----ATVFPQAIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSP 127

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ           D    +  A  KH
Sbjct: 128 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQG---------EDLNHPRTIATPKH 178

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
            A +         R  FD  V+ +D++ T+   F   + EG   SVMC+YN ++G P CA
Sbjct: 179 IAVHSGPE---PGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPACA 235

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              LLN  +RGDW F G++VSDCD++  + + H F  D    + A  LKAG DL+CG  Y
Sbjct: 236 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 294

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
                 A+ +G++ EA +D SL  L+    RLG  +   +  Y  LG  ++ N  H  LA
Sbjct: 295 RELGT-AIARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALA 353

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA + IVLLKND   LPL  G    LA++GP+A+A  A+  NY+GT     +P+ G  
Sbjct: 354 LQAAAESIVLLKNDANTLPLKAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 411

Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
               ++ ++YA G A +      MIP
Sbjct: 412 QRFGAQQVSYAQG-APLAAGVPGMIP 436



 Score =  130 bits (327), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 87/282 (30%), Positives = 136/282 (48%), Gaps = 45/282 (15%)

Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           V G +L ++    +G DR D+ LP  Q  L+ + A A+  P+ +V+MS  AV +N+AK +
Sbjct: 616 VEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 674

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
                  W  YPG+ GG AIA ++ G  NPGGRLP+T+Y +      Y S  ++      
Sbjct: 675 ADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK------ 726

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++F G  ++ FGYGLSYT+F Y                 Q        G++     
Sbjct: 727 GRTYRYFKGEPLFAFGYGLSYTRFAYDAP--------------QLSTTTLQAGSS----- 767

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
                              V N G   G EV  VY + P    + ++ ++G++RV +AAG
Sbjct: 768 -------------LQVTTTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAAG 814

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
           +   + F ++A ++L  VD +    + +G +T+ VG G  G 
Sbjct: 815 EQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGGQPGT 855


>gi|384420163|ref|YP_005629523.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
           BLS256]
 gi|353463076|gb|AEQ97355.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
           BLS256]
          Length = 889

 Score =  293 bits (751), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 174/446 (39%), Positives = 242/446 (54%), Gaps = 46/446 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA DLV  M+  EKV Q  + A  +PRLG+P YEWWSE LHG++  G            
Sbjct: 39  QRAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 87

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
                AT FP  I   AS+N  L +++G  VSTEARA +N            AGLT WSP
Sbjct: 88  -----ATVFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGKDHKRYAGLTIWSP 142

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDP++ G+ A+ ++ GLQ         D    P  I A  KH
Sbjct: 143 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIHGLQG--------DDLDHPRTI-ATPKH 193

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
            A +   +     R  FD  V+ +D++ T+   F   + EG   +VMC+YN ++G P CA
Sbjct: 194 LAVH---SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPACA 250

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              L+N  +RGDW F G++VSDCD++  + + H F  D    + A  LKAG DL+CG  Y
Sbjct: 251 ADWLINGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 309

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
                 A+ +G++ EA +D SL  L+    RLG  +   +  Y  LG  ++ N QH  LA
Sbjct: 310 RELGT-AIARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAQHRALA 368

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA + IVLLKN+   LPL  G    LA++GP+A+A  A+  NY+GT     +P+ G  
Sbjct: 369 LQAAAESIVLLKNNANTLPLKAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 426

Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
               ++ ++YA G A +      MIP
Sbjct: 427 QRFGAQQVSYAQG-APLAAGVPGMIP 451



 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 89/280 (31%), Positives = 137/280 (48%), Gaps = 49/280 (17%)

Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           V G +L ++    +G DR D+ LP  Q  L+ + A A+  P+ +V+MS  AV +N+AK +
Sbjct: 631 VEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 689

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
                  W  YPG+ GG AIA ++ G  NPGGRLP+T+Y +      Y S  ++      
Sbjct: 690 ADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK------ 741

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++F G  ++PFGYGLSYT F Y                 Q        G+      
Sbjct: 742 GRTYRYFKGEPLFPFGYGLSYTCFAYDAP--------------QLSSTAVQAGS------ 781

Query: 647 AVLIDDVKCKDYKFTFQI--EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIA 704
                         T Q+   V N G   G EV  VY + P    + ++ ++G++RV +A
Sbjct: 782 --------------TLQVTTTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLA 827

Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
           AG+   + F ++A ++L  VD +    + +G +T+ VG G
Sbjct: 828 AGEQRTLTFNLDA-RALSDVDPSGQRAVEAGNYTLFVGGG 866


>gi|78048767|ref|YP_364942.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
 gi|78037197|emb|CAJ24942.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
          Length = 889

 Score =  293 bits (751), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 174/446 (39%), Positives = 240/446 (53%), Gaps = 46/446 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA  LV +M+  EKV Q  + A  +PRLG+P YEWWSE LHG++  G            
Sbjct: 39  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 87

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
                AT FP  I   AS+N  L +++G  VSTEARA +N            AGLT WSP
Sbjct: 88  -----ATVFPQSIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSP 142

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ           D    +  A  KH
Sbjct: 143 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQG---------EDLNHPRTIATPKH 193

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
            A +         R  FD  V+ +D++ T+   F   + EG   SVMC+YN ++G P CA
Sbjct: 194 IAVHSGPE---PGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPACA 250

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              LLN  +RGDW F G++VSDCD++  + + H F  D    + A  LKAG DL+CG  Y
Sbjct: 251 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 309

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
                 A+ +G++ EA +D SL  L+    RLG  +   +  Y  LG  ++ N  H  LA
Sbjct: 310 RELGT-AIARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALA 368

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA + IVLLKND   LPL  G    LA++GP+A+A  A+  NY+GT     +P+ G  
Sbjct: 369 LQAAAESIVLLKNDANTLPLKAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 426

Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
               ++ ++YA G A +      MIP
Sbjct: 427 QRFGAQQVSYAQG-APLAAGVPGMIP 451



 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 87/282 (30%), Positives = 136/282 (48%), Gaps = 45/282 (15%)

Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           V G +L ++    +G DR D+ LP  Q  L+ + A A+  P+ +V+MS  AV +N+AK +
Sbjct: 631 VEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 689

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
                  W  YPG+ GG AIA ++ G  NPGGRLP+T+Y +      Y S  ++      
Sbjct: 690 ADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK------ 741

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++F G  ++ FGYGLSYT+F Y                 Q        G++     
Sbjct: 742 GRTYRYFKGEPLFAFGYGLSYTRFAYDAP--------------QLSTTTLQAGSS----- 782

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
                              V N G   G EV  VY + P    + ++ ++G++RV +AAG
Sbjct: 783 -------------LQVTTTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAAG 829

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
           +   + F ++A ++L  VD +    + +G +T+ VG G  G 
Sbjct: 830 EQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGGQPGT 870


>gi|167038437|ref|YP_001666015.1| glycoside hydrolase family 3 [Thermoanaerobacter pseudethanolicus
           ATCC 33223]
 gi|320116830|ref|YP_004186989.1| glycoside hydrolase family 3 domain-containing protein
           [Thermoanaerobacter brockii subsp. finnii Ako-1]
 gi|166857271|gb|ABY95679.1| glycoside hydrolase, family 3 domain protein [Thermoanaerobacter
           pseudethanolicus ATCC 33223]
 gi|319929921|gb|ADV80606.1| glycoside hydrolase family 3 domain protein [Thermoanaerobacter
           brockii subsp. finnii Ako-1]
          Length = 784

 Score =  293 bits (751), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 230/804 (28%), Positives = 378/804 (47%), Gaps = 133/804 (16%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGD--------------------LAYGV---PR 50
           Y D+     +R +DL+++MT+ EKV Q+                      ++YG+    R
Sbjct: 5   YLDSTQSVEKRVEDLLQQMTIEEKVAQLNSIWVYEILDDMKFSFDKAKRLMSYGIGQITR 64

Query: 51  LG----LPLYEWWSEALHGVSFI--GRRTNSPPGTHFDS----EVPGATSFPTVILTTAS 100
           LG    L   E    A     F+    R   P   H +S       GAT FP  I   ++
Sbjct: 65  LGGASNLSPRETVRIANQIQKFLIENTRLGIPALIHEESCSGYMAKGATIFPQTIGVAST 124

Query: 101 FNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRY 160
           +N  + +K+   +  + +A+           +P +++ RDPRWGR  ET GEDPY+V R 
Sbjct: 125 WNNEIVEKMASVIREQMKAV-----GARQALAPLLDITRDPRWGRTEETFGEDPYLVMRM 179

Query: 161 AINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQ 220
            ++Y+RGLQ          ++S    I A  KH+  Y   N EG   +   + + E++++
Sbjct: 180 GVSYIRGLQ----------TESLKEGIVATGKHFVGYG--NSEGGMNWA-PAHIPERELR 226

Query: 221 ETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQ 280
           E F+ PFE  V E  +SS+M  Y+ ++G+P     KLLN  +R DW F G +VSD  +I 
Sbjct: 227 EVFLYPFEAAVKEAKLSSIMPGYHELDGVPCHKSKKLLNDILRKDWGFEGIVVSDYFAIS 286

Query: 281 TIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQGKIAEADIDTSLRFL 338
            + E H   +D K+ A    L+AG+D++    DYY       ++ G+I    ++ +++ +
Sbjct: 287 QLYEYHHVTSD-KKGAAKLALEAGVDVELPSTDYYGLPLRELIESGEIDIDFVNEAVKRV 345

Query: 339 YIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKT 398
             +   LG F+     +          +  ELA + A++ IVLLKN+N  LPL   ++K+
Sbjct: 346 LKIKFELGLFENPYINEEKAVEIFDTNEQRELAYKIAQESIVLLKNENNLLPLKK-DLKS 404

Query: 399 LALVGPHANATKAMIGNYEGTPCR-------------YTSPM-------DGFYAYSKVIN 438
           +A++GP+A++ + MIG+Y   PC              + +P+       D +     V+ 
Sbjct: 405 IAVIGPNADSIRNMIGDY-AYPCHIESLLEMRETDNVFNTPLPESLEAKDIYVPIVTVLQ 463

Query: 439 -------------YAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG-----LDLSVEAE 480
                        YA GC D++  +      A++ AK AD  V+V G      D     E
Sbjct: 464 GIKAKVSSNTEVLYAKGC-DVLNNSKDGFKEAVEIAKQADVAVVVVGDKSGLTDGCTSGE 522

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
            +DR DL LPG Q ELI  + +    PV +V+++   + I++     KI +I+    PGE
Sbjct: 523 SRDRADLNLPGVQEELIKAIYETGT-PVIVVLINGRPMSISWIAE--KIPAIIEAWLPGE 579

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEA-NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
           EGGRA+ADVIFG YNPGG+LPI+  ++   + + Y   P    +++ G   +    P +Y
Sbjct: 580 EGGRAVADVIFGDYNPGGKLPISIPQSVGQLPVYYYHKPSGGRSHWKGDYVELSTKP-LY 638

Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
           PFGYGLSYT+F Y                      N  +   K          V  +D  
Sbjct: 639 PFGYGLSYTEFSY---------------------TNLNISNRK----------VSLRDRM 667

Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNAC 718
               ++++N G + G EVV +Y     ++ T  +K++ G++R+ + AG+   V F + + 
Sbjct: 668 VEISVDIKNTGTLKGDEVVQLYIHQEALSVTRPVKELKGFKRITLDAGEEKTVIFKL-SI 726

Query: 719 KSLKIVDNAANSLLASGAHTILVG 742
           + L   D     ++  G   +++G
Sbjct: 727 EQLGFYDENMEYVVEPGRVDVMIG 750


>gi|390992294|ref|ZP_10262532.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
 gi|372552957|emb|CCF69507.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
          Length = 886

 Score =  293 bits (751), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 175/446 (39%), Positives = 240/446 (53%), Gaps = 46/446 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA  LV +M+  EKV Q  + A  +PRLG+P YEWWSE LHG++  G            
Sbjct: 36  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 84

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
                AT FP  I   AS+N SL +++G  VSTEARA +N            AGLT WSP
Sbjct: 85  -----ATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSP 139

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ           D    +  A  KH
Sbjct: 140 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQG---------EDLNHPRTIATPKH 190

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
            A +         R  FD  V+  D++ T+   F   + EG   SVMC+YN ++G P CA
Sbjct: 191 IAVHSGPE---PGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCA 247

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              LLN  +RGDW F G++VSDCD++  + + H F  D    + A  LKAG DL+CG  Y
Sbjct: 248 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 306

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
                 A+ +G++ EA +D SL  L+    RLG  +   +  Y  LG  ++ N  H  LA
Sbjct: 307 RELGT-AIARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALA 365

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA + IVLLKND   LPL  G    LA++GP+A+A  A+  NY+GT     +P+ G  
Sbjct: 366 LQAAAESIVLLKNDANTLPLRAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 423

Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
               ++ ++YA G A +      MIP
Sbjct: 424 QRFGAQQVSYAQG-APLAAGVPGMIP 448



 Score =  132 bits (333), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 90/282 (31%), Positives = 140/282 (49%), Gaps = 45/282 (15%)

Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           V G +L ++    +G DR D+ LP  Q  L+ + A A+  P+ +V+MS  AV +N+AK +
Sbjct: 628 VEGEELRIDVPGFDGGDRNDIALPAAQQTLLER-AKASGKPLVVVLMSGSAVALNWAKTH 686

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
                  W  YPG+ GG AIA ++ G  NPGGRLP+T+Y +      Y S  ++      
Sbjct: 687 ADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK------ 738

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++F G  ++PFGYGLSYT+F Y    +P+     L   Q    +  T         
Sbjct: 739 GRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQLSTTTL---QAGNPLQVTA-------- 784

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
                              V N G   G EV  VY + P    + ++ ++G++RV +AAG
Sbjct: 785 ------------------TVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAAG 826

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
           +   + F ++A ++L  VD +    + +G +T+ VG G  G 
Sbjct: 827 EQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGGQPGT 867


>gi|381169747|ref|ZP_09878910.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           citri pv. mangiferaeindicae LMG 941]
 gi|380689765|emb|CCG35397.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           citri pv. mangiferaeindicae LMG 941]
          Length = 874

 Score =  293 bits (750), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 175/446 (39%), Positives = 240/446 (53%), Gaps = 46/446 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA  LV +M+  EKV Q  + A  +PRLG+P YEWWSE LHG++  G            
Sbjct: 24  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 72

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
                AT FP  I   AS+N SL +++G  VSTEARA +N            AGLT WSP
Sbjct: 73  -----ATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSP 127

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ           D    +  A  KH
Sbjct: 128 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQG---------EDLNHPRTIATPKH 178

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
            A +         R  FD  V+  D++ T+   F   + EG   SVMC+YN ++G P CA
Sbjct: 179 IAVHSGPE---PGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCA 235

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              LLN  +RGDW F G++VSDCD++  + + H F  D    + A  LKAG DL+CG  Y
Sbjct: 236 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 294

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
                 A+ +G++ EA +D SL  L+    RLG  +   +  Y  LG  ++ N  H  LA
Sbjct: 295 RELGT-AIARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALA 353

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA + IVLLKND   LPL  G    LA++GP+A+A  A+  NY+GT     +P+ G  
Sbjct: 354 LQAAAESIVLLKNDANTLPLRAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 411

Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
               ++ ++YA G A +      MIP
Sbjct: 412 QRFGAQQVSYAQG-APLAAGVPGMIP 436



 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 91/282 (32%), Positives = 139/282 (49%), Gaps = 45/282 (15%)

Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           V G +L ++    +G DR D+ LP  Q  L+ + A A+  P+ +V+MS  AV +N+AK +
Sbjct: 616 VEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKMH 674

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
                  W  YPG+ GG AIA ++ G  NPGGRLP+T+Y +     PY S  ++      
Sbjct: 675 ADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK------ 726

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++F G  ++PFGYGLSYT+F Y             D  Q       T+    P   
Sbjct: 727 GRTYRYFKGEPLFPFGYGLSYTRFAY-------------DAPQLS---TTTLQAGNP--- 767

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
                              V N G   G EV  VY + P    + ++ ++G++RV +AAG
Sbjct: 768 -------------LQVTATVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAAG 814

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
           +   + F ++A ++L  VD +    + +G +T+ VG G  G 
Sbjct: 815 EQRTLTFHLDA-RALSDVDRSGQRAVEAGDYTLFVGGGQPGT 855


>gi|346725879|ref|YP_004852548.1| beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346650626|gb|AEO43250.1| Beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 889

 Score =  293 bits (750), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 174/446 (39%), Positives = 241/446 (54%), Gaps = 46/446 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA  LV +M+  EKV Q  + A  +PRLG+P YEWWSE LHG++  G            
Sbjct: 39  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 87

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
                AT FP  I   AS+N  L +++G  VSTEARA +N            AGLT WSP
Sbjct: 88  -----ATVFPQSIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSP 142

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ           D    +  A  KH
Sbjct: 143 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQG---------EDLNHPRTIATPKH 193

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
            A +   +     R  FD  V+ +D++ T+   F   + EG   SVMC+YN ++G P CA
Sbjct: 194 IAVH---SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPACA 250

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              LLN  +RGDW F G++VSDCD++  + + H F  D    + A  LKAG DL+CG  Y
Sbjct: 251 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 309

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
                 A+ +G++ EA +D SL  L+    RLG  +   +  Y  LG  ++ N  H  LA
Sbjct: 310 RELGT-AIARGEVDEALLDQSLVRLFATRYRLGELEAPRKDPYARLGAKDVDNAAHRALA 368

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA + IVLLKND   LPL  G    LA++GP+A+A  A+  NY+GT     +P+ G  
Sbjct: 369 LQAAAESIVLLKNDANTLPLKAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 426

Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
               ++ ++YA G A +      MIP
Sbjct: 427 QRFGAQQVSYAQG-APLAAGVPGMIP 451



 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 87/282 (30%), Positives = 136/282 (48%), Gaps = 45/282 (15%)

Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           V G +L ++    +G DR D+ LP  Q  L+ + A A+  P+ +V+MS  AV +N+AK +
Sbjct: 631 VEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 689

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
                  W  YPG+ GG AIA ++ G  NPGGRLP+T+Y +      Y S  ++      
Sbjct: 690 ADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK------ 741

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++F G  ++ FGYGLSYT+F Y                 Q        G++     
Sbjct: 742 GRTYRYFKGEPLFAFGYGLSYTRFAYDAP--------------QLSTTTLQAGSS----- 782

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
                              V N G   G EV  VY + P    + ++ ++G++RV +AAG
Sbjct: 783 -------------LQVTTTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAAG 829

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
           +   + F ++A ++L  VD +    + +G +T+ VG G  G 
Sbjct: 830 EQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGGQPGT 870


>gi|386718620|ref|YP_006184946.1| glucan 1,4-beta-glucosidase [Stenotrophomonas maltophilia D457]
 gi|384078182|emb|CCH12773.1| Glucan 1,4-beta-glucosidase [Stenotrophomonas maltophilia D457]
          Length = 897

 Score =  292 bits (748), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 178/447 (39%), Positives = 246/447 (55%), Gaps = 45/447 (10%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           P+ D    + +RA  LV +MTL EK  QM + A  + RLG+P Y+WW+E LHGV+  G+ 
Sbjct: 37  PWLDVSASFEQRAASLVAQMTLDEKAAQMQNAAPAIERLGVPAYDWWNEGLHGVARAGQ- 95

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL--------G 124
                          AT FP  I   A+F+  L  ++  T+S EARA ++          
Sbjct: 96  ---------------ATVFPQAIGLAATFDVPLMGQVATTISDEARAKHHQFLRQGAHGR 140

Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLTFWSPN+N+ RDPRWGR  ET GEDPY+  R  + +VRGLQ  + V          
Sbjct: 141 YQGLTFWSPNVNIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQGDDPVYR-------- 192

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
            K+ A  KH A +        DR HFD+R + +D+ +T++  FE  V EGDV +VM +YN
Sbjct: 193 -KLDATAKHLAVHSGPE---ADRHHFDARPSRRDLYDTYLPAFEALVKEGDVDAVMGAYN 248

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           RV G    A   LL   +R DW F GY+VSDC +I  I + H  +  T+E A A  ++ G
Sbjct: 249 RVYGESASASRFLLRDVLRRDWGFKGYVVSDCWAIVDIWKHHHIVT-TREAAAALAVRNG 307

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICN 364
            +L+CG  Y      AV+QG I+EA+ID ++  L+   MRLG FD   + +        N
Sbjct: 308 TELECGQEYATLP-SAVRQGLISEAEIDDAVTRLFTARMRLGMFDPPERVRWARIPASVN 366

Query: 365 --PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
             P H  LA +AA+  +VLLKND G LPL+  +IK +A+VGP A+ T A++GNY GTP  
Sbjct: 367 QAPSHDALALKAAQASLVLLKND-GILPLSR-DIKRIAVVGPTADDTMALLGNYFGTPAA 424

Query: 423 YTSPMDGFYAYSK--VINYAPGCADIV 447
             + + G    +K   + YA G  D+V
Sbjct: 425 PVTILQGIREAAKGVEVRYARGV-DLV 450



 Score =  140 bits (353), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 94/298 (31%), Positives = 149/298 (50%), Gaps = 53/298 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+DAA+ AD  V V GL   VE E          G DR DL LP  Q  L+  +    K 
Sbjct: 622 ALDAAREADVVVFVGGLTGDVEGEEMTVNYPGFAGGDRTDLRLPAPQRTLLEALHATGK- 680

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV +V+    A+ +++A+++  + +IL   YPG+ GG A+   +FG  NP GRLP+T+Y+
Sbjct: 681 PVVMVLTGGSAIAVDWAQSH--LPAILMSWYPGQRGGTAVGQALFGDVNPAGRLPVTFYK 738

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
           A+       ++P        GRTY++F G  +YPFG+GLSYT+F Y          ++LD
Sbjct: 739 AS------EALPAFDDYAMEGRTYRYFRGTPLYPFGHGLSYTRFDYGT--------LRLD 784

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPP 685
                       G+ +              D +    ++V N G   G EVV +Y  +  
Sbjct: 785 -----------AGSLR-------------ADGRLGVAVDVTNAGTRSGDEVVQLYVRREH 820

Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA-ANSLLASGAHTILVG 742
             +G  ++++ G++R+ +A G+   V FT+ A ++L+  D A A   +  GA+ + VG
Sbjct: 821 AGSGDAVQELRGFQRIHLAPGEHRTVTFTLEAAQALRHYDEARAAYEVRPGAYEVRVG 878


>gi|408824590|ref|ZP_11209480.1| Glucan 1,4-beta-glucosidase [Pseudomonas geniculata N1]
          Length = 897

 Score =  292 bits (748), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 177/447 (39%), Positives = 246/447 (55%), Gaps = 45/447 (10%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           P+ D    + +RA  LV +MTL EK  QM + A  + RLG+P Y+WW+E LHGV+  G+ 
Sbjct: 37  PWLDVSASFEQRAAALVAQMTLEEKAAQMQNAAPAIERLGVPAYDWWNEGLHGVARAGQ- 95

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL--------G 124
                          AT FP  I   A+F+  L  ++  T+S EARA ++          
Sbjct: 96  ---------------ATVFPQAIGLAATFDVPLMGQVAATISDEARAKHHQFLREGAHGR 140

Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLTFWSPN+N+ RDPRWGR  ET GEDPY+  R  + +VRGLQ  + V          
Sbjct: 141 YQGLTFWSPNVNIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQGDDPVYR-------- 192

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
            K+ A  KH A +        DR HFD+R + +D+ +T++  FE  V EGDV +VM +YN
Sbjct: 193 -KLDATAKHLAVHSGPE---ADRHHFDARPSRRDLYDTYLPAFEALVKEGDVDAVMGAYN 248

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           RV G    A   LL   +R DW F GY+VSDC +I  I + H+ +  T+E A A  ++ G
Sbjct: 249 RVYGESASASRFLLRDVLRRDWGFKGYVVSDCWAIVDIWKHHRIVT-TREAAAALAVRNG 307

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICN 364
            +L+CG  Y      AV+QG I+EA+ID ++  L+   MRLG FD   + +        N
Sbjct: 308 TELECGQEYATLP-SAVRQGLISEAEIDDAVTRLFTARMRLGMFDPPERVRWARIPASVN 366

Query: 365 --PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
             P H  LA +AA+  +VLLKND G LPL+  N + +A+VGP A+ T A++GNY GTP  
Sbjct: 367 QAPAHDALALKAAQASLVLLKND-GILPLSR-NTRRIAVVGPTADDTMALLGNYFGTPAA 424

Query: 423 YTSPMDGFYAYSK--VINYAPGCADIV 447
             + + G    +K   + YA G  D+V
Sbjct: 425 PVTILQGIREAAKGVEVRYARGV-DLV 450



 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 90/287 (31%), Positives = 137/287 (47%), Gaps = 53/287 (18%)

Query: 468 VIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA 517
           V V GL   VE E          G DR DL LP  Q  L+  +    K PV +V+    A
Sbjct: 633 VFVGGLTGDVEGEEMTVNYPGFAGGDRTDLRLPAPQRTLLEALHGTGK-PVVMVLTGGSA 691

Query: 518 VDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSM 577
           + +++A+ +  + +IL   YPG+ GG A+   +FG  NP GRLP+T+Y+A        +M
Sbjct: 692 IAVDWAQAH--LPAILMSWYPGQRGGTAVGQALFGDVNPSGRLPVTFYKAG------EAM 743

Query: 578 PLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
           P        GRTY++F G  +YPFG+GLSYT+F Y          ++LD D         
Sbjct: 744 PAFDDYAMEGRTYRYFRGTPLYPFGHGLSYTRFDYGT--------LRLDADSL------- 788

Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVI 696
                              D +    ++V N G   G EVV +Y  +    +G  ++++ 
Sbjct: 789 -----------------RADGRLGVAVDVANTGTRSGDEVVQLYVRREHAGSGDAVQELR 831

Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNA-ANSLLASGAHTILVG 742
           G++RV +A G+   V FT+ A ++L+  D A A   +  GA+ + VG
Sbjct: 832 GFQRVQLAPGERRTVTFTLEAAQALRHYDEARAAYAVQPGAYEVRVG 878


>gi|374313710|ref|YP_005060140.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
 gi|358755720|gb|AEU39110.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
          Length = 883

 Score =  291 bits (744), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 166/460 (36%), Positives = 247/460 (53%), Gaps = 51/460 (11%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            PY D  LP  +RA DLV R+TL EK  Q+   A G+PRLG+P Y++WSE LHG++  G 
Sbjct: 35  LPYQDTTLPAEQRAADLVGRLTLDEKAAQLVTSAPGIPRLGVPAYDFWSEGLHGIARSGY 94

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
                           AT FP  +   A+F+E L  +IG+ +STEARA YN   A     
Sbjct: 95  ----------------ATLFPQAVGMAATFDEPLLHQIGEVISTEARAKYNDAVAHDLRS 138

Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
              GLT WSPNIN+ RDPRWGR  ET GEDP++  R    +V GLQ           D  
Sbjct: 139 IFYGLTIWSPNINIFRDPRWGRGQETYGEDPFLTARLGTAFVEGLQG---------DDPN 189

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
             +     KH+A +   +   ++R  F++  +  D+ +T++  F   + EG   S+MC+Y
Sbjct: 190 YYRAIGTPKHFAVH---SGPESERHRFNADPSPHDLWDTYLPAFRATIVEGKAGSIMCAY 246

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVES--HKFLNDTKEDAVARVL 301
           N + G P CA   LL++ +R DW F G++ SDC +I    E   H +  D ++ +V  + 
Sbjct: 247 NAIEGKPACASDLLLDEVLRKDWAFKGFVTSDCGAIDNFFEKDGHHYSKDAEQASVDGI- 305

Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGK 359
           +AG D +CG  Y N    AV++G I E+++D  LR L++   +LG FD   Q  Y ++  
Sbjct: 306 RAGTDTNCGGTYRNLA-SAVRKGMIQESELDVPLRRLFLARFKLGLFDPPSQVKYASMPI 364

Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
               +  H ELA +AAR+ +VLLKN++  LPL+   +KT+A++GP+A++  ++ GNY   
Sbjct: 365 TENMSSSHTELALQAAREAVVLLKNEHHTLPLD-ARVKTIAVIGPNASSLISLEGNYNAI 423

Query: 420 PCRYTSPMDGF--------YAYSKVINYAPGCADIVCQNN 451
           P      +DG           Y++   YA G A ++ +  
Sbjct: 424 PKNPVMQVDGIAREFRDAKVLYAQGSPYAEGVALVIPRTQ 463



 Score =  133 bits (335), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 92/298 (30%), Positives = 143/298 (47%), Gaps = 52/298 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A++A K ADA V   GL   +E E          G DR DL+LP  Q +L+ + A A+  
Sbjct: 606 AMEAVKQADAVVAFVGLSPELEGEEMDVHIPGFSGGDRTDLVLPAAQQQLL-EAAKASGK 664

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           P+ +V+++  A+ +N+A+ +    +IL   YPG+ G +AIA+ + GK NP GRLP+T+Y 
Sbjct: 665 PLVVVLLNGSALAVNWAQEH--ADAILEAWYPGQAGAQAIAETLSGKNNPSGRLPVTFYR 722

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
           +     P+T   +        RTY++F G  +Y FGYGLSY+ F Y  A   K    +LD
Sbjct: 723 SVNDLPPFTDYAM------ANRTYRYFKGKPLYEFGYGLSYSTFSYSNAHLSKE---RLD 773

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
                R                              + +V+N   + G EV  +Y  PP 
Sbjct: 774 AGDTLR-----------------------------VEADVKNTSTLAGDEVAELYLTPPQ 804

Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
                ++ + G+E V +  GQS  V FT++  + L  VD      + +G +++ VG G
Sbjct: 805 NGVYPLRSLEGFEHVHLLPGQSKHVSFTLDP-RQLSEVDEKGIRAVRAGVYSVTVGGG 861


>gi|218262493|ref|ZP_03476939.1| hypothetical protein PRABACTJOHN_02617 [Parabacteroides johnsonii
           DSM 18315]
 gi|218223341|gb|EEC95991.1| hypothetical protein PRABACTJOHN_02617 [Parabacteroides johnsonii
           DSM 18315]
          Length = 868

 Score =  291 bits (744), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 177/469 (37%), Positives = 246/469 (52%), Gaps = 54/469 (11%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
           S   +  D+P+ +  LP  ER  DL++R+T  EKV QM +    + RLG+P Y+WW+EAL
Sbjct: 18  SCSQRQEDYPFRNPDLPIDERIDDLLKRLTAEEKVGQMMNTTPAIERLGIPQYDWWNEAL 77

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           HGV+  G+                AT FP  I   A+F++    +    VS EARA Y+ 
Sbjct: 78  HGVARAGK----------------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQ 121

Query: 124 GNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
                      GLTFW+PNIN+ RDPRWGR +ET GEDPY+  R  +  V+GLQ      
Sbjct: 122 YQKDKEYDRYKGLTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQG----- 176

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
                D +  K  AC KHYA +    W   +R  FD  VT +D+ +T++  FE  V EG+
Sbjct: 177 ----DDPKYFKTHACAKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKEGN 229

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE------SHKFL 289
           V  VMC+YNR  G P C+  KLL   +R  W +   I+SDC +I    E       H+  
Sbjct: 230 VQEVMCAYNRYQGKPCCSSDKLLIDILRNSWGYENIILSDCGAINDFWERDERTPRHETH 289

Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
            D  E A A  +  G DL+CG+ Y    + A++ GKI+E D+D SLR L      LG FD
Sbjct: 290 PDA-ESASADAVLNGTDLECGNSYRAL-VKALKDGKISENDLDVSLRRLLKGRFELGMFD 347

Query: 350 GSPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
              Q  Y  +  N + +P+H+  A E A + +VLLKN N  LPL +  I+ +A+VGP+A 
Sbjct: 348 PDEQVPYAQIPYNVVESPEHVAQALEMAHKSMVLLKNKNNTLPL-SKTIRKIAVVGPNAA 406

Query: 408 ATKAMIGNYEGTPCRYTSPMDGFYAY---SKVINYAPGC---ADIVCQN 450
            +  +  NY G P    + ++G       ++VI Y  GC   AD V Q+
Sbjct: 407 DSTMLWANYNGFPTHTVTILEGIRNKVPDTEVI-YELGCNHAADFVIQD 454



 Score =  132 bits (331), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 102/315 (32%), Positives = 142/315 (45%), Gaps = 55/315 (17%)

Query: 442 GCADIVCQNNSMIPA--AIDAAKNADATVIV---------AGLDLSVEAEG---KDRVDL 487
           G AD+  Q  +  P   A  AAK  DA VIV          G ++ V  EG    DR ++
Sbjct: 579 GSADLNFQIGTRRPVDYAATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNI 638

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
            LP  Q E++ K   A   PV  V+ +  A+ +N+ + N  I +IL   Y G+E G A+A
Sbjct: 639 ELPKVQQEMV-KALKATGKPVVYVLCTGSALALNWEEAN--IDAILNAWYGGQEAGTAVA 695

Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSY 607
           D++FG YNP GRLP+T+Y++         +P     +  GRTY++     +YPFGYGLSY
Sbjct: 696 DILFGDYNPSGRLPVTFYKS------IDQLPDFEDYSMKGRTYRYMTETPLYPFGYGLSY 749

Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
           T F Y+ A   K    K+ KDQ                               T   ++ 
Sbjct: 750 TNFAYRNA---KLSSGKIAKDQSV-----------------------------TLTFDIA 777

Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           N GKMDG EV  +Y K P      IK +  + RV + AG S +V   +         DN 
Sbjct: 778 NTGKMDGDEVAQIYIKNPNDPEGPIKALKAFLRVHVKAGDSQEVNIELAPETFHSFNDNT 837

Query: 728 ANSLLASGAHTILVG 742
               +  G + IL G
Sbjct: 838 QTMEVRPGKYQILYG 852


>gi|423313768|ref|ZP_17291703.1| hypothetical protein HMPREF1058_02315 [Bacteroides vulgatus
           CL09T03C04]
 gi|392684303|gb|EIY77631.1| hypothetical protein HMPREF1058_02315 [Bacteroides vulgatus
           CL09T03C04]
          Length = 788

 Score =  290 bits (743), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 238/811 (29%), Positives = 373/811 (45%), Gaps = 163/811 (20%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
           Y + K P  +R +DL+ +MTL EK  QM  L YG  R+    LP   W +E    G+  I
Sbjct: 43  YENPKAPLEDRVQDLLSQMTLEEKTCQMATL-YGSGRVLKDALPQNNWKTEVWKDGIGNI 101

Query: 70  GRRTNS----------PPGTHFDSE--------------VP--------------GATSF 91
               N           P   H +++              +P               AT F
Sbjct: 102 DEEHNGLGAFKSEYSFPYAKHVNAKHTIQRWFVEKTRLGIPVDFTNEGIRGLCHDRATYF 161

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
           P      A++N+ L  +IG+  + EA A+      G T  +SP +++ +DPRWGR +ET 
Sbjct: 162 PAQCGQGATWNKKLIARIGEVEAKEAVAL------GYTNIYSPILDIAQDPRWGRCVETY 215

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDPY+VG      +  LQ                 + A  KH+A Y +     + +   
Sbjct: 216 GEDPYLVGELGKQMITSLQK--------------YNLVATPKHFAVYSIPIGGRDGKTRT 261

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
           D  V  ++M+  +I PF M   E     VM SYN  +G P       L + +R +W F G
Sbjct: 262 DPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKG 321

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
           Y+VSD ++++ I   HK + DT ED +A+ + AGL++      T+FT           AV
Sbjct: 322 YVVSDSEAVEFISNKHK-VADTYEDGIAQAVNAGLNIR-----THFTPPADFILPLRKAV 375

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN---ICNPQHIELAAEAARQG 378
             GKI++  +D  +  +  +  RLG FD    Y+  GK     + + +H  ++ EAARQ 
Sbjct: 376 DNGKISQETLDKRVAEILRIKFRLGLFDNP--YRGNGKQAEQIVHSKEHQAVSLEAARQS 433

Query: 379 IVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY---TSPMDGFYA--- 432
           +VLLKN+   LPL+  +I+++A++GP+AN    +I       CRY    +P+   Y    
Sbjct: 434 LVLLKNETNLLPLSK-SIRSIAVIGPNANEQTQLI-------CRYGPANAPIKTVYQGIK 485

Query: 433 ----YSKVINYAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGL 473
               +++VI Y  GC DI+                +   ++  AI AAK A+  V+V G 
Sbjct: 486 ELLPHAEVI-YKKGC-DIIDPHFPESEILDFPKTAEEVQLMEEAIRAAKQAEVVVMVLGG 543

Query: 474 DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
           +     E + R  L LPG Q EL+  V    K PV LV++   A  IN+A  +  + +IL
Sbjct: 544 NELTVREDRSRTSLNLPGRQEELLKAVCATGK-PVILVMLDGRASSINYAAAH--VPAIL 600

Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
              +PGE  G+A+A+ +FG YNPGGRL +T +  +  +IP+ + P +P ++    T  + 
Sbjct: 601 HAWFPGEFCGQAVAEALFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDESSSTSVY- 657

Query: 594 DGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
               +YPFG+GLSYT F Y  +  SP    ++ D    C+                    
Sbjct: 658 --GALYPFGHGLSYTTFTYSDLHISPSHQGVQGDIHVSCK-------------------- 695

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKV 711
                        ++N GK+ G EVV +Y +       T+ K + G+ER+ + AG+   V
Sbjct: 696 -------------IKNTGKIKGDEVVQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTV 742

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            F +   + L + D   N  +  G+  +++G
Sbjct: 743 HFRLRP-QDLGLWDKNMNFRVEPGSFKVMLG 772


>gi|160901716|ref|YP_001567297.1| glycoside hydrolase family 3 protein [Petrotoga mobilis SJ95]
 gi|160359360|gb|ABX30974.1| glycoside hydrolase family 3 domain protein [Petrotoga mobilis
           SJ95]
          Length = 777

 Score =  290 bits (743), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 237/814 (29%), Positives = 389/814 (47%), Gaps = 161/814 (19%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDL-AYGVPRLGLPLYEWWSEAL-HGVSFIGR 71
           Y +   P  ER +DL+E+MTL EK+ Q+G   +Y +   G   +E     L  G+  I R
Sbjct: 4   YKNPDKPIEERIEDLLEQMTLDEKIAQLGSFWSYELLDNGNFSFEKAQNLLKEGIGQITR 63

Query: 72  ---------------------------RTNSPPGTHFDS----EVPGATSFPTVILTTAS 100
                                      R   P   H +        GAT FP +I   ++
Sbjct: 64  PGGATGFSPKKTAELANKIQKFLLTETRLGIPAFMHEECLSGYMTRGATIFPQMIGAAST 123

Query: 101 FNESLWKKIGQTVSTEARAMYNLG-NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR 159
           +   L +++  ++  + +A   LG + GL   SP ++V RDPRWGR  ET GEDPY++ +
Sbjct: 124 WEPPLIERMTTSIRNQMKA---LGIHQGL---SPVVDVTRDPRWGRTEETFGEDPYLIAK 177

Query: 160 YAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLD----NWEGNDRFHFDSRVT 215
             + YV+GLQ          SD     I A  KH+  Y +     NW         + + 
Sbjct: 178 MGVAYVKGLQ----------SDDLKNGIVATLKHFVGYGVSEGGMNWA-------PAHIP 220

Query: 216 EQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSD 275
           E++++ETF+ PFE  + EG V SVM +Y+ ++GIP  A   LL + +R +W F G +VSD
Sbjct: 221 ERELKETFLFPFEAAIKEGKVKSVMNAYHEIDGIPCGASETLLRRILREEWGFDGIVVSD 280

Query: 276 CDSIQTIVESHKF-LNDTKEDAVARVLKAGLDLDCG--DYYTNFTMGAVQQGKIAEADID 332
             +I +++E HK  LN  KE+A  + LKAG+D++    D Y      A++ G+ +EA ID
Sbjct: 281 YFAINSLMEYHKIALN--KEEAAIKALKAGIDVELPSFDCYKEPLKNAIENGEFSEAFID 338

Query: 333 TSLRFLYIVLMRLGYFDGSPQYKNLGK--NNICNPQHIELAAEAARQGIVLLKNDNGALP 390
            S+R +  +   +G F+    Y +L K  +N+  P+  +LA E A++ IVLLKND G +P
Sbjct: 339 KSVRNILRLKFEMGLFENP--YVDLEKVPDNLDTPEDRKLAYEIAKKSIVLLKND-GIVP 395

Query: 391 LNTGN-IKTLALVGPHANATKAMIGNY-------------------EGT-------PCR- 422
           L   + IK +A++GP+AN+ + + G+Y                   EG        P + 
Sbjct: 396 LKKNSKIKKVAVIGPNANSARNLTGDYTYLTHLETLKQGAFGTSAMEGITFSESELPIKT 455

Query: 423 -YTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG------LDL 475
            Y S  +     +   +YA GC +I   N  MI  A++ A+N+D  ++V G      LD 
Sbjct: 456 IYESLKEKLEKLNVETSYAKGC-EINDDNKEMIKEAVELAENSDVALLVLGDKSGLTLDC 514

Query: 476 SVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
           +   E +D   L+LPG Q +L+  V +    PV +V+++     +++   N  + +I   
Sbjct: 515 TT-GESRDSSTLILPGVQLDLLKSVINTGT-PVIVVLVNGRPYSLDWVSKN--VSAIFEA 570

Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFFD 594
             PGEEGG A+AD+I G  +P G+LPI++      + + Y   P    + + G    + D
Sbjct: 571 WLPGEEGGNALADIILGDESPSGKLPISFPRHVGQIPVYYNHKPSGGRSQWWG---DYTD 627

Query: 595 GPV--VYPFGYGLSYTQFKY---KVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL 649
            P   +YPFG+GLSYTQF+Y   ++ ++ + V I +D                       
Sbjct: 628 SPAKPLYPFGHGLSYTQFEYGNLQIENNDRIVKISMD----------------------- 664

Query: 650 IDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQS 708
                           V+N+G+  G E+V +Y      + T  +K++ G++RV +   + 
Sbjct: 665 ----------------VKNIGEETGDEIVQLYMNDEVASVTRPVKELKGFQRVTLKPSEK 708

Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            ++ F +   ++L + +     L+  G   ++VG
Sbjct: 709 KRIIFNL-PIETLALYNEKMEFLVEKGYFKVMVG 741


>gi|383643328|ref|ZP_09955734.1| glycoside hydrolase family 3 [Sphingomonas elodea ATCC 31461]
          Length = 799

 Score =  290 bits (743), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 229/720 (31%), Positives = 343/720 (47%), Gaps = 106/720 (14%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+P+     EALHG+                   PGATSFP  I   +SF+  L + I
Sbjct: 145 RLGIPML-MHEEALHGLV-----------------APGATSFPQSIALASSFDPKLVENI 186

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
               + EARA      A L   +P ++V RDPRWGR+ ET GEDPY+V +  +  +RG Q
Sbjct: 187 FSMAAKEARAR----GANLVL-APVVDVARDPRWGRIEETYGEDPYLVTQMGLAAIRGFQ 241

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
              G      SD    K+    KH   +       N      + + E+ ++E F  PFE 
Sbjct: 242 ---GTTMPLKSD----KVFITLKHMTGHGQPE---NGTNVGPASLGERTLREDFFPPFEA 291

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            V    V SVM SYN ++GIP+ A+  LL   +RG+W F G +VSD  +I+ ++  H   
Sbjct: 292 AVKTLPVMSVMASYNEIDGIPSHANKWLLTDVLRGEWGFQGAVVSDYFAIRELITRHHLF 351

Query: 290 NDTKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY 347
            D K DA  R L AG+D++   G+ YT+     V+QG++++ +ID ++R +  +    G 
Sbjct: 352 KDPK-DAAQRALDAGVDVETPDGEAYTHLVQ-LVKQGRVSQGEIDNAVRRVLRMKFEGGL 409

Query: 348 FDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
           F+       L       P+ I L+ +AAR+ IVLLKN  G LPL+   IK +A++G HA 
Sbjct: 410 FENPYPEVKLAAARTNTPEAIALSRQAARESIVLLKNAQGLLPLDARGIKRMAVIGTHAK 469

Query: 408 ATKAMIGNYEGTPCRYTSPMDGFYAYSK---VINYAPG---------CADIVCQ-----N 450
            T   IG Y   P    S ++G  A  K    ++YA G           D V Q     N
Sbjct: 470 DTP--IGGYSDLPNHVVSVLEGMQAEGKGKFAVDYAEGIRITNHREWSKDAVAQVPASVN 527

Query: 451 NSMIPAAIDAAKNADATVIVAGLDLSVEAEG------KDRVDLLLPGFQTELINKVADAA 504
           + +   A++ AKNAD  V+V G + +V  E        D   L LPG Q +L  ++    
Sbjct: 528 DQLRAQALETAKNADVVVLVLGGNEAVSREAWADNHLGDSETLDLPGPQDQLAKELIALG 587

Query: 505 KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW 564
           K PV +++++     +N+     K  +++   Y GE+ G AIADV+FG+YNPGG+LP++ 
Sbjct: 588 K-PVVVILLNGRPYAVNYLAE--KAPALIEGWYLGEQTGNAIADVVFGRYNPGGKLPVSV 644

Query: 565 YEA-NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
             +   + I Y   P         R Y F D   +YPFGYGLSYT F     S+P+    
Sbjct: 645 ARSVGQLPIYYNKKP------SARRGYLFGDTSPLYPFGYGLSYTTFDI---SAPR---- 691

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
                         +GT       + I D      K + +++V N GK+ G EVV ++  
Sbjct: 692 --------------LGT-----PTIGIAD------KASVEVDVTNTGKVAGDEVVQLFVH 726

Query: 684 PPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
               + T  + ++  +ERV +  G+   V F +     L + ++    ++  G  TI  G
Sbjct: 727 DDEASVTRPVIELKRFERVTLKPGEKKTVRFELT-PDDLALWNSQMRHVVEPGTFTISSG 785


>gi|150002739|ref|YP_001297483.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
 gi|294776994|ref|ZP_06742455.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           vulgatus PC510]
 gi|149931163|gb|ABR37861.1| glycoside hydrolase family 3, candidate beta-glycosidase
           [Bacteroides vulgatus ATCC 8482]
 gi|294449242|gb|EFG17781.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           vulgatus PC510]
          Length = 788

 Score =  290 bits (743), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 238/811 (29%), Positives = 373/811 (45%), Gaps = 163/811 (20%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
           Y + K P  +R +DL+ +MTL EK  QM  L YG  R+    LP   W +E    G+  I
Sbjct: 43  YENPKAPLEDRVQDLLSQMTLEEKTCQMATL-YGSGRVLKDALPQNNWKTEVWKDGIGNI 101

Query: 70  GRRTNS----------PPGTHFDSE--------------VP--------------GATSF 91
               N           P   H +++              +P               AT F
Sbjct: 102 DEEHNGLGAFKSEYSFPYAKHVNAKHTIQRWFVEKTRLGIPVDFTNEGIRGLCHDRATYF 161

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
           P      A++N+ L  +IG+  + EA A+      G T  +SP +++ +DPRWGR +ET 
Sbjct: 162 PAQCGQGATWNKKLIARIGEVEAKEAVAL------GYTNIYSPILDIAQDPRWGRCVETY 215

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDPY+VG      +  LQ                 + A  KH+A Y +     + +   
Sbjct: 216 GEDPYLVGELGKQMITSLQK--------------YNLVATPKHFAVYSIPIGGRDGKTRT 261

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
           D  V  ++M+  +I PF M   E     VM SYN  +G P       L + +R +W F G
Sbjct: 262 DPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKG 321

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
           Y+VSD ++++ I   HK + DT ED +A+ + AGL++      T+FT           AV
Sbjct: 322 YVVSDSEAVEFISNKHK-VADTYEDGIAQAVNAGLNIR-----THFTPPADFILPLRKAV 375

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN---ICNPQHIELAAEAARQG 378
             GKI++  +D  +  +  +  RLG FD    Y+  GK     + + +H  ++ EAARQ 
Sbjct: 376 DNGKISQETLDKRVAEILRIKFRLGLFDNP--YRGNGKQAEQIVHSKEHQAVSLEAARQS 433

Query: 379 IVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY---TSPMDGFYA--- 432
           +VLLKN+   LPL+  +I+++A++GP+AN    +I       CRY    +P+   Y    
Sbjct: 434 LVLLKNETNLLPLSK-SIRSIAVIGPNANEQTQLI-------CRYGPANAPIKTVYQGIK 485

Query: 433 ----YSKVINYAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGL 473
               +++VI Y  GC DI+                +   ++  AI AAK A+  V+V G 
Sbjct: 486 ELLPHTEVI-YKKGC-DIIDPHFPESEILDFPKTAEEVQLMEEAIRAAKQAEVVVMVLGG 543

Query: 474 DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
           +     E + R  L LPG Q EL+  V    K P+ LV++   A  IN+A  +  I +IL
Sbjct: 544 NELTVREDRSRTSLNLPGRQEELLKAVCATGK-PIILVMLDGRASSINYAAAH--IPAIL 600

Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
              +PGE  G+A+A+ +FG YNPGGRL +T +  +  +IP+ + P +P ++    T  + 
Sbjct: 601 HAWFPGEFCGQAVAEALFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDESSSTSVY- 657

Query: 594 DGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
               +YPFG+GLSYT F Y  +  SP    ++ D    C+                    
Sbjct: 658 --GALYPFGHGLSYTTFTYSDLHISPSHQGVQGDIHVSCK-------------------- 695

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKV 711
                        ++N GK+ G EVV +Y +       T+ K + G+ER+ + AG+   V
Sbjct: 696 -------------IKNTGKIKGDEVVQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTV 742

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            F +   + L + D   N  +  G+  +++G
Sbjct: 743 HFRLRP-QDLGLWDKNMNFRVELGSFKVMLG 772


>gi|121308314|dbj|BAF43576.1| arabinofuranosidase/xylosidase homolog [Prunus persica]
          Length = 349

 Score =  290 bits (742), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 156/356 (43%), Positives = 219/356 (61%), Gaps = 18/356 (5%)

Query: 407 NATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADA 466
           + T  MIGNY G  C YT+P+ G   Y++ I+ A GC D+ C  N +  AA  AA+ ADA
Sbjct: 1   DVTVTMIGNYAGVACGYTTPLQGIGRYTRTIHQA-GCTDVHCNGNQLFGAAEAAARQADA 59

Query: 467 TVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           TV+V GLD S+EAE  DR  LLLPG Q EL+++VA A++GP  LV+MS G +D+ FAKN+
Sbjct: 60  TVLVMGLDQSIEAEFVDRAGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKND 119

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVN 583
           P+I +I+WVGYPG+ GG AIADV+FG  NPGG+LP+TWY  NYV  +P T M +R  P  
Sbjct: 120 PRISAIIWVGYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPAR 179

Query: 584 NFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRD---INYTVGT 640
            +PGRTY+F+ GPVV+PFG GLSYT F + +A  P  V + L   +   +   ++  V  
Sbjct: 180 GYPGRTYRFYRGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKAVRV 239

Query: 641 NKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYER 700
           +   C A+   DV          ++V+N G MDG+  ++V++ PP       KQ++G+ +
Sbjct: 240 SHADCNALSPLDV---------HVDVKNTGSMDGTHTLLVFTSPPDGKWASSKQLMGFHK 290

Query: 701 VFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
           + IAAG   +V   ++ CK L +VD      +  G H + +G+    VS  LQ NL
Sbjct: 291 IHIAAGSEKRVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGDLSHHVS--LQTNL 344


>gi|325914134|ref|ZP_08176487.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325539637|gb|EGD11280.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 874

 Score =  290 bits (742), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 174/446 (39%), Positives = 242/446 (54%), Gaps = 46/446 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA  LV +M+  EKV Q  + A  +PRL +P YEWWSE LHG++  G            
Sbjct: 24  QRAAALVAQMSRDEKVAQAMNDAPAIPRLDIPAYEWWSEGLHGIARNGY----------- 72

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
                AT FP  I   AS+N +L +++G  VSTEARA +N            AGLT WSP
Sbjct: 73  -----ATVFPQAIGLAASWNTALMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSP 127

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ         D  + P  I A  KH
Sbjct: 128 NINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQG--------DDLNHPRTI-ATPKH 178

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
            A +         R  FD  V+ +DM+ T+   F   + +G   SVMC+YN ++G P CA
Sbjct: 179 IAVHSGPE---PGRHGFDVDVSPRDMEATYTPAFRAALVDGQAWSVMCAYNSLHGTPACA 235

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              LLN  +RGDW F G++VSDCD++  + + H F  D    + A  LKAG DL+CG  Y
Sbjct: 236 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 294

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
                 A+++G++ EA +D SL  L+    RLG  +   +  Y  LG  ++ N  H  LA
Sbjct: 295 RELGT-AIERGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALA 353

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA + IVLLKN    LPL  G    LA++GP+A+A  A+  NY+GT     +P+ G  
Sbjct: 354 LQAAAESIVLLKNTATTLPLKAGT--RLAVIGPNADALAALEANYQGTSATPITPLLGLR 411

Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
            +  ++ + YA G A +      MIP
Sbjct: 412 QHFGAQQVRYAQG-APLAAGVPGMIP 436



 Score =  137 bits (345), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 94/299 (31%), Positives = 137/299 (45%), Gaps = 52/299 (17%)

Query: 463 NADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVI 512
            +DA V   GL   VE E          G DR D+ LP  Q  L+ + A A+  P+ +V+
Sbjct: 602 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVL 660

Query: 513 MSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKI 572
           MS  AV +N+AK N       W  YPG+ GG AIA  + G  NPGGRLP+T+Y +     
Sbjct: 661 MSGSAVALNWAKANADAIVAAW--YPGQSGGTAIARALAGDDNPGGRLPVTFYRSTKDLP 718

Query: 573 PYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCR 632
            Y S  ++      GRTY++F G  ++PFGYGLSYT F Y                   R
Sbjct: 719 AYVSYDMK------GRTYRYFKGEPLFPFGYGLSYTSFAYDAP----------------R 756

Query: 633 DINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI 692
               T+    P                      V N G   G EV  VY + P    + +
Sbjct: 757 LSTRTLQAGNP----------------LQVTTTVRNTGSRAGDEVAQVYLQYPDRPQSPL 800

Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFP 751
           + ++G++RV +  G+  ++ FT++A ++L  VD +    + +G + + VG G  G   P
Sbjct: 801 RSLVGFQRVHLKPGEQRELTFTLDA-RALSDVDRSGQRAVEAGEYRVFVGGGQPGTGAP 858


>gi|94969405|ref|YP_591453.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
 gi|94551455|gb|ABF41379.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
          Length = 902

 Score =  290 bits (742), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 173/445 (38%), Positives = 241/445 (54%), Gaps = 46/445 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y DA  P  ERA DLV+RMTL EK  Q+ D A  +PRLG+P Y+ WSEALHGV+  G   
Sbjct: 38  YRDATRPANERAHDLVQRMTLDEKAAQLEDWATAIPRLGVPDYQTWSEALHGVARAGH-- 95

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL----GNA--- 126
                         AT FP  I   A+++  + K++G  +STEAR  YN     GN    
Sbjct: 96  --------------ATVFPQAIGMAATWDTEMVKQMGDVISTEARGKYNEAQREGNHRIF 141

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLTFWSPNIN+ RDPRWGR  ET GEDP++ G+  I ++ G+Q           D+   
Sbjct: 142 WGLTFWSPNINIFRDPRWGRGQETYGEDPFLTGKMGIAFIDGVQG---------PDAAHP 192

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K  A  KH+A +       + R  FD +V+ +D++ET++  F   V +G V SVMC+YN 
Sbjct: 193 KAVATSKHFAVHSGPE---SLRHGFDVKVSPRDLEETYLAAFRATVTDGHVKSVMCAYNA 249

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           V+G+  CA+  LL + ++  W F G++VSDC +I  + + HK   D    A A  L AG 
Sbjct: 250 VDGMGACANKMLLEEHLKQAWGFKGFVVSDCGAIMDVTQGHKNAPDIVH-AAAISLAAGT 308

Query: 306 DLDCGDYYTNFTM--GAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNN 361
           DL C  +   F     AV++G + E  +  +   LY     LG FD  GS     +  + 
Sbjct: 309 DLSCSIWEPGFNTLADAVRKGLVTEDMVTRAAERLYAARFELGMFDEPGSNPNDKIDMSQ 368

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           + + +H   A +AA + IVLLKND G LPL   N KT+A++GP A    ++ GNY G P 
Sbjct: 369 VASEEHRAEALKAAEESIVLLKND-GLLPLK--NAKTIAVIGPTAELLASLEGNYNGQPV 425

Query: 422 RYTSPMDGFYAY--SKVINYAPGCA 444
           R  +P+DG      ++ + YA G +
Sbjct: 426 RPVTPLDGIVKQFGAENVRYAQGSS 450



 Score =  108 bits (271), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 82/263 (31%), Positives = 131/263 (49%), Gaps = 45/263 (17%)

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
           G DR  + LP  Q +L+  +  A K PV +V +S  AV +N+A  N    +IL   YPG 
Sbjct: 659 GGDRTSIDLPATQEKLLEALGAAGK-PVVVVNLSGSAVALNWA--NQHAGAILQAWYPGV 715

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIP-YTSMPLRPVNNFPGRTYKFFDGPVVY 599
           EGG AIA  + G+ NP GRLP+T+Y A+   +P +T   ++       RTY+++ G  ++
Sbjct: 716 EGGTAIAKTLAGESNPAGRLPVTFY-ASVQDLPAFTEYAMK------NRTYRYYAGKPLW 768

Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
            FG+GLSY+ FKY         ++KL                    A+  +D  K     
Sbjct: 769 GFGFGLSYSTFKYG--------EVKL--------------------ASTSVDAGKS---- 796

Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
            T  + V N  ++ G EVV  Y K P   G     ++G++RV +  G+S +V   ++  +
Sbjct: 797 LTATVTVTNTSQVAGDEVVEAYLKTPQKGGPS-HSLVGFQRVPLNPGESREVAIEVSP-R 854

Query: 720 SLKIVDNAANSLLASGAHTILVG 742
           SL  VD++    + +G + + +G
Sbjct: 855 SLSAVDDSGKRSILAGEYRLSIG 877


>gi|325922365|ref|ZP_08184139.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
           19865]
 gi|325547147|gb|EGD18227.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
           19865]
          Length = 889

 Score =  290 bits (742), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 177/446 (39%), Positives = 241/446 (54%), Gaps = 46/446 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA  LV +M+  EKV Q  + A  +PRLG+P YEWWSE LHG++  G            
Sbjct: 39  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 87

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
                AT FP  I   AS+N  L +++G  VSTEARA +N            AGLT WSP
Sbjct: 88  -----ATVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSP 142

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ         D    P  I A  KH
Sbjct: 143 NINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQG--------DDLDHPRTI-ATPKH 193

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
            A +         R  FD  V+ +D++ T+   F   + +G   SVMC+YN ++G P CA
Sbjct: 194 IAVHSGPE---PGRHSFDVDVSPRDVEATYTPAFRAALIDGQAGSVMCAYNSLHGTPACA 250

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              LLN  +RGDW F G++VSDCD++  + + H F  D    + A  LKAG DL+CG  Y
Sbjct: 251 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAS-LKAGHDLNCGYAY 309

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
                 A+++G++ EA +D SL  L+    RLG  +   +  Y  LG  +I N  +  LA
Sbjct: 310 RALGT-AIERGEVDEALLDQSLVRLFAARYRLGELEAPHKDPYATLGAKDIDNTANRALA 368

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA Q IVLLKND   LPL  G    LA++GP+A+A  A+  NY+GT     +P+ G  
Sbjct: 369 LKAAAQSIVLLKNDANTLPLKAG--ARLAVIGPNADALAALEANYQGTSSTPVTPLLGLR 426

Query: 432 AYSKV--INYAPGCADIVCQNNSMIP 455
               V  ++YA G A +      MIP
Sbjct: 427 QRFGVHQVSYAQG-APLAAGVPGMIP 451



 Score =  134 bits (336), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 90/284 (31%), Positives = 137/284 (48%), Gaps = 49/284 (17%)

Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           V G +L ++    +G DR D+ LP  Q  L+ + A A+  P+ +V+MS  AV +N+AK +
Sbjct: 631 VEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 689

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
                  W  YPG+ GG AIA ++ G  NPGGRLP+T+Y +     PY S  ++      
Sbjct: 690 ADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK------ 741

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++F G  ++PFGYGLSYT F Y                 Q        G+      
Sbjct: 742 GRTYRYFKGEPLFPFGYGLSYTSFAYGAP--------------QLSSTTLQAGS------ 781

Query: 647 AVLIDDVKCKDYKFTFQI--EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIA 704
                         T Q+   V N G   G EV  VY + P    + ++ ++G++RV + 
Sbjct: 782 --------------TLQVTTTVRNTGTRAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLK 827

Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
            G+   + FT++A ++L  VD      + +G +T+ VG G  G 
Sbjct: 828 PGEQRTLTFTLDA-RALSDVDRTGQRAVEAGDYTLFVGGGQRGT 870


>gi|229580225|ref|YP_002838625.1| glycoside hydrolase family protein [Sulfolobus islandicus
           Y.G.57.14]
 gi|229581131|ref|YP_002839530.1| glycoside hydrolase family protein [Sulfolobus islandicus
           Y.N.15.51]
 gi|228010941|gb|ACP46703.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           Y.G.57.14]
 gi|228011847|gb|ACP47608.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           Y.N.15.51]
          Length = 754

 Score =  290 bits (742), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 212/691 (30%), Positives = 348/691 (50%), Gaps = 106/691 (15%)

Query: 88  ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRV 146
           +T+FP  I   +++N  L   I   + ++ R +      G+    SP ++V +DPRWGR 
Sbjct: 101 STAFPQAIGLASTWNPELVMDIASVIRSQGRLV------GVNQCLSPVLDVCKDPRWGRC 154

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN- 205
            ET GEDPY+V    + Y+ GLQ           D+   ++ A  KH+AA+       N 
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQ----------GDN---QLVATAKHFAAHGFPEGGRNI 201

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
            + H  +R    +++ETF+ PFE+ V  G V S+M +Y+ ++GIP   +P+LL   +R +
Sbjct: 202 AQVHVGNR----ELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQE 257

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQ 323
           W F G +VSD D I+ +   H+  ++  E A+   L++G+D++    D Y    + A+++
Sbjct: 258 WGFDGIVVSDYDGIRQLETIHRVASNKMEAAIL-ALESGVDIEFPTIDCYGEPLVNALKE 316

Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLK 383
           G + E+ ID ++  +  +  RLG  D     +N     + + +  ELA + AR+ IVLLK
Sbjct: 317 GLVPESLIDRAVERVLRIKDRLGLLDNPFVNENSVPEKLDDHKSRELALKTARESIVLLK 376

Query: 384 NDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT-------PCRYTSPMDGF---YAY 433
           N+N  LPL + N+  +A++GP+AN  + M+G+Y  T            + + G       
Sbjct: 377 NENNILPL-SKNVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIVKKVGE 435

Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV----AGLDLS------------- 476
           SKV+ YA GC DI  ++      AI+ A+ AD  + +    +GL LS             
Sbjct: 436 SKVL-YAKGC-DIASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSEEEFKKY 493

Query: 477 --VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
             V  EG DR  L LPG Q EL+ ++    K P+ LV+++   + ++   N   +K+++ 
Sbjct: 494 QAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSSIIN--YVKAVIE 550

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
             +PGEEGG AIADVIFG YNPGGRLPIT+  +   + + Y   P    ++F  R Y   
Sbjct: 551 AWFPGEEGGNAIADVIFGDYNPGGRLPITFPMDTGQIPLYYNRKP----SSF--RPYVML 604

Query: 594 DGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
               ++ FGYGLSYTQF+Y  +  +PK                  +G N           
Sbjct: 605 RSSPLFTFGYGLSYTQFEYSNLEVTPKE-----------------IGPNS---------- 637

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKV 711
                      I+V+N+GKM+G +VV +Y SK        +K++ G+ ++ +  G+  +V
Sbjct: 638 ------NIAISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEKRRV 691

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            F +   ++L   D+    ++  G + +L+G
Sbjct: 692 KFIL-PTEALAFYDSFMRLVVEKGEYQLLIG 721


>gi|385774250|ref|YP_005646817.1| glycoside hydrolase family protein [Sulfolobus islandicus HVE10/4]
 gi|323478365|gb|ADX83603.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           HVE10/4]
          Length = 754

 Score =  290 bits (742), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 213/691 (30%), Positives = 348/691 (50%), Gaps = 106/691 (15%)

Query: 88  ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRV 146
           +T+FP  I   +++N  L   I   + ++ R +      G+    SP ++V +DPRWGR 
Sbjct: 101 STAFPQAIGLASTWNLELVMDIASVIRSQGRLV------GVNQCLSPVLDVCKDPRWGRC 154

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN- 205
            ET GEDPY+V    + Y+ GLQ           D+   ++ A  KH+AA+       N 
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQ----------GDN---QLVATAKHFAAHGFPEGGRNI 201

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
            + H  +R    +++ETF+ PFE+ V  G V S+M +Y+ ++GIP   +P+LL   +R +
Sbjct: 202 AQVHVGNR----ELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQE 257

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQ 323
           W F G +VSD D I+ +   H+  ++  E A+   L++G+D++    D Y+   + A+ +
Sbjct: 258 WGFDGIVVSDYDGIRQLETIHRVASNKMEAAIL-ALESGVDIEFPTIDCYSEPLVNALTE 316

Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLK 383
           G + E+ ID ++  +  +  RLG  D     +N     + + +  ELA + AR+ IVLLK
Sbjct: 317 GLVPESLIDRAVERVLRIKDRLGLLDNPFVNENSVPEKLDDHKSRELALKTARESIVLLK 376

Query: 384 NDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT-------PCRYTSPMDGF---YAY 433
           N+N  LPL + N+  +A++GP+AN  + M+G+Y  T            + + G       
Sbjct: 377 NENNILPL-SKNVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGVVKKVGE 435

Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV----AGLDLS------------- 476
           SKV+ YA GC DI  ++      AI+ A+ AD  + V    +GL LS             
Sbjct: 436 SKVL-YAKGC-DIASESKEGFAEAIEIARQADVIIAVMGEKSGLPLSWTDIPSEEEFKKY 493

Query: 477 --VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
             V  EG DR  L LPG Q EL+ ++    K P+ LV+++   + ++   N   +K+++ 
Sbjct: 494 QAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSPIIN--YVKAVIE 550

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
             +PGEEGG AIADVIFG YNPGGRLPIT+  +   + + Y   P    ++F  R Y   
Sbjct: 551 AWFPGEEGGNAIADVIFGDYNPGGRLPITFPMDTGQIPLYYNRKP----SSF--RPYVML 604

Query: 594 DGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
               ++ FGYGLSYTQF+Y  +  +PK                  +G N           
Sbjct: 605 RSSPLFTFGYGLSYTQFEYSNLEVTPKE-----------------IGPNS---------- 637

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKV 711
                      I+V+N+GKM+G +VV +Y SK        +K++ G+ ++ +  G+  +V
Sbjct: 638 ------NIAISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEKRRV 691

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            F +   ++L   D+    ++  G + +L+G
Sbjct: 692 KFIL-PTEALAFYDSFMRLVVEKGEYQLLIG 721


>gi|384428895|ref|YP_005638255.1| beta-glucosidase [Xanthomonas campestris pv. raphani 756C]
 gi|341937998|gb|AEL08137.1| beta-glucosidase [Xanthomonas campestris pv. raphani 756C]
          Length = 888

 Score =  290 bits (741), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 174/446 (39%), Positives = 240/446 (53%), Gaps = 46/446 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA  LV +M+  EKV Q  + A  +PRLG+P YEWW+E LHG++  G            
Sbjct: 38  QRAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWNEGLHGIARNGY----------- 86

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
                AT FP  I   AS+N  L +++G  VSTEARA +N            AGLT WSP
Sbjct: 87  -----ATVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSP 141

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ         D    P  I A  KH
Sbjct: 142 NINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQG--------DDLEHPRTI-ATPKH 192

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
            A +         R  FD  V+ +D++ T+   F   + EG   SVMC+YN ++G P CA
Sbjct: 193 IAVHSGPE---PGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACA 249

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              LLN  +RGDW F G++VSDCD++  + + H F  D    + A  LKAG DL+CG  Y
Sbjct: 250 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGTAY 308

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG--SPQYKNLGKNNICNPQHIELA 371
                 A+++G++ EA +D SL  L+    RLG        +Y  LG  +I N  +  LA
Sbjct: 309 RALGT-AIERGEVDEALLDQSLVRLFAARYRLGELQAPRKDRYARLGAKDIDNAGNRALA 367

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA + IVLLKN N  LPL       LA++GP+A+A  A+  NY+GT  +  +P+ G  
Sbjct: 368 LQAAAESIVLLKNANATLPLKAST--RLAVIGPNADALAALEANYQGTSSQPVTPLLGLR 425

Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
               ++ + YA G A +      MIP
Sbjct: 426 QRFGAQQVRYAQG-APLAAGVPGMIP 450



 Score =  137 bits (346), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 94/301 (31%), Positives = 139/301 (46%), Gaps = 52/301 (17%)

Query: 463 NADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVI 512
            +DA V   GL   VE E          G DR D+ LP  Q  L+ + A A+  P+ +V+
Sbjct: 616 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVL 674

Query: 513 MSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKI 572
           MS  AV +N+AK +       W  YPG+ GG AIA  + G  NPGGRLP+T+Y +     
Sbjct: 675 MSGSAVALNWAKTHADAIVAAW--YPGQSGGTAIARALAGDDNPGGRLPVTFYRSTKDLP 732

Query: 573 PYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCR 632
           PY S  ++      GRTY++F G  ++PFGYGLSYT+F Y+                  R
Sbjct: 733 PYVSYDMK------GRTYRYFKGEALFPFGYGLSYTRFAYETP----------------R 770

Query: 633 DINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI 692
               T+    P                      V N G+  G EV  VY + P    + +
Sbjct: 771 LSATTLQAGSP----------------LQVTTTVRNTGERAGDEVAQVYLQYPERPQSPL 814

Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPL 752
           + ++G++RV +  G+   + FT++A ++L  VD      + +G + + VG G      P 
Sbjct: 815 RSLVGFQRVHLQPGEQRTLTFTLDA-RALSDVDRTGTRAVEAGDYRLFVGGGQPDTGAPG 873

Query: 753 Q 753
           Q
Sbjct: 874 Q 874


>gi|380509734|ref|ZP_09853141.1| beta-glucosidase-related glycosidase [Xanthomonas sacchari NCPPB
           4393]
          Length = 883

 Score =  290 bits (741), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 179/447 (40%), Positives = 246/447 (55%), Gaps = 45/447 (10%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           P+ D    +  RA  LV +MTL EK  QM + A  + RLG+P Y+WW+EALHGV+  G+ 
Sbjct: 23  PWQDTSASFEARAAALVAQMTLEEKAAQMQNAAPAIERLGVPAYDWWNEALHGVARAGQ- 81

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL--------G 124
                          AT FP  I   A+F+  L  ++  T+S EARA ++          
Sbjct: 82  ---------------ATVFPQAIGLAATFDVPLMGQVATTISDEARAKHHQFLREGAHGR 126

Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLTFWSPNIN+ RDPRWGR  ET GEDPY+  R  + +V+GLQ  + V          
Sbjct: 127 YQGLTFWSPNINIFRDPRWGRGQETYGEDPYLTARMGVAFVQGLQGDDPVYR-------- 178

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
            K+ A  KH+A +        DR HFD+R +++D+ +T++  FE  V EG V +VM +YN
Sbjct: 179 -KLDATAKHFAVHSGPE---ADRHHFDARPSKRDLYDTYLPAFEALVKEGKVDAVMGAYN 234

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           RV G    A   LL   +R DW F GY+VSDC +I  I + H  L  ++E A A  +K G
Sbjct: 235 RVYGESASASQFLLRDVLRRDWGFTGYVVSDCWAIVDIWK-HHHLAPSREAAAALAVKNG 293

Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICN 364
            +L+CG  Y      AV+QG I EA+ID ++  L+   MRLG FD   + +        N
Sbjct: 294 TELECGQEYATLP-AAVRQGLIGEAEIDDAVTRLFTARMRLGMFDPPERVRWARIPASVN 352

Query: 365 --PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
             P H  LA +AA++ +VLLKND G LPL+   +K +A+VGP A+ T A++GNY GTP  
Sbjct: 353 QVPAHDALALQAAQESLVLLKND-GVLPLSR-TLKRIAVVGPTADDTMALLGNYFGTPAA 410

Query: 423 YTSPMDGFYAYSKVI--NYAPGCADIV 447
             + + G    +K I   YA G  D+V
Sbjct: 411 PVTILQGIRDAAKGIEVRYARGV-DLV 436



 Score =  143 bits (360), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 97/298 (32%), Positives = 150/298 (50%), Gaps = 53/298 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+DAA+NAD  V V GL   VE E          G DR DL LP  Q  L+  +    K 
Sbjct: 608 ALDAARNADVVVFVGGLTGDVEGEEMKVDYPGFAGGDRTDLRLPAPQRALLEALHATGK- 666

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV +V+    A+ +++A+ +  + +IL   YPG+ GG A+   +FG+ NP GRLP+T+Y 
Sbjct: 667 PVVMVLTGGSALAVDWAQAH--LPAILMSWYPGQRGGTAVGQALFGEVNPAGRLPVTFYR 724

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
           A+       ++P        GRTY++F G  +YPFG+GLSYT+F Y           KL 
Sbjct: 725 AD------QALPAFDDYAMEGRTYRYFRGTPLYPFGHGLSYTRFDYG----------KLH 768

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
            D                 A  + DD + K      Q+EV N GK  G EV  +Y +   
Sbjct: 769 LD-----------------APRIADDGRLK-----LQVEVANTGKRAGDEVAQLYVRRLA 806

Query: 687 IAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS-LLASGAHTILVG 742
            A    +Q + G++RV +A G+   + F ++A ++L+  D+A  + ++ +G + + +G
Sbjct: 807 AAPGDAQQTLRGFQRVHLAPGERRTLTFELDAQQALRQYDDARGAYVVPAGRYEVRIG 864


>gi|255013451|ref|ZP_05285577.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. 2_1_7]
 gi|410103695|ref|ZP_11298616.1| hypothetical protein HMPREF0999_02388 [Parabacteroides sp. D25]
 gi|409236424|gb|EKN29231.1| hypothetical protein HMPREF0999_02388 [Parabacteroides sp. D25]
          Length = 868

 Score =  289 bits (740), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 178/469 (37%), Positives = 241/469 (51%), Gaps = 54/469 (11%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
           S   K  D+P+ +  LP  ER  DL+ R+T  EK+ QM ++   + RLG+P Y+WW+EAL
Sbjct: 18  SCSEKQQDYPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEAL 77

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           HGV+  GR                AT FP  I   A+F+++   +    VS EARA Y+ 
Sbjct: 78  HGVARAGR----------------ATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQ 121

Query: 124 GNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
                      GLTFW+PNIN+ RDPRWGR +ET GEDPY+  +  +   RGLQ      
Sbjct: 122 YQKDKEYDRYKGLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQG----- 176

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
                D    K  AC KHYA +    W   +R  FD   T +D+ ET++  FE  V EGD
Sbjct: 177 ----DDPNYYKTHACAKHYAVHSGPEW---NRHEFDVEATPRDLYETYLPAFEALVKEGD 229

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHK-----FLN 290
           V  VMC+YNR  G P C+  KLL   +R  W +   I+SDC +I       K       +
Sbjct: 230 VQEVMCAYNRFEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETH 289

Query: 291 DTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
              E A A  +  G DL+CG  Y      A+  GKI+E D+D SLR L      LG FD 
Sbjct: 290 PDAESASADAVLNGTDLECGGSYRALNK-ALADGKISEKDLDVSLRRLLKGRFELGMFDP 348

Query: 351 SPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
             +  Y  +  + + +P+HI  A + AR+ IVLLKN N  LPL+  NIK +A+VGP+A  
Sbjct: 349 DERVPYSKIPYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAAD 407

Query: 409 TKAMIGNYEGTPCRYTSPMDGFYAYSKVIN----YAPGC---ADIVCQN 450
           +  +  NY G P +  + ++G    +KV N    Y  GC   AD V  +
Sbjct: 408 STMLWANYNGFPTKTVTIVEGI--RNKVPNAEVIYELGCNHTADFVVTD 454



 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 94/297 (31%), Positives = 138/297 (46%), Gaps = 51/297 (17%)

Query: 456 AAIDAAKNADATVIVAGL-------DLSVEAEG---KDRVDLLLPGFQTELINKVADAAK 505
           A     K+AD  V V G+       ++ V+AEG    DR ++ +P  Q E++  +    K
Sbjct: 597 ATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALVATGK 656

Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
            PV  V+ +  A+ +N+   N  + +IL   Y G+EGG A+ADV+FG YNP GRLPIT+Y
Sbjct: 657 -PVVYVVCTGSALALNW--ENDHVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPITFY 713

Query: 566 EANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
           ++         +P     +  GRTY++     +YPFGYGLSYT F YK A        KL
Sbjct: 714 KS------VDQLPDFQDYSMKGRTYRYMTQTPLYPFGYGLSYTTFDYKNA--------KL 759

Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
            KD+        + +N+                  T   ++ N GKMDG EV  +Y K P
Sbjct: 760 SKDK--------IASNE----------------SVTLSFDIANTGKMDGDEVAQIYIKNP 795

Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
                 +K +  ++RV + AG    V   +         DN     +  G + IL G
Sbjct: 796 NDPAGPLKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYG 852


>gi|423331656|ref|ZP_17309440.1| hypothetical protein HMPREF1075_01453 [Parabacteroides distasonis
           CL03T12C09]
 gi|409230226|gb|EKN23094.1| hypothetical protein HMPREF1075_01453 [Parabacteroides distasonis
           CL03T12C09]
          Length = 868

 Score =  289 bits (740), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 178/469 (37%), Positives = 241/469 (51%), Gaps = 54/469 (11%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
           S   K  D+P+ +  LP  ER  DL+ R+T  EK+ QM ++   + RLG+P Y+WW+EAL
Sbjct: 18  SCSEKQQDYPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEAL 77

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           HGV+  GR                AT FP  I   A+F+++   +    VS EARA Y+ 
Sbjct: 78  HGVARAGR----------------ATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQ 121

Query: 124 GNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
                      GLTFW+PNIN+ RDPRWGR +ET GEDPY+  +  +   RGLQ      
Sbjct: 122 YQKDKEYDRYKGLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQG----- 176

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
                D    K  AC KHYA +    W   +R  FD   T +D+ ET++  FE  V EGD
Sbjct: 177 ----DDPNYYKTHACAKHYAVHSGPEW---NRHEFDVEATPRDLYETYLPAFEALVKEGD 229

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHK-----FLN 290
           V  VMC+YNR  G P C+  KLL   +R  W +   I+SDC +I       K       +
Sbjct: 230 VQEVMCAYNRFEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETH 289

Query: 291 DTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
              E A A  +  G DL+CG  Y      A+  GKI+E D+D SLR L      LG FD 
Sbjct: 290 PDAESASADAVLNGTDLECGGSYRALNK-ALADGKISEKDLDVSLRRLLKGRFELGMFDP 348

Query: 351 SPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
             +  Y  +  + + +P+HI  A + AR+ IVLLKN N  LPL+  NIK +A+VGP+A  
Sbjct: 349 DERVPYSKIPYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAAD 407

Query: 409 TKAMIGNYEGTPCRYTSPMDGFYAYSKVIN----YAPGC---ADIVCQN 450
           +  +  NY G P +  + ++G    +KV N    Y  GC   AD V  +
Sbjct: 408 STMLWANYNGFPTKTVTIVEGI--RNKVPNAEVIYELGCNHTADFVVTD 454



 Score =  134 bits (338), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 94/297 (31%), Positives = 139/297 (46%), Gaps = 51/297 (17%)

Query: 456 AAIDAAKNADATVIVAGL-------DLSVEAEG---KDRVDLLLPGFQTELINKVADAAK 505
           A     K+AD  V V G+       ++ V+AEG    DR ++ +P  Q E++  +    K
Sbjct: 597 ATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALVATGK 656

Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
            PV  V+ +  A+ +N+   N  + +IL   Y G+EGG A+ADV+FG YNP GRLPIT+Y
Sbjct: 657 -PVVYVVCTGSALALNW--ENDHVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPITFY 713

Query: 566 EANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
           ++         +P     +  GRTY++     +YPFGYGLSYT F YK A        KL
Sbjct: 714 KS------VDQLPDFQDYSMKGRTYRYMTQTPLYPFGYGLSYTTFDYKNA--------KL 759

Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
            KD+        + +N+                  T   ++ N GKMDG EV  +Y K P
Sbjct: 760 SKDK--------IASNE----------------SVTLSFDIANTGKMDGDEVAQIYIKNP 795

Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
                 +K +  ++RV + AG +  V   +         DN     +  G + IL G
Sbjct: 796 NDPAGPLKAMKAFKRVNVKAGSAQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYG 852


>gi|423342048|ref|ZP_17319763.1| hypothetical protein HMPREF1077_01193 [Parabacteroides johnsonii
           CL02T12C29]
 gi|409219455|gb|EKN12417.1| hypothetical protein HMPREF1077_01193 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 868

 Score =  289 bits (740), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 176/469 (37%), Positives = 246/469 (52%), Gaps = 54/469 (11%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
           S   +  D+P+ +  LP  ER  DL++R+T  EKV QM +    + RLG+P Y+WW+EAL
Sbjct: 18  SCSQRQEDYPFRNPDLPIDERIDDLLKRLTAEEKVGQMMNTTPAIERLGIPQYDWWNEAL 77

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           HGV+  G+                AT FP  I   A+F++    +    VS EARA Y+ 
Sbjct: 78  HGVARAGK----------------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQ 121

Query: 124 GNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
                      GLTFW+PNIN+ RDPRWGR +ET GEDPY+  R  +  V+GLQ      
Sbjct: 122 YQKDKEYDRYKGLTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQG----- 176

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
                D +  K  AC KHYA +    W   +R  FD  VT +D+ +T++  FE  V EG+
Sbjct: 177 ----DDPKYFKTHACAKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKEGN 229

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE------SHKFL 289
           V  VMC+YNR  G P C+  KLL   +R  W +   I+SDC +I    E       H+  
Sbjct: 230 VQEVMCAYNRYQGKPCCSSDKLLIDILRNSWGYENIILSDCGAINDFWERDERTPRHETH 289

Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
            D  E A A  +  G DL+CG+ Y    + A++ GKI+E D+D SLR L      LG FD
Sbjct: 290 PDA-ESASADAVLNGTDLECGNSYRAL-VKALKDGKISENDLDVSLRRLLKGRFELGMFD 347

Query: 350 GSPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
              +  Y  +  N + +P+H+  A E A + +VLLKN N  LPL +  I+ +A+VGP+A 
Sbjct: 348 PDERVPYAQIPYNVVESPEHVAQALEMAHKSMVLLKNKNNTLPL-SKTIRKIAVVGPNAA 406

Query: 408 ATKAMIGNYEGTPCRYTSPMDGFYAY---SKVINYAPGC---ADIVCQN 450
            +  +  NY G P    + ++G       ++VI Y  GC   AD V Q+
Sbjct: 407 DSTMLWANYNGFPTHTVTILEGIRNKVPDTEVI-YELGCNHAADFVIQD 454



 Score =  132 bits (331), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 101/315 (32%), Positives = 142/315 (45%), Gaps = 55/315 (17%)

Query: 442 GCADIVCQNNSMIPA--AIDAAKNADATVIV---------AGLDLSVEAEG---KDRVDL 487
           G AD+  Q  +  P   A  AAK  DA VIV          G ++ V  EG    DR ++
Sbjct: 579 GSADLNFQIGTRRPVDYAATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNI 638

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
            LP  Q E++ K   A   PV  V+ +  A+ +N+ + N  I +IL   Y G+E G A+A
Sbjct: 639 ELPKVQQEMV-KALKATGKPVVYVLCTGSALALNWEEAN--IDAILNAWYGGQEAGTAVA 695

Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSY 607
           D++FG YNP GRLP+T+Y++         +P     +  GRTY++     +YPFGYGLSY
Sbjct: 696 DILFGDYNPSGRLPVTFYKS------IDQLPDFEDYSMKGRTYRYMTETPLYPFGYGLSY 749

Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
           T F Y+ A   K    K+ KDQ                               T   ++ 
Sbjct: 750 TNFAYRNA---KLSSGKIAKDQSV-----------------------------TLTFDIA 777

Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           N GKMDG E+  +Y K P      IK +  + RV + AG S +V   +         DN 
Sbjct: 778 NTGKMDGDEIAQIYIKNPNDPEGPIKALKAFLRVHVKAGDSQEVNIELAPETFHSFNDNT 837

Query: 728 ANSLLASGAHTILVG 742
               +  G + IL G
Sbjct: 838 QTMEVRPGKYQILYG 852


>gi|385776908|ref|YP_005649476.1| glycoside hydrolase family protein [Sulfolobus islandicus REY15A]
 gi|323475656|gb|ADX86262.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           REY15A]
          Length = 754

 Score =  289 bits (739), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 213/691 (30%), Positives = 348/691 (50%), Gaps = 106/691 (15%)

Query: 88  ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRV 146
           +T+FP  I   +++N  L   I   + ++AR +      G+    SP ++V +DPRWGR 
Sbjct: 101 STAFPQAIGLASTWNLELVMDIASVIRSQARLV------GVNQCLSPVLDVCKDPRWGRC 154

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN- 205
            ET GEDPY+V    + Y+ GLQ           D+   ++ A  KH+AA+       N 
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQ----------GDN---QLVATAKHFAAHGFPEGGRNI 201

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
            + H  +R    +++ETF+ PFE+ V  G V S+M +Y+ ++GIP   +P+LL   +R +
Sbjct: 202 AQVHVGNR----ELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQE 257

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQ 323
           W F G +VSD D I+ +   H+  ++  E A+   L++G+D++    D Y+   + A+ +
Sbjct: 258 WGFDGIVVSDYDGIRQLETIHRVASNKMEAAIL-ALESGVDIEFPTIDCYSEPLVNALTE 316

Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLK 383
           G + E+ ID ++  +  +  RLG  D     +N     + + +  ELA + AR+ IVLLK
Sbjct: 317 GLVPESLIDRAVERVLRIKDRLGLLDNPFVNENSVPEKLDDHKSRELALKTARESIVLLK 376

Query: 384 NDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT-------PCRYTSPMDGF---YAY 433
           N+N  LPL + N+  +A++GP+AN  + M+G+Y  T            + + G       
Sbjct: 377 NENNILPL-SKNVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGVVKKVGE 435

Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV----AGLDLS------------- 476
           SKV+ YA GC DI  ++      AI+ A+ AD  + V    +GL LS             
Sbjct: 436 SKVL-YAKGC-DIASESKEGFAEAIEIARQADVIIAVMGEKSGLPLSWTDIPSEEEFKKY 493

Query: 477 --VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
             V  EG DR  L LPG Q EL+ ++    K P+ LV+++   + ++   N   +K+++ 
Sbjct: 494 QAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSPIIN--YVKAVIE 550

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
             +PGEEGG AIADVIFG YNP GRLPIT+  +   + + Y   P    ++F  R Y   
Sbjct: 551 AWFPGEEGGNAIADVIFGDYNPSGRLPITFPMDTGQIPLYYNRKP----SSF--RPYVML 604

Query: 594 DGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
               ++ FGYGLSYTQF+Y  +  +PK                  +G N           
Sbjct: 605 RSSPLFTFGYGLSYTQFEYSNLEVTPKE-----------------IGPNS---------- 637

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKV 711
                      I+V+N+GKM+G +VV +Y SK        +K++ G+ ++ +  G+  +V
Sbjct: 638 ------NIAISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEKRRV 691

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            F +   ++L   D+    ++  G + +L+G
Sbjct: 692 KFIL-PTEALAFYDSFMRLVVEKGEYQLLIG 721


>gi|298376791|ref|ZP_06986746.1| beta-glucosidase [Bacteroides sp. 3_1_19]
 gi|298266669|gb|EFI08327.1| beta-glucosidase [Bacteroides sp. 3_1_19]
          Length = 868

 Score =  289 bits (739), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 177/469 (37%), Positives = 243/469 (51%), Gaps = 54/469 (11%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
           S   K  D+P+ + +LP  ER  DL+ R+T  EK+ QM ++   + RLG+P Y+WW+EAL
Sbjct: 18  SCSEKQQDYPFRNPELPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEAL 77

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           HGV+  GR                AT FP  I   A+F+++   +    VS EARA Y+ 
Sbjct: 78  HGVARAGR----------------ATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQ 121

Query: 124 GNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
                      GLTFW+PNIN+ RDPRWGR +ET GEDPY+  +  +   RGLQ      
Sbjct: 122 YQKDKEYDRYKGLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQG----- 176

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
                D    K  AC KHYA +    W   +R  F++  T +D+ ET++  FE  V EGD
Sbjct: 177 ----DDPNYYKTHACAKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGD 229

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHK-----FLN 290
           V  VMC+YNR  G P C+  KLL   +R  W +   I+SDC +I       K       +
Sbjct: 230 VQEVMCAYNRFEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETH 289

Query: 291 DTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
              E A A  +  G DL+CG  Y      A+  GKI+E D+D SLR L      LG FD 
Sbjct: 290 PDAESASADAVLNGTDLECGGSYRALNK-ALADGKISEKDLDVSLRRLLKGRFELGMFDP 348

Query: 351 SPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
             +  Y  +  + + +P+HI  A + AR+ IVLLKN N  LPL+  NIK +A+VGP+A  
Sbjct: 349 DERVPYSKIPYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAAD 407

Query: 409 TKAMIGNYEGTPCRYTSPMDGFYAYSKVIN----YAPGC---ADIVCQN 450
           +  +  NY G P +  + ++G    +KV N    Y  GC   AD V  +
Sbjct: 408 STMLWANYNGFPSKTVTIVEGI--RNKVPNAEVIYELGCNHTADFVVTD 454



 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 94/297 (31%), Positives = 138/297 (46%), Gaps = 51/297 (17%)

Query: 456 AAIDAAKNADATVIVAGL-------DLSVEAEG---KDRVDLLLPGFQTELINKVADAAK 505
           A     K+AD  V V G+       ++ V+AEG    DR ++ +P  Q E++  +    K
Sbjct: 597 ATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALVATGK 656

Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
            PV  V+ +  A+ +N+   N  + +IL   Y G+EGG A+ADV+FG YNP GRLPIT+Y
Sbjct: 657 -PVVYVVCTGSALALNW--ENDHVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPITFY 713

Query: 566 EANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
           ++         +P     +  GRTY++     +YPFGYGLSYT F YK A        KL
Sbjct: 714 KS------VDQLPDFQDYSMKGRTYRYMTQTPLYPFGYGLSYTTFDYKNA--------KL 759

Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
            KD+        + +N+                  T   ++ N GKMDG EV  +Y K P
Sbjct: 760 SKDK--------IASNE----------------SVTLSFDIANTGKMDGDEVAQIYIKNP 795

Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
                 +K +  ++RV + AG    V   +         DN     +  G + IL G
Sbjct: 796 NDPAGPLKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYG 852


>gi|262381651|ref|ZP_06074789.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           2_1_33B]
 gi|262296828|gb|EEY84758.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           2_1_33B]
          Length = 868

 Score =  289 bits (739), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 177/469 (37%), Positives = 243/469 (51%), Gaps = 54/469 (11%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
           S   K  D+P+ + +LP  ER  DL+ R+T  EK+ QM ++   + RLG+P Y+WW+EAL
Sbjct: 18  SCSEKQQDYPFRNPELPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEAL 77

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           HGV+  GR                AT FP  I   A+F+++   +    VS EARA Y+ 
Sbjct: 78  HGVARAGR----------------ATVFPQAIAMAATFDDNAVHETFTIVSDEARAKYHQ 121

Query: 124 GNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
                      GLTFW+PNIN+ RDPRWGR +ET GEDPY+  +  +   RGLQ      
Sbjct: 122 YQKDKEYDRYKGLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQG----- 176

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
                D    K  AC KHYA +    W   +R  F++  T +D+ ET++  FE  V EGD
Sbjct: 177 ----DDPNYYKTHACAKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGD 229

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHK-----FLN 290
           V  VMC+YNR  G P C+  KLL   +R  W +   I+SDC +I       K       +
Sbjct: 230 VQEVMCAYNRFEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETH 289

Query: 291 DTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
              E A A  +  G DL+CG  Y      A+  GKI+E D+D SLR L      LG FD 
Sbjct: 290 PDAESASADAVLNGTDLECGGSYRALNK-ALADGKISEKDLDVSLRRLLKGRFELGMFDP 348

Query: 351 SPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
             +  Y  +  + + +P+HI  A + AR+ IVLLKN N  LPL+  NIK +A+VGP+A  
Sbjct: 349 DERVPYSKIPYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAAD 407

Query: 409 TKAMIGNYEGTPCRYTSPMDGFYAYSKVIN----YAPGC---ADIVCQN 450
           +  +  NY G P +  + ++G    +KV N    Y  GC   AD V  +
Sbjct: 408 STMLWANYNGFPSKTVTIVEGI--RNKVPNAEVIYELGCNHTADFVVTD 454



 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 94/297 (31%), Positives = 138/297 (46%), Gaps = 51/297 (17%)

Query: 456 AAIDAAKNADATVIVAGL-------DLSVEAEG---KDRVDLLLPGFQTELINKVADAAK 505
           A     K+AD  V V G+       ++ V+AEG    DR ++ +P  Q E++  +    K
Sbjct: 597 ATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALVATGK 656

Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
            PV  V+ +  A+ +N+   N  + +IL   Y G+EGG A+ADV+FG YNP GRLPIT+Y
Sbjct: 657 -PVVYVVCTGSALALNW--ENDHVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPITFY 713

Query: 566 EANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
           ++         +P     +  GRTY++     +YPFGYGLSYT F YK A        KL
Sbjct: 714 KS------VDQLPDFQDYSMKGRTYRYMTQTPLYPFGYGLSYTTFDYKNA--------KL 759

Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
            KD+        + +N+                  T   ++ N GKMDG EV  +Y K P
Sbjct: 760 SKDK--------IASNE----------------SVTLSFDIANTGKMDGDEVAQIYIKNP 795

Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
                 +K +  ++RV + AG    V   +         DN     +  G + IL G
Sbjct: 796 NDPAGPLKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYG 852


>gi|150007848|ref|YP_001302591.1| glycoside hydrolase family protein [Parabacteroides distasonis ATCC
           8503]
 gi|301310124|ref|ZP_07216063.1| beta-glucosidase [Bacteroides sp. 20_3]
 gi|423336365|ref|ZP_17314112.1| hypothetical protein HMPREF1059_00064 [Parabacteroides distasonis
           CL09T03C24]
 gi|149936272|gb|ABR42969.1| glycoside hydrolase family 3, candidate beta-glycosidase
           [Parabacteroides distasonis ATCC 8503]
 gi|300831698|gb|EFK62329.1| beta-glucosidase [Bacteroides sp. 20_3]
 gi|409240840|gb|EKN33614.1| hypothetical protein HMPREF1059_00064 [Parabacteroides distasonis
           CL09T03C24]
          Length = 868

 Score =  288 bits (738), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 177/469 (37%), Positives = 242/469 (51%), Gaps = 54/469 (11%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
           S   K  D+P+ +  LP  ER  DL+ R+T  EK+ QM ++   + RLG+P Y+WW+EAL
Sbjct: 18  SCSEKQQDYPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEAL 77

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           HGV+  GR                AT FP  I   A+F+++   +    VS EARA Y+ 
Sbjct: 78  HGVARAGR----------------ATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQ 121

Query: 124 GNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
                      GLTFW+PNIN+ RDPRWGR +ET GEDPY+  +  +   RGLQ      
Sbjct: 122 YQKDKEYDRYKGLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQG----- 176

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
                D    K  AC KHYA +    W   +R  F++  T +D+ ET++  FE  V EGD
Sbjct: 177 ----DDPNYYKTHACAKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGD 229

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHK-----FLN 290
           V  VMC+YNR  G P C+  KLL   +R  W +   I+SDC +I       K       +
Sbjct: 230 VQEVMCAYNRFEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETH 289

Query: 291 DTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
              E A A  +  G DL+CG  Y      A+  GKI+E D+D SLR L      LG FD 
Sbjct: 290 PDAESASADAVLNGTDLECGGSYRALNK-ALADGKISEKDLDVSLRRLLKGRFELGMFDP 348

Query: 351 SPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
             +  Y  +  + + +P+HI  A + AR+ IVLLKN N  LPL+  NIK +A+VGP+A  
Sbjct: 349 DERVPYSKIPYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAAD 407

Query: 409 TKAMIGNYEGTPCRYTSPMDGFYAYSKVIN----YAPGC---ADIVCQN 450
           +  +  NY G P +  + ++G    +KV N    Y  GC   AD V  +
Sbjct: 408 STMLWANYNGFPTKTVTIVEGI--RNKVPNAEVIYELGCNHTADFVVTD 454



 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 94/297 (31%), Positives = 138/297 (46%), Gaps = 51/297 (17%)

Query: 456 AAIDAAKNADATVIVAGL-------DLSVEAEG---KDRVDLLLPGFQTELINKVADAAK 505
           A     K+AD  V V G+       ++ V+AEG    DR ++ +P  Q E++  +    K
Sbjct: 597 ATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALVATGK 656

Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
            PV  V+ +  A+ +N+   N  + +IL   Y G+EGG A+ADV+FG YNP GRLPIT+Y
Sbjct: 657 -PVVYVVCTGSALALNW--ENDHVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPITFY 713

Query: 566 EANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
           ++         +P     +  GRTY++     +YPFGYGLSYT F YK A        KL
Sbjct: 714 KS------VDQLPDFQDYSMKGRTYRYMTQTPLYPFGYGLSYTTFDYKNA--------KL 759

Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
            KD+        + +N+                  T   ++ N GKMDG EV  +Y K P
Sbjct: 760 SKDK--------IASNE----------------SVTLSFDIANTGKMDGDEVAQIYIKNP 795

Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
                 +K +  ++RV + AG    V   +         DN     +  G + IL G
Sbjct: 796 NDPAGPLKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYG 852


>gi|256840106|ref|ZP_05545615.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
           D13]
 gi|256739036|gb|EEU52361.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
           D13]
          Length = 868

 Score =  288 bits (738), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 177/469 (37%), Positives = 242/469 (51%), Gaps = 54/469 (11%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
           S   K  D+P+ +  LP  ER  DL+ R+T  EK+ QM ++   + RLG+P Y+WW+EAL
Sbjct: 18  SCSEKQQDYPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEAL 77

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           HGV+  GR                AT FP  I   A+F+++   +    VS EARA Y+ 
Sbjct: 78  HGVARAGR----------------ATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQ 121

Query: 124 GNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
                      GLTFW+PNIN+ RDPRWGR +ET GEDPY+  +  +   RGLQ      
Sbjct: 122 YQKDKEYDRYKGLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQG----- 176

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
                D    K  AC KHYA +    W   +R  F++  T +D+ ET++  FE  V EGD
Sbjct: 177 ----DDPNYYKTHACAKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGD 229

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHK-----FLN 290
           V  VMC+YNR  G P C+  KLL   +R  W +   I+SDC +I       K       +
Sbjct: 230 VQEVMCAYNRFEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETH 289

Query: 291 DTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
              E A A  +  G DL+CG  Y      A+  GKI+E D+D SLR L      LG FD 
Sbjct: 290 PDAESASADAVLNGTDLECGGSYRALNK-ALADGKISEKDLDVSLRRLLKGRFELGMFDP 348

Query: 351 SPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
             +  Y  +  + + +P+HI  A + AR+ IVLLKN N  LPL+  NIK +A+VGP+A  
Sbjct: 349 DERVPYSKIPYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAAD 407

Query: 409 TKAMIGNYEGTPCRYTSPMDGFYAYSKVIN----YAPGC---ADIVCQN 450
           +  +  NY G P +  + ++G    +KV N    Y  GC   AD V  +
Sbjct: 408 STMLWANYNGFPSKTVTIVEGI--RNKVPNAEVIYELGCNHTADFVVTD 454



 Score =  134 bits (338), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 94/297 (31%), Positives = 138/297 (46%), Gaps = 51/297 (17%)

Query: 456 AAIDAAKNADATVIVAGL-------DLSVEAEG---KDRVDLLLPGFQTELINKVADAAK 505
           A     K+AD  V V G+       ++ V+AEG    DR ++ +P  Q E++  +    K
Sbjct: 597 ATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALVATGK 656

Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
            PV  V+ +  A+ +N+   N  + +IL   Y G+EGG A+ADV+FG YNP GRLPIT+Y
Sbjct: 657 -PVVYVVCTGSALALNW--ENDHVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPITFY 713

Query: 566 EANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
           ++         +P     +  GRTY++     +YPFGYGLSYT F YK A        KL
Sbjct: 714 KS------VDQLPDFQDYSMKGRTYRYMTQTPLYPFGYGLSYTTFDYKNA--------KL 759

Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
            KD+        + +N+                  T   ++ N GKMDG EV  +Y K P
Sbjct: 760 SKDK--------IASNE----------------SVTLSFDIANTGKMDGDEVAQIYIKNP 795

Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
                 +K +  ++RV + AG    V   +         DN     +  G + IL G
Sbjct: 796 NDPAGPLKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEIRPGKYQILYG 852


>gi|206901280|ref|YP_002250567.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
 gi|206740383|gb|ACI19441.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
          Length = 762

 Score =  288 bits (736), Expect = 9e-75,   Method: Compositional matrix adjust.
 Identities = 217/695 (31%), Positives = 346/695 (49%), Gaps = 110/695 (15%)

Query: 87  GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
           GAT FP  I   ++F   L +++   +    +A  N+ + GL   SP +++ RDPRWGR 
Sbjct: 106 GATVFPQAIGMASTFEPELIRRVSDVIRQHMKAA-NV-HQGL---SPVLDIPRDPRWGRT 160

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
            ET GEDPY+V R A  YV+GLQ   G ++          I A  KH+ AY +   EG  
Sbjct: 161 EETFGEDPYLVSRMATEYVKGLQ---GEDWREG-------IVATVKHFTAYGIS--EGA- 207

Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
           R    ++V E++++E F+ PFE+ + EG   S+M +Y+ ++G+P  +   LL + +R +W
Sbjct: 208 RNLGPAKVGERELREVFLFPFEVAIKEGQAGSLMNAYHEIDGVPCASSKFLLTKILRWEW 267

Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG--DYYTNFTMGAVQQG 324
            F GY+VSD  +++ +   HK   D KE AV   L+AG+D++    D Y    + AV++G
Sbjct: 268 GFKGYVVSDYIAVRMLENFHKVARDAKEAAVL-ALEAGIDIELPSVDCYGEPLIQAVKEG 326

Query: 325 KIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN------ICNPQHIELAAEAARQG 378
            I+E  I+ S+  +      LG FD      NL K+          P+  +L+ E AR+ 
Sbjct: 327 LISEEVINASVERVLRAKFMLGLFD-----DNLEKDPKKVYEVFDKPEFRDLSREVARRS 381

Query: 379 IVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-------------------EGT 419
           IVLLKND G LPL+  N+K +A++GP+A+  + + G+Y                   E  
Sbjct: 382 IVLLKND-GTLPLSK-NLKKVAVIGPNADNPRNLHGDYSYTAHIPSIAEGLEGVKVEEKC 439

Query: 420 PCRYTSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG---- 472
             R  S ++G     +    + YA GC DI+  +      AI+ AK AD  + V G    
Sbjct: 440 VVRTVSILEGIRNKVSPETEVLYAKGC-DIISDSKDGFAEAIEMAKEADVIIAVMGEESG 498

Query: 473 -LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKS 531
                +  EG DR  L L G Q +L+ ++    K P+ LV+++     + +   N  + +
Sbjct: 499 LFHRGISGEGNDRTTLELFGVQRDLLKELHKLGK-PIVLVLINGRPQALKWEHEN--LNA 555

Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYK 591
           IL   YPGEEGG A+ADVIFG YNP G+LPI+ + A   +IP         N  P     
Sbjct: 556 ILEAWYPGEEGGNAVADVIFGDYNPSGKLPIS-FPAVTGQIPVY------YNRKPSAFSD 608

Query: 592 FFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL 649
           + D     +YPFG+GLSYT F+Y         D+K+  ++                    
Sbjct: 609 YIDESAKPLYPFGHGLSYTTFEYS--------DLKISPEK-------------------- 640

Query: 650 IDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH--IKQVIGYERVFIAAGQ 707
           ++ ++  +  FT    ++N G  DG EVV +Y     +A     +K++ G++++++  G+
Sbjct: 641 VNSLEKVEISFT----IKNTGNRDGEEVVQLYIHDQ-VASLERPVKELKGFKKIYLKPGE 695

Query: 708 SAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           S +V FT+   + L   D     ++  G   +++G
Sbjct: 696 SKRVTFTLYP-EQLAFYDEFMRFIVEKGVFEVMIG 729


>gi|237712573|ref|ZP_04543054.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
 gi|345512524|ref|ZP_08792050.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
           5_1_36/D4]
 gi|423239901|ref|ZP_17221016.1| hypothetical protein HMPREF1065_01639 [Bacteroides dorei
           CL03T12C01]
 gi|229435409|gb|EEO45486.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
           5_1_36/D4]
 gi|229453894|gb|EEO59615.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
 gi|392644890|gb|EIY38624.1| hypothetical protein HMPREF1065_01639 [Bacteroides dorei
           CL03T12C01]
          Length = 788

 Score =  288 bits (736), Expect = 9e-75,   Method: Compositional matrix adjust.
 Identities = 244/812 (30%), Positives = 378/812 (46%), Gaps = 165/812 (20%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
           Y + K P  ER +DL+ +MTL EK  QM  L YG  R+    LP   W +E    G+  I
Sbjct: 43  YENPKAPLEERVQDLLSQMTLEEKSCQMATL-YGSGRVLKDALPQDNWKTEVWKDGIGNI 101

Query: 70  GRRTNS----------PPGTHFDSE--------------VP--------------GATSF 91
               N           P   H D++              +P               AT F
Sbjct: 102 DEEHNGLGTFKSEYSFPYTKHVDAKHAIQRWFVEETRLGIPVDFTNEGIRGLCHDRATYF 161

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
           P      A++N+ L  +IG+  + EA A+      G T  +SP +++ +DPRWGR +ET 
Sbjct: 162 PAQCGQGATWNKELIARIGEVEAKEAVAL------GYTNIYSPILDIAQDPRWGRCVETY 215

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDPY+VG        G Q +  ++ H         + A  KH+A Y +     + +   
Sbjct: 216 GEDPYLVGEL------GKQMITSLQKH--------NLVATPKHFAVYSIPVGGRDGKTRT 261

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
           D  V  ++M+  +I PF M   E     VM SYN  +G P       L + +R +W F G
Sbjct: 262 DPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKG 321

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
           Y+VSD ++++ I   HK  N T ED +A+ + AGL++      T+FT           AV
Sbjct: 322 YVVSDSEAVEFISSKHKVAN-TYEDGIAQAVNAGLNIR-----THFTPPADFILPLRKAV 375

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN---ICNPQHIELAAEAARQG 378
             GKI++  +D  +  +  V   LG FD    Y+  GK     + + +H  ++ EAARQ 
Sbjct: 376 ADGKISQETLDKRVAEILRVKFWLGLFDNP--YRGNGKQAEQIVHSKEHQAVSLEAARQS 433

Query: 379 IVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY---TSPMDGFYA--- 432
           +VLLKN+   LPL+  +++++A++GP+A+    +I       CRY    +P+   Y    
Sbjct: 434 LVLLKNEMNLLPLSK-SLRSIAVIGPNADERTQLI-------CRYGPANAPIKTVYQGIK 485

Query: 433 ----YSKVINYAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGL 473
               +++VI Y  GC DI+                +   ++  AI AAK A+  V+V G 
Sbjct: 486 ERLPHTEVI-YRKGC-DIIDPHFPESEVLDFPKTTEEARLMEEAIHAAKQAEVVVMVLGG 543

Query: 474 DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
           +     E + R  L LPG Q EL+  V    K PV LV++   A  IN+A  +  + +IL
Sbjct: 544 NELTVREDRSRTSLNLPGRQEELLKAVCATGK-PVVLVLLDGRASSINYAAAH--VPAIL 600

Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
              +PGE  G+A+A+ +FG YNPGGRL +T +  +  +IP+ + P +P ++    T  + 
Sbjct: 601 HAWFPGEFCGQAVAEALFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDESSSTSVY- 657

Query: 594 DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQC--RDINYTVGTNKPPCAAVLID 651
              V+YPFG+GLSYT F Y         D+K+   +Q    DIN                
Sbjct: 658 --GVLYPFGHGLSYTTFSYG--------DLKISPLRQGVQGDIN---------------- 691

Query: 652 DVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAK 710
            + CK         ++N GK+ G EVV +Y +       T+ K + G+ER+ + AG+   
Sbjct: 692 -ISCK---------IKNTGKIKGDEVVQLYLRDEVSSVTTYTKVLRGFERISLEAGEEQM 741

Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           V F +   + L + D   N  +  G   +++G
Sbjct: 742 VHFRLRP-QDLGLWDKNMNFRVEPGKFKVMIG 772


>gi|270296098|ref|ZP_06202298.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D20]
 gi|270273502|gb|EFA19364.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D20]
          Length = 798

 Score =  287 bits (735), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 233/799 (29%), Positives = 370/799 (46%), Gaps = 139/799 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
           Y D+  P   R ++L+ +MTL EK  QM  L YG  R+    LP   W +E    G+  I
Sbjct: 53  YEDSYAPLEARVQNLLSQMTLEEKSCQMATL-YGSGRVLNDALPSDNWKNEVWKDGIGNI 111

Query: 70  GRRTNS----------PPGTHFDSE--------------VP--------------GATSF 91
               N           P   H  ++              +P               AT F
Sbjct: 112 DEEHNGLGSFKSAYSFPYAHHVKTKHAIQRWFVENTRLGIPVDFTNEGIRGLCHDRATYF 171

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPG 151
           P      A++N+ L  +IG+    EAR    LG   +  +SP +++ +DPRWGR +ET G
Sbjct: 172 PAQCGQGATWNKELIAQIGEA---EAREASVLGYTNI--YSPILDIAQDPRWGRCVETYG 226

Query: 152 EDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFD 211
           EDPY  G+     +  LQ                K+ +  KH+A Y +     + +   D
Sbjct: 227 EDPYHAGQMGKQMILSLQKN--------------KLVSTPKHFAVYSIPVGGRDGKTRTD 272

Query: 212 SRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGY 271
             V  ++M+  ++ PF +  +E     VM SYN  +G P       L + +R +W F GY
Sbjct: 273 PHVAPREMRTLYLDPFRVAFHEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGY 332

Query: 272 IVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAVQ 322
           +VSD ++++ I   H+  N   EDAVA+ + AGL++      T+FT           AV+
Sbjct: 333 VVSDSEAVEFISTKHQVANGY-EDAVAQAVNAGLNIR-----THFTPPADFILPLRSAVK 386

Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICN-PQHIELAAEAARQGIVL 381
           +GKI++  ++  +  +  V   LG FD   +        I + P+H +LA EAARQ +VL
Sbjct: 387 KGKISQETLNQRVAEILRVKFWLGLFDNPYRGDEKRAGQIVHSPEHQQLALEAARQSLVL 446

Query: 382 LKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVINY 439
           LKN++  LPL+  +I+++A++GP+A+  + +I  Y       T+  +G         + Y
Sbjct: 447 LKNEHQTLPLSK-SIRSVAVIGPNADERQQLICRYGPANAHITTIYEGIKKMLPQADVVY 505

Query: 440 APGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
             GC DI+                Q   M+  AI+AAK A+ TV+V G +     E + R
Sbjct: 506 KKGC-DIIDPHFPESEVLEFPKAAQEAQMMEEAIEAAKGAEVTVMVLGGNELTVREDRSR 564

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
             L LPG Q EL+ K+    K PV LV++   A  INFA  +  + +I+   +PGE GG+
Sbjct: 565 TSLDLPGRQKELLKKICQLGK-PVVLVMIDGRASSINFAATH--VPAIIHAWFPGEFGGQ 621

Query: 545 AIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYG 604
           AIA+ +FG YNPGGRL +T +  +  +IP+ + P +P ++    T  +     +YPFG+G
Sbjct: 622 AIAEALFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDESSETSVY---GALYPFGHG 676

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
           LSYT F+Y         D+ +   +Q    N ++                          
Sbjct: 677 LSYTTFQYS--------DLAISPSKQGVQGNISISCT----------------------- 705

Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLKI 723
            ++N+G+ +G EVV +Y +    + T   QV+ G+ER+ +    S  V F +   + L I
Sbjct: 706 -IKNIGQREGDEVVQLYLRDEVSSVTTYTQVLRGFERITLKPEASHTVHFELTP-QELGI 763

Query: 724 VDNAANSLLASGAHTILVG 742
            D   N  +  G   +++G
Sbjct: 764 WDKQMNFTVEPGMFKVMIG 782


>gi|227831319|ref|YP_002833099.1| glycoside hydrolase family protein [Sulfolobus islandicus L.S.2.15]
 gi|227457767|gb|ACP36454.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           L.S.2.15]
          Length = 754

 Score =  287 bits (735), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 211/691 (30%), Positives = 347/691 (50%), Gaps = 106/691 (15%)

Query: 88  ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRV 146
           +T+FP  I   +++N  L   I   + ++ R +      G+    SP ++V +DPRWGR 
Sbjct: 101 STAFPQAIGLASTWNPELVMDIASVIRSQGRLV------GVNQCLSPVLDVCKDPRWGRC 154

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN- 205
            ET GEDPY+V    + Y+ GLQ           D+   ++ A  KH+AA+       N 
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQ----------GDN---QLVATAKHFAAHGFPEGGRNI 201

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
            + H  +R    +++ETF+ PFE+ V  G V S+M +Y+ ++GIP   +P+LL   +R +
Sbjct: 202 AQVHVGNR----ELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQE 257

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQ 323
           W F G +VSD D I+ +   H+  ++  E A+   L++G+D++    D Y    + A+++
Sbjct: 258 WGFDGIVVSDYDGIRQLETIHRVASNKMEAAIL-ALESGVDIEFPTIDCYGEPLVNALKE 316

Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLK 383
           G + E+ ID ++  +  +  RLG  D     +N     + + +  ELA + AR+ IVLLK
Sbjct: 317 GLVPESLIDRAVERVLRIKDRLGLLDNPFVNENSVPEKLDDHKSRELALKTARESIVLLK 376

Query: 384 NDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT-------PCRYTSPMDGF---YAY 433
           N+N  LPL + N+  +A++GP+AN  + M+G+Y  T            + + G       
Sbjct: 377 NENNILPL-SKNVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIVKKVGE 435

Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV----AGLDLS------------- 476
           SKV+ YA GC DI  ++      AI+ A+ AD  + +    +GL LS             
Sbjct: 436 SKVL-YAKGC-DIASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSKEEFKKY 493

Query: 477 --VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
             V  EG DR  L LPG Q EL+ ++    K P+ LV+++   + ++   N   +K+++ 
Sbjct: 494 QAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSSIIN--YVKAVIE 550

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
             +PGEEGG AIADVIFG YNP GRLPIT+  +   + + Y   P    ++F  R Y   
Sbjct: 551 AWFPGEEGGNAIADVIFGDYNPSGRLPITFPMDTGQIPLYYNRKP----SSF--RPYVML 604

Query: 594 DGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
               ++ FGYGLSYTQF+Y  +  +PK                  +G N           
Sbjct: 605 RSSPLFTFGYGLSYTQFEYSNLEVTPKE-----------------IGPNS---------- 637

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKV 711
                      I+V+N+GKM+G +VV +Y SK        +K++ G+ ++ +  G+  +V
Sbjct: 638 ------NIAISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEKRRV 691

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            F +   ++L   D+    ++  G + +L+G
Sbjct: 692 KFIL-PTEALAFYDSFMRLVVEKGEYQLLIG 721


>gi|284998833|ref|YP_003420601.1| glycoside hydrolase family protein [Sulfolobus islandicus L.D.8.5]
 gi|284446729|gb|ADB88231.1| glycoside hydrolase, family 3 domain protein [Sulfolobus islandicus
           L.D.8.5]
          Length = 754

 Score =  287 bits (735), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 211/691 (30%), Positives = 347/691 (50%), Gaps = 106/691 (15%)

Query: 88  ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRV 146
           +T+FP  I   +++N  L   I   + ++ R +      G+    SP ++V +DPRWGR 
Sbjct: 101 STAFPQAIGLASTWNPELVMDIASVIRSQGRLV------GVNQCLSPVLDVCKDPRWGRC 154

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN- 205
            ET GEDPY+V    + Y+ GLQ           D+   ++ A  KH+AA+       N 
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQ----------GDN---QLVATAKHFAAHGFPEGGRNI 201

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
            + H  +R    +++ETF+ PFE+ V  G V S+M +Y+ ++GIP   +P+LL   +R +
Sbjct: 202 AQVHVGNR----ELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQE 257

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQ 323
           W F G +VSD D I+ +   H+  ++  E A+   L++G+D++    D Y    + A+++
Sbjct: 258 WGFDGIVVSDYDGIRQLETIHRVASNKMEAAIL-ALESGVDIEFPTIDCYGEPLVNALKE 316

Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLK 383
           G + E+ ID ++  +  +  RLG  D     +N     + + +  ELA + AR+ IVLLK
Sbjct: 317 GLVPESLIDRAVERVLRIKDRLGLLDNPFVNENSVPEKLDDHKSRELALKTARESIVLLK 376

Query: 384 NDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT-------PCRYTSPMDGF---YAY 433
           N+N  LPL + N+  +A++GP+AN  + M+G+Y  T            + + G       
Sbjct: 377 NENNILPL-SKNVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIVKKVGE 435

Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV----AGLDLS------------- 476
           SKV+ YA GC DI  ++      AI+ A+ AD  + +    +GL LS             
Sbjct: 436 SKVL-YAKGC-DIASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSEEEFKKY 493

Query: 477 --VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
             V  EG DR  L LPG Q EL+ ++    K P+ LV+++   + ++   N   +K+++ 
Sbjct: 494 QAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSSIIN--YVKAVIE 550

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
             +PGEEGG AIADVIFG YNP GRLPIT+  +   + + Y   P    ++F  R Y   
Sbjct: 551 AWFPGEEGGNAIADVIFGDYNPSGRLPITFPMDTGQIPLYYNRKP----SSF--RPYVML 604

Query: 594 DGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
               ++ FGYGLSYTQF+Y  +  +PK                  +G N           
Sbjct: 605 RSSPLFTFGYGLSYTQFEYSNLEVTPKE-----------------IGPNS---------- 637

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKV 711
                      I+V+N+GKM+G +VV +Y SK        +K++ G+ ++ +  G+  +V
Sbjct: 638 ------NIAISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEKRRV 691

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            F +   ++L   D+    ++  G + +L+G
Sbjct: 692 KFIL-PTEALAFYDSFMRLVVEKGEYQLLIG 721


>gi|423229063|ref|ZP_17215468.1| hypothetical protein HMPREF1063_01288 [Bacteroides dorei
           CL02T00C15]
 gi|423244903|ref|ZP_17225977.1| hypothetical protein HMPREF1064_02183 [Bacteroides dorei
           CL02T12C06]
 gi|392634816|gb|EIY28728.1| hypothetical protein HMPREF1063_01288 [Bacteroides dorei
           CL02T00C15]
 gi|392640944|gb|EIY34735.1| hypothetical protein HMPREF1064_02183 [Bacteroides dorei
           CL02T12C06]
          Length = 788

 Score =  287 bits (735), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 242/811 (29%), Positives = 376/811 (46%), Gaps = 163/811 (20%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
           Y + K P  ER +DL+ +MTL EK  QM  L YG  R+    LP   W +E    G+  I
Sbjct: 43  YENPKAPLEERVQDLLSQMTLEEKSCQMATL-YGSGRVLKDALPQDNWKTEVWKDGIGNI 101

Query: 70  GRRTNS----------PPGTHFDSE--------------VP--------------GATSF 91
               N           P   H D++              +P               AT F
Sbjct: 102 DEEHNGLGTFKSEYSFPYTKHVDAKHAIQRWFVEETRLGIPVDFTNEGIRGLCHDRATYF 161

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPG 151
           P      A++N+ L  +IG+  + EA A+          +SP +++ +DPRWGR +ET G
Sbjct: 162 PAQCGQGATWNKELIARIGEVEAKEAVAL-----EYTNIYSPILDIAQDPRWGRCVETYG 216

Query: 152 EDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFD 211
           EDPY+VG        G Q +  ++ H         + A  KH+A Y +     + +   D
Sbjct: 217 EDPYLVGEL------GKQMITSLQKH--------NLVATPKHFAVYSIPVGGRDGKTRTD 262

Query: 212 SRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGY 271
             V  ++M+  +I PF M   E     VM SYN  +G P       L + +R +W F GY
Sbjct: 263 PHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGY 322

Query: 272 IVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAVQ 322
           +VSD ++++ I   HK  N T ED +A+ + AGL++      T+FT           AV 
Sbjct: 323 VVSDSEAVEFISSKHKVAN-TYEDGIAQAVNAGLNIR-----THFTPPADFILPLRKAVA 376

Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN---ICNPQHIELAAEAARQGI 379
            GKI++  +D  +  +  V   LG FD    Y+  GK     + + +H  ++ EAARQ +
Sbjct: 377 DGKISQETLDKRVAEILRVKFWLGLFDNP--YRGNGKQAEQIVHSKEHQAVSLEAARQSL 434

Query: 380 VLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY---TSPMDGFYA---- 432
           VLLKN+   LPL+  +++++A++GP+A+    +I       CRY    +P+   Y     
Sbjct: 435 VLLKNEMNLLPLSK-SLRSIAVIGPNADERTQLI-------CRYGPANAPIKTVYQGIKE 486

Query: 433 ---YSKVINYAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLD 474
              +++VI Y  GC DI+                +   ++  AI AAK A+  V+V G +
Sbjct: 487 RLPHTEVI-YRKGC-DIIDPHFPESEVLDFPKTTEEARLMEEAIHAAKQAEVVVMVLGGN 544

Query: 475 LSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
                E + R  L LPG Q EL+  V    K PV LV++   A  IN+A  +  + +IL 
Sbjct: 545 ELTVREDRSRTSLNLPGRQEELLKAVCATGK-PVVLVLLDGRASSINYAAAH--VPAILH 601

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD 594
             +PGE  G+A+A+ +FG YNPGGRL +T +  +  +IP+ + P +P ++    T  +  
Sbjct: 602 AWFPGEFCGQAVAEALFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDESSSTSVY-- 657

Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQC--RDINYTVGTNKPPCAAVLIDD 652
             V+YPFG+GLSYT F Y         D+K+   +Q    DIN                 
Sbjct: 658 -GVLYPFGHGLSYTTFSYG--------DLKISPLRQGVQGDIN----------------- 691

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKV 711
           + CK         ++N GK+ G EVV +Y +       T+ K + G+ER+ + AG+   V
Sbjct: 692 ISCK---------IKNTGKIKGDEVVQLYLRDEVSSVTTYTKVLRGFERISLEAGEEQMV 742

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            F +   + L + D   N  +  G   +++G
Sbjct: 743 HFRLRP-QDLGLWDKNMNFRVEPGKFKVMIG 772


>gi|154493680|ref|ZP_02033000.1| hypothetical protein PARMER_03021 [Parabacteroides merdae ATCC
           43184]
 gi|423723902|ref|ZP_17698051.1| hypothetical protein HMPREF1078_02038 [Parabacteroides merdae
           CL09T00C40]
 gi|154086890|gb|EDN85935.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Parabacteroides merdae ATCC 43184]
 gi|409240709|gb|EKN33484.1| hypothetical protein HMPREF1078_02038 [Parabacteroides merdae
           CL09T00C40]
          Length = 868

 Score =  287 bits (735), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 173/469 (36%), Positives = 247/469 (52%), Gaps = 54/469 (11%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
           S   +  D+P+ +  LP  ER  DL++R+T  EK+ QM +    + RLG+P Y+WW+EAL
Sbjct: 18  SCSQRQEDYPFRNPDLPIDERIDDLLKRLTAEEKIGQMMNTTPAIERLGIPEYDWWNEAL 77

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           HGV+  G+                AT FP  I   A+F++    +    VS EARA Y+ 
Sbjct: 78  HGVARAGK----------------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQ 121

Query: 124 GNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
                      GLTFW+PNIN+ RDPRWGR +ET GEDPY+  R  +  V+GLQ      
Sbjct: 122 YQKNKEYDRYKGLTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQG----- 176

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
                D +  K  AC KHYA +    W   +R  FD  VT +D+ +T++  FE  V +G+
Sbjct: 177 ----DDPKYFKTHACAKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKKGN 229

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE------SHKFL 289
           V  VMC+YNR  G P C+  KLL   +R  W +   I+SDC +I    +       H+  
Sbjct: 230 VQEVMCAYNRYQGKPCCSSDKLLIDILRNSWGYENIILSDCGAINDFWQRDERTPRHETH 289

Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
            D  E A A  +  G DL+CG+ Y    + A+++GKI+E D+D SLR L      LG FD
Sbjct: 290 PDA-ESASADAVLNGTDLECGNSYKAL-IKALKEGKISENDLDVSLRRLLKGRFELGMFD 347

Query: 350 GSPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
              +  Y  +  N + +P+H+  A E A + +VLLKN N  LPL +  I+ +A+VGP+A 
Sbjct: 348 PDERVPYAQIPYNVVESPEHVAQALEMAHKSMVLLKNKNNTLPL-SKTIRKIAVVGPNAA 406

Query: 408 ATKAMIGNYEGTPCRYTSPMDGFYAY---SKVINYAPGC---ADIVCQN 450
            +  +  NY G P    + ++G       ++VI Y  GC   AD V Q+
Sbjct: 407 DSTMLWANYNGFPTHTVTILEGIRNKVPDTEVI-YELGCNHAADFVIQD 454



 Score =  131 bits (330), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 101/315 (32%), Positives = 141/315 (44%), Gaps = 55/315 (17%)

Query: 442 GCADIVCQNNSMIPA--AIDAAKNADATVIV---------AGLDLSVEAEG---KDRVDL 487
           G AD+  Q  +  P   A  AAK  DA VIV          G ++ V  EG    DR ++
Sbjct: 579 GSADLNFQIGTRRPVDYAATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNI 638

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
            +P  Q E++ K   A   PV  V+ +  A+ +N+   N  I +IL   Y G+E G A+A
Sbjct: 639 EIPKVQQEMV-KALKATGKPVVYVLCTGSALALNWEDAN--IDAILNAWYGGQEAGTAVA 695

Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSY 607
           D++FG YNP GRLP+T+Y++         +P     +  GRTY++     +YPFGYGLSY
Sbjct: 696 DILFGDYNPSGRLPVTFYKS------IDQLPDFEDYSMKGRTYRYMTETPLYPFGYGLSY 749

Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
           T F Y+ A   K    K+ KDQ                               T   ++ 
Sbjct: 750 TNFAYRNA---KLSSGKITKDQSV-----------------------------TLTFDIA 777

Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           N GKMDG EV  +Y K P      IK +  + RV + AG S +V   +         DN 
Sbjct: 778 NTGKMDGDEVAQIYIKNPNDPEGPIKALKAFLRVHVKAGDSQEVNIELTPEAFHSFNDNT 837

Query: 728 ANSLLASGAHTILVG 742
               +  G + IL G
Sbjct: 838 QTMEVRPGKYQILYG 852


>gi|393786911|ref|ZP_10375043.1| hypothetical protein HMPREF1068_01323 [Bacteroides nordii
           CL02T12C05]
 gi|392658146|gb|EIY51776.1| hypothetical protein HMPREF1068_01323 [Bacteroides nordii
           CL02T12C05]
          Length = 863

 Score =  287 bits (735), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 168/430 (39%), Positives = 231/430 (53%), Gaps = 41/430 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            P+ +  LP  ER +DLV R+TL EKV  M D +  VPRLG+  Y WW+EALHGV   G 
Sbjct: 22  LPFNNPDLPVEERVEDLVRRLTLHEKVLLMCDYSSSVPRLGIKQYNWWNEALHGVGRAGL 81

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN------ 125
                           AT FP  I   A+F++   K++ + VS EARA Y+         
Sbjct: 82  ----------------ATVFPQAIGMAATFDDCAVKQVFECVSDEARAKYHHSENKDGSE 125

Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
              GLTFW+PN+N+ RDPRWGR  ET GEDPY+  R  +  VRGLQ          S+S+
Sbjct: 126 RYRGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGLAVVRGLQG--------PSESK 177

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFD-SRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
             K+ AC KHYA +    W   +R  FD   ++ +D+ ET++  F+  V +G V  VMC+
Sbjct: 178 YDKLHACAKHYALHSGPEW---NRHRFDVENISPRDLWETYLPAFKALVQQGGVKEVMCA 234

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI-VESHKFLNDTKEDAVARVL 301
           YNR  G P C   +LL   +R +W F G +VSDC +I    ++ H   + TKE AVA  +
Sbjct: 235 YNRFEGEPCCGSNRLLYNILREEWGFDGLVVSDCGAISDFYLKGHHETHSTKESAVAAAV 294

Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGK 359
           KAG DLDCG  Y +    AV++G I E  ID SL  L      LG  D      + ++  
Sbjct: 295 KAGTDLDCGVDYQSLEK-AVEKGIITEKQIDVSLSRLLKARFELGLMDEEHLVSWSDIPY 353

Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
             + + +H   A E AR+ + LLKN NG LPL+  +   + ++GP+AN +  M GNY G 
Sbjct: 354 TVVDSEKHRAKALEVARKSMTLLKNKNGTLPLSK-HCGKIVVIGPNANDSIMMWGNYNGF 412

Query: 420 PCRYTSPMDG 429
           P    + ++G
Sbjct: 413 PSHTVTILEG 422



 Score =  113 bits (283), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 83/296 (28%), Positives = 133/296 (44%), Gaps = 53/296 (17%)

Query: 463 NADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVI 512
           +A+A V V G+   VE E          G DR  + LP  Q +L+ ++    K P+ L++
Sbjct: 599 DAEAIVFVGGISPKVEGEELPVSFPGFKGGDRTVIELPQVQRDLLQELYKTGK-PIILIL 657

Query: 513 MSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKI 572
            S  A  I  +       +I+   YPG+ GG A+ADV+FG YNP GRLP+T+Y+      
Sbjct: 658 CSGSA--IGLSAEVDLADAIIQAWYPGQAGGTAVADVLFGDYNPAGRLPVTFYKTT---- 711

Query: 573 PYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCR 632
               +P     N  GRTY++F G  ++PFGYGLSYT F+   A   K            +
Sbjct: 712 --EQLPDFEDYNMQGRTYRYFKGEALFPFGYGLSYTSFEIGKAQLSK------------K 757

Query: 633 DINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI 692
            I+     N                      + ++N G+ DG EV+ VY +        +
Sbjct: 758 RIHANESVN--------------------LDLWIKNTGERDGEEVIQVYIRKLKDKEGPL 797

Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVGEGVGG 747
           K +  ++RV + +G+  ++   +    S +  D   N + + +G + +L G    G
Sbjct: 798 KTLRAFKRVHVKSGEKKQISIHL-PNDSFEFFDPEFNVMRVMAGEYEVLYGTSSEG 852


>gi|1749831|emb|CAA91219.1| beta-xylo-glucosidase [Thermoanaerobacter brockii]
          Length = 730

 Score =  287 bits (734), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 206/698 (29%), Positives = 341/698 (48%), Gaps = 100/698 (14%)

Query: 87  GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
           GAT FP  I   +++N  + +K+   +  + +A+           +P +++ RDPRWGR 
Sbjct: 57  GATIFPQTIGVASTWNNEIVEKMASVIREQMKAV-----GARQALAPLLDITRDPRWGRT 111

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
            ET GEDPY+V R  ++Y+RGLQ          ++S    I A  KH+  Y   N EG  
Sbjct: 112 EETFGEDPYLVMRMGVSYIRGLQ----------TESLKEGIVATGKHFVGYG--NSEGGM 159

Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
            +   + + E++++E F+ PFE  V E  +SS+M  Y+ ++G+P     KLLN  +R DW
Sbjct: 160 NWA-PAHIPERELREVFLYPFEAAVKEAKLSSIMPGYHELDGVPCHKSKKLLNDILRKDW 218

Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQG 324
            F G +VSD  +I  + E H   +D K+ A    L+AG+D++    DYY       ++ G
Sbjct: 219 GFEGIVVSDYFAISQLYEYHHVTSD-KKGAAKLALEAGVDVELPSTDYYGLPLRELIESG 277

Query: 325 KIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKN 384
           +I    ++ +++ +  +   LG F+     +          +  ELA + A++ IVLLKN
Sbjct: 278 EIDIDFVNEAVKRVLKIKFELGLFENPYINEEKAVEIFDTNEQRELAYKIAQESIVLLKN 337

Query: 385 DNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR-------------YTSPM---- 427
           +N  LPL   ++K++A++GP+A++ + MIG+Y   PC              + +P+    
Sbjct: 338 ENNLLPLKK-DLKSIAVIGPNADSIRNMIGDY-AYPCHIESLLEMRETDNVFNTPLPESL 395

Query: 428 ---DGFYAYSKVIN-------------YAPGCADIVCQNNSMIPAAIDAAKNADATVIVA 471
              D +     V+              YA GC D++  +      A++ AK AD  V+V 
Sbjct: 396 EAKDIYVPIVTVLQGIKAKVSSNTEVLYAKGC-DVLNNSKDGFKEAVEIAKQADVAVVVV 454

Query: 472 G-----LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           G      D     E +DR DL LPG Q ELI  + +    PV +V+++   + I++    
Sbjct: 455 GDKSGLTDGCTSGESRDRADLNLPGVQEELIKAIYETGT-PVIVVLINGRPMSISWIAE- 512

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA-NYVKIPYTSMPLRPVNNF 585
            KI +I+    PGEEGGRA+ADVIFG YNPGG+LPI+  ++   + + Y   P    +++
Sbjct: 513 -KIPAIIEAWLPGEEGGRAVADVIFGDYNPGGKLPISIPQSVGQLPVYYYHKPSGGRSHW 571

Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
            G   +    P +YPFGYGLSYT+F Y                      N  +   K   
Sbjct: 572 KGDYVELSTKP-LYPFGYGLSYTEFSY---------------------TNLNISNRK--- 606

Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIA 704
                  V  +D      ++++N G + G EVV +Y     ++ T  +K++ G++R+ + 
Sbjct: 607 -------VSLRDRMVEISVDIKNTGTLKGDEVVQLYIHQEALSVTRPVKELKGFKRITLD 659

Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           AG+   V F + + + L   D     ++  G   +++G
Sbjct: 660 AGEEKTVIFKL-SIEQLGFYDENMEYVVEPGRVDVMIG 696


>gi|383114360|ref|ZP_09935124.1| hypothetical protein BSGG_1469 [Bacteroides sp. D2]
 gi|313693934|gb|EFS30769.1| hypothetical protein BSGG_1469 [Bacteroides sp. D2]
          Length = 863

 Score =  286 bits (733), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 171/448 (38%), Positives = 242/448 (54%), Gaps = 46/448 (10%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S +PY D KL   +RA DL++R+TL EKV  M + +  +PRLG+  YEWW+EALHGV+  
Sbjct: 24  SKYPYQDTKLTVEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARA 83

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---- 125
           G                 AT FP  I   ASFN+ L  ++   VS EARA     N    
Sbjct: 84  GL----------------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQ 127

Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
                GLT W+PN+N+ RDPRWGR  ET GEDPY+ GR  +  VRGLQ  E  EY     
Sbjct: 128 YKRYQGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYD---- 183

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVM 240
               K+ AC KH+A +    W   +R  F++  +  +D+ ET++  F+  V +  V  VM
Sbjct: 184 ----KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVM 236

Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV--- 297
           C+YNR  G P C   +LL Q +R DW F G +V+DC +I    +  K  ++T  DA    
Sbjct: 237 CAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKK--HETHPDAAHAS 294

Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL 357
           A  + +G DL+CG  + + T  AV++G I+E  I+TS++ L      LG  + +  + N+
Sbjct: 295 ADAVLSGTDLECGGNFKSIT-DAVKKGLISEEKINTSVKRLLKARFELGEMNSTHPWSNI 353

Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
             + I  P+H ELA + A + +VLL+N+N  LPLN      +A++GP+AN +    GNY 
Sbjct: 354 PFSVIDCPKHKELALKMAHESLVLLQNNNNILPLNRQ--MKVAVIGPNANDSVMQWGNYN 411

Query: 418 GTPCRYTSPMDGFYAY--SKVINYAPGC 443
           G P    + ++G  A      I Y P C
Sbjct: 412 GFPSHTVTLLEGIRAKLPDAQIIYEPVC 439



 Score =  118 bits (295), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 95/296 (32%), Positives = 138/296 (46%), Gaps = 53/296 (17%)

Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
           ++  ++AD  +   G+   +E E          G DR ++ LP  Q E++  +    K  
Sbjct: 594 LNKLQSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKKT 653

Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
           V  V  S  A+ I     N    +IL   YPG+ GG A+ADV+FG YNP GRLPIT+Y++
Sbjct: 654 V-FVNFSGSAMAIVPETQN--CDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKS 710

Query: 568 NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
                 Y    ++      GRTY+F     +YPFGYGLSYT+F Y  A+  +S   KL K
Sbjct: 711 MQQLPDYEDYSMK------GRTYRFMTETPLYPFGYGLSYTRFSYGKATLNQS---KLTK 761

Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI 687
            ++                A+L              I V N+G+ DG EVV VY   P  
Sbjct: 762 GEK----------------AILT-------------IPVSNVGQRDGEEVVQVYICRPDD 792

Query: 688 AGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
                K + G++RV IA G++  V   +    S +  D A N++   +G + IL G
Sbjct: 793 KEGPQKTLRGFQRVSIAKGKTQNVQIEL-PYDSFEWFDAATNTIRPLNGTYKILYG 847


>gi|423344787|ref|ZP_17322476.1| hypothetical protein HMPREF1060_00148 [Parabacteroides merdae
           CL03T12C32]
 gi|409224378|gb|EKN17311.1| hypothetical protein HMPREF1060_00148 [Parabacteroides merdae
           CL03T12C32]
          Length = 866

 Score =  286 bits (733), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 173/469 (36%), Positives = 247/469 (52%), Gaps = 54/469 (11%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
           S   +  D+P+ +  LP  ER  DL++R+T  EK+ QM +    + RLG+P Y+WW+EAL
Sbjct: 16  SCSQRQEDYPFRNPDLPIDERIDDLLKRLTAEEKIGQMMNTTPAIERLGIPEYDWWNEAL 75

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           HGV+  G+                AT FP  I   A+F++    +    VS EARA Y+ 
Sbjct: 76  HGVARAGK----------------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQ 119

Query: 124 GNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
                      GLTFW+PNIN+ RDPRWGR +ET GEDPY+  R  +  V+GLQ      
Sbjct: 120 YQKNKEYDRYKGLTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGLAVVKGLQG----- 174

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
                D +  K  AC KHYA +    W   +R  FD  VT +D+ +T++  FE  V +G+
Sbjct: 175 ----DDPKYFKTHACAKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKKGN 227

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE------SHKFL 289
           V  VMC+YNR  G P C+  KLL   +R  W +   I+SDC +I    +       H+  
Sbjct: 228 VQEVMCAYNRYQGKPCCSSDKLLIDILRNSWGYENIILSDCGAINDFWQRDERTPRHETH 287

Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
            D  E A A  +  G DL+CG+ Y    + A+++GKI+E D+D SLR L      LG FD
Sbjct: 288 PDA-ESASADAVLNGTDLECGNSYKAL-IKALKEGKISENDLDVSLRRLLKGRFELGMFD 345

Query: 350 GSPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
              +  Y  +  N + +P+H+  A E A + +VLLKN N  LPL +  I+ +A+VGP+A 
Sbjct: 346 PDERVPYAQIPYNVVESPEHVAQALEMAHKSMVLLKNKNNTLPL-SKTIRKIAVVGPNAA 404

Query: 408 ATKAMIGNYEGTPCRYTSPMDGFYAY---SKVINYAPGC---ADIVCQN 450
            +  +  NY G P    + ++G       ++VI Y  GC   AD V Q+
Sbjct: 405 DSTMLWANYNGFPTHTVTILEGIRNKVPDTEVI-YELGCNHAADFVIQD 452



 Score =  131 bits (330), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 101/315 (32%), Positives = 141/315 (44%), Gaps = 55/315 (17%)

Query: 442 GCADIVCQNNSMIPA--AIDAAKNADATVIV---------AGLDLSVEAEG---KDRVDL 487
           G AD+  Q  +  P   A  AAK  DA VIV          G ++ V  EG    DR ++
Sbjct: 577 GSADLNFQIGTRRPVDYAATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNI 636

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
            +P  Q E++ K   A   PV  V+ +  A+ +N+   N  I +IL   Y G+E G A+A
Sbjct: 637 EIPKVQQEMV-KALKATGKPVVYVLCTGSALALNWEDAN--IDAILNAWYGGQEAGTAVA 693

Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSY 607
           D++FG YNP GRLP+T+Y++         +P     +  GRTY++     +YPFGYGLSY
Sbjct: 694 DILFGDYNPSGRLPVTFYKS------IDQLPDFEDYSMKGRTYRYMTETPLYPFGYGLSY 747

Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
           T F Y+ A   K    K+ KDQ                               T   ++ 
Sbjct: 748 TNFAYRNA---KLSSGKITKDQSV-----------------------------TLTFDIA 775

Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
           N GKMDG EV  +Y K P      IK +  + RV + AG S +V   +         DN 
Sbjct: 776 NTGKMDGDEVAQIYIKNPNDPEGPIKALKAFLRVHVKAGDSQEVNIELTPEAFHSFNDNT 835

Query: 728 ANSLLASGAHTILVG 742
               +  G + IL G
Sbjct: 836 QTMEVRPGKYQILYG 850


>gi|160886913|ref|ZP_02067916.1| hypothetical protein BACOVA_04927 [Bacteroides ovatus ATCC 8483]
 gi|423288977|ref|ZP_17267828.1| hypothetical protein HMPREF1069_02871 [Bacteroides ovatus
           CL02T12C04]
 gi|423294866|ref|ZP_17272993.1| hypothetical protein HMPREF1070_01658 [Bacteroides ovatus
           CL03T12C18]
 gi|156107324|gb|EDO09069.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
 gi|392668741|gb|EIY62235.1| hypothetical protein HMPREF1069_02871 [Bacteroides ovatus
           CL02T12C04]
 gi|392676057|gb|EIY69498.1| hypothetical protein HMPREF1070_01658 [Bacteroides ovatus
           CL03T12C18]
          Length = 863

 Score =  286 bits (733), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 171/448 (38%), Positives = 242/448 (54%), Gaps = 46/448 (10%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S +PY D KL   +RA DL++R+TL EKV  M + +  +PRLG+  YEWW+EALHGV+  
Sbjct: 24  SKYPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARA 83

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---- 125
           G                 AT FP  I   ASFN+ L  ++   VS EARA     N    
Sbjct: 84  GL----------------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQ 127

Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
                GLT W+PN+N+ RDPRWGR  ET GEDPY+ GR  +  VRGLQ  E  EY     
Sbjct: 128 YKRYQGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYD---- 183

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVM 240
               K+ AC KH+A +    W   +R  F++  +  +D+ ET++  F+  V +  V  VM
Sbjct: 184 ----KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVM 236

Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV--- 297
           C+YNR  G P C   +LL Q +R DW F G +V+DC +I    +  K  ++T  DA    
Sbjct: 237 CAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKK--HETHPDAAHAS 294

Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL 357
           A  + +G DL+CG  + + T  AV++G I+E  I+TS++ L      LG  + +  + N+
Sbjct: 295 ADAVLSGTDLECGGNFKSIT-DAVKKGLISEEKINTSVKRLLKARFELGEMNSTHPWSNI 353

Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
             + I  P+H ELA + A + +VLL+N+N  LPLN      +A++GP+AN +    GNY 
Sbjct: 354 PFSVIDCPKHKELALKMAHESLVLLQNNNNILPLNRQ--MKVAVIGPNANDSVMQWGNYN 411

Query: 418 GTPCRYTSPMDGFYAY--SKVINYAPGC 443
           G P    + ++G  A      I Y P C
Sbjct: 412 GFPSHTVTLLEGIRAKLPDAQIIYEPVC 439



 Score =  118 bits (295), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 95/296 (32%), Positives = 138/296 (46%), Gaps = 53/296 (17%)

Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
           ++  ++AD  +   G+   +E E          G DR ++ LP  Q E++  +    K  
Sbjct: 594 LNKLQSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKKT 653

Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
           V  V  S  A+ I     N    +IL   YPG+ GG A+ADV+FG YNP GRLPIT+Y++
Sbjct: 654 V-FVNFSGSAMAIVPETQN--CDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKS 710

Query: 568 NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
                 Y    ++      GRTY+F     +YPFGYGLSYT+F Y  A+  +S   KL K
Sbjct: 711 MQQLPDYEDYSMK------GRTYRFMTETPLYPFGYGLSYTRFSYGKATLNQS---KLTK 761

Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI 687
            ++                A+L              I V N+G+ DG EVV VY   P  
Sbjct: 762 GEK----------------AILT-------------IPVSNVGQRDGEEVVQVYICRPDD 792

Query: 688 AGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
                K + G++RV IA G++  V   +    S +  D A N++   +G + IL G
Sbjct: 793 KEGPQKTLRGFQRVSIAKGKTQNVQIEL-PYDSFEWFDAATNTIRPLNGTYKILYG 847


>gi|403253118|ref|ZP_10919422.1| xylosidase [Thermotoga sp. EMP]
 gi|402811565|gb|EJX26050.1| xylosidase [Thermotoga sp. EMP]
          Length = 778

 Score =  286 bits (732), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 241/811 (29%), Positives = 378/811 (46%), Gaps = 152/811 (18%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGD---------------------LAYGVPRLG 52
           Y D   P   R +DL+ RMTL EKV Q+G                      L  G+ ++ 
Sbjct: 4   YRDPSQPIEVRVRDLLSRMTLEEKVAQLGSVWGYELIDERGKFNKEKAKELLKNGIGQIT 63

Query: 53  LP---LYEWWSEALHGVSFIGR------RTNSPPGTHFDSEVP----GATSFPTVILTTA 99
            P         EA   V+ I R      R   P   H +        G T+FP  I   +
Sbjct: 64  RPGGSTNLEPQEAAELVNEIQRFLVEETRLGIPAMIHEECLTGYMGLGGTNFPQAIAMAS 123

Query: 100 SFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR 159
           +++  L +K+   +  + R +    + GL   +P ++V RDPRWGR  ET GE PY+V R
Sbjct: 124 TWDPDLIEKMTTAIREDMRKIG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVAR 178

Query: 160 YAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDM 219
             ++YV+GLQ   G +  +        + A  KH+A Y     EG   +   + + E++ 
Sbjct: 179 MGVSYVKGLQ---GEDIKKG-------VVATVKHFAGYSAS--EGGKNWA-PTNIPEREF 225

Query: 220 QETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSI 279
           +E F+ PFE  V E +V SVM SY+ ++G+P  A+ KLL   +R DW F G +VSD  ++
Sbjct: 226 KEVFLFPFEAAVKEANVLSVMNSYSEIDGVPCAANRKLLTDILRKDWGFEGIVVSDYFAV 285

Query: 280 QTIVESHKFLNDTKEDAVARVLKAGLDLD-----CGDYYTNFTMGAVQQGKIAEADIDTS 334
           + + + H+   + K +A    L+AG+D++     C  Y  +     V++G I+EA ID +
Sbjct: 286 KVLEDYHRIARN-KSEAARLALEAGIDVELPKTECYQYLKDL----VEKGIISEALIDEA 340

Query: 335 LRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTG 394
           +  + ++   LG F+    Y  + K  I N  H ++A E AR+ I+LLKND G LPL   
Sbjct: 341 VARVLMLKFMLGLFENP--YVEVEKAKIEN--HRDIALEIARKSIILLKND-GILPLQKN 395

Query: 395 NIKTLALVGPHANATKAMIGNYE----------------GTPC----------------- 421
             K +AL+GP+A   + ++G+Y                 G P                  
Sbjct: 396 --KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKSIEEHM 453

Query: 422 -RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG------LD 474
               S +D F        YA GC ++  ++ S    AI+ AK +D  ++V G      LD
Sbjct: 454 KSIPSVLDAFKEEGIEFEYAKGC-EVTGEDRSGFEEAIEIAKKSDVAIVVVGDKSGLTLD 512

Query: 475 LSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
            +   E +D  +L LPG Q EL+ +VA   K PV LV+++     +    +  K+ +IL 
Sbjct: 513 CTT-GESRDMANLKLPGVQEELVLEVAKTGK-PVVLVLITGRPYSLKNVVD--KVNAILQ 568

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
           V  PGE GGRAI D+I+GK NP G+LPI++   A  + + +   P    +++ G      
Sbjct: 569 VWLPGEAGGRAIVDIIYGKVNPSGKLPISFPRSAGQIPVFHYVKPSGGRSHWHGDYVDES 628

Query: 594 DGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
             P ++PFG+GLSYT+F+Y  +   PK V                     PP   V+I  
Sbjct: 629 TKP-LFPFGHGLSYTKFEYSNLRIEPKEV---------------------PPAGEVVI-- 664

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKV 711
                     +++VEN+G  DG EVV +Y      + T  +K++ G++RV + A +   V
Sbjct: 665 ----------KVDVENIGDRDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAKEKKTV 714

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            F ++    L   D     ++  G   ++VG
Sbjct: 715 VFRLH-MDVLAYYDRDMKLVVEPGEFKVMVG 744


>gi|423303655|ref|ZP_17281654.1| hypothetical protein HMPREF1072_00594 [Bacteroides uniformis
           CL03T00C23]
 gi|423307623|ref|ZP_17285613.1| hypothetical protein HMPREF1073_00363 [Bacteroides uniformis
           CL03T12C37]
 gi|392688019|gb|EIY81310.1| hypothetical protein HMPREF1072_00594 [Bacteroides uniformis
           CL03T00C23]
 gi|392689492|gb|EIY82769.1| hypothetical protein HMPREF1073_00363 [Bacteroides uniformis
           CL03T12C37]
          Length = 801

 Score =  286 bits (731), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 233/799 (29%), Positives = 370/799 (46%), Gaps = 139/799 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
           Y D+  P   R ++L+ +MTL EK  QM  L YG  R+    LP   W +E    G+  I
Sbjct: 56  YEDSCAPLEVRVQNLLSQMTLEEKSCQMATL-YGSGRVLNDALPSDNWKNEVWKDGIGNI 114

Query: 70  GRRTNS----------PPGTHFDSE--------------VP--------------GATSF 91
               N           P   H  ++              +P               AT F
Sbjct: 115 DEEHNGLGSFKSAYSFPYAHHVKTKHAIQRWFVENTRLGIPVDFTNEGIRGLCHDRATYF 174

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPG 151
           P      A++N+ L  +IG+    EAR    LG   +  +SP +++ +DPRWGR +ET G
Sbjct: 175 PAQCGQGATWNKELIAQIGEA---EAREASVLGYTNI--YSPILDIAQDPRWGRCVETYG 229

Query: 152 EDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFD 211
           EDPY  G+     +  LQ                K+ +  KH+A Y +     + +   D
Sbjct: 230 EDPYHAGQMGKQMILSLQKN--------------KLVSTPKHFAVYSIPVGGRDGKTRTD 275

Query: 212 SRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGY 271
             V  ++M+  ++ PF +  +E     VM SYN  +G P       L + +R +W F GY
Sbjct: 276 PHVAPREMRTLYLDPFRVAFHEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGY 335

Query: 272 IVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAVQ 322
           +VSD ++++ I   H+  N   EDAVA+ + AGL++      T+FT           AV+
Sbjct: 336 VVSDSEAVEFISTKHQVANGY-EDAVAQAVNAGLNIR-----THFTPPADFILPLRSAVK 389

Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICN-PQHIELAAEAARQGIVL 381
           +GKI++  ++  +  +  V   LG FD   +        I + P+H +LA EAARQ +VL
Sbjct: 390 KGKISQETLNQRVAEILRVKFWLGLFDNPYRGDEKRAGQIVHSPEHQQLALEAARQSLVL 449

Query: 382 LKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVINY 439
           LKN++  LPL+  +I+++A++GP+A+  + +I  Y       T+  +G         + Y
Sbjct: 450 LKNEHQTLPLSK-SIRSVAVIGPNADERQQLICRYGPANAHITTIYEGIKKMLPQADVVY 508

Query: 440 APGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
             GC DI+                Q   M+  AI+AAK A+ TV+V G +     E + R
Sbjct: 509 KKGC-DIIDPHFPESEVLEFPKAAQEAQMMEEAIEAAKGAEVTVMVLGGNELTVREDRSR 567

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
             L LPG Q EL+ K+    K PV LV++   A  INFA  +  + +I+   +PGE GG+
Sbjct: 568 TSLDLPGRQEELLKKICQLGK-PVVLVMIDGRASSINFAATH--VPAIIHAWFPGEFGGQ 624

Query: 545 AIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYG 604
           AIA+ +FG YNPGGRL +T +  +  +IP+ + P +P ++    T  +     +YPFG+G
Sbjct: 625 AIAEALFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDESSETSVY---GALYPFGHG 679

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
           LSYT F+Y         D+ +   +Q    N ++                          
Sbjct: 680 LSYTTFQYS--------DLVISPSKQGVQGNISISCT----------------------- 708

Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLKI 723
            ++N+G+ +G EVV +Y +    + T   QV+ G+ER+ +    S  V F +   + L I
Sbjct: 709 -IKNIGQREGDEVVQLYLRDEVSSVTTYTQVLRGFERITLKPEASHTVHFELTP-QELGI 766

Query: 724 VDNAANSLLASGAHTILVG 742
            D   N  +  G   +++G
Sbjct: 767 WDKQMNFTVEPGMFKVMIG 785


>gi|15837447|ref|NP_298135.1| family 3 glycoside hydrolase [Xylella fastidiosa 9a5c]
 gi|9105751|gb|AAF83655.1|AE003924_1 family 3 glycoside hydrolase [Xylella fastidiosa 9a5c]
          Length = 882

 Score =  285 bits (730), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 170/435 (39%), Positives = 234/435 (53%), Gaps = 45/435 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           + A  LV +MTL EK+ Q  + A  +PRLG+P Y+WWSE LHG++  G            
Sbjct: 32  QHAAALVAKMTLQEKITQTMNAAPAIPRLGIPAYDWWSEGLHGIARNGY----------- 80

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
                AT FP  I   AS+N  L + +G   STEARA +NL           AGLT WSP
Sbjct: 81  -----ATVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLAGGPGKDHPRYAGLTLWSP 135

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDPY+ G+ A++++RGLQ         +    P  I A  KH
Sbjct: 136 NINIFRDPRWGRGMETYGEDPYLTGQLAVSFIRGLQG--------NIPDHPRTI-ATPKH 186

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           +A +   +     R  FD  V+  D++ T+   F   + +G   SVMC+YN ++G P CA
Sbjct: 187 FAVH---SGPEPGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACA 243

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              LLN  +R DW F+G++VSDCD+I  +   H F  D    A A  LK+G DL+CG+ Y
Sbjct: 244 SDWLLNTRLRNDWGFNGFVVSDCDAIDDMTRFHFFRQDNAS-ASAAALKSGNDLNCGNTY 302

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
            +    A+ +G I EA +D +L  L+    RLG         Y  +G  +I  P H  LA
Sbjct: 303 RDLNQ-AIARGDIDEALLDQALIRLFAARQRLGTLQPREHDPYATIGIKHIDTPAHRALA 361

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA Q +VLLKN    LPL  G   TLA++GP A++  A+  NY+GT     +P+ G  
Sbjct: 362 LQAAVQSLVLLKNSGNTLPLTPGT--TLAVLGPDADSLTALEANYQGTSSTPVTPLTGLR 419

Query: 432 AY--SKVINYAPGCA 444
               +  I+YA G +
Sbjct: 420 TRFGAAKIHYAQGAS 434



 Score =  142 bits (357), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 95/295 (32%), Positives = 137/295 (46%), Gaps = 52/295 (17%)

Query: 460 AAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVT 509
           A  +ADA V   GL   VE E          G DR  + LP  Q  L+  V    K P+ 
Sbjct: 607 AVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKTTGK-PLI 665

Query: 510 LVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANY 569
           +V+MS  AV +N+A+++    +IL   YPG+ GG AIA  + G  NPGGRLP+T+Y +  
Sbjct: 666 VVLMSGSAVALNWAQHH--ANAILAAWYPGQSGGTAIAQALAGDVNPGGRLPVTFYRSTQ 723

Query: 570 VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQ 629
              PY S       +  GRTY++F G  +YPFGYGLSYTQF Y+      +         
Sbjct: 724 DLPPYISY------DMTGRTYRYFKGQPLYPFGYGLSYTQFTYEAPQLSTAT-------- 769

Query: 630 QCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG 689
                                  +K  D   T    V N G   G EVV +Y +PP    
Sbjct: 770 -----------------------LKAGD-TLTVTAHVRNTGTRAGDEVVQLYLEPPHSPQ 805

Query: 690 THIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
             ++ ++G++RV +  G+S  + FT++  + L  V       + +G + + VG G
Sbjct: 806 APLRNLVGFKRVTLRPGESRLLTFTLD-TRQLSSVQQTGQRSVEAGHYHLFVGGG 859


>gi|298482082|ref|ZP_07000270.1| beta-glucosidase [Bacteroides sp. D22]
 gi|298271639|gb|EFI13212.1| beta-glucosidase [Bacteroides sp. D22]
          Length = 863

 Score =  285 bits (730), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 172/449 (38%), Positives = 241/449 (53%), Gaps = 46/449 (10%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S +PY D KL   +RA DL++R+TL EKV  M + +  +PRLG+  YEWW+EALHGV+  
Sbjct: 24  SKYPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARA 83

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---- 125
           G                 AT FP  I   ASFN+ L  ++   VS EARA     N    
Sbjct: 84  GL----------------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQ 127

Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
                GLT W+PN+N+ RDPRWGR  ET GEDPY+ GR  +  VRGLQ  E  EY     
Sbjct: 128 YKRYQGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAVVRGLQGPEDAEYD---- 183

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVM 240
               K+ AC KH+A +    W   +R  F++  +  +D+ ET++  F+  V +  V  VM
Sbjct: 184 ----KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVM 236

Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV--- 297
           C+YNR  G P C   +LL Q +R DW F G +V+DC +I    +  K  ++T  DAV   
Sbjct: 237 CAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKK--HETHPDAVHAS 294

Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL 357
           A  +  G DL+CG  + + T  AV++G I+E  I+TS++ L      LG  + +  + N+
Sbjct: 295 ADAVLNGTDLECGGNFKSIT-DAVKKGLISEEKINTSVKRLLKARFELGEMNPTHPWSNI 353

Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
             + I  P+H ELA + A + +VLL+N N  LPLN      +A++GP+AN +    GNY 
Sbjct: 354 PYSVIDCPKHKELALKMAHESLVLLQNKNNILPLNRQ--MKVAVIGPNANDSVMQWGNYN 411

Query: 418 GTPCRYTSPMDGFYAY--SKVINYAPGCA 444
           G P    + ++G  A      I Y P C 
Sbjct: 412 GFPSHTVTLLEGIRAKLPDAQIIYEPVCG 440



 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 96/296 (32%), Positives = 135/296 (45%), Gaps = 53/296 (17%)

Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
           ++  KNAD  +   G+   +E E          G DR ++ LP  Q E++  +    K  
Sbjct: 594 LNKLKNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKKT 653

Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
           V  V  S  A+ I          +IL   YPG+ GG A+ADV+FG YNP GRLPIT+Y++
Sbjct: 654 V-FVNFSGSAMAI--VPETQSCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKS 710

Query: 568 NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
                    +P     +  GRTY+F     +YPFGYGLSYT+F Y  A+  +S   KL+K
Sbjct: 711 ------IQQLPDYEDYSMKGRTYRFMTETPLYPFGYGLSYTRFSYGKATLNQS---KLNK 761

Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI 687
            +                             K    I V N+G+ DG EVV VY   P  
Sbjct: 762 GE-----------------------------KAILTIPVSNVGQRDGEEVVQVYICRPDD 792

Query: 688 AGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
                K + G++RV IA G++  V   +    S +  D A N++   SG + IL G
Sbjct: 793 KEGPQKTLRGFQRVNIAKGKTQNVSIEL-PYDSFEWFDTATNTIRPLSGTYKILYG 847


>gi|336412679|ref|ZP_08593032.1| hypothetical protein HMPREF1017_00140 [Bacteroides ovatus
           3_8_47FAA]
 gi|335942725|gb|EGN04567.1| hypothetical protein HMPREF1017_00140 [Bacteroides ovatus
           3_8_47FAA]
          Length = 735

 Score =  285 bits (730), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 222/755 (29%), Positives = 351/755 (46%), Gaps = 87/755 (11%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLY-- 56
           Y DAK P  +R  DL+ RMTL EKV Q+     G              VP  +G  +Y  
Sbjct: 30  YKDAKAPIEKRIDDLISRMTLEEKVLQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFD 89

Query: 57  --EWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
                  ++   +    R   P    +D+     T +P  +    S+N  L ++     +
Sbjct: 90  INPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSA 149

Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
            EAR    +     TF SP I+V RDPRWGRV E  GEDPY  G +A   VRG       
Sbjct: 150 QEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRG------- 197

Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
            Y  D  S   +++AC KHY  Y      G D  +  + ++ Q + +T++LP+EM V  G
Sbjct: 198 -YQGDDMSAENRMAACLKHYVGYGASE-AGRDYVY--TEISAQTLWDTYLLPYEMGVKAG 253

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
             +++M S+N ++G+P  A+P ++ + ++  W   G+IVSD  +++ +   ++ L  TK+
Sbjct: 254 -AATLMSSFNDISGVPGSANPYIMTEILKKRWKHDGFIVSDWGAVEQL--KNQGLAATKK 310

Query: 295 DAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
           DA      AGL++D   + Y       V++GK+  A +D S+R +  V  RLG F+    
Sbjct: 311 DAARYAFNAGLEMDMMSHAYDRHLKELVEEGKVTMAQVDESVRRVLRVKFRLGLFERPYT 370

Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
                K+    PQ + +AA+ A + +VLLKNDN  LPL   N K +A+VGP A     ++
Sbjct: 371 PVTNEKDRFFRPQSMAVAAQLAAESMVLLKNDNQILPLT--NKKRIAVVGPMAKNGWDLL 428

Query: 414 GNY--EGTPCRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
           G++   G         DG  A       + YA GC      + S    A+D  + +D  +
Sbjct: 429 GSWCGHGKDTDVEMLYDGLTAEFGGEAELRYAMGCKP-QGNDRSGFAGALDVVRWSDVVI 487

Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
           +  G  L+   E   R  + LP  Q EL+ ++ +A K P+ LV+ +   +++N  +  P 
Sbjct: 488 VCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PIILVLSNGRPLELN--RMEPL 544

Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPG 587
             +IL +  PG  G R++A ++ G+ NP G+L IT+ Y    + I Y     R    +  
Sbjct: 545 CDAILEIWQPGINGARSMAGILSGRINPSGKLAITFPYSTGQIPIYYNR---RKSGRWHQ 601

Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
             YK       Y FGYGLSYT+F+Y V  +P S  +K                       
Sbjct: 602 GFYKDITSDPFYSFGYGLSYTEFQYGVV-TPSSTTVK----------------------- 637

Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAG 706
                   +  K + ++ V N+GK DG+E V  +   P  + T  +K++  +E+ FI  G
Sbjct: 638 --------RGEKLSVEVTVTNVGKRDGAETVHWFISDPYCSITRPVKELKHFEKQFIKVG 689

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
           ++    F ++  + L  VD      L +G + I V
Sbjct: 690 ETRTFRFDVDLERDLGFVDGNGKRFLEAGEYNIWV 724


>gi|197106390|ref|YP_002131767.1| glucan 1,4-beta-glucosidase [Phenylobacterium zucineum HLK1]
 gi|196479810|gb|ACG79338.1| glucan 1,4-beta-glucosidase [Phenylobacterium zucineum HLK1]
          Length = 888

 Score =  285 bits (730), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 179/484 (36%), Positives = 251/484 (51%), Gaps = 54/484 (11%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D +LP   RA DLV RMTL EK +Q+G  A  +PRLG+P Y WW+E LHGV+  G   
Sbjct: 38  YRDTRLPAERRAADLVARMTLEEKSRQIGHTAPAIPRLGVPAYNWWNEGLHGVARAGI-- 95

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-----GNA-- 126
                         AT FP  I   A+++    +     + TE RA Y       G+   
Sbjct: 96  --------------ATVFPQAIGMAATWDVDRMRGTADVIGTEFRAKYAERVHPDGSTDW 141

Query: 127 --GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLT WSPNIN+ RDPRWGR  ET GEDPY+ GR  + ++RGLQ           D   
Sbjct: 142 YRGLTVWSPNINIFRDPRWGRGQETYGEDPYLTGRMGVAFIRGLQG---------QDPNF 192

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
            K  A  KHYA +       ++R   D   +  D+++T++  F   V EG V +VMC+YN
Sbjct: 193 FKTIATAKHYAVHSGPE---SNRHREDVHPSAYDLEDTYLPAFRAAVTEGKVQAVMCAYN 249

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLN-DTKEDAVARVLKA 303
            V+G+P CA   L++Q +R DW F G++VSDC +   I          T E+ + R L A
Sbjct: 250 AVDGVPACASEDLMDQRLRRDWGFSGHVVSDCGAAANIYREDSLAYVKTPEEGITRALNA 309

Query: 304 GLDLDCGDYYTNF------TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YK 355
           G+DL CGDY  ++      T+ AV++G + E  +D +L  L+   +RLG FD   +  + 
Sbjct: 310 GMDLVCGDYRADWNTEAEATVSAVRKGMLDETVLDGALVRLFADRIRLGLFDPPAEVPFS 369

Query: 356 NLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGN 415
            +       P+H  ++ E A+  + LLKND G LPL  G  + +A+VGP+A++  A+IGN
Sbjct: 370 KITAAQNDTPEHRAMSLEMAKASMTLLKND-GVLPLK-GEPRRIAVVGPNADSVDALIGN 427

Query: 416 YEGTPCRYTSPMDGFYA-YSKV-INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGL 473
           Y GTP    + + G  A + K  + YA G   +V   +  +P   DA   ADA     GL
Sbjct: 428 YYGTPSNPVTVLAGIRARFPKAEVVYAEGTG-LVGPASLPVP---DAVLCADAACRTKGL 483

Query: 474 DLSV 477
              V
Sbjct: 484 KQEV 487



 Score =  149 bits (375), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 98/291 (33%), Positives = 141/291 (48%), Gaps = 54/291 (18%)

Query: 465 DATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMS 514
           D  V V GL   VE E          G DR  L LP  Q +L+ ++    K PV LV+M+
Sbjct: 613 DLVVFVGGLTARVEGEEMKLQVPGFAGGDRTSLDLPAPQQDLLRRLHATGK-PVVLVLMN 671

Query: 515 AGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPY 574
             A+ +N+A  N  + +I+   YPG EGG A+A ++ G Y+P GRLP+T+Y +     P+
Sbjct: 672 GSALSVNWADAN--LPAIVEAWYPGGEGGHAVAQLLAGDYSPAGRLPVTFYRSAGDLPPF 729

Query: 575 TSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVAS-SPKSVDIKLDKDQQCRD 633
               ++      GRTY++F G V+YPFGYGLSYT+F Y     S +SV            
Sbjct: 730 ADYAMK------GRTYRYFGGEVLYPFGYGLSYTRFSYGAPQLSARSV------------ 771

Query: 634 INYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIK 693
                                  D + T   +V N G MDG EVV +Y   PG  GT I+
Sbjct: 772 ---------------------SADGEITVTTQVTNTGGMDGEEVVQLYVSHPGRDGTPIR 810

Query: 694 QVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
            + G++R+ +  G++  V FT+   + L +VD   N  +  G   + VG G
Sbjct: 811 ALQGFQRIGLKRGETRPVSFTLKD-RQLSVVDAEGNRRVEPGRVEVWVGGG 860


>gi|224536087|ref|ZP_03676626.1| hypothetical protein BACCELL_00952 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224522306|gb|EEF91411.1| hypothetical protein BACCELL_00952 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 791

 Score =  285 bits (730), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 229/821 (27%), Positives = 378/821 (46%), Gaps = 153/821 (18%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
           Y D   P  ER  DL+ +MTL EK+ QM  L YG  R+    LP   W    W + +   
Sbjct: 47  YEDPSAPMEERVNDLLSQMTLEEKICQMATL-YGSGRVLEDALPEEHWKQALWKDGIGNI 105

Query: 64  ----HGVSFIGRRTNSPPGTHFDSE--------------VP--------------GATSF 91
               +G+   G   + P   H  ++              +P               AT F
Sbjct: 106 DEEHNGLGTFGSEYSFPYNKHVKAKHEIQRWFVEETRLGIPVDFTNEGIRGLCHDRATFF 165

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
           P+     +++N+ L  +IG+  + EA A+      G T  +SP +++ +DPRWGR +E  
Sbjct: 166 PSQSGQGSTWNKELIARIGEVEAKEAIAL------GYTNIYSPILDICQDPRWGRSVECY 219

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDPY+VG+       G Q ++ ++ HR        + +  KH+A Y +     + +   
Sbjct: 220 GEDPYLVGQL------GKQMIQSLQKHR--------LVSTVKHFAVYSIPVGGRDGKTRT 265

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
           D  V+ ++M+  ++ PF     E     VM SYN  +G P  +    L + +R ++ F G
Sbjct: 266 DPHVSPREMRTLYLEPFRRAFCEAGALGVMSSYNDYDGEPITSSHHFLTEILRQEYGFKG 325

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
           Y+VSD ++++ I   H  +++  E  VA+ + AGL++      T+FT           A+
Sbjct: 326 YVVSDSEAVEFITTKHHVVSNEVE-GVAQAVNAGLNIR-----THFTKPEDFVLPLRQAI 379

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICN-PQHIELAAEAARQGIV 380
           ++GK++   I++ +  +  +   LG FD   +     +  I +  +H ++A EAARQ +V
Sbjct: 380 KEGKVSPETINSRVADILRIKFWLGLFDNPYRGDEKQEEKIVHCKEHQQVALEAARQSLV 439

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY---TSPMDGFYAYSK-- 435
           LLKN+N  LPL    +K++A++GP+AN    +I       CRY    +P+   Y   K  
Sbjct: 440 LLKNENQLLPLKK-TVKSVAVIGPNANEQTQLI-------CRYGPANAPIKTVYQGIKEL 491

Query: 436 ----VINYAPGCADI--------------VCQNNSMIPAAIDAAKNADATVIVAGLDLSV 477
                + Y  GC  I                +   M+  A+ AA+NA+  V+V G     
Sbjct: 492 LPETEVVYRKGCEIIDSHFPESEILPFEKTTEEQQMLDEAVAAARNAEVVVLVLGGSELT 551

Query: 478 EAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGY 537
             E + R  L LPG Q EL+  +    K P  LV++   A  IN+A  N  I +IL   +
Sbjct: 552 VREDRSRTSLDLPGHQQELMQAIHATGK-PTVLVLLDGRAATINYA--NQYIPAILHAWF 608

Query: 538 PGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV 597
           PGE  G A+A+ +FG YNPGGRL +T +  +  +IP+ + P +P ++ P  T  +     
Sbjct: 609 PGEFAGTAVAEALFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDEPCETAVY---GA 663

Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
           +YPFGYGLSYT+F YK        ++++  ++Q      TV                   
Sbjct: 664 LYPFGYGLSYTKFSYK--------NLQITPEEQGPQGEITVSC----------------- 698

Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
                  EV N+G   G EVV +Y +       T++K + G+ER+ +  G++ KV F + 
Sbjct: 699 -------EVTNIGDRTGDEVVQLYLRDEVSSVTTYMKVLRGFERITLNPGETKKVTFILT 751

Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLN 757
             + L + D     ++  G   +++G     +    + N+ 
Sbjct: 752 P-QDLGLWDKNNKFVVEPGMFKVMIGAASTDIRLEGKFNIK 791


>gi|423290405|ref|ZP_17269254.1| hypothetical protein HMPREF1069_04297 [Bacteroides ovatus
           CL02T12C04]
 gi|392665792|gb|EIY59315.1| hypothetical protein HMPREF1069_04297 [Bacteroides ovatus
           CL02T12C04]
          Length = 861

 Score =  285 bits (730), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 172/445 (38%), Positives = 240/445 (53%), Gaps = 44/445 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            PY D  L   +RA+DL+ R+TL EKV  M + +  +PRLG+  YEWW+EALHGV   G 
Sbjct: 24  LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-GNAG--- 127
                           AT FP  I   ASFN+SL  ++    S EAR    + G +G   
Sbjct: 84  ----------------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLK 127

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFW+PN+N+ RDPRWGR  ET GEDPY+ G+  +  VRGLQ  E        D+R
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DAR 179

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
             K+ AC KH+A +    W   +R  FD+  +  +D+ ET++  F+  V +  V  VMC+
Sbjct: 180 YDKLHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCA 236

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
           YNR  G P C   +LL Q +R +W + G +VSDC +I       +H+   D KE A A  
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPD-KEHASADA 295

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           ++AG DL+CG  Y +    AV+ G I E +ID SL+ L      LG  D  P +  +  +
Sbjct: 296 VRAGTDLECGSEYASLA-DAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPAWAEIPTS 354

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + + +H  LA   AR+ +VLL+N N  LPLNT ++K +A++GP+AN +    GNY G P
Sbjct: 355 VLNSKEHQALALRMARESLVLLQNKNNILPLNT-HLK-VAVMGPNANDSVMQWGNYNGIP 412

Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
               + ++   A      I Y PGC
Sbjct: 413 AHTVTLLEAVRAKLPEGQIIYEPGC 437



 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 84/297 (28%), Positives = 131/297 (44%), Gaps = 53/297 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+    +AD  +   G+  S+E E          G DR D+ LP  Q +L+  +  A K 
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGK- 649

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
              +V ++     I         ++IL   YPG+ GG AI D ++G+YNPGGRLP+T+Y+
Sbjct: 650 --KVVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
                     +P     +  GRTY++     ++PFG+GLSYT F Y  A        KL 
Sbjct: 708 ------DVNQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTTFTYGEA--------KLS 753

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           K+   +  N                            I V N+G+ DG EVV VY + PG
Sbjct: 754 KNTIAKGEN------------------------VVLTIPVSNVGQRDGEEVVQVYLRRPG 789

Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
                   +  ++RV I AG++  V   +   ++ +  D  +N++    G + +L G
Sbjct: 790 DKEGPRYTLRAFKRVHIPAGKTESVAIPLTG-ENFEWFDAESNTMRPLEGTYELLYG 845


>gi|170288668|ref|YP_001738906.1| glycoside hydrolase family 3 protein [Thermotoga sp. RQ2]
 gi|170176171|gb|ACB09223.1| glycoside hydrolase family 3 domain protein [Thermotoga sp. RQ2]
          Length = 778

 Score =  285 bits (729), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 244/817 (29%), Positives = 378/817 (46%), Gaps = 162/817 (19%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGD---------------------LAYGVPRL 51
           PY D   P   R +DL+ RMTL EKV Q+G                      L  G+ ++
Sbjct: 3   PYRDPSQPIEVRVRDLLSRMTLEEKVAQLGSVWGYELIDERGKFSREKAKELLKNGIGQV 62

Query: 52  GLP---LYEWWSEALHGVSFIGR------RTNSPPGTHFDSEVP----GATSFPTVILTT 98
             P         EA   V+ I R      R   P   H +        G T+FP  I   
Sbjct: 63  TRPGGSTNLEPQEAAELVNEIQRFLVEETRLGIPAMIHEECLTGYMGLGGTNFPQAIAMA 122

Query: 99  ASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVG 158
           ++++  L +K+   +  + R +    + GL   +P ++V RDPRWGR  ET GE PY+V 
Sbjct: 123 STWDPDLIEKMTTAIREDMRKIG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVA 177

Query: 159 RYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLD----NWEGNDRFHFDSRV 214
           R  ++YV+GLQ   G +  +        + A  KH+A Y       NW   +       +
Sbjct: 178 RMGVSYVKGLQ---GEDIKKG-------VVATVKHFAGYSASEGGKNWAPTN-------I 220

Query: 215 TEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVS 274
            E++ +E F+ PFE  V E +V SVM SY+ ++G+P  A+ KLL   +R DW F G +VS
Sbjct: 221 PEREFKEVFLFPFEAAVKEANVLSVMNSYSEIDGVPCAANRKLLTDILRKDWGFEGIVVS 280

Query: 275 DCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-----CGDYYTNFTMGAVQQGKIAEA 329
           D  +++ + + H+   D K +A    L+AG+D++     C  Y  +     V++G I+EA
Sbjct: 281 DYFAVKVLEDYHRIARD-KSEAARLALEAGIDVELPKTECYQYLKDL----VEKGIISEA 335

Query: 330 DIDTSL-RFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGA 388
            ID ++ R L +  M LG F+    Y  + K  I    H ++A E AR+ I+LLKND G 
Sbjct: 336 LIDEAVARVLRLKFM-LGLFENP--YVEVEKAKI--ESHRDIALEIARKSIILLKND-GI 389

Query: 389 LPLNTGNIKTLALVGPHANATKAMIGNYE----------------GTPC----------- 421
           LPL+    K +AL+GP+A   + ++G+Y                 G P            
Sbjct: 390 LPLSKE--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKK 447

Query: 422 -------RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG-- 472
                     S +D F        YA GC ++  ++ S    AI+ AK +D  ++V G  
Sbjct: 448 SIEEHMKSIPSVLDAFKEEGIEFEYAKGC-EVTGEDRSGFEEAIEIAKKSDVAIVVVGDK 506

Query: 473 ----LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
               LD +   E +D  +L LPG Q EL+ +VA   K PV LV+++     +    +  K
Sbjct: 507 SGLTLDCTT-GESRDMANLKLPGVQEELVLEVAKTGK-PVVLVLITGRPYSLKNVVD--K 562

Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPG 587
           + +IL V  PGE GGR+I D+I+GK NP G+LPI++   A  + + +   P    +++ G
Sbjct: 563 VNAILQVWLPGEAGGRSIVDIIYGKVNPSGKLPISFPRSAGQIPVFHYVKPSGGRSHWHG 622

Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
                   P ++PFG+GLSYT+F+Y  +   PK V                     PP  
Sbjct: 623 DYVDESTKP-LFPFGHGLSYTKFEYSNLRIEPKEV---------------------PPAG 660

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAA 705
            V+I            +++VEN G  DG EVV +Y      + T  +K++ G++RV + A
Sbjct: 661 EVVI------------KVDVENTGDRDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKA 708

Query: 706 GQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            +   V F ++    L   D     ++  G   ++VG
Sbjct: 709 KEKKTVVFRLH-MDVLAYYDRDMKLVVEPGEFRVMVG 744


>gi|160884133|ref|ZP_02065136.1| hypothetical protein BACOVA_02110 [Bacteroides ovatus ATCC 8483]
 gi|423291392|ref|ZP_17270240.1| hypothetical protein HMPREF1069_05283 [Bacteroides ovatus
           CL02T12C04]
 gi|156110475|gb|EDO12220.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
 gi|392663392|gb|EIY56942.1| hypothetical protein HMPREF1069_05283 [Bacteroides ovatus
           CL02T12C04]
          Length = 735

 Score =  285 bits (728), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 222/760 (29%), Positives = 357/760 (46%), Gaps = 97/760 (12%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLY-- 56
           Y DAK P  +R  DL+ RMTL EK+ Q+     G              VP  +G  +Y  
Sbjct: 30  YKDAKAPIEKRIDDLISRMTLEEKILQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFD 89

Query: 57  --EWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
                  ++   +    R   P    +D+     T +P  +    S+N  L ++     +
Sbjct: 90  INPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSA 149

Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
            EAR    +     TF SP I+V RDPRWGRV E  GEDPY  G +A   VRG       
Sbjct: 150 QEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRG------- 197

Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
            Y  D  S   +++AC KHY  Y      G D  +  + ++ Q + +T++LP+EM V  G
Sbjct: 198 -YQGDDMSAENRMAACLKHYVGYGASE-AGRDYVY--TEISAQTLWDTYLLPYEMGVKAG 253

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
             +++M S+N ++G+P  A+P ++ + ++  W   G+IVSD  +++ +   ++ L  TK+
Sbjct: 254 -AATLMSSFNDISGVPGSANPYIMTEILKKRWKHDGFIVSDWGAVEQL--KNQGLAATKK 310

Query: 295 DAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
           DA      AGL++D   + Y  +    V++GK+  A +D S+R +  V  RLG F+    
Sbjct: 311 DAARYAFNAGLEMDMMSHAYDRYLKELVEEGKVTMAQVDESVRRVLRVKFRLGLFERPYT 370

Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
                K+    PQ + +AA+ A + +VLLKNDN  LPL   N K +A+VGP A     ++
Sbjct: 371 PVTNEKDRFFRPQSMAVAAQLAAESMVLLKNDNQILPLT--NKKKIAVVGPMAKNGWDLL 428

Query: 414 GNY--EGTPCRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
           G++   G         DG  A       + YA GC      + S    A+D A+ +D  +
Sbjct: 429 GSWCGHGKDTDVEMLYDGLTAEFGGDAELRYAMGCKP-QGNDRSGFAGALDVARWSDVVI 487

Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
           +  G  L+   E   R  + LP  Q EL+ ++ +A K P+ LV+ +   +++N  +  P 
Sbjct: 488 VCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PIILVLSNGRPLELN--RMEPL 544

Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS--MPLRPVNNFP 586
             +IL +  PG  G R++A ++ G+ NP G+L +T+        PY++  +P+       
Sbjct: 545 CDAILEIWQPGINGARSMAGILSGRINPSGKLAMTF--------PYSTGQIPIYYNRRKS 596

Query: 587 GRTYKFFDGPV----VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
           GR ++ F   +    +YPFG+GLSYT+FKY                          GT  
Sbjct: 597 GRGHQGFYKDITSDPLYPFGHGLSYTEFKY--------------------------GTVT 630

Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERV 701
           P    V   D      K + ++ V N G  DG+E V  +   P  + T  +K++  +E+ 
Sbjct: 631 PSATKVKRGD------KLSAEVTVTNTGARDGAETVHWFISDPYCSITRPVKELKHFEKQ 684

Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
           FI AG++    F ++  +    V+      L +G + ILV
Sbjct: 685 FIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724


>gi|336404627|ref|ZP_08585320.1| hypothetical protein HMPREF0127_02633 [Bacteroides sp. 1_1_30]
 gi|335941531|gb|EGN03384.1| hypothetical protein HMPREF0127_02633 [Bacteroides sp. 1_1_30]
          Length = 861

 Score =  285 bits (728), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 171/445 (38%), Positives = 237/445 (53%), Gaps = 44/445 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            PY D  L   +RA+DL+ R+TL EKV  M + +  +PRLG+  YEWW+EALHGV   G 
Sbjct: 24  LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-GNAG--- 127
                           AT FP  I   ASFN+SL  ++    S EAR    + G +G   
Sbjct: 84  ----------------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLK 127

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFW+PN+N+ RDPRWGR  ET GEDPY+ G+  +  VRGLQ  E   Y       
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDAGYD------ 181

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
             K+ AC KH+A +    W   +R  FD+  +  +D+ ET++  F+  V +  V  VMC+
Sbjct: 182 --KLHACAKHFAVHSGPEW---NRHSFDAENIAPRDLWETYLPAFKDLVQKAHVKEVMCA 236

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
           YNR  G P C   +LL Q +R +W + G +VSDC +I       +H+   D KE A A  
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPD-KEHASAAA 295

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           ++ G DL+CG  Y +    AV+ G I E +ID SL+ L      LG  D  P +  +  +
Sbjct: 296 VRTGTDLECGSEYASLA-DAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPAWAEIPTS 354

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + + +H  LA   AR+ +VLL+N N  LPLNT N+K +A++GP+AN +    GNY G P
Sbjct: 355 VLNSKEHQALALRMARESLVLLQNKNNILPLNT-NLK-IAVMGPNANDSVMQWGNYNGIP 412

Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
               + ++   A      I Y PGC
Sbjct: 413 AHTVTLLEAVRAKLPEGQIIYEPGC 437



 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 84/297 (28%), Positives = 131/297 (44%), Gaps = 53/297 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+    +AD  +   G+  S+E E          G DR D+ LP  Q +L+  +  A K 
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGK- 649

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
              +V ++     I         ++IL   YPG+ GG AI D ++G+YNPGGRLP+T+Y+
Sbjct: 650 --KVVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
                     +P     +  GRTY++     ++PFG+GLSYT F Y  A        KL 
Sbjct: 708 ------DVNQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEA--------KLS 753

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           K+   +  N                            I V N+G+ DG EVV VY + PG
Sbjct: 754 KNTIAKGEN------------------------VVLTIPVSNVGQRDGEEVVQVYLRRPG 789

Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
                   +  ++RV I AG++  V   +   ++ +  D  +N++    G + +L G
Sbjct: 790 DKEGPRYTLRAFKRVHIPAGKTESVAIPLTG-ENFEWFDAESNTMRPLEGTYELLYG 845


>gi|380692929|ref|ZP_09857788.1| glycoside hydrolase family protein [Bacteroides faecis MAJ27]
          Length = 777

 Score =  285 bits (728), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 240/808 (29%), Positives = 374/808 (46%), Gaps = 157/808 (19%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSE--------- 61
           Y D   P  ER KDL+ +M + EK  QM  L YG  R+    LP  +W SE         
Sbjct: 32  YEDPSAPIEERVKDLLSQMNMDEKTCQMATL-YGSGRVLADALPTEKWKSEIWKDGIGNI 90

Query: 62  -----------------------ALHGVS--FIGRRTNSPPGTHFDSEVPG-----ATSF 91
                                  A+H +   F+       P    +  + G     AT F
Sbjct: 91  DEEHNGLGKFGSEYAFPYAKHVKAIHDIQRWFVEETRLGIPVDFTNEGIRGVCHEKATFF 150

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
           P      +++N+ L  +IG+  + EA A+      G T  +SP +++ +DPRWGR +E  
Sbjct: 151 PAQCGQGSTWNKELIARIGEVEAKEAVAL------GYTNIYSPILDIAQDPRWGRAVECY 204

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDPY+VG+       G Q ++ ++ H        K+ A  KH+A Y +     +     
Sbjct: 205 GEDPYLVGQL------GKQMIQSLQKH--------KLVATPKHFAVYSIPVGGRDGGTRT 250

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
           D  V  ++M+  ++ PF +   E     VM SYN  +G P     + L Q +R +W F G
Sbjct: 251 DPHVAPREMRTLYLEPFRVAFQEAGALGVMSSYNDYDGEPITGSYRFLTQILRQEWGFKG 310

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
           Y+VSD D+++ I   HK + D  E+AV + + AGL++      TNF+           A+
Sbjct: 311 YVVSDSDAVEFISSKHK-VADNNEEAVVQSVNAGLNV-----RTNFSSPAGFIKPLRSAI 364

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK--NNICN-PQHIELAAEAARQG 378
            +GK+++A ID  +  +  V   LG FD    Y+  GK  + I +  +H  +A EAARQ 
Sbjct: 365 AKGKVSQATIDQRVSEILYVKFWLGLFDNP--YRGDGKLADKIVHCKEHQAVALEAARQS 422

Query: 379 IVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY---TSPMDGFYAYSK 435
           IVLLKN +  LPL    +K++A++GP+A+  K +I       CRY    +P+   Y   K
Sbjct: 423 IVLLKNQDNLLPLQK-TLKSVAVIGPNADEQKELI-------CRYGPSNAPIKTVYKGIK 474

Query: 436 ------VINYAPGCA--------------DIVCQNNSMIPAAIDAAKNADATVIVAGLDL 475
                  + Y  GC               DI  +   ++  AI+AAK+A+  ++V G   
Sbjct: 475 EALPGAKVVYKKGCEIVDPHFPESEVLPFDITPKEQQIMDEAIEAAKSAEVVIMVLGGSE 534

Query: 476 SVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
               E + R  L LPG Q EL+  V    K P  LV++   A  IN+AK    + +IL  
Sbjct: 535 VTVREERSRTSLDLPGRQEELLKAVCKLGK-PTILVMIDGRASSINYAKKY--VPAILHA 591

Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDG 595
            +PGE  G+A+A+ IFG  NPGG+L +T +  +  +IP+ + P +P ++    T     G
Sbjct: 592 WFPGEFCGQAVAETIFGDNNPGGKLAVT-FPKSVGQIPF-AFPFKPGSDSGCGTS--VTG 647

Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
             ++PFG+GLSYT F+Y         ++K+  +QQ       +G  K  C          
Sbjct: 648 -ALFPFGHGLSYTTFEYN--------NLKISPEQQG-----VLGEVKVSCT--------- 684

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
                     V+N GK  G EVV +Y +       T++K + G+ER+ +   +  KV FT
Sbjct: 685 ----------VKNTGKRPGDEVVQLYLRDEISSVTTYVKILRGFERITLQPNEEKKVTFT 734

Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
           ++  + L I D      +  G   +++G
Sbjct: 735 LSP-QDLAIWDKNMKFQVEPGTFKVMIG 761


>gi|254445290|ref|ZP_05058766.1| Glycosyl hydrolase family 3 C terminal domain protein
           [Verrucomicrobiae bacterium DG1235]
 gi|198259598|gb|EDY83906.1| Glycosyl hydrolase family 3 C terminal domain protein
           [Verrucomicrobiae bacterium DG1235]
          Length = 730

 Score =  285 bits (728), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 232/749 (30%), Positives = 340/749 (45%), Gaps = 127/749 (16%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGV---- 66
           D+P+ D  LP  ER  DL+  MTL EKV  MG    G+PRL +  Y   SE  HGV    
Sbjct: 26  DYPFQDPDLPNEERIDDLITCMTLEEKVDLMG-FVPGIPRLDVK-YTRISEGYHGVAQGG 83

Query: 67  -SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--- 122
            S  G+R  +P            T FP      A+++ +L  ++    +TE R +Y    
Sbjct: 84  PSNWGKRNPTP-----------TTQFPQAYGLAATWDPALISRVSANQATEVRYLYQSPK 132

Query: 123 LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
              +GL   +PN ++ RDPRWGR  E  GEDP++ G  A  +  GL              
Sbjct: 133 YQRSGLVVMAPNADLARDPRWGRTEEVYGEDPFLTGTLAAAFASGLA---------GDHP 183

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
           R LK ++  KH+    L N   +DRF   S   E+  +E +  PFEM + +G   S+M +
Sbjct: 184 RYLKATSLLKHF----LANSNEDDRFFSSSDFDERLWREYYAKPFEMAIRDGGARSMMAA 239

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           YN +NG P    P +L   + G+W   G I +D   +  +V  HK   D    A A  +K
Sbjct: 240 YNAINGTPAHVHP-MLRDIVMGEWGLDGTICTDGGGLAHLVNQHKTYPDLPT-ATAACIK 297

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGK 359
           AG++L   D +T   + AV+Q  + EA+ID  +R    + + LG  D  P+   Y N+G 
Sbjct: 298 AGINLFL-DNHTQAALDAVEQSLVTEAEIDDVIRGRIRLFLDLGLLD-PPELVPYSNIGH 355

Query: 360 NNICNPQHI----ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGN 415
                P  +        E  R+ IVLLKN+N  LPL+   I ++A+VGP AN T  ++  
Sbjct: 356 EPGLEPWELPETHAFVREVTRKSIVLLKNENNILPLDPSKINSVAIVGPLANTT--LLDW 413

Query: 416 YEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNN---SMIPAAIDAAKNADATVIVAG 472
           Y GTP     P DG   Y+   N  P  +     +N    M   A++ A + D  ++V G
Sbjct: 414 YSGTPPYAIPPRDGIEGYA---NSGPFPSPAKFGSNWVADMSDTALEVAASRDVAIVVVG 470

Query: 473 LD---------LSVEAEGK---DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
                      ++  +EGK   DR +++L   Q E I KV   A  P T+V++ +     
Sbjct: 471 NHPESNAGWGVVTSPSEGKEAVDRQEIILQPDQEEFIQKV--YAANPNTIVVLVS----- 523

Query: 521 NF-------AKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP 573
           NF       A+N P   +I+ + +  +E G A+ADV+FG YNPGG+   TW        P
Sbjct: 524 NFPYAMPWAAENAP---AIVHITHASQEQGNALADVLFGDYNPGGKTVQTW--------P 572

Query: 574 YTSMPLRPVNNFP---GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQ 630
            +   L P+ ++    GRTY +      YPFGYGLSYT F+                   
Sbjct: 573 KSLDQLPPMMDYDIRRGRTYMYSQHEPQYPFGYGLSYTTFE------------------- 613

Query: 631 CRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAG 689
                          + +        D   T ++ V N G+ DG EVV +Y + P     
Sbjct: 614 --------------LSKLKAPKKLKADATATIKVRVANTGERDGDEVVQLYVRYPNSKVE 659

Query: 690 THIKQVIGYERVFIAAGQSAKVGFTMNAC 718
              KQ+ G++RV + AG+S      + A 
Sbjct: 660 RPSKQLKGFQRVTVPAGKSVTGEIPLKAA 688


>gi|423215029|ref|ZP_17201557.1| hypothetical protein HMPREF1074_03089 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692292|gb|EIY85530.1| hypothetical protein HMPREF1074_03089 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 861

 Score =  285 bits (728), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 172/445 (38%), Positives = 239/445 (53%), Gaps = 44/445 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            PY D  L   +RA+DL+ R+TL EKV  M + +  +PRLG+  YEWW+EALHGV   G 
Sbjct: 24  LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-GNAG--- 127
                           AT FP  I   ASFN+SL  ++    S EAR    + G +G   
Sbjct: 84  ----------------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLK 127

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFW+PN+N+ RDPRWGR  ET GEDPY+ G+  +  VRGLQ  E        D+R
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DAR 179

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
             K+ AC KH+A +    W   +R  FD+  +  +D+ ET++  F+  V +  V  VMC+
Sbjct: 180 YDKLHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCA 236

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
           YNR  G P C   +LL Q +R +W + G +VSDC +I       +H    D KE A A  
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASADA 295

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           ++AG DL+CG  Y +    AV+ G I E +ID SL+ L      LG  D  P +  +  +
Sbjct: 296 VRAGTDLECGSEYASLA-DAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPAWSEIPTS 354

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + + +H  LA   AR+ +VLL+N N  LPLNT ++K +A++GP+AN +    GNY G P
Sbjct: 355 VLNSKEHQALALRMARESLVLLQNKNNILPLNT-HLK-VAVMGPNANDSVMQWGNYNGIP 412

Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
               + ++   A      I Y PGC
Sbjct: 413 AHTVTLLEAVRAKLPEGQIIYEPGC 437



 Score =  103 bits (258), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 84/297 (28%), Positives = 130/297 (43%), Gaps = 53/297 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+    +AD  +   G+  S+E E          G DR D+ LP  Q +L+  +  A K 
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGK- 649

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
              +V ++     I         ++IL   YPG+ GG AI D ++G+YNPGGRLP+T+Y+
Sbjct: 650 --KVVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
                     +P     +  GRTY++     ++PFG+GLSYT F Y  A        KL 
Sbjct: 708 ------DVNQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEA--------KLS 753

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           K+   +  N                            I V N+G+ DG EVV VY + PG
Sbjct: 754 KNTIAKGEN------------------------VVLTIPVSNVGQRDGEEVVQVYLRRPG 789

Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
                   +  ++RV I AG++  V   +    + +  D  +N++    G + +L G
Sbjct: 790 DKEGPRYTLRAFKRVHIPAGKTESVAIPLTGV-NFEWFDVESNTMRPLEGTYELLYG 845


>gi|298481648|ref|ZP_06999839.1| beta-glucosidase [Bacteroides sp. D22]
 gi|298272189|gb|EFI13759.1| beta-glucosidase [Bacteroides sp. D22]
          Length = 861

 Score =  284 bits (727), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 172/445 (38%), Positives = 239/445 (53%), Gaps = 44/445 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            PY D  L   +RA+DL+ R+TL EKV  M + +  +PRLG+  YEWW+EALHGV   G 
Sbjct: 24  LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-GNAG--- 127
                           AT FP  I   ASFN+SL  ++    S EAR    + G +G   
Sbjct: 84  ----------------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLK 127

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFW+PN+N+ RDPRWGR  ET GEDPY+ G+  +  VRGLQ  E        D+R
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DAR 179

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
             K+ AC KH+A +    W   +R  FD+  +  +D+ ET++  F+  V +  V  VMC+
Sbjct: 180 YDKLHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCA 236

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
           YNR  G P C   +LL Q +R +W + G +VSDC +I       +H    D KE A A  
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASADA 295

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           ++AG DL+CG  Y +    AV+ G I E +ID SL+ L      LG  D  P +  +  +
Sbjct: 296 VRAGTDLECGSEYASLA-DAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPAWSEIPTS 354

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + + +H  LA   AR+ +VLL+N N  LPLNT ++K +A++GP+AN +    GNY G P
Sbjct: 355 VLNSKEHQALALRMARESLVLLQNKNNILPLNT-HLK-VAVMGPNANDSVMQWGNYNGIP 412

Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
               + ++   A      I Y PGC
Sbjct: 413 AHTVTLLEAVRAKLPEGQIIYEPGC 437



 Score =  105 bits (263), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 83/297 (27%), Positives = 128/297 (43%), Gaps = 53/297 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+    +AD  +   G+  S+E E          G DR D+ LP  Q    N +    K 
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---NLLKALKKA 647

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
              +V ++     I         ++IL   YPG+ GG AI D ++G+YNPGGRLP+T+Y+
Sbjct: 648 GKKVVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
                     +P     +  GRTY++     ++PFG+GLSYT F Y  A        KL 
Sbjct: 708 ------DVNQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEA--------KLS 753

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           K+   +  N                            I V N+G+ DG EVV VY + PG
Sbjct: 754 KNTIAKGEN------------------------VVLTIPVSNVGQCDGEEVVQVYLRRPG 789

Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
                   +  ++RV I AG++  V   +   ++ +  D  +N++    G + +L G
Sbjct: 790 DKEGPRYTLRAFKRVHIPAGKTESVAIPLTG-ENFEWFDVESNTMRPLEGTYELLYG 845


>gi|299147288|ref|ZP_07040353.1| beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298514566|gb|EFI38450.1| beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 861

 Score =  284 bits (727), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 172/445 (38%), Positives = 239/445 (53%), Gaps = 44/445 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            PY D  L   +RA+DL+ R+TL EKV  M + +  +PRLG+  YEWW+EALHGV   G 
Sbjct: 24  LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-GNAG--- 127
                           AT FP  I   ASFN+SL  ++    S EAR    + G +G   
Sbjct: 84  ----------------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLK 127

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFW+PN+N+ RDPRWGR  ET GEDPY+ G+  +  VRGLQ  E        D+R
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DAR 179

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
             K+ AC KH+A +    W   +R  FD+  +  +D+ ET++  F+  V +  V  VMC+
Sbjct: 180 YDKLHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCA 236

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
           YNR  G P C   +LL Q +R +W + G +VSDC +I       +H    D KE A A  
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASADA 295

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           ++AG DL+CG  Y +    AV+ G I E +ID SL+ L      LG  D  P +  +  +
Sbjct: 296 VRAGTDLECGSEYASLA-DAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPAWSEIPTS 354

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + + +H  LA   AR+ +VLL+N N  LPLNT ++K +A++GP+AN +    GNY G P
Sbjct: 355 VLNSKEHQALALRMARESLVLLQNKNNILPLNT-HLK-VAVMGPNANDSVMQWGNYNGIP 412

Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
               + ++   A      I Y PGC
Sbjct: 413 AHTVTLLEAVRAKLPEGQIIYEPGC 437



 Score =  105 bits (263), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 83/297 (27%), Positives = 127/297 (42%), Gaps = 53/297 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+    +AD  +   G+  S+E E          G DR D+ LP  Q    N +    K 
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---NLLKALKKA 647

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
              +V ++     I         ++IL   YPG+ GG AI D ++G+YNPGGRLP+T+Y+
Sbjct: 648 GKKVVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
                     +P     +  GRTY++     ++PFG+GLSYT F Y  A        KL 
Sbjct: 708 N------VNQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTTFTYGEA--------KLS 753

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           K+   +  N                            I V N+G+ DG EVV VY + PG
Sbjct: 754 KNTIAKGEN------------------------VVLTIPVSNVGQRDGEEVVQVYLRRPG 789

Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
                   +  ++RV I AG++  V   +    + +  D  +N++    G + +L G
Sbjct: 790 DKEGPRYTLRAFKRVHIPAGKTESVAIPLTGV-NFEWFDAESNTMRPLEGTYELLYG 845


>gi|217967241|ref|YP_002352747.1| glycoside hydrolase family 3 [Dictyoglomus turgidum DSM 6724]
 gi|217336340|gb|ACK42133.1| glycoside hydrolase family 3 domain protein [Dictyoglomus turgidum
           DSM 6724]
          Length = 762

 Score =  284 bits (727), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 214/690 (31%), Positives = 347/690 (50%), Gaps = 100/690 (14%)

Query: 87  GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
           GAT FP  I   ++F   L +++   +    RA  N+ + GL   SP +++ RDPRWGR 
Sbjct: 106 GATVFPQAIGMASTFEPELIRRVSDVIRQHMRAA-NV-HQGL---SPVLDIPRDPRWGRT 160

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
            ET GEDPY+V R A  YV+GLQ   G ++          I A  KH+ AY +       
Sbjct: 161 EETFGEDPYLVSRMAAEYVKGLQ---GEDWREG-------IIATVKHFTAYGISE---GA 207

Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
           R    ++V E++++E F+ PFE+ + EG   S+M +Y+ ++G+P  +   LL + +R +W
Sbjct: 208 RNLGPAKVGERELREVFLFPFEVAIKEGQAGSLMNAYHEIDGVPCASSKFLLTKILRWEW 267

Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG--DYYTNFTMGAVQQG 324
            F GY+VSD  +I+ +   H+   D KE AV   L+AG+D++    D Y    + AV++G
Sbjct: 268 GFKGYVVSDYIAIRMLENFHRVAKDAKEAAVL-ALEAGIDIELPSVDCYGEPLIQAVKEG 326

Query: 325 KIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICN-PQHIELAAEAARQGIVLLK 383
            I+E  I+ S+  +      LG FDG  +       +I + P+  EL+ E AR+ IVLLK
Sbjct: 327 LISEEVINASVERVLRAKFMLGLFDGDLEKDPKKVYDIFDKPEFRELSREVARRSIVLLK 386

Query: 384 NDNGALPLNTGNIKTLALVGPHANATKAMIGNY-------------------EGTPCRYT 424
           ND G LPL+  NI+T+A++GP+A+  + + G+Y                   E    R  
Sbjct: 387 ND-GILPLSK-NIRTVAVIGPNADNPRNLHGDYSYTAHIPSVSETLEGVKIPEECAVRTV 444

Query: 425 SPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG-----LDL 475
           S ++G      A ++V+ YA GC +I+  +      AI+ AK AD  + V G        
Sbjct: 445 SILEGIKNKVSAETQVL-YAKGC-EILSDSKEGFDEAIEIAKRADVIIAVMGEESGLFHR 502

Query: 476 SVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
            +  EG DR  L L G Q +L+ ++    K P+ LV+++     + +   N  + +IL  
Sbjct: 503 GISGEGNDRTTLELFGIQRDLLRELHKLGK-PIVLVLVNGRPQALKWEHEN--LNAILEA 559

Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEAN-YVKIPYTSMPLRPVNNFPGRTYKFFD 594
            YPGEEGG A+ADVIFG YNP G+LPI++      V + Y   P     ++   + K   
Sbjct: 560 WYPGEEGGDAVADVIFGDYNPSGKLPISFPAVTGQVPVYYNRKP-SAFTDYVEESAK--- 615

Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
              +YPFG+GLSYT F+Y         ++K+  ++                    ++ ++
Sbjct: 616 --PLYPFGHGLSYTTFEYS--------NLKIHPEK--------------------VNALE 645

Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH--IKQVIGYERVFIAAGQSAKVG 712
             +  FT    ++N G  +G EVV +Y     +A     +K++ G++++ +  G+S +V 
Sbjct: 646 KVEISFT----IKNTGVREGEEVVQLYVHDQ-VASLERPVKELKGFKKIHLKPGESKRVT 700

Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVG 742
           F +   + L   D     ++  G   I++G
Sbjct: 701 FILYP-EQLAFYDEFMRFVVEKGIFEIMIG 729


>gi|319643197|ref|ZP_07997825.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
 gi|345520511|ref|ZP_08799899.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           4_3_47FAA]
 gi|254835034|gb|EET15343.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           4_3_47FAA]
 gi|317385101|gb|EFV66052.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
          Length = 788

 Score =  284 bits (727), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 236/811 (29%), Positives = 371/811 (45%), Gaps = 163/811 (20%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
           Y + K P  +R +DL+ +MTL EK  QM  L YG  R+    LP   W +E    G+  I
Sbjct: 43  YENPKAPLEDRVQDLLSQMTLEEKTCQMATL-YGSGRVLKDALPQDNWKTEVWKDGIGNI 101

Query: 70  GRRTNS----------PPGTHFDSE--------------VP--------------GATSF 91
               N           P   H D++              +P               AT F
Sbjct: 102 DEEHNGLGAFKSEYSFPYAKHVDAKHTIQRWFVEKTRLGIPVDFTNEGIRGLCHDRATYF 161

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
           P      A++N+ L  +IG+  + EA A+      G T  +SP +++ +DPRWGR +ET 
Sbjct: 162 PAQCGQGATWNKKLIARIGEVEAKEAVAL------GYTNIYSPILDIAQDPRWGRCVETY 215

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDPY+VG      +  LQ                 + A  KH+A Y +     + +   
Sbjct: 216 GEDPYLVGELGKQMITSLQK--------------YNLVATPKHFAVYSIPIGGRDGKTRT 261

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
           D  V  ++M+  +I PF M   E     VM SYN  +G P       L + +R +W F G
Sbjct: 262 DPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKG 321

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
           Y+VSD ++++ I   HK + DT ED +A+ + AGL++      T+FT           AV
Sbjct: 322 YVVSDSEAVEFISNKHK-VADTYEDGIAQAVNAGLNIR-----THFTPPADFILPLRKAV 375

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN---ICNPQHIELAAEAARQG 378
             GKI++  +D  +  +  +   LG FD    Y+  GK     + + +H  ++ EAARQ 
Sbjct: 376 DDGKISQETLDKRVAEILRIKFWLGLFDNP--YRGNGKQAEQIVHSKEHQAVSLEAARQS 433

Query: 379 IVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY---TSPMDGFYA--- 432
           +VLLKN+   LPL+  +I+++A++GP+A+    +I       CRY    +P+   Y    
Sbjct: 434 LVLLKNETHLLPLSK-SIRSIAVIGPNADEQTQLI-------CRYGPANAPIKTVYQGIK 485

Query: 433 ----YSKVINYAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGL 473
               +++VI Y  GC DI+                +   ++   I AAK A+  V+V G 
Sbjct: 486 ELLPHAEVI-YKKGC-DIIDPHFPESEILDFPKTAEEVRLMQEVIRAAKQAEVVVMVLGG 543

Query: 474 DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
           +     E + R  L LPG Q EL+  V    K PV LV++   A  IN+A  +  + +IL
Sbjct: 544 NELTVREDRSRTSLNLPGRQEELLKAVCATGK-PVILVMLDGRASSINYAAAH--VPAIL 600

Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
              +PGE  G+A+A+ +FG YNPGGRL +T +  +  +IP+ + P +P ++    T  + 
Sbjct: 601 HAWFPGEFCGQAVAEALFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDESSSTSVY- 657

Query: 594 DGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
               +YPFG+GLSYT F Y  +  SP    ++ D    C+                    
Sbjct: 658 --GALYPFGHGLSYTTFTYSDLHISPSHQGVQGDIHVSCK-------------------- 695

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKV 711
                        ++N GK+ G EVV +Y +       T+ K + G+ER+ + AG+   V
Sbjct: 696 -------------IKNTGKIKGDEVVQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTV 742

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            F +   + L + D   N  +  G+  +++G
Sbjct: 743 HFRLRP-QDLGLWDKNMNFRVEPGSFKVMLG 772


>gi|299148437|ref|ZP_07041499.1| beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298513198|gb|EFI37085.1| beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 863

 Score =  284 bits (726), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 171/449 (38%), Positives = 240/449 (53%), Gaps = 46/449 (10%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S +PY D KL   +RA DL++R+TL EKV  M + +  +PRLG+  YEWW+EALHGV+  
Sbjct: 24  SKYPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARA 83

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---- 125
           G                 AT FP  I   ASFN+ L  ++   VS EARA     N    
Sbjct: 84  GL----------------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQ 127

Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
                GLT W+PN+N+ RDPRWGR  ET GEDPY+ GR  +  VRGLQ  E  EY     
Sbjct: 128 YKRYQGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYD---- 183

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVM 240
               K+ AC KH+A +    W   +R  F++  +  +D+ ET++  F+  V +  V  VM
Sbjct: 184 ----KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVM 236

Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV--- 297
           C+YNR  G P C   +LL Q +R DW F G +V+DC +I    +  K  ++T  DA    
Sbjct: 237 CAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKK--HETHPDAAHAS 294

Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL 357
           A  +  G DL+CG  + + T  AV++G I+E  I+TS++ L      LG  + +  + N+
Sbjct: 295 ADAVLNGTDLECGGNFKSIT-DAVKKGLISEEKINTSVKRLLKARFELGEMNPTHPWSNI 353

Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
             + I  P+H ELA + A + +VLL+N N  LPLN      +A++GP+AN +    GNY 
Sbjct: 354 PYSVINCPKHKELALKMAHESLVLLQNKNNILPLNRQ--MKVAVIGPNANDSVMQWGNYN 411

Query: 418 GTPCRYTSPMDGFYAY--SKVINYAPGCA 444
           G P    + ++G  A      I Y P C 
Sbjct: 412 GFPSHTVTLLEGIRAKLPDAQIIYEPVCG 440



 Score =  125 bits (314), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 97/296 (32%), Positives = 135/296 (45%), Gaps = 53/296 (17%)

Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
           ++  KNAD  +   G+   +E E          G DR ++ LP  Q E++  +    K  
Sbjct: 594 LNKLKNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKKT 653

Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
           V  V  S  A+ I          +IL   YPG+ GG A+ADV+FG YNP GRLPIT+Y++
Sbjct: 654 V-FVNFSGSAMAI--VPETQSCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKS 710

Query: 568 NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
                    +P     +  GRTY+F     +YPFGYGLSYT+F Y  A+  +S   KL K
Sbjct: 711 ------IQQLPDYEDYSMKGRTYRFMTETPLYPFGYGLSYTRFSYGKATLNQS---KLAK 761

Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI 687
            +                             K    I V N+G+ DG EVV VY   P  
Sbjct: 762 GE-----------------------------KAILTIPVSNVGQRDGEEVVQVYICRPDD 792

Query: 688 AGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
            G   K + G++RV IA G++  V   +    S +  D A N++   SG + IL G
Sbjct: 793 KGGPQKTLRGFQRVNIAKGKTQNVNIEL-PYDSFEWFDTATNTIRPLSGTYKILYG 847


>gi|336415490|ref|ZP_08595829.1| hypothetical protein HMPREF1017_02937 [Bacteroides ovatus
           3_8_47FAA]
 gi|335940369|gb|EGN02236.1| hypothetical protein HMPREF1017_02937 [Bacteroides ovatus
           3_8_47FAA]
          Length = 863

 Score =  284 bits (726), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 171/449 (38%), Positives = 240/449 (53%), Gaps = 46/449 (10%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S +PY D KL   +RA DL++R+TL EKV  M + +  +PRLG+  YEWW+EALHGV+  
Sbjct: 24  SKYPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARA 83

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---- 125
           G                 AT FP  I   ASFN+ L  ++   VS EARA     N    
Sbjct: 84  GL----------------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQ 127

Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
                GLT W+PN+N+ RDPRWGR  ET GEDPY+ GR  +  VRGLQ  E  EY     
Sbjct: 128 YKRYQGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYD---- 183

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVM 240
               K+ AC KH+A +    W   +R  F++  +  +D+ ET++  F+  V +  V  VM
Sbjct: 184 ----KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVM 236

Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV--- 297
           C+YNR  G P C   +LL Q +R DW F G +V+DC +I    +  K  ++T  DA    
Sbjct: 237 CAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKK--HETHPDAAHAS 294

Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL 357
           A  +  G DL+CG  + + T  AV++G I+E  I+TS++ L      LG  + +  + N+
Sbjct: 295 ADAVLNGTDLECGGNFKSIT-DAVKKGLISEEKINTSVKRLLKARFELGEMNPTHPWSNI 353

Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
             + I  P+H ELA + A + +VLL+N N  LPLN      +A++GP+AN +    GNY 
Sbjct: 354 PYSVINCPKHKELALKMAHESLVLLQNKNNILPLNRQ--MKVAVIGPNANDSVMQWGNYN 411

Query: 418 GTPCRYTSPMDGFYAY--SKVINYAPGCA 444
           G P    + ++G  A      I Y P C 
Sbjct: 412 GFPSHTVTLLEGIRAKLPDAQIIYEPVCG 440



 Score =  125 bits (315), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 97/296 (32%), Positives = 135/296 (45%), Gaps = 53/296 (17%)

Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
           ++  KNAD  +   G+   +E E          G DR ++ LP  Q E++  +    K  
Sbjct: 594 LNKLKNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKKT 653

Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
           V  V  S  A+ I          +IL   YPG+ GG A+ADV+FG YNP GRLPIT+Y++
Sbjct: 654 V-FVNFSGSAMAI--VPETQSCDAILQAWYPGQAGGTAVADVLFGNYNPAGRLPITFYKS 710

Query: 568 NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
                    +P     +  GRTY+F     +YPFGYGLSYT+F Y  A+  +S   KL K
Sbjct: 711 ------IQQLPDYEDYSMKGRTYRFMTETPLYPFGYGLSYTRFSYGKATLNQS---KLAK 761

Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI 687
            +                             K    I V N+G+ DG EVV VY   P  
Sbjct: 762 GE-----------------------------KAILTIPVSNVGQRDGEEVVQVYICRPDD 792

Query: 688 AGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
            G   K + G++RV IA G++  V   +    S +  D A N++   SG + IL G
Sbjct: 793 KGGPQKTLRGFQRVNIAKGKTQNVNIEL-PYDSFEWFDTATNTIRPLSGTYKILYG 847


>gi|15642851|ref|NP_227892.1| xylosidase [Thermotoga maritima MSB8]
 gi|418046013|ref|ZP_12684107.1| Beta-glucosidase [Thermotoga maritima MSB8]
 gi|4980564|gb|AAD35170.1|AE001694_6 xylosidase [Thermotoga maritima MSB8]
 gi|351675566|gb|EHA58726.1| Beta-glucosidase [Thermotoga maritima MSB8]
          Length = 778

 Score =  284 bits (726), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 237/789 (30%), Positives = 366/789 (46%), Gaps = 159/789 (20%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGD---------------------LAYGVPRLG 52
           Y D   P   R +DL+ RMTL EKV Q+G                      L  G+ ++ 
Sbjct: 4   YRDPSQPIEVRVRDLLSRMTLEEKVAQLGSVWGYELIDERGKFSREKAKELLKNGIGQIT 63

Query: 53  LP---LYEWWSEALHGVSFIGR------RTNSPPGTHFDSEVP----GATSFPTVILTTA 99
            P         EA   V+ I R      R   P   H +        G T+FP  I   +
Sbjct: 64  RPGGSTNLEPQEAAELVNEIQRFLVEETRLGIPAMIHEECLTGYMGLGGTNFPQAIAMAS 123

Query: 100 SFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR 159
           +++  L +K+   V  + R +    + GL   +P ++V RDPRWGR  ET GE PY+V R
Sbjct: 124 TWDPDLIEKMTTAVREDMRKIG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVAR 178

Query: 160 YAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLD----NWEGNDRFHFDSRVT 215
             ++YV+GLQ   G +  +        + A  KH+A Y       NW   +       + 
Sbjct: 179 MGVSYVKGLQ---GEDIKKG-------VVATVKHFAGYSASEGGKNWAPTN-------IP 221

Query: 216 EQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSD 275
           E++ +E F+ PFE  V E +V SVM SY+ ++G+P  A+ KLL   +R DW F G +VSD
Sbjct: 222 EREFKEVFLFPFEAAVKEANVLSVMNSYSEIDGVPCAANRKLLTDILRKDWGFEGIVVSD 281

Query: 276 CDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-----CGDYYTNFTMGAVQQGKIAEAD 330
             +++ + + H+   D K +A    L+AG+D++     C  Y  +     V++G I+EA 
Sbjct: 282 YFAVKVLEDYHRIARD-KSEAARLALEAGIDVELPKTECYQYLKDL----VEKGIISEAL 336

Query: 331 IDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALP 390
           ID ++  +  +   LG F+    Y  + K  I    H ++A E AR+ I+LLKND G LP
Sbjct: 337 IDEAVTRVLRLKFMLGLFENP--YVEVEKAKI--ESHRDIALEIARKSIILLKND-GILP 391

Query: 391 LNTGNIKTLALVGPHANATKAMIGNYE----------------GTPC------------- 421
           L     K +AL+GP+A   + ++G+Y                 G P              
Sbjct: 392 LQKN--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKSI 449

Query: 422 -----RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG---- 472
                   S +D F        YA GC ++  ++ S    AI+ AK +D  ++V G    
Sbjct: 450 EEHMKSIPSVLDAFKEEGIEFEYAKGC-EVTGEDRSGFEEAIEIAKKSDVAIVVVGDKSG 508

Query: 473 --LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIK 530
             LD +   E +D  +L LPG Q EL+ +VA   K PV LV+++     +    +  K+ 
Sbjct: 509 LTLDCTT-GESRDMANLKLPGVQEELVLEVAKTGK-PVVLVLITGRPYSLKNVVD--KVN 564

Query: 531 SILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRT 589
           +IL V  PGE GGRAI D+I+GK NP G+LPI++   A  + + +   P    +++ G  
Sbjct: 565 AILQVWLPGEAGGRAIVDIIYGKVNPSGKLPISFPRSAGQIPVFHYVKPSGGRSHWHGDY 624

Query: 590 YKFFDGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
                 P ++PFG+GLSYT+F+Y  +   PK V                     PP   V
Sbjct: 625 VDESTKP-LFPFGHGLSYTKFEYSNLRIEPKEV---------------------PPAGEV 662

Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQ 707
           +I            +++VEN+G  DG EVV +Y      + T  +K++ G++RV + A +
Sbjct: 663 VI------------KVDVENIGDRDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAKE 710

Query: 708 SAKVGFTMN 716
              V F ++
Sbjct: 711 KKTVVFRLH 719


>gi|333381842|ref|ZP_08473521.1| hypothetical protein HMPREF9455_01687 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332829771|gb|EGK02417.1| hypothetical protein HMPREF9455_01687 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 861

 Score =  283 bits (724), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 176/449 (39%), Positives = 242/449 (53%), Gaps = 43/449 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            PY DA L   ERA+DL+ R+TL EKV  MGD +  V RLG+  + WWSEALHGV+  G 
Sbjct: 21  MPYKDANLTPEERAQDLLSRLTLKEKVGLMGDNSIEVTRLGVKKFAWWSEALHGVANQG- 79

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-------- 123
                          G T FP  I   ASFN+ L   +   +S EARA ++         
Sbjct: 80  ---------------GVTVFPEPIGMAASFNDELLYHVFDAISDEARARFHFREKKGDER 124

Query: 124 -GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
             + GL+ W+PN+N+ RDPRWGR  ET GEDPY+  R  I+ V GLQ  +  +Y      
Sbjct: 125 RQDNGLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGISVVNGLQGPKDAKYK----- 179

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
              K+ AC KHYA +    W  N      + +  + + ET++  F++ V + DVS VMC+
Sbjct: 180 ---KLLACAKHYAVHSGPEW--NRHVLNLNNLDNRHLWETYMPAFQVLVQKADVSQVMCA 234

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           Y+R +  P C +  LL + +R +W F   +VSDC +I     SHK  +D    AV  VL 
Sbjct: 235 YHRQDDDPCCGNNHLLKRILRDEWGFKRMVVSDCGAIADFYTSHKVSSDALHSAVKGVL- 293

Query: 303 AGLDLDCGDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGK 359
           AG D++CG  YT   +  AV +G I EADID S+  L     RLG FD +    + N+  
Sbjct: 294 AGTDVECGFGYTYHELVDAVSRGLIYEADIDKSVLRLLTERFRLGDFDDNSIVPWANIPD 353

Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
             I   +H  LA E ARQ + LL+N N  LPL++   K +A++GP+A+  K M GNY G 
Sbjct: 354 TIINCKKHQALALEMARQSMTLLQNKNNILPLSSK--KKIAVIGPNADDAKLMWGNYNGI 411

Query: 420 PCRYTSPMDGFYAYS-KVINYAPGCADIV 447
           P +  + ++G  + + K I Y  GC DIV
Sbjct: 412 PVKTVTILEGIKSIAGKDIFYEKGC-DIV 439



 Score =  102 bits (253), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 77/268 (28%), Positives = 118/268 (44%), Gaps = 52/268 (19%)

Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
           +D  K+ D  V   G+   +E E          G DR D+ LP  Q   I  +  A K  
Sbjct: 593 VDRLKDIDVVVFAGGISGELEGEEMPIEMPGFKGGDRTDIELPASQRNCIKALKKAGK-- 650

Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
             +++++     I     +   ++IL   Y G+ GG+AIA+V+FGKYNP G+LPIT+Y+ 
Sbjct: 651 -RVIMVNCSGSAIGLMPESESCEAILQAWYGGQSGGQAIAEVLFGKYNPSGKLPITFYKN 709

Query: 568 NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
                    +P     +  GRTY++ +   ++PFGYGLSYT F    A++  S+  K  +
Sbjct: 710 ------IDQLPDFEEYDMKGRTYRYLEDKPLFPFGYGLSYTTFDIGRATA-SSISAKAGE 762

Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI 687
                                          K    I V+N GK  GSE V VY K    
Sbjct: 763 -------------------------------KIKLVIPVKNTGKRTGSETVQVYVKKVD- 790

Query: 688 AGTHIKQVIGYERVFIAAGQSAKVGFTM 715
           +G  IK +  ++R+ +    S  + F +
Sbjct: 791 SGGPIKTLRSFKRIELPPNVSQDLTFEL 818


>gi|380512525|ref|ZP_09855932.1| beta-glucosidase [Xanthomonas sacchari NCPPB 4393]
          Length = 885

 Score =  283 bits (724), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 171/441 (38%), Positives = 233/441 (52%), Gaps = 46/441 (10%)

Query: 28  LVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPG 87
           LV +MT  EK+ Q  + A  +PRLG+P YEWWSE LHG++  G                 
Sbjct: 40  LVAKMTRAEKIAQAMNAAPAIPRLGVPAYEWWSEGLHGIARNGE---------------- 83

Query: 88  ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSPNINVV 138
           AT FP  I   A++N  L   +G   STEARA +NL           AGLT WSPNIN+ 
Sbjct: 84  ATVFPQAIGLAATWNPELLHDVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIF 143

Query: 139 RDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYD 198
           RDPRWGR +ET GEDPY+ GR A+ ++ GLQ         D  + P  I A  KH A + 
Sbjct: 144 RDPRWGRGMETYGEDPYLTGRLAVGFIHGLQG--------DDPAHPRTI-ATPKHLAVH- 193

Query: 199 LDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLL 258
             +     R  FD  V+  D + T+   F   + +G   SVMC+YN ++G P CA   L+
Sbjct: 194 --SGPEPGRHGFDVDVSPHDFEATYSPAFRAAIVDGQAGSVMCAYNSLHGTPACAADWLI 251

Query: 259 NQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTM 318
           +  +RGDW F G++VSDCD+I  + + H +  D    + A  LKAG DL+CG  Y    +
Sbjct: 252 DGRVRGDWGFKGFVVSDCDAIDDMTQFHYYRPDNAGSSAA-ALKAGHDLNCGTAYRELGI 310

Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNICNPQHIELAAEAAR 376
            A  +G+  EA +D SL  L+    RLG      +  Y  LG  +I +  H  LA +AA+
Sbjct: 311 -AFDRGEADEALLDRSLVRLFAARYRLGELQPRRNDPYARLGARDIDSAAHRALALQAAQ 369

Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--S 434
           Q +VLLKN N  LPL  G    LA++GP+A+A  A+  NY+GT  +  +P+ G      +
Sbjct: 370 QSLVLLKNANATLPLRPG--LRLAVLGPNADALAALEANYQGTSVQPVTPLQGLRTRFGA 427

Query: 435 KVINYAPGCADIVCQNNSMIP 455
             + YA G A +      MIP
Sbjct: 428 AQVAYAQG-APLAAGVPGMIP 447



 Score =  141 bits (356), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 92/287 (32%), Positives = 139/287 (48%), Gaps = 45/287 (15%)

Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           V G +L ++    +G DR DL LP  Q  L+ + A A+  P+ +V+MS  AV +N+A+ +
Sbjct: 627 VEGEELRIDVPGFDGGDRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAEQH 685

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
                  W  YPG+ GG AIA  + G  NPGGRLP+T+Y +     PY S  ++      
Sbjct: 686 ADAIIAAW--YPGQSGGTAIAQALAGDINPGGRLPVTFYRSTKDLPPYVSYDMK------ 737

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++F G  ++PFGYGLSYTQF Y    +P+     L   Q                 
Sbjct: 738 GRTYRYFKGEPLFPFGYGLSYTQFAY---DAPQLSTTTLQAGQ----------------- 777

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
                              V N G   G EVV VY + P  A + ++ ++G++RV +  G
Sbjct: 778 ------------PLQVSTTVRNTGARAGDEVVQVYLQYPQRAQSPLRSLVGFQRVHLQPG 825

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQ 753
           ++  + F ++A + L  VD +    + +G + + VG G  G   P Q
Sbjct: 826 EARTLSFALDA-RQLSDVDRSGQRAVEAGDYRLFVGGGQPGTGAPGQ 871


>gi|383110854|ref|ZP_09931672.1| hypothetical protein BSGG_1962 [Bacteroides sp. D2]
 gi|313694427|gb|EFS31262.1| hypothetical protein BSGG_1962 [Bacteroides sp. D2]
          Length = 861

 Score =  283 bits (724), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 169/445 (37%), Positives = 239/445 (53%), Gaps = 44/445 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            PY D  L   +RA+DL+ R+TL EKV  M + +  +PRLG+  Y+WW+EALHGV   G 
Sbjct: 24  LPYQDTSLTAEQRAEDLLPRLTLEEKVALMQNASPAIPRLGIKEYDWWNEALHGVGRAGL 83

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN------ 125
                           AT FP  I   ASFN+SL  ++   VS EAR    + +      
Sbjct: 84  ----------------ATVFPQSIGMGASFNDSLLYEVFDAVSDEARVKSRIFSENGVLK 127

Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
              GLTFW+PN+N+ RDPRWGR  ET GEDPY+ G+  +  VRGLQ  E  +Y       
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQLGMAVVRGLQGPENGKYD------ 181

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
             K+ AC KH+A +    W   +R  FD+  +T +D+ ET++  F+  V + DV  VMC+
Sbjct: 182 --KLHACAKHFAVHSGPEW---NRHSFDAENITPRDLWETYLPAFKDLVQKADVKEVMCA 236

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
           YNR  G P C   +LL Q +R +W + G +VSDC +I       +H    D KE A A  
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASAGA 295

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           + +G DL+CG  Y +    AV+ G I E  ID SL+ L      LG  D  P +  +  +
Sbjct: 296 VLSGTDLECGGEYGSLA-DAVKAGLIDEKQIDVSLKRLLTARFELGEMDEQPAWAEIPAS 354

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + + +H +LA   AR+ +VLL+N N  LPLNT ++K +A++GP+AN +    GNY G P
Sbjct: 355 TLNSKEHQDLALRMARESLVLLQNKNDILPLNT-DLK-VAVMGPNANDSVMQWGNYNGIP 412

Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
               + ++   +      + Y PGC
Sbjct: 413 GHTVTLLEAVRSKLPEGQVMYEPGC 437



 Score =  102 bits (255), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 86/297 (28%), Positives = 131/297 (44%), Gaps = 53/297 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A++  K+AD  +   G+  S+E E          G DR D+ LP  Q +L+  +  A K 
Sbjct: 591 AVEKVKDADVVLFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPAVQRDLLKALKKAGK- 649

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
              +V ++     I     +   ++IL   YPG+ GG AI DV+FG YNP GRLP+T+Y+
Sbjct: 650 --KVVFINYSGSAIGLVPESNTCEAILQGWYPGQAGGTAIVDVLFGDYNPAGRLPVTFYK 707

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
                     +P     +  GRTY++     ++PFG+GLSYT F Y  A   K+      
Sbjct: 708 DA------GQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTTFTYGEADLSKN------ 755

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
                     T+G                     T  I V N G+ DG EVV VY +   
Sbjct: 756 ----------TIGDGG----------------TVTLTIPVSNAGQRDGDEVVQVYLRCMA 789

Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
                   +  ++RV I AG++ +V   +   +S +  D A N++    G + +L G
Sbjct: 790 DKEGPHYTLRAFKRVHIPAGETKQVTIPLT-YESFEWFDTATNTVHPLKGTYELLYG 845


>gi|433679952|ref|ZP_20511614.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
           18974]
 gi|430814928|emb|CCP42243.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
           18974]
          Length = 909

 Score =  283 bits (723), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 172/440 (39%), Positives = 235/440 (53%), Gaps = 50/440 (11%)

Query: 31  RMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATS 90
           +MT  EKV Q  + A  +PRLG+P YEWW+E LHG++  G                 AT 
Sbjct: 67  KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNGY----------------ATV 110

Query: 91  FPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSPNINVVRDP 141
           FP  I   A++N +L +++G   STEARA +NL           AGLT WSPNIN+ RDP
Sbjct: 111 FPQAIGLAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDP 170

Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
           RWGR +ET GEDPY+ G+ A+ ++RGLQ         D  + P  I A  KH A +    
Sbjct: 171 RWGRGMETYGEDPYLTGQLAVGFIRGLQG--------DDLTHPRTI-ATPKHLAVHSGPE 221

Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
                R  FD  V+  D++ T+   F   + +G   +VMC+YN ++G P CA   LLN  
Sbjct: 222 ---PGRHGFDVDVSPHDLEATYTPAFRAAIVDGRAGAVMCAYNSLHGTPACAADWLLNGR 278

Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAV 321
           +RGDW F G++VSDCD++  + + H F  D    + A  LKAG DL+CG  Y +    A+
Sbjct: 279 LRGDWGFTGFVVSDCDAVDDMTQFHYFRADNAGSSAA-ALKAGHDLNCGYAYRDLGK-AI 336

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNPQHIELAAEAARQ 377
            +G   EA +D SL  L+    RLG     PQ    Y  LG  ++ +  H  LA +AA+Q
Sbjct: 337 ARGDADEAVLDQSLVRLFAARYRLGEL--QPQRKDPYARLGAKDVDSAAHRALALQAAQQ 394

Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVI 437
            IVLL+N N  LPL  G    LA++GP+A+A  A+  NY+GT     +P+ G        
Sbjct: 395 SIVLLQNRNATLPLRPG--LRLAVIGPNADALAALEANYQGTSAAPVTPLLGLRERFGAA 452

Query: 438 N--YAPGCADIVCQNNSMIP 455
           N  YA G A +    + MIP
Sbjct: 453 NLRYAQG-APLAAGVSGMIP 471



 Score =  140 bits (352), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 93/285 (32%), Positives = 139/285 (48%), Gaps = 45/285 (15%)

Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           V G +L ++    +G DR DL LP  Q  L+ + A A+  P+ +V+MS  AV +N+AK +
Sbjct: 651 VEGEELRIDVPGFDGGDRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKQH 709

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
                  W  YPG+ GG AIA V+ G  NPGGRLP+T+Y +      Y S  ++      
Sbjct: 710 ADAIVAAW--YPGQSGGTAIAQVLAGDVNPGGRLPVTFYRSTKDLPAYVSYDMK------ 761

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++F G  ++ FG GLSYT+F Y   ++P           Q        G N     
Sbjct: 762 GRTYRYFKGEPLFAFGSGLSYTRFTY---AAP-----------QLSATTLQAGAN----- 802

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
                           + +V N G   G EVV VY +PP  A + ++ ++G++RV +  G
Sbjct: 803 -------------LQVRTQVRNSGTRAGDEVVQVYLQPPQGAQSPLRTLVGFQRVTLQPG 849

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFP 751
           ++ +VGF +   + L  VD A    +  G + + VG G  G   P
Sbjct: 850 EAREVGFELTP-RQLSDVDRAGQRAVQPGDYRVFVGGGQPGTGAP 893


>gi|182413194|ref|YP_001818260.1| glycoside hydrolase family 3 [Opitutus terrae PB90-1]
 gi|177840408|gb|ACB74660.1| glycoside hydrolase family 3 domain protein [Opitutus terrae
           PB90-1]
          Length = 859

 Score =  283 bits (723), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 231/817 (28%), Positives = 379/817 (46%), Gaps = 139/817 (17%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----------- 58
           PY D+  P   R +DL+ RM+L EK  Q+  L YG PR+     P   W           
Sbjct: 60  PYEDSSRPIDARIEDLLARMSLEEKTAQLTTL-YGFPRVLKDERPTSAWREAMWKDGIGN 118

Query: 59  -----------------------WS---EALHGVS--FIGRRTNSPPGTHFDSEVPG--- 87
                                  WS    AL+ V   FI +     P    +  + G   
Sbjct: 119 IDEHLNGNTGWTNNLADPVHDLPWSLHARALNEVQRWFIEQTRLGIPVDFTNEGIRGLLH 178

Query: 88  --ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWG 144
             ATSFP  +   ++++ +L ++IG+    EARA+      G T  +SP +++ RDPRWG
Sbjct: 179 SKATSFPAELAVASTWDPALVREIGRITGREARAL------GYTNIYSPVLDLARDPRWG 232

Query: 145 RVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEG 204
           R +ET GEDP++VG   +  VRGLQ     E+          + +  KH+A Y +     
Sbjct: 233 RTIETYGEDPFLVGTLGVEQVRGLQ----AEH----------VVSTLKHFAVYSIPKGGR 278

Query: 205 NDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRG 264
           +     D + T +++Q  F+ PF   + E     VM SYN  +G+P       L++ +RG
Sbjct: 279 DGEARTDPQATWREVQTIFLEPFRRAIREAGALGVMASYNDYDGVPVEGSALFLSEILRG 338

Query: 265 DWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGA---- 320
            W F GY+VSD  +++ I   H+ +  T  DA+ + ++AGL++      TNFT  A    
Sbjct: 339 QWGFRGYVVSDSAAVEFIHSKHR-VAPTPADAIRQAVEAGLNI-----RTNFTPPAAYAE 392

Query: 321 -----VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICN-PQHIELAAEA 374
                V+ GK+A A ID  +R +  V  +LG FD          + +   P+H+ +A  A
Sbjct: 393 PLRQLVRDGKLAMATIDARVRDVLRVKFQLGLFDRPYVADPAAADRVVRAPEHLVVAQRA 452

Query: 375 ARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA-- 432
            R+ IVLLKN+   LPL+   ++ + + GP A+   A    Y      + +P+ G  A  
Sbjct: 453 GREAIVLLKNEPALLPLDRAKLQRVLVAGPLADDAHAWWSRYGAQRLDFVTPLPGLRAKL 512

Query: 433 -YSKVINYAPG---------CADIV-----CQNNSMIPAAIDAAKNADATVIVAGLDLSV 477
             +  + YA G          +D++      +  + I AA+ AA+N D  + V G    +
Sbjct: 513 GAAVEVRYAKGVEAKDAAWPASDVLKDPPSAEVRAGIEAAVAAAQNVDVIIAVLGETDEL 572

Query: 478 EAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGY 537
             E   R+ L LPG+Q EL+  +    K P+ LV+ +   + + +A  +  + +I+ + +
Sbjct: 573 CRESSSRISLALPGYQQELLEALHATGK-PLVLVLSNGRPLSVVWAARH--VPAIVELWF 629

Query: 538 PGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV 597
           PGE+GG A+A V+ G  NP GRLPIT +  +  ++PY + P  P +    R +   +G  
Sbjct: 630 PGEDGGAALAAVLLGDANPSGRLPIT-FPQSVGQLPY-NFPAHPGSQ--ARDFGQVEG-S 684

Query: 598 VYPFGYGLSYTQFKYK-VASSPKSVDIKLD----------KDQQCRDINYTVGTNKPPCA 646
           ++PFG+GLSYT F+Y  +  +P+ + +             +    R   Y+V T      
Sbjct: 685 LFPFGHGLSYTTFRYSDLRITPERIPVDGFGAAGGGDPGLRGSASRATPYSVSTVP---- 740

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAA 705
                       +FT   +V N G   G EVV +Y +       T+   + G+ RV +A 
Sbjct: 741 ------------EFTITCDVTNTGTRAGDEVVQLYLRDDYSSVTTYDIALRGFARVTLAP 788

Query: 706 GQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           G++  V FT++    L++ +   + ++  G  T+++G
Sbjct: 789 GETKPVTFTLHRAH-LELYNRDGDWVVEPGRFTVMLG 824


>gi|237719778|ref|ZP_04550259.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
 gi|229451047|gb|EEO56838.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
          Length = 861

 Score =  283 bits (723), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 171/445 (38%), Positives = 239/445 (53%), Gaps = 44/445 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            PY D  L   +RA+DL+ R+TL EKV  M + +  +PRLG+  YEWW+EALHGV   G 
Sbjct: 24  LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-GNAG--- 127
                           AT FP  I   ASFN+SL  ++    S EAR    + G +G   
Sbjct: 84  ----------------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLK 127

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFW+PN+N+ RDPRWGR  ET GEDPY+ G+  +  VRGLQ  E        D+R
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DAR 179

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
             K+ AC KH+A +    W   +R  FD+  +  +D+ ET++  F+  V +  V  VMC+
Sbjct: 180 YDKLHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCA 236

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
           YNR  G P C   +LL Q +R +W + G +VSDC +I       +H+   D KE A A  
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETYPD-KEHASAGA 295

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           ++AG DL+CG  Y +    AV+ G I E +ID SL+ L      LG  D    +  +  +
Sbjct: 296 VRAGTDLECGSEYASLA-DAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSAWSEIPTS 354

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + + +H  LA   AR+ +VLL+N N  LPLNT ++K +A++GP+AN +    GNY G P
Sbjct: 355 VLNSKEHQALALRMARESLVLLQNKNNILPLNT-HLK-VAVMGPNANDSVMQWGNYNGIP 412

Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
               + ++   A      I Y PGC
Sbjct: 413 AHTVTLLEAVRAKLPEGQIIYEPGC 437



 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 84/297 (28%), Positives = 131/297 (44%), Gaps = 53/297 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+    +AD  +   G+  S+E E          G DR D+ LP  Q +L+  +  A K 
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGK- 649

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
              +V ++     I         ++IL   YPG+ GG AI D ++G+YNPGGRLP+T+Y+
Sbjct: 650 --KVVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
                     +P     +  GRTY++     ++PFG+GLSYT F Y  A        KL 
Sbjct: 708 ------DVNQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEA--------KLS 753

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           K+   +  N                            I V N+G+ DG EVV VY + PG
Sbjct: 754 KNTIAKGEN------------------------VVLTIPVSNVGQRDGEEVVQVYLRRPG 789

Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
                   +  ++RV I AG++  V   +   ++ +  D  +N++    G + +L G
Sbjct: 790 DKEGPRYTLRAFKRVHIPAGKTESVAIPLTG-ENFEWFDVESNTMCPLEGTYELLYG 845


>gi|423294294|ref|ZP_17272421.1| hypothetical protein HMPREF1070_01086 [Bacteroides ovatus
           CL03T12C18]
 gi|392675485|gb|EIY68926.1| hypothetical protein HMPREF1070_01086 [Bacteroides ovatus
           CL03T12C18]
          Length = 861

 Score =  282 bits (722), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 170/445 (38%), Positives = 237/445 (53%), Gaps = 44/445 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            PY D  L   +RA+DL+ R+TL EKV  M + +  +PRLG+  YEWW+EALHGV   G 
Sbjct: 24  LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-GNAG--- 127
                           AT FP  I   ASFN+SL  ++    S EAR    + G +G   
Sbjct: 84  ----------------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGALK 127

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFW+PN+N+ RDPRWGR  ET GEDPY+ G+  +  VRGLQ  E  +Y       
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDTKYD------ 181

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
             K+ AC KH+A +    W   +R  FD+  +  +D+ ET++  F+  V +  V  VMC+
Sbjct: 182 --KLHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCA 236

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
           YNR  G P C   +LL Q +R +W + G +VSDC +I       +H    D KE A A  
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASAAA 295

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           ++ G DL+CG  Y +    AV+ G I E +ID SL+ L      LG  D  P +  +  +
Sbjct: 296 VRTGTDLECGSEYASLA-DAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPAWSEIPAS 354

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + + +H  LA   AR+ +VLL+N N  LPLNT ++K +A++GP+AN +    GNY G P
Sbjct: 355 VLNSKEHQALALRMARESLVLLQNKNNILPLNT-HLK-VAVMGPNANDSVMQWGNYNGIP 412

Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
               + ++   A      I Y PGC
Sbjct: 413 AHTVTLLEAVRAKLPEGQIIYEPGC 437



 Score =  105 bits (262), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 83/297 (27%), Positives = 128/297 (43%), Gaps = 53/297 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+    +AD  +   G+  S+E E          G DR D+ LP  Q    N +    K 
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---NLLKALKKA 647

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
              +V ++     I         ++IL   YPG+ GG AI D ++G+YNPGGRLP+T+Y+
Sbjct: 648 GKKVVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
                     +P     +  GRTY++     ++PFG+GLSYT F Y  A        KL 
Sbjct: 708 ------DVNQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTTFTYGEA--------KLS 753

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           K+   +  N                            I V N+G+ DG EVV VY + PG
Sbjct: 754 KNTIAKGEN------------------------VVLTIPVSNVGQRDGEEVVQVYLRRPG 789

Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
                   +  ++RV I AG++  V   +   ++ +  D  +N++    G + +L G
Sbjct: 790 DKEGPRYTLRAFKRVHIPAGKTESVAIPLTG-ENFEWFDVESNTMRPLEGTYELLYG 845


>gi|402304900|ref|ZP_10823963.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
 gi|400380686|gb|EJP33499.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
          Length = 866

 Score =  282 bits (722), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 165/452 (36%), Positives = 241/452 (53%), Gaps = 41/452 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +PY + +L   ERA+DL  R+TL EK + M + +  +PRLG+P +EWWSEALHG++  G 
Sbjct: 23  YPYQNPRLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWWSEALHGIARNG- 81

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
                           AT FP      AS+++ L  ++    S EA A  NL        
Sbjct: 82  ---------------FATVFPQTTAMAASWDDELLYRVFCAASDEAVAKNNLARKSGDIK 126

Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD---- 179
              G++ W+PNIN+ RDPRWGR  ET GEDPY+  R  +  V GLQ   G  + RD    
Sbjct: 127 RYQGVSIWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQ---GQPFRRDMRPF 183

Query: 180 -SDSRPLKISACCKHYAAYDLDNWEGNDRFHFD-SRVTEQDMQETFILPFEMCVNEGDVS 237
               R  K  AC KHYA +    W   +R  FD  R+ E+D+ ET++  F+  V EG+V 
Sbjct: 184 TERPRYYKTLACAKHYAVHSGPEW---NRHVFDVERLPERDLWETYLPAFKSLVQEGNVR 240

Query: 238 SVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV-ESHKFLNDTKEDA 296
            VMC+Y R++G P C + + L+Q +RG+W ++G +VSDC +I     E H  + +T  +A
Sbjct: 241 EVMCAYQRIDGSPCCGNTRYLHQILRGEWGYNGLVVSDCGAISDFYREGHHHVVETPAEA 300

Query: 297 VARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QY 354
            A  ++AG D++CG  Y      AV+QG I+   IDTS+  L      +G FD      +
Sbjct: 301 SAMGVRAGTDVECGAVYATLPR-AVEQGLISREAIDTSVVRLLKARFEVGDFDSEKLVPW 359

Query: 355 KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIG 414
           K  G   I +  H  LA + AR+ + LL+N N  LPL+   ++ +A++GP+AN +  + G
Sbjct: 360 KLTGPEVIASETHRRLALDMARESMTLLQNRNRLLPLSKNGLR-IAVMGPNANDSVMLWG 418

Query: 415 NYEGTPCRYTSPMDGFYAYSKVINYAPGCADI 446
           NY G P   T+ + G  +      +  GC  I
Sbjct: 419 NYTGYPISTTTILKGIRSKVPAARFVEGCGYI 450



 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 103/328 (31%), Positives = 150/328 (45%), Gaps = 67/328 (20%)

Query: 432 AYSKVINYAPGCADIVCQ----NNSMIPAAIDAAKNADATVIV---------AGLDLSVE 478
           AY   + Y    A  VCQ      S I A+  AA+  DA V+V          G ++ V+
Sbjct: 573 AYRVRVEYVQNKAMAVCQFDIARKSPITASEIAAQAGDADVVVFVGGISPRLEGEEMKVD 632

Query: 479 A---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
           A   +G DR  + LP  Q E+I  +  A K  V  V  S GAV +          ++L  
Sbjct: 633 APGFKGGDRTSIELPEAQREVIRLLRQAGK-LVVFVNCSGGAVAL--VPEAEACDAVLQA 689

Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDG 595
            Y GE GG+A+ADV+FG YNP G+LP+T+Y+++        +P        GRTY++F G
Sbjct: 690 WYAGEAGGQAVADVLFGDYNPSGKLPVTFYKSD------ADLPDFLDYRMTGRTYRYFRG 743

Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
             ++PFG+GLSYT F +     P+                Y  G                
Sbjct: 744 TPLFPFGFGLSYTSFAF---GKPR----------------YENG---------------- 768

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
                   +EV N GK DG+EVV VY K P  A   +K + G+ R+ + AG+  +V   M
Sbjct: 769 -----MLYVEVTNTGKRDGAEVVQVYVKNPADADGPVKTLRGFARIDLKAGERRRVEIAM 823

Query: 716 NACKSLKIVDNAANSL-LASGAHTILVG 742
              +  +  D  AN++ +  G H ++VG
Sbjct: 824 PR-ERFEGWDATANTMRVKPGNHLLMVG 850


>gi|297736784|emb|CBI25985.3| unnamed protein product [Vitis vinifera]
          Length = 241

 Score =  282 bits (722), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 133/212 (62%), Positives = 162/212 (76%)

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDP+ V  YA++YVRGLQDVEG E   D +SRPLK+S+  KH+AAYDLDNW   DR HF
Sbjct: 9   GEDPFTVSVYAVSYVRGLQDVEGTENTTDLNSRPLKVSSSGKHFAAYDLDNWLNVDRNHF 68

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
           ++RV+EQDM ETF+ PFE CV EGDVS VMCS+N +NGIP CADP+L   TIR +WN HG
Sbjct: 69  NARVSEQDMAETFLRPFEACVREGDVSGVMCSFNNINGIPPCADPRLFKGTIRDEWNLHG 128

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEAD 330
           YIVSDC SI+TIVE  KFL+ T E+AVA  LKAGLDL+CG YY +    AV  G++ + D
Sbjct: 129 YIVSDCWSIETIVEDQKFLDVTGEEAVALNLKAGLDLECGHYYNDSPASAVMAGRVGQHD 188

Query: 331 IDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNI 362
           +D SL  LY+VLMRLG+FDG P   +LGK++I
Sbjct: 189 LDQSLSNLYVVLMRLGFFDGIPALASLGKDDI 220


>gi|198274480|ref|ZP_03207012.1| hypothetical protein BACPLE_00628 [Bacteroides plebeius DSM 17135]
 gi|198272682|gb|EDY96951.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           plebeius DSM 17135]
          Length = 912

 Score =  282 bits (722), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 231/809 (28%), Positives = 367/809 (45%), Gaps = 142/809 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
           Y D K P  ER +DL+ +MT+ EK  QM  L YG  R+    LP  +W ++    G+  I
Sbjct: 18  YEDPKAPLNERIEDLLSQMTVEEKTCQMVTL-YGYQRVLKDSLPTPDWKNQLWKDGIGAI 76

Query: 70  GRRTNS---------------PPGTH----------FDSE----VPG------------- 87
               N+               P   H          F  E    +P              
Sbjct: 77  DEHLNAFRGWGVPPMQNELVWPASNHAWALNEVQRFFVEETRLGIPADFTNEGIRGVENY 136

Query: 88  -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
            AT+FPT +    ++N  L ++IG     EAR +      G T  ++P ++V RD RWGR
Sbjct: 137 IATNFPTQLALGHTWNRELIRQIGYITGREARLL------GYTNVYAPILDVGRDQRWGR 190

Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
             E  GE PY+V    I   +GLQ               +++++  KH+ AY  +     
Sbjct: 191 YEEVYGESPYLVAELGIAMGKGLQT-------------DMQVASTAKHFIAYSNNKGARE 237

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
                D +++ ++++     PF   + E  +  VM SYN  +G P  +    L Q +RG 
Sbjct: 238 GFARVDPQMSWREVENIHAYPFTRVIQEAGILGVMSSYNDYDGFPIQSSYYWLTQRLRGT 297

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
             F GY+VSD D+++ +   HK   D KE AV + ++AGL++ C     + Y       +
Sbjct: 298 MGFRGYVVSDSDAVEYLYSKHKTAKDMKE-AVRQSVEAGLNVRCTFRSPESYVLPLRELI 356

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYK-NLGKNNICNPQHIELAAEAARQGIV 380
           Q+G ++   ID  +R +  V    G FD   Q    L    + +  H ++A +A+R+G+V
Sbjct: 357 QEGGLSMETIDNRVRDILRVKFLTGLFDTPYQTDLALADKEVNSEAHQQVALQASREGLV 416

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK---VI 437
           LLKN N  LPL+   IK +A+ GP+A+     + +Y       T+ ++G     K    +
Sbjct: 417 LLKNANNLLPLDKSQIKRIAVCGPNADEASFALTHYGPVAVEVTTVLEGIKQQVKEGTKV 476

Query: 438 NYAPGC---------ADIV-----CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
            Y  GC         ++I+      +  + I  A+D  K +D  V+V G  +    E K 
Sbjct: 477 TYTKGCDLVDANWPESEIISYPLTAEEKTEIQKAVDNVKESDVAVVVLGGGIRTCGENKS 536

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R  L LPG Q +L+  +    K PV LV+++   + IN+A  +  + +IL   YPG +GG
Sbjct: 537 RTSLDLPGHQQQLLEAIVATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSQGG 593

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG-------RTYKFFDGP 596
            AIA+ +FG YNPGG+L +T +     +IP+ + P +P +   G             +GP
Sbjct: 594 TAIAEALFGDYNPGGKLTVT-FPKTVGQIPF-NFPAKPASQVDGGQTPGMKGNQSRINGP 651

Query: 597 VVYPFGYGLSYTQFKYK--VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
            +YPFGYGLSYT F+Y     SSP                   V T+K P        V 
Sbjct: 652 -LYPFGYGLSYTTFEYSNLQLSSP-------------------VITDKEPVT------VT 685

Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGF 713
           CK         ++N G   G EVV +Y++       T+ K + G+ERV +  G++ KV F
Sbjct: 686 CK---------IKNTGTRSGDEVVQLYTRDVISSVTTYEKNLRGFERVHLEPGETKKVSF 736

Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVG 742
            +   +  ++++   + ++  G   I++G
Sbjct: 737 QL-LPRDFQLLNKDNHWVVEPGMFQIMIG 764


>gi|189463167|ref|ZP_03011952.1| hypothetical protein BACCOP_03878 [Bacteroides coprocola DSM 17136]
 gi|189430146|gb|EDU99130.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           coprocola DSM 17136]
          Length = 865

 Score =  282 bits (722), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 170/425 (40%), Positives = 233/425 (54%), Gaps = 42/425 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           FPY +  L   +RA DL+ER+TL EKV  M + +  +PRLG+  Y+WW+EALHGV   G 
Sbjct: 25  FPYQNTSLTPEQRASDLLERLTLEEKVSLMQNASPAIPRLGIKAYDWWNEALHGVGRAGI 84

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGN-- 125
                           AT FP  I   ASF++ L  K+   VS EARA Y      GN  
Sbjct: 85  ----------------ATVFPQTIGMAASFDDELIYKVFTAVSDEARAKYTEFSKSGNLK 128

Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
              GLTFW+PNIN+ RDPRWGR  ET GEDPY+  R  +  VRGLQ  + ++Y       
Sbjct: 129 RYQGLTFWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVRGLQGPDNMKYD------ 182

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
             K+ AC KHYA +    W   +R  F++  +  +D+ ET++  F+  V E DV  VMC+
Sbjct: 183 --KLHACAKHYAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKALVQEADVKEVMCA 237

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVES--HKFLNDTKEDAVARV 300
           YNR  G P C   +LL Q +R +W + G IVSDC +I        H+   D KE A A  
Sbjct: 238 YNRFEGEPCCGSNRLLMQILRDEWKYKGIIVSDCGAISDFWRKGDHETHPD-KETASAGA 296

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           + +G DL+CG+ Y +    AVQ+G I E  ID S++ L      LG  D    + ++  +
Sbjct: 297 VLSGTDLECGNNYKSLPE-AVQKGLIDEKQIDISVKRLLTARFELGEMDEHVCWDSIPYS 355

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + +  H +LA E AR+ IVLL+N N  LPL   ++K +AL+GP+AN +    GNY G P
Sbjct: 356 VVDSKAHKDLALEIARKSIVLLQNRNNILPLKE-DMK-IALIGPNANDSVMQWGNYNGFP 413

Query: 421 CRYTS 425
              ++
Sbjct: 414 SHTST 418



 Score =  123 bits (309), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 94/303 (31%), Positives = 138/303 (45%), Gaps = 59/303 (19%)

Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
           + A+ID  K AD  V   G+  S+E E          G DR  + LP  Q  LI+++   
Sbjct: 591 LQASIDKVKAADVIVFAGGISPSLEGEEMPVNAEGFKGGDRTTIELPAIQRRLISELKKL 650

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIK---SILWVGYPGEEGGRAIADVIFGKYNPGGRL 560
            K P+  V  S  AV +      P+ K   +IL   YPG+ GG A+ADV+FG YNP G+L
Sbjct: 651 GK-PIIFVNYSGSAVGLE-----PESKICDAILQAWYPGQAGGTAVADVLFGDYNPSGKL 704

Query: 561 PITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKS 620
           P+T+Y+          +P     +  GRTY++     +Y FG+GLSYT F Y  A+  + 
Sbjct: 705 PVTFYKHT------DQLPDFQDYSMKGRTYRYMTESPLYSFGHGLSYTNFTYGPATLSQQ 758

Query: 621 VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
                           T+   K                + T  I V+N G  DG EVV V
Sbjct: 759 ----------------TISQGK----------------EVTLTIPVQNTGNYDGEEVVQV 786

Query: 681 YSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTI 739
           Y    G        +  ++RV IA GQ A V FT+++ ++ +  D   N++ +  G + +
Sbjct: 787 YLSCSGDKEGPSHTLRAFKRVHIAKGQRANVSFTLDS-ETFQWFDTNTNTMRMVEGNYEL 845

Query: 740 LVG 742
           L G
Sbjct: 846 LYG 848


>gi|293370402|ref|ZP_06616956.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|292634550|gb|EFF53085.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
          Length = 863

 Score =  282 bits (722), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 169/448 (37%), Positives = 241/448 (53%), Gaps = 46/448 (10%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S +PY D KL   +RA DL++R+TL EKV  M + +  +PRLG+  YEWW+EALHGV+  
Sbjct: 24  SKYPYQDTKLTVEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARA 83

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---- 125
           G                 AT FP  I   ASFN+ L  ++   VS EARA     N    
Sbjct: 84  GL----------------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQ 127

Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
                GLT W+PN+N+ RDPRWGR  ET GEDPY+ GR  +  VRGLQ  E  EY     
Sbjct: 128 YKRYQGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYD---- 183

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVM 240
               K+ AC KH+A +    W   +R  F++  +  +D+ ET++  F+  V +  V  VM
Sbjct: 184 ----KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVM 236

Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV--- 297
           C+YNR  G P C   +LL Q +R DW F G +V+DC +I    +  K  ++T  DA    
Sbjct: 237 CAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKK--HETHPDAAHAS 294

Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL 357
           A  + +G DL+CG  + + T  AV++  I+E  I+TS++ +      LG  + +  + N+
Sbjct: 295 ADAVLSGTDLECGGNFKSIT-DAVKKDLISEEKINTSVKRVLKARFELGEMNSTHPWSNI 353

Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
             + I  P+H ELA + A + +VLL+N+N  LPLN      +A++GP+AN +    GNY 
Sbjct: 354 PFSVIDCPKHKELALKMAHESLVLLQNNNNILPLNRQ--MKVAVIGPNANDSVMQWGNYN 411

Query: 418 GTPCRYTSPMDGFYAY--SKVINYAPGC 443
           G P    + ++G  A      I Y P C
Sbjct: 412 GFPSHTVTLLEGIRAKLPDAQIIYEPVC 439



 Score =  118 bits (295), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 94/296 (31%), Positives = 135/296 (45%), Gaps = 53/296 (17%)

Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
           ++  ++AD  +   G+   +E E          G DR ++ LP  Q E++  +    K  
Sbjct: 594 LNKLQSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKKT 653

Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
           V  V  S  A+ I     N    +IL   YPG+ GG A+ADV+FG YNP GRLPIT+Y++
Sbjct: 654 V-FVNFSGSAMAIVPETQN--CDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKS 710

Query: 568 NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
                 Y    ++      GRTY+F     +YPFGYGLSYT+F Y  A+  +S   KL K
Sbjct: 711 MQQLPDYEDYSMK------GRTYRFMTETPLYPFGYGLSYTRFSYGKATLNQS---KLTK 761

Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI 687
            +                             K    I V N+G+ DG EVV VY   P  
Sbjct: 762 GE-----------------------------KAILTIPVSNVGQRDGEEVVQVYICRPDD 792

Query: 688 AGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
                K + G++RV IA G++  V   +    S +  D A N++   +G + IL G
Sbjct: 793 KEGPQKTLRGFQRVSIAKGKTQNVQIEL-PYDSFEWFDAATNTIRPLNGTYKILYG 847


>gi|315607027|ref|ZP_07882031.1| beta-glucosidase [Prevotella buccae ATCC 33574]
 gi|315251081|gb|EFU31066.1| beta-glucosidase [Prevotella buccae ATCC 33574]
          Length = 866

 Score =  282 bits (722), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 165/452 (36%), Positives = 241/452 (53%), Gaps = 41/452 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +PY + +L   ERA+DL  R+TL EK + M + +  +PRLG+P +EWWSEALHG++  G 
Sbjct: 23  YPYQNLQLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWWSEALHGIARNG- 81

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN------ 125
                           AT FP      AS+++ L  ++    S EA A  NL        
Sbjct: 82  ---------------FATVFPQTTAMAASWDDELLYRVFCAASDEAVAKNNLARKSGDIK 126

Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD---- 179
              G++ W+PNIN+ RDPRWGR  ET GEDPY+  R  +  V GLQ   G  + RD    
Sbjct: 127 RYQGVSIWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQ---GQPFRRDMRPF 183

Query: 180 -SDSRPLKISACCKHYAAYDLDNWEGNDRFHFD-SRVTEQDMQETFILPFEMCVNEGDVS 237
               R  K  AC KHYA +    W   +R  FD  R+ E+D+ ET++  F+  V EG+V 
Sbjct: 184 TERPRYYKTLACAKHYAVHSGPEW---NRHVFDVERLPERDLWETYLPAFKSLVQEGNVR 240

Query: 238 SVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV-ESHKFLNDTKEDA 296
            VMC+Y R++G P C + + L+Q +RG+W ++G +VSDC +I     E H  + +T  +A
Sbjct: 241 EVMCAYQRIDGSPCCGNTRYLHQILRGEWGYNGLVVSDCGAISDFYREGHHHVVETPAEA 300

Query: 297 VARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QY 354
            A  ++AG D++CG  Y      AV+QG I+   IDTS+  L      +G FD      +
Sbjct: 301 SAMGVRAGTDVECGAVYATLPR-AVEQGLISREAIDTSVVRLLKARFEVGDFDSEKLVPW 359

Query: 355 KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIG 414
           K  G   I +  H  LA + AR+ + LL+N N  LPL+   ++ +A++GP+AN +  + G
Sbjct: 360 KLTGPEVIASETHRRLALDMARESMTLLQNRNRLLPLSKNGLR-IAVMGPNANDSVMLWG 418

Query: 415 NYEGTPCRYTSPMDGFYAYSKVINYAPGCADI 446
           NY G P   T+ + G  +      +  GC  I
Sbjct: 419 NYTGYPISTTTILKGIRSKVPAARFVEGCGYI 450



 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 103/328 (31%), Positives = 149/328 (45%), Gaps = 67/328 (20%)

Query: 432 AYSKVINYAPGCADIVCQ----NNSMIPAAIDAAK--NADATVIVAGLDLSVEAE----- 480
           AY   + Y    A  VCQ      S I A+  AA+  +AD  V V G+   +E E     
Sbjct: 573 AYRVRVEYVQNKAMAVCQFDIARKSPITASEIAAQAGDADVVVFVGGISPRLEGEEMKVD 632

Query: 481 -----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
                G DR  + LP  Q E+I  +  A K  V  V  S GAV +          ++L  
Sbjct: 633 APGFNGGDRTSIELPEAQREVIRLLRQAGK-LVVFVNCSGGAVAL--VPEAEACDAVLQA 689

Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDG 595
            Y GE GG+A+ADV+FG YNP G+LP+T+Y+++        +P        GRTY++F G
Sbjct: 690 WYAGEAGGQAVADVLFGDYNPSGKLPVTFYKSD------ADLPDFLDYRMTGRTYRYFRG 743

Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
             ++PFG+GLSYT F   V  +P+  + KL                              
Sbjct: 744 TPLFPFGFGLSYTSF---VFGTPRYENGKL------------------------------ 770

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
                   +EV N GK DG+EVV VY K P  A   +K + G+ R+ + AG+  +V   M
Sbjct: 771 -------YVEVTNTGKRDGAEVVQVYVKNPADADGPVKTLRGFARIDLKAGERRRVEIAM 823

Query: 716 NACKSLKIVDNAANSL-LASGAHTILVG 742
              +  +  D   N++ +  G H ++VG
Sbjct: 824 PR-ERFEGWDATTNTMRVKPGNHLLMVG 850


>gi|295086418|emb|CBK67941.1| Beta-glucosidase-related glycosidases [Bacteroides xylanisolvens
           XB1A]
          Length = 861

 Score =  282 bits (721), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 171/445 (38%), Positives = 239/445 (53%), Gaps = 44/445 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            PY D  L   +RA+DL+ R+TL EKV  M + +  +PRLG+  YEWW+EALHGV   G 
Sbjct: 24  LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-GNAG--- 127
                           AT FP  I   ASFN+SL  ++    S EAR    + G +G   
Sbjct: 84  ----------------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLK 127

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFW+PN+N+ RDPRWGR  ET GEDPY+ G+  +  VRGLQ  E        D+R
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DAR 179

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
             K+ AC KH+A +    W   +R  FD+  +  +D+ ET++  F+  V +  V  VMC+
Sbjct: 180 YDKLHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCA 236

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
           YNR  G P C   +LL Q +R +W + G +VSDC +I       +H+   D KE A A  
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPD-KEHASAGA 295

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           ++AG DL+CG  Y +    AV+ G I E +ID SL+ L      LG  D    +  +  +
Sbjct: 296 VRAGTDLECGSEYASLA-DAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSAWSEIPTS 354

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + + +H  LA   AR+ +VLL+N N  LPLNT ++K +A++GP+AN +    GNY G P
Sbjct: 355 VLNSKEHQALALRMARESLVLLQNKNNILPLNT-HLK-VAVMGPNANDSVMQWGNYNGIP 412

Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
               + ++   A      I Y PGC
Sbjct: 413 AHTVTLLEAVRAKLPEGQIIYEPGC 437



 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 84/297 (28%), Positives = 131/297 (44%), Gaps = 53/297 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+    +AD  +   G+  S+E E          G DR D+ LP  Q +L+  +  A K 
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGK- 649

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
              +V ++     I         ++IL   YPG+ GG AI D ++G+YNPGGRLP+T+Y+
Sbjct: 650 --KVVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
                     +P     +  GRTY++     ++PFG+GLSYT F Y  A        KL 
Sbjct: 708 ------DVNQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEA--------KLS 753

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           K+   +  N                            I V N+G+ DG EVV VY + PG
Sbjct: 754 KNTIAKGEN------------------------VVLTIPVSNVGQRDGEEVVQVYLRRPG 789

Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
                   +  ++RV I AG++  V   +   ++ +  D  +N++    G + +L G
Sbjct: 790 DKEGPRYTLRAFKRVHIPAGKTESVAIPLTG-ENFEWFDVESNTMCPLEGTYELLYG 845


>gi|392537607|ref|ZP_10284744.1| Beta-glucosidase [Pseudoalteromonas marina mano4]
          Length = 870

 Score =  282 bits (721), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 166/428 (38%), Positives = 239/428 (55%), Gaps = 47/428 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y +      ER  DLV R+TL EKV Q+ D +  + RL +P Y WW+EALHGV+  G+  
Sbjct: 34  YLNESASIDERVNDLVTRLTLEEKVAQLFDKSPAIERLNIPEYNWWNEALHGVARAGK-- 91

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   A+F+E L  ++G  +S E RA ++   A       
Sbjct: 92  --------------ATVFPQAIGLAATFDEDLMLRVGTAISDEGRAKHHAFLAENNRSMY 137

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLT+WSPNIN+ RDPRWGR  ET GEDPY+  R A+N++ GLQ  +  EY        L
Sbjct: 138 TGLTYWSPNINIFRDPRWGRGQETYGEDPYLTTRIAVNFINGLQG-DNTEY--------L 188

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K  A  KHYA +         R   D   +++D+ ET++  F+  + +  V+SVMC+YN 
Sbjct: 189 KSVATLKHYAVHSGPEVS---RHSDDYTASKKDLAETYLPAFKDVIAQTKVASVMCAYNS 245

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI--VESHKFLNDTKEDAVARVLKA 303
           VNG P C + +L+   +R ++NF GYIVSDC +I     V+SH  +N T+  A A  LK 
Sbjct: 246 VNGTPACGNDELIQNKLRDEFNFDGYIVSDCGAIADFYDVKSHNIVN-TEAKAAAMALKT 304

Query: 304 GLDLDCGDYYTN---FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLG 358
           G DL+CGD++ N   +   AV++G + E D+D +L+ L     +LG FD      Y +  
Sbjct: 305 GTDLNCGDHHGNTYSYLSQAVKEGLVEEKDVDKALKRLMYARFKLGMFDNPENVPYSDTS 364

Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
            + + + +H+ L  EAA++ +VLLKN+   LPL  GN K +AL+GP+A+    ++GNY G
Sbjct: 365 IDIVGSNKHLALTQEAAKKSLVLLKNEQ-VLPL-KGNEK-VALIGPNADNEAILLGNYNG 421

Query: 419 TPCRYTSP 426
            P    +P
Sbjct: 422 MPIVPITP 429



 Score =  118 bits (295), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 93/328 (28%), Positives = 148/328 (45%), Gaps = 58/328 (17%)

Query: 431 YAYSKVINYAPGCADIVCQN-NSMIPAAIDAAKNADATVIVAGLDLSVEAEGK------- 482
           + +S VIN  P  +    +N  S+   A++ A  AD  V V G+  ++E E         
Sbjct: 574 FWHSNVIN--PTASLTWLKNPQSLTQQALNNANEADVIVFVGGISANLEGEEMPLQIDGF 631

Query: 483 ---DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
              DR ++ LP  Q  L+ K+    K P+ LV MS  A+ +N+   N  I +I+   YPG
Sbjct: 632 SHGDRTNINLPKSQLNLLKKLKQTGK-PIVLVNMSGSAMALNWENEN--IDAIIQGFYPG 688

Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
           E  G A+  +++G+Y+P G+LPIT+Y++       + +P     +   RTYK+++G V+Y
Sbjct: 689 EAAGSALVSLLYGEYSPSGKLPITFYKS------VSDLPDFKDYSMKNRTYKYYEGEVLY 742

Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
           PFG+GLSY  FKYK        + +   D    D+N T                      
Sbjct: 743 PFGFGLSYADFKYK--------NTRHSIDAGSGDLNLTT--------------------- 773

Query: 660 FTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
                 + N       +VV VY S P     T  KQ++G++ + +       + FT+   
Sbjct: 774 -----TITNQSSFSADDVVQVYVSMPDAPIKTPNKQLVGFKHITLKNESKNDIKFTIPKN 828

Query: 719 KSLKIVDNAANSLLASGAHTILVGEGVG 746
           K L  ++    ++   G   I VG G G
Sbjct: 829 K-LSYINEQGIAVAYKGRLIITVGSGQG 855


>gi|299149391|ref|ZP_07042448.1| beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298512578|gb|EFI36470.1| beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 853

 Score =  282 bits (721), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 161/421 (38%), Positives = 237/421 (56%), Gaps = 45/421 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y +   P  ER  DL+ R+T+ EK+  +   + G+PRLG+  Y   +EALHGV   GR  
Sbjct: 30  YKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-- 87

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
                          T FP  I   A++N  L K++   +S EARA +N  + G      
Sbjct: 88  --------------FTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQ 133

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFWSP +N+ RDPRWGR  ET GEDP++ G     +V+GLQ           D R
Sbjct: 134 FSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQG---------DDPR 184

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LKI +  KH+AA    N E ++RF  + +++E+ ++E +   FEMCV EG  +S+M +Y
Sbjct: 185 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAY 240

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N +N +P   +  LL + +R DW F GY+VSDC     +V +HK++  TKE A    ++A
Sbjct: 241 NALNDVPCTLNAWLLKKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVK-TKEAAATLSIQA 299

Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
           GLDL+CG D Y  + + A +Q  +++ADID++   +    M+LG FDG+ +  Y  +  +
Sbjct: 300 GLDLECGDDVYDEYLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDGTERNPYTRISPS 359

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            I + +H ++A +AAR+ IVLLKN N  LPLN   +K++A+VG   NA K   G+Y G P
Sbjct: 360 VIGSKEHQQIALDAARECIVLLKNKNNMLPLNVNKVKSIAVVG--INAGKCEFGDYSGAP 417

Query: 421 C 421
            
Sbjct: 418 V 418



 Score =  136 bits (342), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 95/303 (31%), Positives = 147/303 (48%), Gaps = 55/303 (18%)

Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-V 518
           A +  +  V V G++ S+E EG+DR D+ LP  Q E + ++      P  +V++ AG+ +
Sbjct: 600 AVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAGSSL 657

Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
            +N+   +  I +I+   YPGE+GG A+ADV+FG YNP GRLP+T+Y++         +P
Sbjct: 658 AVNWMDEH--IPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKS------LDELP 709

Query: 579 -LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
                +   GRTYK+F G V+YPFGYGLSY+ FKY                         
Sbjct: 710 AFDDYDITKGRTYKYFKGDVLYPFGYGLSYSSFKYS------------------------ 745

Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQI--EVENMGKMDGSEVVMVYSKPPGIAG-THIKQ 694
                         D+K KD   T  +   ++N GK  G EV  VY + P   G   IK+
Sbjct: 746 --------------DLKVKDGANTISVSFRLKNTGKRKGDEVAQVYVRIPETGGVVPIKE 791

Query: 695 VIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVGEGVGGVSFPLQ 753
           + G+ R+ + +G+S  V   ++  + L+  D      +   GA  I+VG     +     
Sbjct: 792 LKGFRRIPLKSGESRVVDIELDK-EQLRYWDAGLGQFIVPQGAFDIMVGASSKDIRLQTV 850

Query: 754 LNL 756
           +NL
Sbjct: 851 INL 853


>gi|393782428|ref|ZP_10370612.1| hypothetical protein HMPREF1071_01480 [Bacteroides salyersiae
           CL02T12C01]
 gi|392673256|gb|EIY66719.1| hypothetical protein HMPREF1071_01480 [Bacteroides salyersiae
           CL02T12C01]
          Length = 596

 Score =  282 bits (721), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 204/635 (32%), Positives = 308/635 (48%), Gaps = 84/635 (13%)

Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
           +T+WSPN+N+ RDPRWGR  ET GEDPY+       YVRGLQ          +D   LK 
Sbjct: 1   MTYWSPNVNIFRDPRWGRGQETYGEDPYLTAEIGKAYVRGLQG---------NDPFFLKA 51

Query: 188 SACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           +AC KHYA +      G +  R  F++  +++D+ ET++  FE  V E  V +VM +YNR
Sbjct: 52  AACAKHYAVH-----SGPEALRHEFNASPSKRDLFETYLPAFEALVKEAKVEAVMGAYNR 106

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           V G        LL   +R  W F G++VSDC ++  I   HK   D  E A A  LK+GL
Sbjct: 107 VYGESASGSFFLLTDILRKKWGFKGHVVSDCGAVDDIYGGHKIAKDVAE-ASAIALKSGL 165

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF---DGSPQYKNLGKNNI 362
           +L+CG  +      A+++  I E D+D +L  L +  ++LG     D SP YKN+  + I
Sbjct: 166 NLNCGGSFHALKE-ALERKLITEVDLDNALMPLMMTRLKLGNLTDDDESP-YKNISDSVI 223

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
            +  H  +A E A++ +VLLKN+N  LPL   ++KT+ + GP+A  T  M+GNY G   R
Sbjct: 224 ASYTHAMVAREVAQKSMVLLKNNNHTLPLKK-DVKTIFVTGPYAADTYVMMGNYYGVSPR 282

Query: 423 YTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPA--AIDAAKNADATVIVAGL---- 473
             + + G  A       INY  G   I+    +M PA   +   + A+  ++V GL    
Sbjct: 283 SNTFLQGIAAKVSGGTSINYKIG---ILPTTPNMNPADWTVGEVRAAEVAIVVIGLSGID 339

Query: 474 -----DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
                D    +   D+ +L LP  Q + +  ++      +  VI     +D+        
Sbjct: 340 EGEEGDAIASSHRGDKQNLKLPEHQLKFLRDISRNRWNKLVTVITGGSPIDLEEVSELSD 399

Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGR 588
              + W  YPG+EGG A+ D++FG  +  GR+P+T+       I    +P     N  GR
Sbjct: 400 AVIMAW--YPGQEGGMALGDLLFGDVSFSGRMPVTF------PINSDWLPAFEDYNMQGR 451

Query: 589 TYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
           TYK+    ++YPFGYGL+Y    Y   S  K ++ K D  Q+                  
Sbjct: 452 TYKYMTDNIMYPFGYGLTYGDVSY---SDVKILNPKYDGKQEIH---------------- 492

Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG--THIKQVIGYERVFIAAG 706
                         Q  + N G  +  EVV +Y   PG AG  T I  +IG++RV + + 
Sbjct: 493 -------------VQATLRNNGNNEVEEVVQLYLSAPG-AGVITPISSLIGFKRVTLESH 538

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
            S  V F +   +   ++++ + +LL  G +TI+V
Sbjct: 539 LSQTVEFIIKPDQLKMVMEDGSKNLL-KGKYTIIV 572


>gi|336417083|ref|ZP_08597412.1| hypothetical protein HMPREF1017_04520 [Bacteroides ovatus
           3_8_47FAA]
 gi|335936708|gb|EGM98626.1| hypothetical protein HMPREF1017_04520 [Bacteroides ovatus
           3_8_47FAA]
          Length = 850

 Score =  281 bits (720), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 161/421 (38%), Positives = 237/421 (56%), Gaps = 45/421 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y +   P  ER  DL+ R+T+ EK+  +   + G+PRLG+  Y   +EALHGV   GR  
Sbjct: 27  YKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-- 84

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
                          T FP  I   A++N  L K++   +S EARA +N  + G      
Sbjct: 85  --------------FTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQ 130

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFWSP +N+ RDPRWGR  ET GEDP++ G     +V+GLQ           D R
Sbjct: 131 FSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQG---------DDPR 181

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LKI +  KH+AA    N E ++RF  + +++E+ ++E +   FEMCV EG  +S+M +Y
Sbjct: 182 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAY 237

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N +N +P   +  LL + +R DW F GY+VSDC     +V +HK++  TKE A    ++A
Sbjct: 238 NALNDVPCTLNAWLLKKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVK-TKEAAATLSIQA 296

Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
           GLDL+CG D Y  + + A +Q  +++ADID++   +    M+LG FDG+ +  Y  +  +
Sbjct: 297 GLDLECGDDVYDEYLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDGTERNPYTRISPS 356

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            I + +H ++A +AAR+ IVLLKN N  LPLN   +K++A+VG   NA K   G+Y G P
Sbjct: 357 VIGSKEHQQIALDAARECIVLLKNKNNMLPLNVNKVKSIAVVG--INAGKCEFGDYSGAP 414

Query: 421 C 421
            
Sbjct: 415 V 415



 Score =  135 bits (340), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 95/303 (31%), Positives = 147/303 (48%), Gaps = 55/303 (18%)

Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-V 518
           A +  +  V V G++ S+E EG+DR D+ LP  Q E + ++      P  +V++ AG+ +
Sbjct: 597 AVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAGSSL 654

Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
            +N+   +  I +I+   YPGE+GG A+ADV+FG YNP GRLP+T+Y++         +P
Sbjct: 655 AVNWMDEH--IPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKS------LDELP 706

Query: 579 -LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
                +   GRTYK+F G V+YPFGYGLSY+ FKY                         
Sbjct: 707 AFDDYDITKGRTYKYFKGDVLYPFGYGLSYSSFKY------------------------- 741

Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQI--EVENMGKMDGSEVVMVYSKPPGIAG-THIKQ 694
                         D+K KD   T  +   ++N GK  G EV  VY + P   G   IK+
Sbjct: 742 -------------SDLKVKDGANTVSVSFRLKNTGKRKGDEVAQVYVRIPETGGVVPIKE 788

Query: 695 VIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVGEGVGGVSFPLQ 753
           + G+ R+ + +G+S  V   ++  + L+  D      +   GA  I+VG     +     
Sbjct: 789 LKGFRRIPLKSGESRVVEIELDK-EQLRYWDAGLGRFIVPQGAFDIMVGASSKDIRLQTV 847

Query: 754 LNL 756
           +NL
Sbjct: 848 INL 850


>gi|359450637|ref|ZP_09240068.1| beta-glucosidase [Pseudoalteromonas sp. BSi20480]
 gi|358043611|dbj|GAA76317.1| beta-glucosidase [Pseudoalteromonas sp. BSi20480]
          Length = 468

 Score =  281 bits (720), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 166/428 (38%), Positives = 238/428 (55%), Gaps = 47/428 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y +      ER  DLV R+TL EKV Q+ D +  + RL +P Y WW+EALHGV+  G+  
Sbjct: 34  YLNKSASIDERVNDLVTRLTLEEKVAQLFDKSPAIERLNMPEYNWWNEALHGVARAGK-- 91

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL--------GN 125
                         AT FP  I   A+F+E L  ++G  +S E RA ++           
Sbjct: 92  --------------ATVFPQAIGLAATFDEDLMLRVGTAISDEGRAKHHAFLEENNRSMY 137

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLT+WSPNIN+ RDPRWGR  ET GEDPY+  R A+N++ GLQ  +  EY        L
Sbjct: 138 TGLTYWSPNINIFRDPRWGRGQETYGEDPYLTTRIAVNFINGLQG-DNAEY--------L 188

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K  A  KHYA +   +     R   D   +E+D+ ET++  F+  + +  V+SVMC+YN 
Sbjct: 189 KSVATLKHYAVH---SGPEVSRHSDDYTASEKDLAETYLPAFKDVIAQTKVASVMCAYNS 245

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI--VESHKFLNDTKEDAVARVLKA 303
           VNG P C + +L+   +R ++NF GYIVSDC +I     V+SH  +N T   A A  LK 
Sbjct: 246 VNGTPACGNDELIQNKLRDEFNFDGYIVSDCGAIADFYDVKSHNIVN-TGAKAAAMALKT 304

Query: 304 GLDLDCGDYYTN---FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLG 358
           G DL+CGD++ N   +   AV++G + E D+D +L+ L     +LG FD      Y +  
Sbjct: 305 GTDLNCGDHHGNTYSYLTQAVKEGLVEEKDVDKALKRLMYARFKLGMFDNPENVPYSDTS 364

Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
            + + + +H+ L  EAA++ +VLLKN+   LPL  GN K +AL+GP+A+    ++GNY G
Sbjct: 365 IDVVGSNKHLALTQEAAQKSLVLLKNEQ-VLPLK-GNEK-IALIGPNADNEAILLGNYNG 421

Query: 419 TPCRYTSP 426
            P    +P
Sbjct: 422 MPIVPITP 429


>gi|262405256|ref|ZP_06081806.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|294644754|ref|ZP_06722499.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294810589|ref|ZP_06769241.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
 gi|345508031|ref|ZP_08787672.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
 gi|229444722|gb|EEO50513.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
 gi|262356131|gb|EEZ05221.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|292639876|gb|EFF58149.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294442250|gb|EFG11065.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
          Length = 861

 Score =  281 bits (720), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 171/445 (38%), Positives = 238/445 (53%), Gaps = 44/445 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            PY D  L   +RA+DL+ R+TL EKV  M + +  +PRLG+  YEWW+EALHGV   G 
Sbjct: 24  LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-GNAG--- 127
                           AT FP  I   ASFN+SL  ++    S EAR    + G +G   
Sbjct: 84  ----------------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLK 127

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFW+PN+N+ RDPRWGR  ET GEDPY+ G+  +  VRGLQ  E        D+R
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DAR 179

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
             K+ AC KH+A +    W   +R  FD+  +  +D+ ET++  F+  V +  V  VMC+
Sbjct: 180 YDKLHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCA 236

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
           YNR  G P C   +LL Q +R +W + G +VSDC +I       +H    D KE A A  
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASAGA 295

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           ++AG DL+CG  Y +    AV+ G I E +ID SL+ L      LG  D    +  +  +
Sbjct: 296 VRAGTDLECGSEYASLA-DAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSAWSEIPTS 354

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + + +H  LA   AR+ +VLL+N N  LPLNT ++K +A++GP+AN +    GNY G P
Sbjct: 355 VLNSKEHQALALRMARESLVLLQNKNNILPLNT-HLK-VAVMGPNANDSVMQWGNYNGIP 412

Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
               + ++   A      I Y PGC
Sbjct: 413 AHTVTLLEAVRAKLPEGQIIYEPGC 437



 Score =  108 bits (270), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 83/285 (29%), Positives = 127/285 (44%), Gaps = 52/285 (18%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+    +AD  +   G+  S+E E          G DR D+ LP  Q +L+  +  A K 
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGK- 649

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
              +V ++     I         ++IL   YPG+ GG AI D ++G+YNPGGRLP+T+Y+
Sbjct: 650 --KVVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
                     +P     +  GRTY++     ++PFG+GLSYT F Y  A        KL 
Sbjct: 708 ------DVNQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTTFTYGEA--------KLS 753

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           K+   +  N                            I V N+G+ DG EVV VY + PG
Sbjct: 754 KNTIAKGEN------------------------VVLTIPVSNVGQRDGEEVVQVYLRRPG 789

Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL 731
                   +  ++RV I AG++  V  ++   +S +  D A N++
Sbjct: 790 DKEGPRYTLRAFKRVHIPAGKTESVAISLTH-ESFEWFDEATNTM 833


>gi|336404202|ref|ZP_08584900.1| hypothetical protein HMPREF0127_02213 [Bacteroides sp. 1_1_30]
 gi|335943530|gb|EGN05369.1| hypothetical protein HMPREF0127_02213 [Bacteroides sp. 1_1_30]
          Length = 735

 Score =  281 bits (720), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 222/760 (29%), Positives = 355/760 (46%), Gaps = 97/760 (12%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLY-- 56
           Y DAK P  +R  DL+ RMTL EK+ Q+     G              VP  +G  +Y  
Sbjct: 30  YKDAKAPIEKRIDDLISRMTLEEKILQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFD 89

Query: 57  --EWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
                  ++   +    R   P    +D+     T +P  +    S+N  L ++     +
Sbjct: 90  INPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSA 149

Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
            EAR    +     TF SP I+V RDPRWGRV E  GEDPY  G +A   VRG       
Sbjct: 150 QEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRG------- 197

Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
            Y  D  S   +++AC KHY  Y      G D  +  + ++ Q + +T++LP+EM V  G
Sbjct: 198 -YQGDDMSAENRMAACLKHYVGYGASE-AGRDYVY--TEISAQTLWDTYLLPYEMGVKAG 253

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
             +++M S+N ++G+P  A+P ++ + ++  W   G+IVSD  +++ +   ++ L  TK+
Sbjct: 254 -AATLMSSFNDISGVPGSANPYIMTEILKKRWKHDGFIVSDWGAVEQL--KNQGLAATKK 310

Query: 295 DAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
           DA      AGL++D   + Y       V++GK+  A +D S+R +  V  RLG F+    
Sbjct: 311 DAAQYAFNAGLEMDMMSHAYDRHLKELVEEGKVTMAQVDESVRRVLRVKFRLGLFERPYT 370

Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
                K+    PQ + +AA+ A + +VLLKNDN  LPL   N K +A+VGP A     ++
Sbjct: 371 PVTNEKDRFFRPQSMAVAAQLAAESMVLLKNDNQILPLT--NKKKIAVVGPMAKNGWDLL 428

Query: 414 GNY--EGTPCRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
           G++   G         DG  A       + YA GC      + S    A+D A+ +D  +
Sbjct: 429 GSWCGHGKDTDVEMLYDGLTAEFGGDAELRYAMGCKP-QGNDRSGFAGALDVARWSDVVI 487

Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
           +  G  L+   E   R  + LP  Q EL+ ++ +A K PV LV+ +   +++N  +  P 
Sbjct: 488 VCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PVILVLSNGRPLELN--RMEPL 544

Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS--MPLRPVNNFP 586
             +IL +  PG  G R++A ++ G+ NP G+L +T+        PY++  +P+       
Sbjct: 545 CDAILEIWQPGINGARSMAGILSGRINPSGKLAMTF--------PYSTGQIPIYYNRRKS 596

Query: 587 GRTYKFFDGPV----VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
           GR ++ F   +    +YPFG+GLSYT+FKY                          GT  
Sbjct: 597 GRGHQGFYKDITSDPLYPFGHGLSYTEFKY--------------------------GTVT 630

Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERV 701
           P    V   D      K + ++ V N G  DG+E V  +   P  + T  +K++  +E+ 
Sbjct: 631 PSATKVKRGD------KLSAEVTVTNTGARDGAETVHWFISDPYCSITRPVKELKHFEKQ 684

Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
            I AG++    F ++  +    V+      L +G + ILV
Sbjct: 685 LIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724


>gi|288927072|ref|ZP_06420962.1| beta-glucosidase [Prevotella buccae D17]
 gi|288336152|gb|EFC74543.1| beta-glucosidase [Prevotella buccae D17]
          Length = 866

 Score =  281 bits (720), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 165/452 (36%), Positives = 240/452 (53%), Gaps = 41/452 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +PY + +L   ERA+DL  R+TL EK + M + +  +PRLG+P +EWWSEALHG++  G 
Sbjct: 23  YPYQNPRLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWWSEALHGIARNG- 81

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
                           AT FP      AS+++ L   +    S EA A  NL        
Sbjct: 82  ---------------FATVFPQTTAMAASWDDELLYHVFCAASDEAVAKNNLARKSGDIK 126

Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD---- 179
              G++ W+PNIN+ RDPRWGR  ET GEDPY+  R  +  V GLQ   G  + RD    
Sbjct: 127 RYQGVSIWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQ---GQPFRRDMRPF 183

Query: 180 -SDSRPLKISACCKHYAAYDLDNWEGNDRFHFD-SRVTEQDMQETFILPFEMCVNEGDVS 237
               R  K  AC KHYA +    W   +R  FD  R+ E+D+ ET++  F+  V EG+V 
Sbjct: 184 TERPRYYKTLACAKHYAVHSGPEW---NRHVFDVERLPERDLWETYLPAFKSLVQEGNVR 240

Query: 238 SVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV-ESHKFLNDTKEDA 296
            VMC+Y R++G P C + + L+Q +RG+W ++G +VSDC +I     E H  + +T  +A
Sbjct: 241 EVMCAYQRIDGSPCCGNTRYLHQILRGEWEYNGLVVSDCGAISDFYREGHHHVVETPAEA 300

Query: 297 VARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QY 354
            A  ++AG D++CG  Y      AV+QG I+   IDTS+  L      +G FD      +
Sbjct: 301 SAMGVRAGTDVECGAVYATLPR-AVEQGLISREAIDTSVVRLLKARFEVGDFDSEKLVPW 359

Query: 355 KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIG 414
           K  G   I +  H  LA + AR+ + LL+N N  LPL+   ++ +A++GP+AN +  + G
Sbjct: 360 KLTGPEVIASETHRRLALDMARESMTLLQNRNRLLPLSKNGLR-IAVMGPNANDSVMLWG 418

Query: 415 NYEGTPCRYTSPMDGFYAYSKVINYAPGCADI 446
           NY G P   T+ + G  +      +  GC  I
Sbjct: 419 NYTGYPISTTTILKGIRSKVPAARFVEGCGYI 450



 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 102/328 (31%), Positives = 150/328 (45%), Gaps = 67/328 (20%)

Query: 432 AYSKVINYAPGCADIVCQ----NNSMIPAAIDAAKNADATVIV---------AGLDLSVE 478
           AY   + Y    A  VCQ      S I A+  AA+  DA V+V          G ++ V+
Sbjct: 573 AYRVRVEYVQNKAMAVCQFDIARKSPITASEIAAQAGDADVVVFVGGISPRLEGEEMKVD 632

Query: 479 A---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
           A   +G DR  + LP  Q E+I  +  A K  V  V  S GAV +          ++L  
Sbjct: 633 APGFKGGDRTSIELPEAQREVIRLLRQAGK-LVVFVNCSGGAVAL--VPETEACDAVLQA 689

Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDG 595
            Y GE GG+A+ADV+FG YNP G+LP+T+Y+++        +P        GRTY++F G
Sbjct: 690 WYAGEAGGQAVADVLFGDYNPSGKLPVTFYKSD------ADLPDFLDYRMTGRTYRYFRG 743

Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
             ++PFG+GLSYT F +     P+  + KL                              
Sbjct: 744 IPLFPFGFGLSYTSFAF---GKPRYENGKL------------------------------ 770

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
                   +EV N GK DG+EVV VY K P  A   +K + G+ R+ + AG+  +V   M
Sbjct: 771 -------YVEVTNTGKRDGAEVVQVYVKNPADADGPVKTLRGFARIDLKAGERRRVEIAM 823

Query: 716 NACKSLKIVDNAANSL-LASGAHTILVG 742
              +  +  D   N++ +  G H ++VG
Sbjct: 824 PR-ERFEGWDATTNTMRVKPGNHLLMVG 850


>gi|383113364|ref|ZP_09934136.1| hypothetical protein BSGG_3068 [Bacteroides sp. D2]
 gi|382948729|gb|EFS32368.2| hypothetical protein BSGG_3068 [Bacteroides sp. D2]
          Length = 850

 Score =  281 bits (720), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 161/421 (38%), Positives = 237/421 (56%), Gaps = 45/421 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y +   P  ER  DL+ R+T+ EK+  +   + G+PRLG+  Y   +EALHGV   GR  
Sbjct: 27  YKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-- 84

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
                          T FP  I   A++N  L K++   +S EARA +N  + G      
Sbjct: 85  --------------FTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQ 130

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFWSP +N+ RDPRWGR  ET GEDP++ G     +V+GLQ           D R
Sbjct: 131 FSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQG---------DDPR 181

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LKI +  KH+AA    N E ++RF  + +++E+ ++E +   FEMCV EG  +S+M +Y
Sbjct: 182 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAY 237

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N +N +P   +  LL + +R DW F GY+VSDC     +V +HK++  TKE A    ++A
Sbjct: 238 NALNDVPCTLNAWLLKKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVK-TKEAAATLSIQA 296

Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
           GLDL+CG D Y  + + A +Q  +++ADID++   +    M+LG FDG+ +  Y  +  +
Sbjct: 297 GLDLECGDDVYDEYLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDGTERNPYTRISPS 356

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            I + +H ++A +AAR+ IVLLKN N  LPLN   +K++A+VG   NA K   G+Y G P
Sbjct: 357 VIGSKEHQQIALDAARECIVLLKNKNNMLPLNVNKVKSIAVVG--INAGKCEFGDYSGAP 414

Query: 421 C 421
            
Sbjct: 415 V 415



 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 94/303 (31%), Positives = 147/303 (48%), Gaps = 55/303 (18%)

Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-V 518
           A +  +  V V G++ S+E EG+DR D+ LP  Q E + ++      P  +V++ AG+ +
Sbjct: 597 AVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAGSSL 654

Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
            +N+   +  I +I+   YPGE+GG A+ADV+FG YNP GRLP+T+Y++         +P
Sbjct: 655 AVNWMDEH--IPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKS------LDELP 706

Query: 579 -LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
                +   GRTYK+F G V+YPFGYGLSY+ FKY                         
Sbjct: 707 AFDDYDITQGRTYKYFKGDVLYPFGYGLSYSSFKYS------------------------ 742

Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQI--EVENMGKMDGSEVVMVYSKPPGIAG-THIKQ 694
                         D+K KD   T  +   ++N GK  G EV  VY + P   G   IK+
Sbjct: 743 --------------DLKVKDGANTVSVSFRLKNTGKRKGDEVAQVYVRIPETGGVVPIKE 788

Query: 695 VIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVGEGVGGVSFPLQ 753
           + G+ R+ + +G+S  V   ++  + L+  D      +   GA  I++G     +     
Sbjct: 789 LKGFRRIPLKSGESRVVEIELDK-EQLRYWDAGLGQFIVPQGAFDIMIGASSKDIRLQTV 847

Query: 754 LNL 756
           +NL
Sbjct: 848 INL 850


>gi|336415363|ref|ZP_08595703.1| hypothetical protein HMPREF1017_02811 [Bacteroides ovatus
           3_8_47FAA]
 gi|335940959|gb|EGN02821.1| hypothetical protein HMPREF1017_02811 [Bacteroides ovatus
           3_8_47FAA]
          Length = 861

 Score =  281 bits (720), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 171/445 (38%), Positives = 238/445 (53%), Gaps = 44/445 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            PY D  L   +RA+DL+ R+TL EKV  M + +  +PRLG+  YEWW+EALHGV   G 
Sbjct: 24  LPYQDTSLAAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-GNAG--- 127
                           AT FP  I   ASFN+SL  ++    S EAR    + G +G   
Sbjct: 84  ----------------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLK 127

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFW+PN+N+ RDPRWGR  ET GEDPY+ G+  +  VRGLQ  E        D+R
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DAR 179

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
             K+ AC KH+A +    W   +R  FD+  +  +D+ ET++  F+  V +  V  VMC+
Sbjct: 180 YDKLHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCA 236

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
           YNR  G P C   +LL Q +R +W + G +VSDC +I       +H    D KE A A  
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASAGA 295

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           ++AG DL+CG  Y +    AV+ G I E +ID SL+ L      LG  D    +  +  +
Sbjct: 296 VRAGTDLECGSEYASLA-DAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSAWSEIPTS 354

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + + +H  LA   AR+ +VLL+N N  LPLNT ++K +A++GP+AN +    GNY G P
Sbjct: 355 VLNSKEHQALALRMARESLVLLQNKNNILPLNT-HLK-VAVMGPNANDSVMQWGNYNGIP 412

Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
               + ++   A      I Y PGC
Sbjct: 413 AHTVTLLEAVRAKLPEGQIIYEPGC 437



 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 82/291 (28%), Positives = 129/291 (44%), Gaps = 53/291 (18%)

Query: 463 NADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVI 512
           +AD  +   G+  S+E E          G DR D+ LP  Q +L+  +    K    +V 
Sbjct: 597 DADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKVGK---KVVF 653

Query: 513 MSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKI 572
           ++     I         ++IL   YPG+ GG AI D ++G+YNPGGRLP+T+Y+      
Sbjct: 654 INYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK------ 707

Query: 573 PYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCR 632
               +P     +  GRTY++     ++PFG+GLSYT F Y  A        KL K+   +
Sbjct: 708 DVNQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEA--------KLSKNTIAK 759

Query: 633 DINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI 692
             N                            I V N+G+ DG EVV VY + PG      
Sbjct: 760 GEN------------------------VVLTIPVSNVGQRDGEEVVQVYLRRPGDKEGPR 795

Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
             +  ++RV I AG++  V  ++   ++ +  D  +N++    G + +L G
Sbjct: 796 YTLRAFKRVHIPAGKTESVAISLTG-ENFEWFDVESNTMRPLEGTYELLYG 845


>gi|225873995|ref|YP_002755454.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
 gi|225792796|gb|ACO32886.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
          Length = 896

 Score =  281 bits (720), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 168/432 (38%), Positives = 234/432 (54%), Gaps = 45/432 (10%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           P+ +   P  +R  +LV +MTL E+  QM + A  +PRLG+P Y WWSE LHG++  G  
Sbjct: 38  PWDNPNQPIQKRVHELVSQMTLQEEAAQMMNTAPAIPRLGVPAYNWWSEGLHGIARSGY- 96

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------ 126
                          AT FP  I  +A+F+ +   ++G TVSTEARA YN          
Sbjct: 97  ---------------ATVFPQAIGMSATFDPAAIHQMGTTVSTEARAKYNWAIRHDIHSI 141

Query: 127 --GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLT W+PNIN+VRDPRWGR  ET GEDP++ G  A  YV GLQ          ++ + 
Sbjct: 142 YFGLTLWAPNINIVRDPRWGRGQETYGEDPFLTGTMAAEYVSGLQ---------GNNPKY 192

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
           LK  A  KH++ Y   N   + R   ++  +  DMQ+T++  F M + +G   S+MCSYN
Sbjct: 193 LKTVATPKHFSVY---NGPESMRHKINANPSAHDMQDTYLAAFRMAITKGHADSMMCSYN 249

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARVLK 302
            V G+P+CA+ KLL   +RG W F GYI SDC +I       +H +  D    A + VL 
Sbjct: 250 AVYGVPSCAN-KLLADVVRGKWGFDGYITSDCGAISDFYRPGAHGYSPDAVHAAASAVL- 307

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
           AG D DCG  Y      +VQQG I++A ID ++  L+    RLG FD      Y ++  +
Sbjct: 308 AGTDTDCGTGYKVLPQ-SVQQGLISKAAIDRAVERLFTARFRLGMFDPKADVPYNSIPYS 366

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + +  H   A E A + +VLLKN+ G LPL   N +T+A+VGP+A    ++ GNY   P
Sbjct: 367 VVDSAAHRAQALEDASKSMVLLKNEGGILPLR--NARTIAVVGPNAANLNSIEGNYNAIP 424

Query: 421 CRYTSPMDGFYA 432
              + P+DG  A
Sbjct: 425 SHPSLPVDGIEA 436



 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 86/263 (32%), Positives = 135/263 (51%), Gaps = 42/263 (15%)

Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
           +G DR  L LP  Q +L++ +    K PV LV+++  A+ I++AK +  ++ IL   YPG
Sbjct: 652 DGGDRTRLSLPQTQQDLLHALVATGK-PVVLVLLNGSALSIDWAKQH--VQGILEAWYPG 708

Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
           E GG AI + + G+ +PGG+LPIT+Y +     P+T   ++      GRTY+++ G  ++
Sbjct: 709 EAGGEAIGETLSGQNDPGGKLPITFYTSVKDLPPFTDYSMK------GRTYRYYTGKPLF 762

Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
           PFGYGLSYT F+Y                   R     +   +P                
Sbjct: 763 PFGYGLSYTTFEYS----------------HVRLSTSNLKAGEP---------------- 790

Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
            T + EV+N G + G  V  VY  PP      +K++ G++RV +A GQS ++ FT+N  +
Sbjct: 791 LTVEAEVKNTGHVAGDAVTEVYVTPPQNGVNPLKELKGFDRVHLAPGQSRQLTFTLNP-R 849

Query: 720 SLKIVDNAANSLLASGAHTILVG 742
            L +VD A    +  G ++I VG
Sbjct: 850 DLSLVDEAGKRSVQPGVYSIFVG 872


>gi|440733337|ref|ZP_20913088.1| beta-glucosidase [Xanthomonas translucens DAR61454]
 gi|440362904|gb|ELQ00083.1| beta-glucosidase [Xanthomonas translucens DAR61454]
          Length = 895

 Score =  281 bits (720), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 175/452 (38%), Positives = 240/452 (53%), Gaps = 53/452 (11%)

Query: 31  RMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATS 90
           +MT  EKV Q  + A  +PRLG+P YEWW+E LHG++  G                 AT 
Sbjct: 53  KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNGY----------------ATV 96

Query: 91  FPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSPNINVVRDP 141
           FP  I   A++N +L +++G   STEARA +NL           AGLT WSPNIN+ RDP
Sbjct: 97  FPQAIGLAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDP 156

Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
           RWGR +ET GEDPY+ G+ A+ ++ GLQ         D  + P  I A  KH A +    
Sbjct: 157 RWGRGMETYGEDPYLTGQLAVGFIHGLQG--------DDLTHPRTI-ATPKHLAVHSGPE 207

Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
                R  FD  V+  D++ T+   F   + +G   SVMC+YN ++G P CA   LLN  
Sbjct: 208 ---PGRHGFDVDVSPHDLEATYTPAFRAAIVDGRAGSVMCAYNALHGTPACAADWLLNGR 264

Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAV 321
           +RGDW F G++VSDCD++  + + H F  D    + A  LKAG DL+CG  Y +    A+
Sbjct: 265 LRGDWGFTGFVVSDCDAVDDMTQFHYFRADNAGSSAA-ALKAGHDLNCGYAYRDLGK-AI 322

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNPQHIELAAEAARQ 377
            +G   EA +D SL  L+    RLG     PQ    Y  LG  ++ +  H  LA +AA+Q
Sbjct: 323 ARGDADEALLDQSLVRLFAARYRLGEL--QPQRKDPYAQLGAKDVDSAAHRALALQAAQQ 380

Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVI 437
            IVLL+N N  LPL  G    LA++GP+A+A  A+  NY+GT     +P+ G        
Sbjct: 381 SIVLLQNRNATLPLRPG--LRLAVIGPNADALAALEANYQGTSAAPVTPLLGLRERFGAA 438

Query: 438 N--YAPGCADIVCQNNSMIPAAIDAAKNADAT 467
           N  YA G A +    + MIP   + A ++D T
Sbjct: 439 NVRYAQG-APLAAGVSGMIP---ETALHSDGT 466



 Score =  139 bits (351), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 93/285 (32%), Positives = 139/285 (48%), Gaps = 45/285 (15%)

Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           V G +L ++    +G DR DL LP  Q  L+ + A A+  P+ +V+MS  AV +N+AK +
Sbjct: 637 VEGEELRIDVPGFDGGDRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKQH 695

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
                  W  YPG+ GG AIA V+ G  NPGGRLP+T+Y +      Y S  ++      
Sbjct: 696 ADAIVAAW--YPGQSGGTAIAQVLAGDVNPGGRLPVTFYRSTKDLPAYVSYDMK------ 747

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++F G  ++ FG GLSYT+F Y   ++P           Q        G N     
Sbjct: 748 GRTYRYFKGEPLFAFGSGLSYTRFTY---AAP-----------QLSATTLQAGAN----- 788

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
                           + +V N G   G EVV VY +PP  A + ++ ++G++RV +  G
Sbjct: 789 -------------LQVRTQVSNSGTRAGDEVVQVYLQPPQGAQSPLRTLVGFQRVTLQPG 835

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFP 751
           ++ +VGF +   + L  VD A    +  G + + VG G  G   P
Sbjct: 836 EAREVGFELTP-RQLSDVDRAGQRAVQPGDYRVFVGGGQPGTGAP 879


>gi|347736643|ref|ZP_08869226.1| xylosidase/arabinosidase [Azospirillum amazonense Y2]
 gi|346919803|gb|EGY01181.1| xylosidase/arabinosidase [Azospirillum amazonense Y2]
          Length = 775

 Score =  281 bits (720), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 232/729 (31%), Positives = 351/729 (48%), Gaps = 123/729 (16%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+P+  +  E LHG   IG                  TSFP  I   +S++  L +++
Sbjct: 121 RLGIPVL-FHEEGLHGYPAIG-----------------PTSFPQAIAQASSWDPDLIREV 162

Query: 110 GQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGL 168
              V+ E R        G++   SP ++V RDPRWGR+ ET GEDPY+ G   +  V+GL
Sbjct: 163 DSVVAREIRVR------GVSLVLSPVVDVARDPRWGRIEETFGEDPYLAGEMGVAAVQGL 216

Query: 169 QDVEGVEYHRDSDSRPL---KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFIL 225
           Q           DS PL   K+ A  KH   +       N      + V E+ ++E F  
Sbjct: 217 Q----------GDSLPLADGKVFATLKHLTGHGQPESGTN---VGPASVGERTLREMFFP 263

Query: 226 PFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVES 285
           PFE  ++  +V +VM SYN ++G+P+  +  LL+  +RG+W + G I+SD  +I  +V  
Sbjct: 264 PFEQVIHRTNVRAVMASYNEIDGVPSHVNTWLLHDILRGEWGYKGSIISDYSAIDQLVSI 323

Query: 286 HKFLNDTKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLM 343
           H  + D    A+ R ++AG+D D   G+ Y +    +V+ GKI E  ID ++R +  +  
Sbjct: 324 HHVVPDLPSAAI-RAIQAGVDADLPDGESYASLA-DSVRAGKIKEEVIDRAVRRILELKF 381

Query: 344 RLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG 403
           + G F+      +  +    N +   +A +AA++ +VLLKND G LPL+   +KTLA++G
Sbjct: 382 QAGLFEHPYADADKAEALTANGEARAVALKAAQKSVVLLKND-GVLPLDMAKVKTLAVIG 440

Query: 404 PHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKV-INYAPGC---------ADIV---- 447
           P  NA KA +G Y G P +  S +DG  A   ++V + YA G           D V    
Sbjct: 441 P--NAAKAHLGGYSGEPKQTVSILDGIKAKVGARVKVTYAEGVRITKDDDWYGDTVELAD 498

Query: 448 -CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG------KDRVDLLLPGFQTELINKV 500
             +N  +I  A+  AK AD  V+V G +     EG       DR  L L G Q +L   +
Sbjct: 499 PAENARLIQQAVAVAKTADHIVLVIGDNEQTSREGWANNHLGDRDSLDLVGQQNDLAKAL 558

Query: 501 ADAAKGPVTLVIMSA---GAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPG 557
               K PV +V+ +      VD+  A+ N  ++   W  Y G+EGG A+ADV+FG  NPG
Sbjct: 559 FALGK-PVVVVLQNGRPLSVVDVA-ARANALVEG--W--YLGQEGGTAMADVLFGDVNPG 612

Query: 558 GRLPITWYEANYVKIPYTSMPLRPVNNFPG--RTYKFFDGPVVYPFGYGLSYTQFKYKVA 615
           G+LP+T      V      +P+   N  P   R Y F     ++PFGYGLSYT F     
Sbjct: 613 GKLPVT------VARSVGQLPMF-YNKKPSARRGYLFDTTDPLFPFGYGLSYTTFD---V 662

Query: 616 SSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGS 675
            SP+                     + P  A         KD   T  ++V N GK  G 
Sbjct: 663 GSPR--------------------LSTPTIA---------KDGAITVAVDVRNTGKRAGD 693

Query: 676 EVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS 734
           EVV +Y      + T  +K++ G++R+ +A G+S  V FT++  K+L + +     ++  
Sbjct: 694 EVVQLYLHQQVASVTRPVKELKGFQRITLAPGESRTVTFTVDG-KALALWNQDMKRVVEP 752

Query: 735 GAHTILVGE 743
           GA  I+VG+
Sbjct: 753 GAFDIMVGD 761


>gi|320105647|ref|YP_004181237.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
 gi|319924168|gb|ADV81243.1| glycoside hydrolase family 3 domain protein [Terriglobus saanensis
           SP1PR4]
          Length = 885

 Score =  281 bits (719), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 179/445 (40%), Positives = 237/445 (53%), Gaps = 48/445 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D  L  P RA+DLV RMTL EK  QM + A  + RLG+P Y++WSE LHGV+  G   
Sbjct: 30  YLDPTLSPPARARDLVHRMTLEEKTAQMINTAPAIDRLGVPAYDFWSEGLHGVARSGY-- 87

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   A+++E L  +IG  VSTEARA YN           
Sbjct: 88  --------------ATLFPQAIGMAATWDEPLMHEIGTVVSTEARAKYNDAVQHGVHSIY 133

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLT WSPNIN+ RDPRWGR  ET GEDP++  R    +VRG+Q           D    
Sbjct: 134 FGLTIWSPNINIFRDPRWGRGQETYGEDPFLTARMGTAFVRGIQG---------DDPNYF 184

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           +  A  KH+A +       + R  F+  V++ D+ +T++  F   + EG   S+MC+YNR
Sbjct: 185 RTIATPKHFAVHSGPE---STRHTFNVDVSQHDLWDTYLPAFRSTIIEGKADSIMCAYNR 241

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVES--HKFLNDTKEDAVARVLKA 303
           ++G P CA   LL Q +RGDW F G++ SDC +I        H F  + KEDA A  +KA
Sbjct: 242 IDGQPACASDLLLKQILRGDWGFRGFVTSDCGAIDDFYTKIGHHFSKE-KEDASAAGVKA 300

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNN 361
           G D  CG  Y   T  AV+ G I E ++D SL  L+   +RLG FD   +  Y  L    
Sbjct: 301 GTDTACGKTYLGLT-SAVKSGLITEHEMDISLERLFEARIRLGLFDDPARMPYARLTMAE 359

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           + +P H  LA  AAR+ IVLLKN N  LPL+   +K +A++GP+A +  A+ GNY     
Sbjct: 360 VNSPAHRALALRAARESIVLLKNANNLLPLH--GVKNIAVIGPNAASLDALEGNYNAIAR 417

Query: 422 RYTSPMDGFYAY---SKVINYAPGC 443
               P+DG  A    +KV+ YA G 
Sbjct: 418 DPAMPVDGIAAAFPGAKVV-YAQGA 441



 Score =  116 bits (291), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 91/288 (31%), Positives = 132/288 (45%), Gaps = 58/288 (20%)

Query: 468 VIVAGLDLSVEAEGK------------DRVDLLLPGFQTELINKVADAAKGPVTLVIMSA 515
           V+VA + LS E EG+            DR D+ LP  Q EL+  V    K P+ +V+M+ 
Sbjct: 621 VVVAFVGLSPELEGEEMPIKVKGFAGGDRTDIELPQTQLELLRAVKATGK-PLIVVLMNG 679

Query: 516 GAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYT 575
            A+    A  + +  ++L   YPGE G +AIA+ + GK NP GRLP+T+Y          
Sbjct: 680 SAI----ALKDSETDALLEAWYPGEAGAQAIAETLAGKNNPSGRLPLTFYSN------ID 729

Query: 576 SMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKY-KVASSPKSVDIKLDKDQQCRDI 634
            +P     +   RTY++F G  +Y FG GLSYT F+Y KV+ S   +    D        
Sbjct: 730 QLPAFDDYSMANRTYRYFKGQPLYAFGGGLSYTTFRYGKVSLSATHLHAGED-------- 781

Query: 635 NYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQ 694
                                     T + EV N GK+ G EV  VY  PP  +      
Sbjct: 782 -------------------------LTVEAEVTNTGKVAGDEVAQVYLTPPQTSIAPRFA 816

Query: 695 VIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           ++GY+RV +  GQS  + FT++  + L  VD       ++G + I VG
Sbjct: 817 LVGYQRVHLLPGQSKPMRFTLHP-RELSQVDAQGVRAASAGHYEIKVG 863


>gi|424792251|ref|ZP_18218496.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
 gi|422797157|gb|EKU25539.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
          Length = 909

 Score =  281 bits (718), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 172/440 (39%), Positives = 235/440 (53%), Gaps = 50/440 (11%)

Query: 31  RMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATS 90
           +MT  EKV Q  + A  +PRLG+P YEWW+E LHG++  G                 AT 
Sbjct: 67  KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNGY----------------ATV 110

Query: 91  FPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSPNINVVRDP 141
           FP  I   A++N +L +++G   STEARA +NL           AGLT WSPNIN+ RDP
Sbjct: 111 FPQAIGLAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDP 170

Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
           RWGR +ET GEDPY+ G+ A+ ++ GLQ         D  + P  I A  KH A +   +
Sbjct: 171 RWGRGMETYGEDPYLTGQLAVGFIHGLQG--------DDLTHPRTI-ATPKHLAVH---S 218

Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
                R  FD  V+  D++ T+   F   + +G   SVMC+YN ++G P CA   LLN  
Sbjct: 219 GPEPGRHGFDVDVSPHDLEATYTPAFRAAIVDGRAGSVMCAYNALHGTPACAADWLLNGR 278

Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAV 321
           +RGDW F G++VSDCD++  + + H F  D    + A  LKAG DL+CG  Y +    A+
Sbjct: 279 LRGDWGFTGFVVSDCDAVDDMTQFHYFRADNAGSSAA-ALKAGHDLNCGYAYRDLGK-AI 336

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNPQHIELAAEAARQ 377
            +G   EA +D SL  L+    RLG     PQ    Y  LG  ++ +  H  LA +AA+Q
Sbjct: 337 ARGDADEALLDKSLVRLFAARYRLGEL--QPQRKDPYARLGAKDVDSAAHRALALQAAQQ 394

Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVI 437
            IVLL+N N  LPL  G    LA++GP+A+A  A+  NY+GT     +P+ G        
Sbjct: 395 SIVLLQNRNATLPLRPG--LRLAVIGPNADALAALEANYQGTSAAPVTPLLGLRERFGAA 452

Query: 438 N--YAPGCADIVCQNNSMIP 455
           N  YA G A +    + MIP
Sbjct: 453 NVRYAQG-APLAAGVSGMIP 471



 Score =  132 bits (331), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 91/285 (31%), Positives = 136/285 (47%), Gaps = 45/285 (15%)

Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           V G +L ++    +G DR DL LP  Q  L+ + A A+  P+ +V+MS  AV +N+AK +
Sbjct: 651 VEGEELRIDVPGFDGGDRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKQH 709

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
                  W  YPG+ GG AIA V+ G  NPGGRLP+T+Y +      Y S  ++      
Sbjct: 710 ADAIVAAW--YPGQSGGTAIAQVLAGDVNPGGRLPVTFYRSTKDLPAYVSYDMK------ 761

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++F G  ++ FG GLSYT+F Y  A    +  ++     Q R              
Sbjct: 762 GRTYRYFKGEPLFAFGSGLSYTRFTY-AAPQLSATTLQAGAHLQVR-------------- 806

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
                             +V N G   G EVV VY + P  A + ++ ++G++RV +  G
Sbjct: 807 -----------------TQVRNSGTRAGDEVVQVYLEFPQRAQSPLRTLVGFQRVTLQPG 849

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFP 751
           ++  V F + A + L  VD A    +  G + + VG G  G   P
Sbjct: 850 EARDVSFEL-APRQLSDVDRAGQRAVQPGDYRVFVGGGQPGTGAP 893


>gi|262405981|ref|ZP_06082531.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|345510488|ref|ZP_08790055.1| beta-glucosidase [Bacteroides sp. D1]
 gi|262356856|gb|EEZ05946.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|345454434|gb|EEO48987.2| beta-glucosidase [Bacteroides sp. D1]
          Length = 735

 Score =  281 bits (718), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 223/760 (29%), Positives = 354/760 (46%), Gaps = 97/760 (12%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLY-- 56
           Y DAK P  +R  DL+ RMTL EKV Q+     G              VP  +G  +Y  
Sbjct: 30  YKDAKAPIEKRIDDLISRMTLEEKVLQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFD 89

Query: 57  --EWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
                  ++   +    R   P    +D+     T +P  +    S+N  L ++     +
Sbjct: 90  INPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSA 149

Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
            EAR    +     TF SP I+V RDPRWGRV E  GEDPY  G +A   VRG       
Sbjct: 150 QEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRG------- 197

Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
            Y  D  S   +++AC KHY  Y      G D  +  + ++ Q + +T++LP+EM V  G
Sbjct: 198 -YQGDDMSAENRMAACLKHYVGYGASE-AGRDYVY--TEISAQTLWDTYLLPYEMGVKAG 253

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
              ++M S+N ++G+P  A+P ++ + ++  W   G+IVSD  +++ +   ++ L  TK+
Sbjct: 254 -APTLMSSFNDISGVPGSANPYIMTEILKKRWKHDGFIVSDWGAVEQL--KNQGLAATKK 310

Query: 295 DAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
           DA      AGL++D   + Y       V++GK+  A +D S+R +  V  RLG F+    
Sbjct: 311 DAARYAFNAGLEMDMMSHAYDRHLKELVEEGKVTMAQVDESVRRVLRVKFRLGLFERPYT 370

Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
                K+    PQ + +AA+ A + +VLLKNDN  LPL   N K +A+VGP A     ++
Sbjct: 371 PVTNEKDRFFRPQSMAVAAQLAAESMVLLKNDNQILPLT--NKKKIAVVGPMAKNGWDLL 428

Query: 414 GNY--EGTPCRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
           G++   G         DG  A       + YA GC      + S    A+D A+ +D  +
Sbjct: 429 GSWCGHGKDTDVEMLYDGLTAEFGGDAELRYAMGCKP-QGNDRSGFAGALDVARWSDVVI 487

Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
           +  G  L+   E   R  + LP  Q EL+ ++ +A K PV LV+ +   +++N  +  P 
Sbjct: 488 VCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PVILVLSNGRPLELN--RMEPL 544

Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS--MPLRPVNNFP 586
             +IL +  PG  G R++A ++ G+ NP G+L +T+        PY++  +P+       
Sbjct: 545 CDAILEIWQPGINGARSMAGILSGRINPSGKLAMTF--------PYSTGQIPIYYNRRKS 596

Query: 587 GRTYKFFDGPV----VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
           GR ++ F   +    +YPFG+GLSYT+FKY                          GT  
Sbjct: 597 GRGHQGFYKDITSDPLYPFGHGLSYTEFKY--------------------------GTVT 630

Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERV 701
           P    V   D      K + ++ V N G  DG+E V  +   P  + T  +K++  +E+ 
Sbjct: 631 PSATKVKRGD------KLSAEVTVTNTGSRDGAETVHWFISDPYCSITRPVKELKHFEKQ 684

Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
            I AG++    F ++  +    V+      L +G + ILV
Sbjct: 685 LIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724


>gi|222099590|ref|YP_002534158.1| Beta-mannanase [Thermotoga neapolitana DSM 4359]
 gi|2429092|gb|AAB70867.1| beta-xylosidase [Thermotoga neapolitana]
 gi|221571980|gb|ACM22792.1| Beta-mannanase [Thermotoga neapolitana DSM 4359]
          Length = 778

 Score =  281 bits (718), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 238/815 (29%), Positives = 371/815 (45%), Gaps = 160/815 (19%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGD---------------------LAYGVPRLG 52
           Y D   P   R KDL+ RMTL EK+ Q+G                      L  G+ ++ 
Sbjct: 4   YRDPSQPVEVRVKDLLSRMTLEEKIAQLGSVWGYELIDERGKFKREKAKDLLKNGIGQIT 63

Query: 53  LP---LYEWWSEALHGVSFIGR------RTNSPPGTHFDSEVP----GATSFPTVILTTA 99
            P         EA   V+ I R      R   P   H +        G T+FP  I   +
Sbjct: 64  RPGGSTNLEPQEAAELVNEIQRFLVEETRLGIPAMIHEECLTGYMGLGGTNFPQAIAMAS 123

Query: 100 SFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR 159
           +++  L +K+   +  + R +    + GL   +P ++V RDPRWGR  ET GE PY+V R
Sbjct: 124 TWDPDLIEKMTAAIREDMRKLG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVAR 178

Query: 160 YAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLD----NWEGNDRFHFDSRVT 215
             ++YV+GLQ           ++    + A  KH+A Y       NW   +       + 
Sbjct: 179 MGVSYVKGLQ----------GENIKEGVVATVKHFAGYSASEGGKNWAPTN-------IP 221

Query: 216 EQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSD 275
           E++ +E F+ PFE  V E  V SVM SY+ ++G+P  A+ +LL   +R DW F G +VSD
Sbjct: 222 EREFREVFLFPFEAAVKEARVLSVMNSYSEIDGVPCAANRRLLTDILRKDWGFEGIVVSD 281

Query: 276 CDSIQTIVESHKFLNDTKEDAVARVLKAGLDL-----DCGDYYTNFTMGAVQQGKIAEAD 330
             ++  + E H+   D  E A    L+AG+D+     DC  +  +     V++G + E+ 
Sbjct: 282 YFAVNMLGEYHRIAKDKSESA-RLALEAGIDVELPKTDCYQHLKDL----VEKGIVPESL 336

Query: 331 IDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALP 390
           ID ++  +  +   LG F+    Y ++ K  I    H +LA E AR+ I+LLKND G LP
Sbjct: 337 IDEAVSRVLKLKFMLGLFENP--YVDVEKAKI--ESHRDLALEIARKSIILLKND-GTLP 391

Query: 391 LNTGNIKTLALVGPHANATKAMIGNYE----------------GTPC------------- 421
           L     K +AL+GP+A   + ++G+Y                 G P              
Sbjct: 392 LQKN--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKSI 449

Query: 422 -----RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG---- 472
                   S +D F        YA GC ++  ++ S    AI+ AK +D  ++V G    
Sbjct: 450 EEHMKSIPSVLDAFKEEGIDFEYAKGC-EVTGEDRSGFKEAIEVAKRSDVAIVVVGDRSG 508

Query: 473 --LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIK 530
             LD +   E +D  +L LPG Q EL+ ++A   K PV LV+++     +    +  ++ 
Sbjct: 509 LTLDCTT-GESRDMANLKLPGVQEELVLEIAKTGK-PVVLVLITGRPYSLKNLVD--RVN 564

Query: 531 SILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRT 589
           +IL V  PGE GGRAI DVI+GK NP G+LPI++   A  + + +   P    +++ G  
Sbjct: 565 AILQVWLPGEAGGRAIVDVIYGKVNPSGKLPISFPRSAGQIPVFHYVKPSGGRSHWHGDY 624

Query: 590 YKFFDGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
                 P ++PFG+GLSYT+F+Y  +   PK V                     P    V
Sbjct: 625 VDESTKP-LFPFGHGLSYTRFEYSNLRIEPKEV---------------------PSAGEV 662

Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQ 707
           +I            +++VEN+G MDG EVV +Y      + T  +K++ G++RV + A +
Sbjct: 663 VI------------KVDVENVGDMDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAKE 710

Query: 708 SAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
              V F ++    L   D     ++  G   ++VG
Sbjct: 711 KKTVVFRLH-TDVLAYYDRDMKLVVEPGEFRVMVG 744


>gi|334365132|ref|ZP_08514098.1| glycosyl hydrolase family 3 N-terminal domain protein [Alistipes
           sp. HGB5]
 gi|313158675|gb|EFR58064.1| glycosyl hydrolase family 3 N-terminal domain protein [Alistipes
           sp. HGB5]
          Length = 771

 Score =  281 bits (718), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 216/727 (29%), Positives = 334/727 (45%), Gaps = 124/727 (17%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+PL+    EA HG   IG                 AT+FPT     +++N  L +++
Sbjct: 120 RLGIPLF-LAEEAPHGHMAIG-----------------ATTFPTAPGQASTWNPELIERM 161

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           G+ ++ E R        G   + P +++VRDPRW R  E+ GED Y+  R    YVRG  
Sbjct: 162 GKVIAAEIRL-----QGGHICYGPVLDIVRDPRWSRTEESYGEDCYLTARIGEAYVRGTG 216

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
             +       S SR     +  KH+ AY       N   +    + E++++ET++ PFE 
Sbjct: 217 SGD------LSQSR--HALSTLKHFIAYGASEGGQNGGSNL---LGERELRETYLPPFEA 265

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            V  G   SVM +YN V+GIP  A+ ++L   +RG+W F G++VSD  SI+ + E+H   
Sbjct: 266 AVKAG-ARSVMTAYNSVDGIPCTANRRMLTDILRGEWGFDGFVVSDLLSIEGLHETHGVA 324

Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
              +E AV + L+AG+D D           A + G +AEA+ID ++  +  +   +G F+
Sbjct: 325 GSVREAAV-QALRAGVDADLKGGAFASLREAAEAGDVAEAEIDRAVERVLALKFEMGLFE 383

Query: 350 GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT 409
            +P         +    H ELA EAARQ + LL+N +G LPL+   ++ +A++GP+A+  
Sbjct: 384 -NPYIDEAAAAEVGCAAHSELALEAARQSVTLLENRSGTLPLDPRRLRRVAVIGPNADNI 442

Query: 410 KAMIGNYEGTPCRYTSPMDG---FYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADA 466
              +G+Y        +  DG        +V+ Y+ GC  +   + S I AA+ AA+  DA
Sbjct: 443 YNQLGDYTAQQTAANTVRDGLEKLLGRDRVV-YSRGCT-VRGGDRSEIAAAVSAARGTDA 500

Query: 467 TVIVAG----LDLSVE-------------------AEGKDRVDLLLPGFQTELINKVADA 503
            V+V G     D   E                    EG DR  L L G Q EL+ ++  A
Sbjct: 501 AVVVIGGSSARDFDTEFLQTGAAKAAHDEVRDMECGEGFDRATLALLGEQEELLRRI-KA 559

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
              P+ +V ++   +D+  A        + W  YPG  GG A+A+ I G+ NP GRLPIT
Sbjct: 560 TGTPLIVVCIAGRPLDLRRASEQADALLMAW--YPGARGGDAVAETILGRNNPAGRLPIT 617

Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
              A   +IP      RP N+     Y       +YPFGYGLSY+ F+Y    + +S D 
Sbjct: 618 IPRAEG-QIPVYYNKKRPANH----DYTDLTAAPLYPFGYGLSYSTFEYGSLEARQSGDN 672

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-- 681
            L                          +V C+         + N    +G EVV +Y  
Sbjct: 673 VL--------------------------EVSCR---------IRNTSDREGDEVVQLYIS 697

Query: 682 ------SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASG 735
                  +PP       +Q+ G+ R+ +A G+  +V FT+   ++L ++D     ++  G
Sbjct: 698 DMVASTVRPP-------RQLGGFRRIRLAPGEQRQVSFTLGD-EALALIDPQGRRVVEKG 749

Query: 736 AHTILVG 742
              I VG
Sbjct: 750 DFVIAVG 756


>gi|148269983|ref|YP_001244443.1| glycoside hydrolase family 3 protein [Thermotoga petrophila RKU-1]
 gi|147735527|gb|ABQ46867.1| glycoside hydrolase, family 3 domain protein [Thermotoga petrophila
           RKU-1]
          Length = 778

 Score =  280 bits (717), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 244/812 (30%), Positives = 375/812 (46%), Gaps = 154/812 (18%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGD---------------------LAYG---VP 49
           Y D   P   R +DL+ RMTL EK  Q+G                      L  G   V 
Sbjct: 4   YRDPSQPIEVRVRDLLSRMTLEEKAAQLGSVWGYELIDERGKFSREKAKELLKNGIGQVT 63

Query: 50  RLGLPLYEWWSEALHGVSFIGR------RTNSPPGTHFDSEVP----GATSFPTVILTTA 99
           R G        EA   V+ I R      R   P   H +        G T+FP  I   +
Sbjct: 64  RPGGSTNLEPQEAAELVNEIQRFLVEETRLGIPAMIHEECLTGYMGLGGTNFPQAIAMAS 123

Query: 100 SFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR 159
           +++  L +K+   +  + R +    + GL   +P ++V RDPRWGR  ET GE PY+V R
Sbjct: 124 TWDPDLIEKMTTAIREDMRKIG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVAR 178

Query: 160 YAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDM 219
             ++YV+GLQ   G +  +        + A  KH+A Y     EG   +   + + E++ 
Sbjct: 179 MGVSYVKGLQ---GEDIKKG-------VVATVKHFAGYSAS--EGGKNWA-PTNIPEREF 225

Query: 220 QETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSI 279
           +E F+ PFE  V E +V SVM SY+ ++G+P  A+ KLL   +R DW F G +VSD  ++
Sbjct: 226 KEVFLFPFEAAVKEANVLSVMNSYSEIDGVPCAANRKLLTDILRKDWGFKGIVVSDYFAV 285

Query: 280 QTIVESHKFLNDTKEDAVARVLKAGLDLD-----CGDYYTNFTMGAVQQGKIAEADIDTS 334
           + + + H+   D K +A    L+AG+D++     C  Y  +     V++G I+EA ID +
Sbjct: 286 KVLEDYHRIARD-KSEAARLALEAGIDVELPKTECYQYLKDL----VEKGIISEALIDEA 340

Query: 335 L-RFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNT 393
           + R L +  M LG F+    Y  + K  I    H ++A + AR+ I+LLKND G LPL  
Sbjct: 341 VARVLRLKFM-LGLFENP--YVEVEKAKI--ESHKDIALDIARKSIILLKND-GILPLQK 394

Query: 394 GNIKTLALVGPHANATKAMIGNYE----------------GTPC---------------- 421
              K +AL+GP+A   + ++G+Y                 G P                 
Sbjct: 395 N--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKSIEEH 452

Query: 422 --RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG------L 473
                S +D F        YA GC ++  ++ S    AI+ AK +D  ++V G      L
Sbjct: 453 MKSIPSVLDAFKEEGIEFEYAKGC-EVTGEDRSGFEEAIEIAKKSDVAIVVVGDKSGLTL 511

Query: 474 DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
           D +   E +D  +L LPG Q EL+ +VA   K PV LV+++     +    +  K+ +IL
Sbjct: 512 DCTT-GESRDMANLKLPGVQEELVLEVAKTGK-PVVLVLITGRPYSLKNVVD--KVNAIL 567

Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKF 592
            V  PGE GGRAI D+I+GK NP G+LPI++   A  + + +   P    +++ G     
Sbjct: 568 QVWLPGEAGGRAIVDIIYGKVNPSGKLPISFPRSAGQIPVFHYVKPSGGRSHWHGDYVDE 627

Query: 593 FDGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLID 651
              P ++PFG+GLSYT+F+Y  +   PK V                     PP   V+I 
Sbjct: 628 STKP-LFPFGHGLSYTKFEYSNLRIEPKEV---------------------PPAGEVVI- 664

Query: 652 DVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAK 710
                      +++VEN G  DG EVV +Y      + T  +K++ G++RV + A +   
Sbjct: 665 -----------KVDVENTGDRDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAKEKKT 713

Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           V F ++    L   D     ++  G   ++VG
Sbjct: 714 VVFRLH-MDVLAYYDRDMKLVVEPGEFKVMVG 744


>gi|312794525|ref|YP_004027448.1| glycoside hydrolase family 3 domain-containing protein
           [Caldicellulosiruptor kristjanssonii 177R1B]
 gi|312181665|gb|ADQ41835.1| glycoside hydrolase family 3 domain protein [Caldicellulosiruptor
           kristjanssonii 177R1B]
          Length = 770

 Score =  280 bits (717), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 212/710 (29%), Positives = 350/710 (49%), Gaps = 113/710 (15%)

Query: 87  GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
           GAT FP  I    +F+  + +++ + + T+ +A+           +P I+V RD RWGRV
Sbjct: 102 GATVFPQSIGVACTFDNEIVEELAKVIRTQMKAV-----GAHQALAPLIDVARDARWGRV 156

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLD----NW 202
            ET GEDPY+V   A++YV+GLQ           D     I A  KH+  Y +     NW
Sbjct: 157 EETFGEDPYLVANMAVSYVKGLQ----------GDDIKDGIVATGKHFVGYAMSEGGMNW 206

Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
                      + E++++E ++ PFE+ V    + S+M +Y+ ++GIP  A+ KLL    
Sbjct: 207 A-------PVHIPERELREVYLYPFEVAVKVAGLKSIMPAYHEIDGIPCHANRKLLTDIA 259

Query: 263 RGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG--DYYTNFTMGA 320
           RG+W F G  VSD   ++ +++ HK +  T E+A A  L AGLD++    + +T   + A
Sbjct: 260 RGEWGFDGIYVSDYSGVKNLLDYHKSVK-TYEEAAALSLWAGLDIELPKIECFTEEFIKA 318

Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC-NPQHIELAAEAARQGI 379
           +++GK   A +D +++ +  +  RLG FD +P  K  G   +  N +  +L+ + A++ +
Sbjct: 319 LKEGKFDMALVDAAVKRVLEMKFRLGLFD-NPYIKTEGVVELFDNKEQRQLSRKVAQESM 377

Query: 380 VLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA------- 432
           VLLKND+  LPL+  ++K +A++GP+AN+ + ++G+Y   P  + + ++ F+        
Sbjct: 378 VLLKNDS-FLPLSK-DLKKIAVIGPNANSVRNLLGDY-SYPA-HIATLEMFFIKEDRGVG 433

Query: 433 -----YSKVIN-------------------YAPGCADIVCQNNSMIPAAIDAAKNADATV 468
                   VIN                   YA GC D+  Q+ S    A  AA+ ADA +
Sbjct: 434 NEEEFVKNVINMKSIFEAIKDKVSSNTEVVYAKGC-DVNSQDKSGFEEAKKAAEGADAVI 492

Query: 469 IV----AGLDLS-VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-VDINF 522
           +V    AGL L     E +DR  L LPG Q +L+ ++      P T+V++  G  V +++
Sbjct: 493 LVVGDKAGLRLDCTSGESRDRASLRLPGVQEDLVKEIVSV--NPNTVVVLVNGRPVALDW 550

Query: 523 AKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRP 581
              N  +K++L   +PGEEG  A+AD++FG YNPGG+L I++  +   V + Y   P   
Sbjct: 551 IMEN--VKAVLEAWFPGEEGADAVADILFGDYNPGGKLAISFPRDVGQVPVYYGHKPSGG 608

Query: 582 VNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTN 641
            + + G   +    P++ PFGYGLSYT F+YK                     N+ +   
Sbjct: 609 KSCWHGDYVEMSTKPLL-PFGYGLSYTTFEYK---------------------NFAIEKE 646

Query: 642 KPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYER 700
           K              D      +EVEN GK +G E+V +Y++      T  +K++ GY+R
Sbjct: 647 KIGM-----------DESIKVSVEVENTGKYEGDEIVQLYTRKEEYLVTRPVKELKGYKR 695

Query: 701 VFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
           V +  G+  KV F +         D   N ++  G   +++G     + F
Sbjct: 696 VHLKPGEKKKVVFELYP-DLFAFYDYDMNRVVTPGVVEVMIGASSEDIKF 744


>gi|390945417|ref|YP_006409177.1| beta-glucosidase-like glycosyl hydrolase [Alistipes finegoldii DSM
           17242]
 gi|390421986|gb|AFL76492.1| beta-glucosidase-like glycosyl hydrolase [Alistipes finegoldii DSM
           17242]
          Length = 771

 Score =  280 bits (717), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 225/760 (29%), Positives = 345/760 (45%), Gaps = 131/760 (17%)

Query: 22  PERAKDLVERMTLPEKV-----QQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSP 76
           P   KDL  R  LP ++      +M   A    RLG+PL+    EA HG   IG      
Sbjct: 89  PWTQKDL--RTGLPPQLAARLANRMQRYAVQHSRLGIPLF-LAEEAPHGHMAIG------ 139

Query: 77  PGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNIN 136
                      AT+FPT     +++N  L +++G+ ++ E R        G   + P ++
Sbjct: 140 -----------ATTFPTAPGQASTWNPELIERMGKVIAAEIRL-----QGGHICYGPVLD 183

Query: 137 VVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAA 196
           +VRDPRW R  E+ GED Y+  R    YVRG    +       S SR     +  KH+ A
Sbjct: 184 IVRDPRWSRTEESYGEDCYLTARIGEAYVRGTGSGD------LSQSR--HALSTLKHFIA 235

Query: 197 YDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPK 256
           Y       N   +    + E++++ET++ PFE  V  G   SVM +YN V+GIP  A+ +
Sbjct: 236 YGASEGGQNGGSNL---LGERELRETYLPPFEAAVKAG-ARSVMTAYNSVDGIPCTANRR 291

Query: 257 LLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNF 316
           +L   +RG+W F G++VSD  SI+ + E+H      +E AV + L+AG+D D        
Sbjct: 292 MLTDILRGEWGFDGFVVSDLLSIEGLHETHGVAGSVREAAV-QALRAGVDADLKGGAFAS 350

Query: 317 TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAAR 376
              A + G +AEA+ID ++  +  +   +G F+ +P         +    H ELA EAAR
Sbjct: 351 LREAAEAGDVAEAEIDRAVERVLALKFEMGLFE-NPYIDEAAAAEVGCAAHSELALEAAR 409

Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG---FYAY 433
           Q + LL+N +G LPL+   ++ +A++GP+A+     +G+Y        +  DG       
Sbjct: 410 QSVTLLENRSGTLPLDPRRLRRVAVIGPNADNIYNQLGDYTAQQTAANTVRDGLEKLLGR 469

Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE----------- 478
            +V+ Y+ GC  +   + S I AA+ AA+  DA V+V G     D   E           
Sbjct: 470 DRVV-YSRGCT-VRGGDRSEIAAAVSAARGTDAAVVVIGGSSARDFDTEFLQTGAAKAAH 527

Query: 479 --------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIK 530
                    EG DR  L L G Q EL+ ++  A   P+ +V ++   +D+  A       
Sbjct: 528 DEVRDMECGEGFDRATLALLGEQEELLRRI-KATGTPLIVVCIAGRPLDLRRASEQADAL 586

Query: 531 SILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTY 590
            + W  YPG  GG A+A+ I G  NP GRLPIT   A   +IP      RP N+     Y
Sbjct: 587 LMAW--YPGARGGDAVAETILGHNNPAGRLPITIPRAEG-QIPVYYNKKRPANH----DY 639

Query: 591 KFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
                  +YPFGYGLSY+ F+Y    + +S D  L                         
Sbjct: 640 TDLTAAPLYPFGYGLSYSTFEYGSLEARQSGDNVL------------------------- 674

Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVY--------SKPPGIAGTHIKQVIGYERVF 702
            +V C+         + N    +G EVV +Y         +PP       +Q+ G+ R+ 
Sbjct: 675 -EVSCR---------IRNTSDREGDEVVQLYISDMVASTVRPP-------RQLGGFRRIR 717

Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +A G+  +V FT+   ++L ++D     ++  G   I VG
Sbjct: 718 LAPGEQRQVSFTLGD-EALSLIDPQGRRVVEKGDFVIAVG 756


>gi|427383551|ref|ZP_18880271.1| hypothetical protein HMPREF9447_01304 [Bacteroides oleiciplenus YIT
           12058]
 gi|425728735|gb|EKU91590.1| hypothetical protein HMPREF9447_01304 [Bacteroides oleiciplenus YIT
           12058]
          Length = 939

 Score =  280 bits (717), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 233/808 (28%), Positives = 369/808 (45%), Gaps = 140/808 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW------------ 58
           Y D       R +DL+ +MTL EK  QM  L YG  R+    LP  EW            
Sbjct: 50  YEDPNATLDARIEDLLSQMTLDEKTCQMVTL-YGYKRVLKDDLPTPEWKQMLWKDGIGAI 108

Query: 59  --------------------WSEALHGVS-------FIGRRTNSPPGTHFDSEVPG---- 87
                               W  + H  +       FI       P    +  + G    
Sbjct: 109 DEHLNGFQQWGLPPSDNPNVWPASRHAWALNEVQRFFIEETRLGIPVDFTNEGIRGVESY 168

Query: 88  -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
            AT+FPT +    ++N  L  ++G     EAR +      G T  ++P ++V RD RWGR
Sbjct: 169 RATNFPTQLGLGHTWNRKLIHQVGLITGREARML------GYTNVYAPILDVGRDQRWGR 222

Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
             E  GE PY+V    I  VRG+Q       H        +++A  KH+ AY  +     
Sbjct: 223 YEEVYGESPYLVAELGIEMVRGMQ-------HNH------QVAATGKHFVAYSNNKGARE 269

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
                D +++ ++++   + PF+  + E  +  VM SYN  +GIP       L + +RG+
Sbjct: 270 GMARVDPQMSPREVEMIHVYPFKRVIKEAGMLGVMSSYNDYDGIPIQGSYYWLTKRLRGE 329

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
             F GY+VSD D+++ +   H    D KE AV + ++AGL++ C     D Y       V
Sbjct: 330 MGFRGYVVSDSDAVEYLYTKHSTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLRELV 388

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN-NICNPQHIELAAEAARQGIV 380
           ++G ++E  I+  +R +  V   +G FD   Q    G +  +   ++  +A +A+R+ ++
Sbjct: 389 KEGGLSEDIINDRVRDILRVKFLIGLFDAPYQTDLAGADKEVEKAENEAVALQASRESLI 448

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK---VI 437
           LLKN+N  LPL+  NIKT+A+ GP+AN     + +Y        + ++G    ++    +
Sbjct: 449 LLKNENNVLPLDINNIKTIAVCGPNANEEGYALTHYGPLAVEVITVLEGIRQKAEGKAEV 508

Query: 438 NYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
            YA GC D+V  N                + I  A++ A+ AD  V+V G       E K
Sbjct: 509 LYAKGC-DLVDANWPESELIEYPMTNEEQAEINKAVENARKADVAVVVLGGGQRTCGENK 567

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
            R  L LPG Q +L+  V    K PV LV+++   + IN+A  +  + +IL   YPG +G
Sbjct: 568 SRSSLDLPGRQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPAILETWYPGSKG 624

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV----- 597
           G A+ADV+FG YNPGG+L +T +  +  +IP+ + P +P +   G      DG +     
Sbjct: 625 GTAVADVLFGDYNPGGKLTVT-FPKSVGQIPF-NFPCKPSSQIDGGKNPGPDGNMSRVNG 682

Query: 598 -VYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
            +YPFGYGLSYT F+Y  +  SPK                                 +  
Sbjct: 683 SLYPFGYGLSYTTFEYSNIEISPK---------------------------------MMT 709

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
            + K T + +V N GK  G EVV +Y +       T+ K + G+ERV +  G++ +V F 
Sbjct: 710 ANQKATVRCKVTNTGKRAGDEVVQLYIRDMLSSVTTYEKNLAGFERVHLQPGETKEVTFI 769

Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
           ++  K L+++D     ++  G  +I+VG
Sbjct: 770 LDR-KHLELLDKHMEWVVEPGDFSIMVG 796


>gi|160885419|ref|ZP_02066422.1| hypothetical protein BACOVA_03419 [Bacteroides ovatus ATCC 8483]
 gi|156109041|gb|EDO10786.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
          Length = 861

 Score =  280 bits (717), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 170/445 (38%), Positives = 238/445 (53%), Gaps = 44/445 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            PY D  L   +R +DL+ R+TL EKV  M + +  +PRLG+  YEWW+EALHGV   G 
Sbjct: 24  LPYQDTSLAAEQRTEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-GNAG--- 127
                           AT FP  I   ASFN+SL  ++    S EAR    + G++G   
Sbjct: 84  ----------------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGDSGVLK 127

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFW+PN+N+ RDPRWGR  ET GEDPY+ G+  +  VRGLQ  E        D+R
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DAR 179

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
             K+ AC KH+A +    W   +R  FD+  +  +D+ ET++  F+  V +  V  VMC+
Sbjct: 180 YDKLHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCA 236

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
           YNR  G P C   +LL Q +R +W + G +VSDC +I       +H    D KE A A  
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYEGIVVSDCGAISDFYRPGTHGTHPD-KEHASAGA 295

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           ++AG DL+CG  Y +    AV+ G I E +ID SL+ L      LG  D    +  +  +
Sbjct: 296 VRAGTDLECGSEYASLA-DAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSAWSEIPTS 354

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + + +H  LA   AR+ +VLL+N N  LPLNT ++K +A++GP+AN +    GNY G P
Sbjct: 355 VLNSKEHQALALRMARESLVLLQNKNNILPLNT-HLK-VAVMGPNANDSVMQWGNYNGIP 412

Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
               + ++   A      I Y PGC
Sbjct: 413 AHTVTLLEAVRAKLPEGQIIYEPGC 437



 Score =  103 bits (256), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 83/291 (28%), Positives = 129/291 (44%), Gaps = 53/291 (18%)

Query: 463 NADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVI 512
           +AD  +   G+  S+E E          G DR D+ LP  Q +L+  +  A K    +V 
Sbjct: 597 DADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGK---KVVF 653

Query: 513 MSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKI 572
           ++     I         ++IL   YPG+ GG AI D ++G+YNPGGRLP+T+Y+      
Sbjct: 654 INYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK------ 707

Query: 573 PYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCR 632
               +P     +  GRTY++     ++PFG+GLSYT F Y  A        KL K+   +
Sbjct: 708 DVNQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEA--------KLSKNTIAK 759

Query: 633 DINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI 692
             N                            I V N+G+ DG EVV VY + PG      
Sbjct: 760 GEN------------------------VVLTIPVSNVGQRDGEEVVQVYLRRPGDKEGPR 795

Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
             +  ++RV I AG++  V   +   ++ +  D  +N++    G + +L G
Sbjct: 796 YTLRAFKRVHIPAGKTESVAIPLTG-ENFEWFDVESNTMCPLEGTYELLYG 845


>gi|182415162|ref|YP_001820228.1| glycoside hydrolase family 3 [Opitutus terrae PB90-1]
 gi|177842376|gb|ACB76628.1| glycoside hydrolase family 3 domain protein [Opitutus terrae
           PB90-1]
          Length = 747

 Score =  280 bits (716), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 223/744 (29%), Positives = 340/744 (45%), Gaps = 100/744 (13%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGV--- 66
           +  P+ D +LP  +R  DL+ RMTL EK+  M  +   VPRLG+       E  HGV   
Sbjct: 30  TGLPFQDPELPAEQRIDDLIGRMTLEEKIDCMA-MRAAVPRLGVKGSRH-IEGYHGVAQG 87

Query: 67  --SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN-- 122
             S  GRR  +             T FP      A+++  L +++    + EAR ++   
Sbjct: 88  GPSNWGRRNPT-----------ATTQFPQAYGLGATWDPELIRQVAAQEAEEARYLFQSP 136

Query: 123 -LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
               AGL   +PN ++ RDPRWGR  E  GEDP+  G  A  +VRGLQ           D
Sbjct: 137 RYDRAGLIVRAPNADLARDPRWGRTEEVYGEDPFHAGTLATAFVRGLQ---------GDD 187

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
            R  K  +  KH+    L N   + R    S  +E+  +E +  PFEM + +G   ++M 
Sbjct: 188 PRYFKAVSLVKHF----LANSNEDGRESSSSNFSERQWREYYAKPFEMAIVDGGAPALMA 243

Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
           +YN VNG P    P +L   +  +W  +G + +D   ++ +VE H    D    A A  +
Sbjct: 244 AYNAVNGTPAHVHP-MLRDIVMAEWKLNGILCTDGGGLRLLVEKHHAFPDLP-SAAAACV 301

Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGK 359
           KAG++    D + +    AV +G I E D+D +LR L+ V ++LG  D   +  Y  +G+
Sbjct: 302 KAGIN-HFLDRHKDAVTEAVARGSITERDLDAALRGLFRVSLKLGLLDPDERVPYAAIGR 360

Query: 360 NN----ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGN 415
           N        P    L  +  ++ IVLLKN    LPL+   +KT+ALVGP  N    +   
Sbjct: 361 NGEAEPWLRPDTQALVRKVTQRSIVLLKNSGALLPLDRTKVKTVALVGPLVNTV--LPDW 418

Query: 416 YEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLD- 474
           Y GTP     P  G       +    G    V     M  AA++ A+ ++  ++  G D 
Sbjct: 419 YGGTPPYTVPPSIG-------VEKVAGEGVKVGWLADMGDAAVELARTSEIAIVCVGNDP 471

Query: 475 --------LSVEAEGK---DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFA 523
                   +   +EGK   DR DL LP  Q + I +V   A  P T+V++ +     NF 
Sbjct: 472 ISAGGWELVRTPSEGKEAVDRKDLALPRDQEKFIRRV--LAANPRTIVVLIS-----NFP 524

Query: 524 KNNP----KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
              P     + +I+ + +  +E G A+ DV++G+ NP G+L  TW        P +   L
Sbjct: 525 YAMPWVVKHVPAIVHLTHASQELGHALGDVLWGEVNPDGKLAQTW--------PKSLKQL 576

Query: 580 RPVNNFP---GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINY 636
            P+ ++    GRTY++F G   +PFG+GLSYT F              L   +   D+  
Sbjct: 577 PPMMDYDLTHGRTYQYFKGEPQFPFGFGLSYTTF-------------NLSNLRVGLDVAR 623

Query: 637 TVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQV 695
            VG      A          +   +  +EV N G   G EVV VY++ P       +KQ+
Sbjct: 624 HVGAGAETPAESPAPRTFAPNAILSIAVEVTNTGTRAGDEVVQVYARYPHSKVSRPLKQL 683

Query: 696 IGYERVFIAAGQSAKVGFTMNACK 719
            G++R+ +AAG++A V   + A +
Sbjct: 684 CGFQRISVAAGETAHVRLQLPASR 707


>gi|189467715|ref|ZP_03016500.1| hypothetical protein BACINT_04107 [Bacteroides intestinalis DSM
           17393]
 gi|189435979|gb|EDV04964.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 943

 Score =  280 bits (716), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 231/808 (28%), Positives = 368/808 (45%), Gaps = 140/808 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW------------ 58
           Y D       R +DL+ +MTL EK  QM  L YG  R+    LP  EW            
Sbjct: 53  YEDPNASLDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEWKQMLWKDGIGAI 111

Query: 59  --------------------WSEALHGVS-------FIGRRTNSPPGTHFDSEVPG---- 87
                               W  + H  +       FI       P    +  + G    
Sbjct: 112 DEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPVDFTNEGIRGIESY 171

Query: 88  -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
            AT+FPT +    ++N  L +++G     EAR +      G T  ++P ++V RD RWGR
Sbjct: 172 RATNFPTQLGLGHTWNRELIRQVGLITGREARIL------GYTNVYAPILDVGRDQRWGR 225

Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
             E  GE PY+V    I  VRG+Q       H        +++A  KH+ AY  +     
Sbjct: 226 YEEVYGESPYLVAELGIEMVRGMQ-------HNH------QVAATGKHFVAYSNNKGARE 272

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
                D +++ ++++   + PF+  + E  +  VM SYN  +G+P       L   +RG+
Sbjct: 273 GMARVDPQMSPREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSYYWLTTRLRGE 332

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
             F GY+VSD D+++ +   H    D KE AV + ++AGL++ C     D Y       V
Sbjct: 333 MGFRGYVVSDSDAVEYLYTKHSTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLRELV 391

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN-NICNPQHIELAAEAARQGIV 380
           ++G ++E  I+  +R +  V   +G FD   Q    G +  +   ++  LA +A+R+ +V
Sbjct: 392 KEGGLSEEVINDRVRDILRVKFLIGLFDAPYQTDLAGADREVEKAENESLALQASRESLV 451

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK---VI 437
           LLKN+N  LPL+  N+K +A+ GP+A+     + +Y       T+ ++G    S+    +
Sbjct: 452 LLKNENNVLPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKSEGKAEV 511

Query: 438 NYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
            Y  GC D+V  N                + I  A++ A+ AD  V+V G       E K
Sbjct: 512 LYTKGC-DLVDANWPESELIDYPMTDNEQAEIDKAVENARQADVAVVVLGGGQRTCGENK 570

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
            R  L LPG Q +L+  V    K PV LV+++   + IN+A  +  + +IL   YPG +G
Sbjct: 571 SRSSLDLPGRQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKG 627

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV----- 597
           G A+ADV+FG YNPGG++ +T +  +  +IP+ + P +P +   G      DG +     
Sbjct: 628 GTAVADVLFGDYNPGGKMTVT-FPKSVGQIPF-NFPCKPSSQIDGGKNPGLDGNMSRVNG 685

Query: 598 -VYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
            +Y FGYGLSYT F+Y  +  SPK                                 V  
Sbjct: 686 ALYSFGYGLSYTTFEYSGIEISPK---------------------------------VIT 712

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
            + K T + +V N GK  G EVV +Y +       T+ K + G+ER+ +  G++ +V FT
Sbjct: 713 PNQKATVRCKVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGETKEVVFT 772

Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
           ++  K L+++D     ++  G  +I+VG
Sbjct: 773 LDR-KQLELLDKHMEWVVEPGDFSIMVG 799


>gi|423293434|ref|ZP_17271561.1| hypothetical protein HMPREF1070_00226 [Bacteroides ovatus
           CL03T12C18]
 gi|392678377|gb|EIY71785.1| hypothetical protein HMPREF1070_00226 [Bacteroides ovatus
           CL03T12C18]
          Length = 735

 Score =  280 bits (715), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 220/760 (28%), Positives = 355/760 (46%), Gaps = 97/760 (12%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLY-- 56
           Y DAK P  +R  DL+ RMTL EK+ Q+     G              VP  +G  +Y  
Sbjct: 30  YKDAKAPIEKRIDDLISRMTLEEKILQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFD 89

Query: 57  --EWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
                  ++   +    R   P    +D+     T +P  +    S+N  L ++     +
Sbjct: 90  INPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSA 149

Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
            EAR    +     TF SP I+V RDPRWGRV E  GEDPY  G +A   VRG       
Sbjct: 150 QEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRG------- 197

Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
            Y  D  S   +++AC KHY  Y      G D  +  + ++ Q + +T++LP+EM V  G
Sbjct: 198 -YQGDDMSAENRMAACLKHYVGYGASE-AGRDYVY--TEISAQTLWDTYLLPYEMGVKAG 253

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
             +++M S+N ++G+P  A+P ++ + ++  W   G+IVSD  +++ +   ++ L  TK+
Sbjct: 254 -AATLMSSFNDISGVPGSANPYIMTEILKKRWKHDGFIVSDWGAVEQL--KNQGLAATKK 310

Query: 295 DAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
           DA      AGL++D   + Y       V++GK+  A +D S+R +  V  RLG F+    
Sbjct: 311 DAARYAFNAGLEMDMMSHAYDRHLKELVEEGKVTMAQVDESVRRVLRVKFRLGLFERPYT 370

Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
                K+    PQ + +AA+ A + +VLLKN+N  LPL   N K +A+VGP A     ++
Sbjct: 371 PVTNEKDRFFRPQSMAVAAQLAAESMVLLKNNNQILPLT--NKKKIAVVGPMAKNGWDLL 428

Query: 414 GNY--EGTPCRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
           G++   G         DG  A       + YA GC      + S    A+D A+ +D  +
Sbjct: 429 GSWCGHGKDTDVEMLYDGLTAEFGGDAELRYAMGCKP-QGNDRSGFAGALDVARWSDVVI 487

Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
           +  G  L+   E   R  + LP  Q EL+ ++ +A K P+ LV+ +   +++N  +  P 
Sbjct: 488 VCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PIILVLSNGRPLELN--RMEPL 544

Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS--MPLRPVNNFP 586
             +IL +  PG  G R++A ++ G+ NP G+L +T+        PY++  +P+       
Sbjct: 545 CDAILEIWQPGINGARSMAGILSGRINPSGKLAMTF--------PYSTGQIPIYYNRRKS 596

Query: 587 GRTYKFFDGPV----VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
           GR ++ F   +    +YPFG+GLSYT+FKY                          GT  
Sbjct: 597 GRGHQGFYKDITSDPLYPFGHGLSYTEFKY--------------------------GTVT 630

Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERV 701
           P    V   D      K + ++ V N G  DG+E V  +   P  + T  +K++  +E+ 
Sbjct: 631 PSATKVKRGD------KLSAEVTVTNTGARDGAETVHWFISDPYCSITRPVKELKHFEKQ 684

Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
           FI  G++    F ++  +    V+      L +G + ILV
Sbjct: 685 FIKVGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724


>gi|333380553|ref|ZP_08472244.1| hypothetical protein HMPREF9455_00410 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332826548|gb|EGJ99377.1| hypothetical protein HMPREF9455_00410 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 957

 Score =  279 bits (714), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 230/766 (30%), Positives = 365/766 (47%), Gaps = 113/766 (14%)

Query: 1   RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQM--GDLAYGVPRLGLPLYEW 58
           + E   V L +  Y +  LP   R +DL+  MT+ +K++ +  G    G+P LG+P    
Sbjct: 157 KAEIANVPLKERAYMNPNLPLESRVEDLLSVMTVEDKMELLREGWGIPGIPHLGVPAIHK 216

Query: 59  WSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEAR 118
             EA+HG S+         G+       GAT FP  I   A++N+ L +     +  E  
Sbjct: 217 -VEAIHGFSY---------GS-------GATIFPQSIGMGATWNKRLIEAAAMAIGDETV 259

Query: 119 AMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHR 178
           +     NA +  WSP ++V +D RWGR  ET GEDP +V      +++G Q         
Sbjct: 260 S----ANA-VQAWSPVLDVAQDARWGRCEETYGEDPVLVTEIGGAWIKGYQ--------- 305

Query: 179 DSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSS 238
              S+ L  +   KH+AA+         R   D  ++E++M+E  ++PF     +    S
Sbjct: 306 ---SKGLMTTP--KHFAAH---GAPLGGRDSHDIGLSEREMREIHLVPFRDIYKKYKYQS 357

Query: 239 VMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVA 298
           +M SY+   G+P     +LL   +R +W F G+IVSDC +I  +     +    K +A  
Sbjct: 358 IMMSYSDFLGVPVAKSKELLKGILRDEWGFDGFIVSDCGAIGNLTARKHYTAVDKVEAAR 417

Query: 299 RVLKAGLDLDCGDYYTN-FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL 357
           + L AG+  +CGD Y +   + A ++G++   D+D + + L   L R G F+ +P  K L
Sbjct: 418 QALAAGIATNCGDTYNDPDVIAAAKRGELNMDDLDFTCKTLLRTLFRNGLFENNP-CKPL 476

Query: 358 GKNNIC----NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
             N I     +P+H  LA + A++ IVLL+N    LPL+  ++KT+A++GP A+  +   
Sbjct: 477 DWNKIYPGWNSPEHQALARKTAQESIVLLENKGNILPLSK-SLKTIAVIGPGADNLQPGD 535

Query: 414 GNYEGTPCRYTSPMDGFYA---YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV 470
              +  P +  S + G  A    S  + Y  GC  I  +    I  A+ AA+NAD  V+V
Sbjct: 536 YTSKPQPGQLKSVLTGIKAAVNSSTKVLYEEGCRFIGTEGTD-IAKAVKAAENADVAVLV 594

Query: 471 AGLDLSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDIN 521
            G   + EA         E  D   L+LPG Q +L+  V    K PV L++ +    +++
Sbjct: 595 LGDCSTSEALKGITNTSGENHDLATLILPGEQQKLLEAVCKTGK-PVVLILQAGRPYNLS 653

Query: 522 FAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP 581
           +A  N +   + W+  PG+EGG A ADV+FG YNP GRLP+T+        P  +  L  
Sbjct: 654 YAAENCQAVLVNWL--PGQEGGYATADVLFGDYNPAGRLPMTF--------PRDAAQLPL 703

Query: 582 VNNF--PGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
             NF   GR Y + D P   +Y FGYGLSYT F Y        ++I L+K+     +N T
Sbjct: 704 YYNFKTSGRVYDYVDMPYYPLYQFGYGLSYTSFNY------SDLNISLEKNGNV-SVNAT 756

Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVI 696
                                       V N GK+ G EVV +Y +       T + ++ 
Sbjct: 757 ----------------------------VTNTGKVAGDEVVQLYITDMYASVKTRVMELK 788

Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            ++RV++  G+S KV F +   + L ++++  + ++  G   I+VG
Sbjct: 789 DFDRVYLNPGESKKVSFVLTPYQ-LSLLNDEMDRVVEKGLFKIMVG 833


>gi|281412136|ref|YP_003346215.1| glycoside hydrolase family 3 domain protein [Thermotoga
           naphthophila RKU-10]
 gi|281373239|gb|ADA66801.1| glycoside hydrolase family 3 domain protein [Thermotoga
           naphthophila RKU-10]
          Length = 778

 Score =  279 bits (714), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 243/816 (29%), Positives = 374/816 (45%), Gaps = 162/816 (19%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGD---------------------LAYG---VP 49
           Y D   P   R +DL+ RMTL EK  Q+G                      L  G   V 
Sbjct: 4   YRDPSQPIEVRVRDLLSRMTLEEKAAQLGSVWGYELIDERGKFSREKAKELLKNGIGQVT 63

Query: 50  RLGLPLYEWWSEALHGVSFIGR------RTNSPPGTHFDSEVP----GATSFPTVILTTA 99
           R G        EA   V+ I R      R   P   H +        G T+FP  I   +
Sbjct: 64  RPGGSTNLEPQEAAELVNEIQRFLVEETRLGIPAMIHEECLTGYMGLGGTNFPQAIAMAS 123

Query: 100 SFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR 159
           +++  L +K+   +  + R +    + GL   +P ++V RDPRWGR  ET GE PY+V R
Sbjct: 124 TWDPDLIEKMTTAIREDMRKIG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVAR 178

Query: 160 YAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLD----NWEGNDRFHFDSRVT 215
             ++YV+GLQ   G +  +        + A  KH+A Y       NW   +       + 
Sbjct: 179 MGVSYVKGLQ---GEDIKKG-------VVATVKHFAGYSASEGGKNWAPTN-------IP 221

Query: 216 EQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSD 275
           E++ +E F+ PFE  V E +V SVM SY+ ++G+P  A+ KLL   +R DW F G +VSD
Sbjct: 222 EREFKEVFLFPFEAAVKEANVLSVMNSYSEIDGVPCAANRKLLTDILRKDWGFKGIVVSD 281

Query: 276 CDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-----CGDYYTNFTMGAVQQGKIAEAD 330
             +++ + + H+   D K +A    L+AG+D++     C  Y  +     V++G I+EA 
Sbjct: 282 YFAVKVLEDYHRIARD-KSEAARLALEAGIDVELPKTECYQYLKDL----VEKGIISEAL 336

Query: 331 IDTSL-RFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGAL 389
           ID ++ R L +  M LG F+    Y  + K  I    H ++A + AR+ I+LLKND G L
Sbjct: 337 IDEAVARVLRLKFM-LGLFENP--YVEVEKAKI--ESHKDIALDIARKSIILLKND-GIL 390

Query: 390 PLNTGNIKTLALVGPHANATKAMIGNYE----------------GTPC------------ 421
           PL     K +AL+GP+A   + ++G+Y                 G P             
Sbjct: 391 PLQKN--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKS 448

Query: 422 ------RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG--- 472
                    S +D F        YA GC ++  ++ S    AI+ AK +D  ++V G   
Sbjct: 449 IEEHMKSIPSVLDAFKEEGIEFEYAKGC-EVTGEDRSGFEEAIEIAKKSDVAIVVVGDKS 507

Query: 473 ---LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
              LD +   E +D  +L LPG Q EL+ +VA   K PV LV+++     +    +  K+
Sbjct: 508 GLTLDCTT-GESRDMANLKLPGVQEELVLEVAKTGK-PVVLVLITGRPYSLKNVVD--KV 563

Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGR 588
            +IL V  PGE GGR+I D+I+GK NP G+LPI++   A  + + +   P    +++ G 
Sbjct: 564 NAILQVWLPGEAGGRSIVDIIYGKVNPSGKLPISFPRSAGQIPVFHYVKPSGGRSHWHGD 623

Query: 589 TYKFFDGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
                  P ++PFG+GLSYT+F+Y  +   PK V                     PP   
Sbjct: 624 YVDESTKP-LFPFGHGLSYTKFEYSNLRIEPKEV---------------------PPAGE 661

Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAG 706
           V+I            +++VEN G  DG EVV +Y      + T  +K++ G++RV + A 
Sbjct: 662 VVI------------KVDVENTGDRDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAK 709

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +   V F ++    L   D     ++  G   ++VG
Sbjct: 710 EKKTVVFRLH-MDVLAYYDRDMKLVVEPGEFKVMVG 744


>gi|224538725|ref|ZP_03679264.1| hypothetical protein BACCELL_03619 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224519667|gb|EEF88772.1| hypothetical protein BACCELL_03619 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 942

 Score =  279 bits (714), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 231/808 (28%), Positives = 368/808 (45%), Gaps = 140/808 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW------------ 58
           Y D       R +DL+ +MTL EK  QM  L YG  R+    LP  EW            
Sbjct: 53  YEDPNASLDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEWKQMLWKDGIGAI 111

Query: 59  --------------------WSEALHGVS-------FIGRRTNSPPGTHFDSEVPG---- 87
                               W  + H  +       FI       P    +  + G    
Sbjct: 112 DEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPVDFTNEGIRGVESY 171

Query: 88  -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
            AT+FPT +    ++N  L ++IG     EAR +      G T  ++P ++V RD RWGR
Sbjct: 172 RATNFPTQLGLGHTWNRELIRQIGLITGREARML------GYTNVYAPILDVGRDQRWGR 225

Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
             E  GE PY+V    I  VRG+Q       H        +++A  KH+ AY  +     
Sbjct: 226 YEEVYGESPYLVAELGIEMVRGMQ-------HNH------QVAATGKHFVAYSNNKGARE 272

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
                D +++ ++++   + PF+  + E  +  VM SYN  +G+P       L   +RG+
Sbjct: 273 GMARVDPQMSPREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSYYWLTTRLRGE 332

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
             F GY+VSD D+++ +   H    D KE AV + ++AGL++ C     D Y       V
Sbjct: 333 MGFRGYVVSDSDAVEYLYTKHSTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLRELV 391

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN-NICNPQHIELAAEAARQGIV 380
           ++G ++E  I+  +R +  V   +G FD   Q    G +  +   ++  LA +A+R+ +V
Sbjct: 392 KEGGLSEEVINDRVRDILRVKFLVGLFDTPYQTDLAGADKEVEKAENESLALQASRESLV 451

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK---VI 437
           LLKN+N  LPL+  N+K +A+ GP+A+     + +Y       T+ ++G    ++    +
Sbjct: 452 LLKNENNVLPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKAEGKAEV 511

Query: 438 NYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
            Y  GC D+V  N                + I  A++ A+ AD  V+V G       E K
Sbjct: 512 LYTKGC-DLVDANWPESELIDYPMTDSEQAEIDKAVENARQADVAVVVLGGGQRTCGENK 570

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
            R  L LPG Q +L+  V    K PV LV+++   + IN+A  +  + +IL   YPG +G
Sbjct: 571 SRSSLELPGRQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKG 627

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV----- 597
           G A+ADV+FG YNPGG+L +T +  +  +IP+ + P +P +   G      DG +     
Sbjct: 628 GTAVADVLFGDYNPGGKLTVT-FPKSVGQIPF-NFPCKPSSQIDGGKNPGLDGNMSRVNG 685

Query: 598 -VYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
            +Y FGYGLSYT F+Y  +  SPK                                 V  
Sbjct: 686 ALYSFGYGLSYTTFEYSDIEISPK---------------------------------VIT 712

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
            + K T + +V N GK  G EVV +Y +       T+ K + G+ER+ +  G++ +V FT
Sbjct: 713 PNQKATVRCKVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGETKEVVFT 772

Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
           ++  K L+++D     ++  G  +I++G
Sbjct: 773 LDR-KQLELLDKHMEWVVEPGDFSIMIG 799


>gi|329850151|ref|ZP_08264997.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
 gi|328842062|gb|EGF91632.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
          Length = 877

 Score =  279 bits (713), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 173/465 (37%), Positives = 245/465 (52%), Gaps = 50/465 (10%)

Query: 2   FESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSE 61
            + +   ++   Y D  L    RA DLV RMTL EK  Q+G  A  +PRLG+P Y WW+E
Sbjct: 11  LDPVPADVAAMAYRDTALDPKARAADLVSRMTLEEKAAQLGHTAPAIPRLGVPKYNWWNE 70

Query: 62  ALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY 121
            LHGV+  G                 AT FP  I   A+++E +   +G  VSTE RA Y
Sbjct: 71  GLHGVARAGV----------------ATVFPQAIGMAATWDEPMMTTVGDVVSTEFRAKY 114

Query: 122 ------NLGN---AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVE 172
                 + G     GLT WSPNIN+ RDPRWGR  ET GEDPY+  R  I Y+ GLQ   
Sbjct: 115 VERVHPDGGTDWYRGLTVWSPNINIFRDPRWGRGQETYGEDPYLTSRIGIGYIHGLQ--- 171

Query: 173 GVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVN 232
                  +D +  K  A  KH+A +       ++R   D   ++ D+++T++  F   V 
Sbjct: 172 ------GNDPKFFKTVATSKHFAVHSGPE---SNRHKEDVYPSKFDLEDTYLPAFRATVT 222

Query: 233 EGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV-ESHKFLND 291
           EG   SVMC YN V G+P CA   L+ + +R +W F G++VSDC +   I  E       
Sbjct: 223 EGKAYSVMCVYNAVYGVPGCASDFLMEEKLRQNWGFPGFVVSDCGAAANIFREDALHYTK 282

Query: 292 TKEDAVARVLKAGLDLDCGDYYTNFT------MGAVQQGKIAEADIDTSLRFLYIVLMRL 345
           T E+ VA  LKAG+DL CGDY    +      + AV+ G++  A +D +L  L+   +RL
Sbjct: 283 TAEEGVAVGLKAGMDLICGDYRNKMSTEVQPIINAVKAGQLPIAVVDQALVRLFEGRIRL 342

Query: 346 GYFD--GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG 403
           G FD   S  + ++  ++   P H  +A + A++ +VLLKND G LPL     KT+A++G
Sbjct: 343 GMFDPPASLPFAHITADDSDTPAHHAVALDMAKKSMVLLKND-GLLPLKA-EPKTIAVIG 400

Query: 404 PHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVINYAPGCADI 446
           P+A++  A++GNY G P +  + +DG  A   +  I YA G   I
Sbjct: 401 PNADSLDALVGNYYGKPSKPVTVLDGIRARFPTAKIVYAEGTGLI 445



 Score =  132 bits (333), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 101/310 (32%), Positives = 146/310 (47%), Gaps = 70/310 (22%)

Query: 453 MIPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVAD 502
           M   A+D AK AD  V V GL   VE E          G DR  + LP  Q +L+ KV  
Sbjct: 587 MAGQAVDVAKTADFVVFVGGLSARVEGEEMKVEAEGFAGGDRTSIDLPKPQQQLLEKVIG 646

Query: 503 AAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPI 562
             K P  LV+MS  A+ +N+A  +  + +I+   YPG EGG A+A +I G Y+P GRLP+
Sbjct: 647 TGK-PTVLVLMSGSALGVNWADKH--VPAIIEAWYPGGEGGHAVAQLIAGDYSPAGRLPV 703

Query: 563 TWYEANYVKIPYTSMPLRPVNNFPG--------RTYKFFDGPVVYPFGYGLSYTQFKYKV 614
           T+Y              R V+  PG        RTY++F+G V+YPFG+GLSYT F Y  
Sbjct: 704 TFY--------------RSVDALPGFSDYTMKNRTYRYFNGEVLYPFGHGLSYTTFAY-- 747

Query: 615 ASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDG 674
            ++PK     +                                   T  ++V N G MD 
Sbjct: 748 -ANPKVSAASVAAGSSV-----------------------------TVSVDVSNSGAMDS 777

Query: 675 SEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS 734
            EVV +Y   PG  GT I+ + G++RV +  G++  V F ++  ++L +VD      + +
Sbjct: 778 DEVVQLYVSHPG--GTAIRSLQGFQRVSLKKGETKTVQFKLDD-RALSVVDEHGGRKVQA 834

Query: 735 GAHTILVGEG 744
           G   + +G G
Sbjct: 835 GQVDLWIGGG 844


>gi|383115356|ref|ZP_09936112.1| hypothetical protein BSGG_2769 [Bacteroides sp. D2]
 gi|313695234|gb|EFS32069.1| hypothetical protein BSGG_2769 [Bacteroides sp. D2]
          Length = 735

 Score =  279 bits (713), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 222/755 (29%), Positives = 347/755 (45%), Gaps = 87/755 (11%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLY-- 56
           Y DAK P  +R  DL+ RMTL EKV Q+     G              VP  +G  +Y  
Sbjct: 30  YKDAKAPIEKRIDDLISRMTLEEKVLQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFD 89

Query: 57  --EWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
                  ++   +    R   P    +D+     T +P  +    S+N  L ++     +
Sbjct: 90  INPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSA 149

Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
            EAR    +     TF SP I+V RDPRWGRV E  GEDPY  G +A   VRG       
Sbjct: 150 QEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRG------- 197

Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
            Y  D  S   +I+AC KHY  Y      G D  +  + ++ Q + +T++LP+EM V  G
Sbjct: 198 -YQGDDMSAENRIAACLKHYIGYGASE-AGRDYVY--TEISAQTLWDTYLLPYEMGVKAG 253

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
             +++M S+N ++G+P  A+   +   ++  W   G+IVSD  +++ +   ++ L  TK+
Sbjct: 254 -AATLMSSFNDISGVPGSANHYTMTAILKERWKHDGFIVSDWGAVEQL--KNQGLAATKK 310

Query: 295 DAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
           DA      AGL++D   + Y       V++GK+  A +D S+R +  V  RLG F+    
Sbjct: 311 DAAWYAFNAGLEMDMMSHAYDRHLKELVEEGKVTMAQVDESVRRVLRVKFRLGLFERPYT 370

Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
                K+    PQ + +AA+ A + +VLLKNDN  LPL   N K +A+VGP A     ++
Sbjct: 371 PVTNEKDRFFRPQSMAVAAQLAAESMVLLKNDNQILPLT--NKKRIAVVGPMAKNGWDLL 428

Query: 414 GNY--EGTPCRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
           G++   G         DG  A       + YA GC      + S    A+D  + +D  +
Sbjct: 429 GSWCGHGKDTDVEMLYDGLTAEFGGEAELRYAMGCKP-QGNDRSGFAGALDVVRWSDVVI 487

Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
           +  G  L+   E   R  + LP  Q EL+ ++ +A K P+ LV+ +   +++N  +  P 
Sbjct: 488 VCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PIILVLSNGRPLELN--RMEPL 544

Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPG 587
             +IL +  PG  G R++A ++ G+ NP G+L IT+ Y    + I Y     R    +  
Sbjct: 545 CDAILEIWQPGINGARSMAGILSGRINPSGKLAITFPYSTGQIPIYYNR---RKSGRWHQ 601

Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
             YK       Y FGYGLSYT+F+Y V  +P S  +K                       
Sbjct: 602 GFYKDITSDPFYSFGYGLSYTEFQYGVV-TPSSTTVK----------------------- 637

Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAG 706
                   +  K + ++ V N GK DG+E V  +   P  + T  +K++  +E+ FI  G
Sbjct: 638 --------RGEKLSVEVTVTNAGKRDGAETVHWFISDPYCSITRPVKELKHFEKQFIKVG 689

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
           ++    F ++  + L  VD      L +G + I V
Sbjct: 690 ETRTFRFDVDLERDLGFVDGNGKRFLEAGEYNIWV 724


>gi|6006601|emb|CAB56857.1| beta-mannanase [Thermotoga neapolitana]
          Length = 821

 Score =  279 bits (713), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 237/813 (29%), Positives = 370/813 (45%), Gaps = 160/813 (19%)

Query: 16  DAKLPYPERAKDLVERMTLPEKVQQMGD---------------------LAYGVPRLGLP 54
           D   P   R KDL+ RMTL EK+ Q+G                      L  G+ ++  P
Sbjct: 49  DPSQPVEVRVKDLLSRMTLEEKIAQLGSVWGYELIDERGKFKREKAKDLLKNGIGQITRP 108

Query: 55  ---LYEWWSEALHGVSFIGR------RTNSPPGTHFDSEVP----GATSFPTVILTTASF 101
                    EA   V+ I R      R   P   H +        G T+FP  I   +++
Sbjct: 109 GGSTNLEPQEAAELVNEIQRFLVEETRLGIPAMIHEECLTGYMGLGGTNFPQAIAMASTW 168

Query: 102 NESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYA 161
           +  L +K+   +  + R +    + GL   +P ++V RDPRWGR  ET GE PY+V R  
Sbjct: 169 DPDLIEKMTAAIREDMRKLG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVARMG 223

Query: 162 INYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLD----NWEGNDRFHFDSRVTEQ 217
           ++YV+GLQ           ++    + A  KH+A Y       NW   +       + E+
Sbjct: 224 VSYVKGLQ----------GENIKEGVVATVKHFAGYSASEGGKNWAPTN-------IPER 266

Query: 218 DMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCD 277
           + +E F+ PFE  V E  V SVM SY+ ++G+P  A+ +LL   +R DW F G +VSD  
Sbjct: 267 EFREVFLFPFEAAVKEARVLSVMNSYSEIDGVPCAANRRLLTDILRKDWGFEGIVVSDYF 326

Query: 278 SIQTIVESHKFLNDTKEDAVARVLKAGLDL-----DCGDYYTNFTMGAVQQGKIAEADID 332
           ++  + E H+   D  E A    L+AG+D+     DC  +  +     V++G + E+ ID
Sbjct: 327 AVNMLGEYHRIAKDKSESA-RLALEAGIDVELPKTDCYQHLKDL----VEKGIVPESLID 381

Query: 333 TSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLN 392
            ++  +  +   LG F+    Y ++ K  I    H +LA E AR+ I+LLKND G LPL 
Sbjct: 382 EAVSRVLKLKFMLGLFENP--YVDVEKAKI--ESHRDLALEIARKSIILLKND-GTLPLQ 436

Query: 393 TGNIKTLALVGPHANATKAMIGNYE----------------GTPC--------------- 421
               K +AL+GP+A   + ++G+Y                 G P                
Sbjct: 437 KN--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKSIEE 494

Query: 422 ---RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG------ 472
                 S +D F        YA GC ++  ++ S    AI+ AK +D  ++V G      
Sbjct: 495 HMKSIPSVLDAFKEEGIDFEYAKGC-EVTGEDRSGFKEAIEVAKRSDVAIVVVGDRSGLT 553

Query: 473 LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
           LD +   E +D  +L LPG Q EL+ ++A   K PV LV+++     +    +  ++ +I
Sbjct: 554 LDCTT-GESRDMANLKLPGVQEELVLEIAKTGK-PVVLVLITGRPYSLKNLVD--RVNAI 609

Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYK 591
           L V  PGE GGRAI DVI+GK NP G+LPI++   A  + + +   P    +++ G    
Sbjct: 610 LQVWLPGEAGGRAIVDVIYGKVNPSGKLPISFPRSAGQIPVFHYVKPSGGRSHWHGDYVD 669

Query: 592 FFDGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
               P ++PFG+GLSYT+F+Y  +   PK V                     P    V+I
Sbjct: 670 ESTKP-LFPFGHGLSYTRFEYSNLRIEPKEV---------------------PSAGEVVI 707

Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSA 709
                       +++VEN+G MDG EVV +Y      + T  +K++ G++RV + A +  
Sbjct: 708 ------------KVDVENVGDMDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAKEKK 755

Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            V F ++    L   D     ++  G   ++VG
Sbjct: 756 TVVFRLH-TDVLAYYDRDMKLVVEPGEFRVMVG 787


>gi|393781363|ref|ZP_10369562.1| hypothetical protein HMPREF1071_00430 [Bacteroides salyersiae
           CL02T12C01]
 gi|392676856|gb|EIY70278.1| hypothetical protein HMPREF1071_00430 [Bacteroides salyersiae
           CL02T12C01]
          Length = 863

 Score =  279 bits (713), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 173/447 (38%), Positives = 243/447 (54%), Gaps = 45/447 (10%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            P+ ++ LP  ERA+DL++R+TL EKV  M D +  +PRLG+  Y WW+EALHGV   G 
Sbjct: 22  LPFNNSDLPVEERAQDLLQRLTLQEKVLLMCDYSSPIPRLGIKRYNWWNEALHGVGRAGL 81

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN------ 125
                           AT FP  I   A+F++   ++  + VS EARA Y+         
Sbjct: 82  ----------------ATVFPQAIGMAATFDDCAVRQAFECVSDEARAKYHHSENKEGSE 125

Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
              GLTFW+PN+N+ RDPRWGR  ET GEDPY+  +  +  VRGLQ          S+S+
Sbjct: 126 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGLAVVRGLQG--------PSESK 177

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFD-SRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
             K+ AC KHYA +    W   +R  FD   ++ +D+ ET++  F+  V +G V  VMC+
Sbjct: 178 YDKLHACAKHYALHSGPEW---NRHSFDVDSISPRDLWETYLPAFKALVQQGGVKEVMCA 234

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI-VESHKFLNDTKEDAVARVL 301
           YNR  G P C   +LL   +R +W F G +VSDC +I    ++ H   + TKE AVA  +
Sbjct: 235 YNRFEGEPCCGSNRLLYNILREEWGFDGLVVSDCGAISDFYLKGHHETHPTKEAAVAAAV 294

Query: 302 KAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLG 358
           KAG DLDCG DYY      AV++G I E  ID SL  L      LG  D      + ++ 
Sbjct: 295 KAGTDLDCGVDYYA--LQKAVEEGIITEKQIDVSLFRLLKARFELGLMDEEHLVSWSDIP 352

Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
              + + +H E A E AR+ + LLKND+G LPL+  +   +A++GP+AN +  M GNY G
Sbjct: 353 YTVVDSEKHREKALEMARKSMTLLKNDHGTLPLSK-HCGKIAVIGPNANDSVMMWGNYNG 411

Query: 419 TPCRYTSPMDGFYAY--SKVINYAPGC 443
            P    + ++G      ++ I Y  GC
Sbjct: 412 FPSHTVTILEGITHKLGAEQIIYDKGC 438



 Score =  119 bits (298), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 90/296 (30%), Positives = 139/296 (46%), Gaps = 55/296 (18%)

Query: 460 AAKNADATVIV--AGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
           AA+  DA VIV   G+   VE E          G DR  + LP  Q +L+ ++    K P
Sbjct: 594 AARVGDAEVIVFVGGISPKVEGEELPVSFPGFKGGDRTVIELPQVQRDLLQELHKTGK-P 652

Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
           V L++ S  A  I  +       +I+   Y G+ GG A+ADV+FG YNP GRLP+T+Y+A
Sbjct: 653 VILILCSGSA--IGLSAEVDLADAIIQAWYLGQAGGTAVADVLFGDYNPAGRLPVTFYKA 710

Query: 568 NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
                    +P     +  GRTY++F+G  ++PFGYGLSYT F+   A   K        
Sbjct: 711 T------EQLPDFEDYSMQGRTYRYFEGEALFPFGYGLSYTSFEIGKARLSK-------- 756

Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI 687
            ++ R+ N +V                      + ++ VEN GK+DG EV+ +Y +    
Sbjct: 757 -KRIRE-NESV----------------------SLKLTVENTGKLDGDEVIQIYIRKLQD 792

Query: 688 AGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
               +K +  ++R  + AG+   V F +         D  +N++ +  G + IL G
Sbjct: 793 KEGPLKTLRAFKRFHLRAGEKKDVTFHLQN-DHFNFFDTESNTMRVMPGEYEILYG 847


>gi|293370605|ref|ZP_06617157.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|292634339|gb|EFF52876.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
          Length = 861

 Score =  278 bits (712), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 169/445 (37%), Positives = 235/445 (52%), Gaps = 44/445 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            PY D  L   +RA+DL+ R+TL EKV  M + +  +PRLG+  YEWW+EALHGV   G 
Sbjct: 24  LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-GNAG--- 127
                           AT FP  I   ASFN+SL  ++    S EAR    + G +G   
Sbjct: 84  ----------------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLK 127

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFW+PN+N+ RDPRWGR  ET GEDPY+ G+  +  VRGLQ  E   Y       
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDAGYD------ 181

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
             K+ AC KH+A +    W   +R  FD+  +  +D+ ET++  F+  V +  V  VMC+
Sbjct: 182 --KLHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCA 236

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
           YNR  G P C   +LL Q +R +W + G +VSDC +I       +H    D KE A A  
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASAAA 295

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           ++ G DL+CG  Y +    AV+ G I E +ID SL+ L      LG  D    +  +  +
Sbjct: 296 VRTGTDLECGSEYASLA-DAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSAWSEIPTS 354

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + + +H  LA   AR+ +VLL+N N  LPLNT ++K +A++GP+AN +    GNY G P
Sbjct: 355 VLNSKEHQALALRMARESLVLLQNKNNILPLNT-HLK-VAVMGPNANDSVMQWGNYNGIP 412

Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
               + ++   A      I Y PGC
Sbjct: 413 AHTVTLLEAVRAKLPEGQIIYEPGC 437



 Score =  104 bits (260), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 84/297 (28%), Positives = 131/297 (44%), Gaps = 53/297 (17%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           A+    +AD  +   G+  S+E E          G DR D+ LP  Q +L+  +  A K 
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGK- 649

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
              +V ++     I         ++IL   YPG+ GG AI D ++G+YNPGGRLP+T+Y+
Sbjct: 650 --KVVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
                     +P     +  GRTY++     ++PFG+GLSYT F Y  A        KL 
Sbjct: 708 ------DVNQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEA--------KLS 753

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           K+   +  N                            I V N+G+ DG EVV VY + PG
Sbjct: 754 KNTIAKGEN------------------------VVLTIPVSNVGQRDGEEVVQVYLRRPG 789

Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
                   +  ++RV I AG++  V   +   ++ +  D  +N++    G + +L G
Sbjct: 790 DKEGPRYTLRAFKRVHIPAGKTESVAIPLTG-ENFEWFDVESNTMRPLEGTYELLYG 845


>gi|423222018|ref|ZP_17208488.1| hypothetical protein HMPREF1062_00674 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392644204|gb|EIY37946.1| hypothetical protein HMPREF1062_00674 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 942

 Score =  278 bits (712), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 230/808 (28%), Positives = 366/808 (45%), Gaps = 140/808 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW------------ 58
           Y D       R +DL+ +MTL EK  QM  L YG  R+    LP  EW            
Sbjct: 53  YEDPNASLDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEWKQMLWKDGIGAI 111

Query: 59  --------------------WSEALHGVS-------FIGRRTNSPPGTHFDSEVPG---- 87
                               W  + H  +       FI       P    +  + G    
Sbjct: 112 DEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPVDFTNEGIRGVESY 171

Query: 88  -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
            AT+FPT +    ++N  L +++G     EAR +      G T  ++P ++V RD RWGR
Sbjct: 172 RATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQRWGR 225

Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
             E  GE PY+V    I  VRG+Q                +++A  KH+ AY  +     
Sbjct: 226 YEEVYGESPYLVAELGIEMVRGMQHSH-------------QVAATGKHFVAYSNNKGARE 272

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
                D +++ ++++   + PF+  + E  +  VM SYN  +G+P       L   +RG+
Sbjct: 273 GMARVDPQMSPREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSYYWLTTRLRGE 332

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
             F GY+VSD D+++ +   H    D KE AV + ++AGL++ C     D Y       V
Sbjct: 333 MGFRGYVVSDSDAVEYLYTKHSTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLRELV 391

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN-NICNPQHIELAAEAARQGIV 380
           ++G ++E  I+  +R +  V   +G FD   Q    G +  +   ++  LA +A+R+ +V
Sbjct: 392 KEGGLSEEVINDRVRDILRVKFLVGLFDTPYQTDLAGADKEVEKAENESLALQASRESLV 451

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK---VI 437
           LLKN+N  LPL+  N+K +A+ GP+A+     + +Y       T+ ++G    ++    +
Sbjct: 452 LLKNENNVLPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKAEGKAEV 511

Query: 438 NYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
            Y  GC D+V  N                + I  A++ A+ AD  V+V G       E K
Sbjct: 512 LYTKGC-DLVDANWPESELIDYPMTDSEQAEIDKAVENARQADVAVVVLGGGQRTCGENK 570

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
            R  L LPG Q +L+  V    K PV LV+++   + IN+A  +  +  IL   YPG +G
Sbjct: 571 SRSSLDLPGRQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPVILEAWYPGSKG 627

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV----- 597
           G A+ADV+FG YNPGG+L +T +  +  +IP+ + P +P +   G      DG +     
Sbjct: 628 GTAVADVLFGDYNPGGKLTVT-FPKSVGQIPF-NFPCKPSSQIDGGKNPGLDGNMSRVNG 685

Query: 598 -VYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
            +Y FGYGLSYT F+Y  +  SPK                                 V  
Sbjct: 686 ALYSFGYGLSYTTFEYSDIEISPK---------------------------------VIT 712

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
            + K T + +V N GK  G EVV +Y +       T+ K + G+ER+ +  G++ +V FT
Sbjct: 713 PNQKATVRCKVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGETKEVVFT 772

Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
           ++  K L+++D     ++  G  +I+VG
Sbjct: 773 LDR-KQLELLDKHMEWVVEPGDFSIMVG 799


>gi|380696428|ref|ZP_09861287.1| beta-glucosidase [Bacteroides faecis MAJ27]
          Length = 851

 Score =  278 bits (711), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 162/421 (38%), Positives = 233/421 (55%), Gaps = 45/421 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y +   P  ER  DL+ R+T+ EK+  +   + G+PRLG+  Y   +EALHGV   GR  
Sbjct: 28  YKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-- 85

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
                          T FP  I   A++N  L +++   +S EARA +N  + G      
Sbjct: 86  --------------FTVFPQAIGLAATWNPELQRRVATVISDEARARWNELDQGRAQKEQ 131

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFWSP +N+ RDPRWGR  ET GEDP++ G     +V+GLQ           D  
Sbjct: 132 FSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQG---------DDPH 182

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LKI +  KH+AA    N E ++RF  + +++E+ ++E +   FEMCV EG  +S+M +Y
Sbjct: 183 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAY 238

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N +N +P   +  LL + +R DW F GY+VSDC     +V +HK+L  TKE A    LKA
Sbjct: 239 NALNDVPCTLNAWLLQKVLRKDWGFQGYVVSDCGGPALLVNAHKYLK-TKEAAATLSLKA 297

Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
           GLDL+CG D Y    + A +Q  +++ADID++   +    M+LG FDG  +  Y  +  +
Sbjct: 298 GLDLECGDDVYDGPLLNAYKQYMVSDADIDSAAYHVLTARMKLGLFDGVERNPYTKISPS 357

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            I + +H ++A +AARQ IVLLKN    LPLN   +K++A+VG   NA K   G+Y G P
Sbjct: 358 VIGSKEHQQIALDAARQCIVLLKNQKNMLPLNASKLKSIAVVG--INAGKCEFGDYSGAP 415

Query: 421 C 421
            
Sbjct: 416 V 416



 Score =  134 bits (336), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 87/301 (28%), Positives = 148/301 (49%), Gaps = 51/301 (16%)

Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
           A +  +  + V G++ S+E EG+DR D+ LP  Q E + ++       + +++++  ++ 
Sbjct: 598 AVRECETVIAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNSN-MIVILVAGSSLA 656

Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
           IN+   +  + +I+   YPGE+GG A+A+V+FG YNP GRLP+T+Y++   ++P    P 
Sbjct: 657 INWMDEH--VPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS-LDELP----PF 709

Query: 580 RPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
              +   GRTYK+F G V+YPFGYGLSY+ FKY                           
Sbjct: 710 DDYDITKGRTYKYFKGEVLYPFGYGLSYSSFKY--------------------------- 742

Query: 640 TNKPPCAAVLIDDVKCKDY--KFTFQIEVENMGKMDGSEVVMVYSKPPGIAG-THIKQVI 696
                       D++ KD   +      ++N GK +G EV  VY + P   G   +K++ 
Sbjct: 743 -----------SDLRVKDEADEVAVSFRLKNTGKRNGDEVTQVYVRIPETGGIVPVKELK 791

Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVGEGVGGVSFPLQLN 755
           G+ RV + +G+S +V   +N  + L+  D      +   G   I+VG     +     ++
Sbjct: 792 GFRRVPLKSGESRRVEIRLNK-EQLRYWDVGKGQFVVPKGTFDIMVGASSKDIRLQTVID 850

Query: 756 L 756
           L
Sbjct: 851 L 851


>gi|298479985|ref|ZP_06998184.1| periplasmic beta-glucosidase [Bacteroides sp. D22]
 gi|298273794|gb|EFI15356.1| periplasmic beta-glucosidase [Bacteroides sp. D22]
          Length = 735

 Score =  278 bits (711), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 222/760 (29%), Positives = 354/760 (46%), Gaps = 97/760 (12%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLY-- 56
           Y DAK P  +R  DL+ RMTL EKV Q+     G              VP  +G  +Y  
Sbjct: 30  YKDAKAPIEKRIDDLISRMTLEEKVLQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFD 89

Query: 57  --EWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
                  ++   +    R   P    +D+     T +P  +    S+N  L ++     +
Sbjct: 90  INPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSA 149

Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
            EAR    +     TF SP I+V RDPRWGRV E  GEDPY  G +A   VRG       
Sbjct: 150 QEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRG------- 197

Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
            Y  D  S   +++AC KHY  Y      G D  +  + ++ Q + +T++LP+EM V  G
Sbjct: 198 -YQGDDMSAENRMAACLKHYVGYGASE-AGRDYVY--TEISAQTLWDTYLLPYEMGVKAG 253

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
             +++M S+N ++G+P  A+P ++ + ++  W   G+IVSD  +++ +   ++ L  TK+
Sbjct: 254 -AATLMSSFNDISGVPGSANPYIMTEILKKRWKHDGFIVSDWGAVEQL--KNQGLAATKK 310

Query: 295 DAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
           DA      AGL++D   + Y       V++GK+  A +D S+R +  V   LG F+    
Sbjct: 311 DAAQYAFNAGLEMDMMSHAYDRHLKELVEEGKVTMAQVDESVRRVLRVKFCLGLFERPYT 370

Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
                K+    PQ + +AA+ A + +VLLKNDN  LPL   N K +A+VGP A     ++
Sbjct: 371 PVTNEKDRFFRPQSMAVAAQLAAESMVLLKNDNQILPLT--NKKKIAVVGPMAKNGWDLL 428

Query: 414 GNY--EGTPCRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
           G++   G         DG  A       + YA GC      + S    A+D A+ +D  +
Sbjct: 429 GSWCGHGKDTDVEMLYDGLTAEFGGDAELRYAMGCKP-QGNDRSGFAGALDVARWSDVVI 487

Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
           +  G  L+   E   R  + LP  Q EL+ ++ +A K PV LV+ +   +++N  +  P 
Sbjct: 488 VCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PVILVLSNGRPLELN--RMEPL 544

Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS--MPLRPVNNFP 586
             +IL +  PG  G R++A ++ G+ NP G+L +T+        PY++  +P+       
Sbjct: 545 CDAILEIWQPGINGARSMAGILSGRINPSGKLAMTF--------PYSTGQIPIYYNRRKS 596

Query: 587 GRTYKFFDGPV----VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
           GR ++ F   +    +YPFG+GLSYT+FKY                          GT  
Sbjct: 597 GRGHQGFYKDITSDPLYPFGHGLSYTEFKY--------------------------GTVT 630

Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERV 701
           P    V   D      K + ++ V N G  DG+E V  +   P  + T  +K++  +E+ 
Sbjct: 631 PSATKVKRGD------KLSAEVTVTNTGSRDGAETVHWFISDPYCSITRPVKELRHFEKQ 684

Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
            I AG++    F ++  +    V+      L +G + ILV
Sbjct: 685 LIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724


>gi|255690202|ref|ZP_05413877.1| beta-glucosidase [Bacteroides finegoldii DSM 17565]
 gi|260624221|gb|EEX47092.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           finegoldii DSM 17565]
          Length = 853

 Score =  278 bits (711), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 163/420 (38%), Positives = 232/420 (55%), Gaps = 45/420 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y +A  P  ER  DL+ R+T+ EK+  +   + G+PRLG+  Y   +EALHGV   GR  
Sbjct: 29  YKNANAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-- 86

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
                          T FP  I   A++N  L K+I   +S EARA +N  + G      
Sbjct: 87  --------------FTVFPQAIGLAATWNPELQKRIATVISDEARARWNELDQGRNQKEQ 132

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFWSP +N+ RDPRWGR  ET GEDP++ G     +V+GLQ           D  
Sbjct: 133 FSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQG---------DDPH 183

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LKI +  KH+AA    N E ++RF  + +++E+ ++E +   FEMCV EG  +S+M +Y
Sbjct: 184 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAY 239

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N +N +P   +  LL + +R DW F GY+VSDC     +V +HK++  TKE A    +KA
Sbjct: 240 NALNNVPCTLNSWLLQKVLRRDWGFQGYVVSDCGGPSLLVNAHKYVK-TKEAAATLSIKA 298

Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
           GLDL+CG D Y  + + A +Q   +EADID++   +    M+LG FDG  +  Y  +  +
Sbjct: 299 GLDLECGDDVYDEYLLNAYKQYMASEADIDSAAYHVLTARMKLGLFDGVERNPYAKISPS 358

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            I + +H  +A  AAR+ IVLLKN    LPLN   +K++A+VG   NA K   G+Y G P
Sbjct: 359 VIGSKEHQTVALNAARECIVLLKNQKNMLPLNVKKLKSIAVVG--INAGKCEFGDYSGAP 416



 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 94/299 (31%), Positives = 155/299 (51%), Gaps = 46/299 (15%)

Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
           A +  +  V V G++ S+E EG+DR D+ LP  Q E + ++       + LV+++  ++ 
Sbjct: 599 AVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IILVLVAGSSLA 657

Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
           +N+   N  + +I+   YPGE+GG A+A+V+FG YNP GRLP+T+Y++   ++P      
Sbjct: 658 VNW--ENEHLPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS-LEQLP----AF 710

Query: 580 RPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
              +   GRTY++F   V+YPFGYGLSYT FKY         ++K+D   +  ++++T  
Sbjct: 711 DDYDITKGRTYQYFKKDVLYPFGYGLSYTTFKYS--------NLKVDDAGKTVNVSFT-- 760

Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGT--HIKQVIG 697
                                     ++N GK  G EV  VY + P IAG+   I+Q+ G
Sbjct: 761 --------------------------LKNTGKRAGDEVAQVYVRLPEIAGSTQAIRQLKG 794

Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
           + RV + AG+S KV  T++  +     +  A  ++  G+ T +VG   G +      NL
Sbjct: 795 FRRVALKAGESRKVEITLDKEQLRYWDEKQACFVVPQGSFTFMVGASSGDIRLENTTNL 853


>gi|340616359|ref|YP_004734812.1| xylosidase/arabinosidase [Zobellia galactanivorans]
 gi|339731156|emb|CAZ94420.1| Xylosidase/arabinosidase, family GH3 [Zobellia galactanivorans]
          Length = 801

 Score =  278 bits (711), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 232/810 (28%), Positives = 365/810 (45%), Gaps = 147/810 (18%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
           Y D   P   R +DL+ +MTL EK  QM  L YG  R+    LP  +W ++    G+  I
Sbjct: 44  YEDPTRPVDLRIEDLLSQMTLEEKSCQMATL-YGFGRVLKDELPTPDWKNQIWKDGIGNI 102

Query: 70  GRRTNS-------------PPGTH----------------------FDSE-VPG-----A 88
             + N+             PP  H                      F +E + G     A
Sbjct: 103 DEQLNNLAYHPSAVTDKAWPPSNHIKALNTIQEFFVEDTRLGIPVDFTNEGIRGLCHEKA 162

Query: 89  TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVL 147
           TSFP+ +   A++N++L  KIG     EAR +      G T  +SP +++ RDPRWGRV+
Sbjct: 163 TSFPSQLGVGATWNKNLVGKIGHITGKEARLL------GYTNVYSPILDIARDPRWGRVV 216

Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
           E  GEDPY+VG      V+G+Q                K+ +  KH+A Y       +  
Sbjct: 217 ECYGEDPYLVGELGYQMVKGIQQE--------------KVVSTPKHFAIYSAPKGGRDGD 262

Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
              D+ +TE+++   ++ PF+  + +     VM SYN  NG+P  +    LN  +R DW 
Sbjct: 263 ARTDAHITERELFSLYLHPFKRAIKDAGAMGVMSSYNDYNGVPVSSSKYFLNDILREDWG 322

Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTM--------- 318
           F GY+VSD  +++ I + H    D K DAV + + AGL++      T+FTM         
Sbjct: 323 FKGYVVSDSRAVEFIADKHHVAKDRK-DAVRQAVLAGLNV-----RTDFTMPEDFILPVR 376

Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK--NNICNPQHIELAAEAAR 376
             V++G +  A ID  +R +  V    G FD +P  K + +    +  P++ E+A +A+ 
Sbjct: 377 ELVKEGGLDMATIDDRVRDILRVKFWQGLFD-APYGKQMKEADKTVGKPEYQEVAYQASL 435

Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF---YAY 433
           + IVLLKN+   LPL+    K++ + GP+A A    +  Y  +     S  DG    +  
Sbjct: 436 ESIVLLKNEENILPLDFSKYKSVLVTGPNAKAINHSVSRYGPSHIDVVSVFDGIKEKFPK 495

Query: 434 SKVINYAPGCA-------DIVCQN-------NSMIPAAIDAAKNADATVIVAGLDLSVEA 479
              I Y  GC        D    N        S I  A+  AK     ++V G D     
Sbjct: 496 DVEIKYTKGCVFFDENWPDSELMNTPPTEAEQSEIDKAVAMAKTVGLAIVVLGDDEETVG 555

Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
           E + R  L LPG Q +L+ ++      PV +V+++   + IN+   +  +  I+   + G
Sbjct: 556 ESRSRTSLDLPGNQQKLVEEIYKTGT-PVIVVLINGRPMTINWV--DKYVPGIVEGWFQG 612

Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF------PGRTYKFF 593
           + GG AIADV+ G YNPGG+LP++ +     ++P  + P +P          P  + K  
Sbjct: 613 KFGGSAIADVLVGSYNPGGKLPVS-FPKTVGQLP-MNFPSKPGAQADQPAKGPNGSGKTR 670

Query: 594 DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
            G  +YPFGYGLSYT F+Y                      N  + +N           +
Sbjct: 671 VGGFLYPFGYGLSYTTFEY---------------------TNLKIRSN-----------I 698

Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVG 712
           K         +++ N GK  G E+V +Y S        + KQ+ G+ER+ + AG++  V 
Sbjct: 699 KNGLGDVVVSVDITNSGKRKGDEIVQLYFSDETSSVTVYEKQLRGFERISLEAGETKTVN 758

Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVG 742
           FT+ + + L + +     +L  G+ TI++G
Sbjct: 759 FTL-SPEDLSLYNRQMEFVLEPGSFTIMIG 787


>gi|94970273|ref|YP_592321.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
 gi|94552323|gb|ABF42247.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
          Length = 881

 Score =  278 bits (710), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 168/464 (36%), Positives = 244/464 (52%), Gaps = 56/464 (12%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y +  L   +RA DLV RMT+ EKV Q+ + +  VPRL +P Y+WWSEALHGV+      
Sbjct: 30  YLNPSLAPEKRAADLVHRMTVEEKVSQLTNDSRAVPRLNVPDYDWWSEALHGVA------ 83

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN-------- 125
                       PG T +P  +   A+F+    +++ + +  E R  +  G         
Sbjct: 84  -----------QPGVTEYPQPVALAATFDNDKVQRMARFIGIEGRIKHEEGMKDGHSDIF 132

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GL FW+PNIN+ RDPRWGR  ET GEDP++  R  + YV+GLQ  +   Y        L
Sbjct: 133 QGLDFWAPNINIFRDPRWGRGQETYGEDPFLTARMGVAYVKGLQGDDPKYY--------L 184

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
            IS   KHYA +   +     R   D +V++ D  +T++  F   V E    SVMC+YN 
Sbjct: 185 AIS-TPKHYAVH---SGPETTRHFADVKVSKHDELDTYLPAFRATVTEAKAGSVMCAYNS 240

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           +NG P C +  LL   +RG WNF GY+VSDC++I  I   HKF   T+ +A A  ++ G+
Sbjct: 241 INGQPACVNEFLLQDQLRGKWNFQGYVVSDCEAIINIYRDHKF-TKTQAEASALAVQRGM 299

Query: 306 DLDCGDY--------YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---Y 354
           D +C D+        Y  +   A +QG + E++IDT+L  L+   M+LG FD  P+   Y
Sbjct: 300 DNECVDFGKQKDDHDYRPY-FDAYKQGILKESEIDTALVRLFTARMKLGMFD-PPEMVPY 357

Query: 355 KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIG 414
             +    + + +H ELA   A + +VLLKND G LPL    +K +A++GP A  T+ ++G
Sbjct: 358 SKIDPKELESAEHRELARTLANESMVLLKND-GTLPLKKSGLK-IAVIGPLAEQTRYLLG 415

Query: 415 NYEGTPCRYTSPMDGFYAY--SKVINYAPGCADIVCQNNSMIPA 456
           NY GTP    S ++G  A      I +  G    + QN   +P+
Sbjct: 416 NYNGTPSHTVSVLEGLRAEFPDAQITFERGT-QFLDQNGEAVPS 458



 Score =  161 bits (408), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 101/300 (33%), Positives = 154/300 (51%), Gaps = 52/300 (17%)

Query: 455 PAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAA 504
           PAA+ AAKNAD  + V G+   +E E          G DR  L LP  + +L+  ++ A 
Sbjct: 601 PAAVTAAKNADVVIAVLGITSDLEGEEMPVSEEGFNGGDRTSLDLPKPEQQLLESISAAG 660

Query: 505 KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW 564
           K PV LV+ +  A+ +N+A+ +    +IL   YPGEEGG AIA  + GK NP GRLP+T+
Sbjct: 661 K-PVVLVLSNGSALSVNWAQQH--ANAILEGWYPGEEGGTAIAQTLSGKNNPAGRLPVTF 717

Query: 565 YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIK 624
           Y       P+    ++      GRTY++F+G  +YPFGYGLSYT F Y+  + PK+    
Sbjct: 718 YTGTEQLPPFEDYAMK------GRTYRYFEGKPLYPFGYGLSYTTFSYRDLALPKA---- 767

Query: 625 LDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP 684
                        +    P                 T Q+ V N GK++G EV  +Y   
Sbjct: 768 ------------PLNAGDP----------------VTAQVTVTNTGKVEGDEVAQLYLSF 799

Query: 685 PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
           P IAG  ++ + G+ R+ + AG+S  + F +   + L +V+ A + ++A G +++ VG G
Sbjct: 800 PNIAGAPLRALRGFRRIHLKAGESQTIKFELKD-RDLSMVNEAGDPIIAEGEYSVSVGGG 858


>gi|261408260|ref|YP_003244501.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
 gi|261284723|gb|ACX66694.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
           Y412MC10]
          Length = 763

 Score =  278 bits (710), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 216/691 (31%), Positives = 335/691 (48%), Gaps = 100/691 (14%)

Query: 87  GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
           GAT FP  +   +++N  L++ I + V+ E RA       G   +SP ++VVRDPRWGR 
Sbjct: 123 GATVFPVPLTIGSTWNTELFRSISRAVAAETRA-----QGGSATYSPVLDVVRDPRWGRT 177

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
            ET GEDP++V  +A+  V+GLQ  E ++ H         + A  KH+A Y       N 
Sbjct: 178 EETFGEDPHLVTEFAVAAVQGLQG-ERLDSH-------TSLLATLKHFAGYGASEGGRNG 229

Query: 207 R-FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
              H   R    ++ E  +LPF   V  G + SVM +YN ++G+P  +   LL   +R  
Sbjct: 230 APVHMGLR----ELHEVDLLPFRKAVEAGAL-SVMTAYNEIDGVPCTSSGYLLQDVLREA 284

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQG 324
           W F G++++DC +I  +   H       E A A+ LKAG+D++  G  +      A++QG
Sbjct: 285 WGFDGFVITDCGAIHMLACGHNTAGSGVE-AAAQSLKAGVDMEMSGTMFRAHLHQALEQG 343

Query: 325 KIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKN 384
            I E D++ +   +  +  RLG FD         +  I   +HI LA +AA +GIVLLKN
Sbjct: 344 LITEEDLNRAAGRVLELKFRLGLFDRPYVDPAWAEQVIGCKEHIALAYQAAAEGIVLLKN 403

Query: 385 DNGALPLNTGNIKTLALVGPHANATKAMIGNYEG--TPCRYTSPMDGF---YAYSKVINY 439
           +   LPL++ +  T+A++GP+A+A    +G+Y     P +  + +DG       S+V+ Y
Sbjct: 404 EGNLLPLDSSS-GTIAVIGPNAHAPYHQLGDYTSPQPPGQIVTVLDGIRRRLGDSRVL-Y 461

Query: 440 APGCADIVCQNNSMIPAAIDAAKNADATVIVAG-----------LDLSVEA--------- 479
           APGC  I   +    P A+  A+ AD  V+V G           +DL   A         
Sbjct: 462 APGC-RIQGDSREGFPRALACAEQADVIVMVLGGSSARDFGEGTIDLRTGASVVTGHAES 520

Query: 480 -----EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
                EG DR  L L G Q EL+ ++    K PV +V ++   +   +   +  I SI+ 
Sbjct: 521 DMECGEGIDRSTLTLMGVQLELLQELHKLGK-PVIVVYINGRPITEPWIDEH--IPSIVE 577

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
             YPG+EGG AIAD++FG  NP GRLP++   E   +   Y +   R      G+ Y   
Sbjct: 578 AWYPGQEGGSAIADMLFGDINPSGRLPLSIPKEVGQLPNSYNARRTR------GKRYLET 631

Query: 594 DGPVVYPFGYGLSYTQFKY-KVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
           D    YPFG+GLSYT+F+Y ++   P  V I  +                          
Sbjct: 632 DLAPRYPFGFGLSYTEFRYGRLTVEPAVVPIGGEA------------------------- 666

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKV 711
                   T +I+V N G  DG+EVV +Y      + T  ++ + G+ +VF+ AG++ +V
Sbjct: 667 --------TVRIDVTNAGARDGAEVVQLYVSDLAASVTRPEKALKGFRKVFLKAGETQEV 718

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            FT+ + + L+++      ++  G   I VG
Sbjct: 719 TFTIGS-EQLELIGLDLKPVVEPGEFRIQVG 748


>gi|386819249|ref|ZP_10106465.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
           19592]
 gi|386424355|gb|EIJ38185.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
           19592]
          Length = 878

 Score =  277 bits (708), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 161/430 (37%), Positives = 242/430 (56%), Gaps = 46/430 (10%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +P+ + +LP  ER  DL+ R+T+ EK+ Q+   +  + RLG+P Y WW+E+LHGV+  G 
Sbjct: 24  YPFQNTELPEDERVNDLINRLTVDEKIAQLLYQSPAIERLGIPAYNWWNESLHGVARAGY 83

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGN-- 125
                           AT FP  I   AS+++ L  ++   +S EARA ++     G   
Sbjct: 84  ----------------ATVFPQSITIAASWDDELVAEVANVISDEARAKHHEYLRRGQHD 127

Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
              GLTFWSPNIN+ RDPRWGR  ET GEDPY+ G     YV+GLQ          ++++
Sbjct: 128 IYQGLTFWSPNINIFRDPRWGRGHETYGEDPYLTGVLGTEYVKGLQ---------GNNAK 178

Query: 184 PLKISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
            LK+ A  KH+A +      G +  R  FD   +++D+ ET++  F   V +G+V S+M 
Sbjct: 179 YLKVVATAKHFAVHS-----GPEPLRHEFDVAPSQRDLWETYLPAFRTLVKDGNVYSIMT 233

Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
           +YNR+ G    A   L +  +R  W F+GY+VSDC +I  + ++H    D  E A A  +
Sbjct: 234 AYNRIYGEAASASNSLYS-ILRDKWGFNGYVVSDCGAIADMWKTHHVAKDAAE-ASAMAV 291

Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGK 359
           K G DL+CG+ Y   T  A+Q G I EAD+D +L  L     +LG FD   +  Y  +  
Sbjct: 292 KEGCDLNCGNSYEKLT-DALQDGLITEADLDVALHRLMRARFKLGMFDSDEKVPYAKIPF 350

Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
           +   NP+H  LA +AA++ IVLLKN+N  LPL + N+K +A++GP+A+  +++ GNY G 
Sbjct: 351 SVNNNPKHKVLALKAAQKSIVLLKNENAILPL-SKNLKNIAVIGPNADNIQSLWGNYNGM 409

Query: 420 PCRYTSPMDG 429
           P    + ++G
Sbjct: 410 PKNPVTVLEG 419



 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 102/302 (33%), Positives = 150/302 (49%), Gaps = 54/302 (17%)

Query: 452 SMIPAAIDAAKNADATVIVAGL-------DLSVEAEG---KDRVDLLLPGFQTELINKVA 501
           + +  A+ AA  +D  V+  GL       ++ VE EG    DR  L LP  Q EL+ +V 
Sbjct: 587 NQLEKAVLAANKSDVVVLALGLNERLEGEEMKVEVEGFADGDRTSLNLPKKQVELMKEVV 646

Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
              K PV LV+++  A+ IN+A  N  I +I+  GYPG+EGG AIA+V+FG YNP GRLP
Sbjct: 647 ATGK-PVVLVLLNGSALSINWASEN--IPAIISAGYPGQEGGNAIANVLFGDYNPAGRLP 703

Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
           +T+Y++         +P     N  GRTYK+F    +YPFGYGLSYT+FKY     P  +
Sbjct: 704 VTYYKS------VDDLPPFEDYNMDGRTYKYFKKEPLYPFGYGLSYTKFKYSNLEIPLEI 757

Query: 622 DIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY 681
            I                 N+P                    ++V N G  DG EVV +Y
Sbjct: 758 KI-----------------NEP----------------IKVSVQVANEGDFDGDEVVQLY 784

Query: 682 SK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
            +   G     I +++G++R+ +  G   KV FT+   + L +++     ++  G  +I 
Sbjct: 785 VRDEEGSTPRPICELVGFKRIHLKKGARQKVEFTIQP-RELAMINKDDKFVIEPGWFSIS 843

Query: 741 VG 742
           VG
Sbjct: 844 VG 845


>gi|380694609|ref|ZP_09859468.1| periplasmic beta-glucosidase , xylosidase/arabinosidase
           [Bacteroides faecis MAJ27]
          Length = 804

 Score =  277 bits (708), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 223/726 (30%), Positives = 344/726 (47%), Gaps = 120/726 (16%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+P++    EA HG   IG                  T FPT I  +A+++ +L +++
Sbjct: 142 RLGIPVF-LAEEAPHGHMAIG-----------------TTVFPTGIGMSATWSPTLIEEV 183

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           G+ ++ E R+           + P +++ RDPRW RV ET GEDP + GR     V GL 
Sbjct: 184 GKAIAKEIRS-----QGAHISYGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMVTGLG 238

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                       SR     A  KH+ AY +   EG    ++ S V  +D+ E F+ PF  
Sbjct: 239 S--------GDLSREHATIATLKHFLAYAVP--EGGQNGNYAS-VGARDLHENFLPPFRE 287

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            +  G +S VM SYN ++GIP  A+  LL Q +R +W F G++VSD  SI+ I ESH F+
Sbjct: 288 AIEAGALS-VMTSYNSIDGIPCTANHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESH-FV 345

Query: 290 NDTKEDAVARVLKAGLDLDC-GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
             T E+A  + L AG+D+D  GD + N  + AV+ GK+ E  I+ ++  +  +   +G F
Sbjct: 346 ASTMEEAAVQALSAGVDIDLGGDAFMNL-LQAVRSGKLDETQINAAVDRILRMKFEMGLF 404

Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
           +            + N +H++LA + A+  +VLL+N N  LPL+   IK +A+VGP+A+ 
Sbjct: 405 EHPYVNPKTTTKMVRNKEHVKLARKVAQSSVVLLENKNSILPLSK-KIKRVAVVGPNADN 463

Query: 409 TKAMIGNY----EGTPCRYTSPMDGFYAYSKV----INYAPGCADIVCQNNSMIPAAIDA 460
              M+G+Y    E    R  + +DG    SK+    + Y  GCA I     + I  A++A
Sbjct: 464 RYNMLGDYTAPQEDKDIR--TVLDG--VISKLSPSRVEYVRGCA-IRDTTVNEIAEAVEA 518

Query: 461 AKNADATVIVAGLDLSVE-----------------------AEGKDRVDLLLPGFQTELI 497
           A  ++  + V G   + +                        EG DR  L L G Q +L+
Sbjct: 519 AHRSEVIIAVVGGSSARDFKTSYQETGAAIADEKSISDMECGEGFDRATLTLLGKQQDLL 578

Query: 498 NKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPG 557
           N +    K P+ +V +    +D  +A       ++L   YPG+ GG AIADV+FG YNP 
Sbjct: 579 NALKTTGK-PLIVVYIEGRPLDKVWASECA--DALLTASYPGQAGGDAIADVLFGDYNPA 635

Query: 558 GRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASS 617
           GRLP++    +  +IP       P N+     Y       +Y FGYGLSYT F+Y     
Sbjct: 636 GRLPVS-VPRSVGQIPVYYNKKAPRNH----DYVEMAASPLYGFGYGLSYTTFEYS---- 686

Query: 618 PKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEV 677
               D+++              T K PC              F    +V+N G  DG EV
Sbjct: 687 ----DLQI--------------TQKSPC-------------HFEVSFKVKNTGNYDGEEV 715

Query: 678 VMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGA 736
             +Y K    +    +KQ+  +ER F+  G+  ++ FT+   K L I+D +   ++ +G 
Sbjct: 716 AQLYLKDEYASVVQPLKQLKHFERFFLRKGEEKEILFTLTE-KDLSIIDRSMKRVVETGD 774

Query: 737 HTILVG 742
             I++G
Sbjct: 775 FRIMIG 780


>gi|333380551|ref|ZP_08472242.1| hypothetical protein HMPREF9455_00408 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332826546|gb|EGJ99375.1| hypothetical protein HMPREF9455_00408 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 854

 Score =  276 bits (707), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 164/421 (38%), Positives = 230/421 (54%), Gaps = 45/421 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D K P  +R  DL+ R+T+ EK+  +   + G+PRL +P Y   +E+LHGV   GR  
Sbjct: 30  YLDEKAPTHDRIMDLLSRLTIEEKISLLRATSPGIPRLQIPKYYHGNESLHGVVRPGR-- 87

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
                          T FP  I   + +N  L  KI   +S EAR  +N    G      
Sbjct: 88  --------------FTVFPQAIGLASMWNPELHHKIATAISDEARGRWNELEQGKLQTQR 133

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFWSP +N+ RDPRWGR  ET GEDPY+ G     +VRGLQ           D R
Sbjct: 134 FTDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGILGTAFVRGLQG---------DDPR 184

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LKI +  KH+AA    N E ++RF  + +++E+ ++E +   FEMCV +G  +S+M +Y
Sbjct: 185 YLKIVSTPKHFAA----NNEEHNRFVCNPQISERQLREYYFPAFEMCVKDGKSASIMSAY 240

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N +N +P  A+P LL + +R DW F+GY+VSDC     +V + K++  TKE A    +KA
Sbjct: 241 NAINDVPCTANPWLLTKVLRHDWGFNGYVVSDCGGPSLLVSAMKYVK-TKEAAATLSIKA 299

Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKN 360
           GLDL+CG D Y    + A  Q  ++ ADIDT+   +    M LG FD      Y  +  +
Sbjct: 300 GLDLECGDDVYMQPLLNAYNQYMVSRADIDTAAYRVLRARMHLGLFDDPDLNPYNKISPS 359

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + + +H +LA EAARQ IVLLKN+N  LPLN   +K++A+VG   NA  +  G+Y G P
Sbjct: 360 VVGSAEHKQLALEAARQSIVLLKNNNRTLPLNPKKVKSIAVVG--INAGNSEFGDYSGIP 417

Query: 421 C 421
            
Sbjct: 418 A 418



 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 97/313 (30%), Positives = 152/313 (48%), Gaps = 51/313 (16%)

Query: 449 QNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPV 508
           Q   M   A  A +  +  + V G++ ++E EG+DR D+ LP  Q E I ++      P 
Sbjct: 589 QRLDMYGEAGKAVRECEQVIAVLGINKTIEREGQDRYDIHLPADQEEFIREIYKV--NPN 646

Query: 509 TLVIMSAGA-VDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
            +V++ AG+ + IN+   +  + +I+   YPGE+GG A+A+V+FG+YNPGGRLP+T+Y +
Sbjct: 647 IVVVLVAGSSLAINWMDEH--VPAIVNAWYPGEQGGTAVAEVLFGEYNPGGRLPVTYYNS 704

Query: 568 NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
              +IP         +   GRTY++F G  +YPFGYGLSYT F YK              
Sbjct: 705 -LEEIP----SFDDYDITKGRTYQYFKGKPLYPFGYGLSYTTFAYK-------------- 745

Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-- 685
                  N  +  N               + K +F  E++N G+MDG EV  VY K P  
Sbjct: 746 -------NLQINDN-------------GNNIKVSF--ELKNTGRMDGDEVSQVYVKIPSS 783

Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVGEG 744
           GI    IK++ G++R  +  G +  V   +     L+  D+A  + +   G +  ++G  
Sbjct: 784 GIF-MPIKELKGFQRSTLKKGATKNVEINIRK-DLLRYWDDATETFITPKGEYEFMIGTS 841

Query: 745 VGGVSFPLQLNLN 757
              +       LN
Sbjct: 842 SQDIQLTKSFTLN 854


>gi|395802372|ref|ZP_10481625.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
 gi|395435613|gb|EJG01554.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
          Length = 745

 Score =  276 bits (707), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 236/781 (30%), Positives = 355/781 (45%), Gaps = 141/781 (18%)

Query: 28  LVERMTLPEKVQQM-GDLAY---GVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
           L+ +MTL EKV  + G+  +   GV RLG+P  +     L     I R   +P G   D 
Sbjct: 53  LISQMTLEEKVGMLHGNSMFANAGVKRLGIPELKMADGPLGVREEISRDNWAPAGWTNDF 112

Query: 84  EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRW 143
               AT +P      A++N  +    G ++  E RA            SP IN+VR P  
Sbjct: 113 ----ATYYPAGGALAATWNAEMAHTFGTSLGEELRA-----RDKDMLLSPAINMVRTPLG 163

Query: 144 GRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWE 203
           GR  E   EDP++  + A+  + GLQ+ +              + AC KHYAA   +N E
Sbjct: 164 GRTYEYMSEDPFLNKKIAVPLIVGLQEKD--------------VMACVKHYAA---NNQE 206

Query: 204 GNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIR 263
            N  F  D ++ E+ ++E ++  FE  V E    S+M +YN+  G   C +  +LN+ +R
Sbjct: 207 TNRDF-VDVQIDERTLREIYLPAFEASVKEAKAYSIMGAYNKFRGEYLCENDYMLNKILR 265

Query: 264 GDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD-------YYTNF 316
            +W F G +VSD  ++ +                A+ LK GLD++ G        +  + 
Sbjct: 266 DEWGFKGVVVSDWAAVHS---------------TAKSLKNGLDIEMGTPKPFNEFFLADK 310

Query: 317 TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAAR 376
            + AV+ G+++E +ID  ++ +  VL ++    G  +     K +I    H + A + A 
Sbjct: 311 LIVAVKSGEVSEKEIDLHVKRILRVLFQVKAMGGGER----AKGSIATEAHYQDAYKIAA 366

Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC-RYTSPMDGF---YA 432
           + IVLLKN+N ALPL    +K++A++G +A    A+ G   G    R  +P++G      
Sbjct: 367 EAIVLLKNENNALPLQLDGVKSIAVIGNNATKKNALGGFGAGVKTKREVTPLEGLKNRLP 426

Query: 433 YSKVINYAPGCADIVCQNN--------------------SMIPAAIDAAKNADATVIVAG 472
            S  INYA G  +   + N                    + +  A+DAAKN+D  +I AG
Sbjct: 427 SSVKINYAEGYLERYDKKNRGNLGNITANGPVTIDELDPAKVQEAVDAAKNSDVAIIFAG 486

Query: 473 LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-VDINFAKNNPKIKS 531
            +   E E  DR DL LP  Q ELI KV   A  P T+V+M AGA  DIN  + + K  +
Sbjct: 487 SNRDYETEASDRRDLHLPFGQEELIKKV--LAVNPKTIVVMIAGAPFDIN--EVSKKSSA 542

Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRT-- 589
           ++W  + G EGG A+ADVI GK NP G+LP T      + I     P    N+FPG    
Sbjct: 543 LVWSWFNGSEGGNALADVILGKVNPSGKLPWT------MPIALKDSPAHATNSFPGDKAV 596

Query: 590 ---------YKFFDGPVV---YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
                    Y++FD   V   YPFGYGLSYT F    A        K DK    +     
Sbjct: 597 NYAEGLLIGYRWFDTKNVAPLYPFGYGLSYTSFALDNA--------KTDKTSYAQ----- 643

Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI- 696
                        +DV          ++V+N GK+DG EVV +Y+       T   Q + 
Sbjct: 644 -------------NDV------IEVTVDVKNTGKVDGKEVVQLYTSKSDSKITRAAQELK 684

Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS-LLASGAHTILVGEGVGGVSFPLQLN 755
           G+++  + AG S KV   +   K L   D A+    +  G +TI +G     +   +Q+ 
Sbjct: 685 GFKKAEVKAGSSTKVTIKV-PVKELAYYDVASKKWTVEPGKYTIKLGTSSRDIKKEIQVT 743

Query: 756 L 756
           +
Sbjct: 744 V 744


>gi|261880245|ref|ZP_06006672.1| beta-glucosidase [Prevotella bergensis DSM 17361]
 gi|270333079|gb|EFA43865.1| beta-glucosidase [Prevotella bergensis DSM 17361]
          Length = 854

 Score =  276 bits (707), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 160/452 (35%), Positives = 241/452 (53%), Gaps = 42/452 (9%)

Query: 5   IKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALH 64
           + +K   FPY +  L   ERA DL  R+TL EK + M + +  +PRLG+P +EWWSEALH
Sbjct: 16  LPMKAQQFPYQNTDLSPKERAADLCSRLTLEEKSKIMQNGSPAIPRLGIPQFEWWSEALH 75

Query: 65  GVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
           G+   G                 AT FP  +   +S++++L +K+   VS E R      
Sbjct: 76  GIGRNGF----------------ATVFPITMGMASSWDDALLQKVFDAVSDEGRVKAQQA 119

Query: 125 N--------AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEY 176
                     GL+FW+PNIN+ RDPRWGR  ET GEDPY+  R  +  VRGLQ       
Sbjct: 120 KRSGTIKRYQGLSFWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVRGLQG------ 173

Query: 177 HRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGD 235
              SDS+  K+ AC KH+A +    W   +R  F+   + E+D+ ET++  F+  V +GD
Sbjct: 174 --PSDSKYRKLLACAKHFAVHSGPEW---NRHTFNVEDLPERDLWETYLPAFKALVQQGD 228

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVES-HKFLNDTKE 294
           V+ VMC+Y R++G P C + + L   +R +WN+ G +VSDC ++    +  H  ++    
Sbjct: 229 VAEVMCAYQRIDGQPCCGNNRFLKSILRNEWNYQGMVVSDCWAVPDFWKKGHHEVSPDAT 288

Query: 295 DAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP-- 352
            A A+ + +G D++CG  Y+N    AV+ G I EAD+D S+R L      LG FD     
Sbjct: 289 HASAKAVLSGTDVECGSDYSNLPE-AVRAGIIKEADVDVSVRRLLEARFALGDFDPDELV 347

Query: 353 QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAM 412
            +  + ++ + +  H +LA + AR+ +VLL+N N  LPL     K + +VG +A  +  M
Sbjct: 348 PWTKISESVVASKAHKQLALDMARKSMVLLQN-NDILPLKRSGQK-IVVVGANAIDSTMM 405

Query: 413 IGNYEGTPCRYTSPMDGFYAYSKVINYAPGCA 444
            GNY G P +  + + G    S  + + PGC 
Sbjct: 406 WGNYSGYPTQTVTILQGLQTKSDQVTFIPGCG 437



 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 93/321 (28%), Positives = 146/321 (45%), Gaps = 61/321 (19%)

Query: 433 YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----------GK 482
           Y +V + A    DI     S     +     AD  + V G+   +E E          G 
Sbjct: 568 YVQVTSLAMIKFDITHTGLSTPQDIVRKTAGADVVIFVGGISPRLEGEEMEVSDPGFKGG 627

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
           DR  + LP  Q E+I  +++A +    +V ++     I     + ++ +IL   YPGE+G
Sbjct: 628 DRTTIELPQAQREVIKALSEAGR---RIVFVNCSGSAIALTPESQRVDAILQAWYPGEQG 684

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFG 602
           G A+ADV+FG YNP G+LP+T+Y+ +        +P        GRTY++F    ++PFG
Sbjct: 685 GTAVADVLFGDYNPSGKLPVTFYKND------AQLPDFLDYRMAGRTYRYFKETPLFPFG 738

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
           YGLSYTQF                   Q R IN  V                        
Sbjct: 739 YGLSYTQFTIG----------------QPRYINNQV------------------------ 758

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
           Q+ V N GK DG EVV VY +    A   IK + G++RV +  G++ +V  ++   +S +
Sbjct: 759 QVSVSNTGKRDGDEVVQVYIRRTDDAAGPIKTLRGFQRVSLKVGETKQVSVSL-PRESFE 817

Query: 723 IVDNAANSL-LASGAHTILVG 742
             D ++N++ +  G + ++VG
Sbjct: 818 WWDASSNTMRVIPGNYEVMVG 838


>gi|333379224|ref|ZP_08470948.1| hypothetical protein HMPREF9456_02543 [Dysgonomonas mossii DSM
           22836]
 gi|332885492|gb|EGK05741.1| hypothetical protein HMPREF9456_02543 [Dysgonomonas mossii DSM
           22836]
          Length = 745

 Score =  276 bits (707), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 223/757 (29%), Positives = 354/757 (46%), Gaps = 103/757 (13%)

Query: 27  DLVERMTLPEKVQQMGDLAYGVPRLGLPLYEW---------------------WSEALHG 65
           DL+ RMTL EK+ Q      G   +  P  +                      ++ +L  
Sbjct: 37  DLLRRMTLEEKIGQTVLYTSGYDVITGPTVDPNYKEYLKKGMVGGIFNAVGADYTRSLQK 96

Query: 66  VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
           ++    R   P    +D      T FP  +  + S++    ++  +  ++EA A      
Sbjct: 97  IAVEETRLGIPLIFGYDVIHGQRTIFPIPLAESCSWDLEAMERSARIAASEATA------ 150

Query: 126 AGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
            G+ + ++P +++ RDPRWGRV E  GED Y+    A   V+G Q         D+ S  
Sbjct: 151 EGINWIYAPMVDISRDPRWGRVAEGAGEDVYLGSLIAAARVKGFQG--------DNLSAV 202

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
             + AC KHYAAY      G D    D  + E  +  T++ PF+  ++ G   ++M S+N
Sbjct: 203 NTVVACVKHYAAYGA-TMAGRDYNTVDMSLNE--LWNTYLPPFKAALDAG-CGTIMTSFN 258

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
            +NGIP   +  LL   +R  WNF+G++V+D  SI  ++  H + ND K  A    + AG
Sbjct: 259 DLNGIPATGNKYLLKDILRDKWNFNGFVVTDYTSINEMI-PHGYANDEKHSA-EIAMNAG 316

Query: 305 LDLDC-GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQY--KNLGKNN 361
           +D+D  G  Y N     +++GK++E D+  + R +  +  +LG F+   +Y   N  K +
Sbjct: 317 VDMDMQGGVYMNHLKTLIEEGKVSEKDVTEAARAILKIKYKLGLFEDPYRYCDANREKTD 376

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           I  P + E A + AR+ +VLLKND   LPL     K +AL+GP       ++G +     
Sbjct: 377 ILTPANKEAARDMARKSMVLLKNDKQTLPLKEN--KRVALIGPLVKDKYEILGCWSAMGN 434

Query: 422 RYTSPM---DGFYAY--SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
           R T P+   DG         I+YA GC DI  ++      A+  A  +D  V+V G   +
Sbjct: 435 RDTIPVSVYDGLVEAIGKDKISYAKGC-DIQSEDTKGFAEAVRVASASDVVVMVMGEFHN 493

Query: 477 VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
           +  E   R +L LPG Q +L+  +    K PV LV+M+   + IN+ K+N  + +IL   
Sbjct: 494 MSGENNSRTNLSLPGVQVDLLKAIKKTGK-PVVLVLMNGRPLTINWEKDN--LDAILEAW 550

Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPY------TSMPLRPVNNFPGRTY 590
           +PG  GG AIADV+ GKYNP G+L +T +  N  +IP       T  P  P  N P   Y
Sbjct: 551 FPGTMGGAAIADVLTGKYNPSGKLTMT-FPQNVGQIPLFYNHKNTGRPYDP--NVPQFAY 607

Query: 591 --KFFD--GPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
             +++D     +YPFGYGLSYT F Y         D+ L   +                 
Sbjct: 608 GSRYWDVSNEPLYPFGYGLSYTTFTYS--------DLTLSSKEI---------------- 643

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAA 705
                    K+      +++ N G+ DG EVV +Y++   G     +K++ G+++VF+ A
Sbjct: 644 --------TKENPLKVSVKLTNSGEYDGEEVVQLYTRDLVGSVTRPVKELKGFKKVFLKA 695

Query: 706 GQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           G+S  + FT+ +   L+  ++    +   G   + VG
Sbjct: 696 GESKVIDFTL-SVNDLRFYNSQLEYVYEPGDFHLFVG 731


>gi|325103214|ref|YP_004272868.1| glycoside hydrolase family protein [Pedobacter saltans DSM 12145]
 gi|324972062|gb|ADY51046.1| glycoside hydrolase family 3 domain protein [Pedobacter saltans DSM
           12145]
          Length = 866

 Score =  276 bits (707), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 161/434 (37%), Positives = 231/434 (53%), Gaps = 43/434 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +P+ D +LP+ +R  DL++R+T+ EKV  M D++  + RLG+  Y WW+EALHGV+  G 
Sbjct: 24  YPFQDNRLPFDKRVDDLLQRLTVEEKVLLMQDVSRPIERLGIKQYNWWNEALHGVARAGL 83

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
                           AT FP  I   ASF+      +   VS EARA +N   +     
Sbjct: 84  ----------------ATVFPQPIGMAASFDRDALFNVFNAVSDEARAKHNYHLSQGSYG 127

Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
              GLT W+P IN+ RDPRWGR +ET GEDPY+     +  V+GLQ     +Y       
Sbjct: 128 RYEGLTMWTPTINIFRDPRWGRGIETYGEDPYLTAVMGVQAVKGLQGPSNGKYD------ 181

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDS-RVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
             K+ AC KH+A +    W   +R  FD+  + ++D+ ET++  FE  V E  V  VMC+
Sbjct: 182 --KLHACAKHFAVHSGPEW---NRHSFDAANIKQRDLYETYLPAFEALVKEAKVQEVMCA 236

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
           YNR  G P C   +LL Q +R  W F G +V+DC +I    +  +HK   D    + A V
Sbjct: 237 YNRFEGDPCCGSDRLLQQILRKKWGFEGIVVADCGAIADFFKENAHKTHPDAASASAAAV 296

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLG 358
             +G DLDCG  Y   T  AV++G I E DID S+R L +   RLG  D      +  + 
Sbjct: 297 Y-SGTDLDCGSSYKALTE-AVKKGLIEEKDIDVSVRRLLMARFRLGEMDDQSLVPWSKIS 354

Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
            N + +  H ++A + AR+ I LL+N N  LPL +G +K +A++GP+A  +    GNY G
Sbjct: 355 YNVVASKAHNQIALDMARKSITLLQNKNNILPLKSGGLK-IAVMGPNAQDSVMQWGNYNG 413

Query: 419 TPCRYTSPMDGFYA 432
           TP    + ++G  A
Sbjct: 414 TPANTITILEGIKA 427



 Score =  129 bits (325), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 93/310 (30%), Positives = 142/310 (45%), Gaps = 53/310 (17%)

Query: 445 DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQT 494
           DI  +  + I  +I     AD  V V G+  S+E E          G DR D+ LP  Q 
Sbjct: 584 DIGYKEEANINKSIKNIAGADLVVFVGGISPSLEGEEMGVKLPGFRGGDRTDIQLPTIQR 643

Query: 495 ELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKY 554
           + +  + +A K    ++ ++     I  A      ++I+   YPG+ GG+A+ADV+FGKY
Sbjct: 644 QFVKALKEAGK---RVIFINCSGSPIGLADEMANSEAIVQAWYPGQAGGQAVADVLFGKY 700

Query: 555 NPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKV 614
           NP GRLPIT+Y         T +P     +  GRTY++     ++PFGYGLSYTQF+Y  
Sbjct: 701 NPSGRLPITFYRDT------TQLPDFENYDMAGRTYRYMQDKPLFPFGYGLSYTQFQY-- 752

Query: 615 ASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDG 674
             +P                             +L   V          + V N GK  G
Sbjct: 753 -GNP-----------------------------ILNQQVITNGQTIQLTVPVTNTGKRSG 782

Query: 675 SEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LA 733
            EVV VY +  G A   +K +  + R+   AGQ+ +V F +   K L+  +  + ++ + 
Sbjct: 783 DEVVQVYLRKKGDATGPVKTLRDFRRLSFNAGQTQQVVFKITP-KQLEWWNEQSKAMQVQ 841

Query: 734 SGAHTILVGE 743
           SG + +LVG+
Sbjct: 842 SGDYELLVGK 851


>gi|304406707|ref|ZP_07388362.1| glycoside hydrolase family 3 domain protein [Paenibacillus
           curdlanolyticus YK9]
 gi|304344240|gb|EFM10079.1| glycoside hydrolase family 3 domain protein [Paenibacillus
           curdlanolyticus YK9]
          Length = 733

 Score =  276 bits (707), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 222/766 (28%), Positives = 366/766 (47%), Gaps = 108/766 (14%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGV----PRLGLPLYEWWSEALH----GVSFI---GR 71
           E+A+ L+ +MTL +KV QM    +G     P  G   ++   E +     G  F      
Sbjct: 22  EQAEQLLSKMTLEDKVGQMTQFDWGYNPINPETGESEHDLIIELIRQGKVGSIFNLSGAA 81

Query: 72  RTNSPPG---THFDSEVPGA----------TSFPTVILTTASFNESLWKKIGQTVSTEAR 118
             N   G    H + ++P            T FP  +   A++N  + ++     STEA 
Sbjct: 82  EANELQGLIEQHTELKIPMVIGRDVIHGYRTVFPIPLAMAAAWNPEVARQTSAAASTEA- 140

Query: 119 AMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHR 178
               L +     ++P I+V RDPRWGR+ E+ GEDPY+   Y   +V G Q         
Sbjct: 141 ----LTDGVTWVFAPMIDVSRDPRWGRIAESIGEDPYLTAAYGRAWVEGSQ--------- 187

Query: 179 DSDSRPLKISACC-KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVS 237
             D+ P + +A C KH+A Y +    G D    D  ++++++++  + PF+  V  G +S
Sbjct: 188 -IDNGPGRATASCPKHFAGYGMAE-AGRDYNTVD--LSDRELRDIILPPFQDAVEAGALS 243

Query: 238 SVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV 297
            +M S+N +NGIP CA+  LL   +R +W F G + SD +++  ++      N+  E+A 
Sbjct: 244 -IMASFNEINGIPACANEYLLKTILRDEWGFEGVVASDYNALVELIVHGVAANE--EEAC 300

Query: 298 ARVLKAGLDLDC-GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKN 356
              + AG D+D     +T      V+ G++ E+ +D S+R +  + ++LG  + S    +
Sbjct: 301 EMTVLAGCDMDMHSGIFTRQLPKLVRAGRVPESVVDDSVRRILAMKIKLGLLEQSK--SD 358

Query: 357 LGKNNICNP---QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
           + ++    P   +++ELA EAARQ IVLL+N    LPL+     ++A++GP A+     +
Sbjct: 359 VSQSAATQPLKSEYVELAREAARQSIVLLQNKEQVLPLSKAG-ASIAVIGPLADNATDPL 417

Query: 414 GNY--EGTPCRYTSPMDGFY---AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
           G +  +G      + ++G     A    I YA GC DI   +     AA++AA+++D  V
Sbjct: 418 GCWALDGRSDEVVTALEGIRQAAAEGTSIRYAQGC-DIDSDSEEGFEAALEAARSSDVVV 476

Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
           ++ G   ++  E + R  L LPG Q  L+  VA   K P+  VI+S     + FA    +
Sbjct: 477 MLLGESATMSGESRSRAALDLPGKQRALVEAVAKLGK-PIVAVILSGRP--LTFAWLPEQ 533

Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP---YTSMPLRPVNNF 585
             +I+   + G + G AIADV+FG +NP GRLP+T +  N  +IP   Y     RP    
Sbjct: 534 ASAIVQAWHLGVQSGNAIADVLFGDFNPSGRLPVT-FPQNVGQIPIYHYRKKTGRP---- 588

Query: 586 PGRTYK--FFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTN 641
           P   Y   + D     +YPFGYGL+YT+F+Y    + KS                ++G  
Sbjct: 589 PAGAYSSYYIDSTTEPLYPFGYGLTYTEFEYGAIQTSKS----------------SIGA- 631

Query: 642 KPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYER 700
                          D +    + + N+G + G EVV  Y +    + T  +K+++ + +
Sbjct: 632 ---------------DEQLDVTVSIRNVGNLAGEEVVQCYVRDEVASVTQPLKRLVAFRK 676

Query: 701 VFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVG 746
           V +AAG+S  V FT+ A + L I+D      +  G  T+ +G   G
Sbjct: 677 VKVAAGESVDVTFTIGAAE-LAILDKHMKRTVEPGDFTLWIGPSAG 721


>gi|225873993|ref|YP_002755452.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
 gi|225791521|gb|ACO31611.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
          Length = 894

 Score =  276 bits (706), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 163/450 (36%), Positives = 236/450 (52%), Gaps = 52/450 (11%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y +  LP   RA+DLV RMTL EK  Q+ + A  +PRL +P Y WWSEALHGV+      
Sbjct: 39  YLNPSLPPVVRARDLVSRMTLKEKASQLVNAARAIPRLKVPAYNWWSEALHGVA------ 92

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                      V G T FP  I   A+F+     ++   + TE R +Y            
Sbjct: 93  -----------VNGTTEFPEPIGLGATFDVPAIHEMAVDIGTEGRVVYEENEKDGSSKIF 141

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GL FW+PN+N+ RDPRWGR  ET GEDP++ G+  + +V G+Q  +  +Y+R       
Sbjct: 142 HGLDFWAPNLNIFRDPRWGRGQETYGEDPFLTGKMGVAFVSGMQG-DNPKYYR------- 193

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
            + A  KH+   D+ +     R   D  V+  D  +T+   F   + +G   SVMCSYN 
Sbjct: 194 -VIATPKHF---DVHSGPEPTRHFADVDVSLHDQLDTYEPAFRAAIMQGHADSVMCSYNA 249

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           +NG P CA+   L   +RG W F GY+VSDCD++  I   HK+   T   A A  ++ G+
Sbjct: 250 INGQPACANQFTLQHQLRGAWGFKGYVVSDCDAVHDIYSGHKY-RPTLAQAAAISMERGM 308

Query: 306 DLDCGDY--------YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYK 355
           D DC D+        Y  + + AVQQG +++  +DT+L  L+   ++LG FD  G   Y 
Sbjct: 309 DNDCADFAQPKGDDDYKAY-IDAVQQGYLSQQAMDTALVRLFTARIKLGLFDPKGMDPYA 367

Query: 356 NLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGN 415
           +   + + +P H   A + A + +VLLKND G LPL  G++ ++A+VGP A+ T  ++GN
Sbjct: 368 DTPHSELNSPAHRAYARKLADESMVLLKND-GTLPLKPGSVHSIAVVGPLADQTAVLLGN 426

Query: 416 YEGTPCRYTSPMDGFYAY--SKVINYAPGC 443
           Y G P    S ++G  A   +  I Y PG 
Sbjct: 427 YNGVPTHTVSFLEGLRAEYPNTKITYVPGT 456



 Score =  125 bits (314), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 98/305 (32%), Positives = 149/305 (48%), Gaps = 52/305 (17%)

Query: 450 NNSMIPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINK 499
           +N+  PAA+ AAK AD  + V G+   +E E          G DR +L +P  +  L+  
Sbjct: 609 DNTPSPAAVAAAKKADVVIAVVGITSKLEGEEMPVDQPGFLGGDRTNLQMPEPEEALVEA 668

Query: 500 VADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGR 559
           VA   K PV +V+M+  A+ +N+   +    ++L   Y GEEGG AIAD + GK +P GR
Sbjct: 669 VAKTGK-PVVVVLMNGSALAVNWISQH--ANAVLEAWYSGEEGGAAIADTLSGKNDPAGR 725

Query: 560 LPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPK 619
           LP+T+Y++         +P     +   RTY++F G  +YPFGYGLSYT F+Y   S P 
Sbjct: 726 LPVTFYKS------VNQLPNFEDYSMENRTYRYFKGKPLYPFGYGLSYTTFRYSDLSIPH 779

Query: 620 SVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVM 679
           +                TV   +P  A+                  V N GK+ G EVV 
Sbjct: 780 A----------------TVDAGQPVEASA----------------TVTNTGKVAGDEVVQ 807

Query: 680 VYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
           +Y K P + G     + G++R+ +  GQS +V F +   + L +V      ++A G +T+
Sbjct: 808 LYLKFPKVDGAPDIALRGFQRIHLEPGQSQQVHFELKK-RDLSMVTALGQIIVAQGDYTL 866

Query: 740 LVGEG 744
            +G G
Sbjct: 867 SIGGG 871


>gi|285018984|ref|YP_003376695.1| beta-glucosidase [Xanthomonas albilineans GPE PC73]
 gi|283474202|emb|CBA16703.1| putative beta-glucosidase protein [Xanthomonas albilineans GPE
           PC73]
          Length = 904

 Score =  276 bits (706), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 166/421 (39%), Positives = 228/421 (54%), Gaps = 43/421 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA  LV +MT  EK+ Q  + A  +PRLG+P YEWWSE LHG++  G            
Sbjct: 54  DRATALVAKMTRAEKIAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGE----------- 102

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA---------GLTFWSP 133
                AT FP  I   AS+N  L   +G   STEARA +NL            GLT WSP
Sbjct: 103 -----ATVFPQAIGLAASWNTDLLHAVGTVTSTEARAKFNLAGGPGKNHARYGGLTIWSP 157

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDPY+ G+ A+ ++ GLQ         D  + P  I A  KH
Sbjct: 158 NINIFRDPRWGRGMETYGEDPYLTGQLAVGFIHGLQG--------DDPTHPRTI-ATPKH 208

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
            A +   +   + R  FD  V+  D + T+   F   + EG   SVMC+YN ++GIP CA
Sbjct: 209 LAVH---SGPESGRHGFDVDVSPHDFEATYSPAFRAAIVEGHAGSVMCAYNALHGIPACA 265

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              L++  +RG+W F G++VSDCD+I  + + H +       + A  LKAG DL+CG  Y
Sbjct: 266 ADWLIDGRVRGNWGFKGFVVSDCDAIDDMTQFH-YYRADNAGSAAAALKAGHDLNCGYAY 324

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
            +    A+ +G+  EA +D SL  L+    RLG      +  Y  LG  +I +P H  LA
Sbjct: 325 RDLGT-ALDRGEAEEAMLDRSLVRLFAARYRLGELQPRSKDPYARLGAKDIDSPTHRALA 383

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA+Q +VLL+N N  LPL  G    LA++GP+A+A  A+  NY+GT     +P+ G  
Sbjct: 384 LQAAQQSLVLLQNRNDTLPLRPG--LRLAVIGPNADALAALEANYQGTSVAPVTPLQGLR 441

Query: 432 A 432
           A
Sbjct: 442 A 442



 Score =  137 bits (344), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 91/278 (32%), Positives = 139/278 (50%), Gaps = 45/278 (16%)

Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           V G +L ++    +G DR DL LP  Q  L+ + A A+  P+ +V+MS  AV +N+AK +
Sbjct: 646 VEGEELRIDVPGFDGGDRNDLSLPAAQQALLER-AKASGKPLIVVLMSGSAVALNWAKQH 704

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
               +IL   YPG+ GG AIA  + G  NPGGRLP+T+Y +     PY S  ++      
Sbjct: 705 --ADAILAAWYPGQSGGTAIAQALAGDINPGGRLPVTFYRSTKDLPPYVSYDMK------ 756

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY++F G  ++PFGYGLSYT F Y   ++P+     L    Q  D  +   T      
Sbjct: 757 GRTYRYFKGEALFPFGYGLSYTHFAY---TAPQLSSTTL----QAGDTLHVTTT------ 803

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
                              V N G   G EVV VY + P  A + ++ ++G++RV +  G
Sbjct: 804 -------------------VRNTGARAGDEVVQVYLQYPPRAQSPLRALVGFQRVSLQPG 844

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
           ++  + F +   + L  VD +    + +G + + VG G
Sbjct: 845 EARTLSFALEP-RQLSDVDRSGQRAVEAGDYRLFVGGG 881


>gi|383125190|ref|ZP_09945844.1| hypothetical protein BSIG_4346 [Bacteroides sp. 1_1_6]
 gi|251838523|gb|EES66609.1| hypothetical protein BSIG_4346 [Bacteroides sp. 1_1_6]
          Length = 853

 Score =  276 bits (706), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 161/421 (38%), Positives = 232/421 (55%), Gaps = 45/421 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y +   P  ER  DL+ R+T+ EK+  +   + G+PRLG+  Y   +EALHGV   GR  
Sbjct: 30  YKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-- 87

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
                          T FP  I   A++N  L K++   +S EARA +N  + G      
Sbjct: 88  --------------FTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQ 133

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFWSP +N+ RDPRWGR  ET GEDP++ G     +V GLQ           D  
Sbjct: 134 FSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQG---------DDPH 184

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LKI +  KH+AA    N E ++RF  + +++E+ ++E +   FEMCV EG  +S+M +Y
Sbjct: 185 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAY 240

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N +N +P   +P LL + +R DW F GY+VSDC     +V +HK++  TKE A    +KA
Sbjct: 241 NALNDVPCTLNPWLLQKVLRQDWGFQGYVVSDCGGPALLVNAHKYVK-TKEAAATLSIKA 299

Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
           GLDL+CG D Y    + A +Q  +++ADID++   +    M+LG FD   +  Y  +  +
Sbjct: 300 GLDLECGDDVYDGPLLNAYKQYMVSDADIDSAAYHVLTARMKLGLFDSGERNPYTKISPS 359

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            I + +H ++A +AARQ IVLLKN    LPLN   +K++A+VG   NA K   G+Y G P
Sbjct: 360 VIGSKEHQQIALDAARQCIVLLKNQKNRLPLNADKLKSIAVVG--INAGKCEFGDYSGAP 417

Query: 421 C 421
            
Sbjct: 418 V 418



 Score =  136 bits (343), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 95/300 (31%), Positives = 152/300 (50%), Gaps = 49/300 (16%)

Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-V 518
           A +  +  V V G++ S+E EG+DR D+ LP  Q E + ++      P  +V++ AG+ +
Sbjct: 600 AVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAGSSL 657

Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
            IN+   +  I +I+   YPGE+GG A+A+V+FG YNP GRLP+T+Y++   ++P    P
Sbjct: 658 AINWMDEH--IPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS-LDELP----P 710

Query: 579 LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV 638
               +   GRTYK+F G V+YPFGYGLSY+ F Y               D Q +D    V
Sbjct: 711 FDDYDITKGRTYKYFKGDVLYPFGYGLSYSSFTY--------------SDLQVKD---GV 753

Query: 639 GTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG-THIKQVIG 697
           G                   + T    ++N GK +G EV  VY + P   G   +K++ G
Sbjct: 754 G-------------------EVTVSFRLKNTGKRNGDEVAQVYVRIPETGGIVPLKELKG 794

Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVD-NAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
           + RV + +G+S +V   +N  + L+  D      ++  GA  ++VG     +     ++L
Sbjct: 795 FRRVPLKSGESRRVEIKLNK-EQLRYWDVEKGQFVVPKGAFDVMVGASSKDIRLQTVIDL 853


>gi|365121914|ref|ZP_09338824.1| hypothetical protein HMPREF1033_02170 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363643627|gb|EHL82934.1| hypothetical protein HMPREF1033_02170 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 1073

 Score =  276 bits (705), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 162/434 (37%), Positives = 236/434 (54%), Gaps = 45/434 (10%)

Query: 1   RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
           +  S  V   ++P+ D  L + ER KDL+ R+ + EK+  +   +  +PRLG+  Y   +
Sbjct: 16  QISSFAVAQINYPFRDTTLSHHERIKDLLSRLNVSEKISLLRATSPAIPRLGIDKYYHGN 75

Query: 61  EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
           EALHGV   G+                 T FP  I   + +N    +++   +S EAR  
Sbjct: 76  EALHGVVRPGK----------------FTVFPQAIGLASMWNPDFLQEVSTAISDEARGR 119

Query: 121 YNLGNAG----------LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQD 170
           +N  N G          LTFWSP IN+ RDPRWGR  ET GEDP++ G     +VRGLQ 
Sbjct: 120 WNELNQGKDQTAGASDLLTFWSPTINMARDPRWGRTPETYGEDPFLTGTLGTAFVRGLQG 179

Query: 171 VEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMC 230
                    +D + +K+ +  KH+AA    N E ++R   ++ ++E+D++E +   FE C
Sbjct: 180 ---------NDPKYIKVVSTPKHFAA----NNEEHNRASGNAVISERDLREYYFPAFEKC 226

Query: 231 VNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLN 290
           + EG   SVM +YN VNGIP   +  LL   +R DW F GY+VSDC + + IV  H ++ 
Sbjct: 227 IKEGQAQSVMSAYNAVNGIPCTLNKWLLTDVLRDDWGFDGYVVSDCSAPEYIVSQHHYV- 285

Query: 291 DTKEDAVARVLKAGLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
           DT E+A +  +KAGLDL+CGD  Y    + A  +G +  ++ID++   +    MRLG FD
Sbjct: 286 DTYEEAASLCIKAGLDLECGDNVYITPLLNAYNRGMVTMSEIDSAAYRVLRGRMRLGLFD 345

Query: 350 GSPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
              +  Y  +  + +   +H ELA EAARQ +VLLKND   LP+ T NIK++A+VG   N
Sbjct: 346 DPNENPYNKISPSIVGCEKHRELALEAARQSLVLLKNDKDMLPIQTDNIKSIAVVG--IN 403

Query: 408 ATKAMIGNYEGTPC 421
           A     G+Y GTP 
Sbjct: 404 AANCEFGDYSGTPV 417



 Score =  130 bits (327), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 96/298 (32%), Positives = 147/298 (49%), Gaps = 50/298 (16%)

Query: 450 NNSMIPAAIDAA---KNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG 506
           + S++ A  DA    + +D T+ V G+D ++E EG+DR  + LP  Q   I +   A   
Sbjct: 726 SESLLDAYGDAGEIIRGSDLTIAVLGIDRTIEREGQDRSTIELPEDQQIFIEEAYKA--N 783

Query: 507 PVTLVIMSAGA-VDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
           P T+V++ AG+ + IN+   N  I ++L   YPGE+GG A+A+ +FG YNPGGRLP+T+Y
Sbjct: 784 PNTVVVLVAGSSLAINWIDQN--IPAVLDAWYPGEQGGTAVAEALFGDYNPGGRLPLTFY 841

Query: 566 EANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
            +      +    +R  NN   RTY +F+G  +YPFGYGLSYT F Y      + +D+  
Sbjct: 842 NSLSDLPAFDDYNVR--NN---RTYMYFEGKPLYPFGYGLSYTDFAY------RGLDVTQ 890

Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
           D++                                T +  V N G  DG EV  VY + P
Sbjct: 891 DEEN------------------------------VTVKFFVSNTGNYDGDEVAQVYIQFP 920

Query: 686 GIAGT-HIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
               T  +KQ+ G++RV I+ GQ  ++   +   +     +N +      G +  LVG
Sbjct: 921 DQGTTLPLKQLKGFKRVHISKGQETEITVRIPKKELRLWSENNSEFYTPEGNYIFLVG 978


>gi|399030621|ref|ZP_10730998.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
 gi|398071229|gb|EJL62496.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
          Length = 876

 Score =  276 bits (705), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 162/454 (35%), Positives = 240/454 (52%), Gaps = 53/454 (11%)

Query: 8   KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
           K  +F + +  L + +R  DLV R+TL EKV QM + +  +PRL +P Y+WW+E LHGV+
Sbjct: 25  KQKEFLFQNPDLSFEKRVDDLVNRLTLEEKVSQMLNSSPAIPRLDIPAYDWWNETLHGVA 84

Query: 68  FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG--- 124
                      T F       T +P  I   A+F+++   K+    + E RA+YN     
Sbjct: 85  ----------RTPFK-----VTVYPQAIAMAATFDKNSLYKMADFSALEGRAIYNKAVES 129

Query: 125 ------NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHR 178
                   GLT+W+PNIN+ RDPRWGR  ET GEDPY+ G    ++V+GLQ         
Sbjct: 130 GRTNERYLGLTYWTPNINIFRDPRWGRGQETYGEDPYLTGVLGDSFVKGLQG-------- 181

Query: 179 DSDSRPLKISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDV 236
             D + LK +AC KHYA +      G +  R  FD  VT  ++ +T++  F+  V E  V
Sbjct: 182 -DDPKYLKAAACAKHYAVH-----SGPEPLRHTFDVDVTPYELWDTYLPAFQKLVTESKV 235

Query: 237 SSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDA 296
           + VMC+YN     P CA   L+   +R  W F GY+ SDC +I    ++HK   D  E A
Sbjct: 236 AGVMCAYNAFRTQPCCASDILMTDILRNQWKFEGYVTSDCWAIDDFFKNHKTHPDA-ESA 294

Query: 297 VARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QY 354
            A  +  G D+DCG       + AV+ GKI+E  ID S++ L+++  RLG FD     +Y
Sbjct: 295 SADAVFHGTDIDCGTDAYKALVQAVKDGKISEKQIDISVKRLFMIRFRLGMFDPVEMVKY 354

Query: 355 KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIG 414
                + + N +H   A + ARQ IVLL+N+N  LPL +  +K + ++GP+ +   A++G
Sbjct: 355 AQTPTSVLENDEHKAHALKMARQSIVLLRNENKTLPL-SKKLKKIVVLGPNVDNAIAILG 413

Query: 415 NYEGTPCRYTSPMDGF---------YAYSKVINY 439
           NY GTP + T+ ++G            Y K +N+
Sbjct: 414 NYNGTPSKLTTVLEGIKEKVGSNTEVVYEKAVNF 447



 Score =  125 bits (315), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 92/296 (31%), Positives = 139/296 (46%), Gaps = 54/296 (18%)

Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
           ++  K+ADA V V G+   +E E          G DR  +LLP  QT+L+  +    K P
Sbjct: 602 VNRVKDADAFVFVGGISPQLEGEEMKVNFPGFKGGDRTSILLPKIQTDLMKALKTTGK-P 660

Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
           +  V+M+  A+ I +   N  I +I    Y G+  G A+ADV+FG YNP GRLP+T+Y++
Sbjct: 661 IVFVMMTGSAIAIPWEAEN--IPAIANAWYGGQAAGTAVADVLFGNYNPAGRLPVTFYKS 718

Query: 568 NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
           +    P+    +        RTY++F G  +Y FGYGLSYT FKY       SV      
Sbjct: 719 DADLSPFVDYKM------DNRTYRYFKGKPLYGFGYGLSYTTFKYDNLKIAPSV------ 766

Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI 687
                                    +K K+   T  ++V N GK+ G EVV +Y      
Sbjct: 767 -------------------------IKGKNVPIT--VKVTNTGKVSGEEVVQLYVINQNT 799

Query: 688 A-GTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           A    +K + G+ER+ + AG+S  + FT+ + + L  +    N    +G   I +G
Sbjct: 800 AIKAPLKTLKGFERISLKAGKSKTITFTL-SPEDLSYITAEGNHQQYNGKIKIAIG 854


>gi|333377782|ref|ZP_08469515.1| hypothetical protein HMPREF9456_01110 [Dysgonomonas mossii DSM
           22836]
 gi|332883802|gb|EGK04082.1| hypothetical protein HMPREF9456_01110 [Dysgonomonas mossii DSM
           22836]
          Length = 727

 Score =  275 bits (704), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 223/781 (28%), Positives = 361/781 (46%), Gaps = 114/781 (14%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +P+ +  L   +R  +L+  MT+ EK+  +     GVPRLG+      SE LHG++  G 
Sbjct: 24  YPFQNTSLSDEKRLDNLLSIMTIDEKINALS-TNLGVPRLGI-RNTGHSEGLHGMALGG- 80

Query: 72  RTNSPPGT---------HFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEAR---- 118
                PG              +V   T+FP       +++  L KK+    +TE R    
Sbjct: 81  -----PGNWGGFKMVNYQRVPDVYPTTTFPQAYGLGETWDTELIKKVADIEATEIRYYTQ 135

Query: 119 -AMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYH 177
              Y  G  GL   +PN ++ RDPRWGR  E+ GEDP++V   A+ +++GLQ        
Sbjct: 136 NERYTKG--GLVMRAPNADLARDPRWGRTEESFGEDPFLVSEMAVAFIKGLQ-------- 185

Query: 178 RDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVS 237
              + R  K ++  KH+ A   ++   +   +FD+R+      E +  PF   + +G   
Sbjct: 186 -GENPRYWKSASLMKHFLANSNEDGRDSTSSNFDNRL----FHEYYSYPFRKGIEKGGSQ 240

Query: 238 SVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV 297
           + M +YN  N IP    P L  + IR DWNF G I +D  ++  ++++HK    T  +  
Sbjct: 241 AFMAAYNSWNEIPMTIHPIL--KKIRKDWNFKGIICTDGGALDLLIKAHKTF-PTHTEGS 297

Query: 298 ARVLKAGLDLDCGDYYTNF---TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ- 353
           A ++KAG+    G +  NF      A+++G + EA+ID ++R  + + ++LG  DG    
Sbjct: 298 AAIVKAGV----GQFLDNFRPYIYQALEKGMLTEAEIDKAIRGNFYIALKLGLLDGDQTK 353

Query: 354 --YKNLGKNNIC----NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
             Y ++G  +      N +  +       + +VLLKN+   LPLN GNIK +A++GP AN
Sbjct: 354 LPYAHIGVTDTVSVWRNKEIQDFVRLVTAKSVVLLKNEKKLLPLNKGNIKRIAVIGPRAN 413

Query: 408 ATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADAT 467
             + ++  Y GTP    S + G      + N      +++ ++++ I  A  AA+ AD  
Sbjct: 414 --EVLLDWYSGTPPYTVSILQG------IKNAVGNNVEVIYESSNEIDKAYLAAQKADIA 465

Query: 468 VIVAGLDL----------SVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA 517
           ++  G  +           V ++G++ VD      + E + K+   A     +V++S+  
Sbjct: 466 IVCVGNHVYGTDPKWKYSPVPSDGREAVDRKALSLEQEDLVKIVHKANPNTVMVLVSSFP 525

Query: 518 VDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSM 577
             IN+++ N  I +IL +    +E G  +ADVIFG YNP GR   TW ++    +P    
Sbjct: 526 FAINWSQEN--IPAILHITNNSQELGNGLADVIFGNYNPAGRTNQTWVKS-IADLP---- 578

Query: 578 PLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
           P+   +   GRTY +     +YPFGYGLSYT F Y         D+ L      +  N  
Sbjct: 579 PMMDYDIRNGRTYMYAKEKPLYPFGYGLSYTNFTYS--------DMALSSSALSKGKNLK 630

Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVI 696
           V  N                        V+N G MDG EV  +Y S P       IKQ+ 
Sbjct: 631 VSVN------------------------VKNTGDMDGEEVAQLYVSFPQSKVVRPIKQLK 666

Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS-LLASGAHTILVGEGVGGVSFPLQLN 755
           G++R+ I  G+S    FT++A   L   DN  +S ++      IL+G     +    ++ 
Sbjct: 667 GFDRISIKKGESKTFEFTLSA-DDLAYWDNDKDSFVIEPETVNILIGSSSEDIRLTKEIQ 725

Query: 756 L 756
           L
Sbjct: 726 L 726


>gi|29347188|ref|NP_810691.1| beta-glucosidase [Bacteroides thetaiotaomicron VPI-5482]
 gi|29339087|gb|AAO76885.1| beta-glucosidase (gentiobiase) [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 853

 Score =  275 bits (704), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 160/421 (38%), Positives = 232/421 (55%), Gaps = 45/421 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y +   P  ER  DL+ R+T+ EK+  +   + G+PRLG+  Y   +EALHGV   GR  
Sbjct: 30  YKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-- 87

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
                          T FP  I   A++N  L K++   +S EARA +N  + G      
Sbjct: 88  --------------FTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQ 133

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFWSP +N+ RDPRWGR  ET GEDP++ G     +V GLQ           D  
Sbjct: 134 FSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQG---------DDPH 184

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LKI +  KH+AA    N E ++RF  + +++E+ ++E +   FEMCV EG  +S+M +Y
Sbjct: 185 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAY 240

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N +N +P   +P LL + +R DW F GY+VSDC     +V +HK++  TKE A    +KA
Sbjct: 241 NALNDVPCTLNPWLLQKVLRQDWGFQGYVVSDCGGPALLVNAHKYVK-TKEAAATLSIKA 299

Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
           GLDL+CG D Y    + A +Q  +++ADID++   +    M+LG FD   +  Y  +  +
Sbjct: 300 GLDLECGDDVYDGPLLNAYKQYMVSDADIDSAAYHVLTARMKLGLFDSGERNPYTKISPS 359

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            I + +H ++A +AARQ +VLLKN    LPLN   +K++A+VG   NA K   G+Y G P
Sbjct: 360 VIGSKEHQQIALDAARQCVVLLKNQKNRLPLNADKLKSIAVVG--INAGKCEFGDYSGAP 417

Query: 421 C 421
            
Sbjct: 418 V 418



 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 93/300 (31%), Positives = 150/300 (50%), Gaps = 49/300 (16%)

Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-V 518
           A +  +  V V G++ S+E EG+DR D+ LP  Q E + ++      P  +V++ AG+ +
Sbjct: 600 AVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAGSSL 657

Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
            IN+   +  I +I+   YPGE+GG A+A+V+FG YNP GRLP+T+Y++   ++P    P
Sbjct: 658 AINWMDEH--IPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS-LDELP----P 710

Query: 579 LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV 638
               +   GRTYK+F G V+YPFGYGLSY+ F Y               D Q +D    V
Sbjct: 711 FDDYDITKGRTYKYFKGDVLYPFGYGLSYSSFTY--------------SDLQVKDGGGEV 756

Query: 639 GTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG-THIKQVIG 697
                                 T    ++N GK +G EV  VY + P   G   +K++ G
Sbjct: 757 ----------------------TVSFRLKNTGKRNGDEVAQVYVRIPETGGIVPLKELKG 794

Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVD-NAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
           + RV + +G+S +V   ++  + L+  D      ++  GA  ++VG     +     ++L
Sbjct: 795 FRRVPLKSGESRRVEIKLDK-EQLRYWDVEKGQFVVPKGAFDVMVGASSKDIRLQTVIDL 853


>gi|383123909|ref|ZP_09944579.1| hypothetical protein BSIG_4072 [Bacteroides sp. 1_1_6]
 gi|382983834|gb|EES66944.2| hypothetical protein BSIG_4072 [Bacteroides sp. 1_1_6]
          Length = 815

 Score =  275 bits (703), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 227/723 (31%), Positives = 343/723 (47%), Gaps = 114/723 (15%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+PL+    EA HG   IG                  T FPT I   A+++  L +++
Sbjct: 160 RLGIPLF-LAEEAPHGHMAIG-----------------TTVFPTGIGMAATWSPVLIEEV 201

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           G  ++ E R+           + P +++ RDPRW RV ET GEDP + GR     V GL 
Sbjct: 202 GNVIAKEIRS-----QGAHISYGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMVIGLG 256

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                       SR     A  KH+ AY +   EG    ++ S V  +D+ E F+ PF+ 
Sbjct: 257 S--------GDLSREYATIATLKHFLAYAVP--EGGQNGNYAS-VGTRDLHENFLPPFQE 305

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            ++ G +S VM SYN ++GIP  A+  LL Q +R +W F G++VSD  SI+ + ESH F+
Sbjct: 306 AIDAGALS-VMTSYNSIDGIPCTANYYLLTQLLRNEWRFRGFVVSDLYSIEGVHESH-FV 363

Query: 290 NDTKEDAVARVLKAGLDLDC-GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
             T E+A  +V+ AG+D+D  G+ + N T  AVQ GKI+EA IDT++  +  +   +G F
Sbjct: 364 APTIEEAAMQVVSAGVDIDLGGNAFMNLTH-AVQSGKISEAVIDTAVCRVLRMKFEMGLF 422

Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
           +            + + +HI LA + A+  IVLLKN N  LPLN   IK +A+VGP+A+ 
Sbjct: 423 EHPYVNPKSATKVVRSEEHIRLAHKVAQSSIVLLKNKNSILPLNK-KIKKVAVVGPNADN 481

Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKN 463
              M+G+Y          + +DG  +    SKV  Y  GCA I     + I   ++AA  
Sbjct: 482 RYNMLGDYTAPQEDENIKTVLDGVISKLSPSKV-EYVRGCA-IRDTTVNEIAEVVEAASR 539

Query: 464 ADATVIVAGLDLSVE-----------------------AEGKDRVDLLLPGFQTELINKV 500
           ++  + V G   + +                        EG DR  L L G Q +L+N +
Sbjct: 540 SEVIIAVVGGSSARDFKTSYQETGAAIADEKSISDMECGEGFDRATLTLLGKQQDLLNAL 599

Query: 501 ADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRL 560
               K P+ +V +    +D  +A       ++L   YPG+EGG AIADV+FG YNP GRL
Sbjct: 600 KATGK-PLIVVYIEGRPLDKVWASEYA--DALLTASYPGQEGGYAIADVLFGDYNPAGRL 656

Query: 561 PITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKS 620
           P++    +  +IP       P N+     Y       +Y FGYGLSYT F+Y        
Sbjct: 657 PVS-VPRSVGQIPVYYNKKAPCNH----DYVEQAASPLYTFGYGLSYTTFEYS------- 704

Query: 621 VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
            D+++ +              K PC              F    +V+N G  DG EV  +
Sbjct: 705 -DLQVIR--------------KSPCY-------------FEVSFKVKNTGSYDGEEVAQL 736

Query: 681 YSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
           Y +    +    ++Q+  +ER F+  G+  ++ FT+   K L I+D     ++ +G   I
Sbjct: 737 YLRDEYASVVQPLRQLKCFERFFLKRGEEKEIFFTLTE-KDLSIIDRNMARVVETGDFRI 795

Query: 740 LVG 742
           ++G
Sbjct: 796 MIG 798


>gi|323451833|gb|EGB07709.1| hypothetical protein AURANDRAFT_64764 [Aureococcus anophagefferens]
          Length = 819

 Score =  275 bits (703), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 234/758 (30%), Positives = 337/758 (44%), Gaps = 124/758 (16%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
           D  Y DA LP  +R   L + + L + + Q+ + A  V  + LP Y W ++  HGV    
Sbjct: 68  DGTYLDASLPEADRLAWLADNVPLEDMIGQLVNAAPAVDAVDLPAYNWLNDNEHGVKGTA 127

Query: 71  RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN-----LGN 125
             T  P G                    AS++  L  ++G  +  E+RA +N      GN
Sbjct: 128 HATVYPMGASLG----------------ASWSVDLAWRVGAAIGNESRATHNGLADKSGN 171

Query: 126 A--------------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDV 171
           A              G+T ++PN+N+VRDPRWGR  E  GEDP++    A+  V GLQ  
Sbjct: 172 ACGSTSTGEVVANGCGITLYAPNVNLVRDPRWGRAEEVYGEDPHLTAELAVGMVTGLQG- 230

Query: 172 EGVEYHRDSDSRPLKISACCKHYAAY-------DLDNWEGNDRFHFDSRVTEQDMQETFI 224
                       PL   ACCKH+AA+       DL      DR   D+ V+ +D+ ET++
Sbjct: 231 NAEGSTSGPGGGPLVTGACCKHFAAHFAVYQNEDLPA----DRMVLDANVSSRDLWETYL 286

Query: 225 LPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE 284
              + CV       V      VNG PTCA P+LLN  +R  W F G++VSD D+   +V 
Sbjct: 287 PVMKACV-------VRAKATHVNGKPTCAHPELLNDVLRESWGFDGFVVSDYDAWSNLVT 339

Query: 285 SHKFLNDTKEDAVARVLKAGLDLDC--GDYY-TNFTMGAVQQGKIAEADIDTSLRFLYIV 341
           +HK+++ T E+A A  + AG+D +   GDY   +    AV+ G +A A +  S   L  V
Sbjct: 340 THKYVS-TWEEAAAAGINAGMDQEGGFGDYSPVDALPDAVRNGTVAAATVRRSFERLMRV 398

Query: 342 LMRLGYFDGSPQYKNLGKNNICNPQ-----HIELAAEAARQGIVLLKNDNGALPLNTGNI 396
            +RLG FD        G+   C+ Q      + LA EAAR+GIVL KN  GALPL  G  
Sbjct: 399 RLRLGMFDPPASTAVYGEAYQCDYQCETAAKLALAREAAREGIVLFKNAGGALPLAKG-- 456

Query: 397 KTLALVGPHANATKAMIG--NY---EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNN 451
             +ALVGP  +  + ++G  NY   +G      +   G  A + V + A GC  + C   
Sbjct: 457 ARIALVGPQVDDWRVLLGAVNYAFEDGPDVAPVTIQKGLEAVANV-SVAAGCDSVACAAL 515

Query: 452 SMIPAAI--------------DAAKNADATVIVAGL-DLSVEAEGKDRVDLLLPGFQTEL 496
             +  A               D+    D   +  G  D   E+E  DR  + LPG Q  L
Sbjct: 516 VDVDGAKRLAAAADATVVVLGDSFGATDGWPLCRGTRDDGCESESHDRATIELPGEQVAL 575

Query: 497 INKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNP 556
           +  +  A+   V +++          A +   +   LWV  PG+ GG A+ADV+FG Y+P
Sbjct: 576 VAALRAASSRLVCVLVHGGAVALGAAADDCDAVLD-LWV--PGQMGGAALADVLFGDYSP 632

Query: 557 GGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV-VYPFGYGLSYTQFKYKVA 615
            GR PIT Y A     P          +  G TY+++ GP   Y FG GLSY  F Y  A
Sbjct: 633 AGRSPITMYAATSDLPPMGVFDEYAGESSNGTTYRYYAGPAPTYAFGDGLSYASFSYAWA 692

Query: 616 SSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGS 675
           ++P                     T    C A+ +            ++ V N G +   
Sbjct: 693 AAPP--------------------TTVDACGAIRL------------RVAVTNTGSVASD 720

Query: 676 EVVMVYSK-PPGIAGTHIKQVIGYERV-FIAAGQSAKV 711
           EVV VY++ P         +++ ++RV  IA G +A V
Sbjct: 721 EVVQVYARVPDATVPAPAIRLVAFDRVRAIAPGATATV 758


>gi|46127231|ref|XP_388169.1| hypothetical protein FG07993.1 [Gibberella zeae PH-1]
          Length = 712

 Score =  275 bits (703), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 222/753 (29%), Positives = 342/753 (45%), Gaps = 133/753 (17%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD      ERA  LV  +T  EKV  +   A G PR+GLP Y WW+EALHGV+       
Sbjct: 42  CDTTASPAERAAALVSALTPREKVNNLVSNATGAPRIGLPRYNWWNEALHGVA------- 94

Query: 75  SPPGTHFDSEVP--GATSFPTVILTTASFNESLWKKIGQTVSTEARA-MYNLGNAGLTFW 131
             PG  ++ + P   ATSFP  +L  ++F++ L   IG+ + TEARA        G+ +W
Sbjct: 95  GAPGNDYNDKPPYDSATSFPMPLLMGSTFDDDLIHDIGEVIGTEARAWNNGGWGGGVDYW 154

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLK----- 186
           +PN+N  +DPRWGR  ETPGED   V RYA            +E  RD+    +      
Sbjct: 155 TPNVNPFKDPRWGRGSETPGEDALHVSRYA----------RAMECTRDAKVGSIMCSYNA 204

Query: 187 ---ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
              I AC   Y                        +QET +                   
Sbjct: 205 VNGIPACANSY------------------------LQETLLR------------------ 222

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
                       K  N T   +W     I SDC ++Q I + H +   T  +A     + 
Sbjct: 223 ------------KHWNWTHTNNW-----ITSDCGAMQDIWQHHNY-TKTGAEAAKAAFEN 264

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNI 362
           G D  C    T     + +QG + E  +D +L+ L+  L+  G+FDG   ++ +L  +++
Sbjct: 265 GQDSSCEYTTTKDISDSYEQGLLTEKVMDRALKRLFEGLVHTGFFDGDKSEWSSLDFDDV 324

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
                 +LA ++A +G VLLKNDN  LPLN    +++AL+G  A+    + G Y G    
Sbjct: 325 NTRHAQDLALQSAVRGAVLLKNDN-TLPLNIKKKESVALIGFWADDKTKLQGGYSGPAPH 383

Query: 423 YTSPMDGFYAYSKVINYAPGCADIVCQNNSMIP-----AAIDAAKNADATVIVAGLDLSV 477
             +P    YA +K++      A      NS +P      A++AAK +D  V + GLD + 
Sbjct: 384 VRTPA---YA-AKMLGLNTNVAWGPTLQNSSVPDNWTTNALEAAKKSDYIVYLGGLDATA 439

Query: 478 EAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGY 537
             E +DR DL  P  Q  L+ K+++  K P+ +V +     D    KN   + SILWV Y
Sbjct: 440 AGEERDRTDLDWPSTQLTLLKKLSNLGK-PLVVVQLGDQVDDTPLLKNK-GVNSILWVNY 497

Query: 538 PGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGP 596
           PG+EGG A+ ++I G+  P GRLP+T Y + Y  ++    M LRP  + PGRTY+++   
Sbjct: 498 PGQEGGTAVMELITGRKGPAGRLPLTQYPSKYTEQVGMLEMELRPTKSSPGRTYRWYSDS 557

Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
           V+ PFG+G  YT FK    S  + +++ + K  +  D  Y      PP            
Sbjct: 558 VL-PFGFGKHYTTFKAMFKS--QKIEMNIQKILKGCDATYVDTCPLPP------------ 602

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH---IKQVIGYER---VFIAAGQSAK 710
                  + V+N G+     V +V+ +  G  G     +K +  Y R   +   A +  +
Sbjct: 603 -----IHLSVKNTGRTTSDFVSLVFIQ--GKVGPKPYPLKTLAAYSRSHDIKPRATKDVE 655

Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
           + +TM+   ++   +   + ++  G +T+L+ E
Sbjct: 656 LQWTMD---NIARREKNGDLVVYPGTYTLLLDE 685


>gi|206901921|ref|YP_002251428.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
 gi|206741024|gb|ACI20082.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
          Length = 756

 Score =  275 bits (703), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 205/670 (30%), Positives = 335/670 (50%), Gaps = 89/670 (13%)

Query: 87  GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
           G+T FP  I   +++N  L  ++   +  E R+            SP IN+ RDPR GR 
Sbjct: 147 GSTIFPQAIGMASTWNPELIYQVATAIGKETRS-----RGIHQVLSPTINIARDPRCGRT 201

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
            ET GEDPY+  R A+ Y++G+Q+ +GV              A  KH+ A  + +  G D
Sbjct: 202 EETYGEDPYLASRMAVAYIKGVQE-QGV-------------IATPKHFVANFVGDG-GRD 246

Query: 207 RF--HFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRG 264
            +  HF  R+    ++E +   F   + E    S+M +YN ++GIP  ++  LL + +R 
Sbjct: 247 SYPIHFSERL----LREIYFPAFRASIEEAGALSLMAAYNSLDGIPCSSNKWLLTRILRK 302

Query: 265 DWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTM-GAVQQ 323
           +W F GY+VSD  S+  ++  HK + ++K +A    L+AGLD++  D      + G +++
Sbjct: 303 EWGFKGYVVSDYFSVLHLMTKHK-VAESKAEAAKLSLEAGLDMELPDSDCFEEIPGLIRE 361

Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDG---SPQYKNLGKNNICNPQHIELAAEAARQGIV 380
            K+++  +D ++R +  V   +G FD     P Y    + N C+ +H ELA   AR+ IV
Sbjct: 362 SKLSQDTLDEAVRRVLRVKFWIGLFDNPFVDPDYAE--RINDCS-EHRELALRVARESIV 418

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKV-I 437
           LLKN+ G LPLN  +I+++A++GP  NA    +G Y G   +  +P++G       KV +
Sbjct: 419 LLKNE-GILPLNK-DIRSIAVIGP--NAAVPRLGGYSGYGVKVVTPLEGIKNKLGDKVKV 474

Query: 438 NYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL-SVEAEGKDRVDLLLPGFQTEL 496
            +A GC  +   + S    AI  A+ +D  ++  G  +   E E +DR +L LPG Q +L
Sbjct: 475 YFAEGCG-LNDTSKSGFDEAIKIAQKSDVAILFMGNSVPETEGEQRDRHNLNLPGVQEDL 533

Query: 497 INKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNP 556
           I ++ +    PV +V+++  A  I       K+++++   YPGEEGG AIADV+FG YNP
Sbjct: 534 IKEICNT-NTPVIVVLINGSA--ITMMNWIDKVQAVIEAWYPGEEGGNAIADVLFGDYNP 590

Query: 557 GGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD---GPVVYPFGYGLSYTQFKYK 613
           GG+LPI++ + +      + +PL   +   GR   + D      ++PFGYGLSYT FKY 
Sbjct: 591 GGKLPISFPKYS------SQLPLYYNHKPSGRVDDYVDLRGNQYLFPFGYGLSYTDFKYS 644

Query: 614 VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMD 673
                                N  +   + P           +D +     ++EN+GK  
Sbjct: 645 ---------------------NLRITPEEIP-----------RDGEVVITFDIENIGKYK 672

Query: 674 GSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL 732
           G EVV +Y           IK++  +ERV +  G+   V F +N  + L+ +      ++
Sbjct: 673 GDEVVQLYLHDEFASVARPIKELKRFERVTLDVGERKTVSFKLNR-RDLEFLSMDMELVV 731

Query: 733 ASGAHTILVG 742
             G   +L+G
Sbjct: 732 EPGRFEVLIG 741


>gi|395803818|ref|ZP_10483061.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
 gi|395434089|gb|EJG00040.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
          Length = 875

 Score =  275 bits (703), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 160/449 (35%), Positives = 237/449 (52%), Gaps = 49/449 (10%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           FP+ +  L + ER ++LV ++TL EKV QM + A  +PRLG+P Y+WW+E LHGV+    
Sbjct: 27  FPFQNTDLTFEERVENLVSQLTLEEKVAQMLNAAPAIPRLGIPAYDWWNETLHGVARTPF 86

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG------- 124
           +T               T FP  I   A+F+++   K+    + E RA+YN         
Sbjct: 87  KT---------------TVFPQAIAMAATFDKNSLFKMADYSALEGRAIYNKAVELNRTK 131

Query: 125 --NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
               GLT+W+PNIN+ RDPRWGR  ET GEDPY+       +V+GLQ           D 
Sbjct: 132 ERYLGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAVLGDAFVKGLQG---------DDP 182

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
           + LK +AC KHYA +       + R  FD  VT  ++ +T++  F+  V    V+ VMC+
Sbjct: 183 KYLKAAACAKHYAVHSGPE---SLRHTFDVDVTPYELWDTYLPAFKKLVTNSKVAGVMCA 239

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           YN     P CA   L+N  +R  W F GY+ SDC +I    ++HK   D    +   VL 
Sbjct: 240 YNAFRTQPCCASDILMNDILRNQWKFTGYVTSDCWAIDDFFKNHKTHPDAASASADAVLH 299

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKN 360
            G D+DCG       + AV+ G+I E  ID S++ L+++  RLG FD     +Y     +
Sbjct: 300 -GTDIDCGTDAYKSLVQAVKNGQITEKQIDVSVKRLFMIRFRLGMFDPVSMVKYAQTPSS 358

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + + +H E A + ARQ IVLLKN+   LPL +  +K + ++GP+A+ + +++GNY GTP
Sbjct: 359 VLESEEHKEHALKMARQSIVLLKNEKNTLPL-SKKLKKIVVLGPNADNSISILGNYNGTP 417

Query: 421 CRYTSPMDGF---------YAYSKVINYA 440
            + T+ + G            Y K IN+ 
Sbjct: 418 SKLTTVLQGIKEKISPETEVVYEKAINFT 446



 Score =  133 bits (334), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 102/327 (31%), Positives = 149/327 (45%), Gaps = 60/327 (18%)

Query: 433 YSKVINY--APGCADIVCQNNSMIPA----AIDAAKNADATVIVAGLDLSVEAE------ 480
           Y  V+ Y    G A++  Q  + I       I+  KNADA +   G+   +E E      
Sbjct: 569 YKLVLEYWQGEGKAEVALQTGNFIKTDFANLIERHKNADAFIFAGGISPQLEGEEMPVDA 628

Query: 481 ----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
               G DR  +LLP  QT L+  +  + K PV  +IM+  A+ + +   N  I +IL + 
Sbjct: 629 PGFNGGDRTSILLPEVQTRLLKALQSSGK-PVVFLIMTGSAIAVPWEAEN--IPAILNIW 685

Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
           Y G+  G A ADVIFG YNP GRLP+T+Y+ +     +    +        +TY++F G 
Sbjct: 686 YGGQSAGTASADVIFGDYNPAGRLPVTFYKGDSDLSSFVDYKM------DNKTYRYFKGI 739

Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
            +Y FGYGLSYT+FKY    +P     K+ K Q                           
Sbjct: 740 PLYGFGYGLSYTEFKYSGLKTPD----KIKKGQPV------------------------- 770

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSAKVGFTM 715
               T  ++V N GKM+G EV  +Y   P  +  + +K + G+ER  +  GQS  V FT+
Sbjct: 771 ----TISVKVTNTGKMEGEEVAQLYLINPNTSIKSPLKSLKGFERFNLKPGQSTVVNFTL 826

Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
           +  + L  V  + N     G   I VG
Sbjct: 827 SP-EDLSYVTESGNLKPYEGKIQIAVG 852


>gi|399029098|ref|ZP_10730151.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
 gi|398073120|gb|EJL64304.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
          Length = 744

 Score =  275 bits (702), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 221/734 (30%), Positives = 338/734 (46%), Gaps = 143/734 (19%)

Query: 28  LVERMTLPEKVQQM-GDLAY---GVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
           L+ +MTL EK+  + G+  +   GV RLG+P  +     L     I R   +P G   D 
Sbjct: 52  LISQMTLEEKIGMLHGNSMFSNGGVKRLGIPELKMADGPLGVREEISRDNWAPAGLTNDF 111

Query: 84  EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRW 143
               AT +P      A++N  +    G ++  E RA            SP IN+VR P  
Sbjct: 112 ----ATYYPAGGGLAATWNAEMAHTFGNSLGEELRA-----RDKDMLLSPAINMVRSPLG 162

Query: 144 GRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWE 203
           GR  E   EDP++  + A+  + GLQ+ +              + AC KHYAA   +N E
Sbjct: 163 GRTYEYMSEDPFLNKKIAVPLIVGLQEKD--------------VMACVKHYAA---NNQE 205

Query: 204 GNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIR 263
            N  F  D ++ E+ ++E ++  FE  V E    S+M +YN+  G   C +  +LN+ +R
Sbjct: 206 TNRDF-VDVQIDERTLREIYLPAFEASVKEAKAYSIMGAYNKFRGEYLCENDYMLNKILR 264

Query: 264 GDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD-------YYTNF 316
            +W F G +VSD  ++ +                A+ LK GLD++ G        +  + 
Sbjct: 265 DEWGFKGVVVSDWAAVHS---------------TAKTLKNGLDIEMGTPKPFNEFFLADK 309

Query: 317 TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAAR 376
            + AV+ G+++EA+ID  ++ +  VL ++    G  +     K +I    H + A + A 
Sbjct: 310 LIAAVKSGEVSEAEIDLHVKRILRVLFQVKAMGGGER----AKGSIATEAHYQDAYKIAS 365

Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC-RYTSPMDGF---YA 432
           + +VLLKNDN ALPL    +K++A++G +A    A+ G   G    R  +P++G      
Sbjct: 366 EAVVLLKNDNNALPLKLDGVKSIAVIGNNATKKNALAGFGAGVKTKREITPLEGLKNRLP 425

Query: 433 YSKVINYAPGCADIVCQNN--------------------SMIPAAIDAAKNADATVIVAG 472
            S  INYA G  +   + N                    + +  A++AAKN+D  +I AG
Sbjct: 426 SSIKINYAEGYLERYEEKNKGNLGNITSSGPVTIDQLDPAKLQEAVEAAKNSDVAIIFAG 485

Query: 473 LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-VDINFAKNNPKIKS 531
            +   E E  DR DL LP  Q ELI KV   A  P T+V+M AGA  DIN  + + K  +
Sbjct: 486 SNRDYETEASDRRDLHLPFGQEELIKKV--LAVNPKTIVVMIAGAPFDIN--EVSKKTSA 541

Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSM--PLRPVNNFPGRT 589
           ++W  + G EGG A+ADV+ GK NP G+LP T        +P   M  P    N+FPG  
Sbjct: 542 LVWSWFNGSEGGNALADVLLGKVNPSGKLPWT--------MPKNLMDSPAHATNSFPGGK 593

Query: 590 -----------YKFFDGPVV---YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDIN 635
                      Y++FD   +   YPFG+GLSYT F +  A + K+              +
Sbjct: 594 EVNYAEGILIGYRWFDTKKIAPLYPFGFGLSYTTFAFDNAKTDKT--------------S 639

Query: 636 YTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQV 695
           Y V                      T  ++V+N GK+DG EVV +Y+       T   Q 
Sbjct: 640 YAVTET------------------ITVSVDVKNTGKVDGKEVVQLYASKSDSKITRAAQE 681

Query: 696 I-GYERVFIAAGQS 708
           + G+++  + AG S
Sbjct: 682 LKGFQKTDVKAGGS 695


>gi|423302093|ref|ZP_17280116.1| hypothetical protein HMPREF1057_03257 [Bacteroides finegoldii
           CL09T03C10]
 gi|408471184|gb|EKJ89716.1| hypothetical protein HMPREF1057_03257 [Bacteroides finegoldii
           CL09T03C10]
          Length = 1039

 Score =  275 bits (702), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 233/810 (28%), Positives = 369/810 (45%), Gaps = 144/810 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
           Y D   P   R +DL+ +MTL EK  QM  L YG  R+    LP  EW ++    G+  I
Sbjct: 145 YEDPSAPIDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEWKNQLWKDGIGAI 203

Query: 70  GRRTNS------PPG-------------------------------THFDSE-VPG---- 87
               N       PP                                T F +E + G    
Sbjct: 204 DEHLNGFQQWGLPPSDNEYVWPASKHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESY 263

Query: 88  -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
            AT+FPT +    ++N  L ++IG     EAR +      G T  ++P ++V RD RWGR
Sbjct: 264 KATNFPTQLGLGHTWNRQLLRQIGLITGREARML------GYTNVYAPILDVGRDQRWGR 317

Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
             E  GE PY+V    I  V+G+Q       H        +++A  KH+ AY  +     
Sbjct: 318 YEEVYGESPYLVAELGIEMVKGMQ-------HNH------QVAATGKHFIAYSNNKGARE 364

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
                D +++ ++++   + PF+  + E  +  VM SYN  +G P  +    L   +RGD
Sbjct: 365 GMARVDPQMSPREVEMIHVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGD 424

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
             F GY+VSD D+++ +   H    D KE AV + ++AGL++ C     D Y       V
Sbjct: 425 MGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNIRCTFRSPDSYVLPLRELV 483

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN-NICNPQHIELAAEAARQGIV 380
           ++G+++E  I+  +R +  V   +G FD   Q    G +  +    + E+A +A+R+ IV
Sbjct: 484 KEGELSEEIINDRVRDILRVKFLVGLFDHPYQTDLKGADEEVEKASNEEIALQASRESIV 543

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY---SKVI 437
           LLKND   LPLN   IK +A+ GP+A+     + +Y       TS + G          +
Sbjct: 544 LLKNDKNVLPLNASTIKKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKLGGKAEV 603

Query: 438 NYAPGCADI-------------VCQN-NSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
            Y  GC  +             + +N    I  A+   K AD  V+V G       E K 
Sbjct: 604 LYTKGCELVDANWPESELMEYPLSENEQEEIEKAVSQTKQADVAVVVLGGGQRTCGENKS 663

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R  L LPG Q +L+  V    K PV LV+++   + IN+A  +  + +IL   YPG +GG
Sbjct: 664 RSSLALPGRQLDLLKAVVATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGG 720

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV------ 597
           +A+ADV+FG YNPGG+L +T +     +IP+ + P +P +   G      +G +      
Sbjct: 721 KAVADVLFGDYNPGGKLTVT-FPKTVGQIPF-NFPCKPSSQIDGGKNPGLNGNMSRVNGA 778

Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD----V 653
           +YPFG+GLSYT F+Y         D+K+                     A++  +    V
Sbjct: 779 LYPFGFGLSYTTFEYS--------DLKI-------------------SPAIITPNQKTYV 811

Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVG 712
            CK         V N GK  G EVV +Y +       T+ K + G+ERV +  G++ ++ 
Sbjct: 812 TCK---------VTNTGKRAGDEVVQLYVRDVLSSVTTYEKNLAGFERVHLKPGETKEIT 862

Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVG 742
           F ++  K+L++++   + ++  G  T+++G
Sbjct: 863 FPIDR-KALELLNADMHWVVEPGEFTLMIG 891


>gi|380694149|ref|ZP_09859008.1| glycoside hydrolase 3 [Bacteroides faecis MAJ27]
          Length = 946

 Score =  275 bits (702), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 239/807 (29%), Positives = 369/807 (45%), Gaps = 138/807 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
           Y D       R +DL+ +MTL EK  QM  L YG  R+    LP  EW ++    G+  I
Sbjct: 53  YEDPTATIDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEWKNQLWKDGIGAI 111

Query: 70  GRRTNS------PPG-------------------------------THFDSE-VPG---- 87
               N       PP                                T F +E + G    
Sbjct: 112 DEHLNGFQQWGLPPSDNENIWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESY 171

Query: 88  -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
            AT+FPT +    ++N  L  ++G     EAR +      G T  ++P ++V RD RWGR
Sbjct: 172 KATNFPTQLGLGHTWNRRLIHQVGLITGREARML------GYTNVYAPILDVGRDQRWGR 225

Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
             E  GE PY+V    I  VRG+Q       H        +I+A  KH+ AY  +     
Sbjct: 226 YEEVYGESPYLVAELGIEMVRGMQ-------HNH------QIAATGKHFIAYSNNKGARE 272

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
                D +++ ++++ T + PF+  + E  +  VM SYN  +G P  +    L   +RG+
Sbjct: 273 GMARVDPQMSPREVEMTHVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGE 332

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
             F GY+VSD D+++ +   H    D KE AV + ++AGL++ C     D Y       V
Sbjct: 333 MGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLRELV 391

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN-NICNPQHIELAAEAARQGIV 380
           ++G ++E  I+  +R +  V   +G FD   Q    G +  +    + E+A +A+R+ IV
Sbjct: 392 KEGGLSEEVINDRVRDILRVKFLVGLFDHPYQIDLKGADEEVEKAANEEIALQASRESIV 451

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK---VI 437
           LLKND   LPL+   I+ +A+ GP+A+     + +Y       TS + G     K    +
Sbjct: 452 LLKNDKNILPLDASGIQKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKGKAEV 511

Query: 438 NYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
            Y  GC D+V  N                  I  A+D  K AD  V+V G       E K
Sbjct: 512 LYTKGC-DLVDANWPESELIDYPLTDEEQKEIEKAVDQTKQADVAVVVLGGGQRTCGENK 570

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
            R  L LPG Q +L+  VA   K PV LV+++   + IN+A  +  + +I+   YPG +G
Sbjct: 571 SRSSLDLPGRQLDLLKAVAATGK-PVVLVLINGRPLSINWA--DKFVPAIVEAWYPGSKG 627

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV----- 597
           G+A+ADV+FG+YNPGG+L +T +     +IP+ + P +P +   G      +G +     
Sbjct: 628 GKAVADVLFGEYNPGGKLTVT-FPKTVGQIPF-NFPCKPSSQIDGGKNPGMEGNMSRANG 685

Query: 598 -VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
            +YPFGYGLSYT F+Y   S  K     +  +QQ                      V CK
Sbjct: 686 ALYPFGYGLSYTTFEY---SDLKISPAIITPNQQTF--------------------VTCK 722

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
                    V N GK  G EVV +Y +       T+ K + G+ERV +  G++ +V F +
Sbjct: 723 ---------VTNTGKRAGDEVVQLYVRDVLSSVTTYEKNLAGFERVHLQPGETKEVTFPI 773

Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
           +  K+L++++   + ++  G  T++VG
Sbjct: 774 DR-KALELLNADMHWVVEPGDFTLMVG 799


>gi|146301622|ref|YP_001196213.1| glycoside hydrolase family 3 protein [Flavobacterium johnsoniae
           UW101]
 gi|146156040|gb|ABQ06894.1| Candidate beta-xylosidase; Glycoside hydrolase family 3
           [Flavobacterium johnsoniae UW101]
          Length = 875

 Score =  275 bits (702), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 157/433 (36%), Positives = 234/433 (54%), Gaps = 40/433 (9%)

Query: 8   KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
           K  DF + +  L + +R  DLV R+TL EKV QM + +  + RLG+P Y+WW+E LHGV+
Sbjct: 23  KKYDFQFQNPSLSFEQRVDDLVSRLTLEEKVSQMLNSSPEIARLGIPAYDWWNETLHGVA 82

Query: 68  FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG--- 124
               +T               T +P  I   A+F+++    +    + E RA+YN     
Sbjct: 83  RTPFKT---------------TVYPQAIGMAATFDKNSLFTMADYSALEGRAIYNKAVEL 127

Query: 125 ------NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHR 178
                   GLT+W+PNIN+ RDPRWGR  ET GEDPY+       +V+GLQ         
Sbjct: 128 KRTNERYLGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAVLGDAFVKGLQG-------- 179

Query: 179 DSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSS 238
             D + LK +AC KHYA +   +   + R  FD  VT  ++ +T++  F   + E +V+ 
Sbjct: 180 -DDPKYLKAAACAKHYAVH---SGPESLRHTFDVDVTPYELWDTYLPAFRKLITESNVAG 235

Query: 239 VMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVA 298
           VMC+YN     P CA   L+N  +R +W F GY+ SDC +I    ++HK   D  E A A
Sbjct: 236 VMCAYNAFRTQPCCASDILMNDILRKEWKFDGYVTSDCWAIDDFFKNHKTHPDA-ESAAA 294

Query: 299 RVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKN 356
             +  G D+DCG       + AV+ GKI+E  ID S++ L+++  RLG FD     +Y  
Sbjct: 295 DAVFHGTDIDCGTDAYKALVQAVKNGKISEKQIDISVKRLFMIRFRLGMFDPVSMVKYAQ 354

Query: 357 LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
              + + + +H   A + ARQ IVLLKN+   LPLN  N+K + ++GP+A+   +++GNY
Sbjct: 355 TPSSVLESKEHQLHALKMARQSIVLLKNEKNILPLNK-NLKKIVVLGPNADNAISILGNY 413

Query: 417 EGTPCRYTSPMDG 429
            GTP + T+ + G
Sbjct: 414 NGTPSKLTTVLQG 426



 Score =  121 bits (304), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 89/301 (29%), Positives = 137/301 (45%), Gaps = 59/301 (19%)

Query: 433 YSKVINY--APGCADIVCQNNSMIPA----AIDAAKNADATVIVAGLDLSVEAE------ 480
           Y  V+ Y    G A++  Q  + +       I+  KNADA +   G+   +E E      
Sbjct: 569 YKIVLEYWQGEGKAEVSLQTGNFVKTNFADLIEHHKNADAFIFAGGISPQLEGEEMPVDF 628

Query: 481 ----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
               G DR  +L P  QT+L+  +  + K PV   +M+  A+ I +   N  I +IL + 
Sbjct: 629 PGFKGGDRTSILFPEVQTKLLKALQSSGK-PVVFAMMTGSAIAIPWEAEN--IPAILNIW 685

Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
           Y G+  G A ADVIFG YNP GRLP+T+Y+ +      + +P         +TY++F G 
Sbjct: 686 YGGQSAGTAAADVIFGDYNPAGRLPVTFYKND------SDLPSFVDYKMDNKTYRYFKGT 739

Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
            +Y FGYGLSYT FKY    +P    +K+ K Q                           
Sbjct: 740 PLYGFGYGLSYTSFKYSDLKTP----VKIKKGQSV------------------------- 770

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSAKVGFTM 715
               +  ++V N GK +G EV  +Y      A  T +K + G+ER  +  G++  + F +
Sbjct: 771 ----SILVKVANTGKTEGEEVAQLYLINQDTAIKTPLKSLKGFERFNLKPGENKTITFNL 826

Query: 716 N 716
           +
Sbjct: 827 S 827


>gi|404487205|ref|ZP_11022392.1| hypothetical protein HMPREF9448_02853 [Barnesiella intestinihominis
           YIT 11860]
 gi|404335701|gb|EJZ62170.1| hypothetical protein HMPREF9448_02853 [Barnesiella intestinihominis
           YIT 11860]
          Length = 860

 Score =  275 bits (702), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 232/815 (28%), Positives = 363/815 (44%), Gaps = 158/815 (19%)

Query: 8   KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQM-----------------------GDL 44
           K    PY +  LP  ER +DL+ RMT+ EK+ Q+                       G+ 
Sbjct: 23  KAQSLPYKNKNLPIEERVEDLLNRMTVDEKIAQIRHIHSSKIFNGQELDMKKLTDWAGNT 82

Query: 45  AYGV-------------------------PRLGLPLYEWWSEALHGVSFIGRRTNSPPGT 79
           ++G                           RLG+P++   +E+LHG              
Sbjct: 83  SWGFVEGFPLTGDNCAKSMYLIQKYMVEKTRLGIPIFTV-AESLHG------------AV 129

Query: 80  HFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVR 139
           H      GAT +P  I   ++FN  L +K  Q +S +  +M           SP I+VVR
Sbjct: 130 H-----DGATIYPQNIALGSTFNPELARKKTQMISDDLHSM-----GFRQVLSPCIDVVR 179

Query: 140 DPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDL 199
           D RWGRV E+ GEDPY+ G + I  V G        Y  +       IS   KHY  +  
Sbjct: 180 DLRWGRVEESYGEDPYLCGLFGIEEVSG--------YLENG------ISPMLKHYGPH-- 223

Query: 200 DNWEGNDRFHFDSRVTE---QDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPK 256
               GN     +    E   +D+ E ++ PFEM V    + +VM +YN  N IP  A   
Sbjct: 224 ----GNPLSGLNLASVECGLRDLHEIYLKPFEMVVKNTGILAVMSTYNSWNHIPNSASHY 279

Query: 257 LLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNF 316
           LL   +R +W F GY+ SD  +I+ +   H F      +A  + + AGLD +       F
Sbjct: 280 LLTDILRDEWGFKGYVYSDWGAIEMLKTLH-FTARNSSEAAIQAISAGLDAEASSKCYPF 338

Query: 317 TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAAR 376
             G +++G+  E  +DT++R +      +G F+  P  K        +P+ ++LA   A 
Sbjct: 339 LKGLIEKGQFDEKILDTAVRRVLFAKFAMGLFE-DPYGKTFKNRKRHSPESVKLAKTIAD 397

Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY--TSPMDGF---Y 431
           +  VLLKN+N  LPL+  ++K++A++GP  NA +   G+Y  +       +P+ G     
Sbjct: 398 ESTVLLKNENQLLPLDAKSLKSIAIIGP--NADQVQFGDYTWSRNNKDGVTPLQGIKNRV 455

Query: 432 AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG---------LDLSVEAEGK 482
             +  I+YA GC+ +   + S I  A++AAKN++  VI  G            S   EG 
Sbjct: 456 NKNTAIHYAKGCS-LTSLDTSGIAEAVEAAKNSEVAVIFGGSASAALARDYKSSTCGEGF 514

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
           D  DL L G Q++LI +V      PV LV+++     I + KNN  + +IL   Y GE+ 
Sbjct: 515 DLNDLNLTGAQSQLIREVYRTGT-PVILVLVTGKPFVIEWEKNN--LPAILVQWYAGEQA 571

Query: 543 GRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMP------LRPVN-NFPGRTYKFFD 594
           G +IAD++FG+  P GRL  ++     ++ + Y  +P        P + + PGR Y F  
Sbjct: 572 GNSIADILFGEVVPSGRLTFSFPRSTGHLPVYYNYLPSDRGFYKNPGSYDSPGRDYVFSA 631

Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
              +Y FGYGLSYT F YK        ++  DKD+   ++N T+                
Sbjct: 632 PSALYSFGYGLSYTSFVYK--------NLSTDKDKY--ELNDTIHAT------------- 668

Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
                    +EV+N GK  G EVV +Y +       T +KQ+  ++++ +A G++  V  
Sbjct: 669 ---------VEVKNTGKYTGKEVVQLYVRDKASTYVTPVKQLRDFKKIELAPGETRTVQL 719

Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
            +     L +VD      + +G   + VG+    +
Sbjct: 720 QV-PISDLYLVDEKNQRFVEAGEFILEVGQASNNI 753


>gi|371776218|ref|ZP_09482540.1| beta-glucosidase [Anaerophaga sp. HS1]
          Length = 774

 Score =  274 bits (701), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 198/658 (30%), Positives = 332/658 (50%), Gaps = 82/658 (12%)

Query: 89  TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVL 147
           T+FP  +    S++  L +K  +  + EA A      +G+ + ++P +++ RDPRWGR++
Sbjct: 128 TTFPIPLAEACSWDLQLMEKSARIAAEEATA------SGVAWNFAPMVDISRDPRWGRIM 181

Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
           E  GEDP++    A   VRG Q   G++ ++D  S+P  + AC KH+  Y      G D 
Sbjct: 182 EGAGEDPFLGSLIARARVRGFQ---GIDSYKDF-SKPNTMMACAKHFVGYGAAQ-AGRDY 236

Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
              D  ++E+ + ET++ PF+  V+EG V+S M ++N +NG+P   +  +    +R  WN
Sbjct: 237 HTVD--ISERTLFETYLPPFKAAVDEG-VASFMTAFNELNGVPCTGNKYIFQDILRHQWN 293

Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQGKI 326
           F+G +V+D  +IQ +V +H F  D K+ A    + AG+D+D   + +  +    V++G++
Sbjct: 294 FNGMVVTDYTAIQEMV-AHGFAKDLKQ-ASKLAIDAGIDMDMISEGFVTYLKELVEEGQV 351

Query: 327 AEADIDTSLRFLYIVLMRLGYFDGSPQY--KNLGKNNICNPQHIELAAEAARQGIVLLKN 384
           +E  ID ++  +  +   LG FD   +Y      K  + NPQH++ A E A++ IVLLKN
Sbjct: 352 SEKQIDVAVARILEMKFLLGLFDDPYKYCDAEREKEVLMNPQHLQAAREVAQRSIVLLKN 411

Query: 385 DNGALPLNTGNIKTLALVGPHANATKAMIGNY--EGTPCRYTSPMDGF---YAYSKV-IN 438
           +N  LPL     K +AL+GP     +++ G +  +G   +  +  +G    YA + V  N
Sbjct: 412 ENNVLPLRKDIPKRVALIGPFVKERESLNGEWAIKGDRSKSVTLWEGLQEKYADTPVRFN 471

Query: 439 YAPGCA----DIVCQNNSM--------IPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
           YA G +    D   ++ S+           A+  AK +D  ++  G       E   R D
Sbjct: 472 YAKGTSLPLIDGATRHVSLEQGFDKSGFAEALRVAKTSDLILVAMGEHYHWSGEAASRTD 531

Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
           + LPG Q EL+ ++    K P+ LV+ +   +D+++   N  + +I+   YPG   G A+
Sbjct: 532 ITLPGNQRELLKELKKTGK-PIVLVLFNGRPLDLSWEAEN--VDAIVEAWYPGIMAGHAV 588

Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIP-YTSMPL--RPVNNFPGRTYK--FFDGP--VVY 599
           ADV+ G YNP  RL +T +  N  +IP + +M    RP +      YK  + D P   ++
Sbjct: 589 ADVLSGDYNPSARLVVT-FPRNVGQIPIFYNMKNTGRPFDENHPADYKSSYIDSPNSPLF 647

Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
           PFG+GLSYT F+Y                      N T+ + K      LI         
Sbjct: 648 PFGFGLSYTSFQYD---------------------NATISSQKLTKGGSLI--------- 677

Query: 660 FTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
               ++V N G +DG EVV +Y     G     +K++ G++++F+  G++  V FT+N
Sbjct: 678 --VSVDVTNTGNVDGEEVVQLYIHDKVGSVTRPVKELKGFKKIFLKKGETKTVEFTIN 733


>gi|225872720|ref|YP_002754177.1| xylan 1,4-beta-xylosidase [Acidobacterium capsulatum ATCC 51196]
 gi|225793233|gb|ACO33323.1| xylann 1,4-beta-xylosidase [Acidobacterium capsulatum ATCC 51196]
          Length = 721

 Score =  274 bits (701), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 222/733 (30%), Positives = 340/733 (46%), Gaps = 100/733 (13%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP--LYEWWSEALHGVSFI 69
           +P+ +  L   +R  DL+ RMTL EK+Q +GD   GVPRLG+P  L E   E LHG +  
Sbjct: 24  YPFQNPALSPDQRIDDLLSRMTLQEKIQALGDDP-GVPRLGIPGALTE---EGLHGAAIG 79

Query: 70  GRRTNSPPGTHFDSE---VPGATSFPTVILTTASFNESLWKKIGQTVSTEAR-AMYNLGN 125
           G         H++     V   T FP       +++ +L +K     + E R A+    +
Sbjct: 80  GP-------AHWEGRGRAVVPTTQFPQNHGLGQTWDPALLQKAANVEAYETRWAVNKYHD 132

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GL   +PN N+ RDPRWGR  E+ GEDPY+VG  A+ +++GLQ          ++ R  
Sbjct: 133 GGLIVRAPNANLSRDPRWGRTEESYGEDPYLVGTLAVAWIKGLQ---------GNNPRYW 183

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           + +A  KH+ AY  +        +F  R+      E + +PF M + +G   + M SYN 
Sbjct: 184 ETAALMKHFDAYSNEANRDGSSSNFGKRL----FYEYYSVPFRMGIEQGHSDAFMTSYNA 239

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
            NGIP  A+P +L   +   W F+G I +D  ++  +V +H     T  +A A  + AG+
Sbjct: 240 WNGIPMTANP-VLKSVVMKKWGFNGIICTDAGALSNMV-THFHYYKTMPEAAAGAVHAGI 297

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNIC 363
           +    D Y      A+QQ  + E  ID  L+ +Y V++RLG  D S    Y  +G  N  
Sbjct: 298 N-QFLDRYQQPVEEALQQKLLTEQQIDQDLKGVYRVVLRLGLMDPSSMSPYSMIGLTNDN 356

Query: 364 N--------PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGN 415
                    P HI L  +   + IVLLKN N ALPL+   + ++A++GP AN     +  
Sbjct: 357 PAKGDPWDWPSHIALDRKVTDESIVLLKNQNHALPLDAKKLHSIAVIGPWANIVA--LDW 414

Query: 416 YEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL 475
           Y GTP    +P++G           P    +   + S + AA   AK +D  +++ G   
Sbjct: 415 YSGTPPFGVTPVEGIRQ-----RVGPDV-KVTFNDGSNLQAAAALAKQSDEAIVIIGNHP 468

Query: 476 SVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           + +A         EGK+  D        E I K   AA     +V+ ++     ++ + +
Sbjct: 469 TCDAGWGKCALPSEGKEAFDRTALNLPDESIAKAVYAANPHTVVVLQTSFPYTTDWTQAH 528

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
             I +IL + +  EE G A+ADV+FG Y+P GRL  TW  A+  ++P    P+   N   
Sbjct: 529 --IPAILEMAHNSEEQGTALADVLFGDYDPAGRLAQTWV-ASIGQLP----PMMDYNIRD 581

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           GRTY +     +YPFG+GLSYT FKY                      N  + ++  P  
Sbjct: 582 GRTYMYLKSKPLYPFGFGLSYTTFKYS---------------------NLRLSSHTLPAG 620

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAA 705
                       + T  ++V N GK +G EVV +Y K         ++ + G++RV I  
Sbjct: 621 G-----------QLTVSVDVTNTGKYNGDEVVQMYVKHLDSKVSRPLEALKGFDRVSIPV 669

Query: 706 GQSAKVGFTMNAC 718
           GQ+  V   + A 
Sbjct: 670 GQTRTVTLPLKAS 682


>gi|71731103|gb|EAO33170.1| Beta-glucosidase [Xylella fastidiosa subsp. sandyi Ann-1]
          Length = 882

 Score =  274 bits (701), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 171/451 (37%), Positives = 238/451 (52%), Gaps = 52/451 (11%)

Query: 20  PYPER-AKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPG 78
           P PE+ A  LV +MT  EK+ Q  + A  +PRLG+P Y+WWSE LHG++  G        
Sbjct: 28  PSPEQHAAALVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNGY------- 80

Query: 79  THFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLT 129
                    AT FP  I   AS+N  L + +G   STEARA +NL           AGLT
Sbjct: 81  ---------ATVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLT 131

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
            WSPNIN+ RDPRWGR +ET GEDPY+ G+ A++++RGLQ         D+   P  I A
Sbjct: 132 LWSPNINIFRDPRWGRGMETYGEDPYLTGQLAVSFIRGLQG--------DTPDHPRTI-A 182

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
             KH+A +         R  FD  V+  D++ T+   F   + +G   SVMC+YN ++G 
Sbjct: 183 TPKHFAVHSGPE---QGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGT 239

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           P CA   LLN  +R DW F+G++VSDCD+I+ +   H F  D    A A  LK+G DL+C
Sbjct: 240 PACASDWLLNTRLRNDWGFNGFVVSDCDAIEDMTRFHFFRQDNAS-ASAAALKSGDDLNC 298

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQH 367
           G+ Y +    A+ +G I E+ +D +L  L+    RLG         Y  +G  +I  P H
Sbjct: 299 GNTYRDLNQ-AIARGDIDESTLDQALIRLFTARQRLGTLQPREHDPYAAIGIKHIDTPAH 357

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
             LA +AA Q +VLLKN    LPL      TLA++GP A++  A+  NY+GT     +P+
Sbjct: 358 RALALQAAAQSLVLLKNSGNTLPLPPET--TLAVLGPDADSLTALEANYQGTSSTPVTPL 415

Query: 428 DGFYA--------YSKVINYAPGCADIVCQN 450
            G           Y++  + APG    + + 
Sbjct: 416 TGLRTRFGTAKVHYAQGASLAPGVPSTIPET 446



 Score =  141 bits (356), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 96/295 (32%), Positives = 136/295 (46%), Gaps = 52/295 (17%)

Query: 460 AAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVT 509
           A  +ADA V   GL   VE E          G DR  + LP  Q  L+  V    K P+ 
Sbjct: 607 AVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKTTGK-PLI 665

Query: 510 LVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANY 569
           +V+MS  AV +N+A+++    +IL   YPG+ GG AIA  + G  NPGGRLP+T+Y +  
Sbjct: 666 VVLMSGSAVALNWAQHH--ADAILAAWYPGQSGGTAIAQALAGDVNPGGRLPVTFYRSTQ 723

Query: 570 VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQ 629
              PY S       +  GRTY++F G  +YPFGYGLSYTQF Y+                
Sbjct: 724 DLPPYISY------DMTGRTYRYFKGQPLYPFGYGLSYTQFAYEAP-------------- 763

Query: 630 QCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG 689
           Q        G                     T    V N G   G EVV +Y +PP    
Sbjct: 764 QLSTATLKAGNT------------------LTVTAHVRNTGTRAGDEVVQLYLEPPYSPQ 805

Query: 690 THIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
             ++ ++G++RV +  G+S  + FT++A + L  V       + +G + + VG G
Sbjct: 806 APLRSLVGFKRVTLRPGESRLLTFTLDA-RQLSGVQQTGQRSVEAGHYHLFVGGG 859


>gi|289577460|ref|YP_003476087.1| glycoside hydrolase [Thermoanaerobacter italicus Ab9]
 gi|289527173|gb|ADD01525.1| glycoside hydrolase family 3 domain protein [Thermoanaerobacter
           italicus Ab9]
          Length = 787

 Score =  274 bits (700), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 227/816 (27%), Positives = 376/816 (46%), Gaps = 156/816 (19%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL---------- 63
           Y D K P  ++ ++L+ +MT+ EK+ Q+          G+ +YE   + +          
Sbjct: 6   YLDPKQPVEKKVENLLAQMTIEEKIAQLS---------GIWVYEILDDMMKFSYEKANRL 56

Query: 64  --HGVSFIGR---------------------------RTNSPPGTHFDS----EVPGATS 90
             HG+  I R                           R   P   H +S       GAT 
Sbjct: 57  MTHGIGQITRLGGASNLSPQETVKIANQIQKYLVENTRLGIPALIHEESCSGYMAKGATI 116

Query: 91  FPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETP 150
           FP  I   +++N  L +K+   +  + +A+           +P ++V RDPRWGR  ET 
Sbjct: 117 FPQTIGVASTWNPKLVEKMASVIREQMKAV-----GARQALAPLLDVTRDPRWGRTEETF 171

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDPY+V    ++Y+RGLQ          +++    + A  KH+  Y   N EG   +  
Sbjct: 172 GEDPYLVMHMGVSYIRGLQ----------TENLKEGVIATGKHFVGYG--NSEGGMNWA- 218

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
            + +  +++ E F+ PFE  V E  + S+M  Y+ ++GIP     +LL   +R +W F G
Sbjct: 219 PAHIPMRELYEIFLYPFEAAVKEAKLGSIMPGYHELDGIPCHKSKQLLTDILRKNWGFDG 278

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQGKIAE 328
            +VSD  +I  + E H+  ++ KE A    L+AG+D++    D Y       ++QG I  
Sbjct: 279 IVVSDYFAINQLYEYHRLASNKKE-AAKLALEAGVDVELPSTDCYGLPIKELIEQGDIDI 337

Query: 329 ADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQ-HIELAAEAARQGIVLLKNDNG 387
             ++ ++R +      LG F+ +P         I + Q   +LA + A++ IVLLKN++ 
Sbjct: 338 DFVNDAVRRILKAKFLLGLFE-NPYVDEKRVVEIFDTQEQRQLAYKIAQESIVLLKNESN 396

Query: 388 ALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR-------------YTSPM-DGFYA- 432
            LPL   +++++A++GP+A+  + MIG+Y   PC              + +P+ +G  A 
Sbjct: 397 LLPLKK-DLQSIAVIGPNADNIRNMIGDY-AYPCHIESLLEMREKDNVFNTPLPEGLEAK 454

Query: 433 -------------------YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG- 472
                               +KVI YA GC D++  + +    A++ AK AD  ++V G 
Sbjct: 455 DIYVPIVSVLQGIKEKVSPKTKVI-YAKGC-DVISDDTAGFNKAVEVAKQADVAIVVVGD 512

Query: 473 ----LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
                D     E +DR DL LPG Q ELI  V +    PV +V+++   + I++     K
Sbjct: 513 RAGLTDGCTSGESRDRADLNLPGVQEELIKAVYETGT-PVIVVLINGRPMSISWIAE--K 569

Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPG 587
           I +I+    PGEEGGRAIADVIFG YNPGG+LPI+       + + Y   P     N+ G
Sbjct: 570 IPAIIEAWLPGEEGGRAIADVIFGDYNPGGKLPISIPRSVGQLPVYYYHKPSGGRTNWKG 629

Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
              +    P +YPFGYGLSYT+F Y                      N ++   K     
Sbjct: 630 DYVESSTKP-LYPFGYGLSYTEFLYS---------------------NLSISHPKVATQG 667

Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAG 706
            +I+             +V+N+GK+ G EVV +Y     ++ T  +K++ G++R+ +  G
Sbjct: 668 GIIE----------ISADVKNIGKVKGDEVVQLYIHREFLSVTRPVKELKGFKRITLDVG 717

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +   V F +++ + L   +     ++  G   +++G
Sbjct: 718 EQKTVIFQLSS-EQLGFYNEEMEYVVEPGRVEVMIG 752


>gi|146298537|ref|YP_001193128.1| glycoside hydrolase family 3 protein [Flavobacterium johnsoniae
           UW101]
 gi|146152955|gb|ABQ03809.1| Candidate beta-glycosidase; Glycoside hydrolase family 3
           [Flavobacterium johnsoniae UW101]
          Length = 745

 Score =  274 bits (700), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 232/767 (30%), Positives = 351/767 (45%), Gaps = 141/767 (18%)

Query: 28  LVERMTLPEKVQQM-GDLAY---GVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
           L+ +MTL EK+  + G+  +   GV RLG+P  +     L     I R   +P G   D 
Sbjct: 53  LISQMTLEEKIGMLHGNSMFANAGVKRLGIPELKMADGPLGVREEISRDNWAPAGWTNDF 112

Query: 84  EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRW 143
               AT +P      A++N  +    G ++  E RA            SP IN+VR P  
Sbjct: 113 ----ATYYPAGGALAATWNAEMAHTFGTSLGEELRA-----RDKDMLLSPAINMVRTPLG 163

Query: 144 GRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWE 203
           GR  E   EDP++  + A+  V GLQ+ +              + AC KHYAA   +N E
Sbjct: 164 GRTYEYMSEDPFLNKKIAVPLVVGLQEKD--------------VMACVKHYAA---NNQE 206

Query: 204 GNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIR 263
            N  F  D ++ E+ ++E ++  FE  V E    S+M +YN+  G   C +  +LN+ +R
Sbjct: 207 TNRDF-VDVQIDERTLREIYLPAFEATVKEAKAYSIMGAYNKFRGEYLCENDYMLNKILR 265

Query: 264 GDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD-------YYTNF 316
            +W F G +VSD  ++ +                A+ LK GLD++ G        +  + 
Sbjct: 266 DEWGFKGVVVSDWAAVHS---------------TAKSLKNGLDIEMGTPKPFNEFFLADK 310

Query: 317 TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAAR 376
            + AV+ G+++E +ID  ++ +  VL ++    G  +     K +I    H + A + A 
Sbjct: 311 LIAAVKSGEVSEKEIDLHVKRILRVLFQVKAMGGGER----AKGSIATEAHYQDAYKIAA 366

Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC-RYTSPMDGF---YA 432
           + I+LLKN+N ALPL    +K++A++G +A    A+ G   G    R  +P++G      
Sbjct: 367 EAIILLKNENNALPLKLDGVKSIAVIGNNATKKNALGGFGAGVKTKREVTPLEGLKNRLP 426

Query: 433 YSKVINYAPGCADIVCQNN--------------------SMIPAAIDAAKNADATVIVAG 472
            S  INYA G  +   + N                    + +  A++AAK +D  +I AG
Sbjct: 427 SSVKINYAEGYLEKYEEKNKGNLGNITSTGPVTIDKLDPAKVQEAVEAAKKSDVAIIFAG 486

Query: 473 LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-VDINFAKNNPKIKS 531
            +   E E  DR DL LP  Q ELI KV +A   P T+V+M AGA  D+N  + + K  +
Sbjct: 487 SNRDYETEASDRRDLHLPFGQEELIKKVIEA--NPKTIVVMIAGAPFDLN--EVSQKSSA 542

Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRT-- 589
           ++W  + G EGG A+ADVI GK NP G+LP  W     +K      P    N+FPG    
Sbjct: 543 LVWSWFNGSEGGNALADVILGKVNPSGKLP--WTMPKQLK----DSPAHATNSFPGDKAV 596

Query: 590 ---------YKFFDGPVV---YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
                    Y++FD   V   YPFGYGLSYT F    A        K DKD   +     
Sbjct: 597 NYAEGILIGYRWFDTKNVAPLYPFGYGLSYTTFALDNA--------KTDKDSYAQ----- 643

Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI- 696
                        +DV          ++V+N GK+DG EVV +Y+       T   Q + 
Sbjct: 644 -------------NDV------IEVTVDVKNTGKVDGKEVVQLYTSKSDSKITRAAQELK 684

Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS-LLASGAHTILVG 742
           G+++  + AG S K+   +   K L   D AA    +  G +TI +G
Sbjct: 685 GFKKADVKAGGSEKITIKV-PVKELAYYDVAAKKWTVEPGKYTIKLG 730


>gi|423226625|ref|ZP_17213090.1| hypothetical protein HMPREF1062_05276 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392628884|gb|EIY22909.1| hypothetical protein HMPREF1062_05276 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 863

 Score =  274 bits (700), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 172/456 (37%), Positives = 240/456 (52%), Gaps = 49/456 (10%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           PY +  L   ERA DLV R+TL EK   M + +  +PRLG+  Y+WW+EALHGV   G  
Sbjct: 25  PYKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALHGVGRAGL- 83

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN------- 125
                          AT FP  I   ASFN  L   +   VS EARA     +       
Sbjct: 84  ---------------ATVFPQAIGMGASFNNELLYDVFTAVSDEARAKNTEFSKEGGLKR 128

Query: 126 -AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLT W+PNIN+ RDPRWGR  ET GEDPY+ G+  +  VRGLQ  EG +Y        
Sbjct: 129 YQGLTMWTPNINIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEGEKYD------- 181

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            K+ AC KHYA +    W   +R  F++  +  +D+ ET++  F+  V +  V  VMC+Y
Sbjct: 182 -KLHACAKHYAVHSGPEW---NRHSFNAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAY 237

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLND-TKEDAVARVLK 302
           NR  G P C   +LL Q +R +W +   +VSDC +I           D  K+ A A+ + 
Sbjct: 238 NRFEGEPCCGSNRLLMQILRDEWGYKEIVVSDCWAISDFYNKDAHETDPDKQHASAKAVL 297

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
           +G D++CGD Y +    AV++G I E  ID SL+ L      LG  D   Q  +  +  +
Sbjct: 298 SGTDVECGDSYASLPE-AVKEGLIDEKQIDISLKRLMKARFELGEMDEPSQVSWAQIPYS 356

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + + +H ELA   AR+ +VLL+N+   LPLN  N+K +A+VGP+AN +    GNY G P
Sbjct: 357 VVDSKEHRELALRMARESLVLLQNNQSLLPLNK-NLK-VAVVGPNANDSVMQWGNYNGFP 414

Query: 421 CRYTSPMDGFYAY---SKVINYAPGC---ADIVCQN 450
               + ++G   Y   S++I Y PGC   +D+  Q+
Sbjct: 415 SHTITLLEGIREYLPESQII-YEPGCDLTSDVTLQS 449



 Score =  117 bits (293), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 90/300 (30%), Positives = 136/300 (45%), Gaps = 53/300 (17%)

Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
           +   +D  K AD  +   G+  +VE E          G DR  + LP  Q+ L+ ++  A
Sbjct: 590 LKQTVDKVKEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKA 649

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
            K    +V ++     I     +    +IL   YPG+ GG AIA+V+FG YNP GRLP+T
Sbjct: 650 GK---KIVFVNFSGSAIALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVT 706

Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
           +Y++         +P     +  GRTY++     ++PFG+GLSYT F+Y  AS   S +I
Sbjct: 707 FYKST------KQLPDFEDYSMKGRTYRYMTENPLFPFGHGLSYTTFQYGNASLNTS-EI 759

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
           K D +Q                               T  I V N GK DG EVV VY +
Sbjct: 760 K-DGEQ------------------------------VTLTIPVSNTGKYDGEEVVQVYLR 788

Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
            PG        +  ++RV IA G +  V   ++  ++ +  D + N++    G + IL G
Sbjct: 789 HPGDKEGPSHALRAFKRVAIAKGATNNVTIPLSK-ENFEWFDTSTNTMRPIEGDYEILYG 847


>gi|224537384|ref|ZP_03677923.1| hypothetical protein BACCELL_02262 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521009|gb|EEF90114.1| hypothetical protein BACCELL_02262 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 863

 Score =  274 bits (700), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 172/456 (37%), Positives = 240/456 (52%), Gaps = 49/456 (10%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           PY +  L   ERA DLV R+TL EK   M + +  +PRLG+  Y+WW+EALHGV   G  
Sbjct: 25  PYKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALHGVGRAGL- 83

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN------- 125
                          AT FP  I   ASFN  L   +   VS EARA     +       
Sbjct: 84  ---------------ATVFPQAIGMGASFNNELLYDVFTAVSDEARAKNTEFSKEGGLKR 128

Query: 126 -AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLT W+PNIN+ RDPRWGR  ET GEDPY+ G+  +  VRGLQ  EG +Y        
Sbjct: 129 YQGLTMWTPNINIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEGEKYD------- 181

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            K+ AC KHYA +    W   +R  F++  +  +D+ ET++  F+  V +  V  VMC+Y
Sbjct: 182 -KLHACAKHYAVHSGPEW---NRHSFNAENIDPRDLWETYLPAFKNLVQKAHVKEVMCAY 237

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLND-TKEDAVARVLK 302
           NR  G P C   +LL Q +R +W +   +VSDC +I           D  K+ A A+ + 
Sbjct: 238 NRFEGEPCCGSNRLLMQILRDEWGYKEIVVSDCWAISDFYNKGAHETDPDKQHASAKAVL 297

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
           +G D++CGD Y +    AV++G I E  ID SL+ L      LG  D   Q  +  +  +
Sbjct: 298 SGTDVECGDSYASLPE-AVKEGLIDEKQIDISLKRLMKARFELGEMDEPSQVSWAQIPYS 356

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + + +H ELA   AR+ +VLL+N+   LPLN  N+K +A+VGP+AN +    GNY G P
Sbjct: 357 VVDSKEHRELALRMARESLVLLQNNQSLLPLNK-NLK-VAVVGPNANDSVMQWGNYNGFP 414

Query: 421 CRYTSPMDGFYAY---SKVINYAPGC---ADIVCQN 450
               + ++G   Y   S++I Y PGC   +D+  Q+
Sbjct: 415 SHTITLLEGIREYLPESQII-YEPGCDLTSDVTLQS 449



 Score =  117 bits (293), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 90/300 (30%), Positives = 136/300 (45%), Gaps = 53/300 (17%)

Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
           +   +D  K AD  +   G+  +VE E          G DR  + LP  Q+ L+ ++  A
Sbjct: 590 LKQTVDKVKEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKA 649

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
            K    +V ++     I     +    +IL   YPG+ GG AIA+V+FG YNP GRLP+T
Sbjct: 650 GK---KIVFVNFSGSAIALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVT 706

Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
           +Y++         +P     +  GRTY++     ++PFG+GLSYT F+Y  AS   S +I
Sbjct: 707 FYKST------KQLPDFEDYSMKGRTYRYMTENPLFPFGHGLSYTTFQYGNASLNTS-EI 759

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
           K D +Q                               T  I V N GK DG EVV VY +
Sbjct: 760 K-DGEQ------------------------------VTLTIPVSNTGKYDGEEVVQVYLR 788

Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
            PG        +  ++RV IA G +  V   ++  ++ +  D + N++    G + IL G
Sbjct: 789 HPGDKEGPSHALRAFKRVAIAKGATNNVTIPLSK-ENFEWFDTSTNTMRPIEGDYEILYG 847


>gi|265765465|ref|ZP_06093740.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
 gi|263254849|gb|EEZ26283.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
          Length = 814

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 235/813 (28%), Positives = 356/813 (43%), Gaps = 153/813 (18%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE---------------- 57
           Y +   P   R + L+ +MTL EKV QM      +  LG P+YE                
Sbjct: 49  YENPSAPVEYRVEHLLSQMTLEEKVGQM------LTSLGWPMYERVGEDIRLTPQLEKEI 102

Query: 58  ----------------WWSEALH---GVSFIGRRTNSPPG---THFDSEVP--------- 86
                           W    LH     S   R +N        H    +P         
Sbjct: 103 GEYHIGSLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPH 162

Query: 87  -----GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
                G T FPT I   +++N  L +++G+ ++ EA A           + P +++ RDP
Sbjct: 163 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDP 217

Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
           RW RV ET GEDPY+ G      VRG Q     E   D  S    + A  KH+A+Y    
Sbjct: 218 RWSRVEETYGEDPYLNGAMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---G 266

Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
           W         + + E++++E    PF   V  G + SVM SYN ++G P      LL   
Sbjct: 267 WTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDI 325

Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGA 320
           ++  W F G++VSD  ++  + E     ND   +A  + + AG+D D G + Y    + A
Sbjct: 326 LKDRWQFKGFVVSDLYAVGGLREHGVAGNDY--EAAIKAVNAGVDSDLGTNVYAEQLVAA 383

Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
           V++G +A A ID ++R +  +  ++G FD     +      + + +H  LA E ARQ IV
Sbjct: 384 VKRGDVAVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIV 443

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSK 435
           LLKN +  LPL   +I+TLA++GP+A+    M+G+Y     +GT       +    +   
Sbjct: 444 LLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKET 502

Query: 436 VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE------------- 478
            + YA GCA +   + +    AI+ A+NADA V+V G     D S E             
Sbjct: 503 RVLYAKGCA-VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQ 561

Query: 479 ------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
                  EG DR  L L G Q EL+ +++   K PV LV++    + +  A    +    
Sbjct: 562 ISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVD 620

Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
            W  YPG +GG A+ADV+FG YNP GRL ++      V      +P+       G   ++
Sbjct: 621 AW--YPGMQGGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTRRKGNRSRY 672

Query: 593 FDGPVV--YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
            + P    YPFGYGLSYT F Y         D+K         +  T G++         
Sbjct: 673 IEEPGTPRYPFGYGLSYTTFSY--------TDMK---------VQVTEGSD--------- 706

Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSA 709
                 D +    + ++N G  DG EV  +Y +    +  T  KQ+  + R+ + AG+S 
Sbjct: 707 ------DCRVDVTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESR 760

Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +V FT++  KSL +       ++  G  TI+VG
Sbjct: 761 EVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 792


>gi|167521708|ref|XP_001745192.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776150|gb|EDQ89770.1| predicted protein [Monosiga brevicollis MX1]
          Length = 614

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 189/586 (32%), Positives = 282/586 (48%), Gaps = 73/586 (12%)

Query: 48  VPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWK 107
           V R+GLP Y+W   A+HGV     + +       D  V   TSFP  +    ++N S + 
Sbjct: 72  VSRIGLPEYDWGMNAIHGVQSSCIKDD-------DGTVYCPTSFPNPVNYGFTWNYSAYL 124

Query: 108 KIGQTVSTEARAMYNLG-----------NAGLTFWSPNINVVRDPRWGRVLETPGEDPYV 156
           ++G+ +  E RA++  G           + GL  WSPNIN+ R P WGR  E PGEDP++
Sbjct: 125 ELGRIIGVETRALWLAGAVEASTWSGRPHIGLDTWSPNINIARSPLWGRNQEVPGEDPFM 184

Query: 157 VGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTE 216
            G++   Y  GLQ           D   L+     KH+ AY L++ +G  R +F++ V+ 
Sbjct: 185 NGQFGKAYTLGLQG---------DDDTYLQAIVTLKHWDAYSLEDSDGATRHNFNAIVSN 235

Query: 217 QDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDC 276
             + +T+   F + V EG    VMCSYN VNGIPTCA P LL   +R  W F GY+ SD 
Sbjct: 236 FSLMDTYWPAFRVAVTEGKAKGVMCSYNAVNGIPTCAHP-LLRTVLRDLWKFDGYVSSDT 294

Query: 277 DSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLR 336
            +++ I ++HK+       A A +     D+D G  Y    +  V +G     D+D +LR
Sbjct: 295 GAVEDISDNHKYTPSWATAACAAIRDGQTDIDSGAVYMKSLLQGVSEGHCRMEDVDNALR 354

Query: 337 FLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAA---EAAR--------QGIVLLKND 385
               +   LG FD                 H+ LAA    A+R        + +VLL+N 
Sbjct: 355 NTLRLRFELGLFDPVENQSYW---------HVPLAAVNTNASRATNMLHTLESMVLLQNK 405

Query: 386 NGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR------YTSPMDGFYAY--SKVI 437
           N  LPL   N K +AL+GPHA A + M+GNY G  C         SP D   +   +  +
Sbjct: 406 NNVLPL-ASNTK-VALIGPHAKAQEDMVGNYLGQLCPDNNFDCVVSPHDALVSILGTDAV 463

Query: 438 NYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELI 497
            YAPG     C + S I  A+  A  AD  V++ G+D S+EAE  DR  + LP  Q +L 
Sbjct: 464 TYAPGTNVTTC-SQSHIDEAVSVATAADVAVLMLGIDESIEAESNDRKSIDLPECQHQLA 522

Query: 498 NKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPG 557
           + +    K P  +V+++ G + I   K   +  +I+  GYPG  GG AIA  + G+    
Sbjct: 523 SAIFAVGK-PTVIVLLNGGMLAIENEKQ--QADAIIEAGYPGFYGGTAIAQTLTGQNEHL 579

Query: 558 GRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
           G         +Y+   + +M    + + PGRTY+++    ++ F +
Sbjct: 580 G---------DYIN--WINMSDMEMTSGPGRTYRYYKNETLWAFHF 614


>gi|423258860|ref|ZP_17239783.1| hypothetical protein HMPREF1055_02060 [Bacteroides fragilis
           CL07T00C01]
 gi|423264169|ref|ZP_17243172.1| hypothetical protein HMPREF1056_00859 [Bacteroides fragilis
           CL07T12C05]
 gi|387776440|gb|EIK38540.1| hypothetical protein HMPREF1055_02060 [Bacteroides fragilis
           CL07T00C01]
 gi|392706435|gb|EIY99558.1| hypothetical protein HMPREF1056_00859 [Bacteroides fragilis
           CL07T12C05]
          Length = 805

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 235/813 (28%), Positives = 356/813 (43%), Gaps = 153/813 (18%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE---------------- 57
           Y +   P   R + L+ +MTL EKV QM      +  LG P+YE                
Sbjct: 40  YENPSAPVEYRVEHLLSQMTLEEKVGQM------LTSLGWPMYERVGEDIRLTPQLEKEI 93

Query: 58  ----------------WWSEALH---GVSFIGRRTNSPPG---THFDSEVP--------- 86
                           W    LH     S   R +N        H    +P         
Sbjct: 94  GEYHIGSLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPH 153

Query: 87  -----GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
                G T FPT I   +++N  L +++G+ ++ EA A           + P +++ RDP
Sbjct: 154 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDP 208

Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
           RW RV ET GEDPY+ G      VRG Q     E   D  S    + A  KH+A+Y    
Sbjct: 209 RWSRVEETYGEDPYLNGAMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---G 257

Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
           W         + + E++++E    PF   V  G + SVM SYN ++G P      LL   
Sbjct: 258 WTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDI 316

Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGA 320
           ++  W F G++VSD  ++  + E     ND   +A  + + AG+D D G + Y    + A
Sbjct: 317 LKDRWQFKGFVVSDLYAVGGLREHGVAGNDY--EAAIKAVNAGVDSDLGTNVYAEQLVAA 374

Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
           V++G +A A ID ++R +  +  ++G FD     +      + + +H  LA E ARQ IV
Sbjct: 375 VKRGDVAVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIV 434

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSK 435
           LLKN +  LPL   +I+TLA++GP+A+    M+G+Y     +GT       +    +   
Sbjct: 435 LLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKET 493

Query: 436 VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE------------- 478
            + YA GCA +   + +    AI+ A+NADA V+V G     D S E             
Sbjct: 494 RVLYAKGCA-VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQ 552

Query: 479 ------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
                  EG DR  L L G Q EL+ +++   K PV LV++    + +  A    +    
Sbjct: 553 ISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVD 611

Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
            W  YPG +GG A+ADV+FG YNP GRL ++      V      +P+       G   ++
Sbjct: 612 AW--YPGMQGGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTRRKGNRSRY 663

Query: 593 FDGPVV--YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
            + P    YPFGYGLSYT F Y         D+K         +  T G++         
Sbjct: 664 IEEPGTPRYPFGYGLSYTTFSY--------TDMK---------VQVTEGSD--------- 697

Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSA 709
                 D +    + ++N G  DG EV  +Y +    +  T  KQ+  + R+ + AG+S 
Sbjct: 698 ------DCRVDVTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESR 751

Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +V FT++  KSL +       ++  G  TI+VG
Sbjct: 752 EVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 783


>gi|312621303|ref|YP_004022916.1| glycoside hydrolase family 3 domain-containing protein
           [Caldicellulosiruptor kronotskyensis 2002]
 gi|312201770|gb|ADQ45097.1| glycoside hydrolase family 3 domain protein [Caldicellulosiruptor
           kronotskyensis 2002]
          Length = 770

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 211/709 (29%), Positives = 341/709 (48%), Gaps = 111/709 (15%)

Query: 87  GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
           GAT FP  I    +F+  + +++ + + T+ +A+           +P I+V RD RWGRV
Sbjct: 102 GATVFPQSIGVACTFDNEIVEELAKVIKTQMKAV-----GAHQALAPLIDVARDARWGRV 156

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLD----NW 202
            ET GEDPY+V   A++YV+G+Q           D     I A  KH+  Y +     NW
Sbjct: 157 EETFGEDPYLVANMAVSYVKGIQ----------GDDIKDGIVATGKHFVGYAMSEGGMNW 206

Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
                      + E++++E ++ PFE+ V    + S+M +Y+ ++GIP  A+ KLL    
Sbjct: 207 A-------PVHIPERELREVYLYPFEVAVKVAGLKSIMPAYHEIDGIPCHANRKLLTDIA 259

Query: 263 RGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG--DYYTNFTMGA 320
           RG+W F G  VSD   ++ I++ HK +  T  +A    L AGLD++    + +T   + A
Sbjct: 260 RGEWGFDGIFVSDYAGVRNILDYHKAVK-TYAEAAYISLWAGLDIELPKIECFTEEFIKA 318

Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC-NPQHIELAAEAARQGI 379
           +++GK   A +D +++ +  +  RLG FD +P  K  G   +  N +  EL+ + A++ +
Sbjct: 319 LKEGKFDMAVVDAAVKRVLEMKFRLGLFD-NPYIKTEGILELFDNKEQRELSRKVAQESM 377

Query: 380 VLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYS----- 434
           VLLKNDN  LPL + ++K +A++GP+A++ + ++G+Y   P  + + ++ F+        
Sbjct: 378 VLLKNDN-FLPL-SNDVKKIAVIGPNADSVRNLLGDY-SYPA-HIATLEMFFIKEDKGVG 433

Query: 435 -------KVIN-------------------YAPGCADIVCQNNSMIPAAIDAAKNADATV 468
                  KVIN                   YA GC D+  Q+ S    A  AA+ AD  +
Sbjct: 434 NEEEFVRKVINIKSILEAIKDRVQNKAEVVYAKGC-DVNNQDESGFEEAKKAAQGADVVI 492

Query: 469 IV----AGLDLS-VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFA 523
           +V    AGL L     E +DR  L LPG Q +LI +V+   +    +V++      +   
Sbjct: 493 LVVGDKAGLRLDCTSGESRDRASLKLPGVQEKLIEEVSKVNE---NIVVVLVNGRPVALE 549

Query: 524 KNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPV 582
               K K+IL   +PGEEG  A+ADV+FG YNPGG+L I++  +   V + Y   P    
Sbjct: 550 GIWQKAKAILEAWFPGEEGAEAVADVLFGDYNPGGKLAISFPRDVGQVPVYYGHKPSGGK 609

Query: 583 NNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
           + + G   +    P + PFGYGLSYT F+YK                     N+ +   K
Sbjct: 610 SCWHGDYVEMSTKPFL-PFGYGLSYTTFEYK---------------------NFAIEKEK 647

Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERV 701
                         D      +EVEN GK  G E+V +Y++      T  +K++  Y+RV
Sbjct: 648 ISM-----------DESIKISVEVENTGKYAGDEIVQLYTRKEEFLVTRPVKELKAYKRV 696

Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
            +  G+  KV F +         D   N +++ G   ++VG     + F
Sbjct: 697 HLKPGEKKKVVFEIFP-DQFAYYDYDMNRVISPGTVEVMVGASSEDIKF 744


>gi|423281958|ref|ZP_17260843.1| hypothetical protein HMPREF1204_00381 [Bacteroides fragilis HMW
           615]
 gi|404582445|gb|EKA87139.1| hypothetical protein HMPREF1204_00381 [Bacteroides fragilis HMW
           615]
          Length = 805

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 235/813 (28%), Positives = 356/813 (43%), Gaps = 153/813 (18%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE---------------- 57
           Y +   P   R + L+ +MTL EKV QM      +  LG P+YE                
Sbjct: 40  YENPSAPVEYRVEHLLSQMTLEEKVGQM------LTSLGWPMYERVGEDIRLTPQLEKEI 93

Query: 58  ----------------WWSEALH---GVSFIGRRTNSPPG---THFDSEVP--------- 86
                           W    LH     S   R +N        H    +P         
Sbjct: 94  GEYHIGSLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPH 153

Query: 87  -----GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
                G T FPT I   +++N  L +++G+ ++ EA A           + P +++ RDP
Sbjct: 154 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDP 208

Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
           RW RV ET GEDPY+ G      VRG Q     E   D  S    + A  KH+A+Y    
Sbjct: 209 RWSRVEETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---G 257

Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
           W         + + E++++E    PF   V  G + SVM SYN ++G P      LL   
Sbjct: 258 WTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDI 316

Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGA 320
           ++  W F G++VSD  ++  + E     ND   +A  + + AG+D D G + Y    + A
Sbjct: 317 LKDRWQFKGFVVSDLYAVGGLREHGVAGNDY--EAAIKAVNAGVDSDLGTNVYAEQLVAA 374

Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
           V++G +A A ID ++R +  +  ++G FD     +      + + +H  LA E ARQ IV
Sbjct: 375 VKRGDVAVATIDKAVRRILSLKFQMGLFDDPFVDEKQAAQLVASSEHTGLAREVARQSIV 434

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSK 435
           LLKN +  LPL   +I+TLA++GP+A+    M+G+Y     +GT       +    +   
Sbjct: 435 LLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKET 493

Query: 436 VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE------------- 478
            + YA GCA +   + +    AI+ A+NADA V+V G     D S E             
Sbjct: 494 RVLYAKGCA-VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQ 552

Query: 479 ------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
                  EG DR  L L G Q EL+ +++   K PV LV++    + +  A    +    
Sbjct: 553 ISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVD 611

Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
            W  YPG +GG A+ADV+FG YNP GRL ++      V      +P+       G   ++
Sbjct: 612 AW--YPGMQGGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTRRKGNRSRY 663

Query: 593 FDGPVV--YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
            + P    YPFGYGLSYT F Y         D+K         +  T G++         
Sbjct: 664 IEEPGTPRYPFGYGLSYTTFSY--------TDMK---------VQVTEGSD--------- 697

Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSA 709
                 D +    + ++N G  DG EV  +Y +    +  T  KQ+  + R+ + AG+S 
Sbjct: 698 ------DCRVDVTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESR 751

Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +V FT++  KSL +       ++  G  TI+VG
Sbjct: 752 EVTFTLDK-KSLALYMQEGEWVVEPGLFTIMVG 783


>gi|374312362|ref|YP_005058792.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
 gi|358754372|gb|AEU37762.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
          Length = 874

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 170/482 (35%), Positives = 246/482 (51%), Gaps = 53/482 (10%)

Query: 24  RAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
           R  +L+ +MT+ E++ Q+ D A  + RLGLP Y WW+E LHG++  G             
Sbjct: 38  RIDELIAKMTVSERIAQLQDRAPAIERLGLPSYNWWNEGLHGLARDGY------------ 85

Query: 84  EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARA-MYNLGN------AGLTFWSPNIN 136
               AT FP  I   A+++  L  ++G  VSTEARA  Y+ G        GLT WSPNIN
Sbjct: 86  ----ATVFPQAIGLAATWDAPLLHEVGDVVSTEARAKFYSHGGENTPRFGGLTVWSPNIN 141

Query: 137 VVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAA 196
           + RDPRWGR  ET GEDP++       +V G+Q          +D   LK  A  KH+AA
Sbjct: 142 IFRDPRWGRGQETYGEDPFLTATLGTQFVEGVQG---------NDPFYLKADATPKHFAA 192

Query: 197 YDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPK 256
           +     EG D F  ++ V+  D+ +T++  F         +++MCSYN ++G P+CA   
Sbjct: 193 HSGPE-EGRDSF--NAVVSPHDLADTYLPAFHALTTNAHAAALMCSYNEIDGTPSCASGN 249

Query: 257 LLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNF 316
            L   +R  W F GY+VSDCD++  I   H F  D    A A  L AG+DLDCG+ Y   
Sbjct: 250 NLQDLVRERWGFKGYVVSDCDAVGNIAGYHHFATDNAHGA-ADALNAGVDLDCGNTYAAL 308

Query: 317 TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNICNPQHIELAAEA 374
           +  ++ Q    EA ++ +L  L +  +RLG  D      Y+++G   + +P H  LA  A
Sbjct: 309 SK-SLDQNLTTEAKLNQALHRLLLARVRLGMLDPLSCSPYRDIGAEELDSPAHHTLALRA 367

Query: 375 ARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYS 434
           A + IVLLKND G LPL     K ++++GP A+  K +  NY GT     +P+DGF +  
Sbjct: 368 AEESIVLLKND-GVLPLQASTQK-VSVIGPTADMVKVLEANYHGTALHPITPLDGFRSRF 425

Query: 435 KVINYAPGCADIVCQNNSMIPAAIDA--AKNADATVIVAGLDLSVEAEGKDRVDLL-LPG 491
             ++YA G         S++   + A   +NA       G    ++AE  D+  L   P 
Sbjct: 426 HDVSYAQG---------SLLAEGVSAPVPRNALRVAAAPGSSAGLQAEYFDKASLEGTPA 476

Query: 492 FQ 493
           FQ
Sbjct: 477 FQ 478



 Score =  123 bits (309), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 88/304 (28%), Positives = 140/304 (46%), Gaps = 58/304 (19%)

Query: 452 SMIPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVA 501
           +++  A+  A  +D  V   GL   +E E          G DR  L LP  Q  L++++ 
Sbjct: 593 ALLDQAVQTAAKSDVIVAFVGLSPDLEGEALQLRLKGFNGGDRTSLDLPEAQRTLLSRLT 652

Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIK---SILWVGYPGEEGGRAIADVIFGKYNPGG 558
              K PV +V+ S   V +      P+ K    +L   YPGE GG A+A ++ G  NP G
Sbjct: 653 QLHK-PVIIVLTSGSGVALG-----PEAKDAAGVLEAWYPGEAGGEALAGILAGNVNPSG 706

Query: 559 RLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSP 618
           RLP+T+Y +         +P     +   RTY++FDGPV++PFGYGLSY+ F+Y      
Sbjct: 707 RLPVTFYRS------VDDLPAFTDYSMAHRTYRYFDGPVLFPFGYGLSYSHFQYG----- 755

Query: 619 KSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVV 678
                      Q R   + + T++P  A V                 V N  + +G+EV 
Sbjct: 756 -----------QLRLSTHMLKTSEPLVAMV----------------TVHNESQREGTEVA 788

Query: 679 MVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHT 738
            +Y +PP  +G     + G +RV +  G++ ++ F + A   L  VD +    + +G + 
Sbjct: 789 ELYLQPPQASGAPRLTLQGVQRVALRPGETRELTFKL-APGQLSTVDTSGARTVRAGEYK 847

Query: 739 ILVG 742
           + VG
Sbjct: 848 LFVG 851


>gi|375357172|ref|YP_005109944.1| putative beta-glucosidase [Bacteroides fragilis 638R]
 gi|301161853|emb|CBW21397.1| putative beta-glucosidase [Bacteroides fragilis 638R]
          Length = 814

 Score =  273 bits (698), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 236/813 (29%), Positives = 358/813 (44%), Gaps = 153/813 (18%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE---------------- 57
           Y +   P   R + L+ +MTL EKV QM      +  LG P+YE                
Sbjct: 49  YENPSAPVEYRVEHLLSQMTLEEKVGQM------LTSLGWPMYERVGEDIRLTPQLEKEI 102

Query: 58  ----------------WWSEALH-GV--SFIGRRTNSPPG---THFDSEVP--------- 86
                           W    LH G+  S   R +N        H    +P         
Sbjct: 103 GEYHIGSLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPH 162

Query: 87  -----GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
                G T FPT I   +++N  L +++G+ ++ EA A           + P +++ RDP
Sbjct: 163 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDP 217

Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
           RW RV ET GEDPY+ G      VRG Q     E   D  S    + A  KH+A+Y    
Sbjct: 218 RWSRVEETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---G 266

Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
           W         + + E++++E    PF   V  G + SVM SYN ++G P      LL   
Sbjct: 267 WTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDI 325

Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGA 320
           ++  W F G++VSD  ++  + E     ND   +A  + + AG+D D G + Y    + A
Sbjct: 326 LKDRWQFKGFVVSDLYAVGGLREHGVAGNDY--EAAIKAVNAGVDSDLGTNVYAEQLVAA 383

Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
           V++G +A A ID ++R +  +  ++G FD     +      + + +H  LA E ARQ IV
Sbjct: 384 VKRGDVAVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIV 443

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSK 435
           LLKN +  LPL   +I+TLA++GP+A+    M+G+Y     +GT       +    +   
Sbjct: 444 LLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKET 502

Query: 436 VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE------------- 478
            + YA GCA +   + +    AI+ A+NADA V+V G     D S E             
Sbjct: 503 RVLYAKGCA-VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQ 561

Query: 479 ------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
                  EG DR  L L G Q EL+ +++   K PV LV++    + +  A    +    
Sbjct: 562 ISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVD 620

Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
            W  YPG +GG A+ADV+FG YNP GRL ++      V      +P+       G   ++
Sbjct: 621 AW--YPGMQGGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTRRKGNRSRY 672

Query: 593 FDGPVV--YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
            + P    YPFGYGLSYT F Y         D+K         +  T G++         
Sbjct: 673 IEEPGTPRYPFGYGLSYTTFSY--------TDMK---------VQVTEGSD--------- 706

Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSA 709
                 D +    + ++N G  DG EV  +Y +    +  T  KQ+  + R+ + AG+S 
Sbjct: 707 ------DCRVDVTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESR 760

Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +V FT++  KSL +       ++  G  TI+VG
Sbjct: 761 EVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 792


>gi|189468358|ref|ZP_03017143.1| hypothetical protein BACINT_04755 [Bacteroides intestinalis DSM
           17393]
 gi|189436622|gb|EDV05607.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 865

 Score =  273 bits (698), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 162/451 (35%), Positives = 245/451 (54%), Gaps = 43/451 (9%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           PY +  L   ERA DL++RMTL EK+ QM + +  + RLG+P Y+WW+EALHGV+  G+ 
Sbjct: 24  PYRNPNLSPSERAWDLLKRMTLEEKISQMKNGSPAIERLGIPAYDWWNEALHGVARAGK- 82

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------LG 124
                          AT FP  I   A+F+     +    VS EARA Y+         G
Sbjct: 83  ---------------ATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHDFQRKGERGG 127

Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLTFW+PNIN+ RDPRWGR +ET GEDPY+     +  V+GLQ     +Y        
Sbjct: 128 YKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGNGAGKYD------- 180

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            K  AC KHYA +    W   +R  FDS+ ++++D+ ET++  F+  V EG V  VMC+Y
Sbjct: 181 -KAHACAKHYAVHSGPEW---NRHSFDSKNISQRDLWETYLPAFKTLVTEGKVKEVMCAY 236

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI-VESHKFLNDTKEDAVARVLK 302
           NR  G P C++ +LL + +R DW +   +VSDC +I      +H   + + E A A  + 
Sbjct: 237 NRFEGEPCCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPSAEAASADAVV 296

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKN 360
           +G DL+CG  Y++    AV++G I E  I+ S+  L     +LG FD      +  +  +
Sbjct: 297 SGTDLECGGSYSSLNE-AVKKGLITEDKINESVFRLLRARFQLGMFDDDTLVSWSEIPYS 355

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + + +H++ A E AR+ +VLL N N +LPL+  +I+ +A++GP+AN +  +  NY G P
Sbjct: 356 VVESKEHVDKALEMARKSMVLLTNKNNSLPLSK-SIRKVAVLGPNANDSVMLWANYNGFP 414

Query: 421 CRYTSPMDGFYAY--SKVINYAPGCADIVCQ 449
            +  + ++G  +      + Y  GC  +  Q
Sbjct: 415 TKSVTILEGIRSKLPEGAVYYEKGCDFVSTQ 445



 Score =  126 bits (316), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 91/294 (30%), Positives = 138/294 (46%), Gaps = 53/294 (18%)

Query: 468 VIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA 517
           + V GL  ++E E            DR ++ LP  Q E++  +    K PV  V+ S   
Sbjct: 605 IFVGGLSSALEGEEMPVDLPGFKKGDRTNIDLPRVQEEMLKALKKTGK-PVIFVVCSGST 663

Query: 518 VDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSM 577
           + + +   N  + ++L   YPG++GG A+ADV+FG YNP GRLP+T+Y ++      + +
Sbjct: 664 LALPWEAEN--LDAMLEAWYPGQQGGTAVADVLFGDYNPAGRLPLTFYASD------SDL 715

Query: 578 PLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
           P     N   RTY++F G  ++PFGYGLSYT F Y  A        K+DK          
Sbjct: 716 PDFEDYNMSNRTYRYFKGKPLFPFGYGLSYTTFDYGKA--------KVDKKS-------- 759

Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIG 697
                          +K  D   T  I ++N GKMDG EVV VY + P      IK +  
Sbjct: 760 ---------------IKTGD-SMTLTIPLKNTGKMDGDEVVQVYLRNPADKEGPIKMLRA 803

Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVGEGVGGVSF 750
           + RV + AGQ+  +   + A  + +  + A N + +  G + +L G    G S 
Sbjct: 804 FRRVSLKAGQAENIQIELPAS-TFECFNPATNRMEILPGNYELLYGGTSDGKSL 856


>gi|373951852|ref|ZP_09611812.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373888452|gb|EHQ24349.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 871

 Score =  273 bits (698), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 159/457 (34%), Positives = 241/457 (52%), Gaps = 52/457 (11%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
           ++  + SD+PY +  L +  R  DLV+RMTL EKV QM + +  +PRL +P Y+WW+E L
Sbjct: 18  AVIAQTSDYPYQNYHLDFTTRVNDLVKRMTLEEKVSQMLNSSPAIPRLKIPAYDWWNEVL 77

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           HGV+           T F       T +P  I   A+F+     ++    + E RA++N 
Sbjct: 78  HGVA----------RTPFK-----VTVYPQAIAMAATFDRQSLNQMADYAALEGRAVHNK 122

Query: 124 G---------NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
                       GLT+W+PNIN+ RDPRWGR  ET GEDP++ G     +V GLQ     
Sbjct: 123 ALQMRKPGEKYLGLTYWTPNINIFRDPRWGRGQETYGEDPFLTGAMGSAFVSGLQ----- 177

Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVN 232
                +D + LK +AC KHYA +      G +  R  F++ ++  D+ +T++  F+  V 
Sbjct: 178 ----GNDPKYLKAAACAKHYAVH-----SGPEPLRHVFNADISTYDLWDTYLPAFKKLVV 228

Query: 233 EGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDT 292
           +  V+ VMC+YN     P C    L+   +R  W F GY+ SDC  I    ++HK  + T
Sbjct: 229 DDKVAGVMCAYNAFKTQPCCGSDLLMVDILRNQWKFSGYVTSDCGGIDDFFKNHK-THAT 287

Query: 293 KEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP 352
            EDA    +  G D++CG       + AV++GKI+E  ID S++ L+++  RLG FD S 
Sbjct: 288 AEDASTDAVLHGTDIECGTDAYKSLVAAVKEGKISETQIDISVKRLFMIRFRLGMFDPSD 347

Query: 353 --QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATK 410
             +Y     + + +P+H   A + ARQ +VLLKN N  LPL+   I+ + ++GP+A+   
Sbjct: 348 VVKYAQTPVSVLESPEHQAHALKMARQSVVLLKNANHTLPLSK-TIRKIVVLGPNADNPI 406

Query: 411 AMIGNYEGTPCRYTSPMDGF--------YAYSKVINY 439
           A++GNY GTP   T+   G           Y K +N+
Sbjct: 407 AILGNYNGTPSNLTTVYQGIRQKLPQAEVVYEKAVNF 443



 Score =  116 bits (290), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 85/274 (31%), Positives = 128/274 (46%), Gaps = 53/274 (19%)

Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
           + A +    +ADA V V G+   +E E          G DR  + LP  QT L+ K   A
Sbjct: 593 VAALVKRVADADAIVYVGGISPQLEGEEMQVNYPGFNGGDRTSIQLPAAQTNLM-KTLQA 651

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
              PV  V+M+  A+   +   N  I +I+   Y G+  G A+ADV+FG YNP GRLP+T
Sbjct: 652 TGKPVVFVMMTGSALATPWEAEN--IPAIVNAWYGGQAAGTAVADVLFGDYNPAGRLPVT 709

Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
           +Y+++      T +P     +   RTY++F G  +Y FGYGLSYTQFKY     P +V  
Sbjct: 710 FYKSD------TDLPDFTDYSMTNRTYRYFKGIPLYGFGYGLSYTQFKYDKLIVPATV-- 761

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
                +  + I+ +V                           V N G++ G EVV +Y K
Sbjct: 762 -----KSGKAIHLSV--------------------------TVTNSGQIAGDEVVQIYMK 790

Query: 684 PPGIA-GTHIKQVIGYERVFIAAGQSAKVGFTMN 716
                    +K + G+ RV++ AG+   + F ++
Sbjct: 791 HHSQRIKVPLKALKGFARVYLKAGERRTLNFILS 824


>gi|189467437|ref|ZP_03016222.1| hypothetical protein BACINT_03826 [Bacteroides intestinalis DSM
           17393]
 gi|189435701|gb|EDV04686.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 863

 Score =  273 bits (698), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 158/427 (37%), Positives = 226/427 (52%), Gaps = 38/427 (8%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           + + +LP  ER  DLV R+TL EK+ QM + A  + RLG+P Y WW+E LHGV+    R+
Sbjct: 26  FLNPELPIVERVNDLVGRLTLEEKISQMLNNAPAIDRLGIPAYNWWNECLHGVA----RS 81

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
             P            TSFP  I   A+++     ++    S E RA+Y+           
Sbjct: 82  PYP-----------VTSFPQAIAMAATWDTESVHQMAVYASDEGRAIYHDATRKGTPGIF 130

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLT+WSPNIN+ RDPRWGR  ET GEDP++     +++V+GLQ           D   L
Sbjct: 131 RGLTYWSPNINIFRDPRWGRGQETYGEDPFLTASIGVSFVKGLQG---------DDPVYL 181

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K SAC KHYA +    W   +R  +D++V   D+ +T++  F+  V EG V+ VMC+YN 
Sbjct: 182 KSSACAKHYAVHSGPEW---NRHTYDAKVNNHDLWDTYLPAFKELVVEGKVTGVMCAYNS 238

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
             G P C +  L+   +R  W F GY+ SDC +++    +HK   D    +   VL  G 
Sbjct: 239 FFGQPCCGNDLLMMDILRNHWKFGGYVTSDCGAVEDFYNTHKTHQDAAAASADAVLH-GT 297

Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNIC 363
           D +CG+        AV +G I E  ID SL+ L+ +  RLG FD   +  Y N+  + + 
Sbjct: 298 DCECGNGAYRALADAVLRGLITEKQIDESLKKLFEIRFRLGMFDPDDRVPYSNIPLSVLE 357

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
              H   A + ARQ IVLLKN +  LPLN   IK +A+VGP+A+    ++ NY G P   
Sbjct: 358 CDAHKAHALKIARQSIVLLKNQDQLLPLNKNKIKKIAVVGPNADDKSVLLANYYGYPSHI 417

Query: 424 TSPMDGF 430
           T+ ++G 
Sbjct: 418 TTALEGI 424



 Score =  132 bits (331), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 93/295 (31%), Positives = 141/295 (47%), Gaps = 54/295 (18%)

Query: 460 AAKNADATVIVAGL-------DLSVEAEG---KDRVDLLLPGFQTELINKVADAAKGPVT 509
           A K+AD  + V GL       ++ VE EG    DR  + +P  Q  L+ ++    K PV 
Sbjct: 595 AVKDADVIIFVGGLSAKVEGEEMGVEIEGFKRGDRTSISIPSVQQNLLKELYATGK-PVV 653

Query: 510 LVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANY 569
            V+M+  A+ + +   +  + +IL   Y G+ GG+AIADV+FG YNP GRLP+T+Y++  
Sbjct: 654 FVMMTGSALGLEW--ESAHLPAILNAWYGGQAGGQAIADVLFGDYNPSGRLPLTFYKS-- 709

Query: 570 VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQ 629
                  +P     +   RTY++F G  VYPFGYGLSYT F+Y          +KL    
Sbjct: 710 ----VNDLPDFEDYSMENRTYRYFTGTPVYPFGYGLSYTTFQYS--------SLKLQPSP 757

Query: 630 QCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG 689
             R +  T                           ++ N GKM+G EV  +Y   P    
Sbjct: 758 DKRSVKVTA--------------------------KITNTGKMEGEEVAQLYVSNPRDFV 791

Query: 690 THIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
           T I+ + G++R+ +  G+S  V F + + K L +VD +  S+   G   I +G G
Sbjct: 792 TPIRALKGFKRINLKPGESQTVEFVLTS-KELSVVDISGKSVPMKGKVQISLGGG 845


>gi|402307522|ref|ZP_10826545.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
 gi|400378572|gb|EJP31427.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
          Length = 858

 Score =  273 bits (698), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 170/478 (35%), Positives = 247/478 (51%), Gaps = 42/478 (8%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
           S+       PYC+  L   ERA+DL+ R+TL EK + M D +  +PRLG+  + WWSEAL
Sbjct: 14  SLSATAQLLPYCNPDLSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWSEAL 73

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN- 122
           HG + +G                G T FP  +   ASFN+ L +++    S E RA YN 
Sbjct: 74  HGAANMG----------------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNR 117

Query: 123 -LGNAG-------LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
            + N G       L+ W+PN+N+ RDPRWGR  ET GEDPY+        VRGLQ  E  
Sbjct: 118 RMLNGGEDEKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPETA 177

Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
           +Y         K+ AC KHYA +    +  +     D  V+ +D+ ET++  F+  V E 
Sbjct: 178 KYR--------KLWACAKHYAVHSGPEYTRHTANVAD--VSPRDLWETYLPAFKTLVTEA 227

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
            V  VMC+Y R++  P C++ +LL Q +R +W F+  +VSDC ++  I  +HK  +D   
Sbjct: 228 KVREVMCAYQRLDDDPCCSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVH 287

Query: 295 DAVARVLKAGLDLDCGDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP- 352
            A      AG D++CG  Y   T+  AV++G I EA++D  +  L      LG  D    
Sbjct: 288 AAAK-AAVAGTDVECGFGYAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMDDPKL 346

Query: 353 -QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKA 411
            ++  +  + + +  H +LA + ARQ +VLL+N  G LPL  G  + +A++GP+A+    
Sbjct: 347 VEWSKIPASVMDSKAHRQLALDMARQSLVLLQNKGGVLPLKAGG-EPIAVIGPNADDGPM 405

Query: 412 MIGNYEGTPCRYTSPMDGFYAYSKVINYAPGC--ADIVCQNNSMIPAAIDAAKNADAT 467
           M GNY GTP R  + +DG  A  K + Y  GC   D    N+ +   AID  K    T
Sbjct: 406 MWGNYNGTPNRTVTILDGIKARHKRVTYLKGCDLTDTKTVNSLLPQCAIDGRKGLRGT 463



 Score = 99.8 bits (247), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 74/286 (25%), Positives = 121/286 (42%), Gaps = 57/286 (19%)

Query: 456 AAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAK 505
           A I   +     V V G+  ++E E          G DR ++ LP  Q + +  + +A K
Sbjct: 591 AIIRKLQGIRKVVFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK 650

Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
              T+V ++     I          +IL   Y G+EGG A++DV+FG  NP G+LP+T+Y
Sbjct: 651 ---TVVFVNCSGSAIALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNPSGKLPVTFY 707

Query: 566 EANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
           +       Y    +R      GRTY++F  P ++ FGYGLSYT F++  A +        
Sbjct: 708 KRTDQLPDYEDYSMR------GRTYRYFSDP-LFAFGYGLSYTTFRFGRAHA-------- 752

Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
                                       +  +  +   + + N G   G EVV VY +  
Sbjct: 753 ----------------------------EAAEGGYRLSVPLTNTGTRPGEEVVQVYIRRV 784

Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL 731
                 +K +  + RV + AG+S  V   ++  KS +  D + N++
Sbjct: 785 ADTNGPLKSLRAFRRVALKAGESTTVEIPLSR-KSFECFDESTNTM 829


>gi|383113360|ref|ZP_09934132.1| hypothetical protein BSGG_3064 [Bacteroides sp. D2]
 gi|382948727|gb|EFS32364.2| hypothetical protein BSGG_3064 [Bacteroides sp. D2]
          Length = 954

 Score =  273 bits (698), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 231/764 (30%), Positives = 362/764 (47%), Gaps = 119/764 (15%)

Query: 6   KVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSE 61
           K K++D  Y DA LP  ER + L+  MT PE   ++    +G+P  G+P LY       E
Sbjct: 162 KGKVTDRRYMDASLPVEERVESLLAVMT-PEDKMELIREGWGIP--GIPHLYVPPITKVE 218

Query: 62  ALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY 121
           A+HG S+         G+       GAT FP  +   A++N  L +++   +  E  A  
Sbjct: 219 AVHGFSY---------GS-------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA- 261

Query: 122 NLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
           N   A    WSP ++V +D RWGR  ET GEDP +V +    +++G Q            
Sbjct: 262 NTKQA----WSPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------ 305

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
           SR L  +   KH+  +         R   D  ++E++M+E  ++PF   +   D  S+M 
Sbjct: 306 SRGLFTTP--KHFGGHGAPL---GGRDSHDIGLSEREMREVHLVPFRHAIRNYDCQSLMM 360

Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
           +Y+   GIP     +LL Q +R +W F+G+IVSDC +I  +     +    K +A  + L
Sbjct: 361 AYSDYMGIPVAKSTELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAQDKIEAANQAL 420

Query: 302 KAGLDLDCGDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
            AG+  +CGD Y N   + A + G+I   ++D   R +   + R   F+ +P  K L   
Sbjct: 421 AAGIATNCGDTYNNKEVIQAAKDGRINMENLDNVCRTMLSTMFRNELFEKNP-CKPLDWK 479

Query: 361 NIC----NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
            I     +  H E+A +AAR+ IV+L+N    LPL T N++T+A++GP A+  +   G+Y
Sbjct: 480 KIYPGWNSDSHKEMARQAARESIVMLENKENLLPL-TKNLRTIAVLGPGADDLQP--GDY 536

Query: 417 --EGTPCRYTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV 470
             +  P +  S + G        +KV+ Y  GC D    + + IP A+ AA  +D  V+V
Sbjct: 537 TPKLLPGQLKSVLTGIKEAVGKQTKVL-YEQGC-DFTNPDETNIPKAVKAASQSDVVVMV 594

Query: 471 AGLDLSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDIN 521
            G   + EA         E  D   L+LPG Q EL+  V    K PV L++ +    DI 
Sbjct: 595 LGDCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDI- 652

Query: 522 FAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP 581
             K +   K+IL    PG+EGG A+ADV+FG YNPGGRLP+T+            +PL  
Sbjct: 653 -LKASEMCKAILVNWLPGQEGGPAMADVLFGDYNPGGRLPMTFPRH------VGQLPLYY 705

Query: 582 VNNFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
                GR Y++ D     +Y FG+GLSYT F+Y         D+K+ +            
Sbjct: 706 NFKTSGRRYEYVDMEYYPLYRFGFGLSYTSFEYS--------DLKIQE------------ 745

Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGY 698
             KP             +   T Q  V+N+G   G EV  +Y +       T + ++  +
Sbjct: 746 --KP-------------NGNVTVQATVKNIGSRAGDEVAQLYVTDMYASVKTRVMELKDF 790

Query: 699 ERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +R+++  G+S  V F +     + ++++  + ++  G   I VG
Sbjct: 791 DRIYLQPGESKTVSFELTPY-DISLLNDHMDRVVEKGEFKICVG 833


>gi|299149395|ref|ZP_07042452.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298512582|gb|EFI36474.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 950

 Score =  273 bits (698), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 231/764 (30%), Positives = 362/764 (47%), Gaps = 119/764 (15%)

Query: 6   KVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSE 61
           K K++D  Y DA LP  ER + L+  MT PE   ++    +G+P  G+P LY       E
Sbjct: 158 KGKVTDRRYMDASLPVEERVESLLAVMT-PEDKMELIREGWGIP--GIPHLYVPPITKVE 214

Query: 62  ALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY 121
           A+HG S+         G+       GAT FP  +   A++N  L +++   +  E  A  
Sbjct: 215 AVHGFSY---------GS-------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA- 257

Query: 122 NLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
           N   A    WSP ++V +D RWGR  ET GEDP +V +    +++G Q            
Sbjct: 258 NTKQA----WSPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------ 301

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
           SR L  +   KH+  +         R   D  ++E++M+E  ++PF   +   D  S+M 
Sbjct: 302 SRGLFTTP--KHFGGHGAPL---GGRDSHDIGLSEREMREVHLVPFRHAIRNYDCQSLMM 356

Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
           +Y+   GIP     +LL Q +R +W F+G+IVSDC +I  +     +    K +A  + L
Sbjct: 357 AYSDYMGIPVAKSTELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAQDKIEAANQAL 416

Query: 302 KAGLDLDCGDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
            AG+  +CGD Y N   + A + G+I   ++D   R +   + R   F+ +P  K L   
Sbjct: 417 AAGIATNCGDTYNNKEVIQAAKDGRINMENLDNVCRTMLSTMFRNELFEKNP-CKPLDWK 475

Query: 361 NIC----NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
            I     +  H E+A +AAR+ IV+L+N    LPL T N++T+A++GP A+  +   G+Y
Sbjct: 476 KIYPGWNSDSHKEMARQAARESIVMLENKENLLPL-TKNLRTIAVLGPGADDLQP--GDY 532

Query: 417 --EGTPCRYTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV 470
             +  P +  S + G        +KV+ Y  GC D    + + IP A+ AA  +D  V+V
Sbjct: 533 TPKLLPGQLKSVLTGIKEAVGKQTKVL-YEQGC-DFTNPDETNIPKAVKAASQSDVVVMV 590

Query: 471 AGLDLSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDIN 521
            G   + EA         E  D   L+LPG Q EL+  V    K PV L++ +    DI 
Sbjct: 591 LGDCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDI- 648

Query: 522 FAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP 581
             K +   K+IL    PG+EGG A+ADV+FG YNPGGRLP+T+            +PL  
Sbjct: 649 -LKASEMCKAILVNWLPGQEGGPAMADVLFGDYNPGGRLPMTFPRH------VGQLPLYY 701

Query: 582 VNNFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
                GR Y++ D     +Y FG+GLSYT F+Y         D+K+ +            
Sbjct: 702 NFKTSGRRYEYVDMEYYPLYRFGFGLSYTSFEYS--------DLKIQE------------ 741

Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGY 698
             KP             +   T Q  V+N+G   G EV  +Y +       T + ++  +
Sbjct: 742 --KP-------------NGNVTVQATVKNIGSRAGDEVAQLYVTDMYASVKTRVMELKDF 786

Query: 699 ERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +R+++  G+S  V F +     + ++++  + ++  G   I VG
Sbjct: 787 DRIYLQPGESKTVSFELTPY-DISLLNDHMDRVVEKGEFKICVG 829


>gi|423269263|ref|ZP_17248235.1| hypothetical protein HMPREF1079_01317 [Bacteroides fragilis
           CL05T00C42]
 gi|423273173|ref|ZP_17252120.1| hypothetical protein HMPREF1080_00773 [Bacteroides fragilis
           CL05T12C13]
 gi|392701685|gb|EIY94842.1| hypothetical protein HMPREF1079_01317 [Bacteroides fragilis
           CL05T00C42]
 gi|392708205|gb|EIZ01313.1| hypothetical protein HMPREF1080_00773 [Bacteroides fragilis
           CL05T12C13]
          Length = 805

 Score =  273 bits (698), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 235/813 (28%), Positives = 356/813 (43%), Gaps = 153/813 (18%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE---------------- 57
           Y +   P   R + L+ +MTL EKV QM      +  LG P+YE                
Sbjct: 40  YENPSAPVEYRVEHLLSQMTLEEKVGQM------LTSLGWPMYERVGEDIRLTPQLEKEI 93

Query: 58  ----------------WWSEALH---GVSFIGRRTNSPPG---THFDSEVP--------- 86
                           W    LH     S   R +N        H    +P         
Sbjct: 94  GEYHIGSLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPH 153

Query: 87  -----GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
                G T FPT I   +++N  L +++G+ ++ EA A           + P +++ RDP
Sbjct: 154 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDP 208

Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
           RW RV ET GEDPY+ G      VRG Q     E   D  S    + A  KH+A+Y    
Sbjct: 209 RWSRVEETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---G 257

Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
           W         + + E++++E    PF   V  G + SVM SYN ++G P      LL   
Sbjct: 258 WTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDI 316

Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGA 320
           ++  W F G++VSD  ++  + E     ND   +A  + + AG+D D G + Y    + A
Sbjct: 317 LKDRWQFKGFVVSDLYAVGGLREHGVAGNDY--EAAIKAVNAGVDSDLGTNVYAEQLVAA 374

Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
           V++G +A A ID ++R +  +  ++G FD     +      + + +H  LA E ARQ IV
Sbjct: 375 VKRGDVAVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIV 434

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSK 435
           LLKN +  LPL   +I+TLA++GP+A+    M+G+Y     +GT       +    +   
Sbjct: 435 LLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKET 493

Query: 436 VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE------------- 478
            + YA GCA +   + +    AI+ A+NADA V+V G     D S E             
Sbjct: 494 RVLYAKGCA-VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQ 552

Query: 479 ------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
                  EG DR  L L G Q EL+ +++   K PV LV++    + +  A    +    
Sbjct: 553 ISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVD 611

Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
            W  YPG +GG A+ADV+FG YNP GRL ++      V      +P+       G   ++
Sbjct: 612 AW--YPGMQGGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTRRKGNRSRY 663

Query: 593 FDGPVV--YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
            + P    YPFGYGLSYT F Y         D+K         +  T G++         
Sbjct: 664 IEEPGTPRYPFGYGLSYTTFSY--------TDMK---------VQVTEGSD--------- 697

Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSA 709
                 D +    + ++N G  DG EV  +Y +    +  T  KQ+  + R+ + AG+S 
Sbjct: 698 ------DCRVDVTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESR 751

Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +V FT++  KSL +       ++  G  TI+VG
Sbjct: 752 EVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 783


>gi|336417087|ref|ZP_08597416.1| hypothetical protein HMPREF1017_04524 [Bacteroides ovatus
           3_8_47FAA]
 gi|335936712|gb|EGM98630.1| hypothetical protein HMPREF1017_04524 [Bacteroides ovatus
           3_8_47FAA]
          Length = 954

 Score =  273 bits (698), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 231/764 (30%), Positives = 362/764 (47%), Gaps = 119/764 (15%)

Query: 6   KVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSE 61
           K K++D  Y DA LP  ER + L+  MT PE   ++    +G+P  G+P LY       E
Sbjct: 162 KGKVTDRRYMDASLPVEERVESLLAVMT-PEDKMELIREGWGIP--GIPHLYVPPITKVE 218

Query: 62  ALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY 121
           A+HG S+         G+       GAT FP  +   A++N  L +++   +  E  A  
Sbjct: 219 AVHGFSY---------GS-------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA- 261

Query: 122 NLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
           N   A    WSP ++V +D RWGR  ET GEDP +V +    +++G Q            
Sbjct: 262 NTKQA----WSPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------ 305

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
           SR L  +   KH+  +         R   D  ++E++M+E  ++PF   +   D  S+M 
Sbjct: 306 SRGLFTTP--KHFGGHGAPL---GGRDSHDIGLSEREMREVHLVPFRHAIRNYDCQSLMM 360

Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
           +Y+   GIP     +LL Q +R +W F+G+IVSDC +I  +     +    K +A  + L
Sbjct: 361 AYSDYMGIPVAKSTELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAQDKIEAANQAL 420

Query: 302 KAGLDLDCGDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
            AG+  +CGD Y N   + A + G+I   ++D   R +   + R   F+ +P  K L   
Sbjct: 421 AAGIATNCGDTYNNKEVIQAAKDGRIDMENLDNVCRTMLSTMFRNELFEKNP-CKPLDWK 479

Query: 361 NIC----NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
            I     +  H E+A +AAR+ IV+L+N    LPL T N++T+A++GP A+  +   G+Y
Sbjct: 480 KIYPGWNSDSHKEMARQAARESIVMLENKENLLPL-TKNLRTIAVLGPGADDLQP--GDY 536

Query: 417 --EGTPCRYTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV 470
             +  P +  S + G        +KV+ Y  GC D    + + IP A+ AA  +D  V+V
Sbjct: 537 TPKLLPGQLKSVLTGIKEAVGKQTKVL-YEQGC-DFTNPDETNIPKAVKAASQSDVVVMV 594

Query: 471 AGLDLSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDIN 521
            G   + EA         E  D   L+LPG Q EL+  V    K PV L++ +    DI 
Sbjct: 595 LGDCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDI- 652

Query: 522 FAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP 581
             K +   K+IL    PG+EGG A+ADV+FG YNPGGRLP+T+            +PL  
Sbjct: 653 -LKASEMCKAILVNWLPGQEGGPAMADVLFGDYNPGGRLPMTFPRH------VGQLPLYY 705

Query: 582 VNNFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
                GR Y++ D     +Y FG+GLSYT F+Y         D+K+ +            
Sbjct: 706 NFKTSGRRYEYVDMEYYPLYRFGFGLSYTSFEYS--------DLKIQE------------ 745

Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGY 698
             KP             +   T Q  V+N+G   G EV  +Y +       T + ++  +
Sbjct: 746 --KP-------------NGNVTVQATVKNIGSRAGDEVAQLYVTDMYASVKTRVMELKDF 790

Query: 699 ERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +R+++  G+S  V F +     + ++++  + ++  G   I VG
Sbjct: 791 DRIYLQPGESKTVSFELTPY-DISLLNDHMDRVVEKGEFKICVG 833


>gi|253574420|ref|ZP_04851761.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
 gi|251846125|gb|EES74132.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
          Length = 782

 Score =  273 bits (697), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 235/813 (28%), Positives = 375/813 (46%), Gaps = 144/813 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQM-------------GDLA--------------- 45
           Y D+  P PER + L+  MTL EK  Q+             G++                
Sbjct: 20  YKDSSKPIPERVEHLLGLMTLEEKAGQLVQPFGWQTYEHKDGEIKLTEAFKAQVKNGGVG 79

Query: 46  --YGV----PRLGLPLYEWWS--EALHGVSFIGRRT--NSPPG---------THFDSEVP 86
             YGV    P  G+ L    S  E    V+ I R    NS  G         +H    + 
Sbjct: 80  SLYGVLRADPWTGVTLETGLSPREGTEAVNAIQRYAIENSRLGIPILIGEECSHGHMAI- 138

Query: 87  GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
           GAT FP  +   +++N  L++++ + V+ E RA       G   +SP ++VVRDPRWGR 
Sbjct: 139 GATVFPVPLSLGSTWNVELYREMCRAVARETRA-----QGGAVTYSPVLDVVRDPRWGRT 193

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
            E  GED Y++   A+  V GLQ     E     DS    ++A  KH+  Y   + EG  
Sbjct: 194 EECFGEDAYLISEMAVASVEGLQG----ESLDGEDS----VAATLKHFVGYG--SSEGG- 242

Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
           R      +  +++ E  +LPF   V  G  +S+M +YN ++G+P   + +LL+  +RG+W
Sbjct: 243 RNAGPVHMGRRELLEVDLLPFRKAVEAG-AASIMPAYNEIDGVPCTTNEELLDGVLRGEW 301

Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQGK 325
            F G +++DC +I  +   H    D + DA  + ++AG+D++  G  +    + AV+ G+
Sbjct: 302 GFDGMVITDCGAIDMLASGHDVAEDGR-DAAIQAIRAGIDMEMSGVMFGKHLVEAVRSGQ 360

Query: 326 IAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKND 385
           + E  +D ++R +  +  RLG F+         +  I + +H+ELA + A +G+VLLKN 
Sbjct: 361 LEEEVLDRAVRRVLTLKFRLGLFERPYADPERAERVIGSAEHVELARQLASEGVVLLKNK 420

Query: 386 NGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR--YTSPMDGFYA----YSKVINY 439
           +G LPL + +  T+A++GP+A+A    +G+Y     R   T+ + G  +      + + Y
Sbjct: 421 DGVLPL-SADAGTIAVIGPNADAGYNQLGDYTSPQPRSKVTTVLGGIRSKLAETPERVLY 479

Query: 440 APGCADIVCQNNSMIPAAIDAAKNADATVIVAG-----------LDLSVEA--------- 479
           APGC  I   +      A+  A+ AD  V+V G           +DL   A         
Sbjct: 480 APGCR-INGNSREGFDVALSCAEKADTVVMVVGGSSARDFGEGTIDLRTGASKVTDNAES 538

Query: 480 -----EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
                EG DR++L L G Q ELI ++    K P+ +V ++   +   +   +    +IL 
Sbjct: 539 DMDCGEGIDRMNLSLSGVQLELIQEIHKLGK-PLVVVYINGRPIAEPWIDEHA--DAILE 595

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
             YPG+EGG AIAD++FG  NP GRL I+       V + Y     R      G+ Y   
Sbjct: 596 AWYPGQEGGHAIADILFGDVNPSGRLTISIPKHVGQVPVYYHGKRSR------GKRYLEG 649

Query: 594 DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
           D    YPFGYGLSYT+F Y         ++KL+ D     IN                  
Sbjct: 650 DSQPRYPFGYGLSYTEFTYN--------NLKLESDT----IN------------------ 679

Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVG 712
             KD      +EV N+G+  G+EV+ +Y        T   K++ G+ ++F+  G++  V 
Sbjct: 680 --KDGSTKVTVEVTNVGERAGAEVIQLYITDVASKVTRPAKELKGFRKIFLQPGETQTVE 737

Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVGEGV 745
           FT+   + L+ +      ++  G   + VG+ V
Sbjct: 738 FTVGP-EQLQYIGQNYKPVVEPGEFRVHVGKNV 769


>gi|410634080|ref|ZP_11344720.1| beta-glucosidase [Glaciecola arctica BSs20135]
 gi|410146740|dbj|GAC21587.1| beta-glucosidase [Glaciecola arctica BSs20135]
          Length = 772

 Score =  273 bits (697), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 201/606 (33%), Positives = 312/606 (51%), Gaps = 72/606 (11%)

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           ++P ++V RDPRWGR+ E  GED Y+    A   V+G Q         D  S+P  I A 
Sbjct: 176 FAPMVDVARDPRWGRISEGSGEDVYLTTAIARARVQGFQG--------DDLSQPHTILAT 227

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
            KH+AAY      G D    D  ++++++++T++ PF+  V+ G V+S M S+N +NG+P
Sbjct: 228 AKHFAAYG-QGQAGRDYHTTD--MSDRELRDTYLPPFKAAVDAG-VTSFMTSFNELNGVP 283

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC- 309
             A+  LL   +R +W+F G++V+D  SI  +V+ H F  D  + A    +KAG+D+D  
Sbjct: 284 ASANKYLLTDILRDEWSFEGFVVTDYTSINEMVK-HGFARDN-DHAGELAVKAGVDMDMQ 341

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK--NNICNPQH 367
           G  Y ++    V QGK++   ID + R +  +  RLG F+   +Y N  +    I    +
Sbjct: 342 GSVYFDYLANQVTQGKVSPQQIDNAARRILEMKYRLGLFEDPYRYSNEEREAQEIYKEYN 401

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP- 426
           ++ A + AR+ +VLLKN+N  LPL+  ++ T+A++GP A++ + +IG++     RY  P 
Sbjct: 402 LQAAQDVARKSMVLLKNENQQLPLSKSDL-TIAVIGPLADSKEDLIGSWSAAGDRYEKPI 460

Query: 427 --MDGFYAY----SKVINYAPGCA-DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
             + G  A     SKV+ YA G + +   Q+NS   AAI  AK AD  V+  G    +  
Sbjct: 461 TLLTGIKAKVADPSKVL-YAKGASYEFSHQDNSGFEAAIAIAKKADVIVLAMGEKWDMTG 519

Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
           E   R  L  PG Q  L+ ++   AK P+ LV+M+   + I +A  N  + +IL   YPG
Sbjct: 520 EATSRTSLDFPGNQLALMQQLKKLAK-PMVLVLMNGRPMTIEWADQN--VDAILEAWYPG 576

Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPY------TSMPLRPVNNFPGRTYKFF 593
             GG AIADV+FG YNP G+LP+T +  N  +IP       T  P    N       ++ 
Sbjct: 577 TMGGPAIADVLFGDYNPSGKLPVT-FPRNVGQIPLYYNMKNTGRPYSKDNAEQKYVSRYI 635

Query: 594 DG--PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLID 651
           D     +Y FG+GLSYT F Y   S  K+V                              
Sbjct: 636 DSLNTPLYHFGHGLSYTTFDYSKISLNKAV------------------------------ 665

Query: 652 DVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAK 710
            +  K+ K T  I+V N G  DG EVV +Y +   G     +KQ+ G++++F+  G++  
Sbjct: 666 -ITAKE-KLTASIDVTNSGNYDGEEVVQLYIRDRIGSVTRPVKQLKGFKKIFLHKGETKT 723

Query: 711 VGFTMN 716
           V F+++
Sbjct: 724 VSFSIS 729


>gi|409197445|ref|ZP_11226108.1| glycoside hydrolase family protein [Marinilabilia salmonicolor JCM
           21150]
          Length = 737

 Score =  272 bits (696), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 211/727 (29%), Positives = 349/727 (48%), Gaps = 99/727 (13%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGL---PLYEWWSEALHGV 66
           + +P+ +A L    R  DL+ RMTL EKV  +      VPRLG+   P  E      HGV
Sbjct: 38  TSYPFQNADLDMETRVDDLLSRMTLEEKVSALSTDP-SVPRLGIKGAPHIE----GYHGV 92

Query: 67  SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN---L 123
           +  G    +P G   D  VP  T FP      A++N  L +K G+  S EAR ++    +
Sbjct: 93  AMGGPANWAPKG---DERVP-TTQFPQAYGMGATWNPELIRKAGEIESIEARYIFQNPEI 148

Query: 124 GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
              GL   +PN ++ RDPRWGR  E  GEDP++VG  +  + +GLQ           D +
Sbjct: 149 SKGGLVVRAPNADLGRDPRWGRTEEVLGEDPFLVGTLSTAFTKGLQ---------GDDEK 199

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
             + ++  KH+ A   +N   +   +FD+++      E +   F   + EG  ++ M +Y
Sbjct: 200 YWRTASLLKHFLANSNENTRDSSSSNFDTQL----FYEYYGATFRRAILEGGSNAYMTAY 255

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N VNG+P    P +  +     W  +G I +D      +V +HK  +D    A   V+KA
Sbjct: 256 NAVNGVPAHIHP-MHKEISMARWGVNGIICTDGGGYTLLVRAHKAYDDYYR-AAEGVIKA 313

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGK 359
           GL+    D Y     GA+  G +AE D+D  L+ +Y V+++LG  D  PQ    Y ++G+
Sbjct: 314 GLN-QFLDNYREGVWGALAHGYLAEEDLDEVLKGVYRVMIKLGQLD--PQDKVPYASIGR 370

Query: 360 NN----ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGN 415
           +       +P+H E A + AR+ +VLLKN+   LPL    +  +A++G  A+    ++  
Sbjct: 371 DGKPAPWTSPEHQEAALQMARESVVLLKNEKQTLPLAGDELGKVAVIGHLADTI--LLDW 428

Query: 416 YEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL 475
           Y G P   ++P+DG       I    G   ++   ++   AA++AA  AD  ++V G   
Sbjct: 429 YSGMPPFMSTPLDG-------IKEKMGADKVLFAPDNDYNAAVEAASQADVAIVVLGNHP 481

Query: 476 SVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
             ++E          G++ VD        E + +    A     LV+ S+    IN+++ 
Sbjct: 482 YCDSERWGDCPDPGMGREAVDRKTLRLTDEWLAQRVFEANPNTILVLQSSFPYGINWSQE 541

Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
           N  + +I+ + + G+  G A+ADV+FG YNPGG+L  TW ++           +R     
Sbjct: 542 N--LPAIVHITHNGQSTGTALADVLFGDYNPGGKLTQTWPKSEEQLPDMMEYDIR----- 594

Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
            G TY +F+G  +YPFG+GLSYT F++        VD+++             G++    
Sbjct: 595 KGHTYMYFNGEPLYPFGFGLSYTSFEW--------VDMEI------------TGSS---- 630

Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI-KQVIGYERVFIA 704
                  VK  + +    ++++N+G++ G EV+ +Y+  P  +     K + G++RV + 
Sbjct: 631 -------VKSNEEEVIVTVKLKNVGQVKGDEVIQLYASFPETSSRRPDKALKGFKRVTLE 683

Query: 705 AGQSAKV 711
            G+S  V
Sbjct: 684 PGESKNV 690


>gi|397691065|ref|YP_006528319.1| glycoside hydrolase family 3 protein [Melioribacter roseus P3M]
 gi|395812557|gb|AFN75306.1| glycoside hydrolase family 3 protein [Melioribacter roseus P3M]
          Length = 769

 Score =  272 bits (696), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 216/697 (30%), Positives = 340/697 (48%), Gaps = 110/697 (15%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+P+  +  E LHG++                    ATS+P  I   A+FN  L +KI
Sbjct: 110 RLGIPVI-FHEECLHGLA-----------------AKDATSYPVPIGLAATFNPELIEKI 151

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
              ++ +AR+            +P ++VVRDPRWGRV ET GED Y+V +  I  V+GLQ
Sbjct: 152 FSAIAEDARS-----RGAHQALTPVVDVVRDPRWGRVEETFGEDTYLVSQMGIASVKGLQ 206

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
               +  +        K+ A  KH+AA+       N      +  +E+ +++TF++PF+ 
Sbjct: 207 GDGSLNNNN-------KVIATLKHFAAHGQPESGTN---CAPANFSERFLRDTFLMPFKE 256

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            +++  V SVM SYN ++GIP+ A+  LL + +R +WNF G++VSD  +I  +    + +
Sbjct: 257 AIDKAGVISVMASYNEIDGIPSHANKWLLRKVLRDEWNFKGFVVSDYYAITELFHKEETV 316

Query: 290 ND----TKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLM 343
           +      K +A    L+AG++++    D Y N T   V+ G   E+DID  +  +     
Sbjct: 317 SHGVAANKVEAAKLALEAGVNIEFPNPDCYPNLTE-MVKGGLADESDIDALVLPMLKYKF 375

Query: 344 RLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG 403
            LG FD        G+      Q  ELA +AAR+ I LLKN+   LPL   + K +A++G
Sbjct: 376 ELGLFDNPYVEAEPGQFENKLEQDRELALQAARETITLLKNEGNLLPLK--DFKKIAVIG 433

Query: 404 PHANATKAMIGNYEGTPCRYTSPMDGF---YAYSKVINYAPGCADIV------------- 447
           P  NA + ++G Y GTP  YTS   G       +  + Y+ GC   V             
Sbjct: 434 P--NADRTLLGGYHGTPKYYTSVYQGIKDKVGKNGEVFYSEGCKITVGGSWNDDEVILPD 491

Query: 448 -CQNNSMIPAAIDAAKNADATVIVAG--LDLSVEAEGK----DRVDLLLPGFQTELINKV 500
             ++  +I  A+  A+ +D  V+V G     S EA  K    DR  L L G Q +L+ ++
Sbjct: 492 PAEDEKLINEAVAVAQKSDVAVLVLGGNEQTSREAWNKKHLGDRPSLELVGRQNKLVEEI 551

Query: 501 ADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRL 560
               K PV +++ +     I F K+N  + +IL   Y G+E GRA+ADV+FG YNP G+L
Sbjct: 552 LKTGK-PVVVLLFNGRPNSIGFIKDN--VPAILECWYLGQETGRAVADVLFGDYNPSGKL 608

Query: 561 PITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPK 619
           P++    A ++   Y+  P         R Y F D   ++ FGYGLSYT+F +       
Sbjct: 609 PVSIPRSAGHIPAHYSHKP------SARRGYLFDDVSPLFAFGYGLSYTKFSFD------ 656

Query: 620 SVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVM 679
             +++L KD        T+                  D K +  IEV+N G + G EVV 
Sbjct: 657 --NLRLSKD--------TISA----------------DEKVSVSIEVKNEGAIAGEEVVQ 690

Query: 680 VYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTM 715
           +Y +    + T  +K++ G+ ++ +A GQ++ V F +
Sbjct: 691 LYIRDKVSSVTRPVKELKGFRKITLAPGQTSTVVFEL 727


>gi|316980598|dbj|BAJ51947.1| putative beta-D-xylosidase [Glycyrrhiza uralensis]
          Length = 285

 Score =  272 bits (696), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 133/290 (45%), Positives = 188/290 (64%), Gaps = 9/290 (3%)

Query: 472 GLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKS 531
           GLD S+EAE +DRV LLLPG Q EL+++VA  A+GPV LV+MS G +D++FAKN+PKI +
Sbjct: 2   GLDQSIEAEFRDRVGLLLPGHQQELVSRVARVARGPVILVLMSGGPIDVSFAKNDPKISA 61

Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGR 588
           ILWVGYPG+ GG AIADVIFG  NPGGRLP+TWY  NY+ K+P T+M +R  P   +PGR
Sbjct: 62  ILWVGYPGQAGGTAIADVIFGTTNPGGRLPMTWYPQNYLAKVPMTNMDMRPNPATGYPGR 121

Query: 589 TYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
           TY+F+ GPVV+PFG+GLSYT+F + +A +PK V +     Q     N TV T+K    AV
Sbjct: 122 TYRFYKGPVVFPFGHGLSYTRFTHSLAIAPKQVSVPFATLQAF--TNSTVSTSK----AV 175

Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQS 708
            +    C   +  F ++V+N G MDG+  ++V+SKPP    +  KQ++ + + ++ AG  
Sbjct: 176 RVSHANCDAMEVGFHVDVKNEGSMDGTNTLLVFSKPPPGKWSATKQLVSFHKTYVPAGSK 235

Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLNH 758
            +V   ++ CK L +VD      +  G H + +G+    +S   Q  + H
Sbjct: 236 QRVKVGVHVCKHLSVVDEFGIRRIPMGEHELQIGDLKHSISVQTQEEIKH 285


>gi|298387490|ref|ZP_06997042.1| beta-glucosidase [Bacteroides sp. 1_1_14]
 gi|298259697|gb|EFI02569.1| beta-glucosidase [Bacteroides sp. 1_1_14]
          Length = 853

 Score =  272 bits (696), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 160/421 (38%), Positives = 231/421 (54%), Gaps = 45/421 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y +   P  ER  DL+ R+T+ EK+  +   + G+PRLG+  Y   +EALHGV   GR  
Sbjct: 30  YKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-- 87

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
                          T FP  I   A++N  L K++   +S EARA +N  + G      
Sbjct: 88  --------------FTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQ 133

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFWSP +N+ RDPRWGR  ET GEDP++ G     +V GLQ           D  
Sbjct: 134 FSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQG---------DDPH 184

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LKI +  KH+AA    N E ++RF  + +++E+ ++E +   FEMCV EG  +S+M +Y
Sbjct: 185 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAY 240

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N +N +P   +  LL + +R DW F GY+VSDC     +V +HK++  TKE A    +KA
Sbjct: 241 NALNDVPCTLNSWLLQKVLRQDWGFQGYVVSDCGGPALLVNAHKYVK-TKEAAATLSIKA 299

Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
           GLDL+CG D Y    + A +Q  +++ADID++   +    M+LG FD   +  Y  +  +
Sbjct: 300 GLDLECGDDVYDGPLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDSGERNPYTKISPS 359

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            I + +H ++A +AARQ IVLLKN    LPLN   +K++A+VG   NA K   G+Y G P
Sbjct: 360 VIGSKEHQQIALDAARQCIVLLKNQKNRLPLNADKLKSIAVVG--INAGKCEFGDYSGAP 417

Query: 421 C 421
            
Sbjct: 418 V 418



 Score =  133 bits (335), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 92/300 (30%), Positives = 152/300 (50%), Gaps = 49/300 (16%)

Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-V 518
           A +  +  V V G++ S+E EG+DR D+ LP  Q E + ++      P  +V++ AG+ +
Sbjct: 600 AVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAGSSL 657

Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
            +N+   +  + +I+   YPGE+GG A+A+V+FG YNP GRLP+T+Y++   ++P    P
Sbjct: 658 AVNWMDEH--VPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS-LDELP----P 710

Query: 579 LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV 638
               +   GRTYK+F G V+YPFGYGLSY+ F Y               D Q +D     
Sbjct: 711 FDDYDITKGRTYKYFKGDVLYPFGYGLSYSSFTY--------------SDLQVKDGG--- 753

Query: 639 GTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG-THIKQVIG 697
                       D+V       T    ++N GK +G EV  VY + P   G   +K++ G
Sbjct: 754 ------------DEV-------TVSFRLKNTGKRNGDEVAQVYVRIPETGGIVPLKELKG 794

Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVD-NAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
           + RV + +G+S +V   ++  + L+  D      ++  GA  ++VG     +     ++L
Sbjct: 795 FRRVPLKSGESRRVEIKLDK-EQLRYWDVEKGQFVVPKGAFDVMVGASSKDIRLQTVIDL 853


>gi|410097219|ref|ZP_11292201.1| hypothetical protein HMPREF1076_01379 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409224537|gb|EKN17469.1| hypothetical protein HMPREF1076_01379 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 805

 Score =  272 bits (696), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 225/772 (29%), Positives = 359/772 (46%), Gaps = 136/772 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDL-AYG-VPRLGLPLYEW------------- 58
           Y D   P  +R +DL+++MT+ EK  Q+G +  YG V +  LP  EW             
Sbjct: 61  YEDLSQPIDKRVEDLLKQMTVEEKTCQLGTIYGYGAVLKDTLPTDEWKTRIWKDGIGNID 120

Query: 59  ------W------------SEALHGVS--FIGRRTNSPPGTHFDSEVPG-----ATSFPT 93
                 W            +EA++ V   F+       P    +  + G     +T FP 
Sbjct: 121 EHLNGEWKRTSLDFPYSNHAEAMNKVQAFFVEETRLGIPADLTNEGIRGLKHEKSTFFPA 180

Query: 94  VILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETPGE 152
            I    ++++ L  +IG+    EA+A+      G T  +SP +++ RDPRWGR +E+ GE
Sbjct: 181 QIGQGCTWDKELIYEIGRITGEEAKAL------GYTNIYSPILDLSRDPRWGRTVESYGE 234

Query: 153 DPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDS 212
           D Y+ G        G Q V G++ +R        + +  KH+A Y +     +     D 
Sbjct: 235 DSYLAGEL------GRQQVLGIQSNR--------VVSTPKHFAIYGIPGGGRDCYSRTDP 280

Query: 213 RVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYI 272
             + Q++ E  + PF +   E      MCS+N  NG P  A   L+ + +R  W F GY+
Sbjct: 281 HASPQEVHELHLEPFRIAFQEAGALGTMCSHNDYNGTPVSASHYLMTELLRNQWGFKGYV 340

Query: 273 VSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL----DCGDYYTNFTMGAVQQGKIAE 328
           VSD  +I   V+ +  + DT+E+AVA  L AGL++    +  + +      A+Q+G + E
Sbjct: 341 VSDSWAIDKNVKFYHIV-DTEEEAVASELNAGLNVRTFFEQSEVFIEALRRALQKGLVEE 399

Query: 329 ADIDTSLRFLYIVLMRLGYFDGSPQYKN--LGKNNICNPQHIELAAEAARQGIVLLKNDN 386
           + +D  +R +  V   LG FD  P  K+  L    + + ++ E++  AAR+ IVLLKN+N
Sbjct: 400 STLDQRVREVLYVKFWLGLFD-DPYVKDTKLADKIVNSDKNREVSLRAARESIVLLKNEN 458

Query: 387 GALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG---FYAYSKVINYAPGC 443
             LPL +  +K +A++GP A+  K++   Y        + + G       +  + YA GC
Sbjct: 459 NTLPL-SKTLKNIAVIGPQADEVKSLTSRYGSHNPNVITGLQGLKNLLGENVNLMYAKGC 517

Query: 444 ---------ADIVC-----QNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLL 489
                    +D++      +    I  A++ AK A+  +I  G D     E + RV+L L
Sbjct: 518 NVRDKNFPQSDVMYFELSDKEKEEIDEAVEIAKKAEVAIIYVGDDFRTIGESRSRVNLDL 577

Query: 490 PGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADV 549
            G Q EL+  V  A   PV LV+ +   V +N+   N  + +I+   YPGE  G+A+A+V
Sbjct: 578 SGRQKELVRAV-QATGTPVVLVLFNGRPVTLNWEDAN--LPAIVEAWYPGEFSGQAVAEV 634

Query: 550 IFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
           +FG YNPGG+L  T +  +  +IP+ + P +P  N  G+ +   DG  +YPFGYGLSYT 
Sbjct: 635 LFGDYNPGGKLSTT-FPKSVGQIPW-AFPFKP--NATGKGFARVDGE-LYPFGYGLSYTT 689

Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLID----DVKCKDYKFTFQIE 665
           F+                            +N  P A  + D     V CK         
Sbjct: 690 FEI---------------------------SNLQPSATKIADGDTLTVTCK--------- 713

Query: 666 VENMGKMDGSEVVMVYSKPPGIAGTHI-KQVIGYERVFIAAGQSAKVGFTMN 716
           V+N G + G EVV +Y      + +   K++ G+ERV +  G+   V F +N
Sbjct: 714 VKNTGSVKGDEVVQLYLNDETSSISRFEKELCGFERVALEPGEEKTVTFKVN 765


>gi|237721943|ref|ZP_04552424.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
 gi|229448812|gb|EEO54603.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
          Length = 792

 Score =  272 bits (696), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 231/800 (28%), Positives = 362/800 (45%), Gaps = 141/800 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
           Y D   P   R  DL+ +MTL EK  QM  L YG  R+     P   W    W + +   
Sbjct: 48  YEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDACPTAGWLAEIWKDGIGNI 106

Query: 64  ----HGVSFIGRRTNSP-----------------------PGTHFDSEVPG-----ATSF 91
               +G+   G   + P                       P    +  + G     AT F
Sbjct: 107 DEQANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMF 166

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
           P      A++N+ L ++I +  + EA+A+      G T  +SP +++ +DPRWGRV+E+ 
Sbjct: 167 PAQCGQGATWNKKLIREIAKVTANEAKAL------GYTNIYSPILDIAQDPRWGRVVESY 220

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDPY+ G      + GLQ+ EG             I A  KH+A Y +     +     
Sbjct: 221 GEDPYLAGELGKQMILGLQN-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 266

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
           D  V  ++M+  ++ PF   + E     VM SYN  +G P       L + +R  W F G
Sbjct: 267 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 326

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
           Y+VSD ++++ +   H+ +  T+E+  A+V+ AGL++      TNFT           A+
Sbjct: 327 YVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRRAI 380

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
            +GK++   +D  +  +  V   +G FD   P      +  + N  H  ++ +AA + +V
Sbjct: 381 DEGKVSLHTLDQRVSEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESVV 440

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
           LLKN+N  LPL+  N K +A++GP+A   K +   Y        +   G   Y  +  + 
Sbjct: 441 LLKNENQMLPLSK-NFKKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVR 499

Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
           YA GC DI+                Q  +MI  A++ AK +D  ++V G +     E   
Sbjct: 500 YAKGC-DIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGGNEKTVREEFS 558

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R +L L G Q +L+  V    K PV LV++   A  IN+A  N  I +I+   +PGE  G
Sbjct: 559 RTNLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYIPAIIHAWFPGEFMG 615

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
            AIA V+FG YNPGGRL +T +  +  +IP+ + P +P ++  G+     DG  +YPFGY
Sbjct: 616 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDSKGKVR--VDG-ALYPFGY 670

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           GLSYT F Y         D+K+ K          +G  +                  T  
Sbjct: 671 GLSYTTFGYS--------DLKISKP--------VIGPQE----------------NITLS 698

Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
             V+N GK  G EVV +Y +    + T   +V+ G+ER+ +  G+   V FT+   + L 
Sbjct: 699 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVSFTLTP-QDLG 757

Query: 723 IVDNAANSLLASGAHTILVG 742
           + D      +  G+ +++VG
Sbjct: 758 LWDKNNRFTVEPGSFSVMVG 777


>gi|423248809|ref|ZP_17229825.1| hypothetical protein HMPREF1066_00835 [Bacteroides fragilis
           CL03T00C08]
 gi|423253758|ref|ZP_17234689.1| hypothetical protein HMPREF1067_01333 [Bacteroides fragilis
           CL03T12C07]
 gi|392655387|gb|EIY49030.1| hypothetical protein HMPREF1067_01333 [Bacteroides fragilis
           CL03T12C07]
 gi|392657750|gb|EIY51381.1| hypothetical protein HMPREF1066_00835 [Bacteroides fragilis
           CL03T00C08]
          Length = 805

 Score =  272 bits (696), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 235/813 (28%), Positives = 356/813 (43%), Gaps = 153/813 (18%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE---------------- 57
           Y +   P   R + L+ +MTL EKV QM      +  LG P+YE                
Sbjct: 40  YENPSAPVEYRVEHLLSQMTLEEKVGQM------LTSLGWPMYERVGEDIRLTPQLEKEI 93

Query: 58  ----------------WWSEALH---GVSFIGRRTNSPPG---THFDSEVP--------- 86
                           W    LH     S   R +N        H    +P         
Sbjct: 94  GEYHIGSLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPH 153

Query: 87  -----GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
                G T FPT I   +++N  L +++G+ ++ EA A           + P +++ RDP
Sbjct: 154 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDP 208

Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
           RW RV ET GEDPY+ G      VRG Q     E   D  S    + A  KH+A+Y    
Sbjct: 209 RWSRVEETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---G 257

Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
           W         + + E++++E    PF   V  G + SVM SYN ++G P      LL   
Sbjct: 258 WTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDI 316

Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGA 320
           ++  W F G++VSD  ++  + E     ND   +A  + + AG+D D G + Y    + A
Sbjct: 317 LKDRWQFKGFVVSDLYAVGGLREHGVAGNDY--EAAIKAVNAGVDSDLGTNVYAEQLVAA 374

Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
           V++G +A A ID ++R +  +  ++G FD     +      + + +H  LA E ARQ IV
Sbjct: 375 VKRGDVAVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIV 434

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSK 435
           LLKN +  LPL   +I+TLA++GP+A+    M+G+Y     +GT       +    +   
Sbjct: 435 LLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKET 493

Query: 436 VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE------------- 478
            + YA GCA +   + +    AI+ A+NADA V+V G     D S E             
Sbjct: 494 RVLYAKGCA-VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQ 552

Query: 479 ------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
                  EG DR  L L G Q EL+ +++   K PV LV++    + +  A    +    
Sbjct: 553 ISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVD 611

Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
            W  YPG +GG A+ADV+FG YNP GRL ++      V      +P+       G   ++
Sbjct: 612 AW--YPGMQGGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTRRKGNRSRY 663

Query: 593 FDGPVV--YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
            + P    YPFGYGLSYT F Y         D+K         +  T G++         
Sbjct: 664 VEEPGTPRYPFGYGLSYTTFSY--------TDMK---------VQVTEGSD--------- 697

Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSA 709
                 D +    + ++N G  DG EV  +Y +    +  T  KQ+  + R+ + AG+S 
Sbjct: 698 ------DCRVDVTVTIQNQGTADGDEVAQLYFQDDVSSFTTPAKQLRAFSRIHLKAGESR 751

Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +V FT++  KSL +       ++  G  TI+VG
Sbjct: 752 EVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 783


>gi|336412663|ref|ZP_08593016.1| hypothetical protein HMPREF1017_00124 [Bacteroides ovatus
           3_8_47FAA]
 gi|335942709|gb|EGN04551.1| hypothetical protein HMPREF1017_00124 [Bacteroides ovatus
           3_8_47FAA]
          Length = 735

 Score =  272 bits (695), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 213/760 (28%), Positives = 356/760 (46%), Gaps = 97/760 (12%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLYEW 58
           Y D K P  +R  DL+ RMTL EKV Q+     G              VP  +G  +Y  
Sbjct: 30  YKDPKAPIEKRVNDLLSRMTLEEKVMQLNQYTLGRNNNVNNVGEEVKKVPAEIGSLIYFE 89

Query: 59  WSEALHGV----SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
            + AL       +    R   P    +D+     T +P  +    S+N  L ++     +
Sbjct: 90  TNPALRNSMQKKAMEKSRLGIPIIFGYDAIHGFRTVYPISLAQACSWNPDLVEQACAVSA 149

Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
            EAR    +     TF SP I+V RDPRWGRV E  GEDPY  G +    V+G       
Sbjct: 150 QEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYTNGVFGAASVKG------- 197

Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
            Y  D  S   +++AC KHY  Y      G D  +  + +++Q + +T++LP+EM V  G
Sbjct: 198 -YQGDDLSAENRMAACLKHYVGYGASE-AGRDYVY--TEISKQTLWDTYLLPYEMGVKAG 253

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
             +++M S+N ++G+P  A+P ++ + ++  W   G+IVSD  +I+ +   ++ L  TK+
Sbjct: 254 -AATLMSSFNDISGVPGSANPYIMTEILKKRWGHDGFIVSDWGAIEQL--KNQGLAATKK 310

Query: 295 DAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
           +A      AGL++D   + Y       V++G+++ A +D ++R + ++  RLG F+    
Sbjct: 311 EAAWHAFTAGLEMDMMSHAYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLFERPYT 370

Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
                K     PQ +++AA  A + +VLLKN+N  LPL   + K +A++GP A     ++
Sbjct: 371 PATSEKERFFRPQSMDIAARLAAESMVLLKNENKTLPLT--DKKKIAVIGPMAKNGWDLL 428

Query: 414 GNY--EGTPCRYTSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
           G++   G         +G    +A    + YA GCA     N      A++AA+ +D  V
Sbjct: 429 GSWCGHGKDTDVAMLYNGLATEFAGKAELRYAAGCA-TKGDNREGFAEALEAARWSDVVV 487

Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
           +  G  ++   E   R  + LP  Q EL  ++  A K P+ LV+++   +++N  +  P 
Sbjct: 488 LCLGEMMTWSGENASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPLELN--RLEPI 544

Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS--MPLRPVNNFP 586
             +IL +  PG  G   +A ++ G+ NP G+L +T+        PY++  +P+       
Sbjct: 545 SDAILEIWQPGVNGALPMAGILSGRINPSGKLAMTF--------PYSTGQIPIYYNRRKS 596

Query: 587 GRTYKFFDGPV----VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
           GR ++ F   +    +YPFG+GLSYT+FKY                          GT  
Sbjct: 597 GRGHQGFYKDITSDPLYPFGHGLSYTEFKY--------------------------GTVT 630

Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERV 701
           P    V   D      + + ++ V N+G  DG+E V  +   P  + T  +K++  +E+ 
Sbjct: 631 PSATKVKRGD------RLSVEVTVTNVGARDGAETVHWFISDPYCSITRPVKELKHFEKQ 684

Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
            I AG++    F ++  +    V+      L +G + ILV
Sbjct: 685 LIKAGETKTFRFDIDMERDFGFVNEDGKRFLEAGEYHILV 724


>gi|383302737|gb|AFH08276.1| hypothetical protein [uncultured bacterium]
          Length = 768

 Score =  272 bits (695), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 229/790 (28%), Positives = 368/790 (46%), Gaps = 125/790 (15%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQM------------------GDLAYGVPRLGLP- 54
           Y D      ER +DL+ RMTL EKV QM                  GDL     +   P 
Sbjct: 27  YKDPSASVSERVEDLLSRMTLEEKVGQMNQFVGIEHIKANSAVLTEGDLFNNTAQAFYPG 86

Query: 55  -----LYEWWSEALHG----------VSFIGRRTNSP----------PGTHFDSEVPGAT 89
                +  W  E L G           + + R   S              H ++  P  T
Sbjct: 87  ITGDTVIRWTREGLVGSFLHVLTIEEANMLQRHAMSSRLAIPILFGIDAIHGNANAPDNT 146

Query: 90  SFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLET 149
            +PT I   +SF+  +  KI +  + E RAM    N   TF +PN++VVRDPRWGRV ET
Sbjct: 147 VYPTNIGLASSFDPEMAYKIARQTAAEMRAM----NLHWTF-NPNVDVVRDPRWGRVGET 201

Query: 150 PGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFH 209
            GEDPY++       V G + V+G +   D+   P  + AC KH+       +  N    
Sbjct: 202 FGEDPYLIS------VLGAESVKGYQGTLDT---PNDVLACIKHFVG---GGFPANGTNG 249

Query: 210 FDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFH 269
             + V+E+ ++E  + PFE  V  G   S+M S+N VNGIP  ++  L+   +RG+W F 
Sbjct: 250 SPTDVSERTLREVLLPPFEAGVEAG-AGSLMTSHNEVNGIPAHSNEWLMRDVLRGEWGFK 308

Query: 270 GYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC-GDYYTNFTMGAVQQGKIAE 328
           G++VSD   I+ I + H+   + KE A  + + AG+D+   G Y+       V++G+I E
Sbjct: 309 GFVVSDWMDIEHIYDLHRTAENLKE-AFYQSIMAGMDMHMHGIYWNELVCELVREGRIPE 367

Query: 329 ADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGA 388
           + ID S+R +  V  RLG F+     +        +P H   A EAAR  IVLLKND G 
Sbjct: 368 SRIDESVRRILDVKFRLGIFENPYADEARTMEVRLSPGHRATALEAARNSIVLLKND-GV 426

Query: 389 LPLNTGNIKTLALVGPHANATKAMIGNYEGT--PCRYTSPMDGFYAYSKVINYAPGCADI 446
           LPL+    K + + G +A+  + ++G++  +  P   T+ ++G    +   ++     D 
Sbjct: 427 LPLDASKYKRVMVTGINAD-DENILGDWSASQRPENVTTILEGLREVAPDTHFE--FVDQ 483

Query: 447 VCQNNSMIPA----AIDAAKNADATVIVAG-------LDLSVEAEGKDRVDLLLPGFQTE 495
                +M PA    A + A++AD  ++VAG         L    E  DR D+ L G Q E
Sbjct: 484 GWNPQTMSPAQVEKAAEHARHADLNIVVAGEYMMRHRWALRTGGEDTDRSDIDLVGLQNE 543

Query: 496 LINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYN 555
           LI KVA + K P  L++++   + + +A  N  + +I+    PG  GG+A+A++++G  N
Sbjct: 544 LIEKVAASGK-PTILILVNGRQLGVEWAAEN--LPAIVEAWEPGMYGGQAVAEILYGTVN 600

Query: 556 PGGRLPITW-YEANYVKIPYTSMPLRPVNNF-PGRTYKFFDGPVVYPFGYGLSYTQFKYK 613
           P  +LP+T       +++ Y   P    + +  G++        ++PFG+GLSYT ++Y 
Sbjct: 601 PSAKLPVTIPRSVGQIQMYYNHKPSLYFHPYAAGKS-----SSPLWPFGFGLSYTTYEYS 655

Query: 614 VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMD 673
                   D++L  D+   D     GT                       + V+N G  D
Sbjct: 656 --------DLRLSSDEIAAD-----GT-------------------LDVTVRVKNTGSRD 683

Query: 674 GSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL 732
           G E++ +Y +    + T  +K++  + RV + AG++  + FT+   K L+ +D     ++
Sbjct: 684 GVEIIQLYIRDLYSSVTRPVKELKDFGRVALKAGETKDITFTITPDK-LQFLDKDLRPVV 742

Query: 733 ASGAHTILVG 742
             G   ++VG
Sbjct: 743 EPGEFVVMVG 752


>gi|189461690|ref|ZP_03010475.1| hypothetical protein BACCOP_02354 [Bacteroides coprocola DSM 17136]
 gi|189431577|gb|EDV00562.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           coprocola DSM 17136]
          Length = 499

 Score =  271 bits (694), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 160/420 (38%), Positives = 231/420 (55%), Gaps = 45/420 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y +   P  ER  DL+ R+T+ EKV  +   + G+ RL +P Y   +EALHGV   GR  
Sbjct: 28  YKNEDAPLHERIMDLLSRLTVEEKVSLLRATSPGISRLDIPKYYHGNEALHGVVRPGR-- 85

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
                          T FP  I   A++N  L  ++   +S EARA +N  + G      
Sbjct: 86  --------------FTVFPQAIGLAATWNPELQYQVATVISDEARARWNELDQGKLQKGQ 131

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFWSP +N+ RDPRWGR  ET GEDPY+ G     +VRGLQ           D+R
Sbjct: 132 FSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGTMGTAFVRGLQG---------DDAR 182

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LK+ +  KH+AA    N E ++RF  + +++E+ ++E ++  FE C+ +G  +S+M +Y
Sbjct: 183 YLKVVSTPKHFAA----NNEEHNRFECNPQISEKQLREYYLPAFEACIKDGKAASIMSAY 238

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N +N +P   +  LL + +R DW F GY+VSDC     +V +HK++  TKE A    +KA
Sbjct: 239 NAINNVPCTLNSWLLTKVLRHDWGFQGYVVSDCGGPSLLVNAHKYVK-TKEAAATLSIKA 297

Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
           GLDL+CG D Y    + A +Q  +++ADID++   +    MRLG FD      Y  +  +
Sbjct: 298 GLDLECGDDVYYEPLLNAYKQYMVSDADIDSTAYHVLKARMRLGLFDNGKNNPYTKISPS 357

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            I +  H  +A EAARQ IVLLKN N  LPL+T  +K++A+VG   NA     G+Y G+P
Sbjct: 358 IIGSKLHQRVALEAARQCIVLLKNHNWVLPLDTKKLKSIAVVG--INAGNCEFGDYSGSP 415


>gi|423214394|ref|ZP_17200922.1| hypothetical protein HMPREF1074_02454 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692809|gb|EIY86045.1| hypothetical protein HMPREF1074_02454 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 800

 Score =  271 bits (694), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 231/800 (28%), Positives = 361/800 (45%), Gaps = 141/800 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
           Y D   P   R  DL+ +MTL EK  QM  L YG  R+     P   W    W + +   
Sbjct: 56  YEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNI 114

Query: 64  ----HGVSFIGRRTNSP-----------------------PGTHFDSEVPG-----ATSF 91
               +G+   G   + P                       P    +  + G     AT F
Sbjct: 115 DEQANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMF 174

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
           P      A++N+ L ++I +  + EA+A+      G T  +SP +++ +DPRWGRV+E+ 
Sbjct: 175 PAQCGQGATWNKKLIREIAKVTANEAKAL------GYTNIYSPILDIAQDPRWGRVVESY 228

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDPY+ G      + GLQ  EG             I A  KH+A Y +     +     
Sbjct: 229 GEDPYLAGELGKQMILGLQS-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 274

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
           D  V  ++M+  ++ PF   + E     VM SYN  +G P       L + +R  W F G
Sbjct: 275 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 334

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
           Y+VSD ++++ +   H+ +  T+E+  A+V+ AGL++      TNFT           A+
Sbjct: 335 YVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRHAI 388

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
            +GK++   +D  +  +  V   +G FD   P      +  + N  H  ++ +AA + +V
Sbjct: 389 NEGKVSLHTLDQRVSEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESVV 448

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
           LLKN N  LPL+  N K +A++GP+A   K +   Y        +   G   Y  +  + 
Sbjct: 449 LLKNKNQMLPLSK-NFKKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVR 507

Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
           YA GC DI+                Q  +MI  A++ AK +D  ++V G +     E   
Sbjct: 508 YAKGC-DIIDKYFPESELYNVPLDTQEQAMIQEAVELAKASDIAILVLGGNEKTVREEFS 566

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R +L L G Q +L+  V    K PV LV++   A  IN+A  N  + +I+   +PGE  G
Sbjct: 567 RTNLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMG 623

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
            AIA V+FG YNPGGRL +T +  +  +IP+ + P +P ++  G+     DG V+YPFGY
Sbjct: 624 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDSKGKVR--VDG-VLYPFGY 678

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           GLSYT F Y         D+K+ K          +G  +                  T  
Sbjct: 679 GLSYTTFGYS--------DLKISKP--------VIGPQE----------------NITLS 706

Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
             V+N GK  G EVV +Y +    + T   +V+ G+ER+ +  G+   V FT+   + L 
Sbjct: 707 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLG 765

Query: 723 IVDNAANSLLASGAHTILVG 742
           + D      +  G+ +++VG
Sbjct: 766 LWDKNNQFTVEPGSFSVMVG 785


>gi|365122193|ref|ZP_09339098.1| hypothetical protein HMPREF1033_02444 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363642907|gb|EHL82241.1| hypothetical protein HMPREF1033_02444 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 853

 Score =  271 bits (694), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 163/431 (37%), Positives = 234/431 (54%), Gaps = 45/431 (10%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
           S+ V  +   Y D   P  ER  DL+ R+T+ EK+  +   + G+PRLG+  Y   +EAL
Sbjct: 20  SVAVAQTKELYKDMNAPQHERIMDLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEAL 79

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           HGV          PG          T FP  I   + +N  L  +I   +S EAR  +N 
Sbjct: 80  HGVV--------RPGNF--------TVFPQAIGLASMWNPELLYEISTAISDEARGRWNE 123

Query: 124 GNAG----------LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEG 173
            N G          LTFWSP +N+ RDPRWGR  ET GEDP++ G+  + +V+GLQ    
Sbjct: 124 LNRGKDQKGFFSDLLTFWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVAFVKGLQG--- 180

Query: 174 VEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNE 233
                 +D R LKI +  KH+AA    N E ++RF  +  ++E++++E ++  FE C+ E
Sbjct: 181 ------NDPRYLKIVSTPKHFAA----NNEEHNRFECNPHISERNLREYYLPAFESCIKE 230

Query: 234 GDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTK 293
           G   S+M +YN +N +P   +P LL Q +R +W F+GY+VSDC     +V  HK++  T 
Sbjct: 231 GKAQSIMSAYNAINDVPCTLNPWLLTQVLRKEWGFNGYVVSDCGGPGFLVTHHKYVK-TP 289

Query: 294 EDAVARVLKAGLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP 352
           E A    +KAGLDL+CGD  Y    M A +Q  + +ADIDT+   +    M LG FD   
Sbjct: 290 EAAATLSIKAGLDLECGDNVYIEPLMNAYKQCMVTDADIDTAAYRILRARMMLGLFDDPE 349

Query: 353 Q--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATK 410
           +  Y  +  + +   +H +LA EAARQ +VLLKN+   LPLN   +K++A+VG   NA  
Sbjct: 350 KNPYNAISPSIVGCEKHRQLALEAARQSLVLLKNEKNFLPLNPKKVKSIAVVG--INAGN 407

Query: 411 AMIGNYEGTPC 421
              G+Y GTP 
Sbjct: 408 CEFGDYSGTPV 418



 Score =  136 bits (342), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 102/287 (35%), Positives = 144/287 (50%), Gaps = 51/287 (17%)

Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-V 518
           A +  D TV V G++ S+E EG+DR  + LP  Q   I +   A   P T+V++ AG+ +
Sbjct: 600 AIRECDVTVAVLGINKSIEREGQDRYTIELPADQQLFIKEAYKA--NPNTVVVLVAGSSL 657

Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
            IN+   N  I +IL   YPGE+GG A+A+ +FG YNPGGRLP+T+Y +      +    
Sbjct: 658 AINWIDEN--IPAILNAWYPGEQGGTAVAEALFGDYNPGGRLPLTYYRSLDELPAFDDYD 715

Query: 579 LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV 638
           ++      GRTY +F+   +YPFGYGLSYT+F YK   S  S D                
Sbjct: 716 IQ-----KGRTYMYFENKPLYPFGYGLSYTRFDYKNLKSEVSDD---------------- 754

Query: 639 GTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVI 696
                             + KFT    V+N GK  G EV  VY + P  GI    +KQ+ 
Sbjct: 755 ----------------AVNLKFT----VKNTGKYAGDEVAQVYVRFPESGIK-VPLKQLK 793

Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
           G+ERV I  G+SA+V  ++   K L++ D         SG +  +VG
Sbjct: 794 GFERVHIGKGKSAQVSVSIPK-KELRLWDEKDGKFYTPSGNYIFMVG 839


>gi|332982620|ref|YP_004464061.1| glycoside hydrolase [Mahella australiensis 50-1 BON]
 gi|332700298|gb|AEE97239.1| glycoside hydrolase family 3 domain protein [Mahella australiensis
           50-1 BON]
          Length = 753

 Score =  271 bits (694), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 217/676 (32%), Positives = 334/676 (49%), Gaps = 84/676 (12%)

Query: 87  GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
           GAT FP  I   ++++    + +   +  + +A     + GL   SP ++V RDPRWGRV
Sbjct: 108 GATVFPQAIGLASTWDAEAIEAMAGVIRQQMKAAG--AHQGL---SPVLDVARDPRWGRV 162

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
            ET GEDPY+V   A++YVRGLQ   G +  +        I A  KH+A +     EG  
Sbjct: 163 EETFGEDPYLVASMAVSYVRGLQ---GQDLTK-------GIFATLKHFAGHSFS--EGG- 209

Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
           R      V E+++ + F+ PFE  V E +  SVM +Y+ ++G+P  A  +LL   +RG +
Sbjct: 210 RNCAPVHVGERELWDIFLFPFEAAVREANAKSVMNAYHDIDGVPCAASRELLTDILRGHF 269

Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQQG 324
            F G +VSD D+I  + ++H F    K++A  + L+AG+D++    D Y    M AV++G
Sbjct: 270 GFDGIVVSDYDAIDRLRKAH-FTAGNKKEAAVQALEAGIDIELPKMDCYGQPLMDAVKEG 328

Query: 325 KIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKN 384
            I+EA I+ S+  +      LG FDG     +        P+  E++ + AR+ IVLLKN
Sbjct: 329 MISEATINESVERVLTAKFELGLFDGVYVDVDSVPGLFETPEQREMSRDIARKSIVLLKN 388

Query: 385 DNGALPLNTGNIKTLALVGPHANATKAMIGN--------YEGTPCRYTSPMDGFYAY--- 433
           DN  LPL+  +IK++A++GP+A+  + M+G+        Y+ T     + ++G       
Sbjct: 389 DN-VLPLSK-DIKSIAVIGPNADNARNMLGDYAFMAHRSYDKTSVHIVTVLEGIKNKVLD 446

Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSV-----EAEGKDRVDLL 488
           S  I YA GC DI+  +      A++AA+ ADA ++V G +  +       E  DR D+ 
Sbjct: 447 SCRITYAKGC-DIIDPSTDGFVEAVNAARAADAAIVVVGDNSGIFGKGTSGENDDRTDIT 505

Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
           LPG Q +L+  + D  K PV +V+++  A       +N       W  YPGEEGG A+AD
Sbjct: 506 LPGVQMQLVKAIKDTGK-PVIVVLINGRAFAAKELADNASALMEAW--YPGEEGGNAVAD 562

Query: 549 VIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSY 607
           V+FG YNP GRLPI+   E   + I Y   P   +N     T   F       FGYG+SY
Sbjct: 563 VLFGDYNPAGRLPISLPCEVGQIPINYNLKPASYINYLSTETKPAF------AFGYGMSY 616

Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
           T F Y   S   +V                      P A            K     +V 
Sbjct: 617 TTFGYSDLSITPAV---------------------APSAG-----------KVDISFKVT 644

Query: 668 NMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDN 726
           N G++ G EVV +Y +    +    +K++ G++RV +  G++ ++ FT+ A   L   D 
Sbjct: 645 NAGQLAGDEVVQLYIRDEVSSIVRPVKELKGFKRVNLQPGETKEITFTLYA-DQLAFHDK 703

Query: 727 AANSLLASGAHTILVG 742
               ++  G   I+VG
Sbjct: 704 DMRLVVEPGTFKIMVG 719


>gi|329962030|ref|ZP_08300041.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
 gi|328530678|gb|EGF57536.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
          Length = 941

 Score =  271 bits (694), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 227/809 (28%), Positives = 360/809 (44%), Gaps = 144/809 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW------------ 58
           Y D       R +DL+++MTL EK  QM  L YG  R+    LP  EW            
Sbjct: 53  YEDPAATVDARVEDLLKQMTLDEKTCQMVTL-YGYKRVLKDALPTPEWKQMLWKDGIGAI 111

Query: 59  --------------------WSEALHGVS-------FIGRRTNSPPGTHFDSEVPG---- 87
                               W  + H  +       F+       P    +  + G    
Sbjct: 112 DEHLNGFQQWGLPPSDNENVWPASRHAWALNEVQRFFVEETRLGIPVDFTNEGIRGVESY 171

Query: 88  -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
            AT+FPT +    ++N +L  K+G     EAR +      G T  ++P ++V RD RWGR
Sbjct: 172 KATNFPTQLGLGHTWNRALIHKVGLITGREARML------GYTNVYAPILDVGRDQRWGR 225

Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
             E  GE PY+V    I  VRGLQ                 ++A  KH+AAY  +     
Sbjct: 226 YEEVYGESPYLVAELGIEMVRGLQQ---------------HVAATGKHFAAYSNNKGARE 270

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
                D + +  +++   I PF   + E  +  VM SYN  +GIP       L   +R +
Sbjct: 271 GMARVDPQTSPHEVENIHIYPFRRVIKEAGLLGVMSSYNDYDGIPIQGSYYWLTTRLRDE 330

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
             F GY+VSD D+++ +   H    D KE AV + ++AGL++ C     D +       V
Sbjct: 331 MGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSFVLPLRELV 389

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIE-LAAEAARQGIV 380
           ++G + E  ++  +R +  V   +G FD   Q    G +     +  E +A +A+R+ +V
Sbjct: 390 KEGGLDEETVNDRVRDILRVKFLIGLFDAPYQTDLAGADKEVEKEENEAVALQASRESVV 449

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY----AYSKV 436
           LLKN+N  LPLN   +K +A+ GP+A+     + +Y       T+ + G        ++V
Sbjct: 450 LLKNENSTLPLNINTVKKIAVCGPNADEDGYALTHYGPLAVEVTTVLKGIQDKVNGKAEV 509

Query: 437 INYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
           + Y  GC D+V  N                + I  A++ A+ AD  V+V G       E 
Sbjct: 510 L-YTKGC-DLVDANWPESEIIDYPLTPDEQAEINKAVENARRADVAVVVLGGGQRTCGEN 567

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
           K R  L LPG Q +L+  V    K PV L++++   + +N+A  +  + +IL   YPG +
Sbjct: 568 KSRSSLDLPGRQLQLLQAVQATGK-PVVLILINGRPLSVNWA--DKYVPAILEAWYPGSK 624

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV---- 597
           GG A+AD++FG YNPGG+L +T +     +IP+ + P +P +   G      DG +    
Sbjct: 625 GGVALADILFGDYNPGGKLTVT-FPKTVGQIPF-NFPCKPASQIDGGKNAGPDGNMSRIN 682

Query: 598 --VYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
             +YPFGYGLSYT F+Y  +  +PK                                 V 
Sbjct: 683 GALYPFGYGLSYTTFEYSNLEITPK---------------------------------VI 709

Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGF 713
             + K T +++V N GK  G EVV +Y++       T+ K + G+ER+ +  G++ +V F
Sbjct: 710 TPNEKATVRLKVTNTGKYAGDEVVQLYTRDVLSSVTTYEKNLAGFERIHLEPGETKEVTF 769

Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVG 742
            ++  K L+++D     ++  G   I+ G
Sbjct: 770 ILDR-KHLELLDADMKRVVEPGDFAIMAG 797


>gi|60680320|ref|YP_210464.1| beta-glucosidase [Bacteroides fragilis NCTC 9343]
 gi|60491754|emb|CAH06512.1| putative beta-glucosidase [Bacteroides fragilis NCTC 9343]
          Length = 814

 Score =  271 bits (694), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 235/813 (28%), Positives = 357/813 (43%), Gaps = 153/813 (18%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE---------------- 57
           Y +   P   R + L+ +MTL EKV QM      +  LG P+YE                
Sbjct: 49  YENPSAPVEYRVEHLLSQMTLEEKVGQM------LTSLGWPMYERVGEDIRLTPQLEKEI 102

Query: 58  ----------------WWSEALH-GV--SFIGRRTNSPPG---THFDSEVP--------- 86
                           W    LH G+  S   R +N        H    +P         
Sbjct: 103 GEYHIGSLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPH 162

Query: 87  -----GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
                G T FPT I   +++N  L +++G+ ++ EA A           + P +++ RDP
Sbjct: 163 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDP 217

Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
           RW RV ET GEDPY+ G      VRG Q     E   D  S    + A  KH+A+Y    
Sbjct: 218 RWSRVEETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---G 266

Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
           W         + + E++++E    PF   V  G + SVM SYN ++G P      LL   
Sbjct: 267 WTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDI 325

Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGA 320
           ++  W F G++VSD  ++  + E     ND   +A  + + AG+D D G + Y    + A
Sbjct: 326 LKDRWQFKGFVVSDLYAVGGLREHGVAGNDY--EAAIKAVNAGVDSDLGTNVYAEQLVAA 383

Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
           V++G +A A ID ++R +  +  ++G FD     +      + + +H  LA E ARQ IV
Sbjct: 384 VKRGDVAVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIV 443

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSK 435
           LLKN +  LPL   +I+TLA++GP+A+    M+G+Y     +GT       +    +   
Sbjct: 444 LLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKET 502

Query: 436 VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE------------- 478
            + YA GC  +   + +    AI+ A+NADA V+V G     D S E             
Sbjct: 503 RVLYAKGCT-VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQ 561

Query: 479 ------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
                  EG DR  L L G Q EL+ +++   K PV LV++    + +  A    +    
Sbjct: 562 ISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVD 620

Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
            W  YPG +GG A+ADV+FG YNP GRL ++      V      +P+       G   ++
Sbjct: 621 AW--YPGMQGGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTRRKGNRSRY 672

Query: 593 FDGPVV--YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
            + P    YPFGYGLSYT F Y         D+K         +  T G++         
Sbjct: 673 IEEPGTPRYPFGYGLSYTTFSY--------TDMK---------VQVTEGSD--------- 706

Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSA 709
                 D +    + ++N G  DG EV  +Y +    +  T  KQ+  + R+ + AG+S 
Sbjct: 707 ------DCRVDVTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESR 760

Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +V FT++  KSL +       ++  G  TI+VG
Sbjct: 761 EVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 792


>gi|297543748|ref|YP_003676050.1| glycoside hydrolase family 3 domain-containing protein
           [Thermoanaerobacter mathranii subsp. mathranii str. A3]
 gi|296841523|gb|ADH60039.1| glycoside hydrolase family 3 domain protein [Thermoanaerobacter
           mathranii subsp. mathranii str. A3]
          Length = 787

 Score =  271 bits (694), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 224/816 (27%), Positives = 376/816 (46%), Gaps = 156/816 (19%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL---------- 63
           Y D K P  ++ ++L+ +MT+ EK+ Q+          G+ +YE   + +          
Sbjct: 6   YLDPKQPVEKKVENLLAQMTIEEKIAQLS---------GIWVYEILDDMMKFSYKKANRL 56

Query: 64  --HGVSFIGR---------------------------RTNSPPGTHFDS----EVPGATS 90
             HG+  I R                           R   P   H +S       GAT 
Sbjct: 57  MTHGIGQITRLGGASNLSPQETVKIANQIQKYLVENTRLGIPALIHEESCSGYMAKGATI 116

Query: 91  FPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETP 150
           FP  I   +++N  L +K+   +  + +A+           +P ++V RDPRWGR  ET 
Sbjct: 117 FPQTIGVASTWNPKLVEKMASVIREQMKAV-----GARQALAPLLDVTRDPRWGRTEETF 171

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDPY+V    ++Y+RGLQ          +++    + A  KH+  Y   N EG   +  
Sbjct: 172 GEDPYLVMHMGVSYIRGLQ----------TENLKEGVIATGKHFVGYG--NSEGGMNWA- 218

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
            + +  +++ E F+ PFE  V E  + S+M  Y+ ++GIP     +LL   +R +W F G
Sbjct: 219 PAHIPMRELYEIFLYPFEAAVKEAKLGSIMPGYHELDGIPCHKSKQLLTDILRKNWGFDG 278

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQGKIAE 328
            +VSD  +I  + E H+  ++ KE A    L+AG+D++    D Y       ++QG I  
Sbjct: 279 IVVSDYFAINQLYEYHRLASNKKE-AAKLALEAGVDVELPSTDCYGLPIKELIEQGDIDI 337

Query: 329 ADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQ-HIELAAEAARQGIVLLKNDNG 387
             ++ ++R +      LG F+ +P         I + Q   +LA + A++ IVLLKN++ 
Sbjct: 338 DFVNDAVRRILKAKFLLGLFE-NPYVDEKRVVEIFDTQEQRQLAYKIAQESIVLLKNESN 396

Query: 388 ALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR-------------YTSPM-DGFYA- 432
            LPL   +++++A++GP+A+  + MIG+Y   PC              + +P+ +G  A 
Sbjct: 397 LLPLKK-DLQSIAVIGPNADNIRNMIGDY-AYPCHIESLLEMREKDNVFNTPLPEGLEAK 454

Query: 433 -------------------YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG- 472
                               +KVI YA GC D++  + +    A++ AK AD  ++V G 
Sbjct: 455 DIYVPIVSVLQGIKEKVSPKTKVI-YAKGC-DVISDDTAGFNKAVEIAKQADVAIVVVGD 512

Query: 473 ----LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
                D     E +DR DL LPG Q +L+  + +    PV +V+++   + I  ++   K
Sbjct: 513 RAGLTDGCTSGESRDRADLNLPGVQEQLVKAIYETGT-PVVVVLINGRPMSI--SRLAEK 569

Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPG 587
           I +I+    PGEEGGRAIADVIFG YNPGG+LPI+       + + Y   P     N+ G
Sbjct: 570 IPAIIEAWLPGEEGGRAIADVIFGDYNPGGKLPISIPCSVGQLPVYYYHKPSGGRTNWKG 629

Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
              +    P +YPFGYGLSYT+F Y                      N  +   K     
Sbjct: 630 DYVESSTKP-LYPFGYGLSYTEFLYS---------------------NLNISNPK----- 662

Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAG 706
                V  ++      ++V+N+GK+ G EVV +Y     ++ T  +K++ G++R+ +  G
Sbjct: 663 -----VSTQEGIIEISVDVKNIGKVKGDEVVQLYIHREFLSVTRPVKELKGFKRITLDVG 717

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +   V F +++ + L   +     ++  G   +++G
Sbjct: 718 EQKTVIFQLSS-EQLGFYNEEMEYVVEPGRVEVMIG 752


>gi|294675412|ref|YP_003576028.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23]
 gi|294472176|gb|ADE81565.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23]
          Length = 875

 Score =  271 bits (694), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 158/446 (35%), Positives = 237/446 (53%), Gaps = 42/446 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            PY +A L   +RA DL+ R+TL EKV  M D +  +PRLG+P ++WW+EALHG+   G 
Sbjct: 23  LPYQNANLSAAQRADDLLSRLTLDEKVSLMMDTSPAIPRLGIPQFQWWNEALHGIGRNGF 82

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
                           AT FP  +   AS++++L  ++   VS EAR             
Sbjct: 83  ----------------ATVFPITMAMAASWDDALLHQVFTAVSDEARVKAQQAKCTGDIK 126

Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS--D 181
               L+FW+PNIN+ RDPRWGR  ET GEDPY+  +  +  VRGLQ   GV Y+ +    
Sbjct: 127 RYQSLSFWTPNINIFRDPRWGRGQETYGEDPYLTAKMGLAVVRGLQ---GVGYNGEDLGV 183

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFD-SRVTEQDMQETFILPFEMCVNEGDVSSVM 240
           S+  K+ AC KH+A +    W   +R  F+   + E+D+ ET++  F+  V EG V+ VM
Sbjct: 184 SKYRKLLACAKHFAVHSGPEW---NRHEFNIENLPERDLWETYLPAFKALVQEGKVAEVM 240

Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
           C+Y R++G   CA  +   Q +R +W F G I SDC +I+  +     ++    +A A+ 
Sbjct: 241 CAYQRIDGQACCAQTRYEQQILRDEWGFDGLITSDCGAIRDFLPRWHNVSKDGAEASAKA 300

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLG 358
           + AG D++CG  Y +    AV++G + EADID SLR L I    LG  D      +  + 
Sbjct: 301 VLAGTDVECGSEYKHLPE-AVRRGDVKEADIDRSLRRLLIARFELGDMDSDDLNAWTKIP 359

Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPL------NTGNIKTLALVGPHANATKAM 412
           +  + +  H +LA + A + IVLL+N    LPL        G+ K + ++GP+AN +  M
Sbjct: 360 ETVVASQAHKDLALKMALKSIVLLQNKIKVLPLGNPLNAGAGSDKDIVVMGPNANDSVMM 419

Query: 413 IGNYEGTPCRYTSPMDGFYAYSKVIN 438
            GNY G P    + +DG    +K ++
Sbjct: 420 WGNYAGYPTHTVTALDGITRMAKTLS 445



 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 72/321 (22%), Positives = 130/321 (40%), Gaps = 61/321 (19%)

Query: 433 YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----------GK 482
           Y +   Y     DI  + N      +    N    + V G+  ++E E          G 
Sbjct: 589 YVQETGYGALNFDIKKRVNPTAEELLAQIGNTQTIIFVGGISPNLEGEEMRVNEPGFKGG 648

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
           DR  + LP  Q +L+  +  A K    ++ ++     +  A       +IL   Y GE+G
Sbjct: 649 DRTSIELPQAQRDLLAVLHKAGK---KVIFVNCSGSAMALAPELETCDAILQWWYGGEQG 705

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFG 602
           G A+A  +FG   P G+LP+T+Y++         +P         RTY++++G  ++PFG
Sbjct: 706 GAALATTLFGMVAPSGKLPVTFYKST------DELPDFLDYTMKNRTYRYYEGEPLFPFG 759

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
           +GL YT F         ++D  + K+ +                                
Sbjct: 760 FGLGYTTF---------NIDKPIYKNNKV------------------------------- 779

Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
           Q+ V+N+G   G+E V VY +         K +  Y++V + A ++  +   +   KS +
Sbjct: 780 QVRVKNLGTTAGTETVQVYIRHLADKEGPKKSLRAYQQVTLNAAEAKTISIEL-PRKSFE 838

Query: 723 IVDNAANSL-LASGAHTILVG 742
             D   N++ +  G + ++VG
Sbjct: 839 GWDVKTNTMRVVPGKYEVMVG 859


>gi|29350122|ref|NP_813625.1| periplasmic beta-glucosidase , xylosidase/arabinosidase
           [Bacteroides thetaiotaomicron VPI-5482]
 gi|29342034|gb|AAO79819.1| periplasmic beta-glucosidase precursor, xylosidase/arabinosidase
           [Bacteroides thetaiotaomicron VPI-5482]
          Length = 769

 Score =  271 bits (694), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 227/728 (31%), Positives = 345/728 (47%), Gaps = 124/728 (17%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+PL+    EA HG   IG                  T FPT I   A+++ +L +++
Sbjct: 114 RLGIPLF-LAEEAPHGHMAIG-----------------TTVFPTGIGMAATWSPTLIEEV 155

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           G  ++ E R+           + P +++ RDPRW RV ET GEDP + GR     + GL 
Sbjct: 156 GNVIAKEIRS-----QGAHISYGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMILGLG 210

Query: 170 DVE-GVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFE 228
             +   EY            A  KH+ AY +   EG    ++ S V  +D+ E F+ PF 
Sbjct: 211 SGDLSCEY---------ATIATLKHFLAYAVP--EGGQNGNYAS-VGTRDLHENFLPPFR 258

Query: 229 MCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKF 288
             ++ G +S VM SYN ++G+P  A+  LL Q +R +W F G++VSD  SI+ + ESH F
Sbjct: 259 EAIDAGALS-VMTSYNSIDGVPCTANHYLLTQLLRNEWRFRGFVVSDLYSIEGVHESH-F 316

Query: 289 LNDTKEDAVARVLKAGLDLDC-GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY 347
           +  T E+A  + + AG D+D  GD + N T  AVQ GKI+EA IDT++  +  +   +G 
Sbjct: 317 VAPTIEEAAMQAVSAGADIDLGGDAFMNLTH-AVQFGKISEAVIDTAVCRVLRMKFEIGL 375

Query: 348 FDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
           F+            + +  HI+LA + A+  IVLLKN+N  LPLN   IK +A+VGP+A+
Sbjct: 376 FEHPYVNPKTATKIVRSKDHIKLARKVAQSSIVLLKNENSILPLNK-KIKKVAVVGPNAD 434

Query: 408 ATKAMIGNYEG--TPCRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAK 462
               M+G+Y          + +DG  +    SKV  Y  GCA I     + I  A++AA 
Sbjct: 435 NRYNMLGDYTAPQEDENIKTVLDGVISKLSPSKV-EYVRGCA-IRDTTVNEIAEAVEAAS 492

Query: 463 NADATVIVAGLDLSVE-----------------------AEGKDRVDLLLPGFQTELINK 499
            ++  + V G   + +                        EG DR  L L G Q +L+  
Sbjct: 493 RSEVIIAVVGGSSARDFKTSYQETGAAIADEKSISDMECGEGFDRATLTLLGKQQDLLIA 552

Query: 500 VADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGR 559
           +    K P+ +V +    +D  +A       ++L   YPG+EGG AIADV+FG YNP GR
Sbjct: 553 LKATGK-PLIVVYIEGRPLDKVWASEYA--DALLTASYPGQEGGYAIADVLFGDYNPAGR 609

Query: 560 LPITWYEANYVKIPYTSMPLRPV--NNFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVA 615
           LP++        IP  S+   PV  N    R + + +     +Y FGYGLSYT F+Y   
Sbjct: 610 LPVS--------IP-RSVGQIPVYYNKKAPRNHDYVEQAASPLYTFGYGLSYTTFEYS-- 658

Query: 616 SSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGS 675
                 D+++                K PC              F    +V+N G  DG 
Sbjct: 659 ------DLQV--------------IRKSPC-------------HFEVSFKVKNTGSYDGE 685

Query: 676 EVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS 734
           EV  +Y +    +    ++Q+  +ER F+  G+  ++ FT+   K L I+D     ++ +
Sbjct: 686 EVAQLYLRDEYASVVQPLRQLKCFERFFLKRGEEKEIFFTLTE-KDLSIIDRNMKRVVET 744

Query: 735 GAHTILVG 742
           G   I++G
Sbjct: 745 GDFRIMIG 752


>gi|329956938|ref|ZP_08297506.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
 gi|328523695|gb|EGF50787.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
          Length = 944

 Score =  271 bits (694), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 231/810 (28%), Positives = 368/810 (45%), Gaps = 144/810 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
           Y D      +R ++L+++MTL EK  QM  L YG  R+    LP  EW    W +   G+
Sbjct: 53  YEDPAATLDDRIENLLQQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEWKQMLWKD---GI 108

Query: 67  SFIGRRTNS---------------PPGTH----------------------FDSE-VPG- 87
             I    N                P   H                      F +E + G 
Sbjct: 109 GAIDEHLNGFQQWGLPPSDNENVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGV 168

Query: 88  ----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPR 142
               AT+FPT +    ++N  L +++G     EAR +      G T  ++P ++V RD R
Sbjct: 169 ESYKATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQR 222

Query: 143 WGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNW 202
           WGR  E  GE PY+V    I  VRGLQ       H        +++A  KH+AAY  +  
Sbjct: 223 WGRYEEVYGESPYLVAELGIEMVRGLQ-------HNH------QVAATAKHFAAYSNNKG 269

Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
                   D +++ ++++   I PF+  + E  +  +M SYN  +GIP       L   +
Sbjct: 270 AREGMSRVDPQMSPREVENIHIYPFKRVIRETGLLGIMSSYNDYDGIPVQGSYYWLTTRL 329

Query: 263 RGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTM 318
           R +  F GY+VSD D+++ +   H    D KE AV + ++AGL++ C     D +     
Sbjct: 330 RQEMGFRGYVVSDSDAVEYLYTKHNTAKDMKE-AVRQSVEAGLNVRCTFRSPDSFVLPLR 388

Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIE-LAAEAARQ 377
             V++G ++E  I+  +R +  V   +G FD   Q    G +N       E +A +A+R+
Sbjct: 389 ELVKEGGLSEEVINDRVRDILRVKFLIGLFDSPYQTDLAGADNEVEKAANEAVALQASRE 448

Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK-- 435
            +VLLKN +  LPLN   IK +A+ GP+A+     + +Y       T+ ++G    ++  
Sbjct: 449 SVVLLKNADNTLPLNIDKIKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIREKAQGK 508

Query: 436 -VINYAPGC---------ADIV-----CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
             + Y  GC         ++I+         + I  A   A+ AD  V+V G       E
Sbjct: 509 AEVLYTKGCDLVDAHWPESEIIEYPLTPDEQAEIDRAAANARQADVAVVVLGGGQRTCGE 568

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
            K R  L LPG Q +L+  V    K PV LV+++   + +N+A  +  + +IL   YPG 
Sbjct: 569 NKSRTSLDLPGHQLKLLQAVQATGK-PVVLVLINGRPLSVNWA--DKFVPAILEAWYPGS 625

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV--- 597
           +GG A+AD++FG YNPGG+L +T +     +IP+ + P +P +   G      DG +   
Sbjct: 626 KGGTAVADILFGDYNPGGKLTVT-FPKTVGQIPF-NFPCKPASQIDGGKNPGADGNMSRI 683

Query: 598 ---VYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
              +YPFGYGLSYT F+Y  +  SPK                                 V
Sbjct: 684 NGALYPFGYGLSYTTFEYSDLEISPK---------------------------------V 710

Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVG 712
              D K T +++V N GK  G EVV +Y++       T+ K + G+ER+ +  G++ +V 
Sbjct: 711 ITPDQKATVRLKVTNTGKRAGDEVVQLYTRDILSSITTYEKNLAGFERIRLKPGETKEVT 770

Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVG 742
           FT++  K L++++     ++  G   I+ G
Sbjct: 771 FTLDR-KHLELLNADMKWIVEPGEFAIMAG 799


>gi|170731072|ref|YP_001776505.1| beta-glucosidase [Xylella fastidiosa M12]
 gi|167965865|gb|ACA12875.1| Beta-glucosidase [Xylella fastidiosa M12]
          Length = 882

 Score =  271 bits (693), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 170/451 (37%), Positives = 237/451 (52%), Gaps = 52/451 (11%)

Query: 20  PYPER-AKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPG 78
           P PE+ A  LV +MT  EK+ Q  + A  +PRLG+P Y+WWSE LHG++  G        
Sbjct: 28  PSPEQHAAALVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNGY------- 80

Query: 79  THFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLT 129
                    AT FP  I   AS+N  L + +G   STEARA +NL           AGLT
Sbjct: 81  ---------ATVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLT 131

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
            WSPNIN+ RDPRWGR +ET GEDPY+  + A++++RGLQ         +    P  I A
Sbjct: 132 LWSPNINIFRDPRWGRGMETYGEDPYLTSQLAVSFIRGLQG--------NIPDHPRTI-A 182

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
             KH+A +         R  FD  V+  D++ T+   F   + +G   SVMC+YN ++G 
Sbjct: 183 TPKHFAVHSGPE---PGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGT 239

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           P CA   LLN  +R DW F+G++VSDCD+I+ +   H F  D    A A  LK+G DL+C
Sbjct: 240 PACASDWLLNTRLRNDWGFNGFVVSDCDAIEDMTRFHFFRQDNAS-ASAAALKSGDDLNC 298

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQH 367
           G+ Y +    A+ +G I E+ +D +L  L+    RLG         Y  +G  +I  P H
Sbjct: 299 GNTYRDLNQ-AIARGDIDESTLDQALIRLFTARQRLGTLQPREHDPYAAIGIKHIDTPAH 357

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
             LA +AA Q +VLLKN    LPL  G   TLA++GP A++  A+  NY+GT     +P+
Sbjct: 358 RALALQAAAQSLVLLKNSGNTLPLTPGT--TLAVLGPDADSLTALEANYQGTSSTPVTPL 415

Query: 428 DGFYA--------YSKVINYAPGCADIVCQN 450
            G           Y++  + APG    + + 
Sbjct: 416 IGLRTRFGTAKVHYAQGASLAPGVPSTITET 446



 Score =  143 bits (360), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 96/295 (32%), Positives = 138/295 (46%), Gaps = 52/295 (17%)

Query: 460 AAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVT 509
           A  +ADA V   GL   VE E          G DR  + LP  Q  L+  V    K P+ 
Sbjct: 607 AVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKTTGK-PLI 665

Query: 510 LVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANY 569
           +V+MS  AV +N+A+++    +IL   YPG+ GG AIA  + G  NPGGRLP+T+Y +  
Sbjct: 666 VVLMSGSAVALNWAQHH--ADAILAAWYPGQSGGTAIAQALAGDVNPGGRLPMTFYRSTQ 723

Query: 570 VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQ 629
              PY S       +  GRTY++F G  +YPFGYGLSYTQF Y+      +         
Sbjct: 724 DLPPYISY------DMTGRTYRYFKGQPLYPFGYGLSYTQFAYEAPQLSTAT-------- 769

Query: 630 QCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG 689
                                  +K  D   T    V N G   G EVV +Y +PP    
Sbjct: 770 -----------------------LKAGD-TLTVTAHVRNTGTRAGDEVVQLYLEPPHSPQ 805

Query: 690 THIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
             ++ ++G++RV +  G+S  + FT++A + L  V       + +G + + VG G
Sbjct: 806 APLRNLVGFKRVTLRPGESRLLTFTLDA-RQLSSVQQTGQRSVEAGHYHLFVGGG 859


>gi|160882671|ref|ZP_02063674.1| hypothetical protein BACOVA_00625 [Bacteroides ovatus ATCC 8483]
 gi|423289150|ref|ZP_17268000.1| hypothetical protein HMPREF1069_03043 [Bacteroides ovatus CL02T12C04]
 gi|423298450|ref|ZP_17276507.1| hypothetical protein HMPREF1070_05172 [Bacteroides ovatus CL03T12C18]
 gi|156111986|gb|EDO13731.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
            ovatus ATCC 8483]
 gi|392662991|gb|EIY56545.1| hypothetical protein HMPREF1070_05172 [Bacteroides ovatus CL03T12C18]
 gi|392667846|gb|EIY61351.1| hypothetical protein HMPREF1069_03043 [Bacteroides ovatus CL02T12C04]
          Length = 1049

 Score =  271 bits (693), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 219/767 (28%), Positives = 358/767 (46%), Gaps = 100/767 (13%)

Query: 16   DAKLPYPERA----KDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG- 70
            ++KLP+   A    KDL+ RMT+ EK+ Q+     G   L  P  E+ S++L     +G 
Sbjct: 328  NSKLPHTPEADSFVKDLLSRMTVEEKIGQLSQYV-GRTLLTGPESEYLSDSLIARGLVGS 386

Query: 71   ----------RRTNSPPGTHFDSEVP----------GATSFPTVILTTASFNESLWKKIG 110
                      R        H   ++P            T FPT +  + S++ +  ++  
Sbjct: 387  VLNISGAKTLRDLQEKNMRHSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAA 446

Query: 111  QTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
            +  + E+ A      AGL + ++P +++ RD RWGRV+E  GED Y+    A   V G Q
Sbjct: 447  KIAAIESSA------AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQ 500

Query: 170  DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                  +  +S      + AC KH+ AY L    G D    D  ++E+ + +T++ PF+ 
Sbjct: 501  ---WNLWENNS------VLACAKHWVAYGLPQ-AGRDYAPVD--MSERTLFDTYLPPFKA 548

Query: 230  CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            C++ G V + M ++N +NGIP  A P LL   +RG WNF+G++VSD ++++ +V      
Sbjct: 549  CIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLVAQGVAE 607

Query: 290  NDTKEDAVARVLKAGLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
            +D  +DA      +G+D+D  D  Y  +    ++ GKI+  D+D S+  +  +   LG F
Sbjct: 608  DD--KDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF 665

Query: 349  DGSPQYKN--LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHA 406
                ++ N       I   + ++ A + A +  VLLKNDN  LPL   N++++A+VGP A
Sbjct: 666  VDPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPL-AKNVRSIAVVGPLA 724

Query: 407  NATKAMIGNY--EGTPCRYTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDA 460
            +    ++G++   G     T+ + G           + YA GC D   ++ S    A+  
Sbjct: 725  DNQTELLGSWRARGEDRHVTTVLQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKEAVKL 783

Query: 461  AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
            A  +D  + V G    +  E + R  L LPG Q ELI ++    K PV +V+M+   + I
Sbjct: 784  ASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGRPLSI 842

Query: 521  NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEAN-YVKIPYT-SMP 578
             +   N  + +IL   + G   G AIAD++FG YNP GRL I++      V I Y     
Sbjct: 843  EWVDKN--VSAILETWFLGTSAGTAIADILFGDYNPSGRLTISFPRVEGQVPIYYNYKKS 900

Query: 579  LRPVNNFPGRTYKFFDGP--VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINY 636
             RP +     T +  D P   +YPFGYGLSYT F Y   S+P+S   +  + +       
Sbjct: 901  GRPGDMLHSSTTRHIDVPNAPLYPFGYGLSYTTFSY---SAPQSTQKEYTRQET------ 951

Query: 637  TVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQV 695
                                    +  + V N G  DG E V +Y      +    +K++
Sbjct: 952  -----------------------ISVSVTVTNTGDRDGEETVQLYVNDKVASVVRPVKEL 988

Query: 696  IGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
              ++++F+ AG+S  V F ++   +L   D A N ++  G   I+ G
Sbjct: 989  KAFKKIFLKAGESKTVQFDISPL-ALGFYDAAMNYVVEPGEFEIMTG 1034


>gi|298482587|ref|ZP_07000772.1| xylosidase [Bacteroides sp. D22]
 gi|336405443|ref|ZP_08586122.1| hypothetical protein HMPREF0127_03435 [Bacteroides sp. 1_1_30]
 gi|295085727|emb|CBK67250.1| Beta-glucosidase-related glycosidases [Bacteroides xylanisolvens
           XB1A]
 gi|298271294|gb|EFI12870.1| xylosidase [Bacteroides sp. D22]
 gi|335938024|gb|EGM99918.1| hypothetical protein HMPREF0127_03435 [Bacteroides sp. 1_1_30]
          Length = 800

 Score =  271 bits (693), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 231/800 (28%), Positives = 361/800 (45%), Gaps = 141/800 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
           Y D   P   R  DL+ +MTL EK  QM  L YG  R+     P   W    W + +   
Sbjct: 56  YEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNI 114

Query: 64  ----HGVSFIGRRTNSP-----------------------PGTHFDSEVPG-----ATSF 91
               +G+   G   + P                       P    +  + G     AT F
Sbjct: 115 DEQANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMF 174

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
           P      A++N+ L ++I +  + EA+A+      G T  +SP +++ +DPRWGRV+E+ 
Sbjct: 175 PAQCGQGATWNKKLIREIAKVTANEAKAL------GYTNIYSPILDIAQDPRWGRVVESY 228

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDPY+ G      + GLQ  EG             I A  KH+A Y +     +     
Sbjct: 229 GEDPYLAGELGKQMILGLQS-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 274

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
           D  V  ++M+  ++ PF   + E     VM SYN  +G P       L + +R  W F G
Sbjct: 275 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 334

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
           Y+VSD ++++ +   H+ +  T+E+  A+V+ AGL++      TNFT           A+
Sbjct: 335 YVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRRAI 388

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
            +GK++   +D  +  +  V   +G FD   P      +  + N  H  ++ +AA + +V
Sbjct: 389 DEGKVSLHTLDQRVSEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESVV 448

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
           LLKN N  LPL+  N K +A++GP+A   K +   Y        +   G   Y  +  + 
Sbjct: 449 LLKNKNQMLPLSK-NFKKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVR 507

Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
           YA GC DI+                Q  +MI  A++ AK +D  ++V G +     E   
Sbjct: 508 YAKGC-DIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGGNEKTVREEFS 566

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R +L L G Q +L+  V    K PV LV++   A  IN+A  N  + +I+   +PGE  G
Sbjct: 567 RTNLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMG 623

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
            AIA V+FG YNPGGRL +T +  +  +IP+ + P +P ++  G+     DG V+YPFGY
Sbjct: 624 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDSKGKVR--VDG-VLYPFGY 678

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           GLSYT F Y         D+K+ K          +G  +                  T  
Sbjct: 679 GLSYTTFGYS--------DLKISKP--------VIGPQE----------------NITLS 706

Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
             V+N GK  G EVV +Y +    + T   +V+ G+ER+ +  G+   V FT+   + L 
Sbjct: 707 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVSFTLTP-QDLG 765

Query: 723 IVDNAANSLLASGAHTILVG 742
           + D      +  G+ +++VG
Sbjct: 766 LWDKNNQFTVEPGSFSVMVG 785


>gi|153807033|ref|ZP_01959701.1| hypothetical protein BACCAC_01310 [Bacteroides caccae ATCC 43185]
 gi|423219984|ref|ZP_17206480.1| hypothetical protein HMPREF1061_03253 [Bacteroides caccae
           CL03T12C61]
 gi|149130153|gb|EDM21363.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           caccae ATCC 43185]
 gi|392624247|gb|EIY18340.1| hypothetical protein HMPREF1061_03253 [Bacteroides caccae
           CL03T12C61]
          Length = 786

 Score =  271 bits (693), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 228/799 (28%), Positives = 360/799 (45%), Gaps = 139/799 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
           Y D   P   R  DL+ +MTL EK  QM  L YG  R+     P  EW    W + +   
Sbjct: 42  YEDPAAPIEARVADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAEWSKEIWKDGIGNI 100

Query: 64  ----HGVSFIGRRTNSP-----------------------PGTHFDSEVPG-----ATSF 91
               +G+   G   + P                       P    +  + G     AT F
Sbjct: 101 DEQANGLGKFGSELSYPYANSVKNRHEIQRWFVEQTRLGIPVDFTNEGIRGLCHNRATMF 160

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
           P      A++N+ L ++I +  + EA+A+      G T  ++P +++ +DPRWGRV+E+ 
Sbjct: 161 PAQCGQGATWNKKLIREIAKVTADEAKAL------GYTNIYAPILDIAQDPRWGRVVESY 214

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDPY+ G      + GLQ  EG             ++A  KH+A Y +     +     
Sbjct: 215 GEDPYLAGELGKQMILGLQ-AEG-------------LAATPKHFAVYSIPVGGRDGGTRT 260

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
           D  V  ++M+  ++ PF   + E     VM SYN  +G P       L + +R  W F G
Sbjct: 261 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 320

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
           Y+VSD ++++ +   H+ +  T+E+  A+V+ AGL++      TNFT           A+
Sbjct: 321 YVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRRAI 374

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
            +GKI+   +D  +  +  V   LG FD   P      +  + N  H E++ +AA + IV
Sbjct: 375 SEGKISLHTLDQRVGEILRVKFMLGLFDNPYPGDDRHPETVVHNAAHQEVSMKAALESIV 434

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
           LLKN+N  LPL+  ++  +A++GP+A   K +   Y        +   G   Y  +  ++
Sbjct: 435 LLKNENQMLPLSK-SLNKIAVIGPNAEEVKELTCRYGPAHAPIKTVYQGIKEYLPNAEVS 493

Query: 439 YAPGC--------------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           YA GC                +  Q  +MI  A++ AK +D  ++V G +     E   R
Sbjct: 494 YAKGCNIIDKYFPESELYNVPLDTQEQAMINEAVELAKVSDIAILVLGGNEKTVREEFSR 553

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
             L L G Q +L+  V    K PV LV++   A  IN+A  N  + +I+   +PGE  G 
Sbjct: 554 TSLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIVHAWFPGEFMGN 610

Query: 545 AIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYG 604
           AIA V+FG YNPGGRL +T +  +  ++P+ + P +P ++  GR     DG V+YPFGYG
Sbjct: 611 AIAKVLFGDYNPGGRLAVT-FPKSVGQVPF-AFPFKPGSDSKGRVR--VDG-VLYPFGYG 665

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
           LSYT F+Y      K V                +G  +                  T   
Sbjct: 666 LSYTTFEYSALKISKPV----------------IGPQE----------------NMTLSC 693

Query: 665 EVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKI 723
            V+N GK  G EVV +Y +       T+ K + G+ER+ +  G+   + FT+   + L +
Sbjct: 694 IVKNTGKRAGDEVVQLYIRDDFSSVTTYDKMLRGFERIHLQPGEEQTISFTLTP-QDLGL 752

Query: 724 VDNAANSLLASGAHTILVG 742
            D      +  G+ +I++G
Sbjct: 753 WDKNNQFTVEPGSFSIMIG 771


>gi|322437617|ref|YP_004219707.1| glycoside hydrolase family protein [Granulicella tundricola
           MP5ACTX9]
 gi|321165510|gb|ADW71213.1| glycoside hydrolase family 3 domain protein [Granulicella
           tundricola MP5ACTX9]
          Length = 892

 Score =  271 bits (692), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 167/451 (37%), Positives = 237/451 (52%), Gaps = 51/451 (11%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            PY D  L   +R  DLV RMTL EKV Q  + A  + RL +P Y++WSE LHG++  G 
Sbjct: 32  LPYMDPALTTQQRVDDLVSRMTLEEKVSQTINSAPAISRLNVPEYDYWSEGLHGIARSGY 91

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG------- 124
                           AT FP  I   A+++  L ++IG  +S EARA +N         
Sbjct: 92  ----------------ATMFPQAIGMAATWDAPLLQQIGDVISIEARAKFNEAIRHNIHS 135

Query: 125 -NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
              GLT WSPNIN+ RDPRWGR  ET GEDP++ GR  + +V+G+Q           D  
Sbjct: 136 IYYGLTIWSPNINIFRDPRWGRGQETYGEDPFLTGRLGVAFVKGIQG---------PDPN 186

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
             +  A  KH+A +   +   + R   +   T  D+ +T++  F   + E    S+MC+Y
Sbjct: 187 YFRAIATPKHFAVH---SGPESTRHSANIEPTPHDLHDTYLPAFRATITEAHADSIMCAY 243

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQ----TIVESHKFLNDTKEDAVAR 299
           N V G P CA   LL  T+R DW F G++ SDC +I     T   SH    D KE A A 
Sbjct: 244 NAVEGSPACASKLLLQDTLRRDWGFKGFVTSDCGAIDDFYATDYPSHHTSPD-KEAAAAA 302

Query: 300 VLKAGLDLDCGDYYTNFTMG-AVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKN 356
            +KAG D +CG  Y   T+G AV++G + EA+IDT+L+ L+    +LG FD + +  +  
Sbjct: 303 GIKAGTDSNCGQTY--LTLGSAVKKGLVTEAEIDTALKHLFTARFQLGLFDPAAKVAFNA 360

Query: 357 LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
           +  + + +P H  LA +AA + IVLLKND   LP    +++T+A++GP A     + GNY
Sbjct: 361 IPFSEVNSPAHQALALKAAEESIVLLKNDAHTLPFKP-SVRTIAVIGPSAATLNNLEGNY 419

Query: 417 EGTPCRYTSPMDGF---YAYSKVINYAPGCA 444
              P     P+DG    +  SKV+ YA G +
Sbjct: 420 NAIPLHPVLPLDGILTQFKSSKVL-YAQGSS 449



 Score =  113 bits (282), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 79/261 (30%), Positives = 124/261 (47%), Gaps = 42/261 (16%)

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
           G DR D+ LP  Q +++  VA   K P+ +V+++  A+ +N+A  N    +IL   YPG+
Sbjct: 650 GGDRTDIKLPAAQQQMLEAVAATGK-PLVVVLLNGSALAVNWA--NDHAAAILEAWYPGQ 706

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
            GG AIA+ + GK NP GRLP+T+Y +         +P     +   RTY++     ++ 
Sbjct: 707 AGGTAIAETLAGKNNPAGRLPVTFYSS------IDQIPAFDDYSMANRTYRYSKAKPLFE 760

Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
           FGYGLSYT F Y         +IKL           T+    P                 
Sbjct: 761 FGYGLSYTTFTYS--------NIKLSTQ--------TLHAGDP----------------L 788

Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
           T + +V N G++ G EV  +Y  PP  A +  + +  + RV +A G+   V FT++  ++
Sbjct: 789 TVEADVRNTGRVAGDEVAELYLTPPHTAVSPQRALSAFTRVHLAPGELRHVTFTLDP-RT 847

Query: 721 LKIVDNAANSLLASGAHTILV 741
           L  VD      +  G +T+ V
Sbjct: 848 LSQVDEKGARAVTPGNYTLSV 868


>gi|262405837|ref|ZP_06082387.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|294647798|ref|ZP_06725350.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294806192|ref|ZP_06765039.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
 gi|345510348|ref|ZP_08789916.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
 gi|262356712|gb|EEZ05802.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|292636706|gb|EFF55172.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294446448|gb|EFG15068.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
 gi|345454537|gb|EEO48843.2| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
          Length = 800

 Score =  271 bits (692), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 231/800 (28%), Positives = 361/800 (45%), Gaps = 141/800 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
           Y D   P   R  DL+ +MTL EK  QM  L YG  R+     P   W    W + +   
Sbjct: 56  YEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNI 114

Query: 64  ----HGVSFIGRRTNSP-----------------------PGTHFDSEVPG-----ATSF 91
               +G+   G   + P                       P    +  + G     AT F
Sbjct: 115 DEQANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDLTNEGIRGLCHDRATMF 174

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
           P      A++N+ L ++I +  + EA+A+      G T  +SP +++ +DPRWGRV+E+ 
Sbjct: 175 PAQCGQGATWNKKLIREIAKVTANEAKAL------GYTNIYSPILDIAQDPRWGRVVESY 228

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDPY+ G      + GLQ  EG             I A  KH+A Y +     +     
Sbjct: 229 GEDPYLAGELGKQMILGLQS-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 274

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
           D  V  ++M+  ++ PF   + E     VM SYN  +G P       L + +R  W F G
Sbjct: 275 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 334

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
           Y+VSD ++++ +   H+ +  T+E+  A+V+ AGL++      TNFT           A+
Sbjct: 335 YVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRHAI 388

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
            +GK++   +D  +  +  V   +G FD   P      +  + N  H  ++ +AA + +V
Sbjct: 389 NEGKVSLHTLDQRVSEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESVV 448

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
           LLKN N  LPL+  N K +A++GP+A   K +   Y        +   G   Y  +  + 
Sbjct: 449 LLKNKNQMLPLSK-NFKKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVR 507

Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
           YA GC DI+                Q  +MI  A++ AK +D  ++V G +     E   
Sbjct: 508 YAKGC-DIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGGNEKTVREEFS 566

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R +L L G Q +L+  V    K PV LV++   A  IN+A  N  + +I+   +PGE  G
Sbjct: 567 RTNLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMG 623

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
            AIA V+FG YNPGGRL +T +  +  +IP+ + P +P ++  G+     DG V+YPFGY
Sbjct: 624 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDSKGKVR--VDG-VLYPFGY 678

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           GLSYT F Y         D+K+ K          +G  +                  T  
Sbjct: 679 GLSYTTFGYS--------DLKISKP--------VIGPQE----------------NITLS 706

Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
             V+N GK  G EVV +Y +    + T   +V+ G+ER+ +  G+   V FT+   + L 
Sbjct: 707 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLG 765

Query: 723 IVDNAANSLLASGAHTILVG 742
           + D      +  G+ +++VG
Sbjct: 766 LWDKNNQFTVEPGSFSVMVG 785


>gi|336412865|ref|ZP_08593218.1| hypothetical protein HMPREF1017_00326 [Bacteroides ovatus
           3_8_47FAA]
 gi|335942911|gb|EGN04753.1| hypothetical protein HMPREF1017_00326 [Bacteroides ovatus
           3_8_47FAA]
          Length = 800

 Score =  271 bits (692), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 234/800 (29%), Positives = 365/800 (45%), Gaps = 141/800 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
           Y D   P   R  DL+ +MTL EK  QM  L YG  R+     P   W    W + +   
Sbjct: 56  YEDLSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNI 114

Query: 64  ----HGVSFIGRR-----TNSPPGTH-----------------FDSE-VPG-----ATSF 91
               +G+   G        NS    H                 F +E + G     AT F
Sbjct: 115 DEQANGLGKFGSEISYSYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMF 174

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
           P      A++N+ L ++I +  + EA+A+      G T  +SP +++ +DPRWGRV+E+ 
Sbjct: 175 PAQCGQGATWNKKLIREIAKVTANEAKAL------GYTNIYSPILDIAQDPRWGRVVESY 228

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDPY+VG      + GLQ+ EG             I A  KH+A Y +     +     
Sbjct: 229 GEDPYLVGELGKQMILGLQN-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 274

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
           D  V  ++M+  ++ PF   + E     VM SYN  +G P       L + +R  W F G
Sbjct: 275 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 334

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
           Y+VSD ++++ +   H+ +  T+E+  A+V+ AGL++      TNFT           A+
Sbjct: 335 YVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRRAI 388

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
            +GK++   ++  +  +  V   +G FD   P      +  + N  H  ++ +AA + IV
Sbjct: 389 DEGKVSLHTLNQRVSEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESIV 448

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
           LLKN+N  LPL+  N K +A++GP+    K +   Y        +   G   Y  +  + 
Sbjct: 449 LLKNENQMLPLSK-NFKKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVR 507

Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
           YA GC DI+                Q  +MI  A++ AK +D  ++V G +     E   
Sbjct: 508 YAKGC-DIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGGNEKTVREEFS 566

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R +L L G Q +L+  V    K PV LV++   A  IN+A  N  + +I+   +PGE  G
Sbjct: 567 RTNLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMG 623

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
            AIA V+FG YNPGGRL +T +  +  +IP+ + P +P ++  G+     DG V+YPFGY
Sbjct: 624 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPDSDSKGKVR--VDG-VLYPFGY 678

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           GLSYT F Y         D+K+ K          +G  +                  T  
Sbjct: 679 GLSYTIFGYS--------DLKISKP--------VIGPQE----------------NITLS 706

Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
             V+N GK  G EVV +Y +    + T   +V+ G+ER+ +  G+   V FT+   + L 
Sbjct: 707 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVSFTLTP-QDLG 765

Query: 723 IVDNAANSLLASGAHTILVG 742
           + D      +  G+ +++VG
Sbjct: 766 LWDKNNQFTVEPGSFSVMVG 785


>gi|354580734|ref|ZP_08999639.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
 gi|353203165|gb|EHB68614.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
          Length = 766

 Score =  270 bits (691), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 215/695 (30%), Positives = 327/695 (47%), Gaps = 105/695 (15%)

Query: 87  GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
           GAT FP  +   +++N  L++ + + V+ E R+       G   +SP ++VVRDPRWGR 
Sbjct: 123 GATVFPVPLTIGSTWNPELFRSMCRAVAAETRS-----QGGAATYSPVLDVVRDPRWGRT 177

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
            ET GEDP++V  +A+  V+GLQ         D       + A  KH+A Y       N 
Sbjct: 178 EETFGEDPHLVAEFAVAAVQGLQG--------DRLDAEDSLLATLKHFAGYGASEGGRNG 229

Query: 207 R-FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
              H   R    ++ E  +LPF   V  G   SVM +YN ++G+P  +   LL+  +R  
Sbjct: 230 APVHMGLR----ELHEIDLLPFRKAVEAG-AQSVMTAYNEIDGVPCTSSRYLLHDVLREA 284

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQG 324
           W F G++++DC +I  +   H     + E+A A+ L AG+D++  G  +  +   A++QG
Sbjct: 285 WGFDGFVITDCGAIDMLKSGHN-TAASGEEAAAQALTAGVDMEMSGSMFRVYLRQALEQG 343

Query: 325 KIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKN 384
            I E D++T++  +  +  RLG FD         +  I   +HIELA   A +GIVLLKN
Sbjct: 344 HITEDDLNTAVGRVLAMKFRLGLFDRPYTDPERAEKVIGCEEHIELARRVAAEGIVLLKN 403

Query: 385 DNGALPLN--TGNIKTLALVGPHANATKAMIGNYEG--TPCRYTSPMDGFYAY------S 434
           +   LPLN  TG I   A++GP+ANA    +G+Y     P +  + ++G   +      +
Sbjct: 404 EGNVLPLNPKTGKI---AVIGPNANAPYNQLGDYTSPQPPGQIITVLEGIRRHIGEDADT 460

Query: 435 KVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG-----------LDLSVEA---- 479
           +V+ YAPGC  I   +   +  A+  A  AD  V+  G           +DL   A    
Sbjct: 461 RVL-YAPGC-RIQGDSREGLSHALACAAEADVIVMAIGGSSARDFGEGTIDLRTGASVVT 518

Query: 480 ----------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
                     EG DR  L L G Q EL+ ++    K PV +V ++   +   +   +  I
Sbjct: 519 GLAQSDMECGEGIDRSTLHLMGVQLELLQEIHKLGK-PVVVVYINGRPITEPWIDEH--I 575

Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGR 588
            +IL   YPG+EGG AIAD++FG  NP GRL +T   E   + I Y +   R      G+
Sbjct: 576 PAILEAWYPGQEGGSAIADILFGDVNPSGRLTLTIPKEVGQLPINYNAKRTR------GK 629

Query: 589 TYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
            Y   D    YPFGYGLSYT F Y                      N +V     P    
Sbjct: 630 RYLETDLEPRYPFGYGLSYTDFHYG---------------------NLSVEPAVIPA--- 665

Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQ 707
                   D     +I V N G  DG+EVV +Y      + T  ++ +  + +VF+ AG+
Sbjct: 666 --------DGSAAVRIVVTNTGPRDGAEVVQLYVSDLAASVTRPEKALKAFSKVFLKAGE 717

Query: 708 SAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           S +V FT+   + L+++     +++  G   I VG
Sbjct: 718 SREVTFTVGP-EQLELIGPDMKAVVEPGEFRIRVG 751


>gi|393781221|ref|ZP_10369422.1| hypothetical protein HMPREF1071_00290 [Bacteroides salyersiae
           CL02T12C01]
 gi|392677556|gb|EIY70973.1| hypothetical protein HMPREF1071_00290 [Bacteroides salyersiae
           CL02T12C01]
          Length = 946

 Score =  270 bits (691), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 231/807 (28%), Positives = 365/807 (45%), Gaps = 138/807 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
           Y D   P   R +DL+ +M L EK  QM  L YG  R+    LP  EW    W + +  +
Sbjct: 53  YEDPTAPIDARIEDLLSQMNLNEKTCQMVTL-YGYKRVLKDDLPTPEWKQMLWKDGMGAI 111

Query: 67  S-----------------------------------FIGRRTNSPPGTHFDSEVPG---- 87
                                               F+       P    +  + G    
Sbjct: 112 DEHLNGFQQWGLPPSDNEYVWPASRHAWALNEVQRFFVEETRLGIPVDFTNEGIRGVESY 171

Query: 88  -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
            AT+FPT +    ++N  L  +IG     EAR +      G T  ++P ++V RD RWGR
Sbjct: 172 KATNFPTQLGLGHTWNRKLIHQIGLITGREARML------GYTNVYAPILDVGRDQRWGR 225

Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
             E  GE PY+V    I  VRG+Q       H        +++A  KH+ AY  +     
Sbjct: 226 YEEVYGESPYLVAELGIEMVRGMQ-------HNH------QVAATGKHFIAYSNNKGARE 272

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
                D +++ ++++   + PF+  + E  +  VM SYN  +G P  +    L   +RG 
Sbjct: 273 GMARVDPQMSPREVEMIHVYPFKRVIQEAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGQ 332

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
             F GY+VSD D+++ +   H    D KE AV + ++AGL++ C     D Y       V
Sbjct: 333 MGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLRELV 391

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIE-LAAEAARQGIV 380
           Q+G ++E  I+  +R +  V   +G FD   Q    G ++    +  E +A +A+R+ IV
Sbjct: 392 QEGGLSEEVINDRVRDILRVKFLVGLFDAPYQTDLKGADDEVEKEENEAVALQASRESIV 451

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY---SKVI 437
           LLKN+N  LPL+  ++K +A+ GP+A      + +Y       T+ +DG          +
Sbjct: 452 LLKNENNTLPLDITSVKKIAVCGPNAAEKAYALTHYGPLAVEVTTVVDGLREKLNGKAEV 511

Query: 438 NYAPGC---------ADIV-----CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
            Y  GC         ++I+         S I  A+  A+ AD  V+V G       E K 
Sbjct: 512 LYTKGCDLVDAHWPESEIIDYPLSKDEQSEIDKAVAQAQEADVAVVVLGGGQRTCGENKS 571

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R  L LPG Q +L+  V    K PV LV+++   + +N+A  +  + +IL   YPG +GG
Sbjct: 572 RSSLDLPGRQLDLLKAVQATGK-PVILVLINGRPLSVNWA--DKFVPAILEAWYPGSKGG 628

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV------ 597
            AIADV+FG YNPGG+L +T +  +  +IP+ + P +P +   G       G +      
Sbjct: 629 TAIADVLFGDYNPGGKLTVT-FPKSVGQIPF-NFPHKPSSQIDGGKNPGTKGDMSRVNGA 686

Query: 598 VYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
           +YPFGYGLSYT F+Y  +  SPK +                      P   V    V+CK
Sbjct: 687 LYPFGYGLSYTTFEYSDINISPKVI---------------------TPNQKV---QVRCK 722

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
                    V N GK  G EVV +Y +       T+ K + G+ER+ +  G++ +V FT+
Sbjct: 723 ---------VTNTGKHAGDEVVQLYVRDLISSVTTYEKNLEGFERIHLQPGETKEVSFTL 773

Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
           +  K+L++++   + ++  G  +I++G
Sbjct: 774 DR-KALELLNAKNDWVVEPGDFSIMLG 799


>gi|160884749|ref|ZP_02065752.1| hypothetical protein BACOVA_02738 [Bacteroides ovatus ATCC 8483]
 gi|156109784|gb|EDO11529.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
          Length = 800

 Score =  270 bits (690), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 233/800 (29%), Positives = 361/800 (45%), Gaps = 141/800 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
           Y D   P   R  DL+ +MTL EK  QM  L YG  R+     P   W    W + +   
Sbjct: 56  YEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTTGWSTEIWKDGIGNI 114

Query: 64  ----HGVSFIGRRTNSP-----------------------PGTHFDSEVPG-----ATSF 91
               +G+   G   + P                       P    +  + G     AT F
Sbjct: 115 DEQANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMF 174

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
           P      A++N+ L  +I +  + EA+A+      G T  +SP +++ +DPRWGRV+E+ 
Sbjct: 175 PAQCGQGATWNKKLIGEIAKVTADEAKAL------GYTNIYSPILDIAQDPRWGRVVESY 228

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDPY+VG      + GLQ+ EG             I A  KH+A Y +     +     
Sbjct: 229 GEDPYLVGELGKQMILGLQN-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 274

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
           D  V  ++M+  ++ PF   + E     VM SYN  +G P       L + +R  W F G
Sbjct: 275 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 334

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
           YIVSD ++++ +   H+ +  T+E+  A+V+ AGL++      TNFT           A+
Sbjct: 335 YIVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRRAI 388

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
            +GK++   +D  +  +  V   +G FD   P      +  + N  H  ++ +AA + IV
Sbjct: 389 NEGKVSLHTLDQRVGEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESIV 448

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
           LLKN+N  LPL+  N   +A++GP+    K +   Y        +   G   Y  +  + 
Sbjct: 449 LLKNENQMLPLSK-NFSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVR 507

Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
           YA GC DI+                Q  +MI  A++ AK +D  ++V G +     E   
Sbjct: 508 YAKGC-DIIDKYFPESELYNVPLDTQEQAMIQEAVELAKASDIAILVLGGNEKTVREEFS 566

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R +L L G Q +L+  V    K PV LV++   A  IN+A  N  I +I+   +PGE  G
Sbjct: 567 RTNLDLCGRQQQLLEAVYATGK-PVILVMVDGRAATINWA--NKYIPAIIHAWFPGEFMG 623

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
            AIA V+FG YNPGGRL +T +  +  +IP+ + P +P ++  G+     DG V+YPFGY
Sbjct: 624 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDSKGKVR--VDG-VLYPFGY 678

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           GLSYT F Y         D+K+ K          +G  +                  T  
Sbjct: 679 GLSYTTFGYS--------DLKISKP--------VIGPQE----------------NITLS 706

Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
             V+N GK  G EVV +Y +    + T   +V+ G+ER+ +  G+   V FT+   + L 
Sbjct: 707 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLG 765

Query: 723 IVDNAANSLLASGAHTILVG 742
           + D      +  G+ +++VG
Sbjct: 766 LWDKNNQFTVEPGSFSVMVG 785


>gi|383115617|ref|ZP_09936373.1| hypothetical protein BSGG_2514 [Bacteroides sp. D2]
 gi|313694979|gb|EFS31814.1| hypothetical protein BSGG_2514 [Bacteroides sp. D2]
          Length = 946

 Score =  270 bits (690), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 234/806 (29%), Positives = 369/806 (45%), Gaps = 136/806 (16%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
           Y D   P   R +DL+++MTL EK  QM  L YG  R+    LP  EW ++    G+  I
Sbjct: 53  YEDPSAPVDARIEDLLKQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEWKNQLWKDGIGAI 111

Query: 70  GRRTNS------PPG-------------------------------THFDSE-VPG---- 87
               N       PP                                T F +E + G    
Sbjct: 112 DEHLNGFQQWGLPPSDNEYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESY 171

Query: 88  -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
            AT+FPT +    ++N  L +++G     EAR +      G T  ++P ++V RD RWGR
Sbjct: 172 KATNFPTQLGLGHTWNRELIRQVGVITGREARML------GYTNVYAPILDVGRDQRWGR 225

Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
             E  GE PY+V    I  VRG+Q             +  +++A  KH+ AY  +     
Sbjct: 226 YEEVYGESPYLVAELGIEMVRGMQ-------------QDYQVAATGKHFIAYSNNKGGRE 272

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
                D +++ ++++   + PF+  + E  +  VM SYN  +G P  +    L   +RG+
Sbjct: 273 GMSRVDPQMSPREVEMVHVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGE 332

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
             F GY+VSD D+++ +   H    D KE AV + ++AGL++ C     D Y       V
Sbjct: 333 MGFRGYVVSDSDAVEYLYTKHNTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLRELV 391

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN-NICNPQHIELAAEAARQGIV 380
           ++G ++E  I+  +R +  V   +G FD   Q    G +  +   ++ E+A +A+R+ IV
Sbjct: 392 KEGGLSEEVINDRVRDILRVKFLVGLFDHPYQTDLKGADEEVEKAENEEVALQASRESIV 451

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYS--KV-I 437
           LLKND   LPL+   IK +A+ GP+A+     +G+Y       TS + G    +  KV +
Sbjct: 452 LLKNDQDVLPLDISGIKKIAVCGPNADECSYALGHYGPLAVEVTSVLKGIQEKTDGKVEV 511

Query: 438 NYAPGCA--------------DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
            Y+ GC                +  +    I  A+  AK AD  V+V G       E K 
Sbjct: 512 LYSKGCELVDANWPESELIDFPLTEEEQKEIDRAVSQAKEADVAVVVLGGGQRTCGENKS 571

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R  L LPG Q +L+  V    K PV LV+++   + IN+A  +  + +IL   YPG +GG
Sbjct: 572 RSSLDLPGRQLDLLKAVVATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGAKGG 628

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV------ 597
           +A+ADV+FG YNPGG+L +T +     +IP+ + P +P +   G      DG +      
Sbjct: 629 KAVADVLFGDYNPGGKLTVT-FPKTVGQIPF-NFPCKPSSQIDGGKNPGMDGNMSRANGA 686

Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
           +Y FG+GLSYT F+Y                    D+  T     P         V CK 
Sbjct: 687 LYAFGHGLSYTSFEYS-------------------DLKITPAVITPNQKTY----VTCK- 722

Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
                   V N GK  G EVV +Y +       T+ K + G+ER+ +  G++ +V F ++
Sbjct: 723 --------VTNTGKRAGDEVVQLYVRDVLSSVTTYEKNLAGFERIHLKPGETKEVFFPID 774

Query: 717 ACKSLKIVDNAANSLLASGAHTILVG 742
             K+L++++   + ++  G  T++VG
Sbjct: 775 R-KALELLNADMHWVVEPGDFTLMVG 799


>gi|403744211|ref|ZP_10953568.1| glycoside hydrolase family 3 domain-containing protein
           [Alicyclobacillus hesperidum URH17-3-68]
 gi|403122228|gb|EJY56463.1| glycoside hydrolase family 3 domain-containing protein
           [Alicyclobacillus hesperidum URH17-3-68]
          Length = 789

 Score =  270 bits (690), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 232/784 (29%), Positives = 352/784 (44%), Gaps = 141/784 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDL-AYGV-PRLGLPLYEWWSEALHGVSFIGR 71
           Y +  LP  ER + L+  MT+ EK  Q+  + AY V   L     +  S   HG+  I R
Sbjct: 5   YQNPNLPIEERVELLLSEMTIEEKAAQLTSVWAYEVLDDLVFSDAKAASLFEHGIGQITR 64

Query: 72  ---------------------------RTNSPPGTHFDS----EVPGATSFPTVILTTAS 100
                                      R   P   H +S       GAT FP  I   ++
Sbjct: 65  IGGATNLDPADVARLSNRIQQHLLTQTRLAIPALVHEESCSGYMAKGATCFPQSIGIAST 124

Query: 101 FNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRY 160
           +++ + +KIG+ + T+ RA+           +P ++V RDPRWGRV ET GEDPY+V + 
Sbjct: 125 WDQDIARKIGEVIRTQMRAV-----GAQQALAPLLDVTRDPRWGRVEETFGEDPYLVAQM 179

Query: 161 AINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLD----NWEGNDRFHFDSRVTE 216
            I YV GLQ           D     + A  KH+  Y       NW         + + E
Sbjct: 180 GIGYVGGLQ----------GDDLRDGVIATGKHFVGYGASEGGMNWA-------PAHIPE 222

Query: 217 QDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDC 276
           ++++E ++ PFE  V E  + S+M  Y+ ++G+P   +  LL +T+R  W F G +VSD 
Sbjct: 223 RELREVYLYPFEAVVREAKLQSIMPGYHELDGVPCHHNRDLLVETLRNRWGFEGIVVSDY 282

Query: 277 DSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQGKIAEADIDTS 334
            ++  + E H+   D  E AV  V +AG+D++    D Y    + AV QG++    +D  
Sbjct: 283 FAVNQLFEYHQVARDKVEAAVFAV-EAGVDVELPSRDVYGQPLVEAVNQGRLRIEQVDAL 341

Query: 335 LRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTG 394
           +R +     RLG F+     +    N   N +  +LA EAA + IVLLKN+   LPL   
Sbjct: 342 VRRVLTAKFRLGLFERPFVDEGRAPNLFDNHEQRQLAREAAAKSIVLLKNEGNLLPLE-- 399

Query: 395 NIKTLALVGPHANATKAMIGNYEGTPCR------------YTSPM-------DGFYAYSK 435
           N   +A++GP+A++ + M+G+Y   PC             + SPM       D F     
Sbjct: 400 NRGKIAVIGPNADSIRNMVGDY-AYPCHIESLLEQSEDNVFHSPMPKGMKSVDDFIEMKT 458

Query: 436 VIN-------------YAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG-----LDLSV 477
           ++              YA GC D++  + S I  A   A+ AD  ++V G      D   
Sbjct: 459 IVQAIRDKVGDGAEVLYAKGC-DVLGDDTSGIAEAEHVARQADVAIVVVGDRAGLTDGCT 517

Query: 478 EAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGY 537
             E +DR  L L G Q EL+ +V  A   P  +V++    + I +   +  + +IL    
Sbjct: 518 TGESRDRATLTLLGAQQELVERVV-ATGTPTVVVLVGGRPLSITWIAEH--VPAILEAWL 574

Query: 538 PGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
           PGEEG  AIADV+FG  NP G+LPIT       V I Y   P    +++ G      + P
Sbjct: 575 PGEEGAPAIADVVFGDMNPSGKLPITIPRSVGQVPIYYGHKPSGGRSHWKGVYVDESNKP 634

Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
           + Y FG+GLSYT F Y+        ++ L K +    I+ TV             DV C 
Sbjct: 635 L-YAFGHGLSYTTFAYR--------ELALSKSEI--GIHDTV-------------DVSCV 670

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTM 715
                    +EN G   G EVV +Y        T  ++++ G+ RV + A ++A V F +
Sbjct: 671 ---------IENTGDRVGEEVVQLYVYDRAADVTRPVQELRGFARVHLEAKEAALVTFRL 721

Query: 716 NACK 719
           +A +
Sbjct: 722 SAHQ 725


>gi|28199699|ref|NP_780013.1| family 3 glycoside hydrolase [Xylella fastidiosa Temecula1]
 gi|182682443|ref|YP_001830603.1| beta-glucosidase [Xylella fastidiosa M23]
 gi|417557804|ref|ZP_12208815.1| Beta-glucosidase [Xylella fastidiosa EB92.1]
 gi|28057820|gb|AAO29662.1| family 3 glycoside hydrolase [Xylella fastidiosa Temecula1]
 gi|182632553|gb|ACB93329.1| Beta-glucosidase [Xylella fastidiosa M23]
 gi|338179587|gb|EGO82522.1| Beta-glucosidase [Xylella fastidiosa EB92.1]
          Length = 882

 Score =  270 bits (690), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 167/447 (37%), Positives = 236/447 (52%), Gaps = 51/447 (11%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           + A  LV +MT  EK+ Q  + A  +PRLG+P Y+WWSE LHG++  G            
Sbjct: 32  QHAAALVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNGY----------- 80

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
                AT FP  I   AS+N  L + +G   STEARA +NL           AGLT WSP
Sbjct: 81  -----ATVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSP 135

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDPY+  + A++++RGLQ         D+   P  I A  KH
Sbjct: 136 NINIFRDPRWGRGMETYGEDPYLTSQLAVSFIRGLQG--------DTPDHPRTI-ATPKH 186

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           +A +   +     R  FD  V+  D++ T+   F   + +G   SVMC+YN ++G P CA
Sbjct: 187 FAVH---SGPEQGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACA 243

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              LLN  +R DW F+G++VSDCD+I+ +   H F  D    A A  LK+G DL+CG+ Y
Sbjct: 244 SDWLLNTRLRNDWGFNGFVVSDCDAIEDMTRFHFFRQDNAS-ASAAALKSGNDLNCGNTY 302

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
            +    A+ +G I E+ +D +L  L+    RLG         Y  +G  +I  P H  LA
Sbjct: 303 RDLNQ-AIARGDIDESTLDQALIRLFTARQRLGTLQPREHDPYAAIGIKHIDTPAHRALA 361

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
            +AA Q +VLLKN    LPL      TLA++GP A++  A+  NY+GT     +P+ G  
Sbjct: 362 LQAAAQSLVLLKNSGNTLPLPPET--TLAVLGPDADSLTALEANYQGTSSTPVTPLTGLR 419

Query: 432 A--------YSKVINYAPGCADIVCQN 450
                    Y++  + APG  + + + 
Sbjct: 420 TRFGTAKVHYAQGASLAPGVPNTIPET 446



 Score =  142 bits (358), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 96/295 (32%), Positives = 136/295 (46%), Gaps = 52/295 (17%)

Query: 460 AAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVT 509
           A  +ADA V   GL   VE E          G DR  + LP  Q  L+  V    K P+ 
Sbjct: 607 AVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKTTGK-PLI 665

Query: 510 LVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANY 569
           +V+MS  AV +N+A+++    +IL   YPG+ GG AIA  + G  NPGGRLP+T+Y +  
Sbjct: 666 VVLMSGSAVALNWAQHH--ADAILAAWYPGQSGGTAIAQALAGDVNPGGRLPVTFYRSTQ 723

Query: 570 VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQ 629
              PY S       +  GRTY++F G  +YPFGYGLSYTQF Y+                
Sbjct: 724 DLPPYISY------DMTGRTYRYFKGQPLYPFGYGLSYTQFAYEAP-------------- 763

Query: 630 QCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG 689
           Q        G                     T    V N G   G EVV +Y +PP    
Sbjct: 764 QLSTATLKAGNT------------------LTVTTHVRNTGTRAGDEVVQLYLEPPYSPQ 805

Query: 690 THIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
             ++ ++G++RV +  G+S  + FT++A + L  V       + +G + + VG G
Sbjct: 806 APLRSLVGFKRVTLRPGESRLLTFTLDA-RQLSSVQQTGQRSVEAGHYHLFVGGG 859


>gi|364284956|gb|AEW47953.1| GHF3 protein [uncultured bacterium D1_14]
 gi|364284964|gb|AEW47958.1| GHF3 protein [uncultured bacterium E2_1]
          Length = 752

 Score =  270 bits (690), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 221/734 (30%), Positives = 351/734 (47%), Gaps = 98/734 (13%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYG-----VPRL------GLPLYEWWSE---ALHGVSF 68
           +R + L+ +MTL EK+ QM  +++      V RL      G  L E   E   AL  V+ 
Sbjct: 35  KRVESLLTKMTLEEKIGQMNQVSFSGNIEEVSRLIKNGEVGSILNEVDPERVNALQRVAI 94

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
              R   P     D      T FP  +   ASFN  + +K  +  + EA ++      G+
Sbjct: 95  EESRLGIPILIGRDVIHGFKTIFPIPLGQAASFNPQIVEKGARVSAVEASSV------GV 148

Query: 129 TF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
            + ++P I++ RDPRWGR+ E+ GEDPY+        V+G Q         DS + P  I
Sbjct: 149 RWTFTPMIDISRDPRWGRIAESCGEDPYLTSVMGAAMVKGFQG--------DSLNNPNSI 200

Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
           +AC KH+  Y     EG  R +  + +TE+ ++  ++ PFE  V +G V++ M S+N  +
Sbjct: 201 AACAKHFVGYGAA--EGG-RDYNTTCITERQLRNVYLPPFEAAVKQG-VATFMTSFNAND 256

Query: 248 GIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL 307
           GIP+  +P +L + +R +W F G++VSD  SI  +V +H F  D K DA  + + AG+D+
Sbjct: 257 GIPSSGNPFILKKVLRDEWGFDGFVVSDWASIIEMV-AHGFCTDDK-DAAMKAVNAGVDM 314

Query: 308 DCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQ 366
           +   Y Y N       + K++E  ID ++R +  V  RLG FD +P       + I + +
Sbjct: 315 EMVSYTYMNHLKDLKNENKVSEETIDNAVRNILRVKFRLGLFD-NPYVDEKAPSPIYSKE 373

Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGN--YEGTPCRYT 424
           ++ +A EAA Q  +LLKND   LP+N  ++KT+A+VGP A+A    +G   ++G      
Sbjct: 374 NLAIAKEAAIQSAILLKNDKQILPINE-SVKTIAVVGPMADAPYEQMGTWAFDGEKSMTQ 432

Query: 425 SPMDG---FYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
           +P+     FY       + PG A    +N S I  A+ AA  AD  +   G +  +  E 
Sbjct: 433 TPLMALRQFYGDKVNFIFEPGLAYTRDKNTSGISKAVSAANRADLVLAFVGEEAILSGEA 492

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
               +L L G Q++LIN +A   K  VT+VI       +   K     K++L+  +PG  
Sbjct: 493 HCLANLNLQGAQSDLINALAKTGKPIVTVVI---AGRPLTIGKEAELSKAVLYSFHPGTM 549

Query: 542 GGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPL-RPVNNFP------------- 586
           GG AIAD++FGK  P G+ P+T+  E   + I Y+     RP N                
Sbjct: 550 GGPAIADLLFGKAVPSGKTPVTFPKEVGQIPIYYSHYNTGRPANRNEILLDNIAVGAGQT 609

Query: 587 --GRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
             G T  + D     +YPFG+GLSYT F+Y         ++KL  ++             
Sbjct: 610 SLGNTSFYLDAGFDPLYPFGFGLSYTTFEYS--------NLKLSSNE------------- 648

Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERV 701
                     +  KD + T   +++N G  +G+EV  +Y +   G     +K++  + R+
Sbjct: 649 ----------LSAKD-ELTVTFDLKNTGNYEGAEVAQLYVRDMVGSVVRPVKELKRFNRI 697

Query: 702 FIAAGQSAKVGFTM 715
            +  G++  V  T 
Sbjct: 698 TLKPGETRNVSMTF 711


>gi|423293673|ref|ZP_17271800.1| hypothetical protein HMPREF1070_00465 [Bacteroides ovatus
           CL03T12C18]
 gi|392677631|gb|EIY71047.1| hypothetical protein HMPREF1070_00465 [Bacteroides ovatus
           CL03T12C18]
          Length = 800

 Score =  270 bits (690), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 233/800 (29%), Positives = 361/800 (45%), Gaps = 141/800 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
           Y D   P   R  DL+ +MTL EK  QM  L YG  R+     P   W    W + +   
Sbjct: 56  YEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTTGWSTEIWKDGIGNI 114

Query: 64  ----HGVSFIGRRTNSP-----------------------PGTHFDSEVPG-----ATSF 91
               +G+   G   + P                       P    +  + G     AT F
Sbjct: 115 DEQANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMF 174

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
           P      A++N+ L  +I +  + EA+A+      G T  +SP +++ +DPRWGRV+E+ 
Sbjct: 175 PAQCGQGATWNKKLIGEIAKVTADEAKAL------GYTNIYSPILDIAQDPRWGRVVESY 228

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDPY+VG      + GLQ+ EG             I A  KH+A Y +     +     
Sbjct: 229 GEDPYLVGELGKQMILGLQN-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 274

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
           D  V  ++M+  ++ PF   + E     VM SYN  +G P       L + +R  W F G
Sbjct: 275 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 334

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
           YIVSD ++++ +   H+ +  T+E+  A+V+ AGL++      TNFT           A+
Sbjct: 335 YIVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNI-----RTNFTPPQDFILPLRRAI 388

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
            +GK++   +D  +  +  V   +G FD   P      +  + N  H  ++ +AA + IV
Sbjct: 389 NEGKVSLHTLDQRVGEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESIV 448

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
           LLKN+N  LPL+  N   +A++GP+    K +   Y        +   G   Y  +  + 
Sbjct: 449 LLKNENQMLPLSK-NFSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVR 507

Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
           YA GC DI+                Q  +MI  A++ AK +D  ++V G +     E   
Sbjct: 508 YAKGC-DIIDKYFPESELNNVPLDTQEQAMIQEAVELAKASDIAILVLGGNEKTVREEFS 566

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R +L L G Q +L+  V    K PV LV++   A  IN+A  N  I +I+   +PGE  G
Sbjct: 567 RTNLDLCGRQQQLLEAVYATGK-PVILVMVDGRAATINWA--NKYIPAIIHAWFPGEFMG 623

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
            AIA V+FG YNPGGRL +T +  +  +IP+ + P +P ++  G+     DG V+YPFGY
Sbjct: 624 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDSKGKVR--VDG-VLYPFGY 678

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           GLSYT F Y         D+K+ K          +G  +                  T  
Sbjct: 679 GLSYTTFGYS--------DLKISKP--------VIGPQE----------------NITLS 706

Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
             V+N GK  G EVV +Y +    + T   +V+ G+ER+ +  G+   V FT+   + L 
Sbjct: 707 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLG 765

Query: 723 IVDNAANSLLASGAHTILVG 742
           + D      +  G+ +++VG
Sbjct: 766 LWDKNNRFTVEPGSFSVMVG 785


>gi|288925400|ref|ZP_06419334.1| beta-glucosidase [Prevotella buccae D17]
 gi|288337871|gb|EFC76223.1| beta-glucosidase [Prevotella buccae D17]
          Length = 858

 Score =  270 bits (690), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 168/478 (35%), Positives = 246/478 (51%), Gaps = 42/478 (8%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
           S+       PYC+  L   ERA+DL+ R+TL EK + M D +  +PRLG+  + WWSEAL
Sbjct: 14  SLSATAQLLPYCNPDLSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWSEAL 73

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN- 122
           HG + +G                G T FP  +   ASFN+ L +++    S E RA YN 
Sbjct: 74  HGAANMG----------------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNR 117

Query: 123 -LGNAG-------LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
            + N G       L+ W+PN+N+ RDPRWGR  ET GEDPY+        VRGLQ  E  
Sbjct: 118 RMLNGGEDEKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPETA 177

Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
           +Y         K+ AC KHYA +    +  +     D  V+ +D+ ET++  F+  V E 
Sbjct: 178 KYR--------KLWACAKHYAVHSGPEYTRHTANVAD--VSPRDLWETYLPAFKTLVTEA 227

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
            V  VMC+Y R++  P C++ +LL Q +R +W F+  +VSDC ++  I  +HK  +D   
Sbjct: 228 KVREVMCAYQRLDDDPCCSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVH 287

Query: 295 DAVARVLKAGLDLDCGDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP- 352
            A      AG D++CG  Y   T+  AV++G I EA++D  +  L      LG  D    
Sbjct: 288 AAAK-AAVAGTDVECGFGYAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMDDPKL 346

Query: 353 -QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKA 411
            ++  +  + + +  H +LA + ARQ +VLL+N  G LPL  G  + +A++GP+A+    
Sbjct: 347 VEWSKIPASVMDSKAHRQLALDMARQSLVLLQNKGGVLPLKAGG-EPIAVIGPNADDGPM 405

Query: 412 MIGNYEGTPCRYTSPMDGFYAYSKVINYAPGC--ADIVCQNNSMIPAAIDAAKNADAT 467
           M GNY GTP R  + ++G     K + Y  GC   D    N+ +   AID  K    T
Sbjct: 406 MWGNYNGTPNRTVTILNGIKVRHKRVTYLKGCDLTDTKTVNSLLPQCAIDGRKGLRGT 463



 Score = 99.8 bits (247), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 74/286 (25%), Positives = 121/286 (42%), Gaps = 57/286 (19%)

Query: 456 AAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAK 505
           A I   +     V V G+  ++E E          G DR ++ LP  Q + +  + +A K
Sbjct: 591 AIIRKLQGIRKVVFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK 650

Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
              T+V ++     I          +IL   Y G+EGG A++DV+FG  NP G+LP+T+Y
Sbjct: 651 ---TVVFVNCSGSAIALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNPSGKLPVTFY 707

Query: 566 EANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
           +       Y    +R      GRTY++F  P ++ FGYGLSYT F++  A +        
Sbjct: 708 KRTDQLPDYEDYSMR------GRTYRYFSDP-LFAFGYGLSYTTFRFGRARA-------- 752

Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
                                       +  +  +   + + N G   G EVV VY +  
Sbjct: 753 ----------------------------EAAEGGYRLSVPLTNTGTRPGEEVVQVYIRRV 784

Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL 731
                 +K +  + RV + AG+S  V   ++  KS +  D + N++
Sbjct: 785 ADTNGPLKSLRAFRRVALKAGESTTVEIPLSR-KSFECFDESTNTM 829


>gi|335433420|ref|ZP_08558246.1| glycoside hydrolase family 3 domain protein [Halorhabdus tiamatea
           SARL4B]
 gi|335434171|ref|ZP_08558974.1| glycoside hydrolase family 3 domain protein [Halorhabdus tiamatea
           SARL4B]
 gi|334898028|gb|EGM36149.1| glycoside hydrolase family 3 domain protein [Halorhabdus tiamatea
           SARL4B]
 gi|334898759|gb|EGM36857.1| glycoside hydrolase family 3 domain protein [Halorhabdus tiamatea
           SARL4B]
          Length = 783

 Score =  270 bits (690), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 206/692 (29%), Positives = 331/692 (47%), Gaps = 101/692 (14%)

Query: 86  PGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGR 145
           PG T FP  I   ++++ +L + I  ++ T   A+       +   SP ++V RD RWGR
Sbjct: 121 PGGTIFPQSIGLASTWSPALVESITDSIRTRLDAV-----GTVQALSPVLDVSRDMRWGR 175

Query: 146 VLETPGEDPYVVGRYAINYVRGLQ-DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEG 204
           V ET GEDP +VG     YV GLQ D EG             I A  KH+AA+   + EG
Sbjct: 176 VEETYGEDPQLVGALGAAYVAGLQSDGEG-------------IDATLKHFAAHG--SGEG 220

Query: 205 NDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRG 264
             +     ++ E++++E  + PFE+ + E D  +VM +Y+ ++G+P  +   LL   +RG
Sbjct: 221 G-KNRSSVQIGERELREVHLYPFEVAIQEADARAVMNAYHDIDGVPCASSEWLLTDVLRG 279

Query: 265 DWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQ 322
           +W F G++V+D  S+  + E H  + DT+ +A    L+AGLD++    D Y      AV+
Sbjct: 280 EWGFDGHVVADYFSVDLLKEEHG-IADTQREAGVAALEAGLDVELPATDCYDENLRKAVE 338

Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLL 382
            G+++EA +DT++R +    +  G FD      +         +  ELAA AAR+ I LL
Sbjct: 339 DGELSEATVDTAVRRVLRAKIESGVFDDPYVDPDAATEPFDTDEQTELAARAARESITLL 398

Query: 383 KNDNGALPLNTGNIKTLALVGPHANATKAMIGNY---------EGTPCRYTSPMDGFYAY 433
           +ND G LPL  G + ++ALVGP A+  +A +G+Y         E       +P D   A 
Sbjct: 399 END-GLLPLAGGELDSVALVGPQADDGRAQVGDYTHAARFDTEEAGDFESVTPRDALEAR 457

Query: 434 SKV----INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGL---------------- 473
            +     + Y  G A +   +     AA +   +AD  V   G                 
Sbjct: 458 GETAGFDVEYVEG-ATMTGPSTDGFDAAEETVADADLAVACVGARSDIDFADRENPAELP 516

Query: 474 DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI-NFAKNNPKIKSI 532
           D+    E  D  DL LPG Q  L++++A+    P+ +V +S     I   A++ P   ++
Sbjct: 517 DVPTSGENCDVTDLELPGVQEALVDRLAE-TDTPLIVVQVSGKPHAIPEIAESVP---AL 572

Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA-NYVKIPYTSMPLRPVNNFPGRTYK 591
           L    PG+EGG AIADV+FG+YNP G LP++  ++     + Y+  P     N     + 
Sbjct: 573 LHAWLPGQEGGTAIADVLFGEYNPSGHLPVSVPKSVGQQPVYYSRKP-----NSANEEHV 627

Query: 592 FFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLID 651
           + DG  +Y FGYGLSYT F+Y         D+++D +         +GT           
Sbjct: 628 YMDGEPLYSFGYGLSYTDFEYG--------DLEVDAETVA-----PMGT----------- 663

Query: 652 DVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAK 710
                    T  + V N G + G +VV +Y      +    +++++G+ERV +  G++ +
Sbjct: 664 --------LTASVTVTNAGDVAGDDVVQLYQHAENPSQARPVQELLGFERVHLEPGETKR 715

Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           V F+ +A + L   D   N  +  G + + VG
Sbjct: 716 VTFSFDATQ-LAYHDLDMNLAVEEGPYELRVG 746


>gi|298386950|ref|ZP_06996504.1| beta-glucosidase [Bacteroides sp. 1_1_14]
 gi|298260100|gb|EFI02970.1| beta-glucosidase [Bacteroides sp. 1_1_14]
          Length = 846

 Score =  270 bits (689), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 164/438 (37%), Positives = 236/438 (53%), Gaps = 46/438 (10%)

Query: 7   VKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGV 66
           V ++   Y +   P  ER +DL+ ++T+ EK+  +   + G+ R+G+  Y   +EALHG+
Sbjct: 16  VSMAQDLYKNMNAPIHERIQDLLSKLTIEEKISLLRATSPGIERMGIDKYYMGNEALHGI 75

Query: 67  SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
              G+                 T FP  I   + +N  L   I   +S EARA +N    
Sbjct: 76  IRPGK----------------FTVFPQAIGLASMWNPELHHIIASVISDEARARWNELER 119

Query: 127 G----------LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEY 176
           G          LTFWSP +N+ RDPRWGR  ET GEDPY+ G     +V+GLQ       
Sbjct: 120 GKKQKDQFSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGVLGTAFVKGLQG------ 173

Query: 177 HRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDV 236
                 R LK  +  KH+AA    N E ++RF+ D+ +TE DM+E ++  FE C+ EG  
Sbjct: 174 ---DHPRYLKSVSTPKHFAA----NNEEHNRFYCDAAITETDMREYYLPAFEKCIREGKA 226

Query: 237 SSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDA 296
            S+M +YN +NG+P  A+  LLN+ ++ DW F+GYIVSDC +   ++  H+++  T E A
Sbjct: 227 ESIMTAYNAINGVPCTANNWLLNKVLKQDWGFNGYIVSDCGAPGLLMTDHRYVK-TPEAA 285

Query: 297 VARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-- 353
               +KAGLDL+CGDY +    + A +Q  ++ A+ID++   +    MRLG FD   +  
Sbjct: 286 AMIAIKAGLDLECGDYVFGAPLLNAYKQYMVSTAEIDSAAYHVLRARMRLGMFDDPEKNP 345

Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
           Y +L    +   +H ELA EAARQ IVLLKN    LPLN   IK++A+VG   NA     
Sbjct: 346 YNHLSPEIVGCEKHKELALEAARQSIVLLKNQKNTLPLNAKKIKSIAVVG--INAANCEF 403

Query: 414 GNYEGTPCRY-TSPMDGF 430
           G+Y GTP     S +DG 
Sbjct: 404 GDYSGTPVNAPVSVLDGI 421



 Score =  126 bits (317), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 88/286 (30%), Positives = 144/286 (50%), Gaps = 51/286 (17%)

Query: 462 KNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-VDI 520
           + +D  + V G++ S+E EG+DR  + LP  Q   I +   A   P T+V++ AG+ + +
Sbjct: 595 RESDVVIAVMGINQSIEREGQDRSSIELPKDQQIFIREAYKA--NPNTIVVLVAGSSMAV 652

Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP-L 579
            +   N  I +I+   YPGE+GG AIA+V+FG YNP GRLP+T+Y +         +P  
Sbjct: 653 GWMDQN--IPAIIDAWYPGEQGGTAIAEVLFGDYNPAGRLPLTFYNS------IEDLPAF 704

Query: 580 RPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
              N    RTY +F+G  +Y FGYGLSYT+F Y+        ++ + +D Q   +N++  
Sbjct: 705 NDYNVKNNRTYMYFEGKPLYAFGYGLSYTKFDYR--------NLNIKQDSQNITLNFS-- 754

Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGY 698
                                     V+N GK +G EV  VY + P +   T +KQ+ G+
Sbjct: 755 --------------------------VKNSGKYNGDEVAQVYVQFPDLGIKTPLKQLKGF 788

Query: 699 ERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVGE 743
           +RV I  G + ++   +   + L++ D+        SG +  +VG+
Sbjct: 789 KRVHIKKGATEQISIEI-PKEELRLWDDQKKQFYTPSGTYNFMVGK 833


>gi|427411073|ref|ZP_18901275.1| hypothetical protein HMPREF9718_03749 [Sphingobium yanoikuyae ATCC
           51230]
 gi|425710258|gb|EKU73280.1| hypothetical protein HMPREF9718_03749 [Sphingobium yanoikuyae ATCC
           51230]
          Length = 791

 Score =  270 bits (689), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 215/720 (29%), Positives = 345/720 (47%), Gaps = 107/720 (14%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+P+  +  E LHG + +G                 ATSFP  I   +S++ ++ +++
Sbjct: 138 RLGIPIL-FHEEGLHGYAAVG-----------------ATSFPQSIAMASSWDPAMLRQV 179

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
            Q ++ E RA            SP +++ RDPRWGR+ ET GEDPY+VG   +  V GLQ
Sbjct: 180 NQVIAREIRA-----RGVPMVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQ 234

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
            V      R    +P  + A  KH   +       N      + V+E++++E F  PFE 
Sbjct: 235 GV-----GRSRTLQPNHVFATLKHLTGHGQPESGTN---IGPAPVSERELRENFFPPFEQ 286

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            V    + +VM SYN ++G+P+ A+  LL+  +R +W F G +VSD  ++  ++  H   
Sbjct: 287 VVKRTGIEAVMASYNEIDGVPSHANRWLLDNVLRQEWGFRGAVVSDYSAVDQLMSIHHIA 346

Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGA-VQQGKIAEADIDTSLRFLYIVLMRLGYF 348
            +  E+A  R L AG+D D  +  +  T+G  V++GK++EA +D ++R +  +  R G F
Sbjct: 347 ANL-EEAAMRALDAGVDADLPEGLSYATLGKLVREGKVSEAKVDLAVRRMLELKFRAGLF 405

Query: 349 DGSPQYKNLGKNNICNPQHIE-LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
           + +P         I N +    LA  AA++ I LLKND G LPL      T+A++GP  +
Sbjct: 406 E-NPYADANAAAAITNNEDARALARTAAQRSITLLKND-GMLPLKPEG--TIAVIGP--S 459

Query: 408 ATKAMIGNYEGTPCRYTSPMDGFYAYSKV---INYAPGC---------ADIV-----CQN 450
           A  A +G Y G P    S ++G  A       I +A G          AD V      +N
Sbjct: 460 AAVARLGGYYGQPPHSVSILEGIKARVGTKANIVFAQGVKITEDDDWWADSVTKSDPAEN 519

Query: 451 NSMIPAAIDAAKNADATVIVAGLDLSVEAEG------KDRVDLLLPGFQTELINKVADAA 504
             +I  A++AA+N D  ++  G       EG       DR  L L G Q EL + +    
Sbjct: 520 RKLIAQAVEAARNVDRIILTLGDTEQSSREGWADNHLGDRPSLDLVGEQQELFDALKALG 579

Query: 505 KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW 564
           K P+T+V+++      +  K + +  +IL   Y GE+GG A+AD++FG  NPGG+LP+T 
Sbjct: 580 K-PITVVLINGRPA--STVKVSEQANAILEGWYLGEQGGNAVADILFGDVNPGGKLPVTV 636

Query: 565 -YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
              A  + + Y   P         R Y F     +YPFG+GLSYT F     S+P+    
Sbjct: 637 PRSAGQLPLFYNMKP------SARRGYLFDTTDPLYPFGFGLSYTSFSL---SAPRLSAT 687

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
           +             +GT                  K +  ++V N G  +G EVV +Y +
Sbjct: 688 R-------------IGTGG----------------KTSVSVDVRNTGAREGDEVVQLYIR 718

Query: 684 PPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
               + T  +K++ G++RV +  G+S  + FT+   ++L++ ++    ++  G   I+ G
Sbjct: 719 DKVSSVTRPVKELKGFQRVTLKPGESRTITFTVGP-EALQMWNDQMRRVVEPGDFEIMTG 777


>gi|254295141|ref|YP_003061164.1| glycoside hydrolase [Hirschia baltica ATCC 49814]
 gi|254043672|gb|ACT60467.1| glycoside hydrolase family 3 domain protein [Hirschia baltica ATCC
           49814]
          Length = 897

 Score =  270 bits (689), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 178/499 (35%), Positives = 254/499 (50%), Gaps = 65/499 (13%)

Query: 6   KVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHG 65
           + K S+F + D  L   ERA DLV  MTL EK  QM D A  +PRLGL  Y WW+EALHG
Sbjct: 36  EAKSSEFRFMDPSLSPKERALDLVSHMTLEEKAAQMYDKAAAIPRLGLHEYNWWNEALHG 95

Query: 66  VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
           V+  G                 AT FP  I   A+++E L  ++   +S E RA ++   
Sbjct: 96  VARAGH----------------ATVFPQAIGMAATWDEDLMLEVANVISDEGRAKHHFYA 139

Query: 126 --------AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYH 177
                    GLTFWSPNIN+ RDPRWGR  ET GEDPY+ GR A+N++ GLQ        
Sbjct: 140 NEDVYAMYGGLTFWSPNINIFRDPRWGRGQETYGEDPYLTGRMAVNFINGLQ-------- 191

Query: 178 RDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRV-TEQDMQETFILPFEMCVNEGDV 236
              D +  K  A  KHYA +           H D+ + T+ D+ ET++  F+   +E +V
Sbjct: 192 -GDDDKYFKSVATVKHYAVHS----GPEPSRHRDNYIATDADLYETYLPAFKTAFDETEV 246

Query: 237 SSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI-------------V 283
           +SVMC+YN V G P C   +L+   +R +  F GY+VSDC +I                 
Sbjct: 247 ASVMCAYNAVWGDPACGSERLMKDLLREELGFDGYVVSDCGAIGDFYYDEEKKAEGTAPY 306

Query: 284 ESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMG---AVQQGKIAEADIDTSLRFLYI 340
            +H  + DT+  A A  +  G DL+CGD   N       AV++G I E  ID S+  LY 
Sbjct: 307 AAHDHV-DTRAQAAALSVNMGTDLNCGDGEGNKMDALPQAVKEGLITEETIDQSVVRLYS 365

Query: 341 VLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKT 398
            L +LG +D      + N+  + + +P H+E + EAAR  +VLLKND G LPL       
Sbjct: 366 ALFKLGMYDDPSLVPWSNISIDTVASPSHLEKSEEAARASLVLLKND-GILPLKPDT--K 422

Query: 399 LALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVINYAPGC--ADIVCQNNSMI 454
           +A++GP+A+    ++ NY G P    + + G  A   ++ ++Y+ G   A  +  N   +
Sbjct: 423 VAVIGPNADNWWTLVANYYGQPTAPVTALKGIKAKIGAENVSYSVGSTIAGDIYSNYKAV 482

Query: 455 PAAIDAAKNADATVIVAGL 473
           P+     KN +A  +V G+
Sbjct: 483 PSNTLFHKN-EAGELVPGV 500



 Score =  101 bits (252), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 79/296 (26%), Positives = 131/296 (44%), Gaps = 54/296 (18%)

Query: 468 VIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA 517
           +   G+D ++E E          G DR  + LP  Q +L+ ++    K PV LV  S  A
Sbjct: 634 LFFGGIDANLEGEEMGVELDGFLGGDRTHINLPAPQEKLLKELHATGK-PVVLVNFSGSA 692

Query: 518 VDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSM 577
           + +N+   N  + +I+   YPGE+ G AIAD+++G+++P GRLP+T+Y++         M
Sbjct: 693 MALNWEDEN--LPAIVQAFYPGEKSGTAIADLLWGEFSPSGRLPVTFYKS------LEGM 744

Query: 578 PLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
           P     +   RTYK+++G  +YPFG+GLSYT F+Y         D+KL       +  Y 
Sbjct: 745 PAFDDYSMENRTYKYYEGEQLYPFGHGLSYTSFEYS--------DLKL-------ETAYA 789

Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQV-- 695
              N                      ++V N G     E+V  Y     +A     +V  
Sbjct: 790 ANEN------------------LQVSVKVTNSGDKASREIVQAYVTRDTLANVSTPRVEL 831

Query: 696 IGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFP 751
             ++ + +A  +S  V  ++         +N   +    G+ T+ +G G  G   P
Sbjct: 832 AAFDAIELAPKESQTVTLSIKPDAIGYFNENGKLTFPEDGSFTLSIGGGQPGFDAP 887


>gi|217968103|ref|YP_002353609.1| glycoside hydrolase family 3 [Dictyoglomus turgidum DSM 6724]
 gi|217337202|gb|ACK42995.1| glycoside hydrolase family 3 domain protein [Dictyoglomus turgidum
           DSM 6724]
          Length = 756

 Score =  270 bits (689), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 208/674 (30%), Positives = 333/674 (49%), Gaps = 97/674 (14%)

Query: 87  GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
           G+T FP  I   +++N  L  ++   +  E R+            SP IN+ RDPR GR 
Sbjct: 147 GSTIFPQAIGMASTWNPELIYQVATAIGKETRS-----RGIHQVLSPTINIARDPRCGRT 201

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
            ET GEDPY+  R A+ Y++G+Q+ +GV              A  KH+AA  + +  G D
Sbjct: 202 EETYGEDPYLASRMAVAYIKGVQE-QGV-------------IATPKHFAANFVGDG-GRD 246

Query: 207 RF--HFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRG 264
            +  HF  R+    ++E +   F+  + E    S+M +YN ++GIP  ++  LL   +R 
Sbjct: 247 SYPIHFSERL----LREVYFPAFKASIKEAGALSLMAAYNSLDGIPCSSNKWLLTDVLRK 302

Query: 265 DWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL-----DCGDYYTNFTMG 319
           +W F GY+VSD  S+  ++  HK + ++K +A    L+AGLD+     DC +   N   G
Sbjct: 303 EWGFKGYVVSDYFSVLHLMTKHK-VAESKAEAARLALEAGLDMELPDSDCFEEMINLVKG 361

Query: 320 AVQQGKIAEADIDTSLRFLYIVLMRLGYFDG---SPQYKNLGKNNICNPQHIELAAEAAR 376
               GK++E  I+ ++R +  V    G FD     P Y    + N C  +H ELA   AR
Sbjct: 362 ----GKLSEETINEAVRRILGVKFWAGLFDNPFVDPDYAE--RVNDC-AEHRELALRVAR 414

Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF---YAY 433
           + IVLLKN+ G LPL+  +I ++A++GP  NA    +G Y G   +  +P++G       
Sbjct: 415 ESIVLLKNE-GILPLSK-DIGSIAVIGP--NAAVPRLGGYSGYGVKIVTPLEGIKNKMEN 470

Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL-SVEAEGKDRVDLLLPGF 492
              I +A GC  +   + S    AI  A+ +D  ++  G  +   E E +DR +L LPG 
Sbjct: 471 KAKIYFAEGCG-LNDTSKSGFDEAIKIAQKSDVAILFVGNSVPETEGEQRDRHNLNLPGV 529

Query: 493 QTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFG 552
           Q ELI ++ +    PV +V+++  A  I       K+++++   YPGEEGG AIADV+FG
Sbjct: 530 QEELIKEICNT-NTPVIVVLINGSA--ITMMNWIDKVQAVIEAWYPGEEGGNAIADVLFG 586

Query: 553 KYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD---GPVVYPFGYGLSYTQ 609
            YNPGG+LPIT+ + +      + +PL   +   GR   + D      ++PFGYGLSYT+
Sbjct: 587 DYNPGGKLPITFPKYS------SQLPLYYNHKPSGRVDDYVDLRSPQYLFPFGYGLSYTE 640

Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
           F+Y                      N  +   + P            D + T   EVEN+
Sbjct: 641 FRYS---------------------NLRITPEEIPM-----------DGEITITFEVENI 668

Query: 670 GKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAA 728
           GK  G EVV +Y      +    +K++  ++R+ +A G+   V F ++  + L+ ++   
Sbjct: 669 GKYKGDEVVQLYLHDEFASVVRPVKELKRFKRITLAVGEKKTVSFKLDR-RDLEFLNIDM 727

Query: 729 NSLLASGAHTILVG 742
             ++  G   + +G
Sbjct: 728 EPIVEPGRFEVFIG 741


>gi|344995394|ref|YP_004797737.1| glycoside hydrolase family protein [Caldicellulosiruptor
           lactoaceticus 6A]
 gi|343963613|gb|AEM72760.1| glycoside hydrolase family 3 domain protein [Caldicellulosiruptor
           lactoaceticus 6A]
          Length = 770

 Score =  270 bits (689), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 209/709 (29%), Positives = 338/709 (47%), Gaps = 111/709 (15%)

Query: 87  GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
           GAT FP  I    +F+  + +++ + +  + +A            +P I+V RD RWGRV
Sbjct: 102 GATVFPQSIGVACTFDNEIVEELAKVIRIQMKA-----TGSHQALAPLIDVARDARWGRV 156

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLD----NW 202
            ET GEDPY+V   A++YV+G+Q           D     I A  KH+  Y +     NW
Sbjct: 157 EETFGEDPYLVANMAVSYVKGIQ----------GDDIKDGIVATGKHFVGYAMSEGGMNW 206

Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
                      + E++++E ++ PFE+ V    + S+M +Y+ ++GIP  A+ KLL    
Sbjct: 207 A-------PVHIPERELREVYLYPFEVAVKVAGLKSIMPAYHEIDGIPCHANRKLLTDIA 259

Query: 263 RGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG--DYYTNFTMGA 320
           RG+W F G  VSD   ++ I++ HK +  T  +A    L AGLD++    + +T   + A
Sbjct: 260 RGEWGFDGIYVSDYSGVRNILDYHKAVK-TYAEAAYISLWAGLDIELPKIECFTEEFIKA 318

Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC-NPQHIELAAEAARQGI 379
           +++GK   A +D +++ +  +  RLG FD +P  K  G   +  N +  EL+ + A++ +
Sbjct: 319 LKEGKFDMAVVDAAVKRVLEMKFRLGLFD-NPYIKTEGILELFDNKEQRELSRKVAQESM 377

Query: 380 VLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYS----- 434
           VLLKNDN  LPL+  ++K +A++GP+A++ + ++G+Y   P  + + ++ F+        
Sbjct: 378 VLLKNDN-FLPLSK-DVKKIAVIGPNADSVRNLLGDY-SYPA-HIATLEMFFIKEDRGVG 433

Query: 435 -------KVIN-------------------YAPGCADIVCQNNSMIPAAIDAAKNADATV 468
                  KVIN                   YA GC D+  Q+ S    A  AA+ AD  +
Sbjct: 434 NEEEFVRKVINMKSIFEAVKDRVQNKAEVVYAKGC-DVNTQDESGFEEAKKAAQGADVVI 492

Query: 469 IV----AGLDLS-VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFA 523
           +V    AGL L     E +DR  L LPG Q +LI +V+   +    +V++      +   
Sbjct: 493 LVVGDKAGLRLDCTSGESRDRASLKLPGVQEKLIEEVSKVNE---NIVVVLVNGRPVALE 549

Query: 524 KNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPV 582
               K K+IL   +PGEEG  A+ADV+FG YNPGG+L I++  +   V + Y   P    
Sbjct: 550 GIWQKAKAILEAWFPGEEGAEAVADVLFGDYNPGGKLAISFPRDVGQVPVYYGHKPSGGK 609

Query: 583 NNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
           + + G   +    P + PFGYGLSYT F+YK                     N+ +   K
Sbjct: 610 SCWHGDYVEMSTKPFL-PFGYGLSYTTFEYK---------------------NFAIEKEK 647

Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERV 701
                         D      +EVEN GK  G E+V +Y++      T  +K++  Y+RV
Sbjct: 648 ISM-----------DESIKISVEVENTGKYAGDEIVQLYTRKEEFLVTRPVKELKAYKRV 696

Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
            +  G+  KV F +         D     +++ G   ++VG     + F
Sbjct: 697 HLKPGEKKKVVFEIFP-DQFAYYDYDMKRVISPGTVEVMVGASSEDIKF 744


>gi|423289663|ref|ZP_17268513.1| hypothetical protein HMPREF1069_03556 [Bacteroides ovatus
           CL02T12C04]
 gi|423298156|ref|ZP_17276215.1| hypothetical protein HMPREF1070_04880 [Bacteroides ovatus
           CL03T12C18]
 gi|392663697|gb|EIY57244.1| hypothetical protein HMPREF1070_04880 [Bacteroides ovatus
           CL03T12C18]
 gi|392667374|gb|EIY60884.1| hypothetical protein HMPREF1069_03556 [Bacteroides ovatus
           CL02T12C04]
          Length = 850

 Score =  270 bits (689), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 157/420 (37%), Positives = 231/420 (55%), Gaps = 45/420 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y +   P  ER  DL+ R+T+ EK+  +   + G+PRLG+  Y   +EALHGV   GR  
Sbjct: 27  YKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-- 84

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
                          T FP  I   A++N  L +K+   +S EARA +N  + G      
Sbjct: 85  --------------FTVFPQAIGLAATWNPVLQQKVATVISDEARARWNELDQGRNQKEQ 130

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFWSP +N+ RDPRWGR  ET GEDP++ G     +V+GLQ           D R
Sbjct: 131 FSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQG---------EDPR 181

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LKI +  KH+ A    N E ++RF  + +++E+ ++E +   FEMCV +G  +S+M +Y
Sbjct: 182 YLKIVSTPKHFVA----NNEEHNRFICNPQISEKQLREYYFPAFEMCVKKGKAASIMTAY 237

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N +N +P   +  LL + +R DW F GY+VSDC     +V +HK++  TKE A    +KA
Sbjct: 238 NALNDVPCTLNAWLLQKVLRQDWGFRGYVVSDCGGPSLLVNAHKYVK-TKETAATLSIKA 296

Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
           GLDL+CG D Y  + + A +Q  +++ADID++   +    M+LG FD   +  Y  +  +
Sbjct: 297 GLDLECGDDVYDEYLLNAYKQYMVSDADIDSAACHVLAARMKLGMFDSKERNPYARISPS 356

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            I +  H ++A +AAR+ IVLLKN    LPLN   +K++A+VG   NA     G+Y G P
Sbjct: 357 VIGSKDHQQVALDAARECIVLLKNQKNMLPLNVDKLKSIAVVG--INAGTCEFGDYSGAP 414



 Score =  137 bits (345), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 93/288 (32%), Positives = 144/288 (50%), Gaps = 53/288 (18%)

Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-V 518
           A    +  V V G++ S+E EG+DR D+ LP  Q E + ++      P  +V++ AG+ +
Sbjct: 597 AVSECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAGSSL 654

Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
            +N+   +  I +I+   YPGE+GG A+ADV+FG YNP GRLP+T+Y++   ++P    P
Sbjct: 655 AVNWMDEH--IPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKS-LDELP----P 707

Query: 579 LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV 638
               +   GRTYK+F G V+YPFGYGLSY+ FKY                          
Sbjct: 708 FDDYDITKGRTYKYFKGDVLYPFGYGLSYSSFKY-------------------------- 741

Query: 639 GTNKPPCAAVLIDDVKCKDY--KFTFQIEVENMGKMDGSEVVMVYSKPPGIAG-THIKQV 695
                        D+K KD   K T    ++N G+  G EV  VY + P   G   IK++
Sbjct: 742 ------------SDLKVKDSTDKVTVSFRLKNTGRRKGDEVAQVYVRIPETGGIVPIKEL 789

Query: 696 IGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS-LLASGAHTILVG 742
            G+ RV +  G+S  +   ++  + L+  D      +L +G   ++VG
Sbjct: 790 KGFRRVPLEPGESRAIDIELDK-EQLRYWDTTKEQFILPAGTFDVMVG 836


>gi|423223731|ref|ZP_17210200.1| hypothetical protein HMPREF1062_02386 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638106|gb|EIY31959.1| hypothetical protein HMPREF1062_02386 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 854

 Score =  270 bits (689), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 161/421 (38%), Positives = 229/421 (54%), Gaps = 45/421 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D K P  ER  DL+ R+T+ EK+  +   + G+ RL +P Y   +EALHGV   GR  
Sbjct: 28  YKDEKAPMHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPGR-- 85

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
                          T FP  I   A++N  L  ++   +S EARA +N  + G      
Sbjct: 86  --------------FTVFPQAIGLAATWNPELQLQVATVISDEARARWNELDQGREQKSQ 131

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFWSP +N+ RDPRWGR  ET GEDPY+ G     +V+GLQ           D R
Sbjct: 132 FSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQG---------DDDR 182

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LKI +  KH+AA    N E ++RF  + +++E+ ++E ++  FE CV +G  +S+M +Y
Sbjct: 183 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYLPAFEACVKDGKSASIMSAY 238

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N +N +P   +  LL + +R DW F GY+VSDC     +V +HK++  TKE A A  +KA
Sbjct: 239 NALNDVPCTLNAWLLTKVLRKDWGFKGYVVSDCGGPSLLVNAHKYVK-TKEAAAALSIKA 297

Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
           GLDL+CG D Y    + A +Q  + +ADID++   +    M LG FD   Q  Y  +   
Sbjct: 298 GLDLECGDDVYDQPLLSAYRQYMVTDADIDSAAYRVLRARMELGLFDSGEQNPYTKISPA 357

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            I + +H E+A  AAR+ IVLLKN    LPLN   +K++A+VG   NA  +  G+Y G P
Sbjct: 358 VIGSAEHQEVALNAARECIVLLKNQKKMLPLNAKKVKSIAVVG--INAGSSEFGDYSGLP 415

Query: 421 C 421
            
Sbjct: 416 V 416



 Score =  141 bits (356), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 99/289 (34%), Positives = 151/289 (52%), Gaps = 55/289 (19%)

Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-V 518
           A +  +  V V G++ S+E EG+DR D+ LP  Q E + ++      P  +V++ AG+ +
Sbjct: 598 AVRECETVVAVLGINKSIEREGQDRYDIQLPADQQEFLQEIYKV--NPNIVVVLVAGSSL 655

Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
            IN+   +  I +I+   YPGE GG+A+A+V+FG YNPGGRLP+T+Y +   ++P    P
Sbjct: 656 AINWMDEH--IPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRS-LDELP----P 708

Query: 579 LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKY---KVASSPKSVDIKLDKDQQCRDIN 635
               +   GRTYK+F G V+YPFGYGLSYT FKY   +VA   + +++            
Sbjct: 709 FDDYDITKGRTYKYFKGDVLYPFGYGLSYTTFKYSNLQVADGEEEINV------------ 756

Query: 636 YTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQ 694
                                    +FQ+  +N GK  G EV  VY K P       IK+
Sbjct: 757 -------------------------SFQL--KNSGKYAGDEVAQVYVKLPERDEVMPIKE 789

Query: 695 VIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
           + G+ERV + +G++ KV   +     L+  D A +  +  SG +TI+VG
Sbjct: 790 LKGFERVTLKSGENKKVTLKLRK-DLLRYWDEAKDKFVCPSGDYTIMVG 837


>gi|386821036|ref|ZP_10108252.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
           19592]
 gi|386426142|gb|EIJ39972.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
           19592]
          Length = 725

 Score =  270 bits (689), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 222/729 (30%), Positives = 343/729 (47%), Gaps = 90/729 (12%)

Query: 8   KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
           K  D+P+ + K+   +R  +L+  MT+ EKV  +      VPRLG+       E LHG++
Sbjct: 26  KSYDYPFQNPKIATEKRVDNLLSLMTIDEKVNALSTNP-EVPRLGVK-GTGHVEGLHGLA 83

Query: 68  FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEAR-AMYNLGNA 126
             G             E    T+FP       +++  L K+I +    EAR A+   G  
Sbjct: 84  LGGPAGWG----GKGKEPLPTTTFPQAYGLGETWDTELLKEIAKIEGYEARYALQKYGRG 139

Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLK 186
           GL   +PN ++ RDPRWGR  E+ GED +  G+  + +V+GLQ          SD    +
Sbjct: 140 GLVIRAPNADLARDPRWGRTEESYGEDAFFNGKMTVAFVKGLQ---------GSDKTYWQ 190

Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
            ++  KH+ A    N   + R +  S   E+  +E + LPF+M V EG   + M +YN+V
Sbjct: 191 TASLMKHFLA----NSNEDGRTYTSSDFDERLWREYYALPFKMGVVEGGSRAYMAAYNKV 246

Query: 247 NGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLD 306
           NGIP    P L + T+  +W  +G I +D  + + ++  HK+  D K    A  +KAG++
Sbjct: 247 NGIPAMVHPMLKDITV-DEWGQNGIICTDGGAYKLLLSDHKYYKD-KYLGAAATIKAGIN 304

Query: 307 LDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS---PQYKNLGKNNIC 363
               D +T    GA+  G + EAD+D  LR  Y V+++LG  D S   P  K   + +  
Sbjct: 305 QFLDD-FTEGVYGALANGYLTEADLDEVLRGNYRVMIKLGMLDSSANNPYAKIGAEADSM 363

Query: 364 NPQHIE----LAAEAARQGIVLLKNDNGA--LPLNTGNIKTLALVGPHANATKAMIGNYE 417
           +P  +E    LA EA  + IVLLKND     LPL    +K +A++G +A+A   ++  Y 
Sbjct: 364 DPWELEAHKKLALEATEKSIVLLKNDPAKRLLPLQKKKVKKIAIIGEYADAV--LLDWYS 421

Query: 418 GTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----- 472
           GTP    SP+ G      + N      +++   N+    A++ AKNAD  ++  G     
Sbjct: 422 GTPPYTISPLQG------IKNKVGENVEVLFAKNNADGKAVEIAKNADVAIVFIGNHPTC 475

Query: 473 ----LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
                   V + GK+ VD      + E + K+   A     + ++S+    IN+ + N  
Sbjct: 476 NAGWAQCPVPSNGKEAVDRQALNSEYEDLVKLVYKANPNTVVGLISSFPYTINWTQEN-- 533

Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGR 588
           I +I  V    +E G AIA+V+FG YNP GRL  TW     VK      PL   N   GR
Sbjct: 534 IPAIFHVTQNSQELGTAIANVLFGAYNPAGRLTQTW-----VKDISDLPPLMDYNIRNGR 588

Query: 589 TYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
           TY +F G  +Y FG+GLSYT FKYK    PK +                           
Sbjct: 589 TYMYFKGKPLYAFGHGLSYTTFKYKDMEIPKQIK-------------------------- 622

Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQ 707
                  ++ + + ++ + N G++DG EVV +Y K         IK++  ++R+ I AG+
Sbjct: 623 -------ENEEVSVKVNITNAGEVDGDEVVQLYVKHINSTVERPIKELKSFKRIHIKAGE 675

Query: 708 SAKVGFTMN 716
           +  V   +N
Sbjct: 676 TKTVSLLLN 684


>gi|255532174|ref|YP_003092546.1| glycoside hydrolase family protein [Pedobacter heparinus DSM 2366]
 gi|255345158|gb|ACU04484.1| glycoside hydrolase family 3 domain protein [Pedobacter heparinus
           DSM 2366]
          Length = 799

 Score =  269 bits (688), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 221/807 (27%), Positives = 366/807 (45%), Gaps = 148/807 (18%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WS------ 60
           Y D   P   R  +L+ +MTL EK  QM  L YG  R+    LP  EW    W       
Sbjct: 48  YEDPLQPLNARIDNLLSQMTLEEKTCQMATL-YGWKRVLKDSLPTKEWKTAIWKDGIANI 106

Query: 61  -EALHGVSFIGRRTNSPPGTHFDSEVPG-------------------------------- 87
            E L+G    G  + S   T     V                                  
Sbjct: 107 DEHLNGFLTWGVTSTSELVTDIKKHVWAMNETQRFFIEQTRLGIPVDFTNEGIRGVEAYE 166

Query: 88  ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRV 146
           AT FPT +    ++N +L +K+G+    EARA+      G T  ++P ++V RD RWGR+
Sbjct: 167 ATGFPTQLNMGMTWNRNLIRKMGRITGQEARAL------GYTNVYAPILDVARDQRWGRL 220

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
            E  GEDPY+V R  +    G+Q+               +I++  KH+A Y  +      
Sbjct: 221 EEVYGEDPYLVARLGVEMTLGMQENN-------------QIASTAKHFAVYSANKGAREG 267

Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
               D +V+ +++++  + PF+  + E  +  VM SYN  NGIP       L Q +R D+
Sbjct: 268 LARTDPQVSPREVEDIMLYPFKKVIQEAGIMGVMSSYNDYNGIPITGSEYWLTQRLRKDF 327

Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAVQ 322
            F GY+VSD D+++ +   H    + KE AV +   AGL++       D    +    V 
Sbjct: 328 GFGGYVVSDSDALEYLYNKHHVAANLKE-AVFQAFMAGLNVRTTFRPPDSIIIYARQLVN 386

Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN--ICNPQHIELAAEAARQGIV 380
           +G+I    I++ ++ +  V  +LG FD  P  K+   +   + +  H  +A +A+++ IV
Sbjct: 387 EGRIPIETINSRVKDVLRVKFKLGLFD-QPYVKDAAASEKLVNSIAHQAVALQASKESIV 445

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
           LLKN+N  LPL+  ++K +A++GP+A        +Y     + T+ ++G      +  + 
Sbjct: 446 LLKNNNQILPLSR-SLKKIAVIGPNAADNDYAHTHYGPLQSKSTNILEGIRNKIGADKVW 504

Query: 439 YAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
           YA GC ++V +N                ++I  A++ A  AD  ++V G +     E K 
Sbjct: 505 YAKGC-ELVDKNWPESEIFPEDPDATAIALIEDAVNTAMKADVAIVVLGGNTKTAGENKS 563

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R  L LPGFQ  LI  +    K PV  V++    + IN+   +  I  I++ GYPG +GG
Sbjct: 564 RTTLELPGFQLNLIKAIQKTGK-PVVAVMIGTQPMGINWI--DKYIDGIVYAGYPGVKGG 620

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD-------GP 596
            A+ADV+FG YNPGG+L +T+ ++         +PL    NFP +     D         
Sbjct: 621 IAVADVLFGDYNPGGKLTLTFPKS------VGQLPL----NFPSKPNAQTDEGELAKIKG 670

Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
           ++YPFG+GLSYT F Y         ++K+   +Q +D N ++                  
Sbjct: 671 LLYPFGFGLSYTTFAYS--------NLKISPIEQEKDGNISI------------------ 704

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
                  +++ N  K++G E+V +Y +       T+ K + G+ER+ +   ++  + FT+
Sbjct: 705 ------SVDITNTAKLEGDEIVQLYIRDVLSTVTTYEKILRGFERISLKPNETKTLKFTL 758

Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
                LK+ +     ++  G   +++G
Sbjct: 759 -FPDDLKLWNREMQHVIEPGTFKVMIG 784


>gi|423300729|ref|ZP_17278753.1| hypothetical protein HMPREF1057_01894 [Bacteroides finegoldii
           CL09T03C10]
 gi|408472616|gb|EKJ91142.1| hypothetical protein HMPREF1057_01894 [Bacteroides finegoldii
           CL09T03C10]
          Length = 735

 Score =  269 bits (688), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 216/765 (28%), Positives = 351/765 (45%), Gaps = 87/765 (11%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP 49
           S K K     Y DAK P  +R  DL+ RMTL EKV Q+     G              VP
Sbjct: 20  SAKDKKGGALYKDAKAPIEKRVDDLLSRMTLEEKVMQLNQYTLGRNNNVNNVGEEVKKVP 79

Query: 50  -RLGLPLYEWWSEALHG----VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNES 104
             +G  +Y   +  L       +    R   P    +D+     T +P  +    S+N  
Sbjct: 80  AEIGSLIYFETNPELRNNMQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLAQACSWNPD 139

Query: 105 LWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINY 164
           L ++     + EAR    +     TF SP I+V RDPRWGRV E  GEDPY  G +    
Sbjct: 140 LVEQACAVSAQEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYANGVFGAAS 194

Query: 165 VRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFI 224
           VRG        Y  D+ S   +++AC KHY  Y      G D  +  + +++Q + +T++
Sbjct: 195 VRG--------YQGDNMSAENRVAACLKHYVGYGASE-AGRDYVY--TEISKQTLWDTYL 243

Query: 225 LPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE 284
           LP++M V  G  +++M S+N ++G+P  A+P  + + ++  W   G+IVSD  +I+ +  
Sbjct: 244 LPYKMGVKAG-AATLMSSFNDISGVPGSANPYTMTEILKNRWRHDGFIVSDWGAIEQL-- 300

Query: 285 SHKFLNDTKEDAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLM 343
            ++ L  TK++A      AGL++D   + Y       V++GK++ A +D ++R + ++  
Sbjct: 301 KNQGLAATKKEAARHAFTAGLEMDMMSHAYDRHLQELVEEGKVSMAQVDEAVRRVLLLKF 360

Query: 344 RLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG 403
           RLG F+         K     PQ +++AA  A + +VLLKN+N  LPL   + K +A++G
Sbjct: 361 RLGLFERPYTPVTTEKERFLRPQSMDIAARLAAESMVLLKNENNVLPL--ADKKKIAVIG 418

Query: 404 PHANATKAMIGNYEGTPCRYTSPM--DGF---YAYSKVINYAPGCADIVCQNNSMIPAAI 458
           P A     ++G++ G        M  DG    +A    + YA GC +    N      A+
Sbjct: 419 PMAKNGWDLLGSWRGHGKDTDVVMLYDGLAAEFAGKAELRYALGC-NTKGDNREGFAEAL 477

Query: 459 DAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAV 518
            AA+ +D  V+  G  ++   E   R  + LP  Q EL  ++    K PV L++++   +
Sbjct: 478 GAARWSDVVVLCLGEMMTWSGENASRSSIALPQMQEELAKELKKVGK-PVVLILVNGRPL 536

Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSM 577
           ++N  +  P   +IL +  PG  G   +A ++ G+ NP G+L +T+ Y    + I Y   
Sbjct: 537 ELN--RLEPVSDAILEIWQPGVNGALPMAGILSGRINPSGKLAMTFPYSTGQIPIYYNR- 593

Query: 578 PLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
             R         YK      +YPFG+GLSYT+FKY                         
Sbjct: 594 --RKSGRGHQGFYKDMTSDPLYPFGHGLSYTEFKY------------------------- 626

Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVI 696
            GT  P    V       +  K + ++ V N+G  DG+E V  +   P  + T  +K++ 
Sbjct: 627 -GTVTPSATKV------KRGEKLSAEVTVTNIGARDGAETVHWFISDPYCSITRPVKELK 679

Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
            +E+  I AG++    F ++  +    V+      L +G + I V
Sbjct: 680 HFEKQLIKAGETKTFRFDIDLERDFGFVNEDGKRFLETGEYNIHV 724


>gi|423291211|ref|ZP_17270059.1| hypothetical protein HMPREF1069_05102 [Bacteroides ovatus
           CL02T12C04]
 gi|392663822|gb|EIY57367.1| hypothetical protein HMPREF1069_05102 [Bacteroides ovatus
           CL02T12C04]
          Length = 800

 Score =  269 bits (688), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 230/800 (28%), Positives = 360/800 (45%), Gaps = 141/800 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
           Y D   P   R  DL+ +MTL EK  QM  L YG  R+     P   W    W + +   
Sbjct: 56  YEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNI 114

Query: 64  ----HGVSFIGRRTNSP-----------------------PGTHFDSEVPG-----ATSF 91
               +G+   G   + P                       P    +  + G     AT F
Sbjct: 115 DEQANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMF 174

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
           P      A++N+ L ++I +  + EA+A+      G T  +SP +++ +DPRWGRV+E+ 
Sbjct: 175 PAQCGQGATWNKKLIREIAKVTANEAKAL------GYTNIYSPILDIAQDPRWGRVVESY 228

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDPY+ G      + GLQ  EG             I A  KH+A Y +     +     
Sbjct: 229 GEDPYLAGELGKQMILGLQS-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 274

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
           D  V  ++M+  ++ PF   + E     VM SYN  +G P       L + +R  W F G
Sbjct: 275 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 334

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
           Y+VSD ++++ +   H+ +  T+E+  A+V+ AGL++      TNFT           A+
Sbjct: 335 YVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRRAI 388

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
            +GK++   +D  +  +  V   +G FD   P      +  + N  H  ++ +AA + IV
Sbjct: 389 NEGKVSLHTLDQRVGEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESIV 448

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
           LLKN+N  LPL+  N   +A++GP+    K +   Y        +   G   Y  +  + 
Sbjct: 449 LLKNENQMLPLSK-NFSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVR 507

Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
           YA GC DI+                Q  +MI  A++ AK +D  ++V G +     E   
Sbjct: 508 YAKGC-DIIDKYFPESELYNVPLDTQEQAMIQEAVELAKASDVAILVLGGNEKTVREEFS 566

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R +L L G Q +L+  V    K PV LV++   A  IN+A  N  + +I+   +PGE  G
Sbjct: 567 RTNLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMG 623

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
            AIA V+FG YNPGGRL +T +  +  +IP+ + P +P ++  G+     DG V+YPFGY
Sbjct: 624 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDSKGKVR--VDG-VLYPFGY 678

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           GLSYT F Y         D+K+ K          +G  +                  T  
Sbjct: 679 GLSYTTFGYS--------DLKISKP--------VIGPQE----------------NITLS 706

Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
             V+N GK  G EVV +Y +    + T   +V+ G+ER+ +  G+   V FT+   + L 
Sbjct: 707 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLG 765

Query: 723 IVDNAANSLLASGAHTILVG 742
           + D      +  G+ +++VG
Sbjct: 766 LWDKNNQFTVEPGSFSVMVG 785


>gi|409195436|ref|ZP_11224099.1| glycoside hydrolase family protein [Marinilabilia salmonicolor JCM
           21150]
          Length = 867

 Score =  269 bits (688), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 160/420 (38%), Positives = 225/420 (53%), Gaps = 43/420 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           ERA DL++ +TL EKV  M D    + RLG+  Y WW+EALHGV+  G+           
Sbjct: 35  ERADDLLKELTLEEKVSLMVDRNTAIERLGIEEYNWWNEALHGVARAGQ----------- 83

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
                AT FP  +   A+F+  +   +    S EARA ++            GLT W+PN
Sbjct: 84  -----ATVFPQPVGMAAAFDRDMVLDVFSAASDEARAKHHFFKERGERGRYQGLTMWTPN 138

Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
           INV RDPRWGR +E  GEDP++ G      V+GLQ         D   +  K+ AC KHY
Sbjct: 139 INVFRDPRWGRGMEAYGEDPFMNGVLGTAVVKGLQG--------DRSGKYDKLHACAKHY 190

Query: 195 AAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           A +    W   +R  F++  +  +D+ ET++  F+  V +GDV  VMC+YNR  G P C 
Sbjct: 191 AVHSGPEW---NRHSFNAENIRPRDLHETYLPAFKKLVIDGDVRMVMCAYNRFEGEPCCG 247

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLKAGLDLDCGD 311
           + +LL   +R +W F G +VSDC +I      ++H    D K  +   VL AG DL+CGD
Sbjct: 248 NNQLLRDILRNEWGFDGVVVSDCWAINDFFNKDAHAMYPDAKTASTDAVL-AGTDLNCGD 306

Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIE 369
            Y +  + AV+QG I E  +D SLR L I    LG  D     ++  +  + + +P H E
Sbjct: 307 SYPSL-VEAVEQGLITEEQLDISLRRLLIARFELGEMDPDEEVEWSKIPHSVVSSPTHSE 365

Query: 370 LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG 429
           +A EAAR+ + LL N NGALPL    + T+A++GP+AN +    GNY GTP   T+ + G
Sbjct: 366 MALEAARKSMTLLMNKNGALPLKKEGL-TVAVMGPNANDSLMQWGNYNGTPATTTTILQG 424



 Score =  130 bits (326), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 80/272 (29%), Positives = 129/272 (47%), Gaps = 51/272 (18%)

Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
           IP+++    +AD  V  +G+   +E E          G DR D+ LP  Q E++  +  A
Sbjct: 593 IPSSVAKVADADVVVFASGISPFLEGEEMGVDLPGFKGGDRTDIALPAIQKEMLKALHKA 652

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
            K    +++++     I F +      +IL   YPG+ GG+A+A+V+FG YNP GRLP+T
Sbjct: 653 GK---EIILVNCSGSAIGFEEATDYSSAILQAWYPGQAGGQAVAEVLFGDYNPAGRLPVT 709

Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
           +Y++         +P     N   RTY++F+G  +YPFGYGLSYT F Y           
Sbjct: 710 FYKS------VDQLPDFQDYNMTNRTYRYFEGEPLYPFGYGLSYTTFSY----------- 752

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
                            ++P  +   I      + + + ++ V N G  DG EVV +Y +
Sbjct: 753 -----------------DQPELSQTSI----STEEEASLKVSVANTGDYDGEEVVQLYLQ 791

Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
            P         + G++RVFI  G++ +V F +
Sbjct: 792 KPDDTEGPSLTLRGFQRVFIPKGETVEVEFQL 823


>gi|397691073|ref|YP_006528327.1| beta-glucosidase [Melioribacter roseus P3M]
 gi|395812565|gb|AFN75314.1| beta-glucosidase [Melioribacter roseus P3M]
          Length = 923

 Score =  269 bits (688), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 159/430 (36%), Positives = 237/430 (55%), Gaps = 42/430 (9%)

Query: 21  YPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTH 80
           Y ER  DL+  MT  EK++Q+ + A  +PRLGL  Y +W+E+LHGV              
Sbjct: 113 YKERLNDLISLMTTEEKIKQLTNQADSIPRLGLRAYNYWNESLHGVL------------- 159

Query: 81  FDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRD 140
                 GATSFP  I   A+++  L  ++   VS EARA+  L   GLT+WSP IN+ RD
Sbjct: 160 ----AEGATSFPQAIALGATWDPRLVNRVATAVSDEARALNRLYGKGLTYWSPTINIARD 215

Query: 141 PRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLD 200
           PRWGR  E+  EDPY++ R  + +++G+Q      Y+       LK  A  KH+ A    
Sbjct: 216 PRWGRNEESYSEDPYLLSRMGVAFIKGMQGDH--PYY-------LKTVATPKHFIA---- 262

Query: 201 NWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQ 260
           N E   R    S V  +++ E ++  F+  + E    S+M +YN +N +P+ A+  L+  
Sbjct: 263 NNEEERRHTGSSDVDMRNLYEYYLPAFKSAIVEARAYSIMGAYNELNHVPSNANMFLMTD 322

Query: 261 TIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGA 320
            +R  W F GY+VSDC +I  ++  HKF   T  +AVAR + AG DL+CG  Y  F   A
Sbjct: 323 LLRRQWGFEGYVVSDCGAIHDMLYGHKFFK-TGAEAVARSILAGCDLNCGQAYREFIKDA 381

Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQ 377
           + +G + E DID++L  +     RLG FD  P+   Y ++GK+ + + ++  LA +AAR+
Sbjct: 382 LDEGLLREKDIDSALFRVLSARFRLGEFD-PPELVPYSSIGKDKLDSKENRRLALDAARK 440

Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVI 437
            IVLLKN N  LP++   IK++A++GP  NA +A +G Y G P    SP++G    +  +
Sbjct: 441 SIVLLKN-NDILPIDKSKIKSIAVIGP--NAREAQLGIYSGFPNVLISPLEGIKNKADSL 497

Query: 438 N----YAPGC 443
           +    Y  GC
Sbjct: 498 DIRVGYVKGC 507



 Score =  100 bits (248), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 83/285 (29%), Positives = 131/285 (45%), Gaps = 43/285 (15%)

Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
           AA+N D  ++V G+   +  E  DR ++ LP  Q EL+ + A+     + +V+++ G V 
Sbjct: 665 AAEN-DLVILVLGITPGISQEELDRKEIELPSVQRELVKQTAEVNPN-IVIVLVNGGPVA 722

Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
           +  A+   K     W  Y GE GG+A+ADV+FG YNPGG+LP T+Y +     P +   +
Sbjct: 723 LAGAEKYAKAIVENW--YNGEFGGQALADVLFGDYNPGGKLPQTFYASTEQLPPMSDYDI 780

Query: 580 RPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
             +NN   RTY + +   ++PFG+GLSYT FKY    S K V   L++            
Sbjct: 781 --INN--PRTYMYLNEQALFPFGHGLSYTTFKY---DSLKIVSNTLNETDT--------- 824

Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGY 698
                                + Q  + N+G  +G EVV +Y+           KQ+  +
Sbjct: 825 --------------------LSLQFRLTNVGNRNGDEVVQIYASCKDAKFKVPRKQLKRF 864

Query: 699 ERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
            R+ +  G+S  + F +     L       N  +   GA  IL+G
Sbjct: 865 RRLTLQTGESKVLEFKI-PVDELAFYSTYENDFVVEKGAWEILIG 908


>gi|423289665|ref|ZP_17268515.1| hypothetical protein HMPREF1069_03558 [Bacteroides ovatus
           CL02T12C04]
 gi|423298158|ref|ZP_17276217.1| hypothetical protein HMPREF1070_04882 [Bacteroides ovatus
           CL03T12C18]
 gi|392663699|gb|EIY57246.1| hypothetical protein HMPREF1070_04882 [Bacteroides ovatus
           CL03T12C18]
 gi|392667376|gb|EIY60886.1| hypothetical protein HMPREF1069_03558 [Bacteroides ovatus
           CL02T12C04]
          Length = 955

 Score =  269 bits (688), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 229/764 (29%), Positives = 363/764 (47%), Gaps = 119/764 (15%)

Query: 6   KVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSE 61
           K +++D  Y DA LP  ER + L+  MT PE   ++    +G+P  G+P LY       E
Sbjct: 163 KGEVTDRRYMDASLPVEERVESLLAVMT-PEDKMELIREGWGIP--GIPHLYVPPITKVE 219

Query: 62  ALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY 121
           A+HG S+         G+       GAT FP  +   A++N  L +++   +  E   + 
Sbjct: 220 AVHGFSY---------GS-------GATIFPQALAMGATWNRKLTEEVAMVIGDET-VVA 262

Query: 122 NLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
           N   A    WSP ++V +D RWGR  ET GEDP +V +    +++G Q            
Sbjct: 263 NTKQA----WSPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------ 306

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
           SR L  +   KH+  +         R   D  ++E++M+E  ++PF   V   D  S+M 
Sbjct: 307 SRGLFTTP--KHFGGHGA---PLGGRDSHDIGLSEREMREVHLVPFRHVVRNYDCQSLMM 361

Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
           +Y+   GIP     +LL Q +R +W F+G+IVSDC +I  +     +    K +A  + L
Sbjct: 362 AYSDYMGIPVAGSTELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQAL 421

Query: 302 KAGLDLDCGDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
            AG+  +CGD Y +   + A + G+I   ++D   R +   + R   F+ +P  K L  N
Sbjct: 422 AAGIATNCGDTYNDKEVIQAAKDGRINMVNLDNVCRTMLATMFRNELFEKNP-CKPLDWN 480

Query: 361 NIC----NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
            I     + +H E+A +AAR+ IV+L+N +  LPL+   +KT+A++GP A+  +   G+Y
Sbjct: 481 KIYPGWNSDRHREMARQAARESIVMLENKDNLLPLSK-TLKTIAVLGPGADDLQP--GDY 537

Query: 417 --EGTPCRYTSPMDGFYA----YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV 470
             +  P +  S + G  A     +KV+ Y  GC D    + + IP A+ AA  +D  V+V
Sbjct: 538 TPKLQPGQLKSVLSGIKAAVGKQTKVL-YEQGC-DFTTPDATNIPKAVKAASQSDVVVMV 595

Query: 471 AGLDLSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDIN 521
            G   + EA         E  D   L+LPG Q EL+  V    K PV L++ +    D+ 
Sbjct: 596 LGDCSTSEATNNVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVVLILQAGRPYDL- 653

Query: 522 FAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP 581
             K +   K+IL    PG+EGG A ADV+FG YNPGGRLP+T+            +PL  
Sbjct: 654 -LKASEMCKAILVNWLPGQEGGPATADVLFGDYNPGGRLPMTFPRH------VGQLPLYY 706

Query: 582 VNNFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
                GR Y++ D     +Y FGYGLSYT F+Y         D+K+   Q+  + N  V 
Sbjct: 707 NFKTSGRRYEYVDMEFYPLYRFGYGLSYTSFEYS--------DLKI---QEKSNGNVMV- 754

Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGY 698
                                  Q  V+N+G   G EV  +Y +       T + ++  +
Sbjct: 755 -----------------------QATVKNVGGCAGDEVAQLYITDMYASVKTRVMELKDF 791

Query: 699 ERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            R+ +  G+S  V F +     + ++++  + ++  G   ++VG
Sbjct: 792 TRIHLQPGESKNVSFELTPY-DISLLNDRMDRVVEKGEFKVMVG 834


>gi|317477144|ref|ZP_07936385.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316906687|gb|EFV28400.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 814

 Score =  269 bits (688), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 219/727 (30%), Positives = 336/727 (46%), Gaps = 117/727 (16%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+PL+    E  HG   IG                  T FPT I   +++N  L +++
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------------TTVFPTSIGQASTWNPELIRRM 190

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           G+ ++TEA A           + P +++ RDPRW RV ET GED Y+ G      V+G Q
Sbjct: 191 GRAIATEASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDAYLNGVMGAALVKGFQ 245

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                E+ R       K+ A  KH+AAY    W         + V  ++M+E    PF  
Sbjct: 246 G----EFPRTKG----KVIATLKHFAAY---GWTEGGHNGGSAHVGNREMEEAIYPPFRE 294

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            V  G +S VM SYN ++GIP  A+  LL   ++  W F G++VSD  +I  + E    +
Sbjct: 295 AVAAGALS-VMSSYNEIDGIPCTANSNLLTGLLKKRWQFKGFVVSDLYAIGGLREHG--V 351

Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
            DT  +A  + + AG+D D G + Y    + AV++G + E  I+ ++  +  +   +G F
Sbjct: 352 ADTDYEAAVKAVNAGVDSDLGTNVYAGQLVNAVKRGDVQEVVINKAVSRILALKFHMGLF 411

Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
           D     +   +  + + +H+ELA E ARQ I+LLKN N  LPLN   +KT+A++GP+A+ 
Sbjct: 412 DHPFVDEREPEQVVASTEHLELAREVARQSIILLKNKNELLPLNK-KMKTIAVIGPNADN 470

Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYAYSKVIN-----YAPGCADIVCQNNSMIPAAIDAA 461
              M+G+Y    +     + +DG     KV N     YA GCA +   + S    AI+AA
Sbjct: 471 IYNMLGDYTAPQSESSVVTVLDGIR--QKVSNDTHIIYAKGCA-VRDSSKSGFQEAIEAA 527

Query: 462 KNADATVIVAG----LDLSVE-------------------AEGKDRVDLLLPGFQTELIN 498
           + +D  V+V G     D S +                    EG DR  L L G Q ELI 
Sbjct: 528 RQSDVVVMVMGGSSARDFSSKYEETGAAKVSDSHISDMESGEGYDRSTLELLGRQRELIR 587

Query: 499 KVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGG 558
           +V    K P+ LV++    + +   +   ++ +I+   YPG +GG A+ADV+FG YNP G
Sbjct: 588 EVGKLNK-PIVLVLIKGRPLLLEGIE--AEVDAIVDAWYPGMQGGNAVADVLFGDYNPAG 644

Query: 559 RLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFF--DGPVVYPFGYGLSYTQFKYKVAS 616
           RL I+      V      +P+       G   K+   +G   YPFGYGLSYT F Y    
Sbjct: 645 RLTIS------VPRSVGQLPVYYNTKRKGNRSKYIEEEGTPRYPFGYGLSYTSFNYS--- 695

Query: 617 SPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSE 676
                           D+   V   +  C                  ++V N G  DG E
Sbjct: 696 ----------------DLKAEVVEAEDSCLV-------------NISVKVRNEGSRDGDE 726

Query: 677 VVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASG 735
           VV +Y +    +  T  KQ+ G++R+ +  G++ ++ F ++  KSL +        +  G
Sbjct: 727 VVQLYLRDEVASFTTPFKQLCGFQRIHLKVGETKEITFRLDK-KSLALYMQNEEWAVEPG 785

Query: 736 AHTILVG 742
             T+++G
Sbjct: 786 RFTLMLG 792


>gi|322371968|ref|ZP_08046510.1| glycoside hydrolase family 3 domain protein [Haladaptatus
           paucihalophilus DX253]
 gi|320548390|gb|EFW90062.1| glycoside hydrolase family 3 domain protein [Haladaptatus
           paucihalophilus DX253]
          Length = 776

 Score =  269 bits (688), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 228/789 (28%), Positives = 361/789 (45%), Gaps = 136/789 (17%)

Query: 24  RAKDLVERMTLPEKVQQMGD---------------------LAYGV---PRLG----LPL 55
           R ++L+E MT+ EKV Q+G                      L  G+    RLG    LP 
Sbjct: 17  RVEELLEEMTITEKVAQLGSVNANKLLDDDGSLDRKAVEELLENGIGHLTRLGGEGSLPP 76

Query: 56  YEWWSEALHGVSFIGRRTNS--PPGTHFDSEV----PGATSFPTVILTTASFNESLWKKI 109
            E          F+G  T    P   H +       P  T+FP ++   ++++  L  +I
Sbjct: 77  REAAKRTNELQDFLGSETRLGIPAIPHEECLSGYMGPSGTTFPQMLGVASTWSPDLVAEI 136

Query: 110 GQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGL 168
             T+  +  A+      G T   SP +++ RD RWGRV ET GEDPY+V   A  YV GL
Sbjct: 137 TDTIRGQLEAI------GTTHALSPVLDIARDLRWGRVEETFGEDPYLVAAMARGYVNGL 190

Query: 169 QDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFE 228
           Q         D D     ISA  KH+A +      G +R   +  V  ++++ET + PFE
Sbjct: 191 QG--------DGDG----ISATLKHFAGHGAGEG-GKNRSSVN--VGRRELRETHLFPFE 235

Query: 229 MCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKF 288
             +   D  SVM +Y+ ++GIP  +D  LL   +RG+W F G +VSD  S++  ++S   
Sbjct: 236 AVIKTADAESVMNAYHDIDGIPCASDGWLLTDVLRGEWGFDGTVVSDYYSVE-FLQSEHG 294

Query: 289 LNDTKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLG 346
           +  +K+ A    ++AGLD++    D Y +  + AV+ G +AEA ++T++R +       G
Sbjct: 295 VAASKQAAGVMAVEAGLDVELPYTDCYGDHLVNAVEDGDVAEATVNTAVRRVLRAKAEKG 354

Query: 347 YFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHA 406
             D      +            +L   AAR+ + LLKN++  LP +   ++T+A+VGP A
Sbjct: 355 LLDDPTVDVDAAAAPFNTENARDLTTRAARESMTLLKNEDDFLPFDGEELETVAVVGPKA 414

Query: 407 NATKAMIGNYEGTPCRY---------TSPMDGFYAYSKV----INYAPGCADIVCQNNSM 453
           +  + ++G+Y   P  Y         T+P+D   A  +     + Y  GC          
Sbjct: 415 DNAQELMGDY-AYPAHYPTEEVDLDATTPLDAIEARGEHAGFDVRYEQGCTTTGSSTEDF 473

Query: 454 IPAAIDAAKNADATVIV---AGLDLS-------------VEAEGKDRVDLLLPGFQTELI 497
             AA  A     A   V   + +D S                EG D VDL LPG Q EL+
Sbjct: 474 DSAAEAAEAADVAVTFVGARSAVDFSDIDEKQADLPSVPTSGEGCDVVDLDLPGVQQELV 533

Query: 498 NKVADAAKGPVTLVIMSAGAVDINF-AKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNP 556
            +V +    P+ +V++S     + + A+  P   ++L+   PGE GG  IA+V+FG++NP
Sbjct: 534 ERVHETGT-PLVVVVVSGKPHSVEWIAEEAP---ALLYAWLPGERGGEGIAEVLFGEHNP 589

Query: 557 GGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVA 615
           GGRLP++       + + Y   P     N     + + +   +YPFG+GLSYT F+Y   
Sbjct: 590 GGRLPVSIPRSVGQLPVYYNRKP-----NTANEEHVYTESTPLYPFGHGLSYTDFEYG-- 642

Query: 616 SSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGS 675
                 D+ L  D               P   V            + ++ V N G  DG 
Sbjct: 643 ------DLSLSTDSIA------------PSGRV------------SAEVTVSNTGDRDGH 672

Query: 676 EVVMVY--SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA 733
           EVV +Y  +K P  A   +++++G+ER+F+AAG+S ++ F ++A + L   D   N  + 
Sbjct: 673 EVVQLYASAKSPSQA-RPVQELVGFERIFLAAGESKRIIFEIDASQ-LAFHDRDMNLAVE 730

Query: 734 SGAHTILVG 742
            G + + VG
Sbjct: 731 RGPYELRVG 739


>gi|218130696|ref|ZP_03459500.1| hypothetical protein BACEGG_02285 [Bacteroides eggerthii DSM 20697]
 gi|217987040|gb|EEC53371.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           eggerthii DSM 20697]
          Length = 858

 Score =  269 bits (688), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 161/421 (38%), Positives = 234/421 (55%), Gaps = 45/421 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y + K P  ER  DL+ R+T+ EK+  +   + G+ RL +P Y   +EALHGV   GR  
Sbjct: 29  YKNEKAPIHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPGR-- 86

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
                          T FP  I   A++N  L K++   +S EARA +N  + G      
Sbjct: 87  --------------FTVFPQAIGLAATWNPVLQKQVATVISDEARARWNELDQGREQNSQ 132

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFWSP +N+ RDPRWGR  ET GEDPY+ G     +V+GLQ          +DSR
Sbjct: 133 FSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQG---------NDSR 183

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LKI +  KH+AA    N E ++RF  + +++E+ ++E ++  FE CV EG  +S+M +Y
Sbjct: 184 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYLPAFEACVKEGKSASIMSAY 239

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N +N +P   +  LL + +R DW F GY+VSDC     +V +HK++  TKE A    +KA
Sbjct: 240 NALNDVPCTLNAWLLTKVLREDWGFKGYVVSDCGGPALLVNAHKYVK-TKEAAATLSIKA 298

Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
           GLDL+CG D Y    + A +Q  + +ADID++   +    M+LG FD      Y  +   
Sbjct: 299 GLDLECGDDVYDAPLLNAYRQYMVTDADIDSAAYRVLRARMQLGLFDSGENNPYTKISPK 358

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            I + +H ++A +AAR+ IVLLKN N  LPL+   IK++A+VG   NA ++  G+Y G P
Sbjct: 359 VIGSKEHQKVALDAARECIVLLKNQNKMLPLDAKKIKSIAVVG--INAGRSEFGDYSGLP 416

Query: 421 C 421
            
Sbjct: 417 V 417



 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 96/284 (33%), Positives = 145/284 (51%), Gaps = 49/284 (17%)

Query: 462 KNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-VDI 520
           +  +  V V G++ ++E EG+DR D+ LP  Q E + ++      P  +V++ AG+ + I
Sbjct: 601 RECEKVVAVLGINKAIEREGQDRSDIQLPADQREFLKEIYKV--NPNIVVVLVAGSSLSI 658

Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR 580
           N+   +  I +I+   YPGE GG+A+A+V+FG YNPGGRLP+T+Y +   ++P    P  
Sbjct: 659 NWMDEH--IPAIINAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRS-LDELP----PFD 711

Query: 581 PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGT 640
             +   GRTY++F G V+YPFGYGLSYT FKY               D Q  D N  V  
Sbjct: 712 DYDITKGRTYQYFKGNVLYPFGYGLSYTSFKY--------------SDLQVTDGNQEV-- 755

Query: 641 NKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYE 699
           N   C                    ++N+GK  G EV  +Y K P       IK++ G+E
Sbjct: 756 NVSFC--------------------LKNVGKYAGDEVAQIYVKLPERDKIMPIKELKGFE 795

Query: 700 RVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
           R+ +  G+S KV   +     L+  D      +  SG +TI++G
Sbjct: 796 RISLKRGESRKVTIRLKK-DLLRYWDEEKECFVHPSGDYTIMIG 838


>gi|346226406|ref|ZP_08847548.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
          Length = 775

 Score =  269 bits (688), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 197/659 (29%), Positives = 324/659 (49%), Gaps = 84/659 (12%)

Query: 89  TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVL 147
           T+FP  +    S++  L ++  +  + EA A      +G+ + ++P I++ RDPRWGRV+
Sbjct: 129 TTFPIPLAEACSWDLELMEQSARIAAEEATA------SGIAWNFAPMIDIARDPRWGRVM 182

Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
           E  GEDPY+    A   VRG Q   G+E ++D  S+   + A  KH+  Y      G D 
Sbjct: 183 EGAGEDPYLGSLVARARVRGFQ---GIETYKDF-SKINTMMATSKHFVGYGAVQ-AGRDY 237

Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
              D  V  + + ET++ PF+  V+EG V++ M ++N +NG+P   +  L  + +R  W 
Sbjct: 238 HSVDMSV--RTLHETYLPPFKAAVDEG-VTAFMTAFNDLNGVPCTGNKYLFKEILRDRWG 294

Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQGKI 326
           F G +V+D  +IQ +V +H F  D K  A    + AG+D+D   + +  +    V++GK+
Sbjct: 295 FGGMVVTDYTAIQEMV-AHGFARDLKH-ATELAIDAGIDMDMISEGFVTYLKELVEEGKV 352

Query: 327 AEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN--ICNPQHIELAAEAARQGIVLLKN 384
           +E  ID ++  +  +   LG FD   +Y N  +    + NP+H++ A E A++ IVLL+N
Sbjct: 353 SEKQIDVAVSRILEMKFLLGLFDDPFKYCNAERQKEVVMNPEHLKAAREVAQRSIVLLEN 412

Query: 385 DNGALPLNTGNIKTLALVGPHANATKAMIGNY--EGTPCRYTSPMDGF---YAYSKV-IN 438
            N  LPL     K +AL+GP     +++ G +  +G P +  + M+G    Y  S+V  +
Sbjct: 413 KNNVLPLKKNEPKRVALIGPFVKERESLTGEWAIKGDPDKSVTLMEGLEEKYKDSQVKFS 472

Query: 439 YAPGCA----DIVCQ--------NNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
           YA G +    D   Q        + S    AI+ A+ +D  ++  G       E   R D
Sbjct: 473 YAKGTSLPVIDRTTQKVSTTRVPDRSGFSEAINLARTSDVILVAMGEKFHWSGEAASRTD 532

Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
           + LPG Q EL+ ++    K P+ LV+ +   +D+++   N  + +I+   YPG   G A+
Sbjct: 533 ITLPGNQRELLKELKKTGK-PIILVLFNGRPLDLSWEAEN--VDAIVEAWYPGIMAGHAV 589

Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPY------TSMPLRPVNNFPGRTYKFFDGP--VV 598
           ADV+ G YNP  +L +T +  N  +IP       T  P    N    R+  + D P   +
Sbjct: 590 ADVLSGDYNPSAKLVMT-FPRNVGQIPIFYNVKNTGRPFDEDNPADYRS-SYIDCPNSPL 647

Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
           YPFGYGLSYT F+Y                      N  + + K     +L         
Sbjct: 648 YPFGYGLSYTSFEYD---------------------NAKISSKKLERGGIL--------- 677

Query: 659 KFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
             T  ++V N G MDG EVV +Y     G     +K++ G++++ +  G++  V FT++
Sbjct: 678 --TVSVDVTNTGTMDGEEVVQLYIHDKVGSVVRPVKELKGFKKIHLKKGETKTVEFTID 734


>gi|317474225|ref|ZP_07933501.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316909535|gb|EFV31213.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 858

 Score =  269 bits (688), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 161/421 (38%), Positives = 234/421 (55%), Gaps = 45/421 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y + K P  ER  DL+ R+T+ EK+  +   + G+ RL +P Y   +EALHGV   GR  
Sbjct: 29  YKNEKAPIHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPGR-- 86

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
                          T FP  I   A++N  L K++   +S EARA +N  + G      
Sbjct: 87  --------------FTVFPQAIGLAATWNPVLQKQVATVISDEARARWNELDQGREQNSQ 132

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFWSP +N+ RDPRWGR  ET GEDPY+ G     +V+GLQ          +DSR
Sbjct: 133 FSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQG---------NDSR 183

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LKI +  KH+AA    N E ++RF  + +++E+ ++E ++  FE CV EG  +S+M +Y
Sbjct: 184 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYLPAFEACVKEGKSASIMSAY 239

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N +N +P   +  LL + +R DW F GY+VSDC     +V +HK++  TKE A    +KA
Sbjct: 240 NALNDVPCTLNAWLLTKVLREDWGFKGYVVSDCGGPALLVNAHKYVK-TKEAAATLSIKA 298

Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
           GLDL+CG D Y    + A +Q  + +ADID++   +    M+LG FD      Y  +   
Sbjct: 299 GLDLECGDDVYDAPLLNAYRQYMVTDADIDSAAYRVLRARMQLGLFDSGENNPYTKISPK 358

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            I + +H ++A +AAR+ IVLLKN N  LPL+   IK++A+VG   NA ++  G+Y G P
Sbjct: 359 VIGSKEHQKVALDAARECIVLLKNQNKMLPLDAKKIKSIAVVG--INAGRSEFGDYSGLP 416

Query: 421 C 421
            
Sbjct: 417 V 417



 Score =  133 bits (335), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 92/284 (32%), Positives = 147/284 (51%), Gaps = 49/284 (17%)

Query: 462 KNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-VDI 520
           +  +  V V G++ ++E EG+DR D+ LP  Q E + ++      P  +V++ AG+ + I
Sbjct: 601 RECEKVVAVLGINKAIEREGQDRSDIQLPADQREFLKEIYKV--NPNIVVVLVAGSSLSI 658

Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR 580
           N+   +  I +I+   YPGE GG+A+A+V+FG YNPGGRLP+T+Y +   ++P    P  
Sbjct: 659 NWMDEH--IPAIINAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRS-LDELP----PFD 711

Query: 581 PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGT 640
             +   GRTY++F G V+YPFGYGLSYT FKY         D+++ +  Q  ++++    
Sbjct: 712 DYDITKGRTYQYFKGNVLYPFGYGLSYTSFKYS--------DLQVTEGNQEVNVSFC--- 760

Query: 641 NKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYE 699
                                    ++N+GK  G EV  +Y K P       IK++ G+E
Sbjct: 761 -------------------------LKNVGKYAGDEVAQIYVKLPERDKIMPIKELKGFE 795

Query: 700 RVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
           R+ +  G S KV   +     L+  D      +  SG +TI+VG
Sbjct: 796 RISLKRGGSRKVTIRLKK-DLLRYWDEEKGCFVHPSGDYTIMVG 838


>gi|408369545|ref|ZP_11167326.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
 gi|407745291|gb|EKF56857.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
          Length = 881

 Score =  269 bits (688), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 162/423 (38%), Positives = 235/423 (55%), Gaps = 50/423 (11%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +P+ + +L    R  DL+ER+T+ EK+ Q+   +  + RLG+P Y WW+E+LHGV+  G 
Sbjct: 27  YPFQNPELDDSARVADLLERLTVEEKIDQLLYTSPAIERLGIPEYNWWNESLHGVARAGY 86

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGN-- 125
                           AT FP  I   A+++  L K++   +S EARA ++     G   
Sbjct: 87  ----------------ATVFPQSITIAAAWDSDLLKEVADAISDEARAKHHEYIRRGQRG 130

Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
              GLTFWSPNIN+ RDPRWGR  ET GEDPY+ G+  I YV+GLQ          +D  
Sbjct: 131 IYQGLTFWSPNINIFRDPRWGRGHETYGEDPYLTGQLGIAYVKGLQ---------GNDPN 181

Query: 184 PLKISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
            LK+ A  KH+A +      G +  R  FD   +++D+ ET++  F   V +GDV SVM 
Sbjct: 182 YLKLVATAKHFAVH-----SGPEPLRHEFDVSPSKRDLWETYLPAFRYLVKQGDVKSVMT 236

Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
           +YNRV G    A   L    +R  W+F GY+VSDC +I  I + HK   D  E +   V+
Sbjct: 237 AYNRVYGEAASASDTLFT-ILRDYWDFDGYVVSDCFAISDIWKYHKIAKDAAEASAMAVI 295

Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNL 357
           + G DL+CGD Y      A QQG + E DID +L  L    ++LG FD  P+    Y  +
Sbjct: 296 E-GCDLNCGDSYEKLNQ-AYQQGMVTEKDIDIALSRLMEARIKLGMFD--PEQLVPYAQI 351

Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
             N   + +H +LA +AA++ IVLLKN    LPL + ++K++A++GP+A+  +++ GNY 
Sbjct: 352 PFNVNTSEKHNQLALKAAKESIVLLKNQGDLLPL-SKDLKSVAVIGPNADNIQSLWGNYN 410

Query: 418 GTP 420
           G P
Sbjct: 411 GNP 413



 Score =  145 bits (367), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 90/272 (33%), Positives = 139/272 (51%), Gaps = 45/272 (16%)

Query: 473 LDLSVEA-EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKS 531
           +D+ VE   G DR  L LP  Q  L+ +VA   K P+ LV+++  A+ IN+A  N  I +
Sbjct: 620 MDVVVEGFAGGDRTALDLPASQRTLLKEVAKTGK-PIVLVLLNGSALSINWAAEN--IPA 676

Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYK 591
           I+  GY G++GG A+A+V+FG YNP  RLP+T+Y++         +P     N  GRTY+
Sbjct: 677 IMTAGYAGQQGGNAVAEVLFGDYNPAARLPVTYYKS------VEDLPDFEDYNMDGRTYR 730

Query: 592 FFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLID 651
           +F+   +YPFGYGLSYT F Y     P  +D+                            
Sbjct: 731 YFEKEPLYPFGYGLSYTTFDYSKFQLPSKIDM---------------------------- 762

Query: 652 DVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAK 710
                +      +EV N G  DG EVV VY +   G     I++++G++R+ +  G+S K
Sbjct: 763 -----NESIELSVEVTNTGAYDGDEVVQVYLTDEKGSTPRPIRELVGFKRIHLKKGESQK 817

Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           V FT+   + L ++D+  + ++  G  +I VG
Sbjct: 818 VQFTIEP-RQLSMIDDKGDLVIEPGVFSISVG 848


>gi|336415919|ref|ZP_08596257.1| hypothetical protein HMPREF1017_03365 [Bacteroides ovatus
           3_8_47FAA]
 gi|335939822|gb|EGN01694.1| hypothetical protein HMPREF1017_03365 [Bacteroides ovatus
           3_8_47FAA]
          Length = 782

 Score =  269 bits (688), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 221/723 (30%), Positives = 344/723 (47%), Gaps = 114/723 (15%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+P++    EA HG   IG                 AT FPT I   A+++  L K++
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------------ATVFPTGIGMAATWSPELVKEV 170

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           GQ ++ E R+       G   + P +++ RDPRW RV ET GEDP + G    + V GL 
Sbjct: 171 GQVIAKEIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGLG 225

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                     + S+     A  KH+ AY +   EG    ++ S V  +D+ + F+ PF  
Sbjct: 226 G--------GNLSQKYATIATLKHFLAYAVP--EGGQNGNYAS-VGIRDLHQNFLPPFRK 274

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            ++ G +S VM SYN ++GIP  ++  LL Q +R +W F G++VSD  SI+ I ESH F+
Sbjct: 275 AIDAGALS-VMTSYNSIDGIPCTSNHNLLTQLLRNEWKFRGFVVSDLYSIEGIHESH-FV 332

Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
             TKE+A  + + AG+D+D G D YTN    AVQ G++ +A IDT++  +  +   +G F
Sbjct: 333 APTKENAAIQSVTAGVDVDLGGDAYTNLCH-AVQSGQMDKAVIDTAVCRVLRMKFEMGLF 391

Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
           +       +    +   +HIELA + A+  I LLKN+N  LPL+   I  +A++GP+A+ 
Sbjct: 392 EHPYVDPKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSK-TINKVAVIGPNADN 450

Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYA--YSKVINYAPGCADIVCQNNSMIPAAIDAAKNA 464
              M+G+Y          + +DG         + Y  GCA I     + I  AI+AA+ +
Sbjct: 451 RYNMLGDYTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCA-IRDTTVNEIEQAIEAARRS 509

Query: 465 D----------------------ATVIVAGLDLSVE-AEGKDRVDLLLPGFQTELINKVA 501
           +                      A V   G    +E  EG DR  L L G Q EL+  + 
Sbjct: 510 EVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQ 569

Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
              K P+ +V +    ++ N+A       ++L   YPG+EGG AIADV+FG YNP GRLP
Sbjct: 570 KTGK-PLIVVYIEGRPLEKNWASEYA--DALLTAYYPGQEGGNAIADVLFGDYNPSGRLP 626

Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
           I+    +  +IP       P N+     Y       +Y FGYG+SYT F+Y         
Sbjct: 627 IS-VPRSVGQIPVYYNQKAPRNH----DYVEVSSSPLYSFGYGMSYTTFEYS-------- 673

Query: 622 DIK-LDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
           D++ + K  +C ++++                            +V+N GK DG EV  +
Sbjct: 674 DLQVVQKSARCFEVSF----------------------------KVKNTGKYDGEEVSQL 705

Query: 681 YSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
           Y +    +    +KQ+  +ER  +  G+  KV F +   +   +V+     ++ SG   +
Sbjct: 706 YMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGNFHL 764

Query: 740 LVG 742
           ++G
Sbjct: 765 MIG 767


>gi|336408356|ref|ZP_08588849.1| hypothetical protein HMPREF1018_00864 [Bacteroides sp. 2_1_56FAA]
 gi|335937834|gb|EGM99730.1| hypothetical protein HMPREF1018_00864 [Bacteroides sp. 2_1_56FAA]
          Length = 805

 Score =  269 bits (688), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 233/813 (28%), Positives = 355/813 (43%), Gaps = 153/813 (18%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE---------------- 57
           Y +   P   R + L+ +MTL EKV QM      +  LG P+Y+                
Sbjct: 40  YENPSAPVEYRVEHLLSQMTLEEKVGQM------LTSLGWPMYKRVGEDIRLTPQLEKEI 93

Query: 58  ----------------WWSEALH---GVSFIGRRTNSPPG---THFDSEVP--------- 86
                           W    LH     S   R +N        H    +P         
Sbjct: 94  GEYHIGSLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPH 153

Query: 87  -----GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
                G T FPT I   +++N  L +++G+ ++ EA A           + P +++ RDP
Sbjct: 154 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDP 208

Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
           RW RV ET GEDPY+ G      VRG Q     E   D  S    + A  KH+A+Y    
Sbjct: 209 RWSRVEETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---G 257

Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
           W         + + E++++E    PF   V  G + SVM SYN ++G P      LL   
Sbjct: 258 WTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDI 316

Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGA 320
           ++  W F G++VSD  ++  + E     ND   +A  + + AG+D D G + Y    + A
Sbjct: 317 LKDRWQFKGFVVSDLYAVGGLREHGVAGNDY--EAAIKAVNAGVDSDLGTNVYAEQLVAA 374

Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
           V++G +A A ID ++R +  +  ++G FD     +      + + +H  LA E ARQ IV
Sbjct: 375 VKRGDVAVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIV 434

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSK 435
           LLKN +  LPL   +I+TLA++GP+A+    M+G+Y     +GT       +    +   
Sbjct: 435 LLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKET 493

Query: 436 VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE------------- 478
            + YA GCA +   + +    AI+ A+NADA V+V G     D S E             
Sbjct: 494 RVLYAKGCA-VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQ 552

Query: 479 ------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
                  EG DR  L L G Q EL+ +++   K PV LV++    + +  A    +    
Sbjct: 553 ISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVD 611

Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
            W  YPG +GG A+ADV+FG YNP GRL ++      V      +P+       G   ++
Sbjct: 612 AW--YPGMQGGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTRRKGNRSRY 663

Query: 593 FDGPVV--YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
            + P    YPFGYGLSYT F Y         D+K         +  T G++         
Sbjct: 664 IEEPGTPRYPFGYGLSYTTFSY--------TDMK---------VQVTEGSD--------- 697

Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSA 709
                 D +    + ++N G  DG EV  +Y +    +  T  KQ+  + R+ + A +S 
Sbjct: 698 ------DCRVDVTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAAESR 751

Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +V FT++  KSL +       ++  G  TI+VG
Sbjct: 752 EVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 783


>gi|293373755|ref|ZP_06620101.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|292631245|gb|EFF49877.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
          Length = 800

 Score =  269 bits (687), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 230/800 (28%), Positives = 361/800 (45%), Gaps = 141/800 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
           Y D   P   R  DL+ +MTL EK  QM  L YG  R+     P   W    W + +   
Sbjct: 56  YEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNI 114

Query: 64  ----HGVSFIGRRTNSP-----------------------PGTHFDSEVPG-----ATSF 91
               +G+   G   + P                       P    +  + G     AT F
Sbjct: 115 DEQANGLGKFGSEISYPYANSVKNRHTIQRWFMEQTRLGIPVDFTNEGIRGLCHDRATMF 174

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
           P      A++N+ L ++I +  + EA+A+      G T  +SP +++ +DPRWGRV+E+ 
Sbjct: 175 PAQCGQGATWNKKLIREIAKVTADEAKAL------GYTNIYSPILDIAQDPRWGRVVESY 228

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDPY+VG      + GLQ+ EG             I A  KH+A Y +     +     
Sbjct: 229 GEDPYLVGELGKQMILGLQN-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 274

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
           D  V  ++M+  ++ PF   + E     VM SYN  +G P       L + +R  W F G
Sbjct: 275 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 334

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
           Y+VSD ++++ +   H+ +  T+E+  A+V+ AGL++      TNFT           A+
Sbjct: 335 YVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRRAI 388

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
            +GK++   +D  +  +  V   +G FD   P      +  + N  H  ++ +AA + IV
Sbjct: 389 NEGKVSLHTLDQRVGEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESIV 448

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
           LLKN+N  LPL+  N   +A++GP+    K +   Y        +   G   Y  +  + 
Sbjct: 449 LLKNENQMLPLSK-NFSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVR 507

Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
           Y  GC DI+                Q  +MI  A++ AK +D  ++V G +     E   
Sbjct: 508 YVKGC-DIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDVAILVLGGNEKTVREEFS 566

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R +L L G Q +L+  V    K PV LV++   A  IN+A  N  + +I+   +PGE  G
Sbjct: 567 RTNLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMG 623

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
            AIA V+FG YNPGGRL +T +  +  +IP+ + P +P ++  G+     DG V+YPFGY
Sbjct: 624 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDSKGKVR--VDG-VLYPFGY 678

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           GLSYT F Y         D+K+ K          +G  +                  T  
Sbjct: 679 GLSYTTFGYS--------DLKISKP--------VIGPQE----------------NITLS 706

Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
             V+N GK  G EVV +Y +    + T   +V+ G+ER+ +  G+   V FT+   + L 
Sbjct: 707 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLG 765

Query: 723 IVDNAANSLLASGAHTILVG 742
           + D      +  G+ +++VG
Sbjct: 766 LWDKNNRFTVEPGSFSVMVG 785


>gi|189464583|ref|ZP_03013368.1| hypothetical protein BACINT_00926 [Bacteroides intestinalis DSM
           17393]
 gi|189436857|gb|EDV05842.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 879

 Score =  269 bits (687), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 169/456 (37%), Positives = 238/456 (52%), Gaps = 49/456 (10%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           PY +  L   ERA DLV R+TL EK   M + +  +PRLG+  Y+WW+EALHGV   G  
Sbjct: 41  PYKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALHGVGRAGL- 99

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------ 126
                          AT FP  I   ASFN  L   +   +S EARA     +       
Sbjct: 100 ---------------ATVFPQAIGMGASFNNELLYDVFTAISDEARAKNTEFSKEGGLKR 144

Query: 127 --GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLT W+PNIN+ RDPRWGR  ET GEDPY+  +  +  VRGLQ  EG +Y        
Sbjct: 145 YQGLTMWTPNINIFRDPRWGRGQETYGEDPYLTSQMGMAVVRGLQGPEGEKYD------- 197

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            K+ AC KHYA +    W   +R  F++  +  +D+ ET++  F+  V +  V  VMC+Y
Sbjct: 198 -KLHACAKHYAVHSGPEW---NRHSFNAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAY 253

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLND-TKEDAVARVLK 302
           NR  G P C   +LL   +R +W +   +VSDC +I           D  K+ A A+ + 
Sbjct: 254 NRFEGEPCCGSNRLLMHILRDEWGYKEIVVSDCWAISDFYNKGAHETDPDKQHASAKAVL 313

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
           +G D++CGD Y +    AV++G I E  ID SL+ L      LG  D   Q  +  +  +
Sbjct: 314 SGTDIECGDSYGSLPE-AVKEGLIDEKQIDISLKRLMKARFELGEMDEPSQVSWAQIPYS 372

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + + +H ELA   AR+ +VLL+N+   LPLN  N+K +A+VGP+AN +    GNY G P
Sbjct: 373 VVDSKEHRELALRMARESLVLLQNNQSLLPLNK-NLK-VAVVGPNANDSVMQWGNYNGFP 430

Query: 421 CRYTSPMDGFYAY---SKVINYAPGC---ADIVCQN 450
               + ++G   Y   S++I Y PGC   +D+  Q+
Sbjct: 431 SHTITLLEGIREYLPESQII-YEPGCDLTSDVTLQS 465



 Score =  113 bits (282), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 84/300 (28%), Positives = 135/300 (45%), Gaps = 53/300 (17%)

Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
           +   ++  K AD  +   G+  +VE E          G DR  + LP  Q+ L+ ++  A
Sbjct: 606 LKQTVNKVKEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKA 665

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
            K    +V ++     I     +    +IL   YPG+ GG AIA+V+FG YNP GRLP+T
Sbjct: 666 GK---KIVFVNFSGSAIALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVT 722

Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
           +Y++       + +P     +   RTY++     ++PFG+GLSYT F+Y  AS       
Sbjct: 723 FYKST------SQLPGFEDYSMKERTYRYMTEAPLFPFGHGLSYTTFRYGDASL------ 770

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
                Q+ +D   T+ T                       I V N+G+ DG EVV VY +
Sbjct: 771 ---NTQEVKDGEQTILT-----------------------IPVSNVGEYDGEEVVQVYLR 804

Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
            PG        +  ++R  IA G ++ V  +++  +  +  D   N++    G + IL G
Sbjct: 805 RPGDKEGPSHALRAFKRANIAKGATSNVTVSLSK-EDFEWFDTETNTMRPIEGDYEILYG 863


>gi|345881765|ref|ZP_08833275.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
 gi|343918424|gb|EGV29187.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
          Length = 1552

 Score =  269 bits (687), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 223/763 (29%), Positives = 338/763 (44%), Gaps = 133/763 (17%)

Query: 12   FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG-----------------VPRLGLP 54
             PY +A LP   R  DL++RMTL EK+ QM  + +                     +   
Sbjct: 719  LPYQNAALPSAIRVHDLLQRMTLDEKLAQMRHIHFKHYNTDGHVDLTKLRNNYTHSMSFG 778

Query: 55   LYEWW----SEALHGVSFIGRRTNSPPGTHFDSEV------------PGATSFPTVILTT 98
             +E +    ++    VS I  + N+   T F   V             G T FP  I   
Sbjct: 779  CFEAFPYSSTQYRQAVSTI--QQNAADSTRFGIPVIPVIEGIHGIVQDGCTIFPQAIAQG 836

Query: 99   ASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVG 158
            A+FN  L  ++ Q + TE RA+           +P++++ R+ RWGRV ET GEDPY++ 
Sbjct: 837  ATFNPQLVFRMAQHIGTEMRAI-----GARQVLAPDLDIAREQRWGRVEETFGEDPYLIS 891

Query: 159  RYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAY-------DLDNWEGNDRFHFD 211
            R   NYV+G+Q   G+                 KH+ A+       +L + +G  R  FD
Sbjct: 892  RMGYNYVKGIQSRGGI--------------PTLKHFVAHGTPQGGLNLASVKGGQRELFD 937

Query: 212  SRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGY 271
                       ++ PFE  +      SVM  Y+  +     + P  L   +R   +F GY
Sbjct: 938  ----------VYVKPFEYVIRHTKAGSVMNCYSAYDNEAITSSPFFLRTLLRDSLHFKGY 987

Query: 272  IVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADI 331
            I SD  SI  +   H    D++ +A  + + AG+DL+ G  Y       + QG + +A I
Sbjct: 988  IYSDWGSIPMLRYFHH-TADSETEAAQQAINAGVDLEAGSDYYRTAPTLIAQGLLDKARI 1046

Query: 332  DTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPL 391
            D++   +       G FD         +  I  P+ + +A + A + +VLL+N N  LPL
Sbjct: 1047 DSAAAHVLYTKFEAGLFDELASDTLHWRQQIHTPEAVAVAKQLADESLVLLENRNHFLPL 1106

Query: 392  NTGNIKTLALVGPHANATKAMIGNYEGTP-CRY-TSPMDGFYAYSKV---INYAPGCADI 446
            +   + ++A+VGP  NA +   G+Y  T   R+  +P+ G    + +   + Y  GC D 
Sbjct: 1107 DLNRLHSIAVVGP--NAAQVQFGDYSWTADNRHGITPLAGIQQVAGMRTKVRYVKGC-DY 1163

Query: 447  VCQNNSMIPAAIDAAKNADATVIVAGLDL---------SVEAEGKDRVDLLLPGFQTELI 497
              QN   I  A+  AK +D TV+V G            S   EG D  DL+LPG Q +LI
Sbjct: 1164 YSQNTDSIDEAVALAKQSDVTVVVVGTQSMLLARPSQPSTSGEGYDLSDLILPGVQQQLI 1223

Query: 498  NKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPG 557
             ++  AA G   +V+M  G   +  A  N K  ++L   Y GE+ G ++A  +FG+ NP 
Sbjct: 1224 ERI--AATGKPFIVVMVTGRPLLTEAFKN-KADALLVQWYGGEQAGLSLAQALFGQLNPS 1280

Query: 558  GRLPITWYEAN-YVKIPYTSMPL-------RPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
            GRLPI++ +A   + + Y  +P        +   + PGR Y F D    YPFGYGLSYT 
Sbjct: 1281 GRLPISFPKATGQLPVYYNHLPTDKGYYNKKGTPDKPGRDYVFADPYPAYPFGYGLSYTT 1340

Query: 610  FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
            FKY          + L K Q          TN+    AV            TF+  V+N 
Sbjct: 1341 FKYS--------QLALSKKQ----------TNENDTIAV------------TFR--VQNT 1368

Query: 670  GKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKV 711
            GK  G EV  +Y +       T IKQ+ G+E+  +  G++  +
Sbjct: 1369 GKRAGKEVAQLYIRDMKSSVATPIKQLFGFEKCALQPGETKTI 1411


>gi|423215778|ref|ZP_17202304.1| hypothetical protein HMPREF1074_03836 [Bacteroides xylanisolvens
            CL03T12C04]
 gi|392691421|gb|EIY84666.1| hypothetical protein HMPREF1074_03836 [Bacteroides xylanisolvens
            CL03T12C04]
          Length = 1049

 Score =  269 bits (687), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 219/767 (28%), Positives = 357/767 (46%), Gaps = 100/767 (13%)

Query: 16   DAKLPYPERA----KDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG- 70
            ++KLP+   A    KDL+ RMT+ EK+ Q+     G   L  P  E+ S++L     +G 
Sbjct: 328  NSKLPHTPEADSFVKDLLSRMTVEEKIGQLSQYV-GRTLLTGPESEYLSDSLIARGLVGS 386

Query: 71   ----------RRTNSPPGTHFDSEVP----------GATSFPTVILTTASFNESLWKKIG 110
                      R        H   ++P            T FPT +  + S++ +  ++  
Sbjct: 387  VLNISGAKTLRDLQEKNMRHSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAA 446

Query: 111  QTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
            +  + E+ A      AGL + ++P +++ RD RWGRV+E  GED Y+    A   V G Q
Sbjct: 447  KIAAIESSA------AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQ 500

Query: 170  DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                  ++   ++  L   AC KH+ AY L    G D    D  ++E+ + +T++ PF+ 
Sbjct: 501  ------WNLWENNSVL---ACAKHWVAYGLPQ-AGRDYAPVD--MSERTLFDTYLPPFKA 548

Query: 230  CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            C++ G V + M ++N +NGIP  A P LL   +RG WNF+G++VSD ++++ +V      
Sbjct: 549  CIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLVAQGVAE 607

Query: 290  NDTKEDAVARVLKAGLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
            +D  +DA      +G+D+D  D  Y  +    ++ GKI+  D+D S+  +  +   LG F
Sbjct: 608  DD--KDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF 665

Query: 349  DGSPQYKN--LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHA 406
                ++ N       I   + ++ A + A +  VLLKNDN  LPL   N++++A+VGP A
Sbjct: 666  VDPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPL-AKNVRSIAVVGPLA 724

Query: 407  NATKAMIGNY--EGTPCRYTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDA 460
            +    ++G++   G     T+ + G           + YA GC D   ++ S    A+  
Sbjct: 725  DNQTELLGSWRARGEDRHVTTVLQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKEAVKL 783

Query: 461  AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
            A  +D  + V G    +  E + R  L LPG Q ELI ++    K PV +V+M+   + I
Sbjct: 784  ASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGRPLSI 842

Query: 521  NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEAN-YVKIPYT-SMP 578
             +   N  + +IL   + G   G AIAD++FG YNP GRL I++      V + Y     
Sbjct: 843  EWVDKN--VSAILETWFLGTSAGTAIADILFGDYNPSGRLTISFPRVEGQVPVYYNYKKS 900

Query: 579  LRPVNNFPGRTYKFFDGP--VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINY 636
             RP +     T +  D P   +YPFGYGLSYT F Y V  S +                Y
Sbjct: 901  GRPGDMPHSSTTRHIDVPNAPLYPFGYGLSYTTFSYSVPQSTQK--------------EY 946

Query: 637  TVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQV 695
            T                  +    +  + V N G  DG E V +Y      +    +K++
Sbjct: 947  T------------------RQETISVSVTVTNTGDRDGEETVQLYVNDKVASVVRPVKEL 988

Query: 696  IGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
              ++++F+ AG+S  V F ++   +L   D A N ++  G   I+ G
Sbjct: 989  KAFKKIFLKAGESKTVQFDISPL-ALGFYDAAMNYVVEPGEFEIMTG 1034


>gi|410096880|ref|ZP_11291865.1| hypothetical protein HMPREF1076_01043 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409225497|gb|EKN18416.1| hypothetical protein HMPREF1076_01043 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 799

 Score =  269 bits (687), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 241/814 (29%), Positives = 364/814 (44%), Gaps = 155/814 (19%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WS------ 60
           Y D + P   R +DL+ +MTL EK  QM  L YG  R+    LP   W    W       
Sbjct: 41  YEDPEAPIEARVQDLLNQMTLEEKSCQMATL-YGFGRVLKDSLPTEGWKNEIWKDGIANI 99

Query: 61  -EALHGVSFIGRRTNS---PPGTH----------------------FDSE-VPG-----A 88
            E L+GV    RRT     P   H                      F +E + G     A
Sbjct: 100 DEQLNGVGSARRRTPDLIYPFSNHAEAINKTQRWFIEETRLGIPVDFSNEGIHGLNHTKA 159

Query: 89  TSFPTVILTTASFNESLWKKIGQTVSTEARAM-YNLGNAGLTFWSPNINVVRDPRWGRVL 147
           T  P  I   +++N  L  + G     EA+A+ YN        ++P ++V RDPRWGRVL
Sbjct: 160 TPLPAPINIGSTWNRDLVHQAGDIAGKEAKALGYN------NVYAPILDVARDPRWGRVL 213

Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
           ET GEDPY+VG   I  V+G+Q   GV             ++  KH+A Y +     +  
Sbjct: 214 ETYGEDPYLVGELGIQMVKGIQQ-NGV-------------ASTLKHFAVYSIPKGGRDAA 259

Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
              D  V  +++ E  + PF+  V +     VM SYN  +G+P  A    L Q +R ++ 
Sbjct: 260 VRTDPHVAPRELHEIHLYPFKRVVQKAHPKGVMSSYNDWDGVPVTASYYFLTQLLRQEYG 319

Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------M 318
           F GYIVSD ++++  V++   + D+ E+AV +V++AGL++      TNFT          
Sbjct: 320 FKGYIVSDSEAVE-FVQTKHHVADSYEEAVRQVVEAGLNV-----RTNFTHPKDYILPVR 373

Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKN--LGKNNICNPQHIELAAEAAR 376
             V++GK++   +D  +  +  V   LG FD SP  K+       +   +H +   +  +
Sbjct: 374 KLVKEGKLSMKSVDRMVADVLRVKFELGLFD-SPYVKDPKAADKIVGADKHRDFVLDMQK 432

Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--- 433
           Q +VLLKN+N  LPL+    K + + GP A  T  MI  Y        +  DG   Y   
Sbjct: 433 QSLVLLKNENNLLPLDKNQTKKVLIAGPLAKETNYMISRYGPQGLDNITVYDGIKDYLGN 492

Query: 434 SKVINYAPGC--ADIVCQNNSMIPAAI-DAAK-----------NADATVIVAGLDLSVEA 479
              + YA GC   D    ++ ++P  + D  K           + D  + V G D S   
Sbjct: 493 QTEVVYAKGCEVKDANWPDSEIVPTPLTDEEKKGIAEAATAAADCDVIIAVLGEDESCTG 552

Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
           E K R  L LPG Q +L+  +    K PV LV+++   + IN+A  N  I SIL   +PG
Sbjct: 553 ESKSRTGLDLPGRQQQLLEALHATGK-PVVLVLINGQPLTINWADRN--IPSILEAWFPG 609

Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP--- 596
           + GG AIA  +FG YNPGGRL +T +  +  +I + + P +P +    +  ++F+GP   
Sbjct: 610 QLGGEAIAQTLFGDYNPGGRLSVT-FPRSIGQIEF-NFPFKPGS----QDGQYFEGPNGS 663

Query: 597 -------VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL 649
                   +YPFGYGLSYT F Y                      N +V    P   +  
Sbjct: 664 GRTRVNGALYPFGYGLSYTTFAYS---------------------NLSVKQETPYSQS-- 700

Query: 650 IDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQS 708
                      T  ++V N GK  G EVV +Y +    +    + V+ G+ER+ +  G++
Sbjct: 701 ---------PVTVTVDVTNTGKRAGDEVVQLYIRDKVSSVIAYESVLRGFERISLQPGET 751

Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
             V F +   + L+I+D      +  G   + +G
Sbjct: 752 KTVSFVL-LPEDLQILDRHMEWTVEPGEFEVRIG 784


>gi|383115340|ref|ZP_09936096.1| hypothetical protein BSGG_2785 [Bacteroides sp. D2]
 gi|313695250|gb|EFS32085.1| hypothetical protein BSGG_2785 [Bacteroides sp. D2]
          Length = 735

 Score =  269 bits (687), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 212/760 (27%), Positives = 355/760 (46%), Gaps = 97/760 (12%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLYEW 58
           Y D K P  +R  DL+ RMTL EKV Q+     G              VP  +G  +Y  
Sbjct: 30  YKDPKAPIEKRVNDLLSRMTLEEKVMQLNQYTLGRNNNVNNVGEEVKKVPAEIGSLIYFE 89

Query: 59  WSEALHGV----SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
            + AL       +    R   P    +D+     T +P  +    S+N  L ++     +
Sbjct: 90  TNPALRNSMQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLAQACSWNPDLVEQACAVSA 149

Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
            EAR    +     TF SP I+V RDPRWGRV E  GEDPY  G +    V+G       
Sbjct: 150 QEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYTNGVFGAASVKG------- 197

Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
            Y  D  S   +++AC KHY  Y      G D  +  + +++Q + +T++LP+EM V  G
Sbjct: 198 -YQGDDLSAENRMAACLKHYVGYGASE-AGRDYVY--TEISKQTLWDTYLLPYEMGVKAG 253

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
             +++M S+N ++G+P  A+  ++ + ++  W   G+IVSD  +I+ +   ++ L  TK+
Sbjct: 254 -AATLMSSFNDISGVPGSANSYIMTEILKKRWGHDGFIVSDWGAIEQL--KNQGLAATKK 310

Query: 295 DAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
           +A      AGL++D   + Y       V++G+++ A +D ++R + ++  RLG F+    
Sbjct: 311 EAAWHAFTAGLEMDMMSHAYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLFERPYT 370

Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
                K     PQ +++AA  A + +VLLKN+N  LPL   + K +A++GP A     ++
Sbjct: 371 PATSEKERFFRPQSMDIAARLAAESMVLLKNENKTLPLT--DKKKIAVIGPMAKNGWDLL 428

Query: 414 GNY--EGTPCRYTSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
           G++   G         +G    +A    + YA GCA     N      A++AA+ +D  V
Sbjct: 429 GSWCGHGKDTDVAMLYNGLATEFAGKAELRYAAGCA-TKGDNKEGFAEALEAARWSDVVV 487

Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
           +  G  ++   E   R  + LP  Q EL  ++  A K P+ LV+++   +++N  +  P 
Sbjct: 488 LCLGEMMTWSGENASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPLELN--RLEPI 544

Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS--MPLRPVNNFP 586
             +IL +  PG  G   +A ++ G+ NP G+L +T+        PY++  +P+       
Sbjct: 545 SDAILEIWQPGVNGALPMAGILSGRINPSGKLAMTF--------PYSTGQIPIYYNRRKS 596

Query: 587 GRTYKFFDGPV----VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
           GR ++ F   +    +YPFG+GLSYT+FKY                          GT  
Sbjct: 597 GRGHQGFYKDITSDPLYPFGHGLSYTEFKY--------------------------GTVT 630

Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERV 701
           P    V   D      + + ++ V N+G  DG+E V  +   P  + T  +K++  +E+ 
Sbjct: 631 PSVTKVKRGD------RLSVEVTVTNVGARDGAETVHWFISDPYCSITRPVKELKHFEKQ 684

Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
            I AG++    F ++  +    V+      L +G + ILV
Sbjct: 685 LIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724


>gi|380692997|ref|ZP_09857856.1| beta-glucosidase [Bacteroides faecis MAJ27]
          Length = 837

 Score =  269 bits (687), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 163/431 (37%), Positives = 233/431 (54%), Gaps = 46/431 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y +   P  ER +DL+ ++T+ EKV  +   + G+ R+G+  Y   +EALHG+   G+  
Sbjct: 14  YKNMNAPIHERVQDLLSKLTIEEKVSLLRATSPGIERMGIDKYYMGNEALHGIIRPGK-- 71

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
                          T FP  I   + +N  L   I   +S EARA +N    G      
Sbjct: 72  --------------FTVFPQAIGLASMWNPELHHIIAGVISDEARARWNELERGKKQKDQ 117

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFWSP +N+ RDPRWGR  ET GEDPY+ G     +V+GLQ             R
Sbjct: 118 FSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGVLGTAFVKGLQG---------DHPR 168

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LK  A  KH+AA    N E ++RF+ D+ +TE D++E +   FE C+ EG   S+M +Y
Sbjct: 169 YLKAVATPKHFAA----NNEEHNRFYCDAAITETDLREYYFPAFEKCIREGKAESIMTAY 224

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N +NG+P  A+  LLN+ ++ DW F+GYIVSDC +   ++  H+++  T E A    +KA
Sbjct: 225 NAINGVPCTANNWLLNKVLKQDWGFNGYIVSDCGAPGLLMTDHRYVK-TPEAAAMIAIKA 283

Query: 304 GLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
           GLD++CGDY + N  + A +Q  ++ A+ID++   +    MRLG FD   +  Y +L   
Sbjct: 284 GLDVECGDYVFANPLLNAYKQYMVSAAEIDSAAYRVLRARMRLGMFDDPEKNPYNHLSPE 343

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            +   +H +LA EAARQ IVLLKN    LPLN   IK++A+VG   NA     G+Y GTP
Sbjct: 344 IVGCKKHHDLALEAARQSIVLLKNQQNTLPLNAQKIKSIAVVG--INAANCEFGDYSGTP 401

Query: 421 CRY-TSPMDGF 430
                S +DG 
Sbjct: 402 VNAPVSVLDGI 412



 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 89/286 (31%), Positives = 143/286 (50%), Gaps = 51/286 (17%)

Query: 462 KNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDIN 521
           + +D  + V G++ S+E EG+DR  + LP  Q   I +   A   P T+V++ AG+  + 
Sbjct: 586 RESDVVIAVMGINQSIEREGQDRNSIELPKDQQIFIREAYKA--NPNTIVVLVAGS-SMA 642

Query: 522 FAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP-LR 580
               +  I +I+   YPGE+GG AIA+V+FG YNP GRLP+T+Y +         +P   
Sbjct: 643 IGWMDQHIPAIIDAWYPGEQGGTAIAEVLFGDYNPAGRLPLTFYNS------IEDLPAFD 696

Query: 581 PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGT 640
             N    RTY +F+G  +Y FGYGLSYT+F Y+        ++ + +D Q   +N++   
Sbjct: 697 DYNVKNNRTYMYFEGKPLYAFGYGLSYTKFDYR--------NLNIKQDTQNVTLNFS--- 745

Query: 641 NKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGY 698
                                    ++N GK +G EV  VY K P  GI  T +KQ+ G+
Sbjct: 746 -------------------------IKNSGKYNGDEVAQVYVKFPDQGIK-TPLKQLKGF 779

Query: 699 ERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVGE 743
           +RV I  G + ++   +   + L++ D+        SG +  +VG+
Sbjct: 780 KRVHIKKGATEQISIEI-PKEELRLWDDQKKQFYTPSGTYHFMVGK 824


>gi|399025517|ref|ZP_10727513.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
           CF314]
 gi|398077894|gb|EJL68841.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
           CF314]
          Length = 875

 Score =  268 bits (686), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 158/434 (36%), Positives = 232/434 (53%), Gaps = 44/434 (10%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +P+ +  LP  +R ++L+  +T+ EK+  M D +  VPRL +P Y WW+EALHGV+  G 
Sbjct: 23  YPFRNPNLPVEQRIENLLGLLTVDEKIGMMMDNSKAVPRLEIPAYGWWNEALHGVARAGT 82

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG------- 124
                           AT FP  I   A+++     K  + +S EARA YN         
Sbjct: 83  ----------------ATVFPQAIGMAAAWDVPEHLKTFEMISDEARAKYNKSFDEASKT 126

Query: 125 --NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
               GLTFW+PNIN+ RDPRWGR  ET GEDPY+     +  V+GLQ          +D 
Sbjct: 127 GRYEGLTFWTPNINIFRDPRWGRGQETYGEDPYLTSVLGVAAVKGLQG---------NDP 177

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
           +  K  AC KH+A +    W   +R  +++ V+++D+ ET++  F+  V EG+V  VMC+
Sbjct: 178 KYFKTHACAKHFAVHSGPEW---NRHSYNAEVSKRDLYETYLPAFKSLVLEGNVREVMCA 234

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVES--HKFLNDTKEDAVARV 300
           YN  +G P CA   LLN+ +RG W + G +VSDC ++    +   H    D K  A A  
Sbjct: 235 YNAFDGQPCCASNTLLNEILRGKWKYDGMVVSDCWALADFYQEKYHGTHPDEKSTA-ADA 293

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLG 358
           LK   DL+CGD Y N    ++  G I E DID S+R +      LG  D   S  +  + 
Sbjct: 294 LKHSTDLECGDTYNNLNK-SLAGGLITEKDIDISMRRILKGWFELGMLDPKSSVLWNQIP 352

Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
            + + + +H + A + A++ IVL+KN+N  LP N  NIK +A+VGP+A+     +GNY G
Sbjct: 353 YSVVDSDEHKKQALKMAQKSIVLMKNENNILPFNK-NIKKIAVVGPNADDEMMQLGNYNG 411

Query: 419 TPCRYTSPMDGFYA 432
           TP    + ++G  A
Sbjct: 412 TPSSIVTILEGIKA 425



 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 88/300 (29%), Positives = 136/300 (45%), Gaps = 53/300 (17%)

Query: 459 DAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPV 508
           +  K+AD  V   GL  S+E E          G D+  + LP  Q EL+ ++    K PV
Sbjct: 597 EKVKDADVIVFAGGLSPSLEGEEMLVNAEGFKGGDKTSIELPKVQRELLAELRKTGK-PV 655

Query: 509 TLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEAN 568
             V+ +  ++ +   + N  +    W G  G+ GG A+ADV+ G YNP GRLP+T+Y+ N
Sbjct: 656 VFVLCTGSSLGLEQDEKNYDVLLNAWYG--GQSGGTAVADVLAGDYNPSGRLPVTFYK-N 712

Query: 569 YVKIPYTSMPLRPVNNFP-----GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
             ++            F      GRTY++     +Y FG+GLSY++F Y  A        
Sbjct: 713 LEQLDNALSKTSKHQGFENYDMQGRTYRYMTENPLYAFGHGLSYSKFNYGNA-------- 764

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
           KL K+               P   ++I             + V N+   DG EVV VY K
Sbjct: 765 KLSKNSIS------------PNEDIII------------TVPVTNISDRDGEEVVQVYVK 800

Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
                   +K +  +ERV I + ++  +  T++  +S K  D  A+ L++ SG +TIL G
Sbjct: 801 RNNDVLAPVKTLRAFERVLIRSKETKNIQLTISK-ESFKFYDEKADDLISKSGDYTILYG 859


>gi|373956830|ref|ZP_09616790.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373893430|gb|EHQ29327.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 823

 Score =  268 bits (686), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 225/799 (28%), Positives = 365/799 (45%), Gaps = 133/799 (16%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WS------ 60
           Y D+  P   R  DL+ +MTL EK  Q+  L YG  R+    +P  EW    W       
Sbjct: 73  YEDSTQPIEARLNDLIGQMTLEEKTCQLATL-YGYKRILKDSVPTPEWKNEIWKDGIANI 131

Query: 61  -EALHGVSFIGRRTNSPPGTHFDSEVPG-------------------------------- 87
            E L+G    G+ ++ P  T     V                                  
Sbjct: 132 DEHLNGFITWGKTSDLPLVTDVKKHVWAMNQTQRFFIEQTRLGIPVDFTNEGIRGVEAYQ 191

Query: 88  ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRV 146
           AT+FPT +    ++++ L  ++G     EARA+      G T  ++P ++V RD RWGR+
Sbjct: 192 ATAFPTQLNMGMTWDKPLVNQMGNITGMEARAL------GYTNVYAPILDVARDQRWGRL 245

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
            E  GEDPY+V R  +   +G+Q                +I+A  KH+A Y  +      
Sbjct: 246 EEVYGEDPYLVARLGVEMAKGMQQNN-------------QIAATAKHFAVYSANKGGREG 292

Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
               D +V  ++++   + PF+  + E  +  VM SYN  +GIP       L Q +R ++
Sbjct: 293 LARTDPQVAPREVENILLYPFKKVIKEAGLMGVMSSYNDYDGIPISGSSYWLIQRLRQEF 352

Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAVQ 322
            F GY+VSD D+++ +   H    D K DAV +   AG+++       D    +    V+
Sbjct: 353 GFKGYVVSDSDALEYLYNKHHVAADLK-DAVYQAFMAGMNVRTTFRTPDSIIIYARQLVK 411

Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICN-PQHIELAAEAARQGIVL 381
           +GK+    I++ +R +  V  +LG FD            + N   +  +A +A+++ IVL
Sbjct: 412 EGKLPIDTINSRVRDVLRVKFKLGLFDHPYVQDAEASAKLVNCAANQAVALQASKESIVL 471

Query: 382 LKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY---SKVIN 438
           LKN    LPL+    +TLA++GP+A        +Y     +  + ++G  A     KV+ 
Sbjct: 472 LKNKGAILPLSKQ--QTLAVIGPNALNDDYAHTHYGPLASKSINILEGIQAKVGAGKVL- 528

Query: 439 YAPGC---------ADIVCQN-----NSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
           YA GC         ++I+ Q+      + I +A+  A++AD  V+V G +     E K R
Sbjct: 529 YALGCNLVDKHWPESEILPQDPDQAEQAKIDSAVTIARHADVAVVVLGGNTQTAGENKSR 588

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
             L LPG+Q  L+  V    K PV +V++ +  + IN+   +  I  I++ GYPG +GG 
Sbjct: 589 TSLDLPGYQLRLVKAVKATGK-PVVVVLIGSQPMTINWIDQH--IDGIIYAGYPGTQGGT 645

Query: 545 AIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYG 604
           A+ADV+FG YNPGG+L +T +  +  ++P+ + P +P +           G ++YPFG+G
Sbjct: 646 AVADVLFGDYNPGGKLTLT-FPKSVGQLPF-NFPTKPNSETDEGELAKIKG-LLYPFGFG 702

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
           LSYT F Y         D+K+    Q    N TV                CK        
Sbjct: 703 LSYTTFAYS--------DLKISPAIQSDQGNVTVS---------------CK-------- 731

Query: 665 EVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKI 723
            V N GK+ G EVV +Y +       T+ K + G++R+ +  G++ +V FT+     LK+
Sbjct: 732 -VTNTGKVAGDEVVQLYLRDVLSTVTTYEKVLRGFDRLSLKPGETKEVMFTI-VPDDLKL 789

Query: 724 VDNAANSLLASGAHTILVG 742
            +     ++  G   ++VG
Sbjct: 790 YNRQMKYVVEPGEFKVMVG 808


>gi|315606832|ref|ZP_07881841.1| beta-glucosidase [Prevotella buccae ATCC 33574]
 gi|315251497|gb|EFU31477.1| beta-glucosidase [Prevotella buccae ATCC 33574]
          Length = 858

 Score =  268 bits (686), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 168/478 (35%), Positives = 244/478 (51%), Gaps = 42/478 (8%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
           S+       PYC+  L   ERA+DL+ R+TL EK + M D +  +PRLG+  + WWSEAL
Sbjct: 14  SLSATAQLLPYCNPALSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWSEAL 73

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN- 122
           HG + +G                G T FP  +   ASFN+ L +++    S E RA YN 
Sbjct: 74  HGAANMG----------------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNR 117

Query: 123 -LGNAG-------LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
            + N G       L+ W+PN+N+ RDPRWGR  ET GEDPY+        VRGLQ  E  
Sbjct: 118 RMLNGGEDEKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPETA 177

Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
           +Y         K+ AC KHYA +    +  +     D  V+ +D+ ET++  F+  V E 
Sbjct: 178 KYR--------KLWACAKHYAVHSGPEYTRHTANVAD--VSPRDLWETYLPAFKTLVTEA 227

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
            V  VMC+Y R++  P C++ +LL Q +R +W F+  +VSDC ++  I  +HK  +D   
Sbjct: 228 KVREVMCAYQRLDDDPCCSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVH 287

Query: 295 DAVARVLKAGLDLDCGDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP- 352
            A      AG D++CG  Y   T+  AV++G I EA++D  +  L      LG  D    
Sbjct: 288 AAAK-AAVAGTDVECGFGYAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMDDPKL 346

Query: 353 -QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKA 411
            ++  +  + + +  H +LA + ARQ +VLL+N  G LPL  G    + ++GP+A+    
Sbjct: 347 VEWSKIPASVMDSKAHRQLALDMARQSLVLLQNKGGVLPLKAGG-DPITVIGPNADDGPM 405

Query: 412 MIGNYEGTPCRYTSPMDGFYAYSKVINYAPGC--ADIVCQNNSMIPAAIDAAKNADAT 467
           M GNY GTP R  + +DG  A    + Y  GC   D    N+ +   AID  K    T
Sbjct: 406 MWGNYNGTPNRTVTILDGIKARHTRVTYLKGCDLTDTKTVNSLLPQCAIDGRKGLRGT 463



 Score = 99.8 bits (247), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 74/286 (25%), Positives = 121/286 (42%), Gaps = 57/286 (19%)

Query: 456 AAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAK 505
           A I   +     V V G+  ++E E          G DR ++ LP  Q + +  + +A K
Sbjct: 591 AIIRKLQGIRKVVFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK 650

Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
              T+V ++     I          +IL   Y G+EGG A++DV+FG  NP G+LP+T+Y
Sbjct: 651 ---TVVFVNCSGSAIALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNPSGKLPVTFY 707

Query: 566 EANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
           +       Y    +R      GRTY++F  P ++ FGYGLSYT F++  A +        
Sbjct: 708 KRTDQLPDYEDYSMR------GRTYRYFSDP-LFAFGYGLSYTTFRFGRARA-------- 752

Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
                                       +  +  +   + + N G   G EVV VY +  
Sbjct: 753 ----------------------------EAAEGGYRLSVPLTNTGTRPGEEVVQVYIRRV 784

Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL 731
                 +K +  + RV + AG+S  V   ++  KS +  D + N++
Sbjct: 785 ADTNGPLKSLRAFRRVALKAGESTTVEIPLSR-KSFECFDESTNTM 829


>gi|299149090|ref|ZP_07042152.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298513851|gb|EFI37738.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 1049

 Score =  268 bits (686), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 218/767 (28%), Positives = 358/767 (46%), Gaps = 100/767 (13%)

Query: 16   DAKLPYPERA----KDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG- 70
            ++KLP+   A    KDL+ RMT+ EK+ Q+     G   L  P  E+ S++L     +G 
Sbjct: 328  NSKLPHTPEADSFVKDLLSRMTVEEKIGQLSQYV-GRTLLTGPESEYLSDSLIARGLVGS 386

Query: 71   ----------RRTNSPPGTHFDSEVP----------GATSFPTVILTTASFNESLWKKIG 110
                      R        H   ++P            T FPT +  + S++ +  ++  
Sbjct: 387  VLNISGAKTLRDLQEKNMRHSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAA 446

Query: 111  QTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
            +  + E+ A      AGL + ++P +++ RD RWGRV+E  GED Y+    A   V G Q
Sbjct: 447  KIAAIESSA------AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQ 500

Query: 170  DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                  +  +S      + AC KH+ AY L    G D    D  ++E+ + +T++ PF+ 
Sbjct: 501  ---WNLWENNS------VLACAKHWVAYGLPQ-AGRDYAPVD--MSERTLFDTYLPPFKA 548

Query: 230  CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            C++ G V + M ++N +NGIP  A P LL   +RG WNF+G++VSD ++++ +V      
Sbjct: 549  CIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLVAQGVAE 607

Query: 290  NDTKEDAVARVLKAGLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
            +D  +DA      +G+D+D  D  Y  +    ++ GKI+  D+D S+  +  +   LG F
Sbjct: 608  DD--KDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF 665

Query: 349  DGSPQYKN--LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHA 406
                ++ N       I   + ++ A + A +  VLLKNDN  LPL   N++++A+VGP A
Sbjct: 666  VDPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPL-AKNVRSIAVVGPLA 724

Query: 407  NATKAMIGNY--EGTPCRYTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDA 460
            +    ++G++   G     T+ + G           + YA GC D   ++ S    A+  
Sbjct: 725  DNQTELLGSWRARGEDRHVTTVLQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKEAVKL 783

Query: 461  AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
            A  +D  + V G    +  E + R  L LPG Q ELI ++    K PV +V+M+   + I
Sbjct: 784  ASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGRPLSI 842

Query: 521  NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEAN-YVKIPYT-SMP 578
             +   N  + +IL   + G   G AIAD++FG YNP GRL I++      V + Y     
Sbjct: 843  EWVDKN--VSAILETWFLGTSAGTAIADILFGDYNPSGRLTISFPRVEGQVPVYYNYKKS 900

Query: 579  LRPVNNFPGRTYKFFDGP--VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINY 636
             RP +     T +  D P   +YPFGYGLSYT F Y   S+P+S   +  + +       
Sbjct: 901  GRPGDMPHSSTTRHIDVPNAPLYPFGYGLSYTTFSY---SAPQSTQKEYTRQET------ 951

Query: 637  TVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQV 695
                                    +  + V N G  DG E V +Y      +    +K++
Sbjct: 952  -----------------------ISVSVTVTNTGDRDGEETVQLYVNDKVASVVRPVKEL 988

Query: 696  IGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
              ++++F+ AG+S  V F ++   +L   D A N ++  G   I+ G
Sbjct: 989  KAFKKIFLKAGESKTVQFDISPL-ALGFYDAAMNYVVEPGEFEIMTG 1034


>gi|300777563|ref|ZP_07087421.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
 gi|300503073|gb|EFK34213.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
          Length = 896

 Score =  268 bits (686), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 157/431 (36%), Positives = 230/431 (53%), Gaps = 44/431 (10%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +P+ +  LP  ER ++L+  +T  EK+  M D +  VPRL +P Y WW+EALHGV+  G 
Sbjct: 44  YPFRNPDLPVNERIENLLTLLTTEEKIGMMMDNSQAVPRLEIPAYGWWNEALHGVARAGI 103

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG------- 124
                           AT FP  I   A+++     K  + +S EARA YN         
Sbjct: 104 ----------------ATVFPQAIGMAATWDVPEHFKTFEMISDEARAKYNRSFDEALKT 147

Query: 125 --NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
               GLTFW+PNIN+ RDPRWGR  ET GEDPY+     +  V+GLQ          +D 
Sbjct: 148 GRYEGLTFWTPNINIFRDPRWGRGQETYGEDPYLTSVLGVAAVKGLQG---------NDP 198

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
           +  K  AC KH+A +    W   +R  +++ ++++D+ ET++  F+  V EG+V  VMC+
Sbjct: 199 KFFKTHACAKHFAVHSGPEW---NRHSYNAEISKRDLYETYLPAFKALVQEGNVREVMCA 255

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVES--HKFLNDTKEDAVARV 300
           YN  +G P CA+  LL + +RG W + G +VSDC ++    +   H    D K  A A  
Sbjct: 256 YNAFDGQPCCANNTLLTEILRGKWKYDGMVVSDCWALADFFQKKYHGTHPDEKTTA-ADA 314

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLG 358
           LK   DL+CGD Y N    ++  G I E DID S+R +      LG  D   S  +  + 
Sbjct: 315 LKHSTDLECGDTYNNLNK-SLASGLITEKDIDESMRRILKGWFELGMLDPKSSVHWNTIP 373

Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
            + + + +H + A + A++ IVL+KN+   LPLN  NIK +A+VGP+A+     +GNY G
Sbjct: 374 YSVVDSEEHKKQALKMAQKSIVLMKNEKNILPLNR-NIKKIAVVGPNADDGLMQLGNYNG 432

Query: 419 TPCRYTSPMDG 429
           TP    + +DG
Sbjct: 433 TPSSIVTILDG 443



 Score =  104 bits (260), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 86/300 (28%), Positives = 134/300 (44%), Gaps = 53/300 (17%)

Query: 459 DAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPV 508
           +  KNAD  V   GL  S+E E          G D+  + LP  Q +L+ ++    K PV
Sbjct: 618 EKVKNADVIVFAGGLSPSLEGEEMMVNAEGFKGGDKTSIALPKVQRDLLAELRKTGK-PV 676

Query: 509 TLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEAN 568
             V+ +  A+ +   + N    ++L   Y G+ GG A+ADV+ G YNP G+LPIT+Y+ N
Sbjct: 677 VFVLCTGSALGLEQDEKN--YDALLNAWYGGQSGGTAVADVLAGDYNPSGKLPITFYK-N 733

Query: 569 YVKIPYTSMPLRPVNNFP-----GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
             ++            F      GRTY++     +YPFG+GLSY++F Y         D 
Sbjct: 734 LEQLDNALSKTSKHEGFENYDMQGRTYRYMTEKPLYPFGHGLSYSKFVYG--------DS 785

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
           KL K+    + N T+                         I V N+ + +G EVV VY K
Sbjct: 786 KLSKNSISVNENVTI------------------------TIPVTNISEREGEEVVQVYIK 821

Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
               A   +K +  +ER  I + ++  +   ++   S    D  A+ L++  G +TI  G
Sbjct: 822 RNNDAQAPVKTLRAFERTPIKSKETKNIQLILSK-DSFAFYDEKADDLVSKPGDYTIFYG 880


>gi|299144988|ref|ZP_07038056.1| xylosidase [Bacteroides sp. 3_1_23]
 gi|298515479|gb|EFI39360.1| xylosidase [Bacteroides sp. 3_1_23]
          Length = 800

 Score =  268 bits (686), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 230/800 (28%), Positives = 361/800 (45%), Gaps = 141/800 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
           Y D   P   R  DL+ +MTL EK  QM  L YG  R+     P   W    W + +   
Sbjct: 56  YEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNI 114

Query: 64  ----HGVSFIGRRTNSP-----------------------PGTHFDSEVPG-----ATSF 91
               +G+   G   + P                       P    +  + G     AT F
Sbjct: 115 DEQANGLGKFGSEISYPYANSVKNRHTIQRWFMEQTRLGIPVDFTNEGIRGLCHDRATMF 174

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
           P      A++N+ L ++I +  + EA+A+      G T  +SP +++ +DPRWGRV+E+ 
Sbjct: 175 PAQCGQGATWNKKLIREIAKVTADEAKAL------GYTNIYSPILDIAQDPRWGRVVESY 228

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDPY+VG      + GLQ+ EG             I A  KH+A Y +     +     
Sbjct: 229 GEDPYLVGELGKQMILGLQN-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 274

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
           D  V  ++M+  ++ PF   + E     VM SYN  +G P       L + +R  W F G
Sbjct: 275 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 334

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
           Y+VSD ++++ +   H+ +  T+E+  A+V+ AGL++      TNFT           A+
Sbjct: 335 YVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRRAI 388

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
            +GK++   +D  +  +  V   +G FD   P      +  + N  H  ++ +AA + IV
Sbjct: 389 NEGKVSLHTLDQRVGEILRVKFMMGLFDNPYPGDDRRPEAVVHNDAHKAVSMKAALESIV 448

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
           LLKN+N  LPL+  N   +A++GP+    K +   Y        +   G   Y  +  + 
Sbjct: 449 LLKNENQMLPLSK-NFSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVR 507

Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
           Y  GC DI+                Q  +MI  A++ AK +D  ++V G +     E   
Sbjct: 508 YVKGC-DIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDVAILVLGGNEKTVREEFS 566

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R +L L G Q +L+  V    K PV LV++   A  IN+A  N  + +I+   +PGE  G
Sbjct: 567 RTNLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMG 623

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
            AIA V+FG YNPGGRL +T +  +  +IP+ + P +P ++  G+     DG V+YPFGY
Sbjct: 624 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDSKGKVR--VDG-VLYPFGY 678

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           GLSYT F Y         D+K+ K          +G  +                  T  
Sbjct: 679 GLSYTTFGYS--------DLKISKP--------VIGPQE----------------NITLS 706

Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
             V+N GK  G EVV +Y +    + T   +V+ G+ER+ +  G+   V FT+   + L 
Sbjct: 707 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLG 765

Query: 723 IVDNAANSLLASGAHTILVG 742
           + D      +  G+ +++VG
Sbjct: 766 LWDKNNRFTVEPGSFSVMVG 785


>gi|313204584|ref|YP_004043241.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312443900|gb|ADQ80256.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 727

 Score =  268 bits (686), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 220/741 (29%), Positives = 349/741 (47%), Gaps = 105/741 (14%)

Query: 7   VKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGV 66
           V  + FP+ +  LP  ER  +L+  MTL EKV  +     GVPRLG+      SE LHG+
Sbjct: 20  VSQTTFPFQNTGLPDNERLDNLLSLMTLDEKVNALST-NLGVPRLGI-RNTGHSEGLHGM 77

Query: 67  SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTA-----SFNESLWKKIGQTVSTEAR--- 118
           +  G      PG    SE   A ++PT I   A     +++  L +K+    +TE R   
Sbjct: 78  ALGG------PGNWGGSERGVAKTYPTTIFPQAYGLGETWDTELIQKVADIEATEIRFYA 131

Query: 119 AMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHR 178
              NL   G+   +PN ++ RDPRWGR  E+ GED ++  R  + +V+GLQ         
Sbjct: 132 QNANLQKGGMVMRAPNADLARDPRWGRTEESYGEDAFLGSRLTVAFVKGLQ--------- 182

Query: 179 DSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSS 238
            +D +  K ++  KH+ A   ++   +   +FD R+     +E +  PF   + EG   +
Sbjct: 183 GNDPKYWKSASLMKHFLANSNEDGRDSTSSNFDERL----FREYYSFPFYKGITEGGSRA 238

Query: 239 VMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVA 298
            M SYN  NG+P   +P +L +  R +W  +G I +D  ++  +V +H       E A A
Sbjct: 239 FMASYNAWNGVPMTVNP-ILKKIARDEWGNNGIICTDGGALSLLVNAHHAFPTLTEGAAA 297

Query: 299 RVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YK 355
            V+KA +     D + ++   A+++G + E +ID  +R  + V ++LG  D       Y 
Sbjct: 298 -VVKASVG-QFLDNFRSYIYEALKKGLLTEKNIDNVIRGNFYVALKLGLLDADQSKVPYT 355

Query: 356 NLGKNNICNPQHIE----LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKA 411
            +G  +  +P + +       +   + +VLLKN  G LPLN   IK++A++GP AN  + 
Sbjct: 356 GIGVTDTVSPWNKQDTKAFVRKVTAKSVVLLKNTAGLLPLNKSKIKSIAVIGPRAN--EV 413

Query: 412 MIGNYEGTPCRYTSPMDGFY-AYSKVIN--YAPGCADIVCQNNSMIPAAIDAAKNADATV 468
           ++  Y GTP    S + G   A  K I   YAP        ++ M  A + AA+ AD  +
Sbjct: 414 LLDWYSGTPPYAVSILQGIKNAVGKDIEVFYAP--------SDEMDKATL-AARKADVAI 464

Query: 469 IVAG---------LDLS-VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAV 518
           +  G           +S V ++G++ VD      + E + K+   A     +V++S    
Sbjct: 465 VCVGNHPYGTDARWKISPVPSDGREAVDRKSITLEQEDLVKLVMQANPKTVMVLVSNFPF 524

Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
            IN+++ N  + +IL V    +E G  +ADVIFG  +P GR   TW ++    +P    P
Sbjct: 525 AINWSQEN--VPAILHVTNNSQELGNGLADVIFGDVSPAGRTTQTWVKS-ITDLP----P 577

Query: 579 LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV 638
           +   +   GRTY++F    +YPFG+GLSYT F+Y                         +
Sbjct: 578 MMDYDIRHGRTYQYFKSKPLYPFGFGLSYTSFEYS-----------------------GL 614

Query: 639 GTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIG 697
            T+ P     +   VK K           N+GK DG EV+ +Y S P       +KQ+ G
Sbjct: 615 ETSNPTLTDSIFVSVKVK-----------NIGKRDGDEVIQLYVSYPDSKVERPMKQLKG 663

Query: 698 YERVFIAAGQSAKVGFTMNAC 718
           ++RVFI AG+S  V   + A 
Sbjct: 664 FKRVFIPAGKSKTVEIPLKAS 684


>gi|423301682|ref|ZP_17279705.1| hypothetical protein HMPREF1057_02846 [Bacteroides finegoldii
            CL09T03C10]
 gi|408471675|gb|EKJ90206.1| hypothetical protein HMPREF1057_02846 [Bacteroides finegoldii
            CL09T03C10]
          Length = 1365

 Score =  268 bits (686), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 234/807 (28%), Positives = 355/807 (43%), Gaps = 162/807 (20%)

Query: 12   FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDL--------------------------- 44
             PY  A LP  ER KDL++RMT  EK+ Q+  +                           
Sbjct: 534  LPYQRADLPIEERVKDLLQRMTPEEKLAQIRHIHSWEIFNGQALDERKLEEKAQGMSWGF 593

Query: 45   AYGVP---------------------RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
              G P                     RLG+P++   +E+LHGV                 
Sbjct: 594  VEGFPLTAENCAKNMLAIQRFMVEKTRLGIPIFTV-AESLHGVVH--------------- 637

Query: 84   EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRW 143
               GAT FP  I   ++F+  L  +    ++ E  A+           SP I+VVRD RW
Sbjct: 638  --EGATVFPQNIALGSTFDTDLAYRKTSMIADELHAV-----GMRQVLSPCIDVVRDLRW 690

Query: 144  GRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWE 203
            GRV E+ GEDPY+ GR+ I  V+G  D                IS   KHY  +      
Sbjct: 691  GRVEESFGEDPYLCGRFGIAEVKGYMDN--------------GISPMLKHYGPH------ 730

Query: 204  GNDRFHFDSRVTE---QDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQ 260
            GN     +    E   +D+ E ++ PFEM + +    +VM +YN  N IP  A   LL  
Sbjct: 731  GNPLSGLNLASVETSIRDLHEVYLKPFEMVMKQAPTLAVMSAYNSWNRIPNSASHYLLTD 790

Query: 261  TIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGA 320
             +R +W F GY+ SD  +I+ +   H F     E+A  + L AGLD++          G 
Sbjct: 791  VLRKEWGFKGYVYSDWGAIEMLKNFH-FTARNSEEAALQALTAGLDVEASSDCYPAIPGL 849

Query: 321  VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
            +++G++    +D ++R +     R+G FD  P  +   K  I + + I L+ + A +  V
Sbjct: 850  IERGELNREIVDEAVRRVLYAKFRIGLFD-DPYGEKFAKGAIHSGKAIALSKKIADESTV 908

Query: 381  LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT-PCRY-TSPMDGFYAYSKV-- 436
            LLKND   LPL+ G +K++A++GP  NA +   G+Y  T   R+  +P+ G   ++    
Sbjct: 909  LLKNDRQLLPLSIGKLKSIAVIGP--NADQIQFGDYTWTRDNRFGVTPLQGIRKWAGTNV 966

Query: 437  -INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG---------LDLSVEAEGKDRVD 486
             +NY  GC+ +V  + S I  A++AA+ +D  V+  G            S   EG D  D
Sbjct: 967  KVNYVKGCS-LVSMDESGIRQAVEAAEQSDVCVLFCGSASAALARDYKSSTCGEGFDLND 1025

Query: 487  LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
            L L G Q  LI  V    K PV LV+++     I + K N  I +IL   Y GE+ G +I
Sbjct: 1026 LTLTGAQPALIKAVQATGK-PVILVLVTGKPFAIPWEKKN--IPAILVQWYAGEQSGNSI 1082

Query: 547  ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF---------PGRTYKFFDGPV 597
            AD++FGK +P GRL  ++ E+    +P     LR    F         PGR Y  F  PV
Sbjct: 1083 ADILFGKVSPSGRLTFSFPEST-GHLPVFYNHLRSDRGFYKSPGSYDSPGRDY-VFSAPV 1140

Query: 598  -VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
             ++ FG+GL+YT F+Y            L  D+    +N TV                  
Sbjct: 1141 PLWSFGHGLTYTTFEYS----------NLQTDRTSYLLNDTVHV---------------- 1174

Query: 657  DYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
                  +I+++N GK +G EVV +Y S         + Q+  + +V + AG++  V  ++
Sbjct: 1175 ------RIDLKNTGKREGKEVVQLYVSDVYSSVAMPVHQLRDFRKVALQAGETQTVRLSI 1228

Query: 716  NACKSLKIVDNAANSLLASGAHTILVG 742
                 L I++    +++  G   I VG
Sbjct: 1229 -PVSELTILNEKNEAIVEPGEFEIQVG 1254


>gi|237718444|ref|ZP_04548925.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
 gi|229452377|gb|EEO58168.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
          Length = 746

 Score =  268 bits (686), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 219/767 (28%), Positives = 355/767 (46%), Gaps = 100/767 (13%)

Query: 16  DAKLPYPERA----KDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           ++KLP+   A    KDL+ RMT+ EK+ Q+     G   L  P  E+ S++L     +G 
Sbjct: 25  NSKLPHTPEADSFVKDLLSRMTVEEKIGQLSQYV-GRTLLTGPESEYLSDSLIARGLVGS 83

Query: 72  ---------------------RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIG 110
                                R   P     D      T FPT +  + S++ +  ++  
Sbjct: 84  VLNISGAKTLRDLQEKNMRYSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAA 143

Query: 111 QTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           +  + E+ A      AGL + ++P +++ RD RWGRV+E  GED Y+    A   V G Q
Sbjct: 144 KIAAIESSA------AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQ 197

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                 ++   ++  L   AC KH+ AY L    G D    D  ++E+ + +T++ PF+ 
Sbjct: 198 ------WNLWENNSVL---ACAKHWVAYGLPQ-AGRDYAPVD--MSERTLFDTYLPPFKA 245

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
           C++ G V + M ++N +NGIP  A P LL   +RG WNF+G++VSD ++++ +V      
Sbjct: 246 CIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLVAQGVAE 304

Query: 290 NDTKEDAVARVLKAGLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
           +D  +DA      +G+D+D  D  Y  +    ++ GKI+  D+D S+  +  +   LG F
Sbjct: 305 DD--KDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF 362

Query: 349 DGSPQYKN--LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHA 406
               ++ N       I   + ++ A + A +  VLLKNDN  LPL   N++++A+VGP A
Sbjct: 363 VDPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPL-AKNVRSIAVVGPLA 421

Query: 407 NATKAMIGNY--EGTPCRYTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDA 460
           +    ++G++   G     T+ + G           + YA GC D   ++ S    A+  
Sbjct: 422 DNQTELLGSWRARGEDRHVTTVLQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKEAVKL 480

Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
           A  +D  + V G    +  E + R  L LPG Q ELI ++    K PV +V+M+   + I
Sbjct: 481 ASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGRPLSI 539

Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEAN-YVKIPYT-SMP 578
            +   N  + +IL   + G   G AIAD++FG YNP GRL I++      V + Y     
Sbjct: 540 EWVDKN--VSAILETWFLGTSAGTAIADILFGDYNPSGRLTISFPRVEGQVPVYYNYKKS 597

Query: 579 LRPVNNFPGRTYKFFDGP--VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINY 636
            RP +     T +  D P   +YPFGYGLSYT F Y V  S +                Y
Sbjct: 598 GRPGDMPHSSTTRHIDVPNAPLYPFGYGLSYTTFSYSVPQSTQK--------------EY 643

Query: 637 TVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQV 695
           T                  +    +  + V N G  DG E V +Y      +    +K++
Sbjct: 644 T------------------RQETISVSVTVTNTGDRDGEETVQLYVNDKVASVVRPVKEL 685

Query: 696 IGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
             ++++F+ AG+S  V F ++   +L   D A N ++  G   I+ G
Sbjct: 686 KAFKKIFLKAGESKTVQFDISPL-ALGFYDAAMNYVVEPGEFEIMTG 731


>gi|218132023|ref|ZP_03460827.1| hypothetical protein BACEGG_03648 [Bacteroides eggerthii DSM 20697]
 gi|217985783|gb|EEC52123.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           eggerthii DSM 20697]
          Length = 762

 Score =  268 bits (686), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 233/781 (29%), Positives = 356/781 (45%), Gaps = 125/781 (16%)

Query: 20  PYPERAKDLVERMTLPEKVQQMGDLAYGVPRL-GLPLYEWWSEALHGVSF---IGRR--- 72
           P   R  DL++RMTL EK+ QM DL +    + G          L G+S+    G R   
Sbjct: 34  PVEVRVADLLKRMTLEEKIAQMQDLKFKDFSVDGKVDTVKMDSVLKGMSYASVFGSRLSV 93

Query: 73  ---------TNSPPGTHFDSEVP--------------GATSFPTVILTTASFNESLWKKI 109
                     N     H    +P              GAT FP  I  +++FN  +  ++
Sbjct: 94  EQMQESMFAINKYMAEHNRLGIPVLGEAESLHGLIHDGATIFPQSIALSSTFNPDITHRV 153

Query: 110 GQTVSTEARAMYNLGNAGL-TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGL 168
              ++ EA+A       G+    SP +++ R+ RWGRV ET GEDPY+VGR  + YV   
Sbjct: 154 ATVIAQEAKA------TGVDQVLSPVLDLARELRWGRVEETYGEDPYLVGRMGVAYVSAF 207

Query: 169 QDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVT--EQDMQETFILP 226
              EGV                 KH+ A+       N      + VT  E+D++  ++ P
Sbjct: 208 NK-EGV-------------MTTLKHFLAHGSPTGGLNL-----ASVTGCERDLRSLYLKP 248

Query: 227 FEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESH 286
           F+  + E    SVM SYN    +P  A   +L+  +RG+  F GYI SD  S++ +   H
Sbjct: 249 FQDVMREAMPYSVMNSYNSYESVPVAASHWILDDILRGEMGFKGYISSDWGSVEMLRSLH 308

Query: 287 KFLNDTKEDAVARVLKAGLDLDC-GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRL 345
               D K DA  + + AG+D++  GD Y       V+ G + E +ID  +  +      +
Sbjct: 309 HTAKD-KADAACQAVIAGVDVEVDGDCYETLD-SLVRSGVLPEKEIDKCVSRVLTAKFAM 366

Query: 346 GYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPH 405
           G FD     +      +  P+ +ELA  AAR+  +L+KN+N  LPL+   ++++A++GP 
Sbjct: 367 GLFDKDYTKRANLSQTVHTPEAVELALVAARESAILVKNENSLLPLDANKLRSVAVIGP- 425

Query: 406 ANATKAMIGNYEGTPCRY--TSPMDGFYAYSK---VINYAPGCADIVCQNNSMIPAAIDA 460
            NA +   G+Y  T       +P+ G  A ++    INYA GC +I  Q+ S    A+ A
Sbjct: 426 -NAAQVQFGDYMWTNSNEYGITPLQGIEAVTQGKVKINYAKGC-EIHTQDRSGFSQAVTA 483

Query: 461 AKNADATVIVAGLDL---------SVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLV 511
           A+N+D  ++  G            SV  E  D  D+ LPG Q  LI  V   A G  T+V
Sbjct: 484 ARNSDVALLFVGAMSGSPGRPWPNSVSGESFDLSDISLPGCQEALIRAV--KATGKPTIV 541

Query: 512 IMSAGA-VDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEAN-Y 569
           ++ AG    I + K+N +   + W  Y GE+ GRAIA+++FG+ NP GRL +++ ++  +
Sbjct: 542 VLVAGKPFAIPWVKDNCEAVIVQW--YGGEQEGRAIAEILFGEVNPSGRLNVSFPQSTGH 599

Query: 570 VKIPYTSMPL-------RPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVD 622
           + + Y   P              PGR Y F     V+ FG+GLSYT FKY          
Sbjct: 600 LPVFYNYYPSDKGFYHDHGTLEKPGRDYVFSSPDPVWAFGHGLSYTTFKY---------- 649

Query: 623 IKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY- 681
               K  Q  +  +T             DD  C+       +EV N GK DG EVV +Y 
Sbjct: 650 ----KSMQISNKEFT-------------DDDTCE-----ITVEVANTGKRDGKEVVQLYV 687

Query: 682 SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
           +       T +K++  +E+VFI AG++  V F +   K L + +     ++  G   + V
Sbjct: 688 NDIVSSVVTPVKELRRFEKVFIPAGETRTVKFNL-PIKELALWNTDMKEVVEPGDFELQV 746

Query: 742 G 742
           G
Sbjct: 747 G 747


>gi|383117091|ref|ZP_09937838.1| hypothetical protein BSHG_0805 [Bacteroides sp. 3_2_5]
 gi|382973702|gb|EES87886.2| hypothetical protein BSHG_0805 [Bacteroides sp. 3_2_5]
          Length = 805

 Score =  268 bits (685), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 234/813 (28%), Positives = 354/813 (43%), Gaps = 153/813 (18%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE---------------- 57
           Y +   P   R + L+ +MTL EKV QM      +  LG P+YE                
Sbjct: 40  YENPSAPVEYRVEHLLSQMTLEEKVGQM------LTSLGWPMYERVGEDIRLTPQLEKEI 93

Query: 58  ----------------WWSEALH---GVSFIGRRTNSPPG---THFDSEVP--------- 86
                           W    LH     S   R +N        H    +P         
Sbjct: 94  GEYHIGSLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPH 153

Query: 87  -----GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
                G T FPT I   +++N  L +++G+ ++ EA A           + P +++ RDP
Sbjct: 154 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDP 208

Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
           RW RV ET GEDPY+ G      VRG Q     E   D  S    + A  KH+A+Y    
Sbjct: 209 RWSRVEETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---G 257

Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
           W         + + E++++E    PF   V  G + SVM SYN ++G P      LL   
Sbjct: 258 WTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDI 316

Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGA 320
           ++  W F G++VSD  ++  + E     ND   +A  + + AG+D D G + Y    + A
Sbjct: 317 LKDRWQFKGFVVSDLYAVGGLREHGVAGNDY--EAAIKAVNAGVDSDLGTNVYAEQLVAA 374

Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
           V++G +A A ID ++R +  +  ++G FD     +      + + +H  LA E ARQ IV
Sbjct: 375 VKRGDVAVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIV 434

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSK 435
           LLKN +  LPL   +I+TLA++GP+A+    M+G+Y     +GT       +    +   
Sbjct: 435 LLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKET 493

Query: 436 VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE------------- 478
            + YA GCA +   + +    AI+ A+NAD  V+V G     D S E             
Sbjct: 494 RVLYAKGCA-VRDSSRTGFKDAIETARNADTVVMVMGGSSARDFSSEYEETGAAKVTINQ 552

Query: 479 ------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
                  EG DR  L L G Q EL+ +++   K PV LV++    + +  A    +    
Sbjct: 553 ISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVD 611

Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
            W  YPG +GG A+ADV+FG YNP GRL ++      V      +P+       G   ++
Sbjct: 612 AW--YPGMQGGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTRRKGNRSRY 663

Query: 593 FDGPVV--YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
            + P    YPFGYGLSYT F Y         D+K         +  T G++         
Sbjct: 664 VEEPGTPRYPFGYGLSYTTFSY--------TDMK---------VQVTEGSD--------- 697

Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSA 709
                 D      + ++N G  DG EV  +Y +    +  T  KQ+  + R+ + AG+S 
Sbjct: 698 ------DCWVDVTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESR 751

Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +V FT++  KSL +       ++  G  TI+VG
Sbjct: 752 EVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 783


>gi|427385138|ref|ZP_18881643.1| hypothetical protein HMPREF9447_02676 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727306|gb|EKU90166.1| hypothetical protein HMPREF9447_02676 [Bacteroides oleiciplenus YIT
           12058]
          Length = 863

 Score =  268 bits (685), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 171/456 (37%), Positives = 238/456 (52%), Gaps = 49/456 (10%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           PY +  L   ERA DLV R+TL EK   M + +  +PRLG+  Y+WW+EALHGV   G  
Sbjct: 25  PYKNPALTPEERAADLVGRLTLEEKASLMQNTSPAIPRLGIKAYDWWNEALHGVGRAGL- 83

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN------- 125
                          AT FP  I   ASFN  L   +   VS EARA     +       
Sbjct: 84  ---------------ATVFPQAIGMGASFNNDLLYDVFTAVSDEARAKTAEFSKEGGLKR 128

Query: 126 -AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLT W+PN+N+ RDPRWGR  ET GEDPY+ G+  +  VRGLQ  EG +Y        
Sbjct: 129 YQGLTMWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEGGKYD------- 181

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            K+ AC KH+A +    W   +R  FD+  V  +D+ ET++  F+  V +  V  VMC+Y
Sbjct: 182 -KLHACAKHFAVHSGPEW---NRHSFDAENVDPRDLWETYLPAFKDLVQKAHVKEVMCAY 237

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARVL 301
           NR  G P C   +LL Q +R +W + G IVSDC +I       +H+   D KE A A+ +
Sbjct: 238 NRFEGEPCCGSNRLLVQILRDEWAYDGIIVSDCWAINDFFNKGAHETEPD-KEHASAKAV 296

Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGK 359
             G D++CG+ Y +    AV+ G I E  ID SL+ L      LG  D      +  +  
Sbjct: 297 LTGTDVECGESYASLPQ-AVKAGLIDEKKIDISLKRLMKARFELGEMDNPELVSWAQIPY 355

Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
           + + + +H ELA   AR+ +VLL+N+   LPLN  ++K +A+VGP+AN +    GNY G 
Sbjct: 356 SVVDSKEHRELALRMARESLVLLQNNQNVLPLNK-SLK-VAVVGPNANDSVMQWGNYNGF 413

Query: 420 PCRYTSPMDGFYAY--SKVINYAPGC---ADIVCQN 450
           P    + ++G   Y     + Y PGC   +D+  Q+
Sbjct: 414 PGHTVTLLEGIRQYLPEAQLIYEPGCDLTSDVTLQS 449



 Score =  113 bits (283), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 90/299 (30%), Positives = 136/299 (45%), Gaps = 59/299 (19%)

Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
           I   K+AD  V   G+  +VE E          G DR  + LP  Q+ L+ ++  A K  
Sbjct: 594 IQRVKDADIIVFAGGISPAVEGEEMRVTIPGFKGGDRETIELPSIQSRLLAELKKAGK-K 652

Query: 508 VTLVIMSAGAVDINFAKNNPKIKS---ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW 564
           V  V  S  A+ +      P+ K+   IL   YPG+ GG AIA+V+FG YNP GRLP+T+
Sbjct: 653 VVFVNFSGSAIALT-----PETKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVTF 707

Query: 565 YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIK 624
           Y++       + +P     +  GRTY++     ++PFG+GLSYT F+Y  AS        
Sbjct: 708 YKST------SQLPDFEDYSMKGRTYRYMAEAPLFPFGHGLSYTTFRYGDASL------- 754

Query: 625 LDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP 684
               Q+ ++    + T                       I V N G+ DG EVV VY + 
Sbjct: 755 --STQEVKEGEQAILT-----------------------IPVSNTGERDGEEVVQVYLRR 789

Query: 685 PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
           PG        +  ++RV IA G +  V  +++  +  +  D   N++    G + IL G
Sbjct: 790 PGDKEGPSHALRAFKRVNIAKGTTGNVTISLSK-EDFEWFDTETNTMRPIEGDYEILYG 847


>gi|189464211|ref|ZP_03012996.1| hypothetical protein BACINT_00548 [Bacteroides intestinalis DSM
           17393]
 gi|189438001|gb|EDV06986.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 814

 Score =  268 bits (685), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 219/727 (30%), Positives = 335/727 (46%), Gaps = 117/727 (16%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+PL+    E  HG   IG                  T FPT I   +++N  L +++
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------------TTVFPTSIGQASTWNPELIRRM 190

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           G+ ++TEA A           + P +++ RDPRW RV ET GED Y+ G      V+G Q
Sbjct: 191 GRAIATEASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDAYLNGVMGAALVKGFQ 245

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                E+ R       K+ A  KH+AAY    W         + V  ++M+E    PF  
Sbjct: 246 G----EFPRTKG----KVIATLKHFAAY---GWTEGGHNGGSAHVGNREMEEAIYPPFRE 294

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            V  G +S VM SYN ++GIP  A+  LL   ++  W F G++VSD  +I  + E    +
Sbjct: 295 AVAAGALS-VMSSYNEIDGIPCTANSNLLTGLLKERWQFKGFVVSDLYAIGGLREHG--V 351

Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
            DT  +A  + + AG+D D G + Y    + AV++G + E  I+ ++  +  +   +G F
Sbjct: 352 ADTDYEAAVKAVNAGVDSDLGTNVYAGQLVNAVKRGDVQEVVINKAVSRILALKFHMGLF 411

Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
           D     +   +  + + +H+ELA E ARQ I+LLKN N  LPLN    KT+A++GP+A+ 
Sbjct: 412 DHPFVDEREPEQVVASTEHLELAREVARQSIILLKNKNELLPLNK-KTKTIAVIGPNADN 470

Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYAYSKVIN-----YAPGCADIVCQNNSMIPAAIDAA 461
              M+G+Y    +     + +DG     KV N     YA GCA +   + S    AI+AA
Sbjct: 471 IYNMLGDYTAPQSESSVVTVLDGIR--QKVSNDTHIIYAKGCA-VRDSSKSGFQEAIEAA 527

Query: 462 KNADATVIVAG----LDLSVE-------------------AEGKDRVDLLLPGFQTELIN 498
           + +D  V+V G     D S +                    EG DR  L L G Q ELI 
Sbjct: 528 RQSDVVVMVMGGSSARDFSSKYEETGAAKVSDSHISDMESGEGYDRSTLELLGRQRELIR 587

Query: 499 KVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGG 558
           +V    K P+ LV++    + +   +   ++ +I+   YPG +GG A+ADV+FG YNP G
Sbjct: 588 EVGKLNK-PIVLVLIKGRPLLLEGIE--AEVDAIVDAWYPGMQGGNAVADVLFGDYNPAG 644

Query: 559 RLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFF--DGPVVYPFGYGLSYTQFKYKVAS 616
           RL I+      V      +P+       G   K+   +G   YPFGYGLSYT F Y    
Sbjct: 645 RLTIS------VPRSVGQLPVYYNTKRKGNRSKYIEEEGTPRYPFGYGLSYTSFNYS--- 695

Query: 617 SPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSE 676
                           D+   V   +  C                  ++V N G  DG E
Sbjct: 696 ----------------DLKAEVVEAEDSCLV-------------NISVKVRNEGSRDGDE 726

Query: 677 VVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASG 735
           VV +Y +    +  T  KQ+ G++R+ +  G++ ++ F ++  KSL +        +  G
Sbjct: 727 VVQLYLRDEVASFTTPFKQLCGFQRIHLKVGETKEITFRLDK-KSLALYMQNEEWAVEPG 785

Query: 736 AHTILVG 742
             T+++G
Sbjct: 786 RFTLMLG 792


>gi|423212854|ref|ZP_17199383.1| hypothetical protein HMPREF1074_00915 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392694712|gb|EIY87939.1| hypothetical protein HMPREF1074_00915 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 782

 Score =  268 bits (685), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 220/723 (30%), Positives = 343/723 (47%), Gaps = 114/723 (15%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+P++    EA HG   IG                 AT FPT I   A+++  L K++
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------------ATVFPTGIGMAATWSPELVKEV 170

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           GQ ++ E R+       G   + P +++ RDPRW RV ET GEDP + G    + V GL 
Sbjct: 171 GQVIAKEIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGLG 225

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                     + S+     A  KH+ AY +   EG    ++ S V  +D+ + F+ PF  
Sbjct: 226 G--------GNLSQKYATIATLKHFLAYAVP--EGGQNGNYAS-VGIRDLHQNFLPPFRK 274

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            ++ G +S VM SYN ++GIP  ++  LL Q +R +W F G++VSD  SI+ I ESH F+
Sbjct: 275 AIDAGALS-VMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESH-FV 332

Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
             TKE+A  + + AG+D+D G D YTN    AVQ G++ +  IDT++  +  +   +G F
Sbjct: 333 APTKENAAIQSVTAGVDVDLGGDAYTNLCH-AVQSGQMDKTVIDTAVCRVLRMKFEMGLF 391

Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
           +       +    +   +HIELA + A+  I LLKN+N  LPL+   I  +A++GP+A+ 
Sbjct: 392 EHPYVDPKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSK-TINKVAVIGPNADN 450

Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYAYSK--VINYAPGCADIVCQNNSMIPAAIDAAKNA 464
              M+G+Y          + +DG         + Y  GCA I     + I  AI+AA+ +
Sbjct: 451 RYNMLGDYTAPQEDSNVKTVLDGILTKLSPFRVEYVRGCA-IRDTTVNEIEQAIEAARRS 509

Query: 465 D----------------------ATVIVAGLDLSVE-AEGKDRVDLLLPGFQTELINKVA 501
           +                      A V   G    +E  EG DR  L L G Q EL+  + 
Sbjct: 510 EVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQ 569

Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
              K P+ +V +    ++ N+A       ++L   YPG+EGG AIADV+FG YNP GRLP
Sbjct: 570 KTGK-PLIVVYIEGRPLEKNWASEYA--DALLTAYYPGQEGGNAIADVLFGDYNPSGRLP 626

Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
           I+    +  +IP       P N+     Y       +Y FGYG+SYT F+Y         
Sbjct: 627 IS-VPRSVGQIPVYYNKKAPRNH----DYVEVSSSPLYSFGYGMSYTTFEYS-------- 673

Query: 622 DIK-LDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
           D++ + K  +C ++++                            +V+N GK DG EV  +
Sbjct: 674 DLQVVQKSARCFEVSF----------------------------KVKNTGKYDGEEVSQL 705

Query: 681 YSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
           Y +    +    +KQ+  +ER  +  G+  KV F +   +   +V+     ++ SG   +
Sbjct: 706 YMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGTFQV 764

Query: 740 LVG 742
           ++G
Sbjct: 765 MIG 767


>gi|398386387|ref|ZP_10544389.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
 gi|397718418|gb|EJK79007.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
          Length = 791

 Score =  268 bits (685), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 217/721 (30%), Positives = 342/721 (47%), Gaps = 109/721 (15%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+P+  +  E LHG + +G                 ATSFP  I   +S++ ++ +++
Sbjct: 138 RLGIPIL-FHEEGLHGYAAVG-----------------ATSFPQSIAMASSWDPTMLRQV 179

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
            Q +  E RA            SP +++ RDPRWGR+ ET GEDPY+VG   +  V GLQ
Sbjct: 180 NQVIGREIRA-----RGVPMVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQ 234

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
             EG    R    RP  + A  KH   +       N      + V+E++++E F  PFE 
Sbjct: 235 G-EG----RSRLLRPGHVFATLKHLTGHGQPESGTN---VGPAPVSERELRENFFPPFEQ 286

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            V    + +VM SYN ++G+P+ A+  LL+  +R +W F G +VSD  ++  ++  H   
Sbjct: 287 VVKRTGIEAVMASYNEIDGVPSHANRWLLDNVLRQEWGFRGAVVSDYSAVDQLMSIHHIA 346

Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGA-VQQGKIAEADIDTSLRFLYIVLMRLGYF 348
            +  E+A  R L AG+D D  +  +  T+G  V++GK++EA +D ++R +  +  R G F
Sbjct: 347 ANL-EEAAMRALDAGVDADLPEGLSYATLGKLVREGKVSEAKVDLAVRRMLELKFRAGLF 405

Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
           +      N       N +   LA  AA++ I LLKND G LPL      T+A++GP  +A
Sbjct: 406 ENPYADANAAAAITNNDEARALARTAAQRSITLLKND-GMLPLKPEG--TIAVIGP--SA 460

Query: 409 TKAMIGNYEGTPCRYTSPMDGFYAYSKV---INYAPGCA---------DIV-----CQNN 451
             A +G Y G P    S ++G  A       I +A G           D V      +N 
Sbjct: 461 AVARLGGYYGQPPHSVSILEGIKARVGTKANIVFAQGVKITENDDWWEDKVVKSDPAENR 520

Query: 452 SMIPAAIDAAKNADATVIVAGLDLSVEAEG------KDRVDLLLPGFQTELINKVADAAK 505
            +I  A++AA+N D  ++  G       EG       DR  L L G Q EL + +    K
Sbjct: 521 KLIAQAVEAARNVDRIILTLGDTEQSSREGWADNHLGDRPSLDLVGEQQELFDALKALGK 580

Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
            P+T+V+++      +  K + +  +IL   Y GE+GG A+AD++FG  NPGG+LP+T  
Sbjct: 581 -PITVVLINGRPA--STVKVSEQANAILEGWYLGEQGGNAVADILFGDVNPGGKLPVT-- 635

Query: 566 EANYVKIPYTSMPLRPVNNF---PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVD 622
                 +P +   L    N      R Y F     +YPFG+GLSYT F     S+P+   
Sbjct: 636 ------VPRSVGQLPMFYNMKPSARRGYLFDTTDPLYPFGFGLSYTNFSL---SAPRLSA 686

Query: 623 IKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYS 682
            K             +GT                  K +  ++V N G  +G EVV +Y 
Sbjct: 687 TK-------------IGTGG----------------KTSVSVDVRNTGAREGDEVVQLYI 717

Query: 683 KPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
           +    + T  +K++ G++RV +  G+S  V FT+   ++L++ ++    ++  G   I+ 
Sbjct: 718 RDKVSSVTRPVKELKGFQRVTLKPGESRTVTFTV-GPEALQMWNDQMRRVVEPGDFEIMT 776

Query: 742 G 742
           G
Sbjct: 777 G 777


>gi|409730324|ref|ZP_11271901.1| beta-glucosidase [Halococcus hamelinensis 100A6]
 gi|448724096|ref|ZP_21706609.1| beta-glucosidase [Halococcus hamelinensis 100A6]
 gi|445786548|gb|EMA37314.1| beta-glucosidase [Halococcus hamelinensis 100A6]
          Length = 747

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 208/706 (29%), Positives = 339/706 (48%), Gaps = 99/706 (14%)

Query: 86  PGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWG 144
           P  T+FP  I   +S++  L +++ +   +E  A+      G T   SP ++V RD RWG
Sbjct: 88  PEGTTFPQSIGMASSWDPDLMRQVMERTRSEMAAI------GTTHALSPVLDVARDLRWG 141

Query: 145 RVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEG 204
           RV ET GEDPY+V   A  YV GLQ            S    ISA  KH+AA+      G
Sbjct: 142 RVEETFGEDPYLVAAMASAYVAGLQ----------GPSIEDGISATLKHFAAHSASEG-G 190

Query: 205 NDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRG 264
            +R   +  V  ++++ET + P+E  +      SVM +Y+ ++GIP+ ++  LL   +RG
Sbjct: 191 KNRASVN--VGPRELRETHLFPYEAAITTAGAESVMNAYHDIDGIPSASNEWLLTDLLRG 248

Query: 265 DWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQ 322
           +  F G +VSD  S+  + E H   +  +E AV   L+AG+D++    D Y +    A++
Sbjct: 249 ELGFDGTVVSDYYSVDFLREEHGVSDSDRESAVM-ALEAGIDVELPATDCYEHLPE-AIE 306

Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLL 382
            G+++EA +D ++R +  +  R G  D S    ++  +        EL   AAR+ IVLL
Sbjct: 307 NGELSEATLDEAVRRVLRMKFRKGLVDDSTVDASVAADAFNTEAATELTERAARESIVLL 366

Query: 383 KNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY---------TSPMDGF--Y 431
           KN+N  LPL+  +  +LA+VGP A+  + M+G+Y   P  Y         T+P+D    +
Sbjct: 367 KNENELLPLD--DTDSLAVVGPKADDGQEMMGDY-AYPAHYPEAEVSLDATTPLDAIRVH 423

Query: 432 AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV---AGLDLS------------ 476
           A    I Y  GC       +    A   AA        V   + +D S            
Sbjct: 424 ADGTEIAYEEGCTTSGPSTDGFDAAVEAAAGADVTLAFVGARSAVDFSDPDAEDVTNPAL 483

Query: 477 -VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
               EG D  DL LPG QTEL+ +V +    P+ +V++S     I +     ++ +++  
Sbjct: 484 PTSGEGSDVTDLGLPGVQTELLERVHETGT-PLVVVVVSGKPHSIEWVAE--EVPAVVQA 540

Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEA-NYVKIPYTSMPLRPVNNFPGRTYKFFD 594
             PGEEGG  IADV+FG YNPGG LP++   +   + + Y   P     N   + + + +
Sbjct: 541 WLPGEEGGTGIADVLFGDYNPGGHLPVSLARSVGQLPVHYDRRP-----NSANKDHVYTE 595

Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
              +Y FG+GLSYT+F+Y         D ++  D        T+G +             
Sbjct: 596 SEPLYSFGHGLSYTEFEYD--------DFEVSTD--------TLGASG------------ 627

Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERVFIAAGQSAKVG 712
                 T  +   N+G   GS+VV +Y  ++ P  A   +++++G+ERV + AG+S ++ 
Sbjct: 628 ----SVTASVTATNVGGRGGSDVVQLYAHAESPDQA-RPVQELVGFERVSLDAGESTRIS 682

Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLNH 758
           F ++A + L   D   N  +  G++ + VG     ++    +N+N+
Sbjct: 683 FEIDATQ-LAYHDRDMNLRVHDGSYELRVGHSASDIAATGSVNINN 727


>gi|375149998|ref|YP_005012439.1| Beta-glucosidase [Niastella koreensis GR20-10]
 gi|361064044|gb|AEW03036.1| Beta-glucosidase [Niastella koreensis GR20-10]
          Length = 875

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 160/455 (35%), Positives = 233/455 (51%), Gaps = 49/455 (10%)

Query: 5   IKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALH 64
           ++ + S FP+ + +L + +R  DLV R+TL EKV QM + A G+PRL +P Y+WW+E LH
Sbjct: 21  LQAQNSKFPFQNYRLSFEDRVNDLVSRLTLEEKVAQMLNAAPGIPRLDIPAYDWWNETLH 80

Query: 65  GVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
           GV+    RT               T FP  I   A+++ +   ++    + E R ++N  
Sbjct: 81  GVA----RTPY-----------NVTVFPQAIAMAATWDTAALYRMADCSALEGRVIHNKA 125

Query: 125 NA---------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
            A         GLT+W+PNIN+ RDPRWGR  ET GEDPY+    A  +VRGLQ      
Sbjct: 126 IAAGKEKDRYLGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAALADAFVRGLQ------ 179

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
               +D + LK +AC KHYA +   +     R  FD  VT  D+ +T++  F+  V   +
Sbjct: 180 ---GNDPKYLKAAACAKHYAVH---SGPEPSRHVFDVDVTPYDLWDTYLPSFKKLVTVSN 233

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKED 295
           V+ VMC+YN     P CA   L+   +R  W+F GY+ SDC +I     +HK   D    
Sbjct: 234 VAGVMCAYNAFRKQPCCASDVLMTDILRNQWSFKGYVTSDCGAIDDFYRNHKTHPDAAAA 293

Query: 296 AVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQ 353
           +   V   G D+DCG+      + AV++ KI E  ID S++ L+++  RLG FD     +
Sbjct: 294 SADAVFH-GTDIDCGNEAYRALVQAVKENKITEKQIDISVKRLFMIRFRLGMFDPPSMVK 352

Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
           Y       + +  H + A   A + IVLLKN N  LPL  G +K + ++GP+A    A +
Sbjct: 353 YAQTPATELESAAHAKHALLMAHESIVLLKNANNTLPLKKG-LKKIVVLGPNATNVIAPL 411

Query: 414 GNYEGTPCRYTSPMDGF---------YAYSKVINY 439
           GNY GTP +  +   G            Y K +NY
Sbjct: 412 GNYSGTPSKLITLFQGIKEKAGAATQVVYEKAVNY 446



 Score =  115 bits (289), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 94/327 (28%), Positives = 141/327 (43%), Gaps = 60/327 (18%)

Query: 433 YSKVINYAPGCADIVCQNNSMIPAAID------AAKNADATVIVAGLDLSVEAE------ 480
           Y+ V+ Y  G      + ++  PA  D         +ADA +   G+   +E E      
Sbjct: 570 YNLVLEYWQGEGKATIKMHTGHPAVTDFNALVKKYSDADAFIFAGGISPQLEGEEMKVSD 629

Query: 481 ----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
               G DR  +LLP  QTEL+ K   A+  PV  V+M+  A+   +   N  I +I+   
Sbjct: 630 PGFKGGDRTTILLPAIQTELM-KALQASGKPVVFVMMTGSALATPWESEN--IPAIVNAW 686

Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
           Y G+  G A+ADV+FG YNP GRLP+T+Y ++        +P     +   RTY++F G 
Sbjct: 687 YGGQAAGTALADVLFGDYNPSGRLPVTFYGSD------NDLPSFEDYSMKNRTYRYFTGK 740

Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
            +Y FGYGLSYT F+Y   + P +        Q  + +  TV                  
Sbjct: 741 PLYGFGYGLSYTTFRYDQLTMPVTA-------QNGKPVKVTV------------------ 775

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSAKVGFTM 715
                    V N GK  G EV  +Y      +  T +K + G++R+ +   +S  V F +
Sbjct: 776 --------RVTNTGKTTGDEVAQIYVVNENTSIQTALKTLKGFQRISLRPAESKMVSFVL 827

Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
            +   L  VD        +G   I VG
Sbjct: 828 QS-DDLTYVDADGQRKPLTGKIQICVG 853


>gi|294777452|ref|ZP_06742903.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           vulgatus PC510]
 gi|294448520|gb|EFG17069.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           vulgatus PC510]
          Length = 864

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 165/448 (36%), Positives = 240/448 (53%), Gaps = 47/448 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D+ L   ERA+DL++++TL EKV  M D +  V RLG+  Y WW+EALHGV+  G   
Sbjct: 24  YKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARA---MYNLGNA---- 126
                         AT FP  I   ASF       I   VS EARA    Y+   +    
Sbjct: 82  --------------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERY 127

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLT W+P +N+ RDPRWGR +ET GEDPY+     +N V+GLQ +       D++ +  
Sbjct: 128 QGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCM-------DANQKYD 180

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
           KI AC KH+A +    W   +R  F++  +  +D+ ET+++PFE  V E  V  VMC+YN
Sbjct: 181 KIHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEAKVKEVMCAYN 237

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLK 302
           R+ G P C   +LL Q +R DW + G ++SDC +I      + HK   D +  + A VL 
Sbjct: 238 RLEGDPCCGSDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVL- 296

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKN 360
           +G DL+CG  Y      A ++G I+E DID S++ L      LG  D     ++  +  +
Sbjct: 297 SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPDKVEWTKIPYS 355

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            +C+ +H  L+ + AR+ + LL N N  LPL  G  +T+A++GP+AN +    GNY GTP
Sbjct: 356 VVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTP 414

Query: 421 CRYTSPMDGFYAY----SKVINYAPGCA 444
               + ++G  +      K+I Y  GC+
Sbjct: 415 KHTITLLEGIRSAMGENDKLI-YEQGCS 441



 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 95/300 (31%), Positives = 136/300 (45%), Gaps = 54/300 (18%)

Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
           I   +   K+AD  +   G+  S+E E            DR D+ LP  Q ELI  + DA
Sbjct: 592 IKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDA 651

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
            K    ++ ++     I         ++IL   YPG+ GG+A A+V+FG YNP GRLP+T
Sbjct: 652 GK---KVIFVNFSGSPIAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVT 708

Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
           +Y           +P     N  GRTY++F G  ++PFGYGLSYT F Y         +I
Sbjct: 709 FYRNT------AQLPDFEDYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYG--------NI 754

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
           KL++            T K    A +I             + V N G  DG EVV VY K
Sbjct: 755 KLEQ------------TIKVGETAKII-------------VPVTNTGNRDGEEVVQVYLK 789

Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
               A   +K +  ++RV I AG++  V   +   K L+  D   N++   +G   I+VG
Sbjct: 790 KQEDAEGPVKTLRAFKRVQIPAGKTVNVELELTP-KQLEWWDAQTNTMRTIAGNFDIMVG 848


>gi|383114908|ref|ZP_09935668.1| hypothetical protein BSGG_5166 [Bacteroides sp. D2]
 gi|382948422|gb|EIC71783.1| hypothetical protein BSGG_5166 [Bacteroides sp. D2]
          Length = 782

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 220/722 (30%), Positives = 341/722 (47%), Gaps = 112/722 (15%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+P++    EA HG   IG                 AT FPT I   A+++  L K++
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------------ATVFPTGIGMAATWSPELVKEV 170

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           GQ ++ E R+       G   + P +++ RDPRW RV ET GEDP + G    + V GL 
Sbjct: 171 GQVIAKEIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGLG 225

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                     + S+     A  KH+ AY +   EG    ++ S V  +D+ + F+ PF  
Sbjct: 226 G--------GNLSQKYATIATLKHFLAYAVP--EGGQNGNYAS-VGIRDLHQNFLPPFRK 274

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            ++ G +S VM SYN ++GIP  ++  LL Q +R +W F G++VSD  SI+ I ESH F+
Sbjct: 275 AIDSGALS-VMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESH-FV 332

Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
             TKE+A  + + AG+D+D G D YTN    AVQ G++ +A IDT++  +  +   +G F
Sbjct: 333 ALTKENAAIQSVTAGVDVDLGGDAYTNLCH-AVQSGQMDKAVIDTAVCRVLRMKFEMGLF 391

Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
           +       +    +   +HIELA + A+  I LLKN+N  LPL+   I  +A++GP+A+ 
Sbjct: 392 EHPYVDPKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSK-TINKVAVIGPNADN 450

Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYA--YSKVINYAPGCADIVCQNNSMIPAAIDAAKNA 464
              M+G+Y          + +DG         + Y  GCA I     + I  AI+AA+ +
Sbjct: 451 RYNMLGDYTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCA-IRDTTVNEIEQAIEAARRS 509

Query: 465 D----------------------ATVIVAGLDLSVE-AEGKDRVDLLLPGFQTELINKVA 501
           +                      A V   G    +E  EG DR  L L G Q EL+  + 
Sbjct: 510 EVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQ 569

Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
              K P+ +V +    ++ N+A       ++L   YPG+EGG AIADV+FG YNP GRLP
Sbjct: 570 KTGK-PLIVVYIEGRPLEKNWASEYA--DALLTAYYPGQEGGNAIADVLFGDYNPSGRLP 626

Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
           I+    +  +IP       P N+     Y       +Y FGYG+SYT F+Y         
Sbjct: 627 IS-VPRSVGQIPVYYNKKAPRNH----DYVEVSSSPLYSFGYGMSYTTFEYSALQV---- 677

Query: 622 DIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY 681
              + K  +C ++++                            +V+N GK DG EV  +Y
Sbjct: 678 ---VQKSARCFEVSF----------------------------KVKNTGKYDGEEVSQLY 706

Query: 682 SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
            +    +    +KQ+  +ER  +  G+  KV F +   +   +V+     ++ SG   ++
Sbjct: 707 MRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGNFHLM 765

Query: 741 VG 742
           +G
Sbjct: 766 IG 767


>gi|293372493|ref|ZP_06618877.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|299144770|ref|ZP_07037838.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|292632676|gb|EFF51270.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|298515261|gb|EFI39142.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 735

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 212/760 (27%), Positives = 355/760 (46%), Gaps = 97/760 (12%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLYEW 58
           Y D K P  +R  DL+ RMTL EKV Q+     G              VP  +G  +Y  
Sbjct: 30  YKDPKAPIEKRVNDLLSRMTLEEKVMQLNQYTLGRNNNVNNVGEEVKKVPAEIGSLIYFE 89

Query: 59  WSEALHGV----SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
            + AL       +    R   P    +D+     T +P  +    S+N  L ++     +
Sbjct: 90  TNPALRNSMQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLAQACSWNPDLVEQACAVSA 149

Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
            EAR    +     TF SP I+V RDPRWGRV E  GEDPY  G +    V+G       
Sbjct: 150 QEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYTNGVFGAASVKG------- 197

Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
            Y  D  S   +++AC KHY  Y      G D  +  + +++Q + +T++LP+EM V  G
Sbjct: 198 -YQGDDLSAENRMAACLKHYVGYGASE-AGRDYVY--TEISKQTLWDTYLLPYEMGVKAG 253

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
             +++M S+N ++G+P  A+P ++ + ++  W   G+IVSD  +I+ +   ++ L  TK+
Sbjct: 254 -AATLMSSFNDISGVPGSANPYIMTEILKKRWGHDGFIVSDWGAIEQL--KNQGLAATKK 310

Query: 295 DAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
           +A      AGL++D   + Y       V++G+++ A +D ++R + ++  RLG F+    
Sbjct: 311 EAAWHAFTAGLEMDMMSHAYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLFERPYT 370

Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
                K     PQ +++AA  A + +VLLKN+N  LPL   + K +A++GP A     ++
Sbjct: 371 PATSEKERFFRPQSMDIAARLAAESMVLLKNENKTLPLT--DKKKIAVIGPMAKNGWDLL 428

Query: 414 GNY--EGTPCRYTSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
           G++   G         +G    +A    + YA GCA     N      A++AA+ +D  V
Sbjct: 429 GSWCGHGKDTDVAMLYNGLATEFAGKAELRYAAGCA-TKGDNKEGFAEALEAARWSDVVV 487

Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
           +  G  ++   E   R  + LP  Q EL  ++  A K P+ LV+++   +++N  +    
Sbjct: 488 LCLGEMMTWSGENASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPLELN--RLELI 544

Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS--MPLRPVNNFP 586
             +IL +  PG  G   +A ++ G+ NP G+L +T+        PY++  +P+       
Sbjct: 545 SDAILEIWQPGVNGALPMAGILSGRINPSGKLAMTF--------PYSTGQIPIYYNRRKS 596

Query: 587 GRTYKFFDGPV----VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
           GR ++ F   +    +YPFG+GLSYT+FKY                          GT  
Sbjct: 597 GRGHQGFYKDITSDPLYPFGHGLSYTEFKY--------------------------GTVT 630

Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERV 701
           P    V   D      + + ++ V N+G  DG+E V  +   P  + T  +K++  +E+ 
Sbjct: 631 PSVTKVKRGD------RLSVEVTVTNVGARDGAETVHWFISDPYCSITRPVKELKHFEKQ 684

Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
            I AG++    F ++  +    V+      L +G + ILV
Sbjct: 685 LIRAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724


>gi|237709184|ref|ZP_04539665.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
 gi|229456880|gb|EEO62601.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
          Length = 864

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 164/448 (36%), Positives = 239/448 (53%), Gaps = 47/448 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y ++ L   ERA+DL++++TL EKV  M D +  V RLG+  Y WW+EALHGV+  G   
Sbjct: 24  YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   ASF       I   VS EARA     +A       
Sbjct: 82  --------------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERY 127

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLT W+P +N+ RDPRWGR +ET GEDPY+     +N V+GLQ         D++ +  
Sbjct: 128 QGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCT-------DANQKYD 180

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
           KI AC KH+A +    W   +R  F++  +  +D+ ET+++PFE  V EG V  VMC+YN
Sbjct: 181 KIHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYN 237

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLK 302
           R+ G P C   +LL Q +R +W + G ++SDC +I      + HK   D +  + A VL 
Sbjct: 238 RLEGDPCCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVL- 296

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKN 360
           +G DL+CG  Y      A ++G I+E DID S++ L      LG  D     ++  +  +
Sbjct: 297 SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPSKVEWTKIPYS 355

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            +C+ +H  L+ + AR+ + LL N N  LPL  G  +T+A++GP+AN +    GNY GTP
Sbjct: 356 VVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTP 414

Query: 421 CRYTSPMDGFYAY----SKVINYAPGCA 444
               + ++G  +      K+I Y  GC+
Sbjct: 415 KHTITLLEGIRSAMGENDKLI-YEQGCS 441



 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 94/300 (31%), Positives = 133/300 (44%), Gaps = 54/300 (18%)

Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
           I   +   K+AD  +   G+  S+E E            DR D+ LP  Q ELI  + DA
Sbjct: 592 IKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDA 651

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
            K    ++ ++     I         ++IL   YPG+ GG+A A+V+FG YNP GRLP+T
Sbjct: 652 GK---KVIFVNFSGSPIAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVT 708

Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
           +Y           +P     N  GRTY++F G  ++PFGYGLSYT F Y         +I
Sbjct: 709 FYRN------IAQLPDFEDYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYD--------NI 754

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
           KLD+  +  +    V                         I V N G  DG EVV VY K
Sbjct: 755 KLDQTIKVGETAKMV-------------------------IPVTNAGNRDGEEVVQVYLK 789

Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
               A    K +  ++RV I AG++  V   +   K L+  D   N++   +G   I+VG
Sbjct: 790 KQEDAEGPAKTLRAFKRVQIPAGKTVNVELELTP-KQLEWWDAQTNTMRTIAGNFDIMVG 848


>gi|227536644|ref|ZP_03966693.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
           33300]
 gi|227243445|gb|EEI93460.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
           33300]
          Length = 777

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 212/716 (29%), Positives = 334/716 (46%), Gaps = 114/716 (15%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+P++    EA HG   IG                  T FPT I   +++N +L +K+
Sbjct: 126 RLGIPVF-LAEEAPHGHMAIG-----------------TTVFPTGIGQASTWNPALLQKM 167

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
             TV+ E R            + P +++ RDPRW RV E+ GEDP + G  A   V GL 
Sbjct: 168 SATVAKEVRQ-----QGAHISYGPVLDLSRDPRWSRVEESYGEDPVLTGTLAAAIVTGLG 222

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                     + S P       KH+ AY +     N      + + E++++E F+ PF+ 
Sbjct: 223 S--------GNLSDPFATIPTLKHFVAYGIPEGGHNGSA---ASIGERELREYFLPPFQS 271

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            V  G   SVM +YN V+GIP  ++  LL   +R +WNF+G+ VSD  SI+ I  SH+  
Sbjct: 272 AVAAG-AKSVMAAYNSVDGIPCSSNKFLLTDILRKEWNFNGFTVSDLGSIEGIKGSHRVA 330

Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
            D K+ A+   ++AGLD D G       + AV+QG++ E  ID ++  +  +   +G F+
Sbjct: 331 KDHKQAAIL-AIEAGLDADLGGNAYVRLIEAVKQGEVQENSIDQAVSRVLALKFEMGLFE 389

Query: 350 GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT 409
                    K  +    +I L+ + AR+ IVLL+N N  LPL   ++K +A++GP+A+  
Sbjct: 390 KPFVDAKTAKKEVKTEANIALSRQVARESIVLLENKNNILPLRK-DVK-IAIIGPNADNI 447

Query: 410 KAMIGNY-----EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNA 464
             M+G+Y     +G        +      ++V +Y  GC+ I    NS IPAA+ AA+ +
Sbjct: 448 YNMLGDYTAPQPDGAVTTVRQAISARLPKAQV-SYVKGCS-IRDTTNSDIPAAVTAAQQS 505

Query: 465 DATVIVAG----LDLSVE-------------------AEGKDRVDLLLPGFQTELINKVA 501
           D  V V G     D   E                    EG DR  L L G Q EL+  + 
Sbjct: 506 DIIVAVVGGSSARDFKTEYISTGAAVASDKSVSDMESGEGFDRSTLDLLGRQMELLKALK 565

Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
              K P+ ++ +    +++N+A  +       W  YPG+EGG AIADV+FG YNP G++P
Sbjct: 566 QTGK-PLVVIYIQGRPLNMNWAATHADALLCAW--YPGQEGGHAIADVLFGDYNPAGKMP 622

Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
           ++    +  +IP       P+++     Y       +Y FGYG SY+ F+YK        
Sbjct: 623 LS-VPRSVGQIPVHYNRKSPLDH----RYVEEAATPLYAFGYGKSYSDFEYK-------- 669

Query: 622 DIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY 681
           D+K+ KD                           KDY+ +F +   N GK DG EV  +Y
Sbjct: 670 DLKIQKDN--------------------------KDYRVSFTLT--NTGKYDGDEVAQLY 701

Query: 682 SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGA 736
            +    + +  ++Q+  +ER+ +  G+S  V F + A   L +++     +L  G+
Sbjct: 702 IRNQYASVSQPVQQLKHFERIHLKTGESKTVSFVLTAG-DLSVINTQMKKVLEPGS 756


>gi|423230604|ref|ZP_17217008.1| hypothetical protein HMPREF1063_02828 [Bacteroides dorei
           CL02T00C15]
 gi|423244313|ref|ZP_17225388.1| hypothetical protein HMPREF1064_01594 [Bacteroides dorei
           CL02T12C06]
 gi|392630748|gb|EIY24734.1| hypothetical protein HMPREF1063_02828 [Bacteroides dorei
           CL02T00C15]
 gi|392642494|gb|EIY36260.1| hypothetical protein HMPREF1064_01594 [Bacteroides dorei
           CL02T12C06]
          Length = 864

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 164/448 (36%), Positives = 239/448 (53%), Gaps = 47/448 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y ++ L   ERA+DL++++TL EKV  M D +  V RLG+  Y WW+EALHGV+  G   
Sbjct: 24  YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   ASF       I   VS EARA     +A       
Sbjct: 82  --------------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERY 127

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLT W+P +N+ RDPRWGR +ET GEDPY+     +N V+GLQ         D++ +  
Sbjct: 128 QGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCT-------DANQKYD 180

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
           KI AC KH+A +    W   +R  F++  +  +D+ ET+++PFE  V EG V  VMC+YN
Sbjct: 181 KIHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYN 237

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLK 302
           R+ G P C   +LL Q +R +W + G ++SDC +I      + HK   D +  + A VL 
Sbjct: 238 RLEGDPCCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVL- 296

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKN 360
           +G DL+CG  Y      A ++G I+E DID S++ L      LG  D     ++  +  +
Sbjct: 297 SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPSKVEWTKIPYS 355

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            +C+ +H  L+ + AR+ + LL N N  LPL  G  +T+A++GP+AN +    GNY GTP
Sbjct: 356 VVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTP 414

Query: 421 CRYTSPMDGFYAY----SKVINYAPGCA 444
               + ++G  +      K+I Y  GC+
Sbjct: 415 KHTITLLEGIRSAMGENDKLI-YEQGCS 441



 Score =  122 bits (306), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 92/300 (30%), Positives = 132/300 (44%), Gaps = 54/300 (18%)

Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
           I   +   K+AD  +   G+  S+E E            DR D+ LP  Q ELI  + DA
Sbjct: 592 IKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDA 651

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
            K    ++ ++     I         ++IL   YPG+ GG+A A+V+FG YNP GRLP+T
Sbjct: 652 GK---KVIFVNFSGSPIAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVT 708

Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
           +Y           +P     N  GRTY++F G  ++PFGYGLSYT F Y         +I
Sbjct: 709 FYRN------IAQLPDFEDYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYD--------NI 754

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
           KL++  +  +    V                         I V N G  DG EVV VY K
Sbjct: 755 KLEQTIKVGETAKMV-------------------------IPVTNTGNRDGEEVVQVYLK 789

Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
                    K +  ++RV I AG++  V   +   K L+  D   N++   +G   I+VG
Sbjct: 790 KQEDTEGPTKTLRAFKRVQIPAGKTVNVELELTP-KQLEWWDAQTNTMRTIAGNFDIMVG 848


>gi|393781488|ref|ZP_10369683.1| hypothetical protein HMPREF1071_00551 [Bacteroides salyersiae
           CL02T12C01]
 gi|392676551|gb|EIY69983.1| hypothetical protein HMPREF1071_00551 [Bacteroides salyersiae
           CL02T12C01]
          Length = 850

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 162/448 (36%), Positives = 236/448 (52%), Gaps = 47/448 (10%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           +  PY +  L   +RA DL++R+T+ EK+  M + + G+PRLG+  YEWW+EALHGV+  
Sbjct: 12  AQLPYQNPDLTPEQRATDLLQRLTVEEKISLMQNNSPGIPRLGIRPYEWWNEALHGVARA 71

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---- 125
           G                 AT FP  I   ASFN+SL +K+   VS EARA     N    
Sbjct: 72  GL----------------ATVFPQTIGMAASFNDSLVQKVFTAVSDEARAKNRAFNDQGQ 115

Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
                GLT W+PN+N+ RDPRWGR  ET GEDPY+  R  +  V+GLQ  +   Y     
Sbjct: 116 YKRYQGLTMWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPDSARYD---- 171

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVM 240
               K+ AC KH+A +    W   +R  F++  +  +D+ ET++  F+  V E DV  VM
Sbjct: 172 ----KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKTLVQEADVKEVM 224

Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV--- 297
           C+YNR  G P C   +LL Q +R +W F+G +VSDC +I     + K  ++T  DA    
Sbjct: 225 CAYNRFEGDPCCGSNRLLTQILRDEWGFNGIVVSDCGAISDFWGAKK--HNTHPDAAHAS 282

Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL 357
           A  + +G DL+CG  Y   T  AV+ G I+E  ID S++ L      LG  + S  +  L
Sbjct: 283 ADAVLSGTDLECGSNYRKLT-DAVKAGIISEEQIDISVKRLLKARFELGEMEESHPWA-L 340

Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
             + +  P+H  LA + A + + LL+N    LPL+      +A++GP+AN +    GNY 
Sbjct: 341 PYSIVDCPEHRHLALQIAHETMTLLQNKENILPLDKH--AKVAVIGPNANDSVMQWGNYN 398

Query: 418 GTPCRYTSPMDGFYAY--SKVINYAPGC 443
           GTP   ++ +    +   +  + Y P C
Sbjct: 399 GTPSHTSTLLSALRSKLPAAQLIYEPVC 426



 Score =  112 bits (279), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 87/299 (29%), Positives = 135/299 (45%), Gaps = 55/299 (18%)

Query: 456 AAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAK 505
           A ++  K+ +  +   G+   +E E          G DR D+ LP  Q  ++  +  A K
Sbjct: 579 ATLEKLKDTEIVIFAGGISPLLEGEEMKVSAAGFKGGDRTDIELPAVQRNVLAALKKAGK 638

Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
             V  V  S  A+ +     N    +IL   YPG+EGG A+ADV+FG YNP GRLP+T+Y
Sbjct: 639 -KVIFVNFSGSAMALTPETEN--CDAILQAWYPGQEGGTAVADVLFGDYNPAGRLPVTFY 695

Query: 566 EANYVKIP-YTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIK 624
           + N  ++P +    ++      GRTY++     ++PFGYGLSYT F Y  A + K     
Sbjct: 696 K-NMEQLPDFEDYSMQ------GRTYRYMKEAPLFPFGYGLSYTTFTYGKARADKK---- 744

Query: 625 LDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP 684
                        + T +                K T  I V N+G  DG EVV VY + 
Sbjct: 745 ------------RISTGE----------------KMTLTIPVSNIGSRDGEEVVQVYLRR 776

Query: 685 PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
                   K +  ++RV I  G+S  V   +    + +  DN+ +++ +  G + +L G
Sbjct: 777 EDDPEGPTKTLRAFKRVEITKGKSLNVKIEL-PYTAFEWFDNSTHTMHSMKGEYEVLYG 834


>gi|390957160|ref|YP_006420917.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
           18391]
 gi|390412078|gb|AFL87582.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
           18391]
          Length = 908

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 168/444 (37%), Positives = 233/444 (52%), Gaps = 45/444 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y +  L   +RA DLV RMTL EK  QM + A  +PRL +P Y++W+E LHGV+  G   
Sbjct: 24  YLNPALTPQQRAADLVGRMTLEEKSLQMVNGAAAIPRLNVPAYDYWNEGLHGVARSGY-- 81

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   A+++  L K+IG  ++TEARA  N           
Sbjct: 82  --------------ATMFPQAIGMAATWDAPLLKQIGDVIATEARAKNNEALRRNNHDIY 127

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLTFWSPNIN+ RDPRWGR  ET GEDP++  +  +N++ GLQ          +D +  
Sbjct: 128 FGLTFWSPNINIFRDPRWGRGQETYGEDPHLTTQLGVNFIEGLQ---------GTDPKFY 178

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K+ A  KH+A +     EG  R  FD   T  D+ +T++  F   + +    S+MC+YNR
Sbjct: 179 KVIATPKHFAVHSGPE-EG--RHKFDVEPTPHDLWDTYLPQFRAAIVDAKADSIMCAYNR 235

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARVLKA 303
           ++G P C    LL   +R DW F G++ SDC +I       +H+   D  E A    L A
Sbjct: 236 IDGQPACGSKLLLVDILRNDWKFQGFVTSDCGAIDDFFRPNTHQTEPDA-EHADKAALLA 294

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNN 361
           G D +CG  Y      AV+ G I E+DID SLR L+   +RLG FD  GS  Y  +  + 
Sbjct: 295 GTDTNCGSTYRKLG-DAVKSGLIKESDIDVSLRRLFEARVRLGLFDPAGSVPYAQIPFSQ 353

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           + +P +  +A  AA + +VLLKND G LPL  G  KT+A++GP+  +  ++ GNY G   
Sbjct: 354 VNSPANAAVAKRAAEESMVLLKND-GILPLKAGKYKTIAVIGPNGASLSSLEGNYNGMAH 412

Query: 422 RYTSPMDGFYAYSKVIN--YAPGC 443
               P+D   +     N  YAPG 
Sbjct: 413 DPRMPVDALRSALSGTNVVYAPGA 436



 Score =  132 bits (331), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 96/304 (31%), Positives = 145/304 (47%), Gaps = 55/304 (18%)

Query: 452 SMIPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVA 501
           +++P A++AA  +D  V + GL   +E E          G DR D+ LP  Q  L+  + 
Sbjct: 619 TLLPEALEAANKSDLVVAMLGLSPDLEGEEMPVKLPGFVGGDRTDISLPASQQALLQGLI 678

Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
              K P  +V+++  A+ IN A  + K  +IL   YPGE G  A+AD + G+ NP GRLP
Sbjct: 679 ATGK-PTIVVLLNGSALAINLA--DEKANAILESWYPGEAGSTALADTLVGRNNPSGRLP 735

Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
           IT+Y++       + +P     +   RTY++F G  +Y FG+GLSYT+F Y   S  K  
Sbjct: 736 ITFYKSE------SDLPGFEDYSMQNRTYRYFKGAPLYGFGFGLSYTKFAY---SGLKLA 786

Query: 622 DIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY 681
             KL+                                  T ++ V+N GK+ G EV  +Y
Sbjct: 787 KAKLNAGD-----------------------------TLTAEVTVKNTGKVAGEEVAELY 817

Query: 682 SKPP--GIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHT 738
             PP  G AG   KQ + G++RV +  G+S K+ FT+   + L  VD      +  G + 
Sbjct: 818 LLPPAEGNAGLSPKQQLEGFQRVMLKPGESRKLTFTLTP-RQLSEVDAKGTRAIQPGTYA 876

Query: 739 ILVG 742
           I +G
Sbjct: 877 IAIG 880


>gi|345514226|ref|ZP_08793739.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
           5_1_36/D4]
 gi|229437207|gb|EEO47284.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
           5_1_36/D4]
          Length = 864

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 164/448 (36%), Positives = 239/448 (53%), Gaps = 47/448 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y ++ L   ERA+DL++++TL EKV  M D +  V RLG+  Y WW+EALHGV+  G   
Sbjct: 24  YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   ASF       I   VS EARA     +A       
Sbjct: 82  --------------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERY 127

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLT W+P +N+ RDPRWGR +ET GEDPY+     +N V+GLQ         D++ +  
Sbjct: 128 QGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCT-------DANQKYD 180

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
           KI AC KH+A +    W   +R  F++  +  +D+ ET+++PFE  V EG V  VMC+YN
Sbjct: 181 KIHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYN 237

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLK 302
           R+ G P C   +LL Q +R +W + G ++SDC +I      + HK   D +  + A VL 
Sbjct: 238 RLEGDPCCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVL- 296

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKN 360
           +G DL+CG  Y      A ++G I+E DID S++ L      LG  D     ++  +  +
Sbjct: 297 SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPSKVEWTKIPYS 355

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            +C+ +H  L+ + AR+ + LL N N  LPL  G  +T+A++GP+AN +    GNY GTP
Sbjct: 356 VVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTP 414

Query: 421 CRYTSPMDGFYAY----SKVINYAPGCA 444
               + ++G  +      K+I Y  GC+
Sbjct: 415 KHTITLLEGIRSAMGENDKLI-YEQGCS 441



 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 94/300 (31%), Positives = 133/300 (44%), Gaps = 54/300 (18%)

Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
           I   +   K+AD  +   G+  S+E E            DR D+ LP  Q ELI  + DA
Sbjct: 592 IKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDA 651

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
            K    ++ ++     I         ++IL   YPG+ GG+A A+V+FG YNP GRLP+T
Sbjct: 652 GK---KVIFVNFSGSPIAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVT 708

Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
           +Y           +P     N  GRTY++F G  ++PFGYGLSYT F Y         +I
Sbjct: 709 FYRN------IAQLPDFEDYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYD--------NI 754

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
           KLD+  +  +    V                         I V N G  DG EVV VY K
Sbjct: 755 KLDQTIKVGETAKMV-------------------------IPVTNAGNRDGEEVVQVYLK 789

Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
               A    K +  ++RV I AG++  V   +   K L+  D   N++   +G   I+VG
Sbjct: 790 KQEDAEGPAKTLRAFKRVQIPAGKTVNVELELTP-KQLEWWDAQTNTMRTIAGNFDIMVG 848


>gi|150003731|ref|YP_001298475.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
 gi|319640047|ref|ZP_07994774.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
 gi|345517061|ref|ZP_08796539.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           4_3_47FAA]
 gi|149932155|gb|ABR38853.1| glycoside hydrolase family 3, candidate beta-glycosidase
           [Bacteroides vulgatus ATCC 8482]
 gi|254833833|gb|EET14142.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           4_3_47FAA]
 gi|317388325|gb|EFV69177.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
          Length = 864

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 165/448 (36%), Positives = 240/448 (53%), Gaps = 47/448 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D+ L   ERA+DL++++TL EKV  M D +  V RLG+  Y WW+EALHGV+  G   
Sbjct: 24  YKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARA---MYNLGNA---- 126
                         AT FP  I   ASF       I   VS EARA    Y+   +    
Sbjct: 82  --------------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERY 127

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLT W+P +N+ RDPRWGR +ET GEDPY+     +N V+GLQ +       D++ +  
Sbjct: 128 QGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCM-------DANQKYD 180

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
           KI AC KH+A +    W   +R  F++  +  +D+ ET+++PFE  V E  V  VMC+YN
Sbjct: 181 KIHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEAKVKEVMCAYN 237

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLK 302
           R+ G P C   +LL Q +R DW + G ++SDC +I      + HK   D +  + A VL 
Sbjct: 238 RLEGDPCCGSDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVL- 296

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKN 360
           +G DL+CG  Y      A ++G I+E DID S++ L      LG  D     ++  +  +
Sbjct: 297 SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPDKVEWTKIPYS 355

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            +C+ +H  L+ + AR+ + LL N N  LPL  G  +T+A++GP+AN +    GNY GTP
Sbjct: 356 VVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTP 414

Query: 421 CRYTSPMDGFYAY----SKVINYAPGCA 444
               + ++G  +      K+I Y  GC+
Sbjct: 415 KHTITLLEGIRSAMGENDKLI-YEQGCS 441



 Score =  126 bits (317), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 96/300 (32%), Positives = 137/300 (45%), Gaps = 54/300 (18%)

Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
           I   +   K+AD  +   G+  S+E E            DR D+ LP  Q ELI  + DA
Sbjct: 592 IKNTVAKVKDADVVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDA 651

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
            K    ++ ++     I         ++IL   YPG+ GG+A A+V+FG YNP GRLP+T
Sbjct: 652 GK---KVIFVNFSGSPIAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVT 708

Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
           +Y         T +P     N  GRTY++F G  ++PFGYGLSYT F Y         +I
Sbjct: 709 FYRN------ITQLPDFEDYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYG--------NI 754

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
           KL++            T K    A +I             + V N G  DG EVV VY K
Sbjct: 755 KLEQ------------TIKVGETAKII-------------VPVTNTGNRDGEEVVQVYLK 789

Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
               A   +K +  ++RV I AG++  V   +   K L+  D   N++   +G   I+VG
Sbjct: 790 KQEDAEGPVKTLRAFKRVQIPAGKTVNVELELTP-KQLEWWDAQTNTMRTIAGNFDIMVG 848


>gi|423295566|ref|ZP_17273693.1| hypothetical protein HMPREF1070_02358 [Bacteroides ovatus
           CL03T12C18]
 gi|392672275|gb|EIY65744.1| hypothetical protein HMPREF1070_02358 [Bacteroides ovatus
           CL03T12C18]
          Length = 782

 Score =  267 bits (683), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 221/723 (30%), Positives = 344/723 (47%), Gaps = 114/723 (15%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+P++    EA HG   IG                 AT FPT I   A+++  L K++
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------------ATVFPTGIGMAATWSPELVKEV 170

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           GQ ++ E R+       G   + P +++ RDPRW RV ET GEDP + G    + V GL 
Sbjct: 171 GQVIAKEIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGTLGASMVDGLG 225

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                     + S+     A  KH+ AY +   EG    ++ S V  +D+ + F+ PF  
Sbjct: 226 G--------GNLSQKYATIATLKHFLAYAVP--EGGQNGNYAS-VGIRDLHQNFLPPFRK 274

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            ++ G +S VM SYN ++GIP  ++  LL Q +R +W F G++VSD  SI+ I ESH F+
Sbjct: 275 AIDAGALS-VMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESH-FV 332

Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
             TKE+A  + + AG+D+D G D YTN    AVQ G++ +A IDT++  +  +   +G F
Sbjct: 333 APTKENAAIQSVMAGVDVDLGGDAYTNLCH-AVQSGQMDKAVIDTAVCRVLRMKFEMGLF 391

Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
           +       +    +   +HIELA + A+  I LLKN+N  LPL+   I  +A++GP+A+ 
Sbjct: 392 EHPYVDPKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSK-MINKVAVIGPNADN 450

Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYA--YSKVINYAPGCADIVCQNNSMIPAAIDAAKNA 464
              M+G+Y          + +DG         + Y  GCA I     + I  AI+AA+ +
Sbjct: 451 RYNMLGDYTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCA-IRDTTVNEIEQAIEAARRS 509

Query: 465 D----------------------ATVIVAGLDLSVE-AEGKDRVDLLLPGFQTELINKVA 501
           +                      A V   G    +E  EG DR  L L G Q EL+  + 
Sbjct: 510 EVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQ 569

Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
              K P+ +V +    ++ N+A       ++L   YPG+EGG AIADV+FG YNP GRLP
Sbjct: 570 KTGK-PLIVVYIEGRPLEKNWASEYA--DALLTAYYPGQEGGNAIADVLFGDYNPSGRLP 626

Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
           I+    +  +IP       P N+     Y       +Y FGYG+SYT F+Y         
Sbjct: 627 IS-VPRSVGQIPVYYNQKAPRNH----DYVEVSSSPLYSFGYGMSYTTFEYS-------- 673

Query: 622 DIK-LDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
           D++ + K  +C ++++                            +V+N GK DG EV  +
Sbjct: 674 DLQVVQKSARCFEVSF----------------------------KVKNTGKYDGEEVSQL 705

Query: 681 YSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
           Y +    +    +KQ+  +ER  +  G+  KV F +   +   +V+     ++ SG   +
Sbjct: 706 YMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGNFHL 764

Query: 740 LVG 742
           ++G
Sbjct: 765 MIG 767


>gi|212692496|ref|ZP_03300624.1| hypothetical protein BACDOR_01992 [Bacteroides dorei DSM 17855]
 gi|212664971|gb|EEB25543.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           dorei DSM 17855]
          Length = 864

 Score =  267 bits (683), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 164/448 (36%), Positives = 239/448 (53%), Gaps = 47/448 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y ++ L   ERA+DL++++TL EKV  M D +  V RLG+  Y WW+EALHGV+  G   
Sbjct: 24  YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   ASF       I   VS EARA     +A       
Sbjct: 82  --------------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERY 127

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLT W+P +N+ RDPRWGR +ET GEDPY+     +N V+GLQ         D++ +  
Sbjct: 128 QGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCT-------DANQKYD 180

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
           KI AC KH+A +    W   +R  F++  +  +D+ ET+++PFE  V EG V  VMC+YN
Sbjct: 181 KIHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYN 237

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLK 302
           R+ G P C   +LL Q +R +W + G ++SDC +I      + HK   D +  + A VL 
Sbjct: 238 RLEGDPCCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVL- 296

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKN 360
           +G DL+CG  Y      A ++G I+E DID S++ L      LG  D     ++  +  +
Sbjct: 297 SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPSKVEWTKIPYS 355

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            +C+ +H  L+ + AR+ + LL N N  LPL  G  +T+A++GP+AN +    GNY GTP
Sbjct: 356 VVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTP 414

Query: 421 CRYTSPMDGFYAY----SKVINYAPGCA 444
               + ++G  +      K+I Y  GC+
Sbjct: 415 KHTITLLEGIRSAMGENDKLI-YEQGCS 441



 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 94/300 (31%), Positives = 133/300 (44%), Gaps = 54/300 (18%)

Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
           I   +   K+AD  +   G+  S+E E            DR D+ LP  Q ELI  + DA
Sbjct: 592 IKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDA 651

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
            K    ++ ++     I         ++IL   YPG+ GG+A A+V+FG YNP GRLP+T
Sbjct: 652 GK---KVIFVNFSGSPIAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVT 708

Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
           +Y           +P     N  GRTY++F G  ++PFGYGLSYT F Y         +I
Sbjct: 709 FYRN------IAQLPDFEDYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYD--------NI 754

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
           KLD+  +  +    V                         I V N G  DG EVV VY K
Sbjct: 755 KLDQTIKVGETAKMV-------------------------IPVTNAGNRDGEEVVQVYLK 789

Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
               A    K +  ++RV I AG++  V   +   K L+  D   N++   +G   I+VG
Sbjct: 790 KQEDAEGPAKTLRAFKRVQIPAGKTVNVELELTP-KQLEWWDAQTNTMRTIAGNFDIMVG 848


>gi|423313129|ref|ZP_17291065.1| hypothetical protein HMPREF1058_01677 [Bacteroides vulgatus
           CL09T03C04]
 gi|392686343|gb|EIY79649.1| hypothetical protein HMPREF1058_01677 [Bacteroides vulgatus
           CL09T03C04]
          Length = 864

 Score =  267 bits (683), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 165/448 (36%), Positives = 240/448 (53%), Gaps = 47/448 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D+ L   ERA+DL++++TL EKV  M D +  V RLG+  Y WW+EALHGV+  G   
Sbjct: 24  YKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARA---MYNLGNA---- 126
                         AT FP  I   ASF       I   VS EARA    Y+   +    
Sbjct: 82  --------------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERY 127

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLT W+P +N+ RDPRWGR +ET GEDPY+     +N V+GLQ +       D++ +  
Sbjct: 128 QGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCM-------DANQKYD 180

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
           KI AC KH+A +    W   +R  F++  +  +D+ ET+++PFE  V E  V  VMC+YN
Sbjct: 181 KIHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEAKVKEVMCAYN 237

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLK 302
           R+ G P C   +LL Q +R DW + G ++SDC +I      + HK   D +  + A VL 
Sbjct: 238 RLEGDPCCGSDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVL- 296

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKN 360
           +G DL+CG  Y      A ++G I+E DID S++ L      LG  D     ++  +  +
Sbjct: 297 SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPDKVEWTKIPYS 355

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            +C+ +H  L+ + AR+ + LL N N  LPL  G  +T+A++GP+AN +    GNY GTP
Sbjct: 356 VVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTP 414

Query: 421 CRYTSPMDGFYAY----SKVINYAPGCA 444
               + ++G  +      K+I Y  GC+
Sbjct: 415 KHTITLLEGIRSAMGENDKLI-YEQGCS 441



 Score =  129 bits (323), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 96/300 (32%), Positives = 138/300 (46%), Gaps = 54/300 (18%)

Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
           I   +   K+AD  +   G+  S+E E            DR D+ LP  Q ELI  + DA
Sbjct: 592 IKNTVAKVKDADVVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDA 651

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
            K    ++ ++     I         ++IL   YPG+ GG+A+A+V+FG YNP GRLP+T
Sbjct: 652 GK---KVIFVNFSGSPIAMEPETKYCQAILQAWYPGQSGGKAVAEVLFGDYNPAGRLPVT 708

Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
           +Y         T +P     N  GRTY++F G  ++PFGYGLSYT F Y         +I
Sbjct: 709 FYRN------ITQLPNFEDYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYG--------NI 754

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
           KL++            T K    A +I             + V N G  DG EVV VY K
Sbjct: 755 KLEQ------------TIKVGETAKII-------------VPVTNTGNRDGEEVVQVYLK 789

Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
               A   +K +  ++RV I AG++  V   +   K L+  D   N++   +G   I+VG
Sbjct: 790 KQEDAEGPVKTLRAFKRVQIPAGKTVNVELELTP-KQLEWWDTQTNTMRTLAGNFDIMVG 848


>gi|167765093|ref|ZP_02437206.1| hypothetical protein BACSTE_03479 [Bacteroides stercoris ATCC
           43183]
 gi|167696721|gb|EDS13300.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           stercoris ATCC 43183]
          Length = 944

 Score =  267 bits (683), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 228/811 (28%), Positives = 366/811 (45%), Gaps = 146/811 (18%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
           Y D       R ++L+++MTL EK  QM  L YG  R+    LP  EW    W +   G+
Sbjct: 53  YEDPAATLDARIENLLQQMTLEEKTCQMVTL-YGYKRVLKDALPTPEWKQMLWKD---GI 108

Query: 67  SFIGRRTNS---------------PPGTH----------------------FDSE-VPG- 87
             I    N                P   H                      F +E + G 
Sbjct: 109 GAIDEHLNGFQQWGLPPSDNENVWPASRHAWALNEIQRFFVEDTRLGIPVDFTNEGIRGV 168

Query: 88  ----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPR 142
               AT+FPT +    ++N  L +++G     EAR +      G T  ++P ++V RD R
Sbjct: 169 ESYKATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQR 222

Query: 143 WGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNW 202
           WGR  E  GE PY+V    I  VRGLQ       H        +++A  KH+AAY  +  
Sbjct: 223 WGRYEEVYGESPYLVAELGIEMVRGLQ-------HNH------QVAATAKHFAAYSNNKG 269

Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
                   D ++  ++++   I PF+  + E  +  VM SYN  +GIP       L   +
Sbjct: 270 AREGMARVDPQMPPREVENIHIYPFKRVIREAGLLGVMSSYNDYDGIPIQGSYYWLTTRL 329

Query: 263 RGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTM 318
           R +  F GY+VSD D+++ +   H    D KE AV + ++AGL++ C     D +     
Sbjct: 330 RKEMGFRGYVVSDSDAVEYLYTKHNTAKDMKE-AVRQSVEAGLNVRCTFRSPDSFVLPLR 388

Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIE-LAAEAARQ 377
             V++G ++E  I+  +R +  V   +G FD   Q    G ++    +  E +A +A+R+
Sbjct: 389 ELVKEGGLSEEVINDRVRDILRVKFLIGLFDAPYQTDLAGADDEVEKEANEAVALQASRE 448

Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK-- 435
            IVLLKN +  LPLN   IK +A+ GP+A+     + +Y       T+ ++G    ++  
Sbjct: 449 SIVLLKNTDNTLPLNIDKIKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIREKAQGK 508

Query: 436 -VINYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEA 479
             + Y  GC D+V  +                + I  A+  A+ AD  V+V G       
Sbjct: 509 AEVLYTKGC-DLVDAHWPESEIMEYPLTPDEQAEIDRAVANARQADVAVVVLGGGQRTCG 567

Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
           E K R  L LPG Q +L+  V    K PV L++++   + +N+A  +  + +IL   YPG
Sbjct: 568 ENKSRTSLELPGHQLKLLQAVQATGK-PVILILINGRPLSVNWA--DKFVPAILEAWYPG 624

Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV-- 597
            +GG  +AD++FG YNPGG+L +T +     +IP+ + P +P +   G      DG +  
Sbjct: 625 SKGGTVVADILFGDYNPGGKLTVT-FPKTVGQIPF-NFPYKPASQIDGGKNPGPDGNMSR 682

Query: 598 ----VYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
               +YPFGYGLSYT F+Y  +  +PK +                               
Sbjct: 683 INGALYPFGYGLSYTTFEYSDLEITPKVI------------------------------- 711

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKV 711
               + K T +++V N GK  G EVV +Y++       T+ K + G+ER+ +  G+S ++
Sbjct: 712 --TPNQKATIRLKVTNTGKRAGDEVVQLYTRDILSSVTTYEKNLAGFERIHLKPGESKEI 769

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            FT++  K L++++      +  G   I+ G
Sbjct: 770 VFTLDR-KHLELLNADMKWTVEPGEFAIMAG 799


>gi|53712134|ref|YP_098126.1| beta-glucosidase [Bacteroides fragilis YCH46]
 gi|52214999|dbj|BAD47592.1| periplasmic beta-glucosidase precursor [Bacteroides fragilis YCH46]
          Length = 812

 Score =  267 bits (683), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 235/817 (28%), Positives = 358/817 (43%), Gaps = 163/817 (19%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE---------------- 57
           Y +   P   R + L+ +MTL EKV QM      +  LG P+YE                
Sbjct: 49  YENPSAPVEYRVEHLLSQMTLEEKVGQM------LTSLGWPMYERVGEDIRLTPQLEKEI 102

Query: 58  ----------------WWSEALH---GVSFIGRRTNSPPG---THFDSEVP--------- 86
                           W    LH     S   R +N        H    +P         
Sbjct: 103 GEYHIGSLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPH 162

Query: 87  -----GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
                G T FPT I   +++N  L +++G+ ++ EA A           + P +++ RDP
Sbjct: 163 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDP 217

Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
           RW RV ET GEDPY+ G      VRG Q     E   D  S    + A  KH+A+Y    
Sbjct: 218 RWSRVEETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---G 266

Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
           W         + + E++++E    PF   V  G + SVM SYN ++G P      LL   
Sbjct: 267 WTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDI 325

Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGA 320
           ++  W F G++VSD  ++  + E     ND   +A  + + AG+D D G + Y    + A
Sbjct: 326 LKDRWQFKGFVVSDLYAVGGLREHGVAGNDY--EAAIKAVNAGVDSDLGTNVYAEQLVAA 383

Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
           V++G +A A ID ++R +  +  ++G FD     +      + + +H  LA E ARQ IV
Sbjct: 384 VKRGDVAVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIV 443

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSK 435
           LLKN +  LPL   +I+TLA++GP+A+    M+G+Y     +GT       +    +   
Sbjct: 444 LLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKET 502

Query: 436 VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE------------- 478
            + YA GCA +   + +    AI+ A+NADA V+V G     D S E             
Sbjct: 503 RVLYAKGCA-VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQ 561

Query: 479 ------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLV----IMSAGAVDINFAKNNPK 528
                  EG DR  L L G Q EL+ +++   K PV L+    ++  GA+         +
Sbjct: 562 ISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLIKGRPLLMEGAIQ--------E 612

Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGR 588
            ++I+   YPG +GG A+ADV+FG YNP GRL ++      V      +P+       G 
Sbjct: 613 AEAIVDAWYPGMQGGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTRRKGN 666

Query: 589 TYKFFDGPVV--YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
             ++ + P    YPFGYGLSYT F Y         D+K         +  T G++     
Sbjct: 667 RSRYVEEPGTPRYPFGYGLSYTTFSY--------TDMK---------VQVTEGSD----- 704

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAA 705
                     D      + ++N G  DG EV  +Y +    +  T  KQ+  + R+ + A
Sbjct: 705 ----------DCWVDVTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKA 754

Query: 706 GQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           G+S +V FT++  KSL +       ++  G  TI+VG
Sbjct: 755 GESREVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 790


>gi|265752711|ref|ZP_06088280.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           3_1_33FAA]
 gi|263235897|gb|EEZ21392.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           3_1_33FAA]
          Length = 864

 Score =  267 bits (683), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 164/448 (36%), Positives = 239/448 (53%), Gaps = 47/448 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y ++ L   ERA+DL++++TL EKV  M D +  V RLG+  Y WW+EALHGV+  G   
Sbjct: 24  YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   ASF       I   VS EARA     +A       
Sbjct: 82  --------------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERY 127

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLT W+P +N+ RDPRWGR +ET GEDPY+     +N V+GLQ         D++ +  
Sbjct: 128 QGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCT-------DANQKYD 180

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
           KI AC KH+A +    W   +R  F++  +  +D+ ET+++PFE  V EG V  VMC+YN
Sbjct: 181 KIHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYN 237

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLK 302
           R+ G P C   +LL Q +R +W + G ++SDC +I      + HK   D +  + A VL 
Sbjct: 238 RLEGDPCCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVL- 296

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKN 360
           +G DL+CG  Y      A ++G I+E DID S++ L      LG  D     ++  +  +
Sbjct: 297 SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPSKVEWTKIPYS 355

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            +C+ +H  L+ + AR+ + LL N N  LPL  G  +T+A++GP+AN +    GNY GTP
Sbjct: 356 VVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTP 414

Query: 421 CRYTSPMDGFYAY----SKVINYAPGCA 444
               + ++G  +      K+I Y  GC+
Sbjct: 415 KHTITLLEGIRSAMGENDKLI-YEQGCS 441



 Score =  121 bits (304), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 92/300 (30%), Positives = 132/300 (44%), Gaps = 54/300 (18%)

Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
           I   +   K+AD  +   G+  S+E E            DR D+ LP  Q ELI  + DA
Sbjct: 592 IKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDA 651

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
            K    ++ ++     I         ++IL   YPG+ GG+A A+V+FG YNP GRLP+T
Sbjct: 652 GK---KVIFVNFSGSPIAMEPETQYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVT 708

Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
           +Y           +P     N  GRTY++F G  ++PFGYGLSYT F Y         +I
Sbjct: 709 FYRN------IAQLPDFEDYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYG--------NI 754

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
           KL++  +  +    V                         I V N G  DG EVV VY K
Sbjct: 755 KLEQTIKVGETAKMV-------------------------IPVTNTGNRDGEEVVQVYLK 789

Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
                    K +  ++RV I AG++  V   +   K L+  D   N++   +G   I+VG
Sbjct: 790 KQEDTEGPAKTLRAFKRVQIPAGKTVNVELELTP-KQLEWWDAQTNTMRTIAGNFDIMVG 848


>gi|365121873|ref|ZP_09338785.1| hypothetical protein HMPREF1033_02131 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363644185|gb|EHL83481.1| hypothetical protein HMPREF1033_02131 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 850

 Score =  267 bits (683), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 161/430 (37%), Positives = 234/430 (54%), Gaps = 46/430 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D   P  ER  DL+ R+T+ EK+  +   + G+PRL +  Y   +EALHG+       
Sbjct: 27  YKDMDAPQHERIMDLLSRLTIEEKISLLRATSPGIPRLEIEKYYHGNEALHGIV------ 80

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
              PG          T FP  I   + +N     +I   +S EARA +N  N G      
Sbjct: 81  --RPGNF--------TVFPQAIGLASMWNPDFLYEISTVISDEARARWNELNRGKDQKRL 130

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFWSP +N+ RDPRWGR  ET GEDP++ G+  + +V+GLQ          +D R
Sbjct: 131 FSDLLTFWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVAFVKGLQG---------NDPR 181

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LK+ +  KH+AA    N E ++RF  + +++E+D++E ++  FE C+ +G   S+M +Y
Sbjct: 182 YLKVVSTPKHFAA----NNEEHNRFECNPQISERDLREYYLPAFERCIIDGKAQSIMTAY 237

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N +N +P   +  LL + +R DW F+GY+VSDC +   +V  HK++  T E A    LKA
Sbjct: 238 NAINDVPCTLNTWLLKKVLRTDWGFNGYVVSDCGAPSLLVTHHKYVK-TPEAAATLALKA 296

Query: 304 GLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
           GLDL+CGD  Y    M A +Q  ++EA+IDT+   +    M LG FD   +  Y  L  +
Sbjct: 297 GLDLECGDNVYIEPLMNAYKQYMVSEAEIDTAAYRILRARMMLGLFDDPAKNPYNALSPS 356

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            +   +H  +A EAARQ +VLLKN+N  LP+N   IK++A+VG   NA     G+Y G P
Sbjct: 357 IVGCEKHKNMALEAARQSLVLLKNENNFLPINPKKIKSIAVVG--INAGNCEFGDYSGKP 414

Query: 421 CRY-TSPMDG 429
                S +DG
Sbjct: 415 VNVPVSVLDG 424



 Score =  128 bits (322), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 93/288 (32%), Positives = 143/288 (49%), Gaps = 47/288 (16%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAG 516
           A  A +  D T+ V G++ S+E EG+DR  + LP  Q EL  + A      + +V+++  
Sbjct: 594 AKKAIQECDMTIAVMGINKSIEREGRDRDHIELPKDQ-ELFIEEAYKLNPKMAVVLVAGS 652

Query: 517 AVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS 576
           ++ +N+   +  + +IL   YPGE+GG A+A+ +FG YNP GRLP+T+Y +     P+  
Sbjct: 653 SLAVNWMDEH--VPAILNAWYPGEQGGTAVAEALFGDYNPAGRLPLTYYRSLDDLPPFDD 710

Query: 577 MPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINY 636
             ++       RTY +F G  +Y FGYGLSYT+F Y+          KL  DQ   ++  
Sbjct: 711 YAVQ-----KNRTYMYFTGKPLYAFGYGLSYTKFDYR----------KLSVDQDAENV-- 753

Query: 637 TVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQV 695
                                 + +F I  +N GK +G EV  VY + P I     IKQ+
Sbjct: 754 ----------------------RLSFTI--KNSGKYNGDEVAQVYVQFPEIGVKVPIKQL 789

Query: 696 IGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
            G+ERV IA G++  V  T+   K L+I +         SG +  +VG
Sbjct: 790 KGFERVHIAKGKTLPVTITV-PKKELRIWNERKGEFFTPSGNYVFMVG 836


>gi|255690486|ref|ZP_05414161.1| periplasmic beta-glucosidase [Bacteroides finegoldii DSM 17565]
 gi|260623937|gb|EEX46808.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
            finegoldii DSM 17565]
          Length = 1365

 Score =  267 bits (683), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 233/807 (28%), Positives = 360/807 (44%), Gaps = 162/807 (20%)

Query: 12   FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDL--------------------------- 44
             PY  A LP  ER KDL++RMT  EK+ Q+  +                           
Sbjct: 534  LPYQRADLPIEERVKDLLQRMTPEEKLAQIRHIHSWEIFNGQALDERKLEEKAQGMSWGF 593

Query: 45   AYGVP---------------------RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
              G P                     RLG+P++   +E+LHGV                 
Sbjct: 594  VEGFPLTAENCAKNMLAIQRFMVEKTRLGIPIFTV-AESLHGVVH--------------- 637

Query: 84   EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRW 143
               GAT FP  I   ++F+  L  +    ++ E  A+           SP I+VVRD RW
Sbjct: 638  --EGATVFPQNIALGSTFDTDLAYRKTSMIADELHAV-----GMRQVLSPCIDVVRDLRW 690

Query: 144  GRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWE 203
            GRV E+ GEDPY+ GR+ I  V+G  D                IS   KHY  +      
Sbjct: 691  GRVEESFGEDPYLCGRFGIAEVKGYMDN--------------GISPMLKHYGPH------ 730

Query: 204  GNDRFHFDSRVTE---QDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQ 260
            GN     +    E   +D+ E ++ PFEM + +    +VM +YN  N IP  A   LL  
Sbjct: 731  GNPLSGLNLASVETSIRDLHEVYLKPFEMVMKQAPTLAVMSAYNSWNRIPNSASHYLLTD 790

Query: 261  TIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGA 320
             +R +W F GY+ SD  +I+ +   H F     E+A  + L AGLD++          G 
Sbjct: 791  VLRKEWGFKGYVYSDWGAIEMLKNFH-FTARNSEEAALQALTAGLDVEASSDCYPAIPGL 849

Query: 321  VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
            +++G++    +D ++R +     R+G FD  P  +   K  I + + I L+ + A +  V
Sbjct: 850  IERGELNREIVDEAVRRVLYAKFRIGLFD-DPYGEKFAKGAIHSGKAIALSKKIADESTV 908

Query: 381  LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT-PCRY-TSPMDGFYAYSKV-- 436
            LLKN+   LPL+ G +K++A++GP  NA +   G+Y  T   R+  +P+ G   ++    
Sbjct: 909  LLKNERQLLPLSIGKLKSIAVIGP--NADQIQFGDYTWTRDNRFGVTPLQGIRKWAGTNV 966

Query: 437  -INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG---------LDLSVEAEGKDRVD 486
             +NYA GC+ +V  + S I  A++AA+ +D  V+  G            S   EG D  D
Sbjct: 967  KVNYAKGCS-LVSMDESGIRQAVEAAEQSDVCVLFCGSASAALARDYKSSTCGEGFDLND 1025

Query: 487  LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
            L L G Q  LI  V    K PV LV+++     I + K N  I +IL   Y GE+ G +I
Sbjct: 1026 LTLTGAQPALIKAVQATGK-PVILVLVTGKPFAIPWEKKN--IPAILVQWYAGEQSGNSI 1082

Query: 547  ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF---------PGRTYKFFDGPV 597
            AD++FGK +P GRL  ++ E+    +P     LR    F         PGR Y  F  PV
Sbjct: 1083 ADILFGKVSPSGRLTFSFPEST-GHLPVYYNHLRSDRGFYKSPGSYDSPGRDY-VFSAPV 1140

Query: 598  -VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
             ++ FG+GL+YT F+Y         +++ D+                  A+ L++D    
Sbjct: 1141 PLWSFGHGLTYTTFEYS--------NLQTDR------------------ASYLLNDT--- 1171

Query: 657  DYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
                  +I ++N GK +G EVV +Y S         ++Q+  + +V + AG++  V  ++
Sbjct: 1172 ---VHVRIGLKNTGKCEGKEVVQLYVSDVCSSVAMPVRQLRDFRKVALQAGETQIVRLSI 1228

Query: 716  NACKSLKIVDNAANSLLASGAHTILVG 742
                 L I++    +++  G   I VG
Sbjct: 1229 -PVSELTILNEKNEAIVEPGEFEIQVG 1254


>gi|393786908|ref|ZP_10375040.1| hypothetical protein HMPREF1068_01320 [Bacteroides nordii
           CL02T12C05]
 gi|392658143|gb|EIY51773.1| hypothetical protein HMPREF1068_01320 [Bacteroides nordii
           CL02T12C05]
          Length = 854

 Score =  267 bits (683), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 160/421 (38%), Positives = 232/421 (55%), Gaps = 45/421 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D   P  ER  DL+ ++T+ EK+  +   + G+PRL +  Y   +EALHGV       
Sbjct: 28  YLDMNAPRHERILDLLSKLTIEEKISLLRATSPGIPRLHIDKYYHGNEALHGVV------ 81

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
              PG          T FP  I   A +N  L  +I   +S EARA +N    G      
Sbjct: 82  --RPGNF--------TVFPQAIGLAAMWNPQLLNEISTVISDEARARWNELEQGKKQLGQ 131

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFWSP +N+ RDPRWGR  ET GEDP++ G+  +++V+GLQ           D R
Sbjct: 132 FSDLLTFWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVSFVKGLQG---------DDPR 182

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LKI +  KH+AA    N E ++RF  +  ++E+D++E ++  FE C+ EG  +S+M +Y
Sbjct: 183 YLKIVSTPKHFAA----NNEEHNRFECNPIISEKDLREYYLPAFEKCIIEGKAASIMTAY 238

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N +N +P   +  LL + +R DW F GY+VSDC     +V  HK++  T E A A  ++A
Sbjct: 239 NAINDVPCTLNNWLLKKVLRHDWGFDGYVVSDCGGPSFLVTHHKYVK-TLEAAAALSIQA 297

Query: 304 GLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKN 360
           GLDL+CGD  Y    + A +Q  ++EA+ID++   +    MRLG FD      Y  +  +
Sbjct: 298 GLDLECGDEVYMEPLLNAYKQYMVSEAEIDSAAYHVLRARMRLGLFDDPALNPYNKISPS 357

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            +   +H +LA EAARQ IVLLKN+   LPL++  IK++A+VG   NA  +  G+Y GTP
Sbjct: 358 IVGCEKHSKLALEAARQSIVLLKNEKKFLPLDSKKIKSIAVVG--INAGNSEFGDYSGTP 415

Query: 421 C 421
            
Sbjct: 416 V 416



 Score =  122 bits (305), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 84/263 (31%), Positives = 129/263 (49%), Gaps = 49/263 (18%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAG 516
           A D  +  D TV V G++ S+E EG+DR  + LP  Q   I +       P T+V++ AG
Sbjct: 595 AGDIMRKCDLTVAVLGINKSIEREGQDRYSIELPKDQQIFIEEAYKI--NPNTVVVLVAG 652

Query: 517 A-VDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYT 575
           + + IN+   +  I +I+   YPGE GG A+A+V+FG YNPGG+LP+T+Y +      + 
Sbjct: 653 SSLAINWMDEH--IPAIVNAWYPGEAGGTAVAEVLFGDYNPGGKLPLTYYRSLDELPAFD 710

Query: 576 SMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDIN 635
              +R      GRTY+FF+G  +Y FG+GLSYT F YK  S   + D+            
Sbjct: 711 DYDIR-----KGRTYQFFEGDPLYAFGHGLSYTTFSYKKLSIDAAGDV------------ 753

Query: 636 YTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI---AGTHI 692
                                    +    ++N GK +G EV  +Y K  G        +
Sbjct: 754 ------------------------VSVSFTLKNTGKYEGDEVAQLYVKYQGSDSQVKLPL 789

Query: 693 KQVIGYERVFIAAGQSAKVGFTM 715
           KQ+ G+ER+ +  G+S ++  T+
Sbjct: 790 KQLKGFERIHLKKGESKQINLTV 812


>gi|315500297|ref|YP_004089100.1| glycoside hydrolase family 3 domain protein [Asticcacaulis
           excentricus CB 48]
 gi|315418309|gb|ADU14949.1| glycoside hydrolase family 3 domain protein [Asticcacaulis
           excentricus CB 48]
          Length = 882

 Score =  267 bits (683), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 166/478 (34%), Positives = 244/478 (51%), Gaps = 49/478 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y DA  P   RA DLV RMTL EK  Q+ + A  +PRL +  Y WW+E LHGV+  G   
Sbjct: 35  YQDASKPPEARAADLVSRMTLEEKTAQLINDAPAIPRLNVREYNWWNEGLHGVAAAGY-- 92

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY-----NLGNA-- 126
                         AT FP  +   A+++E L  ++ +T+S E RA Y       G +  
Sbjct: 93  --------------ATVFPQAVGLAATWDEPLIHRVAETISVEFRAKYLKERHRFGGSDW 138

Query: 127 --GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLT WSPNIN+ RDPRWGR  ET GEDPY+  R  + +VRGLQ  + V Y        
Sbjct: 139 FGGLTVWSPNINIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQGDDPVYY-------- 190

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
            +  A  KHYA +         R   +   +  D+ +T++  F   + EG   S+MC+YN
Sbjct: 191 -RTVATPKHYAVHSGPE---AGRHRDNVNPSPYDLADTYLPAFRATITEGQAGSIMCAYN 246

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLK 302
            +NG P CA+  LL + +R DW F GY+VSDCD++  I    SH +   T E+ V    +
Sbjct: 247 AINGQPACANEDLLVKYLRKDWGFKGYVVSDCDAVGDIYYKTSHAY-RPTPEEGVTAAYQ 305

Query: 303 AGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKN 360
            G DL CG+    +    AV+QG + E  +DT+L  L+    +LG FD   + +  +   
Sbjct: 306 VGTDLICGNANEADHLTRAVRQGLLPEKTLDTALIRLFTARFKLGQFDPPAKVFPKITAE 365

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
           +   P + + + + A   +VLLKN+N  LPL  G  + +A++GP+A++  +++GNY G P
Sbjct: 366 DYDTPANRDFSQKVAESAMVLLKNENNLLPLK-GEPRQIAVIGPNADSMDSLVGNYNGDP 424

Query: 421 CRYTSPMDGFYAY--SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
               + + G  A      + YAPG   I    + ++ A  D+A   D      G+ +S
Sbjct: 425 SHPVTVLSGIRARFPKATVTYAPGSGLI----DPVMTAVPDSAFCRDEACTQTGVTVS 478



 Score =  137 bits (346), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 97/299 (32%), Positives = 145/299 (48%), Gaps = 52/299 (17%)

Query: 456 AAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAK 505
           +A+ AAK AD  V VAGL   VE E          G DR  L LP  Q +++ +V+ A K
Sbjct: 598 SAVAAAKEADLVVFVAGLSQRVEGEEMRVETEGFSGGDRTTLNLPPAQQKVLEQVSAAGK 657

Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
            PV LV+++  A+ IN+A  N  + +I+   YPG +GG A+A +I G Y+P GRLP+T+Y
Sbjct: 658 -PVVLVLINGSALGINWADKN--VPAIIEAWYPGGQGGAAVARLIAGDYSPAGRLPVTFY 714

Query: 566 EANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
            +         +P     N  GRTY++F G  +YPFGYGLS+T F+Y          + L
Sbjct: 715 RSA------DQLPAFNDYNMKGRTYRYFKGEALYPFGYGLSFTTFRY--------APLTL 760

Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
              Q   D   +V                          +V N G  D  EVV +Y   P
Sbjct: 761 SARQVAGDGQVSVSA------------------------DVTNSGSRDSDEVVQLYVSYP 796

Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
           G     I+ +  +ER+ + AG++  V FT++  ++L  V+   +  +  G   + +G G
Sbjct: 797 GQKLAPIRALARFERIHLKAGETKTVRFTLDP-QALSTVNADGSRSVKPGKVELWLGGG 854


>gi|293371439|ref|ZP_06617870.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
            ovatus SD CMC 3f]
 gi|292633636|gb|EFF52194.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
            ovatus SD CMC 3f]
          Length = 1049

 Score =  267 bits (682), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 219/767 (28%), Positives = 354/767 (46%), Gaps = 100/767 (13%)

Query: 16   DAKLPYPERA----KDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            ++KLP+   A    KDL+ RMT+ EK+ Q+     G   L  P  E+ S++L     +G 
Sbjct: 328  NSKLPHTPEADSFVKDLLSRMTVEEKIGQLSQYV-GRTLLTGPESEYLSDSLIARGLVGS 386

Query: 72   ---------------------RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIG 110
                                 R   P     D      T FPT +  + S++ +  ++  
Sbjct: 387  VLNISGAKTLRDLQEKNMRYSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAA 446

Query: 111  QTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
            +  + E+ A      AGL + ++P +++ RD RWGRV+E  GED Y+    A   V G Q
Sbjct: 447  KIAAIESSA------AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQ 500

Query: 170  DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                  +  +S      + AC KH+ AY L    G D    D  ++E+ + +T++ PF+ 
Sbjct: 501  ---WNLWENNS------VLACAKHWVAYGLPQ-AGRDYAPVD--MSERTLFDTYLPPFKA 548

Query: 230  CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            C++ G V + M ++N +NGIP  A P LL   +RG WNF+G++VSD ++++ +V      
Sbjct: 549  CIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLVAQGVAE 607

Query: 290  NDTKEDAVARVLKAGLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
            +D  +DA      +G+D+D  D  Y  +    ++ GKI+  D+D S+  +  +   LG F
Sbjct: 608  DD--KDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF 665

Query: 349  DGSPQYKN--LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHA 406
                ++ N       I   + ++ A + A +  VLLKNDN  LPL   N++++A+VGP A
Sbjct: 666  VDPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPL-AKNVRSIAVVGPLA 724

Query: 407  NATKAMIGNY--EGTPCRYTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDA 460
            +    ++G++   G     T+ + G           + YA GC D   ++ S    A+  
Sbjct: 725  DNQTELLGSWRARGEDRHVTTVLQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKEAVKL 783

Query: 461  AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
            A  +D  + V G    +  E + R  L LPG Q ELI ++    K PV +V+M+   + I
Sbjct: 784  ASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGRPLSI 842

Query: 521  NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEAN-YVKIPYT-SMP 578
             +   N  + +IL   + G   G AIAD++FG YNP GRL I++      V + Y     
Sbjct: 843  EWVDKN--VSAILETWFLGTSAGTAIADILFGDYNPSGRLTISFPRVEGQVPVYYNYKKS 900

Query: 579  LRPVNNFPGRTYKFFDGP--VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINY 636
             RP +     T +  D P   +YPFGYGLSYT F Y V  S +                Y
Sbjct: 901  GRPGDMPHSSTTRHIDVPNAPLYPFGYGLSYTTFSYSVPQSTQK--------------EY 946

Query: 637  TVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQV 695
            T                  +    +  + V N G  DG E V +Y      +    +K++
Sbjct: 947  T------------------RQETISVSVTVTNTGDRDGEETVQLYVNDKVASVVRPVKEL 988

Query: 696  IGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
              ++++F+ AG+S  V F ++   +L   D A N ++  G   I+ G
Sbjct: 989  KAFKKIFLKAGESKTVQFDISPL-ALGFYDAAMNYVVEPGEFEIMTG 1034


>gi|395492941|ref|ZP_10424520.1| glycoside hydrolase family protein [Sphingomonas sp. PAMC 26617]
          Length = 865

 Score =  267 bits (682), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 168/431 (38%), Positives = 229/431 (53%), Gaps = 44/431 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D   P   R  DL+ RMTL EK  QM ++A  +PRLG+P Y++W+EALHGV+  G   
Sbjct: 14  YFDPGQPIEARVDDLMRRMTLEEKAAQMQNVAPAIPRLGIPPYDYWNEALHGVARAGE-- 71

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   A+++  +    GQTV+TE RA YN   A       
Sbjct: 72  --------------ATVFPQAIGMAATWDRDMMLAEGQTVATEGRAKYNQAQAQKNYDRY 117

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLTFWSPNIN+ RDPRWGR  ET GEDPY+ G  A+ +V G+Q          +D+  L
Sbjct: 118 YGLTFWSPNINIFRDPRWGRGQETLGEDPYLTGTMAVPFVHGVQ---------GTDANYL 168

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
           K  A  KH+A +         R  F+   + +D+ ET++  F   + +G   S+MC+YN 
Sbjct: 169 KAIATPKHFAVHSGPE---QLRHQFNVDPSPRDLSETYLPAFRRAIVDGRAESLMCAYNA 225

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
           V+    CA+  LL  T+RG W F G++ SDC +I  I   H   + T  +  A  +KAG 
Sbjct: 226 VDTKAACANTMLLKDTLRGAWGFKGFVTSDCGAIDDITTGHHN-SPTNPEGAALAVKAGT 284

Query: 306 DLDCGDYYTNF--TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNN 361
           D  C D+         AV+ G + E D+D +LR L+   M+LG FD + +  +  +    
Sbjct: 285 DTGC-DFKDEMLDLPRAVKAGYLTEGDMDVALRRLFTARMKLGMFDPAARVPFSTISIAE 343

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
             +P H  LA  AAR+ IVLLKND G LPL  G  + +A+VGP A +  A+ GNY GTP 
Sbjct: 344 NHSPAHRALALRAARESIVLLKND-GVLPLAAG-ARRIAVVGPTAASLIALEGNYNGTPV 401

Query: 422 RYTSPMDGFYA 432
               P+DG  A
Sbjct: 402 GAVLPVDGMTA 412



 Score =  119 bits (299), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 83/271 (30%), Positives = 126/271 (46%), Gaps = 42/271 (15%)

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
           G DR  + LP  Q++L++ +    K P+ +V+ S  A  I       K +++L   YPGE
Sbjct: 621 GGDRTAIALPAAQSQLLDALFATGK-PLVIVLQSGSA--IALGAQEAKARAVLEAWYPGE 677

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
            GG+AIA+V+ G  NP GRLP+T+Y +         +P         RTY++F G V YP
Sbjct: 678 AGGQAIAEVLSGTVNPSGRLPVTFYAST------DQLPAFDDYRMANRTYRYFAGRVEYP 731

Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
           FG+GLSYT+F Y  A  P +  +   +           GT                    
Sbjct: 732 FGHGLSYTRFAYS-ALRPATSSVAAGQ-----------GT-------------------- 759

Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
           +  + V N G + G EV  +Y   PG  G  I+ + GY+RV +AAG++  + F +   + 
Sbjct: 760 SVSVAVRNTGVLAGDEVAQLYLSVPGREGAPIRSLKGYQRVHLAAGETKTLTFALEP-RD 818

Query: 721 LKIVDNAANSLLASGAHTILVGEGVGGVSFP 751
           L + + A    +    + I VG G  G   P
Sbjct: 819 LALANAAGAMAVTKATYQIWVGGGQPGTGAP 849


>gi|313203744|ref|YP_004042401.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312443060|gb|ADQ79416.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 1286

 Score =  266 bits (681), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 158/442 (35%), Positives = 234/442 (52%), Gaps = 35/442 (7%)

Query: 2   FESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSE 61
           F   KV      Y +    + ERA DL+ R+TL EK   +G+    +PRLG+     WSE
Sbjct: 21  FMPAKVSTKKPIYLNTSYSFEERAADLISRLTLEEKESLLGNSMAAIPRLGIKSMNVWSE 80

Query: 62  ALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY 121
           ALHG+  +G       G +    + G TSFP  +   ++++ +L ++    ++ EARA+ 
Sbjct: 81  ALHGI--LG-------GANQSVGISGPTSFPNSVALGSAWDPALMQREAMAIADEARAIN 131

Query: 122 NLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
             G  GLT+WSP +  +RDPRWGR  E+ GEDP++    A  +VRG+           +D
Sbjct: 132 QTGTKGLTYWSPVVEPIRDPRWGRTGESYGEDPFLAAEIAGGFVRGMV---------GND 182

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
              LK   C KHY A    N    DR    S +  +DM+E ++ P++  + + ++ S+M 
Sbjct: 183 PTYLKSVPCAKHYFA----NNSEFDRHVSSSNMDSRDMREFYLAPYKKLIEQDNLPSIMS 238

Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
           SYN VNG+PT A    L+   R  +   GYI  DC +I+ I   H ++  T E+A A+ L
Sbjct: 239 SYNAVNGVPTSASQLYLDTIARRTYGLKGYITGDCAAIEDIYTGHYYVK-TAEEATAKGL 297

Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGK 359
           KAG+D DCG  Y  + + A+++G I  ADID +L  ++IV MR G FD   +  Y     
Sbjct: 298 KAGVDSDCGSIYQRYAIAALKKGLITMADIDRALLNIFIVRMRTGEFDPPAKVLYAQFQP 357

Query: 360 NNICNPQHIELAAEAARQGIVLLKN------DNGALPLNTGNIKTLALVGPHANATKAMI 413
           N + +P +  LA E A +  VLLKN      +  ALPLN  ++K +AL+GPHA+  K  +
Sbjct: 358 NIVNSPANKALAKEIATKTPVLLKNNISLKTNRKALPLNPADLKKIALIGPHAD--KVEL 415

Query: 414 GNYEGTPCR--YTSPMDGFYAY 433
           G Y G P +    +P  G   Y
Sbjct: 416 GPYSGRPAQENMITPFAGIKKY 437



 Score =  139 bits (350), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 96/263 (36%), Positives = 134/263 (50%), Gaps = 39/263 (14%)

Query: 472 GLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIM-SAGAVDINFAKNNPKIK 530
           G D     E  DR+ LLLPG Q ELI  VA  A  P T+V+M + G V++   KN   I 
Sbjct: 619 GTDEKTATEEADRLTLLLPGNQVELIKAVA--AVNPNTIVVMQTLGCVEVEEFKNLQNIP 676

Query: 531 SILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTY 590
            I+WVGY G+  G AIA V+FG+ NPGG+L  TWY++       T   LR  N   GRT+
Sbjct: 677 GIIWVGYNGQAQGDAIASVLFGEVNPGGKLNGTWYKSVKDLPEITDYTLRGGNGKNGRTF 736

Query: 591 KFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
            +FD  V Y FG+G+SYT F+Y                      N+ +  N     +++ 
Sbjct: 737 WYFDKDVSYEFGFGMSYTTFEYS---------------------NFRISKN-----SIIP 770

Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH---IKQVIGYERVFIAAGQ 707
            D      K T  ++V+N GK++G EV+ VY K P    +    IK++ G++RV + AGQ
Sbjct: 771 HD------KITVSVDVKNTGKVEGDEVIQVYMKTPDSPASLQRPIKRLKGFKRVTLPAGQ 824

Query: 708 SAKVGFTMNACKSLKIVDNAANS 730
           +  V   +N C  L   D   N+
Sbjct: 825 TKTVNIDIN-CADLWFWDMDKNT 846


>gi|237721771|ref|ZP_04552252.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
 gi|229448640|gb|EEO54431.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
          Length = 735

 Score =  266 bits (681), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 211/760 (27%), Positives = 355/760 (46%), Gaps = 97/760 (12%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLYEW 58
           Y D K P  +R  DL+ RMTL EK+ Q+     G              VP  +G  +Y  
Sbjct: 30  YKDPKAPIEKRVNDLLSRMTLEEKMMQLNQYTLGRNNNVNNVGEEVKKVPAEIGSLIYFE 89

Query: 59  WSEALHGV----SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
            + AL       +    R   P    +D+     T +P  +    S+N  L ++     +
Sbjct: 90  TNPALRNSMQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLAQACSWNPDLVEQACAVSA 149

Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
            EAR    +     TF SP I+V RDPRWGRV E  GEDPY  G +    V+G       
Sbjct: 150 QEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYTNGVFGAASVKG------- 197

Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
            Y  D  S   +++AC KHY  Y      G D  +  + +++Q + +T++LP+EM V  G
Sbjct: 198 -YQGDDLSAENRMAACLKHYVGYGASE-AGRDYVY--TEISKQTLWDTYLLPYEMGVKAG 253

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
             +++M S+N ++G+P  A+P ++ + ++  W   G+IVSD  +I+ +   ++ L  TK+
Sbjct: 254 -AATLMSSFNDISGVPGSANPYIMTEILKKRWGHDGFIVSDWGAIEQL--KNQGLAATKK 310

Query: 295 DAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
           +A      AGL++D   + Y       V++G+++ A +D ++R + ++  RLG F+    
Sbjct: 311 EAAWHAFTAGLEMDMMSHAYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLFERPYT 370

Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
                K     PQ +++AA  A + +VLLKN+N  LPL   + K +A++GP A     ++
Sbjct: 371 PATSEKERFFRPQSMDIAARLAAESMVLLKNENKTLPLT--DKKKIAVIGPMAKNGWDLL 428

Query: 414 GNY--EGTPCRYTSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
           G++   G         +G    +A    + YA GCA     N      A++AA+ +D  V
Sbjct: 429 GSWCGHGKDTDVAMLYNGLATEFAGKAELRYAAGCA-TKGDNKEGFAEALEAARWSDVVV 487

Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
           +  G  ++   E   R  + LP  Q EL  ++  A K P+ LV+++   +++N  +    
Sbjct: 488 LCLGEMMTWSGENASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPLELN--RLELI 544

Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS--MPLRPVNNFP 586
             +IL +  PG  G   +A ++ G+ NP G+L +T+        PY++  +P+       
Sbjct: 545 SDAILEIWQPGVNGALPMAGILSGRINPSGKLAMTF--------PYSTGQIPIYYNRRKS 596

Query: 587 GRTYKFFDGPV----VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
           GR ++ F   +    +YPFG+GLSYT+FKY                          GT  
Sbjct: 597 GRGHQGFYKDITSDPLYPFGHGLSYTEFKY--------------------------GTVT 630

Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERV 701
           P    V   D      + + ++ V N+G  DG+E V  +   P  + T  +K++  +E+ 
Sbjct: 631 PSVTKVKRGD------RLSVEVTVTNVGARDGAETVHWFISDPYCSITRPVKELKHFEKQ 684

Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
            I AG++    F ++  +    V+      L +G + ILV
Sbjct: 685 LIRAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724


>gi|435848436|ref|YP_007310686.1| beta-glucosidase-like glycosyl hydrolase [Natronococcus occultus
           SP4]
 gi|433674704|gb|AGB38896.1| beta-glucosidase-like glycosyl hydrolase [Natronococcus occultus
           SP4]
          Length = 771

 Score =  266 bits (681), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 206/704 (29%), Positives = 334/704 (47%), Gaps = 98/704 (13%)

Query: 86  PGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWG 144
           P AT+FP +I   ++++  L +++ +T+  E  A+      G T   SP ++V RD RWG
Sbjct: 113 PEATTFPQMIGMASTWDPELLEEVTETIRGELEAL------GTTHALSPVLDVARDLRWG 166

Query: 145 RVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEG 204
           RV ET GEDP +V   A  YV GLQ           D R   +SA  KH+  +   +  G
Sbjct: 167 RVEETFGEDPLLVAAMACGYVSGLQ----------GDGRADGVSATLKHFVGHGATDG-G 215

Query: 205 NDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRG 264
            +R   +  V  ++++E  + P+E  +   D  SVM +Y+ ++GIP  +   LL   +RG
Sbjct: 216 KNRSSLN--VGPRELREVHLFPYEAAIRTADAESVMNAYHDIDGIPCASSEWLLTDLLRG 273

Query: 265 DWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQ 322
           ++ F G +VSD  S++ +V  H   N TK +A    L+AGLD++    DYY    + AV+
Sbjct: 274 EFGFDGTVVSDYYSVRHLVTEHGTAN-TKPEAATAALEAGLDVELPYTDYYGEHLITAVE 332

Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLL 382
            G+++E  +D S+R +     R G  D          +     +   L   AAR+ + LL
Sbjct: 333 NGELSEKTLDESVRRVLREKARKGLLDDPSVDAEAAADAFRTDEAAALNRRAARRSMTLL 392

Query: 383 KNDNGALPLNTGNIKTLALVGPHANATKAMIGNY--------EGTPCRYTSPMDGFYAYS 434
           KN+N  LPL   ++   A++GP A+A K ++G+Y        E      T+P+    +  
Sbjct: 393 KNENELLPLTADSV---AVIGPKADAKKELLGDYAYAAHYPEEEYASDATTPLAALESRD 449

Query: 435 KV-INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE--------------- 478
            + ++Y  GC       +   PAA   A++AD  +   G   +V+               
Sbjct: 450 GLEVSYEQGCTVSGPSTDGFEPAA-QVAEDADVALAFVGARSAVDFSDGDASKEEKPSVP 508

Query: 479 --AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
              EG D  DL LPG Q ELI+++ +    P+ +VI+S     I   +    + ++L+  
Sbjct: 509 TSGEGCDVTDLGLPGVQEELIDRLQETGT-PLAVVIVSGRPHSIE--RITADVPAVLYAW 565

Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEA-NYVKIPYTSMPLRPVNNFPGRTYKFFDG 595
            PG+EGG AI DV+FG++NP GRLP++  ++   + + Y         N   ++Y + DG
Sbjct: 566 LPGDEGGSAIVDVLFGEHNPSGRLPVSLPKSVGQLPVYYNRKA-----NTANKSYVYTDG 620

Query: 596 PVVYPFGYGLSYTQFKYKVAS-SPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
             VYPFG+GLSYT+F+Y   S S K V                      P   V+     
Sbjct: 621 EPVYPFGHGLSYTEFEYGTLSLSEKRVS---------------------PLETVVA---- 655

Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGF 713
                    + V N G   G+EVV +Y+     +    ++++IG+ERV + AG++ +V F
Sbjct: 656 --------SVPVTNEGDRSGAEVVQLYAHAANPSQARPVQELIGFERVPLEAGETKRVSF 707

Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLN 757
            ++  + L   D +    +  G + I VG     +     L +N
Sbjct: 708 ELSPTQ-LAFHDESMTLTVEEGPYEIRVGRSASDIVATDDLEVN 750


>gi|224535242|ref|ZP_03675781.1| hypothetical protein BACCELL_00103 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224523140|gb|EEF92245.1| hypothetical protein BACCELL_00103 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 864

 Score =  266 bits (681), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 163/460 (35%), Positives = 248/460 (53%), Gaps = 43/460 (9%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
           ++ V   + PY + +L   ERA DL++RMTL EKV QM + +  + RLG+P Y+WW+EAL
Sbjct: 14  TLNVTAQNEPYKNPELSPSERAWDLLKRMTLEEKVSQMKNGSPAIERLGIPAYDWWNEAL 73

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN- 122
           HGV+  G+                AT FP  I   A+F+     +    VS EARA Y+ 
Sbjct: 74  HGVARAGK----------------ATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHD 117

Query: 123 -------LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
                   G  GLTFW+PNIN+ RDPRWGR +ET GEDPY+     +  V+GLQ     +
Sbjct: 118 FQRKGERDGYKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGGGTGK 177

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEG 234
           Y         K  AC KHYA +    W   +R  FD++ ++++D+ ET++  F+  V EG
Sbjct: 178 YD--------KAHACAKHYAVHSGPEW---NRHSFDAKNISQRDLWETYLSAFKTLVKEG 226

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI-VESHKFLNDTK 293
            V  VMC+YNR  G P C++ +LL + +R DW +   +VSDC +I      +H   + T 
Sbjct: 227 KVKEVMCAYNRFEGEPCCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTA 286

Query: 294 EDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP- 352
             A A  + +G DL+CG  Y++    AV++G I+E  I+ S+  L     +LG FD    
Sbjct: 287 AAASADAVVSGTDLECGGSYSSLNE-AVRKGLISEEKINESVFRLLRARFQLGMFDDDAL 345

Query: 353 -QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKA 411
             +  +  + + + +H+  A E AR+ +VLL N N  LPL+  +I+ +A++GP+AN +  
Sbjct: 346 VSWSEIPYSVVESKEHVTKALEMARKSMVLLTNKNHTLPLSK-SIRKVAVLGPNANDSVM 404

Query: 412 MIGNYEGTPCRYTSPMDGFYAY--SKVINYAPGCADIVCQ 449
           +  NY G P +  + ++G  +      + Y  GC  +  Q
Sbjct: 405 LWANYNGFPTKSVTILEGIKSKLPEGTVYYEKGCDYVNTQ 444



 Score =  126 bits (317), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 90/295 (30%), Positives = 139/295 (47%), Gaps = 53/295 (17%)

Query: 459 DAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPV 508
           D A  ADA + V GL  ++E E            DR ++ LP  Q E++  +    K PV
Sbjct: 595 DKAAEADAIIFVGGLSPTLEGEEMPVDLPGFRKGDRTNIDLPHVQAEMLKALKKTGK-PV 653

Query: 509 TLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEAN 568
             V+ S   + + +   N  + +IL   YPG++GG A+ADV+FG YNP GRLP+T+Y ++
Sbjct: 654 IFVLCSGSTLALPWEAEN--LDAILEAWYPGQQGGTAVADVLFGDYNPAGRLPLTFYASS 711

Query: 569 YVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKD 628
                   +P     +   RTY++F G  ++PFG+GLSYT F Y  A        K+DK 
Sbjct: 712 ------NDLPDFEDYDMSNRTYRYFKGKALFPFGHGLSYTIFDYGKA--------KVDKQ 757

Query: 629 QQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA 688
                 N   G                     T  I ++N GK+DG EV+ VY + P   
Sbjct: 758 ------NVRAGEG------------------MTLTIPLKNTGKLDGDEVIQVYLRNPADK 793

Query: 689 GTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
              IK +  + RV + AGQ+  +   + A  + +  + + N + +  G + +L G
Sbjct: 794 EGPIKTLRAFRRVSLPAGQTENIRIELPAS-TFECFNPSTNRMEILPGKYELLYG 847


>gi|381200965|ref|ZP_09908097.1| beta-glucosidase [Sphingobium yanoikuyae XLDN2-5]
          Length = 774

 Score =  266 bits (681), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 217/720 (30%), Positives = 344/720 (47%), Gaps = 107/720 (14%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+P+  +  E LHG + +G                 ATSFP  I   +S++ ++ +++
Sbjct: 121 RLGIPIL-FHEEGLHGYAAVG-----------------ATSFPQSIAMASSWDPAMLRQV 162

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
            Q ++ E RA            SP +++ RDPRWGR+ ET GEDPY+VG   +  V GLQ
Sbjct: 163 NQVIAREIRA-----RGVPMVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQ 217

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
              GV   R   S    + A  KH   +       N      + V+E++++E F  PFE 
Sbjct: 218 ---GVGRSRTLQSN--HVFATLKHLTGHGQPESGTN---IGPAPVSERELRENFFPPFEQ 269

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            V    + +VM SYN ++G+P+ A+  LL   +R +W F G +VSD  ++  ++  H   
Sbjct: 270 VVKRTGIEAVMASYNEIDGVPSHANRWLLENILREEWGFRGAVVSDYSAVDQLMSIHHIA 329

Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGA-VQQGKIAEADIDTSLRFLYIVLMRLGYF 348
            +  E+A  R L AG+D D  +  +  T+G  V++GK++EA +D ++R +  +  R G F
Sbjct: 330 ANL-EEAAMRALDAGVDADLPEGLSYATLGKLVREGKVSEAKVDLAVRRMLELKFRAGLF 388

Query: 349 DGSPQYKNLGKNNICNPQHIE-LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
           + +P         I N +    LA  AA++ I LLKND G LPL      T+A++GP  +
Sbjct: 389 E-NPYADANAAAAITNNEDARALARTAAQRSITLLKND-GMLPLKPEG--TIAVIGP--S 442

Query: 408 ATKAMIGNYEGTPCRYTSPMDGFYAYSKV---INYAPGC---------ADIV-----CQN 450
           A  A +G Y G P    S ++G  A       I +A G          AD V      +N
Sbjct: 443 AAVARLGGYYGQPPHSVSILEGIKARVGTKANIVFAQGVKITEDDDWWADSVTKSDPAEN 502

Query: 451 NSMIPAAIDAAKNADATVIVAGLDLSVEAEG------KDRVDLLLPGFQTELINKVADAA 504
             +I  A++AA+N D  ++  G       EG       DR  L L   Q EL + +    
Sbjct: 503 RKLIAQAVEAARNVDRIILTLGDTEQSSREGWADNHLGDRPSLDLVSEQQELFDALKALG 562

Query: 505 KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW 564
           K P+T+V+++      +  K + +  +IL   Y GE+GG A+AD++FG  NPGG+LP+T 
Sbjct: 563 K-PITVVLINGRPA--STVKVSEQANAILEGWYLGEQGGNAVADILFGDVNPGGKLPVTV 619

Query: 565 -YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
              A  + + Y   P         R Y F     +YPFG+GLSYT F     S+P+    
Sbjct: 620 PRSAGQLPLFYNMKP------SARRGYLFDTTDPLYPFGFGLSYTSFSL---SAPRLSAT 670

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
           K             +GT                  K +  ++V N G  +G EVV +Y +
Sbjct: 671 K-------------IGTGG----------------KTSVSVDVRNTGAREGDEVVQLYIR 701

Query: 684 PPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
               + T  +K++ G++RV +  G+S  V FT+   ++L++ ++  + ++  G   I+ G
Sbjct: 702 DKVSSVTRPVKELKGFQRVTLKPGESRTVTFTV-GPEALQMWNDQMHRVVEPGDFEIMTG 760


>gi|189464310|ref|ZP_03013095.1| hypothetical protein BACINT_00651 [Bacteroides intestinalis DSM
           17393]
 gi|189438100|gb|EDV07085.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 864

 Score =  266 bits (681), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 158/421 (37%), Positives = 229/421 (54%), Gaps = 45/421 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D K P  ER  DL+ R+T+ EK+  +   + G+PRL +P Y   +EALHGV   GR  
Sbjct: 28  YKDEKAPMHERIMDLLSRLTVEEKISLLRATSPGIPRLDIPKYYHGNEALHGVVRPGR-- 85

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
                          T FP  I   A++N  L  ++   +S EARA +N  + G      
Sbjct: 86  --------------FTVFPQAIGLAATWNPELQLQVATVISDEARARWNELDQGREQKSQ 131

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFWSP +N+ RDPRWGR  ET GEDPY+ G     +V+GLQ           D R
Sbjct: 132 FSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGVMGTAFVKGLQG---------DDDR 182

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LKI +  KH+AA    N E ++RF  + +++E+ ++E ++  FE CV +G  +S+M +Y
Sbjct: 183 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYLPAFEACVKDGKSASIMSAY 238

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N +N +P   +  LL + +R DW F GY+VSDC     +V +HK++  TKE A    +KA
Sbjct: 239 NALNDVPCTLNAWLLTKVLREDWGFKGYVVSDCGGPSLLVNAHKYVK-TKEAAATLSIKA 297

Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
           GLDL+CG D +    + A +Q  +  ADID++   +    M+LG FD   +  Y  +   
Sbjct: 298 GLDLECGDDVFDEPLLSAYRQYMVTNADIDSAAYRVLRARMQLGLFDSGEKNPYTKISPA 357

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + + +H E+A  AAR+ IVLLKN    LPLN   +K++A+VG   NA     G+Y G+P
Sbjct: 358 VVGSAKHQEVALNAARECIVLLKNQKKMLPLNAKKVKSIAVVG--INAGNCEFGDYSGSP 415

Query: 421 C 421
            
Sbjct: 416 V 416



 Score =  137 bits (345), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 95/289 (32%), Positives = 150/289 (51%), Gaps = 55/289 (19%)

Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-V 518
           A +  +  V V G++ S+E EG+DR D+ LP  Q E + ++      P  +V++ AG+ +
Sbjct: 598 AVRECETVVAVLGINKSIEREGQDRYDIQLPADQMEFLQEIYKV--NPNIVVVLVAGSSL 655

Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
            +N+   +  + +I+   YPGE GG+A+A+V+FG YNPGGRLP+T+Y +   ++P    P
Sbjct: 656 AVNWMDEH--VPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRS-LDELP----P 708

Query: 579 LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKY---KVASSPKSVDIKLDKDQQCRDIN 635
               +   GRTYK+F G V+YPFGYGLSYT FKY   +VA   + +++            
Sbjct: 709 FDDYDITKGRTYKYFKGDVLYPFGYGLSYTTFKYSNLQVADGEEEINV------------ 756

Query: 636 YTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQ 694
                                    +FQ+  +N GK  G EV  VY K P       +K+
Sbjct: 757 -------------------------SFQL--KNAGKYAGDEVAQVYVKLPERDEVMPVKE 789

Query: 695 VIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
           + G+ERV + +G++ K+   +     L+  D A    +  SG +TI+VG
Sbjct: 790 LKGFERVALKSGENKKMTLKLRK-DLLRYWDEAKGKFVYPSGDYTIMVG 837


>gi|325104789|ref|YP_004274443.1| glycoside hydrolase family protein [Pedobacter saltans DSM 12145]
 gi|324973637|gb|ADY52621.1| glycoside hydrolase family 3 domain protein [Pedobacter saltans DSM
           12145]
          Length = 802

 Score =  266 bits (681), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 233/824 (28%), Positives = 362/824 (43%), Gaps = 150/824 (18%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
           + D   P  +R +DL+ +MT+ EK  Q   L YG  R+    +P  EW    W +   G+
Sbjct: 48  FEDQSQPIEKRVEDLLSQMTVAEKTNQTATL-YGYGRVLKDEMPTSEWKKSIWKD---GI 103

Query: 67  SFIGRRTNSPP---------------------------------GTHFDSEVPG------ 87
           + +    NS P                                 G   D    G      
Sbjct: 104 ANMDEALNSLPNNKKAQTEYSFPYSKHATAINTLQKWFIEETRLGIPVDFTNEGIHGLCH 163

Query: 88  --ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWG 144
             AT F   I   +S+N++L +K G+    E +A+      G T  ++P +++ RDPRWG
Sbjct: 164 DRATPFCAPIGIGSSWNKNLVRKAGEIAGREGKAL------GYTNVYAPILDLARDPRWG 217

Query: 145 RVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEG 204
           RV+E  GEDP++VG    N V GLQ                 I+A  KHYA Y +     
Sbjct: 218 RVVECYGEDPFLVGELGKNMVSGLQSN--------------GIAATLKHYAVYSVPKGGR 263

Query: 205 NDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRG 264
           +     D  VT +++ +  + PF+  V E     VM SYN  +GIP       L + +R 
Sbjct: 264 DGHARTDPHVTPRELHQIHLYPFKKVVQEAKPLGVMSSYNDWDGIPVTGSYYFLTELLRK 323

Query: 265 DWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGA 320
            + F+GY+VSD ++++ I   H+   D KE +V   LKAGL++       D Y N    +
Sbjct: 324 QYGFNGYVVSDSEAVEFIASKHRVAKDFKEASVI-ALKAGLNVWTNFRQPDNYINNLRAS 382

Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN--ICNPQHIELAAEAARQG 378
           V  G +    ++  +R +  V  RLG FD  P  +N   ++  +  P+  + A +  ++ 
Sbjct: 383 VADGSLDMETLNQRVREVLSVKFRLGLFD-RPFTENPAASDKKVQTPEDKKFAEQMNKES 441

Query: 379 IVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK--- 435
           IVLLKN N  LPL+    + + + GP A      I  Y  +    TS +DG   Y+    
Sbjct: 442 IVLLKNGNDFLPLDKNKNQKILVTGPLAAEVGYTISRYGPSNNPSTSILDGLKQYNNGKL 501

Query: 436 VINYAPGC--------------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
            I+YA GC                +  +  +MI  A+  AKN D  + V G +  +  E 
Sbjct: 502 NIDYAKGCEIVNEGWPGTEIIDEPVTEKEKAMIADAVAKAKNVDVIIAVVGENEKIVGES 561

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
             R  L LPG Q EL+ K   A   PV +V+++   + IN+   N  + +IL   + G  
Sbjct: 562 LSRTSLNLPGRQLELL-KALHATGKPVVMVLVNGRPLTINW--ENHYLTAILETWFLGPS 618

Query: 542 GGRAIADVIFGKYNPGGRLPITW------YEANYVKIP--YTSMPLRPVNNFPGRTYKFF 593
            G+ +A+ +FG YNPGG+L +T+       E N+   P  + + P    N F G++    
Sbjct: 619 AGKVVAETLFGDYNPGGKLSVTFPKSIGQIEMNFPFKPGSHANQPSSGDNGF-GKSR--V 675

Query: 594 DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
           +G V+YPFGYGLSYT+F Y         D+KLD              +KP   +      
Sbjct: 676 NG-VLYPFGYGLSYTKFSYS--------DLKLD-------------FSKPDSISA----- 708

Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVG 712
                       ++N+GK DG EVV +Y +       T+  Q+  +ER+ + AG++ ++ 
Sbjct: 709 ---------SFVLKNIGKRDGDEVVQLYFRDLISSVITYDTQLRAFERIHLKAGETKQLN 759

Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
               A K L I+D   N  +  G   +L+G     +    +  L
Sbjct: 760 LKF-ARKDLAILDKDMNWAVEPGDFEVLIGSSSEDIRLKEKFTL 802


>gi|427384377|ref|ZP_18880882.1| hypothetical protein HMPREF9447_01915 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727638|gb|EKU90497.1| hypothetical protein HMPREF9447_01915 [Bacteroides oleiciplenus YIT
           12058]
          Length = 1050

 Score =  266 bits (681), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 159/421 (37%), Positives = 231/421 (54%), Gaps = 45/421 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y +   P  ER  DL+ R+T+ EK+  +   + G+ RL +P Y   +EALHGV   GR  
Sbjct: 29  YKNENAPTHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPGR-- 86

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
                          T FP  I   A++N  L +++   +S EARA +N  + G      
Sbjct: 87  --------------FTVFPQAIGLAATWNPVLQEQVATVISDEARARWNELDQGREQKSQ 132

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFWSP +N+ RDPRWGR  ET GEDPY+ G     +V+GLQ          +DSR
Sbjct: 133 FSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQG---------NDSR 183

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LKI +  KH+AA    N E ++RF  + +++E+ ++E ++  FE CV +G  +S+M +Y
Sbjct: 184 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYLPAFEACVKDGKSASIMSAY 239

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N +N +P   +  LL + +R DW F GY+VSDC     +V +HK++  TKE A    +KA
Sbjct: 240 NALNDVPCTLNAWLLTKVLRNDWGFKGYVVSDCGGPSLLVNAHKYVK-TKEAAATLSIKA 298

Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
           GLDL+CG D Y    + A +Q  + +ADID++   +    M+LG FD   +  Y  +   
Sbjct: 299 GLDLECGDDVYDEPLLSAYRQYMVTDADIDSAAYRVLRARMQLGLFDSGEKNPYTKISPA 358

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            I + +H E+A  AAR+ IVLLKN    LPLN   IK++A+VG   NA  +  G+Y G P
Sbjct: 359 VIGSKEHQEVALNAARECIVLLKNQKKMLPLNAKKIKSIAVVG--INAGSSEFGDYSGLP 416

Query: 421 C 421
            
Sbjct: 417 V 417



 Score =  137 bits (346), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 96/289 (33%), Positives = 148/289 (51%), Gaps = 55/289 (19%)

Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-V 518
           A +  +  V V G++ S+E EG+DR D+ LP  Q E + ++      P  +V++ AG+ +
Sbjct: 599 AVRECETVVAVLGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIVVVLVAGSSL 656

Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
            +N+   +  + +I+   YPGE GG+A+A+V+FG YNPGGRLP+T+Y +   ++P    P
Sbjct: 657 AVNWMDEH--VPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRS-LDELP----P 709

Query: 579 LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKY---KVASSPKSVDIKLDKDQQCRDIN 635
               +   GRTYK+F G V+YPFGYGLSYT FKY   +VA   + V +            
Sbjct: 710 FDDYDITKGRTYKYFKGDVLYPFGYGLSYTSFKYSNLQVADGEEEVSV------------ 757

Query: 636 YTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQ 694
                                    +FQ+  +N G+  G EV  VY K P       +K+
Sbjct: 758 -------------------------SFQL--KNTGRYAGDEVAQVYVKLPEREEVMPVKE 790

Query: 695 VIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
           + G+ERV + +G+S KV   +     L+  D A    +  SG + I+VG
Sbjct: 791 LKGFERVSLKSGESKKVTIKLRK-DLLRYWDEAKGKFIYPSGNYNIMVG 838


>gi|219118959|ref|XP_002180246.1| beta-xylosidase [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217408503|gb|EEC48437.1| beta-xylosidase [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 682

 Score =  266 bits (681), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 199/616 (32%), Positives = 296/616 (48%), Gaps = 64/616 (10%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMG---------DLAYGVPRLGLPLYEWWSEA 62
            PYCD  L   ER +DL+  +TL EKV  +G              V R+GLP Y W  E 
Sbjct: 70  LPYCDMSLSIDERLEDLLSHLTLDEKVDMIGADPTQDVCMTHTMNVSRIGLPDYYWLVE- 128

Query: 63  LHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN 122
                     TN+  G+   +E   AT F   +   ASFN S W   G    TE RA+ N
Sbjct: 129 ----------TNTAVGSACIAENKCATEFSGPLSIAASFNRSSWFLKGSVFGTEQRALMN 178

Query: 123 LG----------NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVE 172
           +           + GLT + PNIN  RDPR+GR  E PGEDP++ G+YA + V+G+Q+  
Sbjct: 179 VHGERFHTHSGRHIGLTAFGPNINQQRDPRFGRSSELPGEDPFLSGQYAAHMVQGMQE-- 236

Query: 173 GVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVN 232
                RD++  P K+ A  KH+ AY  +   GND    D  ++  D+ +T++  +EM + 
Sbjct: 237 -----RDANGYP-KVLAYLKHFTAYSREEGRGND----DYNISMYDLFDTYLPQYEMGMV 286

Query: 233 EGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFH-GYIVSDCDSIQTIVESHKFLND 291
           +G  + VMCSYN VNGIP CA+  LLN+ +R  WN    ++ +DC ++  +         
Sbjct: 287 QGGATGVMCSYNAVNGIPACANDYLLNKILRQRWNRSDAHVTTDCGAVNNL-RGKPIQAA 345

Query: 292 TKEDAVARVLKAGLDLDCGD--YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
            +  A A  L  G D++ G   +  N T  A+  G   E  ++ ++R  Y      G FD
Sbjct: 346 DEAQAAAMALMNGADIEMGSTLFVHNLTT-AITLGYATEEAVNQAIRRSYRPHFIAGRFD 404

Query: 350 GS--PQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
                ++ +LG ++I + +H E+  EAA QG+VLLK+++  LP+  G    LA++GP   
Sbjct: 405 DPTLSEWFSLGLDDIQSKKHQEIQLEAALQGLVLLKHEDSILPIAAGT--KLAVLGPLGM 462

Query: 408 ATKAMIGNYE--------GTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAID 459
               ++ +YE        G  C  T      +   K    A    D+  +N S +   + 
Sbjct: 463 TRSGLMSDYESDQSCFGGGHDCIPTLAESIGFINGKEFTVAAAGVDVDSRNTSDVERILQ 522

Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
            A + D  V+  G   + E EG DR D  LPG Q  L   V    K PV LV+++ G + 
Sbjct: 523 LAADRDLIVLCLGNTKTQEQEGFDRKDTALPGQQYALFEAVLTLRK-PVVLVLVNGGQIA 581

Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
           ++     P   +I+    P   GG A+A  +FG+ N  G+LP T Y   Y  +    M  
Sbjct: 582 LDGMTGYP--SAIIEAFNPNGIGGTALAASLFGQENRWGKLPYTIYP--YSVMQSFDMKD 637

Query: 580 RPVNNFPGRTYKFFDG 595
             ++  PGRTY++F G
Sbjct: 638 HSMSAPPGRTYRYFTG 653


>gi|254786805|ref|YP_003074234.1| glycoside hydrolase family 3 domain-containing protein
           [Teredinibacter turnerae T7901]
 gi|237686035|gb|ACR13299.1| glycoside hydrolase family 3 domain protein [Teredinibacter
           turnerae T7901]
          Length = 888

 Score =  266 bits (680), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 167/450 (37%), Positives = 239/450 (53%), Gaps = 53/450 (11%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D  L    R  DLV RM L EK+ QM + +  +  LG+  Y+WW+EALHGV+  G+  
Sbjct: 47  YMDTTLDIDTRVDDLVSRMDLAEKISQMYNESPAIEHLGIAEYDWWNEALHGVARAGK-- 104

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------LGN 125
                         AT FP  I   A ++      I + VS EARA ++           
Sbjct: 105 --------------ATVFPQAIGMAAMWDRETMFDIAEAVSDEARAKHHYFVENGVHFRY 150

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLTFWSPNIN+ RDPRWGR  ET GEDPY+ G  A+ Y+ GLQ           + + L
Sbjct: 151 TGLTFWSPNINIFRDPRWGRGQETYGEDPYLTGELALPYISGLQG---------ENPKYL 201

Query: 186 KISACCKHYAAYDLDNWEGNDRF-HFDSRV-TEQDMQETFILPFEMCVNEGDVSSVMCSY 243
           K +A  KH+A +      G ++  H D+ + + +D+ ET++  FE  V EGDV SVMC+Y
Sbjct: 202 KTAAMAKHFAVH-----SGPEKSRHSDNYIASPKDLNETYLPAFEKAVVEGDVESVMCAY 256

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVL 301
           NRVN  P C +  LL +T+RG W F G++VSDC +I      E+H  +      A   V 
Sbjct: 257 NRVNDEPACGNDMLLKETLRGKWGFKGHVVSDCGAIADFYAPEAHHVVMAPAAAAAWAV- 315

Query: 302 KAGLDLDCG-DYYTNFT--MGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKN 356
           ++G DL+CG D  + F     A+Q+  I + +ID S++ L     +LG FD   Q  Y  
Sbjct: 316 RSGTDLNCGTDRLSTFANLHFALQREMITQDEIDQSVKRLMKTRFKLGMFDPDDQVPYSK 375

Query: 357 LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
           +  + + +  H+ L  +AA +  VLLKN +G LPL   +   +A++GP+A     ++GNY
Sbjct: 376 IPMDVVGSQAHLALTQKAAEKSFVLLKN-SGILPLKKSS--KVAIIGPNATNPTVLVGNY 432

Query: 417 EGTPCRYTSPMDGFYAY--SKVINYAPGCA 444
            G P +  +P+DG   Y   + + YAPG A
Sbjct: 433 FGDPIKPVTPLDGIQQYLGEENVFYAPGSA 462



 Score =  122 bits (307), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 94/288 (32%), Positives = 140/288 (48%), Gaps = 65/288 (22%)

Query: 470 VAGLDLSVEAEG---KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
           + G ++SVE EG    DR D+ LP  Q +L+  +    K P+ LV  S  A+ +N+A NN
Sbjct: 634 LEGEEMSVEIEGFDHGDRTDIRLPEPQRKLLATLKKLNK-PIVLVNFSGSAIALNWANNN 692

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
             + +IL   YPGE  G A+A +++G+ +P GRLPIT+Y              R +++ P
Sbjct: 693 --VDAILQGFYPGEATGTALARILWGEVSPSGRLPITFY--------------RSLDDLP 736

Query: 587 G--------RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV 638
           G        RTYK++ G V+YPFGYGLSYTQF Y   S+P                  T+
Sbjct: 737 GFKDYAMTNRTYKYYQGDVLYPFGYGLSYTQFAYSELSAPA-----------------TM 779

Query: 639 GTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVI 696
            + +P                     +V N GK+   EVV VY   K PG++    +++ 
Sbjct: 780 ASGEP----------------LAITAQVSNSGKVASDEVVQVYVSMKVPGLSLPQ-RELK 822

Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
            ++R+++  G S  V F++ A K L  VD+        G  T+ VG G
Sbjct: 823 EFKRIYLEPGASQTVEFSI-AGKDLSYVDDQGVRHPYHGPLTLSVGGG 869


>gi|384146876|ref|YP_005529692.1| beta-glucosidase [Amycolatopsis mediterranei S699]
 gi|340525030|gb|AEK40235.1| beta-glucosidase [Amycolatopsis mediterranei S699]
          Length = 671

 Score =  266 bits (680), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 232/773 (30%), Positives = 346/773 (44%), Gaps = 146/773 (18%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQM--------GDLAYGVPRLGLPLYEWWSEALH 64
           P+ DA+     RA +LV  MTL EK+ Q+              +PRLG+P +        
Sbjct: 13  PWRDARQSPDRRAAELVAAMTLDEKISQLHLQPDAEHQRFVPPIPRLGVPGF-------- 64

Query: 65  GVSFIGRRTNSPPGTHFDSEVPG--ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN 122
                 R  N P G     + P   AT+ P  +   ++F+  L ++ G+ +  E RA+ +
Sbjct: 65  ------RIANGPAGMGPADDKPQKPATALPATMALASTFDTGLARRYGRLIGDETRALAH 118

Query: 123 LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
             + G     P+IN+ R PR GR  E  GEDP + G  A   +RG+Q+   +        
Sbjct: 119 NVSEG-----PDINMARVPRNGRTFEGMGEDPVLAGALAAADIRGIQENGTI-------- 165

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
                 A  KHYAA    N +  DR   D  + E+ + E ++  FE  V EG   SVMC+
Sbjct: 166 ------AEVKHYAA----NNQETDRHGIDEHIDERTLNEIYLPHFEQAVTEGHAGSVMCA 215

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           Y ++NG+ TC +P LL   +R DW F G++ SD  +  + V S                 
Sbjct: 216 YPKINGVFTCENPALLQDKLRDDWGFKGFVQSDWGAAHSTVGS---------------AN 260

Query: 303 AGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           AG++L+   G +Y      AV  G+++E  +   L   +  +   G FD  P    L   
Sbjct: 261 AGMNLEMIDGTWYGEKMKQAVLAGQVSEQRVGELLLPRFRTMFAFGQFDHPPVASPL--- 317

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
                QH   A E A +G+VLL+N++  LPL+ G +K++AL+GP   AT+A  G    + 
Sbjct: 318 --PTAQHDAAAKEFAERGMVLLRNEHAQLPLDPG-VKSIALIGPF--ATRAKTGGGGSSA 372

Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
              TS +D      + +   PG   +   + S    A   A+ A+ +V++ G +   EAE
Sbjct: 373 VIPTSTVDPLAGLQQRV---PGAV-VTLDDGSDPARAAALARTAEVSVVMVGDN---EAE 425

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIM-SAGAVDINFAKNNPKIKSILWVGYPG 539
           GKDR  L L G Q  L+  VA+A   P T+V++ S G V + +     ++ +IL   YPG
Sbjct: 426 GKDRPSLALDGNQDALVTAVAEA--NPHTVVVVKSGGPVLMPWVS---RVPAILQAWYPG 480

Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGR----------- 588
           ++ G A+A V+FG  NP G+LPIT+  A+         P      FPG            
Sbjct: 481 QQDGAAVAGVLFGDVNPSGKLPITFPAAD------ADTPANTPAQFPGVGGVATYSEGLQ 534

Query: 589 -TYKFFDG---PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPP 644
             Y++FD      ++PFG+GLSYT F Y   +   S D                      
Sbjct: 535 IGYRWFDAQGRAPLFPFGHGLSYTTFAYSGLAVHNSGD---------------------- 572

Query: 645 CAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIA 704
                           T    V N G   G+EV  VY   P  AG   +Q+ G+ERV +A
Sbjct: 573 --------------GATATFTVRNTGSRAGAEVAQVYLGFPVAAGEPPRQLKGFERVSLA 618

Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVGEGVGGVSFPLQLNL 756
            GQ+ +V   ++  +   + D AA++   A GA T+ VG      S PLQ  L
Sbjct: 619 PGQARRVTIRLDK-RDFSVWDTAAHAWQPARGAFTVSVGG--SSRSLPLQAPL 668


>gi|423227459|ref|ZP_17213920.1| hypothetical protein HMPREF1062_06106 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392623089|gb|EIY17195.1| hypothetical protein HMPREF1062_06106 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 864

 Score =  266 bits (680), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 163/460 (35%), Positives = 248/460 (53%), Gaps = 43/460 (9%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
           ++ V   + PY + +L   ERA DL++RMTL EKV QM + +  + RLG+P Y+WW+EAL
Sbjct: 14  TLNVTAQNEPYKNPELSPSERAWDLLKRMTLEEKVSQMKNGSPAIERLGIPAYDWWNEAL 73

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN- 122
           HGV+  G+                AT FP  I   A+F+     +    VS EARA Y+ 
Sbjct: 74  HGVARAGK----------------ATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHD 117

Query: 123 -------LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
                   G  GLTFW+PNIN+ RDPRWGR +ET GEDPY+     +  V+GLQ     +
Sbjct: 118 FQRKGERDGYKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGGGTGK 177

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEG 234
           Y         K  AC KHYA +    W   +R  FD++ ++++D+ ET++  F+  V EG
Sbjct: 178 YD--------KAHACAKHYAVHSGPEW---NRHSFDAKNISQRDLWETYLPAFKTLVKEG 226

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI-VESHKFLNDTK 293
            V  VMC+YNR  G P C++ +LL + +R DW +   +VSDC +I      +H   + T 
Sbjct: 227 KVKEVMCAYNRFEGEPCCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTA 286

Query: 294 EDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP- 352
             A A  + +G DL+CG  Y++    AV++G I+E  I+ S+  L     +LG FD    
Sbjct: 287 AAASADAVVSGTDLECGGSYSSLNE-AVRKGLISEEKINESVFRLLRARFQLGMFDDDAL 345

Query: 353 -QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKA 411
             +  +  + + + +H+  A E AR+ +VLL N N  LPL+  +I+ +A++GP+AN +  
Sbjct: 346 VSWSEIPYSVVESKEHVAKALEMARKSMVLLTNKNHTLPLSK-SIRKVAVLGPNANDSVM 404

Query: 412 MIGNYEGTPCRYTSPMDGFYAY--SKVINYAPGCADIVCQ 449
           +  NY G P +  + ++G  +      + Y  GC  +  Q
Sbjct: 405 LWANYNGFPTKSVTILEGIKSKLPEGTVYYEKGCDYVNTQ 444



 Score =  126 bits (317), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 90/295 (30%), Positives = 139/295 (47%), Gaps = 53/295 (17%)

Query: 459 DAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPV 508
           D A  ADA + V GL  ++E E            DR ++ LP  Q E++  +    K PV
Sbjct: 595 DKAAEADAIIFVGGLSPTLEGEEMPVDLPGFRKGDRTNIDLPHVQAEMLKALKKTGK-PV 653

Query: 509 TLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEAN 568
             V+ S   + + +   N  + +IL   YPG++GG A+ADV+FG YNP GRLP+T+Y ++
Sbjct: 654 IFVLCSGSTLALPWEAEN--LDAILEAWYPGQQGGTAVADVLFGDYNPAGRLPLTFYASS 711

Query: 569 YVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKD 628
                   +P     +   RTY++F G  ++PFG+GLSYT F Y  A        K+DK 
Sbjct: 712 ------DDLPDFEDYDMSNRTYRYFKGKALFPFGHGLSYTIFDYGKA--------KVDKQ 757

Query: 629 QQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA 688
                 N   G                     T  I ++N GK+DG EV+ VY + P   
Sbjct: 758 ------NVRAGEG------------------MTLTIPLKNTGKLDGDEVIQVYLRNPADK 793

Query: 689 GTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
              IK +  + RV + AGQ+  +   + A  + +  + + N + +  G + +L G
Sbjct: 794 EGPIKTLRAFRRVSLPAGQTENIRIELPAS-TFECFNPSTNRMEILPGKYELLYG 847


>gi|329923020|ref|ZP_08278536.1| glycosyl hydrolase family 3 N-terminal domain protein
           [Paenibacillus sp. HGF5]
 gi|328941793|gb|EGG38078.1| glycosyl hydrolase family 3 N-terminal domain protein
           [Paenibacillus sp. HGF5]
          Length = 763

 Score =  266 bits (680), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 213/691 (30%), Positives = 334/691 (48%), Gaps = 100/691 (14%)

Query: 87  GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
           GAT FP  +   +++N  L++ I + V+ E RA       G   +SP ++VVRDPRWGR 
Sbjct: 123 GATVFPVPLTIGSTWNTELFRSISRAVAAETRA-----QGGAATYSPVLDVVRDPRWGRT 177

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
            ET GEDP++V  +A+  V+GLQ  E ++ H         + A  KH+A Y       N 
Sbjct: 178 EETFGEDPHLVAEFAVAAVQGLQG-ERLDSH-------TSLLATLKHFAGYGASEGGRNG 229

Query: 207 R-FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
              H   R    ++ E  +LPF   V  G + S+M +YN ++G+P  +   LL   +R  
Sbjct: 230 APVHMGLR----ELHEVDLLPFRKAVESGAL-SIMTAYNEIDGVPCTSSRYLLQNVLREA 284

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQG 324
           W F G++++DC +I  +   H       E A  + LKAG+D++  G  +      A++QG
Sbjct: 285 WGFDGFVITDCGAIHMLACGHNTAGSGVE-AATQSLKAGVDMEMSGTMFRAHLQQALEQG 343

Query: 325 KIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKN 384
            I E D++ +   +  +  RLG FD         +  I   +HI LA +AA +GIVLLKN
Sbjct: 344 LITEDDLNRAAGRVLELKFRLGLFDRPYVDPAWAEQVIGCKEHIALAYQAAAEGIVLLKN 403

Query: 385 DNGALPLNTGNIKTLALVGPHANATKAMIGNYEG--TPCRYTSPMDGF---YAYSKVINY 439
           +   LPL++ +  T+A++GP+A+     +G+Y     P +  + +DG       S+V+ Y
Sbjct: 404 EGNLLPLDSSS-GTIAVIGPNAHTPYHQLGDYTSPQPPGQIVTVLDGIRRRLGDSRVL-Y 461

Query: 440 APGCADIVCQNNSMIPAAIDAAKNADATVIVAG-----------LDLSVEA--------- 479
           APGC  I   +    P A+  A+ AD  V+V G           +DL   A         
Sbjct: 462 APGC-RIQGDSREGFPRALACAEQADVIVMVLGGSSARDFGEGTIDLRTGASVVTGDAKS 520

Query: 480 -----EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
                EG DR  L L G Q EL+ ++    K PV +V ++   +   +   +  I +I+ 
Sbjct: 521 DMECGEGIDRSTLTLMGVQLELLQELQKLGK-PVIVVYINGRPITEPWI--DEFIPAIIE 577

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
             YPG+EGG AIAD++FG  NP GRLP++   E   + I Y +   R      G+ Y   
Sbjct: 578 AWYPGQEGGGAIADMLFGDINPSGRLPLSIPKEVGQLPISYNARRTR------GKRYLET 631

Query: 594 DGPVVYPFGYGLSYTQFKY-KVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
           D    YPFG+GLSYT+F+Y ++   P  V I  +                          
Sbjct: 632 DLAPRYPFGFGLSYTEFRYGRLTVEPAVVPIGGEA------------------------- 666

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKV 711
                   T +I+V N G  DG+EVV +Y      + T  ++ + G+ +VF+ AG++ +V
Sbjct: 667 --------TVRIDVTNAGARDGAEVVQLYVSDLAASVTRPEKALKGFRKVFLKAGETQEV 718

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            FT+ + + L+++      ++  G   I VG
Sbjct: 719 TFTIGS-EQLELIGLDLKPVVEPGEFRIQVG 748


>gi|300783640|ref|YP_003763931.1| beta-glucosidase [Amycolatopsis mediterranei U32]
 gi|399535524|ref|YP_006548186.1| beta-glucosidase [Amycolatopsis mediterranei S699]
 gi|299793154|gb|ADJ43529.1| beta-glucosidase [Amycolatopsis mediterranei U32]
 gi|398316294|gb|AFO75241.1| beta-glucosidase [Amycolatopsis mediterranei S699]
          Length = 684

 Score =  266 bits (680), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 232/773 (30%), Positives = 346/773 (44%), Gaps = 146/773 (18%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQM--------GDLAYGVPRLGLPLYEWWSEALH 64
           P+ DA+     RA +LV  MTL EK+ Q+              +PRLG+P +        
Sbjct: 26  PWRDARQSPDRRAAELVAAMTLDEKISQLHLQPDAEHQRFVPPIPRLGVPGF-------- 77

Query: 65  GVSFIGRRTNSPPGTHFDSEVPG--ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN 122
                 R  N P G     + P   AT+ P  +   ++F+  L ++ G+ +  E RA+ +
Sbjct: 78  ------RIANGPAGMGPADDKPQKPATALPATMALASTFDTGLARRYGRLIGDETRALAH 131

Query: 123 LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
             + G     P+IN+ R PR GR  E  GEDP + G  A   +RG+Q+   +        
Sbjct: 132 NVSEG-----PDINMARVPRNGRTFEGMGEDPVLAGALAAADIRGIQENGTI-------- 178

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
                 A  KHYAA    N +  DR   D  + E+ + E ++  FE  V EG   SVMC+
Sbjct: 179 ------AEVKHYAA----NNQETDRHGIDEHIDERTLNEIYLPHFEQAVTEGHAGSVMCA 228

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           Y ++NG+ TC +P LL   +R DW F G++ SD  +  + V S                 
Sbjct: 229 YPKINGVFTCENPALLQDKLRDDWGFKGFVQSDWGAAHSTVGS---------------AN 273

Query: 303 AGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           AG++L+   G +Y      AV  G+++E  +   L   +  +   G FD  P    L   
Sbjct: 274 AGMNLEMIDGTWYGEKMKQAVLAGQVSEQRVGELLLPRFRTMFAFGQFDHPPVASPL--- 330

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
                QH   A E A +G+VLL+N++  LPL+ G +K++AL+GP   AT+A  G    + 
Sbjct: 331 --PTAQHDAAAKEFAERGMVLLRNEHAQLPLDPG-VKSIALIGPF--ATRAKTGGGGSSA 385

Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
              TS +D      + +   PG   +   + S    A   A+ A+ +V++ G +   EAE
Sbjct: 386 VIPTSTVDPLAGLQQRV---PGAV-VTLDDGSDPARAAALARTAEVSVVMVGDN---EAE 438

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIM-SAGAVDINFAKNNPKIKSILWVGYPG 539
           GKDR  L L G Q  L+  VA+A   P T+V++ S G V + +     ++ +IL   YPG
Sbjct: 439 GKDRPSLALDGNQDALVTAVAEA--NPHTVVVVKSGGPVLMPWVS---RVPAILQAWYPG 493

Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGR----------- 588
           ++ G A+A V+FG  NP G+LPIT+  A+         P      FPG            
Sbjct: 494 QQDGAAVAGVLFGDVNPSGKLPITFPAAD------ADTPANTPAQFPGVGGVATYSEGLQ 547

Query: 589 -TYKFFDG---PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPP 644
             Y++FD      ++PFG+GLSYT F Y   +   S D                      
Sbjct: 548 IGYRWFDAQGRAPLFPFGHGLSYTTFAYSGLAVHNSGD---------------------- 585

Query: 645 CAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIA 704
                           T    V N G   G+EV  VY   P  AG   +Q+ G+ERV +A
Sbjct: 586 --------------GATATFTVRNTGSRAGAEVAQVYLGFPVAAGEPPRQLKGFERVSLA 631

Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVGEGVGGVSFPLQLNL 756
            GQ+ +V   ++  +   + D AA++   A GA T+ VG      S PLQ  L
Sbjct: 632 PGQARRVTIRLDK-RDFSVWDTAAHAWQPARGAFTVSVGG--SSRSLPLQAPL 681


>gi|423303577|ref|ZP_17281576.1| hypothetical protein HMPREF1072_00516 [Bacteroides uniformis
           CL03T00C23]
 gi|423307700|ref|ZP_17285690.1| hypothetical protein HMPREF1073_00440 [Bacteroides uniformis
           CL03T12C37]
 gi|392687941|gb|EIY81232.1| hypothetical protein HMPREF1072_00516 [Bacteroides uniformis
           CL03T00C23]
 gi|392689569|gb|EIY82846.1| hypothetical protein HMPREF1073_00440 [Bacteroides uniformis
           CL03T12C37]
          Length = 942

 Score =  266 bits (680), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 229/811 (28%), Positives = 370/811 (45%), Gaps = 146/811 (18%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
           Y D   P   R ++L+++MTL EK  QM  L YG  R+    LP  EW    W +   G+
Sbjct: 53  YEDPSAPLEARIENLLQQMTLDEKTCQMVTL-YGYKRVLKDDLPTPEWKELLWKD---GI 108

Query: 67  SFIGRRTNS---------------PPGTH----------------------FDSE-VPG- 87
             I    N                P   H                      F +E + G 
Sbjct: 109 GAIDEHLNGFQQWGLPPSDNAYVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGV 168

Query: 88  ----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPR 142
               AT+FPT +    ++N  L +++G     EAR +      G T  ++P ++V RD R
Sbjct: 169 ESYRATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQR 222

Query: 143 WGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNW 202
           WGR  E  GE PY+V    I  VRGLQ       H        +++A  KH+AAY  +  
Sbjct: 223 WGRYEEVYGESPYLVAELGIEMVRGLQ-------HNH------QVAATGKHFAAYSNNKG 269

Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
                   D +++ ++++   I PF+  + E  +  VM SYN  +GIP       L   +
Sbjct: 270 AREGMARVDPQMSPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRL 329

Query: 263 RGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTM 318
           RG+  F GY+VSD D+++ +   H    D KE AV + ++AGL++ C     D +     
Sbjct: 330 RGEMGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSFVLPLR 388

Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIE-LAAEAARQ 377
             V++G ++E  I+  +R +  V   +G FD   Q    G +     +  E +A +A+ +
Sbjct: 389 ELVKEGGLSEEVINDRVRDILRVKFLIGLFDAPYQTDLAGADREVEKEENEAIALQASHE 448

Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV- 436
            +VLLKN +  LPL+  + K +A+ GP+AN     + +Y       T+ ++G    +K  
Sbjct: 449 SVVLLKNADELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKSK 508

Query: 437 --INYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEA 479
             + Y  GC D+V  +                + I  A++ A+ AD  V+V G       
Sbjct: 509 AEVLYTKGC-DLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAVVVLGGGQRTCG 567

Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
           E K R  L LPG Q +L+  +    K PV L++++   + IN+A  +  + +IL   YPG
Sbjct: 568 ENKSRTSLDLPGRQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPG 624

Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF-----PGRTYKF-- 592
            +GG A+AD++FG YNPGG+L +T +     +IP+ + P +P +       PG T     
Sbjct: 625 SKGGTALADILFGDYNPGGKLTVT-FPKTVGQIPF-NFPCKPSSQIDGGKNPGPTGNMSR 682

Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
            +G  +YPFGYGLSYT F+Y                    D++ T     P  +A     
Sbjct: 683 ING-ALYPFGYGLSYTTFEYS-------------------DLDITPRVITPNESA----- 717

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKV 711
                   T +++V N GK  G EVV +Y +       T+ K + G++R+ +  G++ ++
Sbjct: 718 --------TVRLKVTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQEL 769

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            FT++  K L+++D     ++  G   ++ G
Sbjct: 770 SFTIDR-KHLELLDADMKWVVEPGDFVLMAG 799


>gi|423214254|ref|ZP_17200782.1| hypothetical protein HMPREF1074_02314 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392693199|gb|EIY86434.1| hypothetical protein HMPREF1074_02314 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 735

 Score =  266 bits (680), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 223/774 (28%), Positives = 356/774 (45%), Gaps = 105/774 (13%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP 49
           S K K S   Y DAK P  +R  DL+ RMTL EK+ Q+     G              VP
Sbjct: 20  SAKDKKSIPLYKDAKAPIEKRVDDLLSRMTLEEKILQLNQYTMGRNNNVNNIGEEVKKVP 79

Query: 50  -RLGLPLYEWWSEALHG----VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNES 104
             +G  +Y   + AL       +    R   P    +D+     T +P  +    S+N  
Sbjct: 80  AEIGSLIYYDTNPALRNNVQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLGQACSWNPE 139

Query: 105 LWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINY 164
           L +K     + EAR    +     TF SP I+V RDPRWGRV E  GEDPY  G +A   
Sbjct: 140 LVEKACAVTAQEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYANGVFAAAS 194

Query: 165 VRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFI 224
           VRG        Y  D  S   +I+AC KHY  Y      G D  +  + ++ Q + +T++
Sbjct: 195 VRG--------YQGDDMSAEDRIAACLKHYIGYGASE-AGRDYVY--TEISAQTLWDTYL 243

Query: 225 LPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE 284
           LP+EM V  G  +++M S+N ++G+P  A+   + + ++  W   G+IVSD  +I+ +  
Sbjct: 244 LPYEMGVKAG-AATLMSSFNDISGVPGSANHYTMTEILKERWGHDGFIVSDWGAIEQL-- 300

Query: 285 SHKFLNDTKEDAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLM 343
            ++ L   K++A      AGL++D   + Y  +    V++GKI  A +D S+R +  V  
Sbjct: 301 KNQGLAANKKEAAVYAFNAGLEMDMMSHAYDRYMKELVEEGKITMAQVDESVRRVLRVKF 360

Query: 344 RLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG 403
           RLG F+         K     PQ +++AA+ A + +VLLKN+NG LPL   + K +A+VG
Sbjct: 361 RLGLFERPYTPVTNEKERFFRPQSMDIAAQLAAESMVLLKNENGILPLT--DKKKIAVVG 418

Query: 404 PHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV---------INYAPGCADIVCQNNSMI 454
           P A     ++G++    C +    D    Y+ +         + YA GC+     N    
Sbjct: 419 PMAKNGWDLLGSW----CGHGKDTDVAMLYNGLATEFVGKAELRYALGCS-TQGDNRKGF 473

Query: 455 PAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMS 514
             A++AA+ +D  V+  G  ++   E   R  + LP  Q EL  ++  A K P+ LV+++
Sbjct: 474 EEALEAARWSDVVVLCLGEMMTWSGENASRSSIALPQIQEELAKELKKAGK-PIVLVLVN 532

Query: 515 AGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPY 574
              +++N  +  P   +IL +  PG  G   +A ++ G+ NP G+L +T+        PY
Sbjct: 533 GRPLELN--RLEPISDAILEIWQPGVNGALPMAGILSGRINPSGKLAMTF--------PY 582

Query: 575 TS--MPLRPVNNFPGRTYKFFDGPV----VYPFGYGLSYTQFKYKVASSPKSVDIKLDKD 628
           ++  +P+       GR ++ F   +    +Y FG+GLSYT+FKY                
Sbjct: 583 STGQIPIYYNRRKSGRGHQGFYKDITSEPLYSFGHGLSYTEFKY---------------- 626

Query: 629 QQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA 688
                     GT  P    V       +  K + ++ V N GK DG E V  +   P  +
Sbjct: 627 ----------GTVTPSVTTV------KRGGKLSVEVSVSNTGKRDGLETVHWFISDPYCS 670

Query: 689 GTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
            T  +K++  +E+  I AG++    F ++  +    V+      L  G + I V
Sbjct: 671 ITRPVKELKHFEKQLIKAGETKVFRFDVDLERDFGFVNGNGKRFLEIGEYYIQV 724


>gi|295132888|ref|YP_003583564.1| beta-glucosidase [Zunongwangia profunda SM-A87]
 gi|294980903|gb|ADF51368.1| beta-glucosidase [Zunongwangia profunda SM-A87]
          Length = 855

 Score =  266 bits (680), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 161/451 (35%), Positives = 237/451 (52%), Gaps = 46/451 (10%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           PY +  L   ERA+DLV R+TL EK   M D++  +PRLG+  + WWSEALHG +     
Sbjct: 13  PYQNPNLSPEERAEDLVNRLTLEEKASLMFDVSEAIPRLGIKKFNWWSEALHGFA----- 67

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--LGNA---- 126
                           T FP  +   ASF++ L  ++    S E RA Y+  L N     
Sbjct: 68  -----------NNDDVTVFPEPVGMAASFDDELVYQVFDATSDEVRAKYHEALRNGEENK 116

Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               L+ W+PN+N+ RDPRWGR  ET GEDPY+  R  +  V+GLQ  E  +Y       
Sbjct: 117 RFLSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQGPEDAKYK------ 170

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFD-SRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
             K+ AC KHYA +    W    R   + + V+++D+ ET++  F++ V + +V  VMC+
Sbjct: 171 --KLLACAKHYAVHSGPEW---SRHELNLNNVSQRDLWETYLPAFKVLVQDANVRQVMCA 225

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           Y R++  P C   +LL Q +R  W F   +VSDC +IQ    SH   +D    A A+ + 
Sbjct: 226 YQRLDDEPCCGSDRLLQQILREKWGFEHLVVSDCGAIQDFYTSHNVSSDAVH-AAAKAVL 284

Query: 303 AGLDLDCGDYYTNFTM--GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLG 358
           AG D++C     N+ +   AV++G + E DID S++ + I    LG  D      Y  + 
Sbjct: 285 AGTDVECQWDKHNYKLLPEAVEKGLVKEEDIDRSVKRVLIGRFELGEMDPDEIVPYAQIP 344

Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
            + I N +H +LA + AR+ + LL+N N  LPL+ G  + +A++GP+A+    + GNY G
Sbjct: 345 ASVINNEEHRQLALKMARESMTLLQNKNNILPLSKGQDR-IAVIGPNADDEPMLWGNYNG 403

Query: 419 TPCRYTSPMDGFYAY--SKVINYAPGCADIV 447
           TP R  S +DG  +    K I Y   C D+V
Sbjct: 404 TPVRTISILDGITSKIGEKSIVYDKAC-DLV 433



 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 83/292 (28%), Positives = 126/292 (43%), Gaps = 53/292 (18%)

Query: 462 KNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLV 511
           K  +  + V GL   +E E          G DR D+ LP  Q   +  + DA K    ++
Sbjct: 590 KGIETVIFVGGLSTKLEGEEMPVSYPGFKGGDRTDIALPSVQRNCLKTLKDAGK---KVI 646

Query: 512 IMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK 571
            ++     I          +IL   Y GE GG+A+ADV+FG YNP G+LP+T+Y+     
Sbjct: 647 FVNNSGSAIGLVPETTSCDAILQAWYGGESGGQAVADVLFGDYNPSGKLPVTFYKDT--- 703

Query: 572 IPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQC 631
              T +P     +  GRTY+F     ++PFG+GLSYT FK   A        +LDK +  
Sbjct: 704 ---TQLPDFEDYSMNGRTYRFMKAEPLFPFGHGLSYTNFKIGEA--------QLDKSE-- 750

Query: 632 RDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH 691
             I+ +   N                      I + N GK +G E++ VY    G+    
Sbjct: 751 --IDTSSSVN--------------------ITISISNEGKTEGVEIIQVYVHKQGLEEGP 788

Query: 692 IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
           IK + G++RV +   +   V   +    S +  D  A S+ +  G + I  G
Sbjct: 789 IKTLKGFKRVNLKPNEMKNVTINL-PSNSFEFYDKKARSMKVMPGNYEIFYG 839


>gi|293371677|ref|ZP_06618088.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|292633374|gb|EFF51944.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
          Length = 783

 Score =  266 bits (680), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 214/725 (29%), Positives = 324/725 (44%), Gaps = 119/725 (16%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+PL+    EA HG   IG                 AT FPT I   A+++  L +++
Sbjct: 131 RLGIPLF-LAEEAPHGHMAIG-----------------ATVFPTGIGMAATWSPQLIREV 172

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           G+ +  E R        G   + P +++ RDPRW RV ET GEDP + G      V GL 
Sbjct: 173 GKAIGKEIRL-----QGGHISYGPVLDLARDPRWSRVEETFGEDPVLTGEIGKAIVEGLG 227

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                       SRP    A  KH+ AY +     N    F      +++ E F+ PF  
Sbjct: 228 G--------GDLSRPYSTLATLKHFLAYGISESGQNGNPSFAGI---RELHENFLPPFRQ 276

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            ++ G +S VM SYN ++G+P  A+  LL + +R +W F G +VSD  SI+ I +SH F+
Sbjct: 277 AIDAGALS-VMTSYNSMDGVPCTANHSLLTELLRNEWKFSGIVVSDLYSIEGIHQSH-FV 334

Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
             T E+A    L AG+D+D G D Y N  M AV  G+I +  +D S+  +  +   +G F
Sbjct: 335 APTMEEAAVLALSAGVDVDLGGDAYMNL-MNAVNTGRIGKTALDASVARVLRLKFEMGLF 393

Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
           +         K  + + + + LA   A+  I LLKN++  LPLN    + +AL+GP+A+ 
Sbjct: 394 ENPYVDPEKAKKEVRSEESVTLARRVAQASITLLKNEHSLLPLNKN--RKVALIGPNADN 451

Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYA--YSKVINYAPGCA--DIVCQNNSMIPAAIDAAK 462
              M+G+Y          + +DG      S  + Y  GC+  D V  +   I  A+ AA+
Sbjct: 452 RYNMLGDYTAPQEEANIKTVLDGIRTKLSSSQVEYVKGCSIRDTVTTD---IEQAVAAAQ 508

Query: 463 NADATVIVAGLDLSVE-----------------------AEGKDRVDLLLPGFQTELINK 499
            ++  + V G   + +                        EG DR  L L G Q EL+  
Sbjct: 509 RSEIIIAVVGGSSARDFKTSYKETGAAIANEKTISDMECGEGFDRATLSLLGKQQELLKA 568

Query: 500 VADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGR 559
           +    K P+ +V +    +D N+A  N    ++L   YPG+EGG AIADV+FG +NP GR
Sbjct: 569 LKTTGK-PLVVVYIEGRPLDKNWASENAD--AVLTAYYPGQEGGIAIADVLFGDFNPAGR 625

Query: 560 LPITWYEANYVKIPYTSMPLRPVNNFP-GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSP 618
           LP +      V      +PL      P    Y       +YPFGYGLSYT F Y      
Sbjct: 626 LPFS------VPRSVGQIPLYYNKKAPQSHDYVEMSASPLYPFGYGLSYTSFDYS----- 674

Query: 619 KSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVV 678
                         D++ +  T +                 F    +V N GK DG EV 
Sbjct: 675 --------------DLHLSALTPR----------------SFEVSFKVRNTGKYDGEEVA 704

Query: 679 MVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAH 737
            +Y +    +    +KQ+  + R ++  G+  +V F ++  +   +VD    S++  G  
Sbjct: 705 QLYLRDEYASVVQPLKQLKHFARFYLKRGEEREVKFILSE-EDFSLVDRNLKSIVEPGTF 763

Query: 738 TILVG 742
            I++G
Sbjct: 764 QIMIG 768


>gi|423240769|ref|ZP_17221883.1| hypothetical protein HMPREF1065_02506 [Bacteroides dorei
           CL03T12C01]
 gi|392643731|gb|EIY37480.1| hypothetical protein HMPREF1065_02506 [Bacteroides dorei
           CL03T12C01]
          Length = 864

 Score =  266 bits (679), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 163/448 (36%), Positives = 238/448 (53%), Gaps = 47/448 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y ++ L   ERA+DL++++TL EKV  M D +  V RLG+  Y WW+EALHGV+  G   
Sbjct: 24  YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   ASF       I   VS EARA     +A       
Sbjct: 82  --------------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERY 127

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLT W+P +N+ RDPRWGR +ET GEDPY+     +N V+GLQ         D++ +  
Sbjct: 128 QGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCT-------DANQKYD 180

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
           KI AC KH+A +    W   +R  F++  +  +D+ ET+++PFE  V EG V  VMC+YN
Sbjct: 181 KIHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYN 237

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLK 302
           R+ G P C   +LL Q +R +W + G ++SDC +I      + HK  +   E A A  + 
Sbjct: 238 RLEGDPCCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHK-THPNAESASAAAVL 296

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKN 360
           +G DL+CG  Y      A ++G I+E DID S++ L      LG  D     ++  +  +
Sbjct: 297 SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPSKVEWTKIPYS 355

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            +C+ +H  L+ + AR+ + LL N N  LPL  G  +T+A++GP+AN +    GNY GTP
Sbjct: 356 VVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTP 414

Query: 421 CRYTSPMDGFYAY----SKVINYAPGCA 444
               + ++G  +      K+I Y  GC+
Sbjct: 415 KHTITLLEGIRSAMGENDKLI-YEQGCS 441



 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 94/300 (31%), Positives = 133/300 (44%), Gaps = 54/300 (18%)

Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
           I   +   K+AD  +   G+  S+E E            DR D+ LP  Q ELI  + DA
Sbjct: 592 IKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDA 651

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
            K    ++ ++     I         ++IL   YPG+ GG+A A+V+FG YNP GRLP+T
Sbjct: 652 GK---KVIFVNFSGSPIAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVT 708

Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
           +Y           +P     N  GRTY++F G  ++PFGYGLSYT F Y         +I
Sbjct: 709 FYRN------IAQLPDFEDYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYD--------NI 754

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
           KLD+  +  +    V                         I V N G  DG EVV VY K
Sbjct: 755 KLDQTIKVGETAKMV-------------------------IPVTNAGNRDGEEVVQVYLK 789

Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
               A    K +  ++RV I AG++  V   +   K L+  D   N++   +G   I+VG
Sbjct: 790 KQEDAEGPAKTLRAFKRVQIPAGKTVNVELELTP-KQLEWWDAQTNTMRTIAGNFDIMVG 848


>gi|315499711|ref|YP_004088514.1| beta-glucosidase [Asticcacaulis excentricus CB 48]
 gi|315417723|gb|ADU14363.1| Beta-glucosidase [Asticcacaulis excentricus CB 48]
          Length = 869

 Score =  266 bits (679), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 169/428 (39%), Positives = 230/428 (53%), Gaps = 49/428 (11%)

Query: 29  VERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGA 88
           + RMT+ +K  QM + A  +P  GL  YEWW+E LHGV+  G                 A
Sbjct: 40  IARMTVEQKAAQMQNRAPDLPSAGLTAYEWWNEGLHGVARAGE----------------A 83

Query: 89  TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPNINVVRD 140
           T FP  I   A++N +L K++G  VSTEARA +N  +         GLT WSPNIN+ RD
Sbjct: 84  TVFPQAIGLAATWNPALLKQVGDVVSTEARAKFNSTDPAGDHQRYYGLTLWSPNINIFRD 143

Query: 141 PRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLD 200
           PRWGR  ET GEDP++  R A  +V GLQ           D +  K+ A  KH A +   
Sbjct: 144 PRWGRGQETYGEDPFLTSRLAEGFVTGLQG---------PDPQHPKVVASVKHLAVHSGP 194

Query: 201 NWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQ 260
                 R  F + V+  D++ T++  F   V      SVMC+YN V G+P CA   LL  
Sbjct: 195 E---AGRHGFAASVSPYDLEMTYLPAFRYSVMTTKAQSVMCAYNAVGGVPACASDLLLKT 251

Query: 261 TIRGDWNFHGYIVSDCDSIQTIVESHKF-LNDTKEDAVARVLKAGLDLDCGDYYTNFTMG 319
            +R  W F GY+V+DCD+I  +   H + LND +  A +  LKAG+DL+CG+ Y      
Sbjct: 252 YVREAWGFKGYVVTDCDAIYDMTRFHFYRLNDAESSAES--LKAGVDLNCGNAYAALPE- 308

Query: 320 AVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICNPQHIELAAEAARQG 378
           AVQ+G I E+ +D SL  L  V  RLG  DG+P  +  +    I  PQ   LA +AA Q 
Sbjct: 309 AVQKGLIPESLMDQSLNRLLDVRKRLG-IDGAPSPWARISPEAINTPQAQGLALQAAEQS 367

Query: 379 IVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY---SK 435
           +VLLKN NG LPL  G  +T+A++GP+A+  + + GNY G   +  +P+ G  A    +K
Sbjct: 368 LVLLKN-NGVLPLKPG--QTVAVIGPNADTEETLRGNYNGIARQPVTPLTGLRAQLGAAK 424

Query: 436 VINYAPGC 443
           V+ YA G 
Sbjct: 425 VL-YAQGA 431



 Score =  120 bits (300), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 91/290 (31%), Positives = 134/290 (46%), Gaps = 49/290 (16%)

Query: 468 VIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
           ++V G D        DR DL LP  Q +L+  V    K P+ +V++S  AV +N+A  + 
Sbjct: 620 ILVPGFDRG------DRTDLGLPRTQEDLLKAVKATGK-PLVVVLLSGSAVALNWADAHA 672

Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG 587
                 W  YPGE GG AIA  + G+ NP GRLP+T+Y +     P+    +       G
Sbjct: 673 DAVVAAW--YPGEAGGTAIARTLTGEANPSGRLPVTFYRSVQDLPPFIDYRME------G 724

Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
           RTY++F G  +YPFG+GLSYTQF Y         D+KLD          T+   +P    
Sbjct: 725 RTYRYFKGKPLYPFGHGLSYTQFSYS--------DLKLDTS--------TLTAGQP---- 764

Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQ 707
                           + V N G+  G EVV +Y K P   G +   +  + RV + AG+
Sbjct: 765 ------------LRVSVRVRNNGQRAGDEVVQLYVKRPDTFGLN-ASLAAFARVSLKAGE 811

Query: 708 SAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLN 757
           S  V  T++  + L  V       + +GA+ + VG G  G +  L  + +
Sbjct: 812 SRTVVMTIDP-RDLSTVTLEGERAIRAGAYGLSVGGGQPGFAPTLNADFS 860


>gi|94497563|ref|ZP_01304132.1| xylosidase/arabinosidase [Sphingomonas sp. SKA58]
 gi|94422980|gb|EAT08012.1| xylosidase/arabinosidase [Sphingomonas sp. SKA58]
          Length = 774

 Score =  266 bits (679), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 227/752 (30%), Positives = 348/752 (46%), Gaps = 114/752 (15%)

Query: 20  PYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGT 79
           P   R +D  + + L   +Q+    A    RLG+P+  +  E LHG + +G         
Sbjct: 93  PRVARGRDPRQTVALVNALQKW---AMTQTRLGIPIL-FHEEGLHGYAAVG--------- 139

Query: 80  HFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVR 139
                   ATSFP  I   +S++  L +++   ++ E R             SP +++ R
Sbjct: 140 --------ATSFPQSIALASSWDPHLVQQVNSVIAREIRV-----RGVPMVLSPVVDIAR 186

Query: 140 DPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDL 199
           DPRWGR+ ET GEDPY+VG   +  V GLQ  EG    R  D RP K+ A  KH   +  
Sbjct: 187 DPRWGRIEETYGEDPYLVGEMGVAAVEGLQG-EG----RSHDLRPGKVFATLKHLTGHGQ 241

Query: 200 DNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLN 259
                N      + ++E++++E F  PFE  V    +++VM SYN ++G+P+  +  LL+
Sbjct: 242 PESGTN---VGPAPISERELRENFFPPFEQVVKRTGINAVMASYNEIDGVPSHMNRWLLD 298

Query: 260 QTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMG 319
             +RG+W F G +VSD   +  ++  H       E A  R L AG+D D  +  +  T+G
Sbjct: 299 DVLRGEWGFRGAVVSDYSGVDQLMNIHHVAGSLDE-AARRALDAGVDADLPEGLSYATLG 357

Query: 320 -AVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIE-LAAEAARQ 377
             V+ GK++EA +D ++R +  +  R G F+  P         + N      LA  AA++
Sbjct: 358 DQVRAGKVSEAQVDKAVRRMLELKFRAGLFE-HPYADAAQAVALTNDAEARALARTAAQR 416

Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SK 435
            I LLKND G LPL      ++A++GP  +A  A +G Y G P    S +DG  A    +
Sbjct: 417 SITLLKND-GMLPLKVEG--SIAVIGP--SAAVARLGGYYGQPPHVVSILDGIKARVGDR 471

Query: 436 V-INYAPGC---------ADIV-----CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
           V I +A G          AD V      +N  +I  A++AA+N D  V+  G       E
Sbjct: 472 VRIVFAQGVKITQDDDWWADKVDKADPAENRRLIAQAVEAARNVDRIVLTLGDTEQSSRE 531

Query: 481 G------KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
           G       DR  L L G Q EL + +    K P+T+V+++      +  K + +  ++L 
Sbjct: 532 GWAANHLGDRPSLDLVGEQQELFDALKTLGK-PITVVLINGRPA--STVKVSEEANALLE 588

Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF---PGRTYK 591
             Y GE+GG A+AD++FG  NPGG+LP+T        +P +   L    N     GR Y 
Sbjct: 589 GWYLGEQGGHAVADILFGDVNPGGKLPVT--------VPRSVGQLPAFYNVKPSAGRGYL 640

Query: 592 FFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLID 651
           F     +YPFG+GLSYT F                             T  PP    L  
Sbjct: 641 FDTNAPLYPFGFGLSYTNF-----------------------------TLSPPR---LAQ 668

Query: 652 DVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAK 710
                    +  ++V N G  DG EVV +Y      + T  IK++ G+ERV +  G+   
Sbjct: 669 SSIGPGGTTSVTVDVRNDGARDGDEVVQLYIHDKVSSVTRPIKELKGFERVSLKPGEVRT 728

Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           V FT+   +SL++ ++  + ++  G   I+ G
Sbjct: 729 VRFTIT-PESLQMWNDKMHRVVEPGEFEIMTG 759


>gi|86142030|ref|ZP_01060554.1| putative beta-glucosidase [Leeuwenhoekiella blandensis MED217]
 gi|85831593|gb|EAQ50049.1| putative beta-glucosidase [Leeuwenhoekiella blandensis MED217]
          Length = 803

 Score =  266 bits (679), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 229/731 (31%), Positives = 347/731 (47%), Gaps = 123/731 (16%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+PL+    EA+HG   IG                  T FP+ I   ++FN  L KK+
Sbjct: 135 RLGIPLF-LAEEAMHGHMAIG-----------------TTEFPSAIGQASTFNPQLNKKM 176

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           G  V+ E RA           + P +++ R+PRW RV ET GEDPY++    +  + G Q
Sbjct: 177 GAAVAKELRA-----QGAHIGYGPILDLAREPRWSRVEETFGEDPYLISEMGLGVIEGFQ 231

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND-RFHFDSRVTEQDMQETFILPFE 228
             EG+E        P  + +  KH+AAY +     N    H   R   QD    ++ PF+
Sbjct: 232 G-EGIE-------NPESVISTLKHFAAYGVSEGGHNGGAVHIGQRELMQD----YMYPFK 279

Query: 229 MCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKF 288
             ++ G V SVM +Y+ V+GIP+ ++  LL   +R  W F G++VSD  SI+ I   H  
Sbjct: 280 KAIDAG-VLSVMTAYSSVDGIPSTSNKALLTGLLREQWGFEGFVVSDLASIEGIKGDHH- 337

Query: 289 LNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY 347
              T EDA A  + AG+D D G + + +  + A + GK++EA +D +++++  +  ++G 
Sbjct: 338 AAATFEDAAALAMNAGVDADLGGNGFDDELLNAFKNGKVSEARLDEAVKYVLRLKFKMGL 397

Query: 348 FDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
           F+     +   K  + +  HI +A E A +G+ LLKN+NG LPL+   +K +A++GP+A+
Sbjct: 398 FENPYVEEKAPKKVVRSAAHIAIAKEMALEGVTLLKNENGLLPLSK-ELKKIAVIGPNAD 456

Query: 408 ATKAMIGNYEG--TPCRYTSPMDGFYAY--SKVINYAPGCADIVCQNNSMIPAAIDAAKN 463
                +G+Y     P    +P++G  A      I Y  G A I     + IPAA+ AAK+
Sbjct: 457 MMYNQLGDYTAPQEPEFIVTPLEGIRAKMPKAEITYVKGTA-IRDTTQTDIPAAVAAAKS 515

Query: 464 ADATVIVAG----LDLSVE----------------------AEGKDRVDLLLPGFQTELI 497
           A+  ++V G     D   E                       EG DR  L L G Q EL+
Sbjct: 516 AEVAIVVLGGSSARDFKTEYLETGAATVSSKEDQVLSDMESGEGYDRSTLDLMGKQLELL 575

Query: 498 NKVADAAKGPVTLVIMSAGAVDINF-AKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNP 556
             V +A   P  LV+++   + IN+ AK+ P I    W  YPG +GG A+ADV+FG YNP
Sbjct: 576 QAV-EATGTPTILVLITGRPLLINWPAKHIPAIIDT-W--YPGSQGGHALADVLFGDYNP 631

Query: 557 GGRLPITWYEANYVKIPYTSMPLRPV--NNF--PGRTYKFFDGPVVYPFGYGLSYTQFKY 612
            GRLP++        IP  S+   PV  N++    R Y       +Y FG+GLSYT F Y
Sbjct: 632 AGRLPVS--------IP-KSVGQSPVYYNHWWPKRRDYVEETSAPLYAFGHGLSYTTFDY 682

Query: 613 KVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKM 672
                    D+K+ +     +    V                         +EV N G  
Sbjct: 683 S--------DLKISQSGNATNTTIEV------------------------SVEVTNTGDR 710

Query: 673 DGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL 731
           DG EVV +Y S       T +KQ+ G+ER+ +  G+S  V F +   + L + D   N +
Sbjct: 711 DGDEVVQLYLSDVVSSVVTPVKQLRGFERIHLDKGESKTVTFILTPAE-LALFDAEMNHV 769

Query: 732 LASGAHTILVG 742
             +G   + +G
Sbjct: 770 AEAGEFEVQLG 780


>gi|393784569|ref|ZP_10372732.1| hypothetical protein HMPREF1071_03600 [Bacteroides salyersiae
           CL02T12C01]
 gi|392665550|gb|EIY59074.1| hypothetical protein HMPREF1071_03600 [Bacteroides salyersiae
           CL02T12C01]
          Length = 929

 Score =  265 bits (678), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 149/422 (35%), Positives = 238/422 (56%), Gaps = 38/422 (9%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           P+ D  L + ERAK+LV  +TL EK+ Q+G     +PRL +  Y +W+EA+HGV+  G  
Sbjct: 41  PFQDESLSFHERAKNLVSLLTLEEKINQVGHQTLAIPRLNIKGYNYWNEAIHGVARSGL- 99

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
                          ATSFP     +++++  L        S EAR   N  + GL +W 
Sbjct: 100 ---------------ATSFPVSKAMSSTWDLPLIFDCAVATSDEARVYSNTKDKGLIYWC 144

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           P IN+ RDPRWGR  E  GEDP++ G+ A+ Y++G+Q           D +  K  A  K
Sbjct: 145 PTINMSRDPRWGRDEENYGEDPFLTGKIAVEYIKGMQ---------GDDPKYYKTIATAK 195

Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           H+AA   +N+E   R    S +  ++++E ++  FEM V EG+V SVM +YN +NGIP  
Sbjct: 196 HFAA---NNYE-KGRHSTSSDMDARNLREYYLPAFEMAVKEGNVRSVMSAYNALNGIPCG 251

Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVES--HKFLNDTKEDAVARVLKAGLDLDCG 310
           A+ +LL   +R +W F+G++ SDC ++  + +S  H F+N T  +A A  +  G DL+CG
Sbjct: 252 ANHELLIDILRTEWGFNGFVTSDCGAVDDVYQSNRHHFVN-TAAEASAVSIVNGEDLNCG 310

Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHI 368
           + + ++   A+++G + EAD+DT+L  ++     +G FD +    ++++  + +   +H 
Sbjct: 311 NTFQDYCKEAIEKGYMQEADLDTALVRVFEARFSVGEFDNASNVPWRSISDDVLDCEEHR 370

Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
           +LA +AA++ IVLLKNDN  LPL+    K++A++GP  N     +G Y G+P   T+P  
Sbjct: 371 QLAYKAAQEAIVLLKNDNNILPLD--KTKSVAVIGPFGNTI--TLGGYSGSPTALTTPFG 426

Query: 429 GF 430
           G 
Sbjct: 427 GI 428



 Score =  115 bits (288), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 83/274 (30%), Positives = 133/274 (48%), Gaps = 47/274 (17%)

Query: 442 GCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVA 501
           GCA +     + +  A + A  AD  +  AG DL+V  E  DR +L LPG Q +L+  V 
Sbjct: 592 GCA-VTGTAETNLERAKEIAAKADVVIFAAGTDLTVSDESHDRTNLNLPGDQQKLLEAVY 650

Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
            +A   V L++ +  +V IN+AK +  + +I+   Y G+  G+AIADV++G YNP G+L 
Sbjct: 651 -SANPNVILLLQTCSSVTINWAKEH--VPAIIEAWYGGQAQGKAIADVLYGDYNPSGKLT 707

Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGR----TYKFFDGPVVYPFGYGLSYTQFKYKVASS 617
            TWY A       + +P   + N+  R    TY + D   +YPFGYG+SYT F+Y+  + 
Sbjct: 708 STWYNA------LSDLP-NGMLNYDIRDAKYTYMYHDKTPLYPFGYGMSYTTFEYQKLNI 760

Query: 618 PKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEV 677
            KS   +L   ++                                  ++ N GK  G+E+
Sbjct: 761 SKS---RLAAGEE-----------------------------LIVSADITNTGKYAGAEI 788

Query: 678 VMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKV 711
           V +Y+         +KQ++G+ RV +  G++  V
Sbjct: 789 VQLYAHVNSSIERPLKQLVGFARVELEPGETKTV 822


>gi|270296173|ref|ZP_06202373.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|270273577|gb|EFA19439.1| conserved hypothetical protein [Bacteroides sp. D20]
          Length = 942

 Score =  265 bits (678), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 229/811 (28%), Positives = 371/811 (45%), Gaps = 146/811 (18%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
           Y D   P   R ++L+++MTL EK  Q+  L YG  R+    LP  EW    W +   G+
Sbjct: 53  YEDPSAPLEARIENLLQQMTLDEKTCQVVTL-YGYKRVLKDDLPTPEWKELLWKD---GI 108

Query: 67  SFIGRRTNS------PPGTH-------------------------------FDSE-VPG- 87
             I    N       PP  +                               F +E + G 
Sbjct: 109 GAIDEHLNGFQQWGLPPSDNAYVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGV 168

Query: 88  ----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPR 142
               AT+FPT +    ++N  L +++G     EAR +      G T  ++P ++V RD R
Sbjct: 169 ESYRATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQR 222

Query: 143 WGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNW 202
           WGR  E  GE PY+V    I  VRGLQ       H        +++A  KH+AAY  +  
Sbjct: 223 WGRYEEVYGESPYLVAELGIEMVRGLQ-------HNH------QVAATGKHFAAYSNNKG 269

Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
                   D +++ ++++   I PF+  + E  +  VM SYN  +GIP       L   +
Sbjct: 270 AREGMARVDPQMSPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRL 329

Query: 263 RGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTM 318
           RG+  F GY+VSD D+++ +   H    D KE AV + ++AGL++ C     D +     
Sbjct: 330 RGEMGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSFVLPLR 388

Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIE-LAAEAARQ 377
             V++G ++E  I+  +R +  V   +G FD   Q    G +     +  E +A +A+R+
Sbjct: 389 ELVKEGGLSEEVINDRVRDILRVKFLIGLFDAPYQTDLAGADREVEKEENEAIALQASRE 448

Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK-- 435
            IVLLKN    LPL+  + K +A+ GP+AN     + +Y       T+ ++G    +K  
Sbjct: 449 SIVLLKNAGELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGK 508

Query: 436 -VINYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEA 479
             + Y  GC D+V  +                + I  A++ A+ AD  ++V G       
Sbjct: 509 AEVLYTKGC-DLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCG 567

Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
           E K R  L LPG Q +L+  +    K PV L++++   + IN+A  +  + +IL   YPG
Sbjct: 568 ENKSRTSLDLPGRQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPG 624

Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF-----PGRTYKF-- 592
            +GG A+AD++FG YNPGG+L +T +     +IP+ + P +P +       PG T     
Sbjct: 625 SKGGTALADILFGDYNPGGKLTVT-FPKTVGQIPF-NFPCKPSSQIDGGKNPGPTGNMSR 682

Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
            +G  +YPFGYGLSYT F+Y                    D++ T     P  +A     
Sbjct: 683 ING-ALYPFGYGLSYTTFEYS-------------------DLDITPRVITPNESA----- 717

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKV 711
                   T +++V N GK  G EVV +Y +       T+ K + G++R+ +  G++ ++
Sbjct: 718 --------TVRLKVTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQEL 769

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            FT++  K L+++D     ++  G   ++ G
Sbjct: 770 SFTIDR-KHLELLDADMKWVVEPGDFVLMAG 799


>gi|399029285|ref|ZP_10730258.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
 gi|398072895|gb|EJL64089.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
          Length = 871

 Score =  265 bits (678), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 160/429 (37%), Positives = 231/429 (53%), Gaps = 42/429 (9%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
           +F + +  L   +R  DLV RM++ EK+ Q+ D +  + RLG+P Y WW+E+LHGV+  G
Sbjct: 23  NFAFKNPNLTTEQRVDDLVSRMSIDEKISQLMDSSPAIERLGVPEYNWWNESLHGVARAG 82

Query: 71  RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGN- 125
                            AT FP  I   +S++  L   +   +S EARA ++     G  
Sbjct: 83  Y----------------ATVFPQSISIASSWDRQLIFDVANVISDEARAKHHEYLRRGQH 126

Query: 126 ---AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
               GLTFWSPN+N+ RDPRWGR  ET GEDP++ G+  + YV GLQ          ++ 
Sbjct: 127 GMYQGLTFWSPNVNIFRDPRWGRGHETYGEDPFLTGQLGLKYVNGLQ---------GTNE 177

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
           + LK+ A  KHYA +         R  F++  ++ D+ ET++  F   V EG V SVM +
Sbjct: 178 KYLKVIATAKHYAVHSGPE---PSRHLFNAETSDIDLYETYLPAFRTLVKEGHVYSVMGA 234

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           YNR  G    A P L N  +R  W F GYIVSDC ++  I + HK   D    A A  LK
Sbjct: 235 YNRFRGESCSASPFLFN-ILRNVWGFDGYIVSDCGAVTDIWKYHKITGDAAT-ASALALK 292

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKN 360
            GLDL+CG  + +    A+ +  I+EADID +++ L+    +LG FD      Y  +  +
Sbjct: 293 DGLDLECGSSFKSLKE-AIDRKLISEADIDIAVKRLFTARFKLGMFDPEEIVSYAQIPYS 351

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
              N  H  LA  A+++ IVLLKN N  LPL+  +IKT+A++GP+AN  +++ GNY G P
Sbjct: 352 VNNNSAHDWLARVASQKSIVLLKNQNNTLPLSR-DIKTVAVIGPNANDVQSLWGNYSGVP 410

Query: 421 CRYTSPMDG 429
               + + G
Sbjct: 411 SNPITVLKG 419



 Score =  156 bits (395), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 108/327 (33%), Positives = 160/327 (48%), Gaps = 60/327 (18%)

Query: 433 YSKVINYAPGCADIVCQ------NNSMIPAAIDAAKNADATVIVAGL-------DLSVEA 479
           Y   + Y     D + Q        +++  A+  A  ADA V+V GL       ++ VEA
Sbjct: 562 YKITVKYQNFYGDAIAQLLWAEPQENVLQEAVQVAGQADAIVLVLGLNERLEGEEMKVEA 621

Query: 480 ---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
              EG DR  L LP  Q EL+ K   A   PV LV+++  A+ IN+A  N  + +IL  G
Sbjct: 622 DGFEGGDRTSLDLPSNQEELM-KAMTATGKPVILVLINGSALSINWA--NDHVPAILTAG 678

Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
           YPG++GG AIADV+FG YNP GRLP+T+Y++         +P     +  GRTY++F   
Sbjct: 679 YPGQQGGNAIADVLFGDYNPAGRLPVTYYKST------EQLPAFENYDMKGRTYRYFQKK 732

Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
            +YPFG+GLSYT+FKY     P +V  + D                              
Sbjct: 733 PLYPFGFGLSYTKFKYSNLKLPTNVTPEKD------------------------------ 762

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTM 715
              F   ++V N+G+ DG EV+ +Y K    +    I Q+ G+ERV +  G++  V FT+
Sbjct: 763 ---FEILVDVTNIGERDGDEVIELYLKDEKASTPRPILQLEGFERVNLKKGETKTVRFTI 819

Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
              + L +++     ++  G  TI VG
Sbjct: 820 TP-RQLSLINKKGQRVIEPGWFTISVG 845


>gi|255689951|ref|ZP_05413626.1| beta-glucosidase [Bacteroides finegoldii DSM 17565]
 gi|260624557|gb|EEX47428.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           finegoldii DSM 17565]
          Length = 735

 Score =  265 bits (678), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 223/774 (28%), Positives = 358/774 (46%), Gaps = 105/774 (13%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP 49
           S K K S   Y DAK+P  +R  DL+ RMTL EK+ Q+     G              VP
Sbjct: 20  SAKDKKSIPLYKDAKVPIEKRVDDLLSRMTLEEKILQLNQYTMGRNNNVNNIGEEVKKVP 79

Query: 50  -RLGLPLYEWWSEALHG----VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNES 104
             +G  +Y   +  L       +    R   P    +D+     T +P  +    S+N  
Sbjct: 80  AEIGSLIYYDTNPTLRNNVQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLGQACSWNPE 139

Query: 105 LWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINY 164
           L +K     + EAR    +     TF SP I+V RDPRWGRV E  GEDPY  G +A   
Sbjct: 140 LVEKACAVTAQEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYTNGVFAAAS 194

Query: 165 VRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFI 224
           VRG        Y  D  S   +I+AC KHY  Y      G D  +  + ++ Q + +T++
Sbjct: 195 VRG--------YQGDDMSAEDRIAACLKHYIGYGASE-AGRDYVY--TEISRQTLWDTYL 243

Query: 225 LPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE 284
           LP+EM V  G  +++M S+N ++GIP  A+   + + ++  W   G+IVSD  +I+ +  
Sbjct: 244 LPYEMGVKAG-AATLMSSFNDISGIPGSANHYTMTEILKERWGHDGFIVSDWGAIEQL-- 300

Query: 285 SHKFLNDTKEDAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLM 343
            ++ L   K++A      AGL++D   + Y  +    V++GKI  A +D S+R +  V  
Sbjct: 301 KNQGLAANKKEAAVYAFNAGLEMDMMSHAYDRYMKELVEEGKITMAQVDESVRRVLRVKF 360

Query: 344 RLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG 403
           RLG F+         K     PQ +++AA+ A + +VLLKN+N  LPL   + K +A+VG
Sbjct: 361 RLGLFERPYTPVTSEKERFFRPQSMDIAAQLAAESMVLLKNENQILPLT--DKKKIAVVG 418

Query: 404 PHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV---------INYAPGCADIVCQNNSMI 454
           P A     ++G++    C +    D    Y+ +         + YA GC      N    
Sbjct: 419 PMAKNGWDLLGSW----CGHGKDTDVVMLYNGLATEFVGKAELRYALGCR-TQGDNRKGF 473

Query: 455 PAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMS 514
             A++AA+ +D  V+  G  ++   E   R  + LP  Q EL  ++    K P+ LV+++
Sbjct: 474 EEALEAARWSDVVVLCLGEMMTWSGENASRSSIALPQIQEELAKELKKVGK-PIVLVLVN 532

Query: 515 AGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPY 574
              +++N  +  P   +IL +  PG  G   +A ++ G+ NP G+L +T+        PY
Sbjct: 533 GRPLELN--RLEPISDAILEIWQPGVNGALPMAGILSGRINPSGKLAMTF--------PY 582

Query: 575 TS--MPLRPVNNFPGRTYKFFDGPV----VYPFGYGLSYTQFKYKVASSPKSVDIKLDKD 628
           ++  +P+       GR ++ F   +    +YPFG+GLSYT+FKY V +   S   K+ + 
Sbjct: 583 SNGQIPIYYNRRKSGRGHQGFYKDITSDPLYPFGHGLSYTEFKYGVVTLSAS---KVKRG 639

Query: 629 QQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA 688
           +                             K + ++ V N GK DG E V  +   P  +
Sbjct: 640 E-----------------------------KLSAEVTVTNTGKRDGLETVHWFISDPYCS 670

Query: 689 GTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
            T  +K++  +E+  I AG++    F ++  + L  VD      L +G + I V
Sbjct: 671 ITRPVKELKYFEKQSIKAGETKIFRFDIDLERDLGFVDGNGKRFLEAGEYYIQV 724


>gi|160884764|ref|ZP_02065767.1| hypothetical protein BACOVA_02753 [Bacteroides ovatus ATCC 8483]
 gi|156109799|gb|EDO11544.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
          Length = 746

 Score =  265 bits (678), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 194/629 (30%), Positives = 320/629 (50%), Gaps = 73/629 (11%)

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           ++P I+V RDPRWGRVLE  GED ++  R A   VRG Q      ++  S+   L   AC
Sbjct: 158 FAPMIDVSRDPRWGRVLEGAGEDTWLTSRVAEAKVRGYQ------WNLGSNESVL---AC 208

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
            KH+AAY L    G D    D  ++E+ ++E ++ PF+  V  G V++ M ++N + G+P
Sbjct: 209 AKHFAAYGLPQ-AGKDYGTVD--ISERTLEEIYLPPFKAAVEAG-VATFMPAFNDIAGVP 264

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
             A+  LL + +R  W F G +VSD  +I  +V  H   + +K+ AV   + AG+D+D  
Sbjct: 265 CTANKWLLTEVLRNRWKFKGVVVSDWGAIWQLV-PHGMAHGSKQ-AVELSINAGVDMDMA 322

Query: 311 D-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN--ICNPQH 367
           D  Y    +  + +GK+    ID  +R +  +  +LG FD   ++ ++ +    I N   
Sbjct: 323 DGEYNRHALALINEGKVTVGQIDEMVRRILRMKFKLGLFDDPFRFCDVKREKRVIRNCDF 382

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY---EGTPCRYT 424
           I  A +AA++ IVLLKN+N  LPL   +IK++A+VGP A+  K  + +Y   +G    Y 
Sbjct: 383 IAEARKAAQKSIVLLKNENHLLPL-AKDIKSIAVVGPLAD-NKQYLRDYWAGKGEVNDYV 440

Query: 425 SPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
           + ++G      ++ K INYA GC D+   + S    A++AA  ++  +   G   S+  E
Sbjct: 441 TLLEGLKNNLPSHIK-INYAKGC-DVTGTDCSFFSEAVEAANQSELVIAAIGERASMSGE 498

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
              R D+ +PG Q EL+  + D  K PV +V+M+   + I  +K   ++ +I+   + G 
Sbjct: 499 DASRADISIPGVQEELVQALLDTGK-PVVVVLMNGRPLTI--SKLTEQVPAIVEGWFLGT 555

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIP----YTSMPLRPVNNFPGRTYKFFDGP 596
           E G AIADV+ GKYNP G+L ++ +  N  +IP    Y        +     T +F D P
Sbjct: 556 ETGNAIADVLLGKYNPSGKLTMS-FPRNVGQIPVFYNYRQSGRPGTDKLTKWTNRFIDSP 614

Query: 597 V--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
           V  +YPFGYGLSYT F Y   S+P+    +   ++  +                      
Sbjct: 615 VSPLYPFGYGLSYTTFSY---SAPRVSQKEFSTNEILK---------------------- 649

Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGF 713
                    ++V N G+ DG E + +Y +    + T  +K++ G++++F+  G++  VGF
Sbjct: 650 -------VSVDVTNTGQYDGEETIQLYIRDVIASVTRPVKELKGFKKIFLRKGETRTVGF 702

Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVG 742
            + A + L  +      ++ SG   ++ G
Sbjct: 703 ELRA-EDLSFLSQDMEPVIESGEFILMTG 730


>gi|325918730|ref|ZP_08180824.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325535054|gb|EGD06956.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 391

 Score =  265 bits (677), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 154/384 (40%), Positives = 209/384 (54%), Gaps = 41/384 (10%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
           +RA  LV +M+  EKV Q  + A  +PRL +P YEWWSE LHG++  G            
Sbjct: 34  QRAAALVAQMSRDEKVAQAMNDAPAIPRLDIPAYEWWSEGLHGIARNGY----------- 82

Query: 83  SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
                AT FP  I   AS+N +L +++G  VSTEARA +N            AGLT WSP
Sbjct: 83  -----ATVFPQAIGLAASWNTALMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSP 137

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
           NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ         D  + P  I A  KH
Sbjct: 138 NINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQG--------DDLNHPRTI-ATPKH 188

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
            A +         R  FD  V+ +DM+ T+   F   + +G   SVMC+YN ++G P CA
Sbjct: 189 IAVHSGPE---PGRHGFDVDVSPRDMEATYTPAFRAALVDGQAWSVMCAYNSLHGTPACA 245

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              LLN  +RGDW F G++VSDCD++  + + H F  D    + A  LKAG DL+CG  Y
Sbjct: 246 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 304

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
                 A+++G++ EA +D SL  L+    RLG  +   +  Y  LG  ++ N  H  LA
Sbjct: 305 RELGT-AIERGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALA 363

Query: 372 AEAARQGIVLLKNDNGALPLNTGN 395
            +AA + IVLLKN    LPL  G 
Sbjct: 364 LQAAAESIVLLKNTATTLPLKAGT 387


>gi|160892207|ref|ZP_02073210.1| hypothetical protein BACUNI_04671 [Bacteroides uniformis ATCC 8492]
 gi|156858685|gb|EDO52116.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           uniformis ATCC 8492]
          Length = 990

 Score =  265 bits (677), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 229/812 (28%), Positives = 372/812 (45%), Gaps = 148/812 (18%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
           Y D   P   R ++L+++MTL EK  QM  L YG  R+    LP  EW    W +   G+
Sbjct: 101 YEDPSAPLEARIENLLQQMTLDEKTCQMVTL-YGYKRVLKDDLPTPEWKELLWKD---GI 156

Query: 67  SFIGRRTNS---------------PPGTH----------------------FDSE-VPG- 87
             I    N                P   H                      F +E + G 
Sbjct: 157 GAIDEHLNGFQQWGLPPSDNAYVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGV 216

Query: 88  ----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPR 142
               AT+FPT +    ++N  L +++G     EAR +      G T  ++P ++V RD R
Sbjct: 217 ESYRATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQR 270

Query: 143 WGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNW 202
           WGR  E  GE PY+V    I  VRGLQ       H        +++A  KH+AAY  +  
Sbjct: 271 WGRYEEVYGESPYLVAELGIEMVRGLQ-------HNH------QVAATGKHFAAYSNNKG 317

Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
                   D +++ ++++   I PF+  + E  +  VM SYN  +GIP       L   +
Sbjct: 318 AREGMARVDPQMSPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRL 377

Query: 263 RGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTM 318
           RG+  F GY+VSD D+++ +   H    D KE AV + ++AGL++ C     D +     
Sbjct: 378 RGEMGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSFVLPLR 436

Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL--GKNNICNPQHIELAAEAAR 376
             V++G ++E  I+  +R +  V   +G FD +P   +L      +   ++  +A +A+R
Sbjct: 437 ELVKEGGLSEEVINDRVRDILRVKFLIGLFD-APYQTDLADADREVEKEENEAIALQASR 495

Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK- 435
           + IVLLKN    LPL+  + K +A+ GP+AN     + +Y       T+ ++G    +K 
Sbjct: 496 ESIVLLKNAGELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKG 555

Query: 436 --VINYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVE 478
              + Y  GC D+V  +                + I  A++ A+ AD  ++V G      
Sbjct: 556 KAEVLYTKGC-DLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTC 614

Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
            E K R  L LPG Q +L+  +    K PV L++++   + IN+A  +  + +IL   YP
Sbjct: 615 GENKSRTSLDLPGRQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYP 671

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF-----PGRTYKF- 592
           G +GG A+AD++FG YNPGG+L +T +     +IP+ + P +P +       PG T    
Sbjct: 672 GSKGGTALADILFGDYNPGGKLTVT-FPKTVGQIPF-NFPCKPSSQIDGGKNPGPTGNMS 729

Query: 593 -FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLID 651
             +G  +YPFGYGLSYT F+Y                    D++ T     P  +A    
Sbjct: 730 RING-ALYPFGYGLSYTTFEYS-------------------DLDITPRVITPNESA---- 765

Query: 652 DVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAK 710
                    T +++V N GK  G EVV +Y +       T+ K + G++R+ +  G++ +
Sbjct: 766 ---------TVRLKVTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQE 816

Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           + FT++  K L+++D     ++  G   ++ G
Sbjct: 817 LSFTIDR-KHLELLDADMKWVVEPGDFVLMAG 847


>gi|317480750|ref|ZP_07939836.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides sp. 4_1_36]
 gi|316903091|gb|EFV24959.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides sp. 4_1_36]
          Length = 942

 Score =  265 bits (677), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 229/812 (28%), Positives = 373/812 (45%), Gaps = 148/812 (18%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
           Y D   P   R ++L+++MTL EK  QM  L YG  R+    LP  EW    W +   G+
Sbjct: 53  YEDPSAPLEARIENLLQQMTLDEKTCQMVTL-YGYKRVLKDDLPTPEWKELLWKD---GI 108

Query: 67  SFIGRRTNS------PPGTH-------------------------------FDSE-VPG- 87
             I    N       PP  +                               F +E + G 
Sbjct: 109 GAIDEHLNGFQQWGLPPSDNAYVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGV 168

Query: 88  ----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPR 142
               AT+FPT +    ++N  L +++G     EAR +      G T  ++P ++V RD R
Sbjct: 169 ESYRATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQR 222

Query: 143 WGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNW 202
           WGR  E  GE PY+V    I  VRGLQ       H        +++A  KH+AAY  +  
Sbjct: 223 WGRYEEVYGESPYLVAELGIEMVRGLQ-------HNH------QVAATGKHFAAYSNNKG 269

Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
                   D +++ ++++   I PF+  + E  +  VM SYN  +GIP       L   +
Sbjct: 270 AREGMARVDPQMSPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRL 329

Query: 263 RGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTM 318
           RG+  F GY+VSD D+++ +   H    D KE AV + ++AGL++ C     D +     
Sbjct: 330 RGEMGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSFVLPLR 388

Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL--GKNNICNPQHIELAAEAAR 376
             V++G ++E  I+  +R +  V   +G FD +P   +L      +   ++  +A +A+R
Sbjct: 389 ELVKEGGLSEEVINDRVRDILRVKFLIGLFD-APYQTDLADADREVEKEENEAIALQASR 447

Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK- 435
           + IVLLKN    LPL+  + K +A+ GP+AN     + +Y       T+ ++G    +K 
Sbjct: 448 ESIVLLKNAGELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKG 507

Query: 436 --VINYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVE 478
              + Y  GC D+V  +                + I  A++ A+ AD  ++V G      
Sbjct: 508 KAEVLYTKGC-DLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTC 566

Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
            E K R  L LPG Q +L+  +    K PV L++++   + IN+A  +  + +IL   YP
Sbjct: 567 GENKSRTSLDLPGRQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYP 623

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF-----PGRTYKF- 592
           G +GG A+AD++FG YNPGG+L +T +     +IP+ + P +P +       PG T    
Sbjct: 624 GSKGGTALADILFGDYNPGGKLTVT-FPKTVGQIPF-NFPCKPSSQIDGGKNPGPTGNMS 681

Query: 593 -FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLID 651
             +G  +YPFGYGLSYT F+Y                    D++ T     P  +A    
Sbjct: 682 RING-ALYPFGYGLSYTTFEYS-------------------DLDITPRVITPNESA---- 717

Query: 652 DVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAK 710
                    T +++V N GK  G EVV +Y +       T+ K + G++R+ +  G++ +
Sbjct: 718 ---------TVRLKVTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQE 768

Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           + FT++  K L+++D     ++  G   ++ G
Sbjct: 769 LSFTIDR-KHLELLDADMKWVVEPGDFVLMAG 799


>gi|441498970|ref|ZP_20981160.1| Beta-glucosidase [Fulvivirga imtechensis AK7]
 gi|441437215|gb|ELR70569.1| Beta-glucosidase [Fulvivirga imtechensis AK7]
          Length = 752

 Score =  265 bits (677), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 219/736 (29%), Positives = 348/736 (47%), Gaps = 104/736 (14%)

Query: 28  LVERMTLPEKVQQM----GDLAYGVPRLGLPLYEWWSEAL-----------HGVSFIGR- 71
           L+ +MTL EKV Q+    GDL    P +     + + + +           HG ++ GR 
Sbjct: 36  LIRQMTLEEKVGQLNFYVGDLFNTGPTVRTTESDKFDQLIREGKLTGLFNVHGAAYTGRL 95

Query: 72  --------RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
                   R   P     D      T FP  + + AS++    +K  +  + E+ A    
Sbjct: 96  QKIAVEESRLGIPLLFGADVIHGFKTVFPIPLASAASWDLEAIEKAERVAAIESTA---- 151

Query: 124 GNAGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
             AG+ F ++P +++ RDPRWGR+ E  GEDP++    A   VRG Q+         S +
Sbjct: 152 --AGINFNFAPMVDISRDPRWGRIAEGAGEDPFLGSEVAKARVRGFQE--------QSLT 201

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
            P  ++AC KH+AAY   +  G D    D  ++E+ ++E ++ P++  ++ G  +++M S
Sbjct: 202 DPQTMAACVKHFAAYGAPDG-GRDYNTVD--MSERLLREMYLPPYKAGIDAG-AATIMTS 257

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           +N +NGI       LL   +R +W F G +VSD  S+  +V +H        +A    LK
Sbjct: 258 FNELNGIAASGSQFLLRDILRKEWGFKGMVVSDWQSVNEMV-AHG-NAANNAEAAMMALK 315

Query: 303 AGLDLD-CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL--GK 359
           AG+D+D  GD Y       V +GK+    +D ++R +  +   LG FD   +Y +    K
Sbjct: 316 AGVDMDMMGDVYLEEVPRLVNEGKLDIKFVDEAVRNVLKLKYDLGLFDDPYRYSDTIREK 375

Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHA------NATKAMI 413
           NNI   +H+E A + A++ IVLLKN    LPL   +I T+A++GP A      N T +  
Sbjct: 376 NNIRAVEHLEAARDVAKKSIVLLKNKEKLLPLKK-SIGTIAVIGPLADNQADMNGTWSFF 434

Query: 414 GNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGL 473
           G  +          D     S+V+ YA GC ++  ++      A++ AK AD  ++  G 
Sbjct: 435 GEAQHPITFLQGIKDAVSGQSRVL-YAEGC-NLYDRSKDKFAEAVNIAKKADVVILAVGE 492

Query: 474 DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
              +  E   R D+ LPG Q EL+ ++A   K PV  ++MS   +D+++   N  I +IL
Sbjct: 493 SAVMNGEAGSRSDIRLPGIQPELVMEIAKTGK-PVVALVMSGRPLDLSWLDEN--IPAIL 549

Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPIT----------WYEANYVKIPYTSMPLRPVN 583
            V   G E G A ADV+FG YNP G+LP+T          +Y       PY      P++
Sbjct: 550 EVWTLGSEAGNAAADVLFGDYNPSGKLPVTFPRNVGQVPIYYNHKNTGRPYEGDYSEPLS 609

Query: 584 NFPGRT-YKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
               R+ Y+      +YPFGYGLSY+ F+Y         DI L  D        T+   +
Sbjct: 610 ERIYRSKYRDVQNSPLYPFGYGLSYSTFEYS--------DITLSAD--------TLNAGE 653

Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERV 701
              A+V                 + N G  DG EVV +Y +   G     +K++ G++++
Sbjct: 654 SITASV----------------SITNEGPYDGEEVVQLYIRDLVGSVTRPVKELKGFKKL 697

Query: 702 FIAAGQSAKVGFTMNA 717
            I  G++ KV FT+++
Sbjct: 698 MIKNGETVKVDFTLSS 713


>gi|393781366|ref|ZP_10369565.1| hypothetical protein HMPREF1071_00433 [Bacteroides salyersiae
           CL02T12C01]
 gi|392676859|gb|EIY70281.1| hypothetical protein HMPREF1071_00433 [Bacteroides salyersiae
           CL02T12C01]
          Length = 854

 Score =  265 bits (676), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 162/425 (38%), Positives = 230/425 (54%), Gaps = 53/425 (12%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y D   P  ER  DL+ ++T+ EK+  +   + G+PRL +  Y   +EALHGV       
Sbjct: 28  YLDMNAPQHERILDLLSKLTIEEKISLLRATSPGIPRLQIDKYYHGNEALHGVV------ 81

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
              PG          T FP  I   A +N  L  +I   +S EARA +N    G      
Sbjct: 82  --RPGNF--------TVFPQAIGLAAMWNPQLLNEISTAISDEARARWNELEQGKKQLGQ 131

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFWSP +N+ RDPRWGR  ET GEDP++ G+  +++V+GLQ           D R
Sbjct: 132 FSDLLTFWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVSFVKGLQG---------DDPR 182

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LKI +  KH+AA    N E ++RF  +  ++E+D++E ++  FE C+ EG  +S+M +Y
Sbjct: 183 YLKIVSTPKHFAA----NNEEHNRFECNPIISEKDLREYYLPAFEKCIIEGKAASIMTAY 238

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N +N +P   +  LL + +R DW F GY+VSDC +   +V  HK++  T E A    ++A
Sbjct: 239 NAINDVPCTLNNWLLKKVLRHDWGFDGYVVSDCGAPDFLVTHHKYVK-TLEAAATLSIQA 297

Query: 304 GLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNI 362
           GLDL+CGD  Y    + A +Q  + EA+ID++   +    MRLG FD      NL   N 
Sbjct: 298 GLDLECGDNVYMEPLLNAYKQYMVTEAEIDSAAYHILRARMRLGLFDDP----NLNPYNK 353

Query: 363 CNP------QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
            +P      +H +LA EAARQ IVLLKN+   LPL+   IK++A+VG   NA     G+Y
Sbjct: 354 ISPSVVGCEKHSQLALEAARQSIVLLKNEKKFLPLDLKKIKSIAVVG--INAGNCEFGDY 411

Query: 417 EGTPC 421
            GTP 
Sbjct: 412 SGTPV 416



 Score =  129 bits (323), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 97/300 (32%), Positives = 145/300 (48%), Gaps = 51/300 (17%)

Query: 456 AAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSA 515
           AA DA +  D T+ V G++ S+E EG+DR  + LP  Q   I +       P T+V++ A
Sbjct: 594 AAGDAMRKCDLTIAVVGINKSIEREGQDRYSIELPKDQQIFIEEAYKI--NPNTVVVLVA 651

Query: 516 GA-VDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPY 574
           G+ + IN+   +  I +I+   YPGE GG A+A+V+FG YNPGG+LP+T+Y +      +
Sbjct: 652 GSSLAINWMDEH--IPAIVNAWYPGEAGGTAVAEVLFGDYNPGGKLPLTYYRSLDELPAF 709

Query: 575 TSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDI 634
               +R      GRTY+FF+G  +Y FG+GLSYT F YK          KL+ D      
Sbjct: 710 DDYDIR-----KGRTYQFFEGNPLYAFGHGLSYTTFSYK----------KLNIDSTG--- 751

Query: 635 NYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG---IAGTH 691
                           D VK           ++N GK DG EV  +Y K  G   +    
Sbjct: 752 ----------------DAVKV-------SFALKNTGKYDGDEVAQLYVKYQGNDSLVKLP 788

Query: 692 IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVGEGVGGVSF 750
           +KQ+ G+ERV +  G+S +V  T+   + L+  D         +G +  +VG     +  
Sbjct: 789 LKQLKGFERVHLKKGESKRVTLTVPKSE-LRFWDEEKGEFYTPAGDYLFMVGTASDAIQL 847


>gi|329963878|ref|ZP_08301220.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
 gi|328527131|gb|EGF54137.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
          Length = 766

 Score =  265 bits (676), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 235/802 (29%), Positives = 378/802 (47%), Gaps = 149/802 (18%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---------------------- 51
           Y D + P  ER +DL+ RMTL EKV QM     G+  +                      
Sbjct: 25  YKDPEAPVKERVEDLLGRMTLEEKVGQMNQFV-GLEHIKANSAVMTEEELKNNTANAFYP 83

Query: 52  GLPLYE--WWSEA------LHGVSF----------IGRRTNSP-----PGTHFDSEVPGA 88
           G+   E   W+E       LH ++           +  R   P        H ++  PG 
Sbjct: 84  GITDKEVAAWTEQGLIGSFLHVLTIEEANYLQSLAMKSRLQIPIIFGIDAIHGNANAPGN 143

Query: 89  TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLE 148
           T +PT I    SF+  +  +I +  + E RAM    N   TF +PN+ V RD RWGRV E
Sbjct: 144 TVYPTNINLACSFDTLMAYRIARETAKEMRAM----NMHWTF-NPNVEVARDARWGRVGE 198

Query: 149 TPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY--AAYDLDNWEGND 206
           T GEDPY+V R  +  V+G        Y    DS+   + AC KH+   +  ++   G+ 
Sbjct: 199 TFGEDPYLVTRMGVQSVKG--------YQGSLDSKE-DVLACIKHFVGGSEPINGTNGS- 248

Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
                + ++E+ ++E F  PFE  V  G + S+M ++N +NG+P  ++  L+   +RG+W
Sbjct: 249 ----PADLSERTLREVFFPPFEAGVKAGAM-SLMTAHNELNGVPCHSNEWLMADVLRGEW 303

Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC-GDYYTNFTMGAVQQGK 325
           NF G++VSD   I+   + H    + KE A  + + +G+D+   G ++    +  V++G+
Sbjct: 304 NFPGFVVSDWMDIEHTHDLHATAENLKE-AFYQSIMSGMDMHMHGIHWNEMVVELVKEGR 362

Query: 326 IAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIVLLKN 384
           I E+ ID S+R +  +  RLG F+      +   K  +C  +H   A EAAR GIVLLKN
Sbjct: 363 IPESRIDESVRRILDIKFRLGLFEQPYADVEETMKIRLCG-EHRATALEAARNGIVLLKN 421

Query: 385 DNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR--YTSPMDGFYAYSKVINYAPG 442
           + G LPL+    K + + G +A+  + ++G++         T+ ++G    +    +   
Sbjct: 422 E-GVLPLDPSKYKKIMVTGINAD-DQNILGDWSAPEKEENVTTILEGLRMIAPDTQF--- 476

Query: 443 CADIVCQN---NSMIPAAIDA----AKNADATVIVAGLDL-------SVEAEGKDRVDLL 488
             D V Q     +M P  +D     AKNAD  ++VAG  +         + E  DR DL 
Sbjct: 477 --DFVDQGWDPRNMDPKKVDEAAAHAKNADLNIVVAGEYMMRFRWNDRTDGEDTDRSDLD 534

Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
           L G Q ELI KVA + K P  LV+++   + + +A  N  + +I+    PG +GG+A+A+
Sbjct: 535 LVGLQEELIEKVAASGK-PTVLVLVNGRPLSVRWAAEN--LPAIVEAWAPGMQGGQAVAE 591

Query: 549 VIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV-------VYPF 601
           +++GK NP  +L IT        IP++   L+ + N   +  ++F   V       +YPF
Sbjct: 592 ILYGKVNPSAKLAIT--------IPHSVGQLQMIYNH--KPSQYFHPYVAGKPSTPLYPF 641

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
           GYGLSYT +KY+        D+ LD+ +  +D   +VG +                    
Sbjct: 642 GYGLSYTTYKYE--------DLNLDRKEIEKD--GSVGVS-------------------- 671

Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKS 720
             ++V N G  DG E+V +Y +      T  +K++  + RV + AG+S  V F +   K 
Sbjct: 672 --VKVTNTGSRDGVEIVQLYIRDKFSCVTRPVKELKDFARVPLKAGESRVVNFKITPDK- 728

Query: 721 LKIVDNAANSLLASGAHTILVG 742
           L   D     ++  G   ++VG
Sbjct: 729 LAFYDIKMKKVVEPGEFIVMVG 750


>gi|157363220|ref|YP_001469987.1| glycoside hydrolase family protein [Thermotoga lettingae TMO]
 gi|157313824|gb|ABV32923.1| glycoside hydrolase family 3 domain protein [Thermotoga lettingae
           TMO]
          Length = 779

 Score =  265 bits (676), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 233/812 (28%), Positives = 374/812 (46%), Gaps = 156/812 (19%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDL-AYGVPRLGLPLYEWWSEAL--HGVSFIG 70
           Y +A LP   R KDL+ RMTL EKV Q+G + +Y +           +EAL  +G+  I 
Sbjct: 7   YKNASLPVDIRVKDLLSRMTLDEKVAQLGSVWSYELLDDQGNFSNEKAEALLKNGIGQIT 66

Query: 71  R---------------------------RTNSPPGTHFDSEVP----GATSFPTVILTTA 99
           R                           R   P   H +        GAT+FP  I   +
Sbjct: 67  RPGGATNLSAKEVARLINQIQKYLIEQTRLGIPAIMHEECLTGYMGLGATNFPQAIAMAS 126

Query: 100 SFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR 159
           +++  L +K+  T+  + R M    + GL   +P ++VVRDPRWGR  E+ GE  Y+V +
Sbjct: 127 TWDPELIEKMTSTIREDMRQMGI--HQGL---APVLDVVRDPRWGRTEESFGESAYLVAK 181

Query: 160 YAINYVRGLQDVEGVEYHRDSDSRPLK--ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQ 217
             ++Y+ GLQ             + +K  + A  KH+  Y     EG   +   + + E+
Sbjct: 182 MGVSYIIGLQ------------GKDIKNGVIATAKHFVGYGAS--EGGKNWA-PTNIPER 226

Query: 218 DMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCD 277
           +++E F+ PFE  V E  V SVM SY+ ++GIP  +  +L    +R +W F G +VSD  
Sbjct: 227 ELREIFMFPFEAAVKEASVMSVMNSYSEIDGIPCASSKELFTGVLRKNWGFSGIVVSDYF 286

Query: 278 SIQTIVESHKFLNDTKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSL 335
           +I  + E H+   D KE A    L+AG+D++    D YT      V+QG I+E+ ++ + 
Sbjct: 287 AIDMLREYHRLAKDKKE-AAKYALQAGIDVELPKADCYTTIRE-LVEQGLISESTVNQAT 344

Query: 336 RFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGN 395
             +  +   LG FD    Y ++ K  +   +H  +A E AR+ IVLLKND G LPL    
Sbjct: 345 SRVLQIKFMLGLFD--KPYVDVEKIEL--KKHYSIATEIARKSIVLLKND-GILPLKKD- 398

Query: 396 IKTLALVGPHANATKAMIGNY-----------EGTPCRYTSPMDGFYAYSKVIN------ 438
              +ALVGP+A+  + ++G+Y                 + +P        KVIN      
Sbjct: 399 -AKIALVGPNASEVRNLLGDYAYLAHIKVLLDSVNQTTFNAPKFNLKNVEKVINESIEKI 457

Query: 439 ---------------YAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG-----LDLSVE 478
                          +A GC DI+  +      A+ A KNAD  V+V G      +    
Sbjct: 458 PSILDSMKAEGVIFTHAIGC-DILNSSTEGFSEALHAVKNADIAVVVVGDRSGLTEDCTS 516

Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN-NPKIKSILWVGY 537
            E +D  +L LPG Q EL+ ++A   K P+ LV+++     +   KN   ++ +I+ +  
Sbjct: 517 GESRDSANLKLPGVQEELVLEIAKCGK-PIVLVLVTGRPYSL---KNIVSRVNAIIEMWL 572

Query: 538 PGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTY---KFF 593
           PGE GG A+ DV+FGK NPGG+LPI++   A  + + +   P        GR++    + 
Sbjct: 573 PGEVGGMALVDVLFGKVNPGGKLPISFPRSAGQIPVYHDVKP------SGGRSHWHKDYV 626

Query: 594 DGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLID 651
           D  V  ++ FG+GLSYT+F++                      N  +   K P       
Sbjct: 627 DELVEPLFSFGHGLSYTKFEFS---------------------NLVIEPQKIPS------ 659

Query: 652 DVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAK 710
                D + T +++V+N G+++G EVV +Y      + T  IK++ G++R+ +  G+S  
Sbjct: 660 -----DGQVTIKVDVKNSGEVEGDEVVQLYLTREHASVTRPIKELKGFKRITLKPGESRT 714

Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
             F ++    L   D     ++  G    ++G
Sbjct: 715 TVFKIH-TDVLAYYDRGMELVVEPGVFKAMIG 745


>gi|319901343|ref|YP_004161071.1| glycoside hydrolase 3 [Bacteroides helcogenes P 36-108]
 gi|319416374|gb|ADV43485.1| glycoside hydrolase family 3 domain protein [Bacteroides helcogenes
           P 36-108]
          Length = 781

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 243/823 (29%), Positives = 373/823 (45%), Gaps = 175/823 (21%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQM------------GDLAYGVPRL------GLPL 55
           Y  A  P   R KDL+ RMT+ EKV Q+            G     V  L        P+
Sbjct: 26  YKQAGAPIEYRVKDLIGRMTVEEKVAQLCCPLGWEMYTKTGKNTVEVSALYKEKMKDAPV 85

Query: 56  YEWWS-------------------------EALHGVSFIGRRTNSPPGTHFDSEVP---- 86
             +W+                          AL   +    R   P    F  E P    
Sbjct: 86  GSFWAVLRADPWTQKTLETGLNPELAAKALNALQKYAVEETRLGIP--VLFAEECPHGHM 143

Query: 87  --GATSFPTVILTTASFNESLWKKIGQTVSTEARAM-YNLGNAGLTFWSPNINVVRDPRW 143
             GAT FPT +   ++++ESL +++G+ ++ EAR    N+G      + P ++V R+PRW
Sbjct: 144 AIGATVFPTALSAASTWDESLMQQMGEAIALEARLQGANIG------YGPVLDVAREPRW 197

Query: 144 GRVLETPGEDPYVVGRYAINYVRGLQ-DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNW 202
            R+ ET GEDP +     +  ++G+Q DV+    H         + +  KH+AAY +   
Sbjct: 198 SRMEETFGEDPVLTSVMGVALMKGMQGDVQNDGKH---------LYSTLKHFAAYGVP-- 246

Query: 203 EGNDRFHFDSRVTE--QDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQ 260
              +  H  SR     + +   ++ PF+  V  G   ++M SYN ++G+P  ++  LL +
Sbjct: 247 ---ESGHNGSRANSGMRQLFSEYLPPFKKAVEAG-AGTIMTSYNSIDGVPCTSNKFLLTE 302

Query: 261 TIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC-GDYYTNFTMG 319
            +R  W F G++ SD  SI+ IV   +   D KE A A+ L+AGLD+D  GD +      
Sbjct: 303 VLRNQWGFKGFVYSDLISIEGIV-GMRAAKDNKE-AAAKALRAGLDMDLGGDAFGRNLKQ 360

Query: 320 AVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGI 379
           A ++G I   D+D ++  +  +  ++G F+           +I + +H ELA   AR+G+
Sbjct: 361 AYEEGLITMDDLDRAVSNVLRLKFQMGLFENPYVSPEQAGKHIRSREHKELARRVAREGV 420

Query: 380 VLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR--YTSPMDGFYA---YS 434
           VLLKND G LPL+  ++K +A++GP+A+     +G+Y     R    + +DG  A    +
Sbjct: 421 VLLKND-GVLPLDK-HLKRIAVIGPNADMMYNQLGDYTAPQDRKEIVTVLDGVRAAVSKT 478

Query: 435 KVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG--------------------LD 474
             + Y  GCA +     S IPAA+ AA+ ADA ++V G                     D
Sbjct: 479 TQVVYVKGCA-VRDTTESDIPAAVAAAQRADAVILVVGGSSARDFKTKYISTGAATVSED 537

Query: 475 LSVE-----AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
           + V       EG DR  L L G Q +LIN VA   K P+ ++ ++  A+++N A +  K 
Sbjct: 538 IKVLPDMDCGEGFDRSSLRLLGDQEKLINAVAATGK-PLVVIYIAGRAMNMNLAAD--KA 594

Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG-- 587
           +++L   YPGE+GG  IAD++FG YNP GRLP++        IP +   L PV    G  
Sbjct: 595 RALLAAWYPGEQGGAGIADILFGDYNPAGRLPVS--------IPRSEGQL-PVFYSQGTQ 645

Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
           R Y    G  +Y FGYGLSYT+F Y      K  D++  +   C                
Sbjct: 646 RDYVEEKGTPLYAFGYGLSYTKFVYSALEMRKGTDVETLQTVSC---------------- 689

Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY--------SKPPGIAGTHIKQVIGYE 699
                             V N G  DG EVV +Y        S+PP +       +  + 
Sbjct: 690 -----------------TVTNTGDRDGEEVVQLYICDEVASVSQPPIL-------LKAFR 725

Query: 700 RVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           R+F+  G+S KV F +     L I D+  N ++  G   ++VG
Sbjct: 726 RIFLKKGESRKVTFLLKK-DDLAIYDDEMNYVVEPGDFKVMVG 767


>gi|255693561|ref|ZP_05417236.1| periplasmic beta-glucosidase(Cellobiase) [Bacteroides finegoldii
           DSM 17565]
 gi|260620626|gb|EEX43497.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           finegoldii DSM 17565]
          Length = 800

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 228/800 (28%), Positives = 359/800 (44%), Gaps = 141/800 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
           Y D   P   R  DL+ +MTL EK  QM  L YG  R+     P   W    W + +   
Sbjct: 56  YEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTDGWSTEIWKDGIGNI 114

Query: 64  ----HGVSFIGRRTNSP-----------------------PGTHFDSEVPG-----ATSF 91
               +G+   G   + P                       P    +  + G     AT F
Sbjct: 115 DEQANGLGKFGSEISYPYANSVKNRHTVQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMF 174

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
           P      A++N+ L ++I +  + EA+A+      G T  ++P +++ +DPRWGRV+E+ 
Sbjct: 175 PAQCGQGATWNKKLIREIAKVTADEAKAL------GYTNIYAPILDIAQDPRWGRVVESY 228

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDPY+VG      + GLQ  EG             I A  KH+A Y +     +     
Sbjct: 229 GEDPYLVGELGKQMILGLQS-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 274

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
           D  V  ++M+  ++ PF   + E     VM SYN  +G P       L + +R  W F G
Sbjct: 275 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 334

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
           Y+VSD ++++ +   H+ +  T+E+  A+V+ AGL++      TNFT           A+
Sbjct: 335 YVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRRAI 388

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
            +GK++   +D  +  +  V   +G FD   P      +  + N  H +++  AA + IV
Sbjct: 389 SEGKVSLHTLDQRVGEILRVKFMMGLFDNPYPGDDRRPEVVVHNAAHQDVSMRAALESIV 448

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
           LLKN+   LPL+  +   +A++GP+A   K +   Y        +   G   Y  +  + 
Sbjct: 449 LLKNEKEMLPLSK-SFSKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNAEVR 507

Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
           YA GC DI+                Q  +MI  A++ AK +D  ++V G +     E   
Sbjct: 508 YAKGC-DIIDKYFPESELYNVPLDTQEQAMINEAVELAKASDVAILVLGGNEKTVREEFS 566

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R +L L G Q +L+  V    K PV LV++   A  IN+A  N  + +I+   +PGE  G
Sbjct: 567 RTNLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMG 623

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
            AIA V+FG YNPGGRL +T +  +  +IP+ + P +P ++  G   K     V+YPFGY
Sbjct: 624 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDSKG---KVRVAGVLYPFGY 678

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           GLSYT F Y         D+K+               +KP   A             T  
Sbjct: 679 GLSYTTFGYS--------DLKI---------------SKPVIGA---------QENITLS 706

Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
             V+N GK  G EVV +Y +    + T   +V+ G+ER+ +  G+   + FT+   + L 
Sbjct: 707 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTISFTLTP-QDLG 765

Query: 723 IVDNAANSLLASGAHTILVG 742
           + D      +  G+ +++VG
Sbjct: 766 LWDKNNQFTVEPGSFSVMVG 785


>gi|260062042|ref|YP_003195122.1| beta-glucosidase [Robiginitalea biformata HTCC2501]
 gi|88783604|gb|EAR14775.1| beta-glucosidase [Robiginitalea biformata HTCC2501]
          Length = 763

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 193/614 (31%), Positives = 313/614 (50%), Gaps = 79/614 (12%)

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           ++P +++ RDPRWGRV+E  GEDPY+  R  +  VRG Q         D  S PL I+AC
Sbjct: 163 FAPMVDISRDPRWGRVMEGAGEDPYLGSRVGVARVRGFQG--------DDLSDPLTIAAC 214

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
            KH+A Y     EG  R +  +      +    + PF+  V+ G  ++VM S+N +NGIP
Sbjct: 215 LKHFAGYGFA--EGG-RDYNTADFGLSTLYNVVLPPFQAGVDAG-AATVMNSFNVLNGIP 270

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV-ARVLKAGLDLDC 309
             AD  L    ++  W+F G++VSD  SI  ++  H +  D  E A+ A V  + +D++ 
Sbjct: 271 ATADAFLQRDILKAAWDFQGFVVSDWGSIGEMI-PHGYARDRNEAALRAAVAGSDMDMES 329

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN--ICNPQH 367
           G Y T      V+ GK+ E+ +D ++  +  +   LG F    +Y +  +    + NP  
Sbjct: 330 GMYLTELPE-LVRDGKVPESLVDEAVLRILGLKYDLGLFADPYRYADAEREKRILSNPAR 388

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT--PCRYTS 425
           +E   + AR+ IVLLKN+ G LPL+  N  ++AL+GP A+   + +G++  T  P    S
Sbjct: 389 LETVRDMARKSIVLLKNEGGVLPLSK-NGGSIALIGPLASDKDSPLGSWRLTAEPNSAVS 447

Query: 426 PMDGFYAYS-KVINYAPGC------------ADIVCQNNSMIPAAIDAAKNADATVIVAG 472
            ++G  AYS   + Y  G               I   + S IPAA++ A++++  V+V G
Sbjct: 448 VLEGMQAYSGNTLAYERGVPLAEGETAFVFETKINTTDRSGIPAAVELARSSETVVMVLG 507

Query: 473 LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
                  EG+ R  L LPG Q EL+  V  A    + LV+M+   + IN+A  +  + +I
Sbjct: 508 EHGFQSGEGRSRAALGLPGLQQELLEAV-HAVNPNIVLVLMNGRPLTINWAAEH--VPAI 564

Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA-NYVKIPYTSMPL-RPVNNFPGRTY 590
           L   + G E G AIA+V++G YNP G+LP+T+ ++   + + Y+ +   RP   +PG   
Sbjct: 565 LEAWHLGTESGHAIAEVLYGDYNPSGKLPMTFPKSVGQIPVYYSHLATGRP--EYPGNDL 622

Query: 591 KFFDGPV------VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPP 644
            F+   +      +YPFG+GLSY+ F+Y         D+KL    Q  +I         P
Sbjct: 623 VFWSHYIDQVNEPLYPFGHGLSYSDFRY--------ADLKL----QTTEIR--------P 662

Query: 645 CAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFI 703
             ++ +             + +EN     G+E+V +Y +   G     ++++ G+E+VF+
Sbjct: 663 GGSLEV------------SVRLENASDTPGTEIVQLYVRDHFGSRARPVRELKGFEKVFL 710

Query: 704 AAGQSAKVGFTMNA 717
            AG SA+V FT++A
Sbjct: 711 EAGGSAEVSFTLSA 724


>gi|423287910|ref|ZP_17266761.1| hypothetical protein HMPREF1069_01804 [Bacteroides ovatus
           CL02T12C04]
 gi|392671925|gb|EIY65396.1| hypothetical protein HMPREF1069_01804 [Bacteroides ovatus
           CL02T12C04]
          Length = 782

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 221/723 (30%), Positives = 343/723 (47%), Gaps = 114/723 (15%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+P++    EA HG   IG                 AT FPT I   A+++  L K++
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------------ATVFPTGIGMAATWSLELVKEV 170

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           GQ ++ E R+       G   + P +++ RDPRW RV ET GEDP + G    + V GL 
Sbjct: 171 GQVIAKEIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGLG 225

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                     + S+     A  KH+ AY +   EG    ++ S V  +D+ + F+ PF  
Sbjct: 226 G--------GNLSQKYATIATLKHFLAYAVP--EGGQNGNYAS-VGIRDLHQNFLPPFRK 274

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            ++ G +S VM SYN ++GIP  ++  LL Q +R +W F G++VSD  SI+ I ESH F+
Sbjct: 275 AIDSGALS-VMTSYNSIDGIPCTSNHYLLTQLLRNEWKFCGFVVSDLYSIEGIHESH-FV 332

Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
             TKE+A  + + AG+D+D G D YTN    AVQ G++ +A IDT++  +  +   +G F
Sbjct: 333 ALTKENAAIQSVTAGVDVDLGGDAYTNLCH-AVQSGQMDKAVIDTAVCRVLRMKFEMGLF 391

Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
           +       +    +   +HIELA + A+  I LLKN+N  LPL+   I  +A++GP+A+ 
Sbjct: 392 EHPYVDPKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSK-TINKVAVIGPNADN 450

Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYAYSK--VINYAPGCADIVCQNNSMIPAAIDAAKNA 464
              M+G+Y          + +DG         + Y  GCA I     + I  AI AA+ +
Sbjct: 451 RYNMLGDYTAPQEDSNVKTVLDGILTKLSPFRVEYVRGCA-IRDTTVNEIEQAIKAARRS 509

Query: 465 D----------------------ATVIVAGLDLSVE-AEGKDRVDLLLPGFQTELINKVA 501
           +                      A V   G    +E  EG DR  L L G Q EL+  + 
Sbjct: 510 EVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQ 569

Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
              K P+ +V +    ++ N+A       ++L   YPG+EGG AIADV+FG YNP GRLP
Sbjct: 570 KTGK-PLIVVYIEGRPLEKNWASEYA--DALLTAYYPGQEGGNAIADVLFGDYNPSGRLP 626

Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
           I+    +  +IP       P N+     Y       +Y FGYG+SYT F+Y         
Sbjct: 627 IS-VPRSVGQIPVYYNKKAPRNH----DYVEMSSFPLYSFGYGMSYTTFEYS-------- 673

Query: 622 DIK-LDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
           D++ + K  +C ++++                            +V+N GK DG EV  +
Sbjct: 674 DLQVVQKSARCFEVSF----------------------------KVKNTGKYDGEEVSQL 705

Query: 681 YSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
           Y +    +    +KQ+  +ER  +  G+  KV F +   +   +V+     ++ SG   +
Sbjct: 706 YMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGNFHL 764

Query: 740 LVG 742
           ++G
Sbjct: 765 MIG 767


>gi|313204470|ref|YP_004043127.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312443786|gb|ADQ80142.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 746

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 214/677 (31%), Positives = 331/677 (48%), Gaps = 81/677 (11%)

Query: 89  TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLE 148
           T+FP  +  TAS++ +L +K  +  +TEA A         TF +P +++ RDPRWGRV+E
Sbjct: 113 TTFPIPLGETASWDLALIEKSARIAATEASAY----GVQWTF-APMVDIARDPRWGRVME 167

Query: 149 TPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRF 208
             GED Y+    A   V G Q         +       I AC KH+AAY      G D  
Sbjct: 168 GAGEDTYLGSLVAKARVHGFQG--------NGLGNVDAIMACAKHFAAYGA-AIGGRDYN 218

Query: 209 HFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNF 268
             D  ++ + + ET++ PF+  V E +V++ M S+N +NGIP  A+  +    ++G WNF
Sbjct: 219 SVD--MSLRQLNETYLPPFKAAV-EANVATFMNSFNDINGIPATANKYIQRDILKGQWNF 275

Query: 269 HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIA 327
            G++VSD  SI  ++ +H +  D+  DA  + + AG D+D     Y N     VQ GK+ 
Sbjct: 276 KGFVVSDWGSIGEMI-AHGYAKDSY-DAAMKAINAGSDMDMESRCYRNNLKQLVQDGKVD 333

Query: 328 EADIDTSLRFLYIVLMRLGYFDGSPQYKNLG--KNNICNPQHIELAAEAARQGIVLLKND 385
            + ID +++ + +    LG FD   ++ N    K    NP++   A E  ++ IVLLKN+
Sbjct: 334 ISVIDEAVKRILVKKFELGLFDDPYRFCNAAREKKQTNNPENRAFAREIGKKSIVLLKNE 393

Query: 386 ---NGA--LPLNTGNIKTLALVGPHANATKAMIG----NYEGTPCRYTSPMDGF---YAY 433
              NG   LPL +   KT+AL+GP   ATKA  G     +     R  S   G       
Sbjct: 394 PLSNGKTLLPL-SKQTKTVALIGPLFKATKANHGFWSIAFPDDSTRIISQYQGIKNQLDK 452

Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQ 493
           S  I YA GC +I   + +    AI+AAK+AD  ++  G    +  E K + +L LPG Q
Sbjct: 453 SSSIVYAKGC-NINDNDKTGFAEAINAAKSADVVIMSLGEAADMSGEAKSKSNLQLPGVQ 511

Query: 494 TELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGK 553
            EL+ ++    K PV L++ +   +  N+A +N  I SIL+  + G E G AIADV+FG 
Sbjct: 512 EELLKEIYKTGK-PVVLLLNAGRPLIFNWASDN--IPSILYTWWLGTEAGNAIADVLFGD 568

Query: 554 YNPGGRLPITW-YEANYVKIPY----TSMPLRPVN--NFPGRTYKFFDGPVVYPFGYGLS 606
           YNP G+LPI++      + I Y    T  P +  N  N+        + P  YPFGYGLS
Sbjct: 569 YNPAGKLPISFPRTEGQIPIYYNHFNTGRPAKDENDKNYVSAYIDLQNSP-KYPFGYGLS 627

Query: 607 YTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEV 666
           YT+F           ++KL  D+                             K T  +++
Sbjct: 628 YTKFDIS--------NLKLSSDKL------------------------SSGNKLTVTVDI 655

Query: 667 ENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVD 725
            N G  DG EVV +Y +   G     +K++ G++++ +  G++ ++ FT+   + LK  +
Sbjct: 656 ANTGNYDGEEVVQLYVRDLVGSVVRPVKELKGFQKLMLKKGETKQLTFTLTP-EDLKFFN 714

Query: 726 NAANSLLASGAHTILVG 742
           N    +  +G + + VG
Sbjct: 715 NEIQYINEAGDYELFVG 731


>gi|299140913|ref|ZP_07034051.1| periplasmic beta-glucosidase [Prevotella oris C735]
 gi|298577879|gb|EFI49747.1| periplasmic beta-glucosidase [Prevotella oris C735]
          Length = 767

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 205/699 (29%), Positives = 324/699 (46%), Gaps = 98/699 (14%)

Query: 88  ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRV 146
           ATSFP       +++ +L ++I    + EA A+      G T  ++P ++V RDPRWGRV
Sbjct: 119 ATSFPAQCGQGVTWDRALIRQIANVTAQEASAL------GYTNVYAPILDVSRDPRWGRV 172

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
           +E   E PY+ G      V GLQ+               +I +  KH+A Y L     ++
Sbjct: 173 VECYSESPYLAGELGKQMVLGLQEN--------------RIVSTPKHFAVYSLPVGGRDE 218

Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
               D  V  ++M+   + PF   + EG    VM SYN  +G P    P  L + +R  W
Sbjct: 219 GTRTDPHVAPKEMKTLLLEPFRKAIQEGGALGVMSSYNDYDGEPITGSPYFLTELLRHQW 278

Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTM-------- 318
            FHGY+VSD ++++ +   H    + +E+  A  + AGLD+      TNF+M        
Sbjct: 279 GFHGYVVSDSEAVEFLSSKHHVAAN-REEGAAMAINAGLDVR-----TNFSMPETFILPL 332

Query: 319 -GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN--ICNPQHIELAAEAA 375
             A+  G ++   +D  ++ +  V   LG FD +P   N+ + +  + +  H +L+  AA
Sbjct: 333 RQALTDGLVSMQILDARVKDVLYVKFWLGLFD-NPYRGNVNEVDQVVHSKAHQQLSLRAA 391

Query: 376 RQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY-- 433
            + IVLLKN+N  LPL+  ++K +A++GP+A+AT A +  Y        S + G      
Sbjct: 392 LESIVLLKNENNLLPLSK-SLKRIAVIGPNADATTAHVCRYGPANAPIKSVLSGIRESMP 450

Query: 434 SKVINYAPGCA--------------DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
              + YA GC+               +      MI  A+  A+ +D  V+V G       
Sbjct: 451 GAEVRYAKGCSIVDKHFPESELYEVALDTTEQRMIDEAVGVARQSDVAVVVLGGSEETVR 510

Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
           E   R DL L G Q +L+  V    K PV LV++   A  IN+A  N  + +I+   +PG
Sbjct: 511 EEYSRTDLNLMGRQEQLLRAVYATGK-PVVLVLLDGRAATINWA--NQYVPAIVHGWFPG 567

Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
           E  G A+A V+FG YNPGG+L +T +  +  +IPY + P +P  +  G      DG  +Y
Sbjct: 568 EFTGTAVAKVLFGDYNPGGKLAVT-FPKSVGQIPY-AFPFKPGADSKGPVR--VDG-ALY 622

Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
           PFGYGLSYT F Y         D  +               +KP        +V CK   
Sbjct: 623 PFGYGLSYTTFAYS--------DFHI---------------SKPVIGIQGETEVSCK--- 656

Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
                 V N G+ +G E+V +Y +       T+ K + G+ER+ + AG+   V F +   
Sbjct: 657 ------VRNTGQREGDEIVQLYIRDDISSVTTYQKSLRGFERIHLKAGEETTVRFMLTP- 709

Query: 719 KSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLN 757
           + L + +     ++  G  TI++G     +    +L +N
Sbjct: 710 RDLSLWNKHEEFVVEPGTFTIMIGRSSEDICLHGKLTVN 748


>gi|448415866|ref|ZP_21578437.1| beta-glucosidase [Halosarcina pallida JCM 14848]
 gi|445680029|gb|ELZ32480.1| beta-glucosidase [Halosarcina pallida JCM 14848]
          Length = 765

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 218/792 (27%), Positives = 357/792 (45%), Gaps = 142/792 (17%)

Query: 24  RAKDLVERMTLPEKVQQMGD-----LAYGVPRLGL-PLYEWWSEALHGVSFIGRRTNSPP 77
           R ++L++RM L EK  Q+G      L  G   L    + E  S+ +  ++ IG   + PP
Sbjct: 6   RVEELLDRMALTEKAAQLGSVNADKLLDGDGNLDENAVEEHLSDGIGHLTRIGGEGSLPP 65

Query: 78  G----------THFDSEV------------------PGATSFPTVILTTASFNESLWKKI 109
                      T+   E                   P  T+FP  I   ++++ SL ++I
Sbjct: 66  TEAARVTNELQTYLREETRLGIPAIPHEECLSGYMGPEGTTFPQSIGLASTWDPSLVEEI 125

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
             T+ T+  A   +G A     SP ++V RD RWGRV ET GEDPY+V   A  YV GLQ
Sbjct: 126 TGTIRTQLEA---IGTA--HALSPVLDVARDLRWGRVEETFGEDPYLVASMACGYVDGLQ 180

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                    D D     ISA  KH+A + +    G +R   +  +  ++++ET + PFE 
Sbjct: 181 G--------DGDG----ISATLKHFAGHSVGEG-GKNRSSVN--LGRRELRETHLFPFEA 225

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            V   D  SVM +Y+ ++GIP  +D  LL   +RG+W F G +VSD  S++ +   H   
Sbjct: 226 AVRTSDAESVMNAYHDIDGIPCASDEWLLTDVLRGEWGFDGTVVSDYYSVEFLRSEHGVA 285

Query: 290 NDTKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY 347
            D +E+A A  L+AG+D++    D Y +  +  V+ G ++E  +D ++R +    +R G 
Sbjct: 286 AD-EEEAGAMALEAGIDVELPYTDCYGDSLVKGVESGHLSEETVDHAVRRVLRAKVRKGL 344

Query: 348 FDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
           FD      +            EL   AAR+ + LLKN+   LPL      ++A++GP A+
Sbjct: 345 FDDPTVDPDAASEPFGTDAADELTTRAARESMTLLKNEGDLLPLAGSETDSVAVIGPKAD 404

Query: 408 ATKAMIGNY--------EGTPCRYTSPMDGFYA----YSKVINYAPGCA----------- 444
             + ++G+Y        E      T+P+D   +    +   +++  GC            
Sbjct: 405 DGQELMGDYAYAAHYPEEEVELDATTPLDAIRSRGDEFGFEVSHEQGCTMTGPGTGGFDA 464

Query: 445 ------------DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGF 492
                         V   +++  + +D  +   +TV  +G       EG D VDL LPG 
Sbjct: 465 AASAAAEADVAVAFVGARSAVDLSDMDKEQENRSTVPTSG-------EGCDVVDLDLPGV 517

Query: 493 QTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFG 552
           Q EL+ +V D    P+ +V++S     I        + +++    PGE GG  IA  +FG
Sbjct: 518 QQELVERV-DQTGTPLVVVVVSGKPHSIEAISE--AVPAVVQAWLPGERGGEGIAATLFG 574

Query: 553 KYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFK 611
           ++NPGG LP++       + + Y+  P     N     + + D   +YPFG+GLSYT F+
Sbjct: 575 EHNPGGHLPVSIPRTVGQIPVHYSRKP-----NSANEDHVYVDSDPLYPFGHGLSYTDFE 629

Query: 612 YKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGK 671
           Y         D+ L  D+             PP   +            T  + VEN G+
Sbjct: 630 YG--------DLALSDDE------------IPPAGTI------------TAAVTVENAGE 657

Query: 672 MDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS 730
             G +VV +Y +    +    +++++G+ERV + AG + +V F ++A + L   D   + 
Sbjct: 658 RAGHDVVQLYVRAENPSQARPVQELVGFERVSLDAGDARRVSFEIDASQ-LAYHDRNFDL 716

Query: 731 LLASGAHTILVG 742
            +  G + + VG
Sbjct: 717 TVEEGPYQLRVG 728


>gi|293371041|ref|ZP_06617583.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|292633971|gb|EFF52518.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
          Length = 791

 Score =  264 bits (674), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 213/726 (29%), Positives = 337/726 (46%), Gaps = 120/726 (16%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+P++    EA HG   IG                  T FPT I   A+++  L K++
Sbjct: 138 RLGIPMF-LAEEAPHGHMAIG-----------------ITVFPTGIGMAATWSPELVKEV 179

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           GQ ++ E R+       G   + P +++ RDPRW RV ET GEDP + G      V GL 
Sbjct: 180 GQVIAKEIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGTLGAAMVDGLI 234

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
           +         + SR     A  KH+ AY +     N      + V  +++ E F+ PF+ 
Sbjct: 235 N--------GNISRKNSTIATLKHFLAYAVPEGGQNGN---QALVGMRELHENFLPPFKK 283

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            ++ G +S VM SYN ++GIP  A+  LLNQ +R +W F G++VSD  SI+ I ESH + 
Sbjct: 284 AIDAGALS-VMTSYNSIDGIPCTANSYLLNQLLRNEWKFRGFVVSDLYSIEGIYESH-YT 341

Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
             + EDA  + + AG+D+D G + YTN    AV++ +++EA ID  +  +  +   +G F
Sbjct: 342 ASSIEDAAIQAVSAGVDVDLGGEAYTNIYR-AVKEKRLSEAIIDEVVCRVLRLKFEMGLF 400

Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
           +       +    + N  HI  A   A+  + LLKN +  LPL + NI+ +A++GP+A+ 
Sbjct: 401 ENPYVDPQIAIERVRNANHIANARRMAQASVTLLKNRHDILPL-SKNIRKVAVIGPNADN 459

Query: 409 TKAMIGNYEGTPCR---YTSPMDGFYAYSKV--INYAPGCADIVCQNNSMIPAAIDAAKN 463
              M+G+Y   P +     + +DG  +   +  + Y  GCA I    N+ I  A++AA  
Sbjct: 460 CYNMLGDYTA-PQKDENIKTVLDGIISKLSLSRVEYVRGCA-IRDTTNNEIAKAVEAANR 517

Query: 464 ADATVIVAGLDLSVE-----------------------AEGKDRVDLLLPGFQTELINKV 500
           AD  + V G   + +                        EG DR  L L G Q EL+  +
Sbjct: 518 ADVVIAVVGGSSARDFKTTYKETGAAIADKSQISDMECGEGFDRATLSLLGKQLELLESL 577

Query: 501 ADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRL 560
               K P+ +V +    ++ N+A  +    ++L   YPG+EGG AIADV+FG YNP GRL
Sbjct: 578 KSTRK-PLIVVYIEGRPLNKNWAAEHAD--ALLTAYYPGQEGGDAIADVLFGDYNPAGRL 634

Query: 561 PITWYEANYVKIPYT--SMPLRPVNNFPG-RTYKFFDGPVVYPFGYGLSYTQFKYKVASS 617
           P++        +P +   +P+      P    Y       +Y FGYGLSY+ F+Y     
Sbjct: 635 PVS--------VPRSEGQIPVYYNKKTPKCHDYVEMSASPLYSFGYGLSYSTFEYS---- 682

Query: 618 PKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEV 677
                            N  V    P                F    +VEN GK DG EV
Sbjct: 683 -----------------NLKVTQQAP--------------LHFEISFDVENTGKYDGEEV 711

Query: 678 VMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGA 736
             +Y +    +    ++Q+  ++R F+  G+   + FT+   + L I++     ++  G+
Sbjct: 712 AQLYIRDEYASVVRALRQLKHFKRFFLKQGEKKTIVFTL-VEEDLSIINQKMERIVEPGS 770

Query: 737 HTILVG 742
             +++G
Sbjct: 771 FQLMIG 776


>gi|393787054|ref|ZP_10375186.1| hypothetical protein HMPREF1068_01466 [Bacteroides nordii
           CL02T12C05]
 gi|392658289|gb|EIY51919.1| hypothetical protein HMPREF1068_01466 [Bacteroides nordii
           CL02T12C05]
          Length = 958

 Score =  264 bits (674), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 227/808 (28%), Positives = 366/808 (45%), Gaps = 140/808 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
           Y D   P   R +DL+ +M L EK  QM  L YG  R+    LP  EW    W + +  +
Sbjct: 65  YEDPTAPIDARIEDLLSQMNLNEKTCQMVTL-YGYKRVLKDALPTPEWKQMLWKDGMGAI 123

Query: 67  S-----------------------------------FIGRRTNSPPGTHFDSEVPG---- 87
                                               FI       P    +  + G    
Sbjct: 124 DEHLNGFQQWGLPPSDNENVWPASRHAWALNEVQRFFIEETRLGIPVDFTNEGIRGVESY 183

Query: 88  -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
            AT+FPT +    ++N  L  ++G     EAR +      G T  ++P ++V RD RWGR
Sbjct: 184 KATNFPTQLGLGHTWNRKLIHQVGLITGREARML------GYTNVYAPILDVGRDQRWGR 237

Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
             E  GE PY+V    I  V+G+Q                +++A  KH+ AY  +     
Sbjct: 238 YEEVYGESPYLVAELGIEMVKGMQ-------------HNYQVAATGKHFIAYSNNKGARE 284

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
                D +++ ++++   + PF+  + E  +  VM SYN  +G+P  +    L   +RG 
Sbjct: 285 GMARVDPQMSPREVEMIHVYPFKRVIQEAGLLGVMSSYNDYDGLPVQSSYYWLMTRLRGQ 344

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
             F GY+VSD D+++ +   H    D KE AV + ++AGL++ C     D Y       V
Sbjct: 345 MGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLRELV 403

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAA-EAARQGIV 380
           Q+G ++E  I+  +R +  V   +G FD   Q    G +     +  E+ A +A+R+ IV
Sbjct: 404 QEGGLSEEIINDRVRDILRVKFLVGLFDTPYQTDLKGADEEVEKEENEIVALQASRESIV 463

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY---SKVI 437
           LLKND  ALPL+  +I+ +A+ GP+A+ T   + +Y       T+ + G          +
Sbjct: 464 LLKNDKNALPLDVASIRKIAVCGPNADETAYALTHYGPLAVDVTTVLSGIRQKVDGKAEV 523

Query: 438 NYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
            Y  GC ++V  N                + I  A+  AK AD  V+V G       E K
Sbjct: 524 LYTKGC-ELVDANWPESEIIDYPLTNDEQNKIDKAVAQAKEADVAVVVLGGGQRTCGENK 582

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
            R  L LPG Q +L+  V    K PV LV+++   + +N+A  +  + +I+   YPG +G
Sbjct: 583 SRSSLDLPGRQLDLLKAVQATGK-PVVLVLINGRPLSVNWA--DKFVPAIIEAWYPGSKG 639

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV----- 597
           G A+ADV+FG YNPGG+L +T +  +  +IP+ + P +P +   G       G +     
Sbjct: 640 GTAVADVLFGDYNPGGKLTVT-FPKSVGQIPF-NFPCKPSSQIDGGKNPGPKGNMSRVNG 697

Query: 598 -VYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
            +YPFG+GLSYT F+Y  ++ SPK +                      P   V    V+C
Sbjct: 698 ALYPFGHGLSYTTFEYSDISISPKVIT---------------------PNQKV---QVRC 733

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
           K         + N GK  G EVV +Y +       T+ K + G+ER+ +  G++ +V FT
Sbjct: 734 K---------ITNTGKRAGDEVVQLYVRDILSSVTTYEKNLEGFERIHLQPGETKEVSFT 784

Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
           ++  K+L++++   + ++  G  +I++G
Sbjct: 785 LDR-KALELLNAKNDWVVEPGDFSIMLG 811


>gi|313145353|ref|ZP_07807546.1| periplasmic beta-glucosidase [Bacteroides fragilis 3_1_12]
 gi|313134120|gb|EFR51480.1| periplasmic beta-glucosidase [Bacteroides fragilis 3_1_12]
          Length = 802

 Score =  264 bits (674), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 238/833 (28%), Positives = 362/833 (43%), Gaps = 161/833 (19%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE---------------- 57
           Y +  +P  ER + L+ +MTL EKV QM      +  LG P+YE                
Sbjct: 37  YENPSVPVEERVEHLLSQMTLEEKVGQM------LTSLGWPMYERVGEEIRLTARLEKEI 90

Query: 58  ----------------WWSEALH---GVSFIGRRTNSPPG---THFDSEVP--------- 86
                           W    LH     S   R +N        H    +P         
Sbjct: 91  SEYHIGALWGFMRADPWTQRTLHTGLNPSLAARASNRLQAFVMEHSRLGIPLFLAEECPH 150

Query: 87  -----GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
                G T FPT I   +++N  L +++G+ ++TEA A           + P +++ RDP
Sbjct: 151 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIATEASA-----QGAHIGYGPVLDLARDP 205

Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
           RW RV ET GEDPY+ G      VRG Q         D+      + A  KH+A+Y    
Sbjct: 206 RWSRVEETYGEDPYLNGVMGAALVRGFQG--------DTLRGRKSVIATLKHFASY---G 254

Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
           W         + + E++++E    PF   V  G +S VM SYN ++G P      LL   
Sbjct: 255 WTEGGHNGGTAHLGERELEEAIFPPFREAVGAGALS-VMSSYNEIDGNPCTGSRYLLTDI 313

Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGA 320
           ++  W F G++VSD  +I  + E H       E AV + + AG+D D G + Y    + A
Sbjct: 314 LKDRWQFKGFVVSDLYAIGGLRE-HGVAGSDYEAAV-KAVNAGVDSDLGTNVYAEQLVAA 371

Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
           V++G +A   +D ++R +  +   +G FD            + +P+HI LA E ARQ IV
Sbjct: 372 VRKGDVAMETVDKAVRRILFLKFHMGLFDAPFVDDKRPAQLVASPEHIGLAREVARQSIV 431

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSK 435
           LLKN++  LPL   +I+TLA++GP+A+    M+G+Y     +G+       +    +   
Sbjct: 432 LLKNEDKLLPLKK-DIRTLAVIGPNADNGYNMLGDYTAPQADGSVVTVLEGIRQKVSKDT 490

Query: 436 VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE------------- 478
            + YA GCA +   + +    AI+AA++AD  V+V G     D S E             
Sbjct: 491 RVLYAKGCA-VRDSSRTGFADAIEAARSADVVVMVVGGSSARDFSSEYEETGAAKVSANR 549

Query: 479 ------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
                  EG DR  L L G Q EL+ +V    K P+ LV++    + +       +  +I
Sbjct: 550 VSDMESGEGYDRATLHLMGRQLELLEEVRKLGK-PMVLVLIKGRPLLMEGVIQ--EADAI 606

Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
           L   YPG +GG A+ADV+FG YNP GRL ++      V      +P+       G   ++
Sbjct: 607 LDAWYPGMQGGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTKRKGNRSRY 660

Query: 593 FD--GPVVYPFGYGLSYTQFKY---KVASSPKSVDIKLDKDQQCR-DINYTVGTNKPPCA 646
            +  G   YPFGYGLSYT F Y   KV  S +S          CR D++ T         
Sbjct: 661 IEEAGTPRYPFGYGLSYTTFSYTGMKVRVSEES--------NHCRVDVSVT--------- 703

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAA 705
                              V N G +DG EVV +Y +   G   T  +Q+  + RV + A
Sbjct: 704 -------------------VRNQGTVDGDEVVQLYLRDEVGSFTTPDRQLRAFSRVRLKA 744

Query: 706 GQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLNH 758
           G++ ++ FT++  KSL +        +  G  T++ G     ++   +  +N 
Sbjct: 745 GETREITFTLDK-KSLALYMRDGEWAVEPGRFTVMAGGSSEDIACQQEFEINR 796


>gi|29347190|ref|NP_810693.1| beta-glucosidase [Bacteroides thetaiotaomicron VPI-5482]
 gi|29339089|gb|AAO76887.1| periplasmic beta-glucosidase precursor [Bacteroides
           thetaiotaomicron VPI-5482]
          Length = 950

 Score =  264 bits (674), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 226/762 (29%), Positives = 359/762 (47%), Gaps = 119/762 (15%)

Query: 8   KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSEAL 63
           K++D  Y DA LP  ER + L+  MT PE   ++    +G+P  G+P LY       EA+
Sbjct: 160 KVTDRRYMDASLPVEERVESLLAVMT-PEDKMELIREGWGIP--GIPHLYVPPITKVEAV 216

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           HG S+         G+       GAT FP  +   A++N  L +++   +  E  A  N 
Sbjct: 217 HGFSY---------GS-------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NT 259

Query: 124 GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
             A    WSP ++V +D RWGR  ET GEDP +V +    +++G Q            SR
Sbjct: 260 KQA----WSPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SR 303

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            L  +   KH+  +         R   D  ++E++M+E  ++PF   +   D  S+M +Y
Sbjct: 304 GLFTTP--KHFGGHGAPL---GGRDSHDIGLSEREMREIHLVPFRHAIRNYDCQSLMMAY 358

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           +   G+P     +LL Q +R +W F+G+IVSDC +I  +     +    K +A  + L A
Sbjct: 359 SDYMGVPVAKSKELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAA 418

Query: 304 GLDLDCGDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNI 362
           G+  +CGD Y N   + A + G+I   D+D   R +   + R   F+ +P  K L    I
Sbjct: 419 GIATNCGDTYNNKEVIQAAKDGRINMEDLDNVCRTMLGTMFRNELFEKNP-CKPLDWKKI 477

Query: 363 C----NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-- 416
                +  H E+A +AAR+ IV+L+N +  LPL+   ++T+A++GP A+  +   G+Y  
Sbjct: 478 YPGWNSDSHKEMARQAARESIVMLENKDNLLPLSK-TLRTIAVLGPGADDLQP--GDYTP 534

Query: 417 EGTPCRYTSPMDGFYA----YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG 472
           +  P +  S + G        +KV+ Y  GC D    + + IP A+ AA  +D  ++V G
Sbjct: 535 KLLPGQLKSVLTGIKGAVGKQTKVL-YEQGC-DFTNPDETNIPKAVKAASQSDVVIMVLG 592

Query: 473 LDLSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFA 523
              + EA         E  D   L+LPG Q EL+  V    K PV L++ +    DI   
Sbjct: 593 DCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDI--L 649

Query: 524 KNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVN 583
           K +   K+IL    PG+EGG A+ADV+FG YNP GRLP+T+            +PL    
Sbjct: 650 KASEMCKAILVNWLPGQEGGPAMADVLFGDYNPAGRLPMTFPRH------VGQLPLYYNF 703

Query: 584 NFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTN 641
              GR Y++ D     +Y FG+GLSYT F+Y         ++K+   Q+  + N  V   
Sbjct: 704 KTSGRRYEYVDMEYYPLYRFGFGLSYTSFEYS--------NLKI---QEKANGNVEV--- 749

Query: 642 KPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYER 700
                                Q  V+N+G   G EV  +Y +       T + ++  + R
Sbjct: 750 ---------------------QATVKNVGSRAGDEVAQLYVTDMYASVKTRVMELKDFAR 788

Query: 701 VFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           + +  G+S  V F M     + ++++  + ++  G   I+VG
Sbjct: 789 IHLQPGESKTVSFEMTPY-DISLLNDRMDRVVEKGEFKIMVG 829


>gi|383125188|ref|ZP_09945842.1| hypothetical protein BSIG_4348 [Bacteroides sp. 1_1_6]
 gi|382983435|gb|EES66611.2| hypothetical protein BSIG_4348 [Bacteroides sp. 1_1_6]
          Length = 954

 Score =  264 bits (674), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 226/762 (29%), Positives = 359/762 (47%), Gaps = 119/762 (15%)

Query: 8   KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSEAL 63
           K++D  Y DA LP  ER + L+  MT PE   ++    +G+P  G+P LY       EA+
Sbjct: 164 KVTDRRYMDASLPVEERVESLLAVMT-PEDKMELIREGWGIP--GIPHLYVPPITKVEAV 220

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           HG S+         G+       GAT FP  +   A++N  L +++   +  E  A  N 
Sbjct: 221 HGFSY---------GS-------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NT 263

Query: 124 GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
             A    WSP ++V +D RWGR  ET GEDP +V +    +++G Q            SR
Sbjct: 264 KQA----WSPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SR 307

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            L  +   KH+  +         R   D  ++E++M+E  ++PF   +   D  S+M +Y
Sbjct: 308 GLFTTP--KHFGGHGAPL---GGRDSHDIGLSEREMREIHLVPFRHAIRNYDCQSLMMAY 362

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           +   G+P     +LL Q +R +W F+G+IVSDC +I  +     +    K +A  + L A
Sbjct: 363 SDYMGVPVAKSKELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAA 422

Query: 304 GLDLDCGDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNI 362
           G+  +CGD Y N   + A + G+I   D+D   R +   + R   F+ +P  K L    I
Sbjct: 423 GIATNCGDTYNNKEVIQAAKDGRINMEDLDNVCRTMLGTMFRNELFEKNP-CKPLDWKKI 481

Query: 363 C----NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-- 416
                +  H E+A +AAR+ IV+L+N +  LPL+   ++T+A++GP A+  +   G+Y  
Sbjct: 482 YPGWNSDSHKEMARQAARESIVMLENKDNLLPLSK-TLRTIAVLGPGADDLQP--GDYTP 538

Query: 417 EGTPCRYTSPMDGFYA----YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG 472
           +  P +  S + G        +KV+ Y  GC D    + + IP A+ AA  +D  ++V G
Sbjct: 539 KLLPGQLKSVLTGIKGAVGKQTKVL-YEQGC-DFTNPDETNIPKAVKAASQSDVVIMVLG 596

Query: 473 LDLSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFA 523
              + EA         E  D   L+LPG Q EL+  V    K PV L++ +    DI   
Sbjct: 597 DCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDI--L 653

Query: 524 KNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVN 583
           K +   K+IL    PG+EGG A+ADV+FG YNP GRLP+T+            +PL    
Sbjct: 654 KASEMCKAILVNWLPGQEGGPAMADVLFGDYNPAGRLPMTFPRH------VGQLPLYYNF 707

Query: 584 NFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTN 641
              GR Y++ D     +Y FG+GLSYT F+Y         ++K+   Q+  + N  V   
Sbjct: 708 KTSGRRYEYVDMEYYPLYRFGFGLSYTSFEYS--------NLKI---QEKANGNVEV--- 753

Query: 642 KPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYER 700
                                Q  V+N+G   G EV  +Y +       T + ++  + R
Sbjct: 754 ---------------------QATVKNVGSRAGDEVAQLYVTDMYASVKTRVMELKDFAR 792

Query: 701 VFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           + +  G+S  V F M     + ++++  + ++  G   I+VG
Sbjct: 793 IHLQPGESKTVSFEMTPY-DISLLNDRMDRVVEKGEFKIMVG 833


>gi|154493932|ref|ZP_02033252.1| hypothetical protein PARMER_03276 [Parabacteroides merdae ATCC
           43184]
 gi|154086192|gb|EDN85237.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Parabacteroides merdae ATCC 43184]
          Length = 955

 Score =  264 bits (674), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 226/808 (27%), Positives = 362/808 (44%), Gaps = 140/808 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW------------ 58
           Y D  +P   R +DL+ +M + EK  QM  L YG  R+    LP  +W            
Sbjct: 61  YEDPTVPIDARVEDLLSQMNVEEKTCQMVTL-YGYKRVLKDDLPTSDWKKQLWKDGIGAI 119

Query: 59  --------------------WSEALHGVS------FIGRRTNSPPGTHFDSE-VPG---- 87
                               W  + H  +      F    T     T F +E + G    
Sbjct: 120 DEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESY 179

Query: 88  -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
            AT+FPT +    ++N +L  K+G     E R +      G T  ++P ++V RD RWGR
Sbjct: 180 IATNFPTQLGLGHTWNRNLVHKVGYITGREGRLL------GYTNVYAPILDVGRDQRWGR 233

Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
             E  GE PY+V    I   +G+Q          +D    +++A  KHY AY  +     
Sbjct: 234 YEEVYGESPYLVAELGIEMAKGMQ----------TDH---QVAATSKHYIAYSNNKGGRE 280

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
                D +++ ++++   + P++  + E  +  VM SYN  +G P  +    L   +RG+
Sbjct: 281 GMARVDPQMSPREVEMIHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGE 340

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
           + F GY+VSD D+++ +   H    D KE  +  VL AGL++ C     D Y       +
Sbjct: 341 FGFRGYVVSDSDAVEYLFSKHGTAADMKESVLQSVL-AGLNIRCTFRSPDSYVLPLRELI 399

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELAAEAARQGI 379
            +G I  + ID  +R +  V   +G FD   Q   K   K   C    + +A +A+++ +
Sbjct: 400 AEGAIPMSTIDDRVRDILRVKFLVGLFDHPYQIDLKETDKEVNCAENQL-VALQASKESL 458

Query: 380 VLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV--- 436
           VLLKN +  LPL+   I  +A+ GP+A+     + +Y       T+ ++G     K    
Sbjct: 459 VLLKNQDAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTD 518

Query: 437 INYAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
           + +  GC D+V                +  S I  A++ AK +D TV+V G       E 
Sbjct: 519 VLFTKGC-DLVDANWPESELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSNRTCGEN 577

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
           K R  L LPG Q +L+  V    K PV LV+++   + IN+A  +  + +IL   YPG +
Sbjct: 578 KSRSSLDLPGRQLDLLQAVVATGK-PVVLVLINGRPLSINWA--DKYVPAILEAWYPGSQ 634

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV---- 597
           GG AIAD +FG YNPGG+L +T +     +IP+ + P +P     G   K  DG +    
Sbjct: 635 GGTAIADALFGDYNPGGKLTVT-FPKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRVN 692

Query: 598 --VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
             +YPFGYGLSYT F+Y   S                 I   + T   P        V+C
Sbjct: 693 GPLYPFGYGLSYTTFEYSDIS-----------------IQPAIVTQVQPVT------VRC 729

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
           K         V N GK  G EVV +Y +       T+ K ++G++R+ +  G++ ++ FT
Sbjct: 730 K---------VTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFT 780

Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
           +   + L+++++  + ++  G   ++VG
Sbjct: 781 IEP-RDLQLLNSDNHWVVEPGDFKVMVG 807


>gi|423346097|ref|ZP_17323785.1| hypothetical protein HMPREF1060_01457 [Parabacteroides merdae
           CL03T12C32]
 gi|409220895|gb|EKN13848.1| hypothetical protein HMPREF1060_01457 [Parabacteroides merdae
           CL03T12C32]
          Length = 955

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 225/808 (27%), Positives = 364/808 (45%), Gaps = 140/808 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW------------ 58
           Y D  +P   R +DL+ +M + EK  QM  L YG  R+    LP  +W            
Sbjct: 61  YEDPTVPIDARVEDLLSQMNVEEKTCQMVTL-YGYKRVLKDDLPTSDWKKQLWKDGIGAI 119

Query: 59  --------------------WSEALHGVS------FIGRRTNSPPGTHFDSE-VPG---- 87
                               W  + H  +      F    T     T F +E + G    
Sbjct: 120 DEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESY 179

Query: 88  -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
            AT+FPT +    ++N +L  K+G     E R +      G T  ++P ++V RD RWGR
Sbjct: 180 IATNFPTQLGLGHTWNRNLVHKVGYITGREGRLL------GYTNVYAPILDVGRDQRWGR 233

Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
             E  GE PY+V    I   +G+Q          +D    +++A  KHY AY  +     
Sbjct: 234 YEEVYGESPYLVAELGIEMAKGMQ----------TDH---QVAATSKHYIAYSNNKGGRE 280

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
                D +++ ++++   + P++  + E  +  VM SYN  +G P  +    L   +RG+
Sbjct: 281 GMARVDPQMSPREVEMIHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGE 340

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
           + F GY+VSD D+++ +   H    D KE  +  VL AGL++ C     D Y       +
Sbjct: 341 FGFRGYVVSDSDAVEYLFSKHGTAADMKESVLQSVL-AGLNIRCTFRSPDSYVLPLRELI 399

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELAAEAARQGI 379
            +G I  + ID  +R +  V   +G FD   Q   K   K   C  ++ ++A +A+++ +
Sbjct: 400 AEGAIPMSTIDDRVRDILRVKFLVGLFDHPYQIDLKETDKEVNC-AENQQVALQASKESL 458

Query: 380 VLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV--- 436
           VLLKN +  LPL+   I  +A+ GP+A+     + +Y       T+ ++G     K    
Sbjct: 459 VLLKNQDAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTN 518

Query: 437 INYAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
           + +  GC D+V                +  S I  A++ AK +D TV+V G       E 
Sbjct: 519 VLFTKGC-DLVDANWPESELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSDRTCGEN 577

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
           K R  L LPG Q +L+  V    K PV L++++   + IN+A  +  + +IL   YPG +
Sbjct: 578 KSRSSLDLPGRQLDLLQAVVATGK-PVVLILINGRPLSINWA--DKYVPAILEAWYPGSQ 634

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV---- 597
           GG AIAD +FG YNPGG+L +T +     +IP+ + P +P     G   K  DG +    
Sbjct: 635 GGTAIADALFGDYNPGGKLTVT-FPKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRVN 692

Query: 598 --VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
             +YPFGYGLSYT F+Y   S                 I   + T   P        V+C
Sbjct: 693 GPLYPFGYGLSYTTFEYSDIS-----------------IQPAIVTQVQPVT------VRC 729

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
           K         V N GK  G EVV +Y +       T+ K ++G++R+ +  G++ ++ FT
Sbjct: 730 K---------VTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFT 780

Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
           +   + L+++++  + ++  G   ++VG
Sbjct: 781 IEP-RDLQLLNSDNHWVVEPGDFKVMVG 807


>gi|255689965|ref|ZP_05413640.1| beta-glucosidase [Bacteroides finegoldii DSM 17565]
 gi|260624572|gb|EEX47443.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           finegoldii DSM 17565]
          Length = 688

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 203/724 (28%), Positives = 340/724 (46%), Gaps = 100/724 (13%)

Query: 33  TLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFP 92
           T PE    M   A    RLG+P+  +  +A+HG                       T +P
Sbjct: 43  TNPELRNNMQKKAMEESRLGIPII-FGYDAIHGFR---------------------TVYP 80

Query: 93  TVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGE 152
             +    S+N  L ++     + EAR    +     TF SP I+V RDPRWGRV E  GE
Sbjct: 81  ISLAQACSWNPDLVEQACAVSAQEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGE 135

Query: 153 DPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDS 212
           DPY  G +    VRG        Y  D+ S   +++AC KHY  Y      G D  +  +
Sbjct: 136 DPYANGVFGAASVRG--------YQGDNMSAENRVAACLKHYVGYGASE-AGRDYVY--T 184

Query: 213 RVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYI 272
            +++Q + +T++LP+EM V  G  +++M S+N ++G+P  A+P  + + ++  W   G+I
Sbjct: 185 EISQQTLWDTYLLPYEMGVKAG-AATLMSSFNDISGVPGSANPYTMTEILKNRWRHDGFI 243

Query: 273 VSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADI 331
           VSD  +I+ +   ++ L  TK++A      AGL++D   + Y       V++GK++ A +
Sbjct: 244 VSDWGAIEQL--KNQGLAATKKEAARYAFTAGLEMDMMSHAYDRHLQELVEEGKVSMAQV 301

Query: 332 DTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPL 391
           D ++R + ++  RLG F+         K     P+ +++AA  A + +VLLKN+N  LPL
Sbjct: 302 DEAVRRVLLLKFRLGLFERPYTPATTEKERFFRPKSMDIAARLAAESMVLLKNENNVLPL 361

Query: 392 NTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM--DGF---YAYSKVINYAPGCADI 446
              + K +A++GP A     ++G++ G        M  DG    +A    + YA GC + 
Sbjct: 362 T--DKKKIAVIGPMAKNGWDLLGSWRGHGKDTDVAMLYDGLAAEFAGKAELRYALGC-NT 418

Query: 447 VCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG 506
              N      A++AA+ +D  V+  G  ++   E   R  + LP  Q EL  ++  A K 
Sbjct: 419 QGDNREGFAEALEAARWSDVVVLCLGEMMTWSGENASRSSIALPQMQEELAKELKKAGK- 477

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
           PV LV+++   +++N  +  P   +IL +  PG  G   +A ++ G+ NP G+L +T+  
Sbjct: 478 PVVLVLVNGRPLELN--RLEPVSDAILEIWQPGVNGALPMAGILSGRINPSGKLAMTF-- 533

Query: 567 ANYVKIPYTS--MPLRPVNNFPGRTYKFFDGPV----VYPFGYGLSYTQFKYKVASSPKS 620
                 PY++  +P+       GR ++ F   +    +YPFG+GLSYT+FKY        
Sbjct: 534 ------PYSTGQIPIYYNRRKSGRGHQGFYKDITSDPLYPFGHGLSYTEFKY-------- 579

Query: 621 VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
                             GT  P    V       +  K + ++ V N+G  DG+E V  
Sbjct: 580 ------------------GTVTPSATKV------KRGEKLSAEVTVTNIGARDGAETVHW 615

Query: 681 YSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
           +   P  + T  +K++  +E+  I AG++    F ++  +    V+      L +G + I
Sbjct: 616 FISDPYCSITRPVKELKHFEKQLIKAGETKTFRFDIDLERDFGFVNEDGKRFLETGEYNI 675

Query: 740 LVGE 743
            V E
Sbjct: 676 HVLE 679


>gi|423300893|ref|ZP_17278917.1| hypothetical protein HMPREF1057_02058 [Bacteroides finegoldii
           CL09T03C10]
 gi|408472228|gb|EKJ90756.1| hypothetical protein HMPREF1057_02058 [Bacteroides finegoldii
           CL09T03C10]
          Length = 798

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 231/800 (28%), Positives = 363/800 (45%), Gaps = 141/800 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
           Y D   P   R  DL+ +MTL EK  QM  L YG  R+     P   W    W + +   
Sbjct: 54  YEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTDGWSTEIWKDGIGNI 112

Query: 64  ----HGVSFIGRR-----TNSPPGTH-----------------FDSE-VPG-----ATSF 91
               +G+   G        NS    H                 F +E + G     AT F
Sbjct: 113 DEQANGLGKFGSEISYPYANSAKNRHTVQRWFVEKTRLGIPVDFTNEGIRGLCHDRATMF 172

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
           P      A++N+ L ++I +  + EA+A+      G T  ++P +++ +DPRWGRV+E+ 
Sbjct: 173 PAQCGQGATWNKKLIREIAKVTADEAKAL------GYTNIYAPILDIAQDPRWGRVVESY 226

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDPY+VG      + GLQ+ EG             I A  KH+A Y +     +     
Sbjct: 227 GEDPYLVGELGKQMILGLQN-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 272

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
           D  V  ++M+  ++ PF   + E     VM SYN  +G P       L + +R  W F G
Sbjct: 273 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 332

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
           Y+VSD ++++ +   H+ +  T+E+  A+V+ AGL++      TNFT           A+
Sbjct: 333 YVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRRAI 386

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
            +GK++   +D  +  +  V   +G FD   P      +  + N  H +++  AA + IV
Sbjct: 387 SEGKVSLHTLDQRVGEILRVKFMMGLFDNPYPGDDRRPEVVVHNAAHQDVSMRAALESIV 446

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
           LLKN+   LPL+  +   +A++GP+A   K +   Y        +   G   Y  +  + 
Sbjct: 447 LLKNEKEMLPLSK-SFNKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNAEVR 505

Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
           YA GC DI+                Q  +MI  A++ AK +D  ++V G +     E   
Sbjct: 506 YAKGC-DIIDKYFPESELYNVPLDTQEKAMINEAVELAKASDVAILVLGGNEKTVREEFS 564

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R +L L G Q +L+  V    K PV LV++   A  IN+A  N  + +I+   +PGE  G
Sbjct: 565 RTNLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMG 621

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
            AIA V+FG YNPGGRL +T +  +  +IP+ + P +P ++  G   K     V+YPFGY
Sbjct: 622 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDSKG---KVRVAGVLYPFGY 676

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           GLSYT F Y         D+K+               +KP   A             T  
Sbjct: 677 GLSYTTFGYS--------DLKV---------------SKPVIGA---------QENITLS 704

Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
             V+N GK  G EVV +Y +    + T   +V+ G+ER+ +  G+   + FT+   + L 
Sbjct: 705 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEERTISFTLTP-QDLG 763

Query: 723 IVDNAANSLLASGAHTILVG 742
           + D   +  +  G+ +++VG
Sbjct: 764 LWDKNNHFTVEPGSFSVMVG 783


>gi|313204103|ref|YP_004042760.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312443419|gb|ADQ79775.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 1278

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 161/432 (37%), Positives = 237/432 (54%), Gaps = 41/432 (9%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y +    + ERA DLV RMTL EK  Q+G+    +PRLG+  Y+ W EALHGV  +GR  
Sbjct: 39  YLNTAYSFKERAADLVSRMTLEEKQSQLGNTMPPIPRLGVNKYDVWGEALHGV--VGRNN 96

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
           NS  G         ATSFP  +   ++++ +L K+    V+ EAR   +     LT+WSP
Sbjct: 97  NS--GMI-------ATSFPNSVAVGSTWDPALIKRETSVVADEARGFNHDLIFTLTYWSP 147

Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
            I   RDPRWGR  ET GEDP++V +    +V+GL            D   LK   C KH
Sbjct: 148 VIEPARDPRWGRTAETFGEDPFLVSQIGSGFVQGLMG---------DDPTYLKTVPCGKH 198

Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           Y A   +N E N R +  + + ++DM+E ++ P+   + +  + S+M +Y+ VNG+P  A
Sbjct: 199 YFA---NNSEFN-RHNGSANMDDRDMREFYLTPYRTLIQKDKLPSIMTAYSAVNGVPMSA 254

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
              L++   +  +   GY+  DCD++  +V SH++   +K +A A  LK G+D DCG  Y
Sbjct: 255 SKFLVDTIAKRTYGLDGYVTGDCDAVADVVNSHRYAK-SKAEAAAMGLKTGVDSDCGGIY 313

Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNPQHIE 369
               + A++QG I+EAD+D +L  +Y + MRLG FD  PQ    Y  +  + I +P H +
Sbjct: 314 QTSALEALKQGLISEADMDKALVNIYTIRMRLGEFD--PQNIVPYAGIKPSIINDPSHND 371

Query: 370 LAAEAARQGIVLLKND------NGALPLNTGNIKTLALVGPHANATKAMIGNYEGT--PC 421
           LA E A +  VLLKN+        ALPLN G IK +A++GP A+  K  +G+Y G   P 
Sbjct: 372 LALEIATKSPVLLKNNLVGKSGKKALPLNAGTIKKIAVLGPQAD--KVELGDYSGEADPK 429

Query: 422 RYTSPMDGFYAY 433
              +P++G   Y
Sbjct: 430 YKITPLEGIKNY 441



 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 95/284 (33%), Positives = 139/284 (48%), Gaps = 43/284 (15%)

Query: 446 IVCQNNSMIPAA----IDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVA 501
           ++    S +PA     +D A +AD  V+  G D +   E  DR  + LPG Q ELI  +A
Sbjct: 593 VLVYRESEVPATDKETLDMAASADVAVVFVGTDQTTGREESDRFAITLPGNQNELIKSIA 652

Query: 502 DAAKGPVTLVIMSA-GAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRL 560
             A  P T+V++   G V++   KNNP +  I++ GY G+  G A+A V+FG  NPGG+ 
Sbjct: 653 --AVNPNTIVVIQGMGMVEVEQFKNNPNVAGIIFTGYNGQAQGTAMAKVLFGDVNPGGKT 710

Query: 561 PITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKS 620
            +TWY++       T   LR      GRTY +F+  V Y FGYGLSYT F Y        
Sbjct: 711 SLTWYKSINDLPALTDYTLRGGAGKNGRTYMYFNKDVSYEFGYGLSYTTFAYS------- 763

Query: 621 VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
                         N+ +        ++  +D      K T  ++V+N G +DG EVV +
Sbjct: 764 --------------NFNISK-----TSITPND------KVTVTVDVKNTGTVDGDEVVQI 798

Query: 681 YSKPPGIAGTH---IKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
           Y K P    +    IK++ G++RV I AGQ+  V   ++ C  L
Sbjct: 799 YVKTPDSPASLERPIKRLKGFKRVAIPAGQTKTVSIEVD-CADL 841


>gi|423722678|ref|ZP_17696831.1| hypothetical protein HMPREF1078_00891 [Parabacteroides merdae
           CL09T00C40]
 gi|409241951|gb|EKN34716.1| hypothetical protein HMPREF1078_00891 [Parabacteroides merdae
           CL09T00C40]
          Length = 955

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 225/808 (27%), Positives = 362/808 (44%), Gaps = 140/808 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW------------ 58
           Y D  +P   R +DL+ +M + EK  QM  L YG  R+    LP  +W            
Sbjct: 61  YEDPTVPIDARVEDLLSQMNVEEKTCQMVTL-YGYKRVLKDDLPTSDWKKQLWKDGIGAI 119

Query: 59  --------------------WSEALHGVS------FIGRRTNSPPGTHFDSE-VPG---- 87
                               W  + H  +      F    T     T F +E + G    
Sbjct: 120 DEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESY 179

Query: 88  -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
            AT+FPT +    ++N +L  K+G     E R +      G T  ++P ++V RD RWGR
Sbjct: 180 IATNFPTQLGLGHTWNRNLVHKVGYITGREGRLL------GYTNVYAPILDVGRDQRWGR 233

Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
             E  GE PY+V    +   +G+Q     +Y         +++A  KHY AY  +     
Sbjct: 234 YEEVYGESPYLVAELGVEMAKGMQ----TDY---------QVAATSKHYIAYSNNKGGRE 280

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
                D +++ ++++   + P++  + E  +  VM SYN  +G P  +    L   +RG+
Sbjct: 281 GMARVDPQMSPREVEMIHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGE 340

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
           + F GY+VSD D+++ +   H    D KE  +  VL AGL++ C     D Y       +
Sbjct: 341 FGFRGYVVSDSDAVEYLFSKHGTAADMKESVLQSVL-AGLNIRCTFRSPDSYVLPLRELI 399

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELAAEAARQGI 379
            +G I  + ID  +R +  V   +G FD   Q   K   K   C    + +A +A+++ +
Sbjct: 400 AEGAIPMSTIDDRVRDILRVKFLVGLFDHPYQIDLKETDKEVNCAENQL-VALQASKESL 458

Query: 380 VLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV--- 436
           VLLKN +  LPL+   I  +A+ GP+A+     + +Y       T+ ++G     K    
Sbjct: 459 VLLKNQDAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTD 518

Query: 437 INYAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
           + +  GC D+V                +  S I  A++ AK +D TV+V G       E 
Sbjct: 519 VLFTKGC-DLVDANWPESELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSNRTCGEN 577

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
           K R  L LPG Q +L+  V    K PV LV+++   + IN+A  +  + +IL   YPG +
Sbjct: 578 KSRSSLDLPGRQLDLLQAVVATGK-PVVLVLINGRPLSINWA--DKYVPAILEAWYPGSQ 634

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV---- 597
           GG AIAD +FG YNPGG+L +T +     +IP+ + P +P     G   K  DG +    
Sbjct: 635 GGTAIADALFGDYNPGGKLTVT-FPKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRVN 692

Query: 598 --VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
             +YPFGYGLSYT F+Y   S                 I   + T   P        V+C
Sbjct: 693 GPLYPFGYGLSYTTFEYSDIS-----------------IQPAIVTQVQPVT------VRC 729

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
           K         V N GK  G EVV +Y +       T+ K ++G++R+ +  G++ ++ FT
Sbjct: 730 K---------VTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFT 780

Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
           +   + L+++++  + ++  G   ++VG
Sbjct: 781 IEP-RDLQLLNSDNHWVVEPGDFKVMVG 807


>gi|224536377|ref|ZP_03676916.1| hypothetical protein BACCELL_01251 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224522015|gb|EEF91120.1| hypothetical protein BACCELL_01251 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 954

 Score =  263 bits (672), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 227/760 (29%), Positives = 357/760 (46%), Gaps = 119/760 (15%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSEALHG 65
           +   Y D  LP  ER + L+  MT PE   ++    +G+P  G+P LY       EA+HG
Sbjct: 166 TSLRYMDPTLPVEERVESLLSVMT-PEDKMELIREGWGIP--GIPHLYVPPITKVEAVHG 222

Query: 66  VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
            S+         G+       GAT FP  +   A++N+ L + +   V  E      L  
Sbjct: 223 FSY---------GS-------GATIFPQALAMGATWNKKLTEDVAMAVGDE-----TLAA 261

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
             +  WSP ++V +D RWGR  ET GEDP +V +    +++G Q            S+ L
Sbjct: 262 GTMQAWSPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SKGL 309

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
             +   KH+  +         R   D  ++E++M+E  ++PF   +   D  SVM +Y+ 
Sbjct: 310 FTTP--KHFGGHGAPL---GGRDSHDIGLSEREMREVHLVPFRHVIRNYDCQSVMMAYSD 364

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
             G+P     +LL+  +R +W F G+IVSDC +I  +     +    K +A  + L AG+
Sbjct: 365 YLGVPVAKSRELLHSILREEWGFDGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGI 424

Query: 306 DLDCGDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC- 363
             +CGD Y +   + A + G+I   ++D   R +  ++ R   F+ +P  K L  N I  
Sbjct: 425 ATNCGDTYNDKEVIQAAKDGRINMENLDEVCRTMLRMMFRNELFEKTPN-KPLDWNKIYP 483

Query: 364 ---NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY--EG 418
              +  H E+A +AAR+ IV+L+N +  LPL   +++T+A+VGP A+  +   G+Y  + 
Sbjct: 484 GWNSDSHKEMARQAARESIVMLENKDNILPL-AKDMRTIAVVGPGADDLQP--GDYTPKL 540

Query: 419 TPCRYTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
            P +  S + G        +KV+ Y  GC D    N + IP A+ AA  +D  V+V G  
Sbjct: 541 LPGQLKSVLTGIKQAVGKQTKVV-YEQGC-DFTSSNGTNIPKAVKAASQSDVVVLVLGDC 598

Query: 475 LSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
            + E+         E  D   L+LPG Q EL+  V   A G   ++I+ AG    N +K 
Sbjct: 599 STSESTTDVYKTSGENHDYATLILPGKQQELLEAV--CATGKPVILILQAGR-PYNLSKA 655

Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
           +   K+IL    PG+EGG A ADV+FG YNP GRLP+T+    +V      +PL      
Sbjct: 656 SELCKAILVNWLPGQEGGPATADVLFGDYNPAGRLPMTF--PRHV----GQLPLYYNFKT 709

Query: 586 PGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKP 643
            GR Y++ D     +Y FGYGLSYT F+Y              K Q+  + N  +     
Sbjct: 710 SGRRYEYSDMEFYPLYYFGYGLSYTSFEYSGL-----------KIQEKDNGNVAI----- 753

Query: 644 PCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF 702
                              Q  V+N+G+  G EVV +Y +       T I ++  + RV 
Sbjct: 754 -------------------QATVKNVGQRAGDEVVQLYITDMYASVKTRITELKDFTRVH 794

Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +  G+S  V F +   + L ++++  + ++  G   ILVG
Sbjct: 795 LQPGESKIVSFELTPYE-LSLLNDRMDRVVEKGEFKILVG 833


>gi|427387362|ref|ZP_18883418.1| hypothetical protein HMPREF9447_04451 [Bacteroides oleiciplenus YIT
           12058]
 gi|425725523|gb|EKU88394.1| hypothetical protein HMPREF9447_04451 [Bacteroides oleiciplenus YIT
           12058]
          Length = 865

 Score =  263 bits (672), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 160/451 (35%), Positives = 242/451 (53%), Gaps = 43/451 (9%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
           PY +  L   ERA DL++RMTL EK+ QM + +  + RLG+P Y WW+EALHGV+  G+ 
Sbjct: 24  PYKNPDLTPSERAWDLLKRMTLEEKISQMKNGSPAIERLGIPAYNWWNEALHGVARAGK- 82

Query: 73  TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------LG 124
                          AT FP  I   A+F+     +    VS EARA Y+         G
Sbjct: 83  ---------------ATVFPQAIGLAATFDNQAVHETFSIVSDEARAKYHDFQRKGERDG 127

Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
             GLTFW+PNIN+ RDPRWGR +ET GEDPY+     +  V+GLQ         D   + 
Sbjct: 128 YKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQG--------DGTGKY 179

Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            K  AC KHYA +    W   +R  FD++ ++++D+ ET++  F+  V EG V  VMC+Y
Sbjct: 180 DKTHACAKHYAVHSGPEW---NRHSFDAKNISQRDLWETYLPAFKTLVTEGKVKEVMCAY 236

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI-VESHKFLNDTKEDAVARVLK 302
           NR  G P C++ +LL + +R DW +   +VSDC +I      +H   + T   A A  + 
Sbjct: 237 NRYEGEPCCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTAAAASADAVV 296

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKN 360
           +G DL+CG  Y++    AV++G I+E  I+ S+  L     +LG FD +    +  +  +
Sbjct: 297 SGTDLECGGSYSSLNE-AVRKGLISEDKINESVFRLLRARFQLGMFDDNTLVSWSEIPYS 355

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            + + +H+  A E AR+ +VLL N N  LPL+  +++ +A++GP+AN +  +  NY G P
Sbjct: 356 VVESKEHVAKALEMARKSMVLLTNKNNILPLSK-SVRKVAVLGPNANDSVMLWANYNGFP 414

Query: 421 CRYTSPMDGFYAY--SKVINYAPGCADIVCQ 449
            +  + ++G         + Y  GC  +  Q
Sbjct: 415 TKSVTILEGIRNKLPEGAVYYEKGCDFVNTQ 445



 Score =  126 bits (317), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 97/328 (29%), Positives = 149/328 (45%), Gaps = 59/328 (17%)

Query: 432 AYSKVINY--APGCA----DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----- 480
           AY  V+ Y  A G A    DI  +         D A  AD  + V GL  S+E E     
Sbjct: 563 AYKVVLEYFQAGGEASLKFDIGIKKEINYKEMADKAAEADVIIFVGGLSSSLEGEEMPVD 622

Query: 481 -----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
                  DR ++ LP  Q E++  +    K PV  V+ S   + + +   N  + +I+  
Sbjct: 623 LPGFRKGDRTNIDLPQVQEEMLKALKKTGK-PVVFVLCSGSTLALPWEAEN--LDAIIEA 679

Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDG 595
            YPG++GG A+ADV+FG YNP GRLP+T+Y ++      + +P     +   RTY++F G
Sbjct: 680 WYPGQQGGTAVADVLFGDYNPAGRLPLTFYASS------SDLPDFEDYDMSNRTYRYFKG 733

Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
             ++PFG+GLSYT F Y  A + K +                 G                
Sbjct: 734 RPLFPFGHGLSYTTFDYGKAKADKKI--------------LRAGEG-------------- 765

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
                T  I ++N+GK+ G EVV VY + PG     IK +  + R+ + AGQ+  V F +
Sbjct: 766 ----LTLTIPLKNIGKLSGDEVVQVYLRNPGDKEGPIKTLRAFRRISLEAGQAEDVLFEL 821

Query: 716 NACKSLKIVDNAANSL-LASGAHTILVG 742
               + +  + A N + +  G + +L G
Sbjct: 822 -PVSTFEWFNPATNRMEVLPGKYELLYG 848


>gi|423217451|ref|ZP_17203947.1| hypothetical protein HMPREF1061_00720 [Bacteroides caccae
           CL03T12C61]
 gi|392628610|gb|EIY22636.1| hypothetical protein HMPREF1061_00720 [Bacteroides caccae
           CL03T12C61]
          Length = 946

 Score =  263 bits (672), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 232/807 (28%), Positives = 363/807 (44%), Gaps = 138/807 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
           Y D   P   R +DL+ +MTL EK  QM  L YG  R+    LP  EW ++    G+  I
Sbjct: 53  YEDPTAPIDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTSEWKNQLWKDGIGAI 111

Query: 70  GRRTNS------PPG-------------------------------THFDSE-VPG---- 87
               N       PP                                T F +E + G    
Sbjct: 112 DEHLNGFQQWGLPPSDNEYVWPASKHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESY 171

Query: 88  -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
            AT+FPT +    ++N  L  ++G     EAR +      G T  ++P ++V RD RWGR
Sbjct: 172 KATNFPTQLGLGHTWNRQLIHQVGLITGREARML------GYTNVYAPILDVGRDQRWGR 225

Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
             E  GE PY+V    I  VRG+Q       H        +++A  KH+ AY  +     
Sbjct: 226 YEEVYGESPYLVAELGIEMVRGMQ-------HNH------QVAATGKHFIAYSNNKGARE 272

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
                D +++ ++++     PF+  + E  +  VM SYN  +G P  +    L   +RG+
Sbjct: 273 GMARVDPQMSPREVEMLHAYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGE 332

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
             F GY+VSD D+++ +   H    D KE AV + ++AGL++ C     D Y       V
Sbjct: 333 MGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLRELV 391

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAA-EAARQGIV 380
           ++G ++E  I+  +R +  V   +G FD   Q    G +     +  E  A +A+R+ IV
Sbjct: 392 KEGGLSEEVINDRVRDILRVKFLVGLFDTPYQTDLKGADEEVEKKENEEVALQASRESIV 451

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK---VI 437
           LLKN+   LPL+   I+ +A+ GP+A+     + +Y       TS + G     K    +
Sbjct: 452 LLKNEKNVLPLDPSKIRKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKDKADV 511

Query: 438 NYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
            Y  GC D+V  N                  I  A+  AK AD  ++V G       E K
Sbjct: 512 LYTKGC-DLVDANWPESELIDYPLTDEEQKEIDKAVSQAKQADVAIVVLGGGQRTCGENK 570

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
            R  L LPG Q +L+  V    K PV LV+++   + IN+A  +  + +IL   YPG +G
Sbjct: 571 SRSSLDLPGRQLDLLKAVVATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKG 627

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV----- 597
           G A+AD++FG YNPGG+L +T +     +IP+ + P +P +   G      DG +     
Sbjct: 628 GIAVADILFGDYNPGGKLTVT-FPKTVGQIPF-NFPCKPSSQIDGGKNPGPDGNMSRANG 685

Query: 598 -VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
            +YPFGYGLSYT F+Y                    D+  +     P   A     V CK
Sbjct: 686 ALYPFGYGLSYTTFEYS-------------------DLKISPAIITPNQKAY----VTCK 722

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
                    V N GK  G EV+ +Y +       T+ K ++G+ERV +  G++ ++ F +
Sbjct: 723 ---------VTNTGKRSGDEVIQLYVRDVLSSVTTYEKNLVGFERVHLKPGETKEITFPI 773

Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
           +  K+L++++   + ++  G  T+++G
Sbjct: 774 DR-KALELLNADMHWVVEPGDFTLMLG 799


>gi|325105296|ref|YP_004274950.1| beta-glucosidase [Pedobacter saltans DSM 12145]
 gi|324974144|gb|ADY53128.1| Beta-glucosidase [Pedobacter saltans DSM 12145]
          Length = 884

 Score =  263 bits (672), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 165/457 (36%), Positives = 241/457 (52%), Gaps = 55/457 (12%)

Query: 7   VKLSDFPYC--DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALH 64
           +K  + PY   +  LP  ER ++L+  +TL EKV  M + +  V RLG+P Y+WW+EALH
Sbjct: 23  LKSQEIPYKFRNPDLPVNERIENLLGLLTLEEKVGLMMNSSKPVGRLGIPAYDWWNEALH 82

Query: 65  GVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN-- 122
           GV+  G+                AT FP  I   A++NES  K+    +S EARA YN  
Sbjct: 83  GVARSGK----------------ATVFPQAIGMAATWNESGHKQTFDLISDEARAKYNEA 126

Query: 123 LGNA------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEY 176
           + N       GL+FW+PNIN+ RDPRWGR  ET GEDPY+  R  +  VRGLQ       
Sbjct: 127 IRNGERGRYYGLSFWTPNINIFRDPRWGRGQETYGEDPYLTARLGVAAVRGLQ------- 179

Query: 177 HRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDV 236
               D +  K  AC KH+A +    W   +R  +D+  + +D+ ET++  F+  V E +V
Sbjct: 180 --GDDPKYFKTHACAKHFAVHSGPEW---NRHSYDATASGRDLWETYLPAFKALVKEANV 234

Query: 237 SSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV-----ESHKFLND 291
             VMC+YN   G P C   +LL   +R  W + G +VSDC +I         E+HK    
Sbjct: 235 QEVMCAYNAYEGQPCCGSDRLLTDILRNRWEYKGIVVSDCWAIDDFFRKGHHETHKDAAA 294

Query: 292 TKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS 351
              DAV        DL+CG  YTN  + AV+QG I++  ID SLR +      LG  D +
Sbjct: 295 AAADAVIH----STDLECGSAYTNL-LEAVRQGLISQQQIDISLRRVLRGWFELGMLDPA 349

Query: 352 PQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT 409
            +  +  L    + + +H++ A + AR+ + LLKN+   LPL + +IK +A++GP+A  +
Sbjct: 350 ERLPWSQLPYQIVASKEHVQQALKVARESMTLLKNNGSILPL-SKSIKKIAVIGPNAADS 408

Query: 410 KAMIGNYEGTPCRYTSPMDGF---YAYSKVINYAPGC 443
             + GNY GTP    + + G      ++++I Y  GC
Sbjct: 409 VMLWGNYNGTPNSTVTILQGIKNKLPHAEII-YDKGC 444



 Score =  111 bits (278), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 88/302 (29%), Positives = 136/302 (45%), Gaps = 53/302 (17%)

Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
           I   K  DA V   GL   +E E          G D++ + LP  Q EL++ +    K P
Sbjct: 602 IKRLKEVDAIVYAGGLSPQLEGEEMPVNADGFRGGDKISIDLPKIQRELLSSLKSTGK-P 660

Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
           V  V+ +  ++ +   + N    ++L   Y G+E G A+ADV+FG YNP GRLPIT+Y++
Sbjct: 661 VVFVLCTGSSLALEQDEKN--YNALLCAWYGGQEAGTAVADVLFGDYNPAGRLPITFYKS 718

Query: 568 ------NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
                   +K   TS       +  GRTY++     +Y FG+GLSY++F Y         
Sbjct: 719 LSQLDNALLKTSDTSRQDFENYSMQGRTYRYMTEKPLYAFGHGLSYSKFNY--------- 769

Query: 622 DIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY 681
                            G  K     V I +           I + N+    G EVV VY
Sbjct: 770 -----------------GEAKLTSGTVKIGNT------LNISIPLTNISNNKGEEVVQVY 806

Query: 682 SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTIL 740
            K  G     +K + G++RV IAAG++  + F + A ++ +  D + + L   +G +TI+
Sbjct: 807 VKRNGDPDAPVKSLKGFKRVAIAAGETKHLDFQLTA-EAFEFYDPSKDELGPKAGNYTIM 865

Query: 741 VG 742
            G
Sbjct: 866 YG 867


>gi|189464325|ref|ZP_03013110.1| hypothetical protein BACINT_00666 [Bacteroides intestinalis DSM
           17393]
 gi|189438115|gb|EDV07100.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 935

 Score =  263 bits (672), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 224/760 (29%), Positives = 356/760 (46%), Gaps = 119/760 (15%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSEALHG 65
           +   Y D  LP  ER + L+  MT PE   ++    +G+P  G+P LY       EA+HG
Sbjct: 147 TSLRYMDPTLPVEERVESLLSVMT-PEDKMELIREGWGIP--GIPHLYVPPITKVEAVHG 203

Query: 66  VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
            S+         G+       GAT FP  +   A++N+ L +++   V  E      L  
Sbjct: 204 FSY---------GS-------GATIFPQALAMGATWNKKLTEEVAMAVGDE-----TLSA 242

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
             +  WSP ++V +D RWGR  ET GEDP +V +    +++G Q               +
Sbjct: 243 GTMQAWSPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQS--------------M 288

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
            +    KH+  +         R   D  ++E++M+E  ++PF   +   D  S+M +Y+ 
Sbjct: 289 GLYTTPKHFGGHGAPL---GGRDSHDIGLSEREMREVHLVPFRHVIRNYDCQSLMMAYSD 345

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
             G+P     +LL+  +R +W F G+IVSDC +I  +     +    K +A  + L AG+
Sbjct: 346 FLGVPVAKSRELLHNILREEWGFSGFIVSDCGAIGNLTARKHYTAKNKIEAANQALAAGI 405

Query: 306 DLDCGDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC- 363
             +CGD Y +   + A + G+I   ++D   R +  ++ R   F+ +P  K L  N I  
Sbjct: 406 ATNCGDTYNDKEVIQAAKDGRINMENLDEVCRTMLRMMFRNELFEKAPN-KPLDWNKIYP 464

Query: 364 ---NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY--EG 418
              +  H E+A +AAR+ IVLL+N +  LPL+  +++T+A++GP AN  +   G+Y  + 
Sbjct: 465 GWNSDSHKEMARQAARESIVLLENKDNILPLSK-DMRTIAVLGPGANDLQP--GDYTPKL 521

Query: 419 TPCRYTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
            P +  S + G        +KVI Y  GC D      + I  A+  A  +D  ++V G  
Sbjct: 522 QPGQLKSVLTGIKQAVGKQTKVI-YEQGC-DFTSLGENNIAKAVKVASQSDVVLLVLGDC 579

Query: 475 LSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
            + EA         E  D   L+LPG Q EL+  V   A G   ++I+ AG    N +K 
Sbjct: 580 STSEATTDVYKTSGENHDYATLILPGKQQELLEAV--CATGKPVILILQAGR-PYNLSKA 636

Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
           +   K+IL    PG+EGG A ADV+FG YNP GRLP+T+    +V      +PL      
Sbjct: 637 SELCKAILVNWLPGQEGGPATADVLFGDYNPAGRLPMTF--PRHV----GQLPLYYNFKT 690

Query: 586 PGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKP 643
            GR Y++ D     +Y FGYGLSYT F+Y              K Q+  + N TV     
Sbjct: 691 SGRRYEYSDMEYYPLYYFGYGLSYTSFEYSGL-----------KIQEKENGNITV----- 734

Query: 644 PCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF 702
                              Q  V+N+G+  G EVV +Y +       T I ++  + R+ 
Sbjct: 735 -------------------QATVKNIGQRAGDEVVQLYVTDMYASVKTRITELKDFTRIH 775

Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +  G++  V F +   + L ++++  + ++  GA  ILVG
Sbjct: 776 LKPGEAKTVSFELTPYE-LSLLNDHMDRVVEKGAFKILVG 814


>gi|167765233|ref|ZP_02437346.1| hypothetical protein BACSTE_03621 [Bacteroides stercoris ATCC
           43183]
 gi|167696861|gb|EDS13440.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           stercoris ATCC 43183]
          Length = 818

 Score =  263 bits (671), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 224/812 (27%), Positives = 369/812 (45%), Gaps = 151/812 (18%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
           Y D   P  +R  DL+ +M++ EK  Q+  L YG  R+    LP+  W    W + +  +
Sbjct: 59  YEDPSQPVEKRVADLLSQMSVEEKTCQLATL-YGYGRVLKDSLPVAGWKNEIWKDGIANI 117

Query: 67  ----SFIGRRTNSPPG----------------------------THFDSE-VPG-----A 88
               + +G+++   PG                              F +E + G     A
Sbjct: 118 DEMLNGVGKKSAQVPGLLYPFSNHAEAVNTVQRWFVEETRLGIPVDFTNEGIHGLNHTKA 177

Query: 89  TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVL 147
           T  P  I   +++N+ L ++ G     EA+A+      G T  ++P +++VRDPRWGR L
Sbjct: 178 TPLPAPIAIGSTWNKELVRRAGVIAGQEAKAL------GYTNVYAPILDIVRDPRWGRTL 231

Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
           E  GE+PY++       V G+Q  +GV             +A  KHYA Y +     +  
Sbjct: 232 ECYGEEPYLIAALGTEMVNGIQS-QGV-------------AATLKHYAVYSVPKGGRDGN 277

Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
              D  V  +++ E F+ PF+  +       VM SYN  +G+P  A    L + +R ++ 
Sbjct: 278 CRTDPHVAPRELHELFLYPFKKVIQNSHPMGVMSSYNDWDGVPVSASYYFLTELLREEYG 337

Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------M 318
           F GY+VSD ++++  VES   + DT ++AV +VL+AGL++      T+FT          
Sbjct: 338 FDGYVVSDSEAVE-FVESKHHVADTYDEAVRQVLEAGLNVR-----THFTPPSDFILPIR 391

Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNP-QHIELAAEAARQ 377
             +++ KI+ A ID  +  +  V  RLG FD          + +    ++++   +  +Q
Sbjct: 392 RLLEEKKISMAVIDKRVSEVLRVKFRLGLFDQPYVADTKAADRVGGADRNMDFVKQMQQQ 451

Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVI 437
            +VLLKN+N  LPL+   IK + + GP A+    M   Y        + + G   Y K I
Sbjct: 452 ALVLLKNENNILPLDKRQIKKVLVTGPLADEDNFMTSRYGPNGLETVTVLAGLRNYLKGI 511

Query: 438 ---NYAPGC--------------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
              +YA GC              A +  Q    I  A+  A  +D  + V G D     E
Sbjct: 512 AEVDYAKGCDIVDAGWPATEILPAPMSEQEKQGIAEAVAKAGESDVIIAVLGEDEYRTGE 571

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
            + R  L LPG Q +L+  +    K PV LV+++   + +N+A  N  I +IL   +PG 
Sbjct: 572 SRSRTSLDLPGRQQQLLEALHATGK-PVILVLINGQPLTVNWA--NAYIPAILESWFPGC 628

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVN---------NFPGRTYK 591
           +GG  IA+ +FG++NPGG+L +T+ ++  V     + P +P +         N  G T  
Sbjct: 629 QGGTVIAETLFGEHNPGGKLTVTFPKS--VGQIELNFPFKPGSHGAQPHSGPNGSGATRI 686

Query: 592 FFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLID 651
             +   +YPFG+GLSYT F Y         D+++   QQ     +T G            
Sbjct: 687 IGE---LYPFGFGLSYTTFAYS--------DLEVSPLQQ-----HTQG------------ 718

Query: 652 DVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAK 710
                  ++T ++ V N GK  G EVV +Y +       T+  Q+ G+ERV +  G++ +
Sbjct: 719 -------EYTIKVNVTNTGKRAGDEVVQLYVRDKVSSVITYDSQLRGFERVSLQPGETRQ 771

Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           V F++   + L+I+D   N  +  G   +++G
Sbjct: 772 VTFSLKP-EDLQILDRNMNWTVEPGEFEVMIG 802


>gi|160887545|ref|ZP_02068548.1| hypothetical protein BACOVA_05565 [Bacteroides ovatus ATCC 8483]
 gi|156107956|gb|EDO09701.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
          Length = 736

 Score =  263 bits (671), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 219/723 (30%), Positives = 341/723 (47%), Gaps = 114/723 (15%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+P++    EA HG   IG                 AT FPT I   A+++  L K++
Sbjct: 83  RLGIPMF-LAEEAPHGHMAIG-----------------ATVFPTGIGMAATWSPELVKEV 124

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           GQ ++ E R+       G   + P +++ RDPRW RV ET GEDP + G    + V GL 
Sbjct: 125 GQVIAKEIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGLG 179

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                     + S+     A  KH+ AY +   EG    ++ S V  +D+ + F+ PF  
Sbjct: 180 G--------GNLSQKYATIATLKHFLAYAVP--EGGQNGNYAS-VGIRDLHQNFLPPFRK 228

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            ++ G +S VM SYN ++G P  ++  LL Q +R +W F G++VSD  SI+ I ESH F+
Sbjct: 229 AIDAGALS-VMTSYNSIDGTPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESH-FV 286

Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
             TKE+A  + + AG+D+D G D YTN    AVQ G++ +  IDT++  +  +   +G F
Sbjct: 287 APTKENAAIQSVMAGVDVDLGGDAYTNLCH-AVQSGQMDKTVIDTAVCRVLRMKFEMGLF 345

Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
           +       +    +   +HIELA + A+  I LLKN+N  LPL+   I  +A++GP+A+ 
Sbjct: 346 EHPYVDPKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSK-TINKVAVIGPNADN 404

Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYAYSK--VINYAPGCADIVCQNNSMIPAAIDAAKNA 464
              M+G+Y          + +DG         + Y  GCA I     + I  AI AA+ +
Sbjct: 405 RYNMLGDYTAPQEDSNVKTVLDGILTKLSPFRVEYVRGCA-IRDTTVNEIEQAIKAARRS 463

Query: 465 D----------------------ATVIVAGLDLSVE-AEGKDRVDLLLPGFQTELINKVA 501
           +                      A V   G    +E  EG DR  L L G Q EL+  + 
Sbjct: 464 EVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQ 523

Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
              K P+ +V +    ++ N+A       ++L   YPG+EGG AIADV+FG YNP GRLP
Sbjct: 524 KTGK-PLIVVYIEGRPLEKNWASEYA--DALLTAYYPGQEGGNAIADVLFGDYNPSGRLP 580

Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
           I+    +  +IP       P N+     Y       +Y FGYG+SYT F+Y         
Sbjct: 581 IS-VPRSVGQIPVYYNKKAPRNH----DYVEMSSFPLYSFGYGMSYTTFEYS-------- 627

Query: 622 DIK-LDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
           D++ + K  +C ++++                            +V+N GK DG EV  +
Sbjct: 628 DLQVVQKSARCFEVSF----------------------------KVKNTGKYDGEEVSQL 659

Query: 681 YSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
           Y +    +    +KQ+  +ER  +  G+  KV F +   +   +V+     ++ SG   +
Sbjct: 660 YMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGNFHL 718

Query: 740 LVG 742
           ++G
Sbjct: 719 MIG 721


>gi|300773468|ref|ZP_07083337.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
           33861]
 gi|300759639|gb|EFK56466.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
           33861]
          Length = 777

 Score =  263 bits (671), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 209/699 (29%), Positives = 327/699 (46%), Gaps = 117/699 (16%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+P++    EA HG   IG                  T FPT I   +++N +L +K+
Sbjct: 126 RLGIPVF-LAEEAPHGHMAIG-----------------TTVFPTGIGQASTWNPALLQKM 167

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
             TV+ E R            + P +++ RDPRW RV E+ GEDP + G  A   VRGL 
Sbjct: 168 SATVAKEVRQ-----QGAHISYGPVLDLSRDPRWSRVEESYGEDPVLTGTLAAAIVRGLG 222

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                     + S P       KH+ AY +     N      + V E++++E F+ PF+ 
Sbjct: 223 S--------GNLSDPFATIPTLKHFVAYGIPEGGHNGS---AASVGERELREYFLPPFQS 271

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            V  G   SVM +YN V+GIP  ++  LL   +R +W+F+G+ VSD  SI+ I  SH+  
Sbjct: 272 AVAAG-AKSVMAAYNSVDGIPCSSNKFLLTDILRKEWSFNGFTVSDLGSIEGIKGSHRVA 330

Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
            D K+ A+   ++AGLD D G       + AV+QG++ E  ID ++  +  +   +G F+
Sbjct: 331 KDHKQAAIL-AIEAGLDADLGGNAYVRLIEAVKQGEVQENSIDQAVSRILALKFEMGLFE 389

Query: 350 GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT 409
                    K  +    +I L+ + AR+ IVLL+N N  LPL   ++K +A+VGP+A+  
Sbjct: 390 KPFVDVKTAKKEVKTESNIALSRQVARESIVLLENKNNILPLRK-DVK-IAIVGPNADNV 447

Query: 410 KAMIGNY-----EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNA 464
             M+G+Y     +G        +      ++V +Y  GCA I    NS IPAA+ AA+ +
Sbjct: 448 YNMLGDYTAPQPDGAVTTVRQAISARLPKAQV-SYVKGCA-IRDTTNSDIPAAVTAARQS 505

Query: 465 DATVIVAG----LDLSVE-------------------AEGKDRVDLLLPGFQTELINKVA 501
           D  V V G     D   E                    EG DR  L L G Q EL+  + 
Sbjct: 506 DIIVAVVGGSSARDFKTEYISTGAAVASDKSVSDMESGEGFDRSTLDLLGRQMELLKALK 565

Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
              K P+ ++ +    +++N+A    +  ++L   YPG+EGG AIADV+FG YNP G++P
Sbjct: 566 QTGK-PLVVIYIQGRPLNMNWAAT--QADALLCAWYPGQEGGHAIADVLFGDYNPAGKMP 622

Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPK 619
           ++      V      +P+   N      +++ +     +Y FGYG SY+ F+YK      
Sbjct: 623 LS------VPRSVGQIPVH-YNRKSSLDHRYVEEAATPLYAFGYGKSYSDFEYK------ 669

Query: 620 SVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVM 679
             D+K+ K+                            DY  +F +   N GK DG EV  
Sbjct: 670 --DLKIQKEN--------------------------TDYHVSFTLT--NTGKYDGDEVPQ 699

Query: 680 VYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNA 717
           +Y +    + +  ++Q+  +ER+ +  G+S  V F + A
Sbjct: 700 LYIRNQYASVSQPVQQLKHFERIHLKTGESKTVSFVLTA 738


>gi|262405113|ref|ZP_06081663.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_22]
 gi|262355988|gb|EEZ05078.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_22]
          Length = 769

 Score =  263 bits (671), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 214/725 (29%), Positives = 326/725 (44%), Gaps = 119/725 (16%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+PL+    EA HG   IG                 AT FPT I   A+++  L +++
Sbjct: 117 RLGIPLF-LAEEAPHGHMAIG-----------------ATVFPTGIGMAATWSPQLIREV 158

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           G+ +  E R        G   + P +++ RDPRW RV ET GEDP + G      V GL 
Sbjct: 159 GKAIGKEIRL-----QGGHISYGPVLDLARDPRWSRVEETFGEDPVLTGEIGKAMVEGLG 213

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                       S P    A  KH+ AY +     N    F      +++ E F+ PF  
Sbjct: 214 G--------GDLSHPYSTLATLKHFLAYGISESGQNGNPSFAGI---RELHENFLPPFRQ 262

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            ++ G +S VM SYN ++G+P  A+  LL + +R +W F G +VSD  SI+ I +SH F+
Sbjct: 263 AIDAGALS-VMTSYNSMDGVPCTANHSLLTELLRNEWKFRGIVVSDLYSIEGIHQSH-FV 320

Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
             T E+A    L AG+D+D G D Y N  M AV  G+I++  +D S+  +  +   +G F
Sbjct: 321 APTMEEAAILALSAGVDVDLGGDAYMNL-MNAVNTGRISKTALDASVARVLRLKFEMGLF 379

Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
           +         K  + + + + LA   A+  I LLKN++  LPLN    + +AL+GP+A+ 
Sbjct: 380 ENPYVDPEKAKKEVRSEESVTLARRVAQASITLLKNEHSLLPLNKN--RKVALIGPNADN 437

Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYA--YSKVINYAPGCA--DIVCQNNSMIPAAIDAAK 462
              M+G+Y          + +DG  A   S  + Y  GC+  D V  +   I  A+ AA+
Sbjct: 438 RYNMLGDYTAPQEEENIKTVLDGIRAKLSSSQVEYVKGCSIRDTVTTD---IEQAVAAAQ 494

Query: 463 NADATVIVAGLDLSVE-----------------------AEGKDRVDLLLPGFQTELINK 499
            ++  + V G   + +                        EG DR  L L G Q EL+ K
Sbjct: 495 RSEVIIAVVGGSSARDFKTSYKETGAAIADEKTISDMECGEGFDRATLSLLGKQQELL-K 553

Query: 500 VADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGR 559
              A   P+ +V +    +D N+A  N    ++L   YPG+EGG AIADV+FG +NP GR
Sbjct: 554 ALKATGKPLIVVYIEGRPLDKNWASENAD--AVLTAYYPGQEGGIAIADVLFGDFNPAGR 611

Query: 560 LPITWYEANYVKIPYTSMPLRPVNNFP-GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSP 618
           LP +      V      +PL      P    Y       +YPFGYGLSYT F Y      
Sbjct: 612 LPFS------VPRSVGQIPLYYNKKAPQSHDYVEMSASPLYPFGYGLSYTSFDYS----- 660

Query: 619 KSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVV 678
              D+ L                           +  + ++ +F+  V N GK DG EV 
Sbjct: 661 ---DLHLSA-------------------------LMPRSFEISFK--VRNTGKYDGEEVA 690

Query: 679 MVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAH 737
            +Y +    +    +KQ+  + R ++  G+  +V F ++  +   +VD     ++  G  
Sbjct: 691 QLYLRDEYASVVQPLKQLKHFARFYLKRGEEREVKFILSE-EDFSLVDRNLKKIVEPGTF 749

Query: 738 TILVG 742
            I++G
Sbjct: 750 QIMIG 754


>gi|383110724|ref|ZP_09931543.1| hypothetical protein BSGG_1833 [Bacteroides sp. D2]
 gi|382949470|gb|EFS31133.2| hypothetical protein BSGG_1833 [Bacteroides sp. D2]
          Length = 783

 Score =  262 bits (670), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 214/724 (29%), Positives = 325/724 (44%), Gaps = 117/724 (16%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+PL+    EA HG   IG                 AT FPT I   A+++  L +++
Sbjct: 131 RLGIPLF-LAEEAPHGHMAIG-----------------ATVFPTGIGMAATWSPQLIREV 172

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           G+ +  E R        G   + P +++ RDPRW RV ET GEDP + G      V GL 
Sbjct: 173 GKAIGKEIRL-----QGGHISYGPVLDLARDPRWSRVEETFGEDPVLTGEIGKAMVEGLG 227

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                       S P    A  KH+ AY +     N    F      +++ E F+ PF  
Sbjct: 228 S--------GDLSHPYSTLATLKHFLAYGISESGQNGNPSFAGI---RELHENFLPPFRQ 276

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            ++ G +S VM SYN ++GIP  A+  LL + +R +W F G +VSD  SI+ I +SH F+
Sbjct: 277 AIDAGALS-VMTSYNSMDGIPCTANHSLLTELLRNEWKFSGIVVSDLYSIEGIHQSH-FV 334

Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
             T E A    L AG+D+D G D Y N  M AV  G+I++  +D S+  +  +   +G F
Sbjct: 335 APTMEAAAILALSAGVDVDLGGDAYMNL-MNAVNTGRISKTALDASVARVLRLKFEMGLF 393

Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
           +         K  + + + + LA   A+  I LLKN++  LPLN    + +AL+GP+A+ 
Sbjct: 394 ENPYVDPEKAKKEVRSEESVTLARRVAQASITLLKNEHSLLPLNKN--RKVALIGPNADN 451

Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYA--YSKVINYAPGCA--DIVCQNNSMIPAAIDAAK 462
              M+G+Y          + +DG      S  + Y  GC+  D V  +   I  A+ AA+
Sbjct: 452 RYNMLGDYTAPQEEENIKTVLDGIRTKLSSSQVEYVKGCSIRDTVTTD---IEQAVAAAQ 508

Query: 463 NADATVIVAGLDLSVE-----------------------AEGKDRVDLLLPGFQTELINK 499
            ++  + V G   + +                        EG DR  L L G Q EL+ K
Sbjct: 509 RSEVIIAVVGGSSARDFKTSYKETGAAIADEKTISDMECGEGFDRATLSLLGKQQELL-K 567

Query: 500 VADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGR 559
              A   P+ +V +    +D  +A  N    ++L   YPG+EGG AIADV+FG YNP GR
Sbjct: 568 ALKATGKPLIVVYIEGRPLDKTWASENAD--AVLTAYYPGQEGGNAIADVLFGDYNPAGR 625

Query: 560 LPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPK 619
           LP+T    +  +IP       P N+     Y       +Y FGYGLSYT F+Y       
Sbjct: 626 LPLT-VPRSVGQIPIYYNKKAPQNH----DYVELSASPLYAFGYGLSYTTFEYS------ 674

Query: 620 SVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVM 679
             D+++                                + F    +V+N G+ DG EV  
Sbjct: 675 --DLRVS---------------------------AISPHSFEVSFKVKNTGRYDGEEVSQ 705

Query: 680 VYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHT 738
           +Y +    +    +KQ+  +ER  +  G+  +V F ++      I+D    +++ SG   
Sbjct: 706 LYLRDEYASVVQPLKQLKHFERFCLKRGEVKEVKFVLSES-DFTIIDRNLKTVVESGTFQ 764

Query: 739 ILVG 742
           ++VG
Sbjct: 765 VMVG 768


>gi|294647557|ref|ZP_06725134.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294807095|ref|ZP_06765914.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
 gi|345508184|ref|ZP_08787819.1| periplasmic beta-glucosidase [Bacteroides sp. D1]
 gi|292637099|gb|EFF55540.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294445794|gb|EFG14442.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
 gi|345455214|gb|EEO50370.2| periplasmic beta-glucosidase [Bacteroides sp. D1]
          Length = 783

 Score =  262 bits (670), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 214/725 (29%), Positives = 326/725 (44%), Gaps = 119/725 (16%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+PL+    EA HG   IG                 AT FPT I   A+++  L +++
Sbjct: 131 RLGIPLF-LAEEAPHGHMAIG-----------------ATVFPTGIGMAATWSPQLIREV 172

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           G+ +  E R        G   + P +++ RDPRW RV ET GEDP + G      V GL 
Sbjct: 173 GKAIGKEIRL-----QGGHISYGPVLDLARDPRWSRVEETFGEDPVLTGEIGKAMVEGLG 227

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                       S P    A  KH+ AY +     N    F      +++ E F+ PF  
Sbjct: 228 G--------GDLSHPYSTLATLKHFLAYGISESGQNGNPSFAGI---RELHENFLPPFRQ 276

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            ++ G +S VM SYN ++G+P  A+  LL + +R +W F G +VSD  SI+ I +SH F+
Sbjct: 277 AIDAGALS-VMTSYNSMDGVPCTANHSLLTELLRNEWKFRGIVVSDLYSIEGIHQSH-FV 334

Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
             T E+A    L AG+D+D G D Y N  M AV  G+I++  +D S+  +  +   +G F
Sbjct: 335 APTMEEAAILALSAGVDVDLGGDAYMNL-MNAVNTGRISKTALDASVARVLRLKFEMGLF 393

Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
           +         K  + + + + LA   A+  I LLKN++  LPLN    + +AL+GP+A+ 
Sbjct: 394 ENPYVDPEKAKKEVRSEESVTLARRVAQASITLLKNEHSLLPLNKN--RKVALIGPNADN 451

Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYA--YSKVINYAPGCA--DIVCQNNSMIPAAIDAAK 462
              M+G+Y          + +DG  A   S  + Y  GC+  D V  +   I  A+ AA+
Sbjct: 452 RYNMLGDYTAPQEEENIKTVLDGIRAKLSSSQVEYVKGCSIRDTVTTD---IEQAVAAAQ 508

Query: 463 NADATVIVAGLDLSVE-----------------------AEGKDRVDLLLPGFQTELINK 499
            ++  + V G   + +                        EG DR  L L G Q EL+ K
Sbjct: 509 RSEVIIAVVGGSSARDFKTSYKETGAAIADEKTISDMECGEGFDRATLSLLGKQQELL-K 567

Query: 500 VADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGR 559
              A   P+ +V +    +D N+A  N    ++L   YPG+EGG AIADV+FG +NP GR
Sbjct: 568 ALKATGKPLIVVYIEGRPLDKNWASENAD--AVLTAYYPGQEGGIAIADVLFGDFNPAGR 625

Query: 560 LPITWYEANYVKIPYTSMPLRPVNNFP-GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSP 618
           LP +      V      +PL      P    Y       +YPFGYGLSYT F Y      
Sbjct: 626 LPFS------VPRSVGQIPLYYNKKAPQSHDYVEMSASPLYPFGYGLSYTSFDYS----- 674

Query: 619 KSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVV 678
              D+ L                           +  + ++ +F+  V N GK DG EV 
Sbjct: 675 ---DLHLSA-------------------------LMPRSFEISFK--VRNTGKYDGEEVA 704

Query: 679 MVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAH 737
            +Y +    +    +KQ+  + R ++  G+  +V F ++  +   +VD     ++  G  
Sbjct: 705 QLYLRDEYASVVQPLKQLKHFARFYLKRGEEREVKFILSE-EDFSLVDRNLKKIVEPGTF 763

Query: 738 TILVG 742
            I++G
Sbjct: 764 QIMIG 768


>gi|383115541|ref|ZP_09936297.1| hypothetical protein BSGG_2589 [Bacteroides sp. D2]
 gi|313695054|gb|EFS31889.1| hypothetical protein BSGG_2589 [Bacteroides sp. D2]
          Length = 800

 Score =  262 bits (670), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 227/800 (28%), Positives = 359/800 (44%), Gaps = 141/800 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
           Y D   P   R  DL+ +MTL EK  QM  L YG  R+     P   W    W + +   
Sbjct: 56  YEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTDGWSTEIWKDGIGNI 114

Query: 64  ----HGVSFIGRRTNSP-----------------------PGTHFDSEVPG-----ATSF 91
               +G+   G   + P                       P    +  + G     AT F
Sbjct: 115 DEQANGLGKFGSEISYPYANSVKNRHTVQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMF 174

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
           P      A++N+ L ++I +  + EA+A+      G T  ++P +++ +DPRWGRV+E+ 
Sbjct: 175 PAQCGQGATWNKKLIREIAKVTADEAKAL------GYTNIYAPILDIAQDPRWGRVVESY 228

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDPY+VG      + GLQ  EG             I A  KH+A Y +     +     
Sbjct: 229 GEDPYLVGELGKQMILGLQS-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 274

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
           D  V  ++M+  ++ PF   + E     VM SYN  +G P       L + +R  W F G
Sbjct: 275 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 334

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
           Y+VSD ++++ +   H+ +  T+E+  A+V+ AGL++      TNFT           A+
Sbjct: 335 YVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRRAI 388

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
            +GK++   +D  +  +  V   +G FD   P      +  + N  H +++  AA + IV
Sbjct: 389 SEGKVSLHTLDQRVGEILRVKFMMGLFDNPYPGDDRRPEVVVHNAAHQDVSMRAALESIV 448

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
           LLKN+   LPL+  +   +A++GP+A   K +   Y        +   G   Y  +  + 
Sbjct: 449 LLKNEKEMLPLSK-SFSKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNAEVR 507

Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
           YA GC DI+                Q  +MI  A++ AK +D  ++V G +     E   
Sbjct: 508 YAKGC-DIIDKYFPESELYNVPLDTQEQAMINEAVELAKASDVAILVLGGNEKTVREEFS 566

Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
           R +L L G Q +L+  V    K PV LV++   A  IN+A  N  + +I+   +PGE  G
Sbjct: 567 RTNLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMG 623

Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
            AIA V+FG YNPGGRL +T +  +  +IP+ + P +P ++  G   K     V+YPFGY
Sbjct: 624 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDSKG---KVRVAGVLYPFGY 678

Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
           GLSYT F Y         ++K+               +KP   A             T  
Sbjct: 679 GLSYTTFNYS--------NLKI---------------SKPVIGA---------QENITLS 706

Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
             V+N GK  G EVV +Y +    + T   +V+ G+ER+ +  G+   + FT+   + L 
Sbjct: 707 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTISFTLTP-QDLG 765

Query: 723 IVDNAANSLLASGAHTILVG 742
           + D      +  G+ +++VG
Sbjct: 766 LWDKNNQFTVEPGSFSVMVG 785


>gi|237721201|ref|ZP_04551682.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
 gi|229448997|gb|EEO54788.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
          Length = 863

 Score =  262 bits (670), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 168/448 (37%), Positives = 239/448 (53%), Gaps = 46/448 (10%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S +PY D KL   +RA DL++R+TL EKV  M + +  +PRLG+  YEWW+EALHGV+  
Sbjct: 24  SKYPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARA 83

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---- 125
           G                 AT FP  I   ASFN+ L  ++   VS EARA     N    
Sbjct: 84  GL----------------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQ 127

Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
                GLT W+PN+N+ RDPRWGR  ET GEDPY+ GR  +  VRGLQ  E  EY     
Sbjct: 128 YKRYQGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYD---- 183

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVM 240
               K+ AC KH+A +    W   +R  F++  +  +D+ ET++  F+  V +  V  VM
Sbjct: 184 ----KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVM 236

Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV--- 297
           C+YNR  G P C   +LL Q +R DW F G +V+DC +I    +  K  ++T  DA    
Sbjct: 237 CAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKK--HETHPDAAHAS 294

Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL 357
           A  + +G DL+CG  + + T  AV++G I+E  I+TS++ L      LG  + +  + N+
Sbjct: 295 ADAVLSGTDLECGGNFKSIT-DAVKKGLISEEKINTSVKRLLKARFELGEMNSTHPWSNI 353

Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
             + I  P+H ELA + A + +VLL+N+N    L       +A++GP+AN +    GNY 
Sbjct: 354 PFSVIDCPKHKELALKMAHESLVLLQNNNNL--LPLNRQMKVAVIGPNANDSVMQWGNYN 411

Query: 418 GTPCRYTSPMDGFYAY--SKVINYAPGC 443
           G P    + ++G  A      I Y P C
Sbjct: 412 GFPSHTVTLLEGIRAKLPDAQIIYEPVC 439



 Score =  118 bits (296), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 95/296 (32%), Positives = 138/296 (46%), Gaps = 53/296 (17%)

Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
           ++  ++AD  +   G+   +E E          G DR ++ LP  Q E++  +    K  
Sbjct: 594 LNKLQSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKKT 653

Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
           V  V  S  A+ I     N    +IL   YPG+ GG A+ADV+FG YNP GRLPIT+Y++
Sbjct: 654 V-FVNFSGSAMAIVPETQN--CDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKS 710

Query: 568 NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
                 Y    ++      GRTY+F     +YPFGYGLSYT+F Y  A+  +S   KL K
Sbjct: 711 MQQLPDYEDYSMK------GRTYRFMTKTPLYPFGYGLSYTRFSYGKATLNQS---KLTK 761

Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI 687
            ++                A+L              I V N+G+ DG EVV VY   P  
Sbjct: 762 GEK----------------AILT-------------IPVSNVGQRDGEEVVQVYICRPDD 792

Query: 688 AGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
                K + G++RV IA G++  V   +    S +  D A N++   +G + IL G
Sbjct: 793 KEGPQKTLRGFQRVSIAKGKTQNVQIEL-PYDSFEWFDAATNTIRPLNGTYKILYG 847


>gi|299146513|ref|ZP_07039581.1| beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298517004|gb|EFI40885.1| beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 736

 Score =  262 bits (670), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 218/723 (30%), Positives = 342/723 (47%), Gaps = 114/723 (15%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+P++    EA HG   IG                  T FPT I   A+++  L K++
Sbjct: 83  RLGIPMF-LAEEAPHGHMAIG-----------------TTVFPTGIGMAATWSPELVKEV 124

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           GQ ++ E R+       G   + P +++ RDPRW RV ET GEDP + G    + V GL 
Sbjct: 125 GQVIAKEIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGLG 179

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                     + S+     A  KH+ AY +   EG    ++ S V  +D+ + F+ PF  
Sbjct: 180 G--------GNLSQKYATIATLKHFLAYAVP--EGGQNGNYAS-VGIRDLHQNFLPPFRK 228

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            ++ G +S VM SYN ++GIP  ++  LL + +R +W F G++VSD  SI+ I ESH F+
Sbjct: 229 AIDAGALS-VMTSYNSIDGIPCTSNHYLLTKLLRNEWKFRGFVVSDLYSIEGIHESH-FV 286

Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
             TKE+A  + + AG+D+D G D YTN    AVQ G++ +  IDT++  +  +   +G F
Sbjct: 287 APTKENAAIQSVMAGVDVDLGGDAYTNLCH-AVQSGQMDKTVIDTAVCRVLRMKFEMGLF 345

Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
           +       +    +   +HIELA + A+  I LLKN+N  LPL+   I  +A++GP+A+ 
Sbjct: 346 EHPYVDPKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSK-MINKVAVIGPNADN 404

Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYA--YSKVINYAPGCADIVCQNNSMIPAAIDAAKNA 464
              M+G+Y          + +DG         + Y  GCA I     + I  AI+AA+ +
Sbjct: 405 RYNMLGDYTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCA-IRDTTVNEIEQAIEAARRS 463

Query: 465 D----------------------ATVIVAGLDLSVE-AEGKDRVDLLLPGFQTELINKVA 501
           +                      A V   G    +E  EG DR  L L G Q EL+  + 
Sbjct: 464 EVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQ 523

Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
              K P+ +V +    ++ N+A       ++L   YPG+EGG AIADV+FG YNP GRLP
Sbjct: 524 KTGK-PLIVVYIEGRPLEKNWASEYA--DALLTAYYPGQEGGNAIADVLFGDYNPSGRLP 580

Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
           I+    +  +IP       P N+     Y       +Y FGYG+SYT F+Y         
Sbjct: 581 IS-VPRSVGQIPVYYNQKAPRNH----DYVEVSSSPLYSFGYGMSYTTFEYS-------- 627

Query: 622 DIK-LDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
           D++ + K  +C ++++                            +V+N GK DG EV  +
Sbjct: 628 DLQVVQKSARCFEVSF----------------------------KVKNTGKYDGEEVSQL 659

Query: 681 YSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
           Y +    +    +KQ+  +ER  +  G+  KV F +   +   +V+     ++ SG   +
Sbjct: 660 YMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGNFHL 718

Query: 740 LVG 742
           ++G
Sbjct: 719 MIG 721


>gi|294146775|ref|YP_003559441.1| beta-glucosidase [Sphingobium japonicum UT26S]
 gi|292677192|dbj|BAI98709.1| beta-glucosidase [Sphingobium japonicum UT26S]
          Length = 791

 Score =  262 bits (670), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 226/770 (29%), Positives = 353/770 (45%), Gaps = 123/770 (15%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEK--------VQQMGDLAYGVPRLGLPLYEWWSEAL 63
           FP+   +   P  AK       +P +        V  +   A    RLG+P+  +  E L
Sbjct: 91  FPHGMGQFTRPSDAKGAFSPREVPGRNPRQTVALVNALQRWATTQTRLGIPIL-FHEEGL 149

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           HG + +G                 ATSFP  I   +S++  L +++   ++ E R+    
Sbjct: 150 HGYAAVG-----------------ATSFPQSIAMASSWDPDLLREVNAVIAREIRSR--- 189

Query: 124 GNAGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
              G++   SP +++ RDPRWGR+ ET GEDPY+VG   +  V GLQ        R    
Sbjct: 190 ---GVSLVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQG-----KGRSRLL 241

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
            P K+ A  KH   +       N      + V+E++++E F  PFE  V    + +VM S
Sbjct: 242 PPGKVFATLKHLTGHGQPESGTN---VGPAPVSERELRENFFPPFEQVVKRTGIEAVMAS 298

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           YN ++G+P+ A+  LL   +RG+W F G +VSD  ++  ++  H    D  E A  R L 
Sbjct: 299 YNEIDGVPSHANRWLLRDVLRGEWGFRGAVVSDYSAVDQLMSIHHVAADL-EQAAGRALD 357

Query: 303 AGLDLDCGDYYTNFTMG-AVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN 361
           AG+D D  D  +  T+G  V++GKI EA +D ++R +  +  R G F+ +P         
Sbjct: 358 AGVDADLPDGLSYATLGRQVREGKIGEALVDRAVRHMLELKFRAGLFE-NPYADAAASEK 416

Query: 362 ICNPQHIELAAEAARQ-GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
           I N       A  A Q  I+LLKND G LPL      ++A++GP  +A  A +G Y G P
Sbjct: 417 ITNDARARALALKAAQRSIILLKND-GMLPLKPEG--SIAVIGP--SAAVARLGGYYGQP 471

Query: 421 CRYTSPMDGFYA----YSKVINYAPGC---------ADIV-----CQNNSMIPAAIDAAK 462
               S ++G  A     +K++ +A G          AD V      +N  +I  A++AA+
Sbjct: 472 PHSVSILEGIRAKVGNRAKIV-FAQGVRITENDDWWADKVTRSDPAENRRLIAQAVEAAR 530

Query: 463 NADATVIVAGLDLSVEAEG------KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAG 516
           + D  V+  G       EG       DR  L L G Q EL + +    K P+ +V+++  
Sbjct: 531 HVDRIVLTLGDTEQSSREGWADNHLGDRPSLDLVGEQQELFDALKALGK-PIAVVLINGR 589

Query: 517 AVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS 576
               +  K + +  +IL   Y GE+GG A+ADV+FG  NPGG+LP+T        IP ++
Sbjct: 590 PA--STVKVSEQADAILEGWYLGEQGGHAVADVLFGDVNPGGKLPVT--------IPRSA 639

Query: 577 MPLRPVNNF---PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRD 633
             L    N      R Y F     +YPFG+GLSYT F     S+P+    K+      R 
Sbjct: 640 GQLPMFYNVKPSARRGYLFDTTDPLYPFGFGLSYTSFDL---SAPRLSAAKISVGGMTR- 695

Query: 634 INYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHI 692
                                         ++V N G+ +G EVV +Y +   G     I
Sbjct: 696 ----------------------------VSVDVRNSGRREGDEVVQLYVRDKVGSVTRPI 727

Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           K++ G++RV +  G+   V FT+   ++L++ ++  + ++  G   I+ G
Sbjct: 728 KELKGFQRVTLKPGEVRTVTFTI-GPEALQMWNDHMDRVVEPGDFEIMTG 776


>gi|224538282|ref|ZP_03678821.1| hypothetical protein BACCELL_03173 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224520107|gb|EEF89212.1| hypothetical protein BACCELL_03173 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 864

 Score =  262 bits (669), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 159/444 (35%), Positives = 232/444 (52%), Gaps = 43/444 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           FPY D  L   ERA DL++R+TL EK   M + +  +PRL +  Y WW+EALHG++  G 
Sbjct: 27  FPYQDTSLTAEERADDLLKRLTLEEKASLMMNGSPAIPRLSIKAYGWWNEALHGLARTGL 86

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARA----MYNLGN-- 125
                           AT FP  I   ASF++SL  ++   VS EARA    + + GN  
Sbjct: 87  ----------------ATVFPQAIGMGASFDDSLLYEVFTAVSDEARAKSRRLDSKGNLT 130

Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LT W+PN+N+ RDPRWGR  ET GEDPY+  R  +  V GLQ  +   Y+      
Sbjct: 131 RYQALTVWTPNVNIFRDPRWGRGQETYGEDPYLTSRLGVAVVNGLQGPDTARYN------ 184

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
             K+ AC KHYA +    W   +R  F++  ++ +D+ ET++  F+  V E  V  VMC+
Sbjct: 185 --KLHACAKHYAVHSGPEW---NRHSFNAENISPRDLWETYLPAFKTLVQEAKVKEVMCA 239

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKF-LNDTKEDAVARVL 301
           YNR  G P C   +LL Q +R +W F G +VSDC ++    +  K   +     A A  +
Sbjct: 240 YNRFEGEPCCGSNRLLTQILRDEWGFDGVVVSDCGAVSDFWQKRKHETHPDAASASADAV 299

Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN 361
             G D++CG+ Y +    AV+ G I E  ID S++ L      LG  D +  +  +  + 
Sbjct: 300 LNGTDVECGNSYKSLP-DAVKAGLITENQIDISVKRLLKARFELGEMDENV-WTGISSDV 357

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           + +P+H +LA + AR+ + LL+N+N  LPL+      +AL+GP+AN +    GNY G P 
Sbjct: 358 VDSPKHRQLALQMARETMTLLQNNNNILPLSKQ--AKIALIGPNANDSVMQWGNYNGLPS 415

Query: 422 RYTSPMDGFYAYSKVIN--YAPGC 443
              + ++G   Y    N  Y P C
Sbjct: 416 HTITLLEGMQRYLPTSNLIYEPVC 439



 Score =  120 bits (300), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 86/300 (28%), Positives = 142/300 (47%), Gaps = 53/300 (17%)

Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
           +   ++  K+ D  +   G+  ++E E          G DR ++ LP  Q  ++  +  A
Sbjct: 591 VKGLLERIKDVDVVIFAGGISPALEGEEMPVDAAGFRGGDRTEIELPAVQRRVVEALKTA 650

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
            K  +  V  S  A+ +     N   ++IL   YPG+ GG+A+A+V+FG YNP G+LP+T
Sbjct: 651 GK-RIVFVNFSGAAIALEPESQN--CEAILQAWYPGQAGGQAVAEVLFGDYNPAGKLPLT 707

Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
           +Y  N  +IP          N  GRTY++     ++PFG+GLSYT FKY          +
Sbjct: 708 FYR-NLAQIPDFE-----DYNMTGRTYRYMKETPLFPFGHGLSYTTFKYG--------KL 753

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
           K++ D+         G N                      I V N G  DG EVV VY K
Sbjct: 754 KMNDDK------IAAGQN------------------LNLAIPVTNTGSRDGDEVVQVYLK 789

Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
                   +K +  ++RV I AG++ +V F+++  + L+  D  +N++ +  G +T+++G
Sbjct: 790 KMDDTEGPVKTLRAFKRVRIPAGKTVEVKFSLDDTQ-LEWWDEQSNTMRVCPGNYTVMIG 848


>gi|423221630|ref|ZP_17208100.1| hypothetical protein HMPREF1062_00286 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392645869|gb|EIY39591.1| hypothetical protein HMPREF1062_00286 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 864

 Score =  262 bits (669), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 159/444 (35%), Positives = 232/444 (52%), Gaps = 43/444 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           FPY D  L   ERA DL++R+TL EK   M + +  +PRL +  Y WW+EALHG++  G 
Sbjct: 27  FPYQDTSLTAEERADDLLKRLTLEEKASLMMNGSPAIPRLSIKAYGWWNEALHGLARTGL 86

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARA----MYNLGN-- 125
                           AT FP  I   ASF++SL  ++   VS EARA    + + GN  
Sbjct: 87  ----------------ATVFPQAIGMGASFDDSLLYEVFTAVSDEARAKSRRLDSKGNLT 130

Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LT W+PN+N+ RDPRWGR  ET GEDPY+  R  +  V GLQ  +   Y+      
Sbjct: 131 RYQALTVWTPNVNIFRDPRWGRGQETYGEDPYLTSRLGVAVVNGLQGPDTARYN------ 184

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
             K+ AC KHYA +    W   +R  F++  ++ +D+ ET++  F+  V E  V  VMC+
Sbjct: 185 --KLHACAKHYAVHSGPEW---NRHSFNAENISPRDLWETYLPAFKTLVQEAKVKEVMCA 239

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKF-LNDTKEDAVARVL 301
           YNR  G P C   +LL Q +R +W F G +VSDC ++    +  K   +     A A  +
Sbjct: 240 YNRFEGEPCCGSNRLLTQILRDEWGFDGVVVSDCGAVSDFWQKRKHETHPDAASASADAV 299

Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN 361
             G D++CG+ Y +    AV+ G I E  ID S++ L      LG  D +  +  +  + 
Sbjct: 300 LNGTDVECGNSYKSLP-DAVKAGLITENQIDISVKRLLKARFELGEMDENV-WTGISSDV 357

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           + +P+H +LA + AR+ + LL+N+N  LPL+      +AL+GP+AN +    GNY G P 
Sbjct: 358 VDSPKHRQLALQMARETMTLLQNNNNILPLSKQ--AKIALIGPNANDSVMQWGNYNGLPS 415

Query: 422 RYTSPMDGFYAYSKVIN--YAPGC 443
              + ++G   Y    N  Y P C
Sbjct: 416 HTITLLEGMQRYLPTSNLIYEPVC 439



 Score =  118 bits (295), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 84/300 (28%), Positives = 141/300 (47%), Gaps = 53/300 (17%)

Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
           +   ++  K+ D  +   G+  ++E E          G DR ++ LP  Q  ++  +  A
Sbjct: 591 VKGLLERIKDVDVVIFAGGISPALEGEEMPVDAAGFRGGDRTEIELPAVQRRVVEALKTA 650

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
            K    +V ++     I     +   ++IL   YPG+ GG+A+A+V+FG YNP G+LP+T
Sbjct: 651 GK---RIVFVNFSGAAIALEPESLNCEAILQAWYPGQAGGQAVAEVLFGDYNPAGKLPLT 707

Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
           +Y  N  +IP          N  GRTY++     ++PFG+GLSYT FKY          +
Sbjct: 708 FYR-NLAQIPDFE-----DYNMTGRTYRYMKETPLFPFGHGLSYTTFKYG--------KL 753

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
           K++ D+         G N                      I V N G  DG EVV VY K
Sbjct: 754 KMNDDK------IAAGQN------------------LNLVIPVTNTGSRDGDEVVQVYLK 789

Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
                   +K +  ++RV I AG++ +V F+++  + L+  D  +N++ +  G +T+++G
Sbjct: 790 KMDDTEGPVKTLRAFKRVRIPAGKTVEVKFSLDDTQ-LEWWDEQSNTMRVCPGNYTVMIG 848


>gi|404406439|ref|ZP_10998023.1| glycoside hydrolase 3 [Alistipes sp. JC136]
          Length = 925

 Score =  262 bits (669), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 195/688 (28%), Positives = 332/688 (48%), Gaps = 94/688 (13%)

Query: 88  ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRV 146
           AT+FP+ +    ++N  L +K G+ V  EAR +      G T  ++P ++V RD RWGR 
Sbjct: 180 ATNFPSQLGMGHTWNRELLRKTGRIVGREARLL------GYTNIYAPVLDVGRDQRWGRY 233

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
            E  GE PY+V    +    G+Q     +Y         ++++  KH+AAY  +      
Sbjct: 234 EEVFGESPYLVAELGVAMASGMQ----TDY---------QVASTAKHFAAYSNNKGAREG 280

Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
               D ++  ++++   ++PF   +    +  VM SYN  +G+P       L + +RG+ 
Sbjct: 281 MSRVDPQMPPREVENIHLMPFREVIRRAGILGVMSSYNDYDGVPIQGSRYWLTERLRGEM 340

Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQ--- 323
            F GY+VSD  S++ +   H    + + DAV + ++AGL++ C  ++    +  ++Q   
Sbjct: 341 GFRGYVVSDSGSVEYLHNKHHTAVN-QLDAVRQSIEAGLNVRCNFWHPETYVMPLRQLLR 399

Query: 324 -GKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK--NNICNPQHIELAAEAARQGIV 380
            G I E  +D+ +R +  V   +G FD  P   +L      +  P+H E+A +A+R+ IV
Sbjct: 400 EGLITEELLDSRVRDVLRVKFLVGLFD-RPYQTDLAAADREVDGPEHNEVALQASRESIV 458

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY----AYSKV 436
           LLKN+N  LPL+   I+ +A++GP+A+A    +G+Y       TS +DG      A  ++
Sbjct: 459 LLKNENSTLPLDARKIRRIAVLGPNADARGFALGHYGPLAVEVTSVLDGLKRNLGARCEI 518

Query: 437 INYAPGC--------------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
           + Y  GC               ++  +  + I  A +AA  +D  V+V G       E  
Sbjct: 519 V-YEKGCELVDAAWPLSEIFREEMTPEEKAGIRRAAEAASESDVAVVVLGGGSRTCGENC 577

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
            R  L LPG Q EL+  V +A   P  LV+++     IN+A  +  + +I+   YPG  G
Sbjct: 578 SRSSLDLPGRQEELLRAV-EATGKPTVLVMINGRPNSINWA--DAHVDAIVEAWYPGAHG 634

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF-------PGRTYKFFDG 595
           G+A+ +V+FG+YNPGG+L +T +  +  +IP+ + P +P  N        PG      +G
Sbjct: 635 GQAVYEVLFGEYNPGGKLTVT-FPRHVGQIPF-NFPYKPAANTDGGLTPGPGGNQTRING 692

Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
             +Y FGYGLSYT F+Y         D++++  Q  R                       
Sbjct: 693 -ALYDFGYGLSYTTFEY--------ADLRIEP-QTIR----------------------- 719

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
           +D  F    +V N G+ DG EVV +Y         T+ K + G++RV + AG++ +V   
Sbjct: 720 QDEPFRVSFDVTNTGQRDGDEVVQLYIHDVLSSVTTYEKNLRGFDRVHLKAGETRRVTMQ 779

Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
           +   + L +++     ++  G   +L+G
Sbjct: 780 VRP-QDLSLLNERMERVVEPGDFDVLIG 806


>gi|305663349|ref|YP_003859637.1| glycoside hydrolase family protein [Ignisphaera aggregans DSM
           17230]
 gi|304377918|gb|ADM27757.1| glycoside hydrolase family 3 domain protein [Ignisphaera aggregans
           DSM 17230]
          Length = 757

 Score =  261 bits (668), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 224/788 (28%), Positives = 362/788 (45%), Gaps = 139/788 (17%)

Query: 23  ERAKDLVERMTLPEKVQQMGD--------------------LAYGV-------------- 48
           ER ++L+ RM++ EK+ Q+                      L YGV              
Sbjct: 5   ERVRELIGRMSIEEKIAQLISIPLESVLDGKKFSVEKAREVLKYGVGEILRIGGSSARLS 64

Query: 49  PRLGLPLYEWWSEALHGVSFIGRRTNS--PPGTHFDS----EVPGATSFPTVILTTASFN 102
           PR  + +Y           F+ R T    P   H +S      P AT FP  +   ++++
Sbjct: 65  PREAVEIYNAIQR------FLTRETRLGIPAIVHEESIAGLLAPTATVFPIPLALASTWD 118

Query: 103 ESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAI 162
             L  ++   +  +  A+           +P +++ R+PRWGR  ET GED Y+     I
Sbjct: 119 PDLVYRVAVAIRRQIMAI-----GSRHTLAPVLDLCREPRWGRCEETYGEDSYLAASMGI 173

Query: 163 NYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQET 222
            YV+G+Q           D     + A  KH+  + +   EG  R      V  +++ E 
Sbjct: 174 AYVKGIQ----------GDDIRYGVIATGKHFVGHGVP--EGG-RNIASIHVGLRELLEI 220

Query: 223 FILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI 282
           ++ PFE  V E ++ S+M +Y+ ++ +P  A+  LL   +RG W F G  VSD + ++ +
Sbjct: 221 YMYPFEATVKEANLLSIMPAYHDIDNVPCHANKWLLTDILRGSWGFKGIAVSDYEGVKQL 280

Query: 283 VESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYI 340
              H+   D  E AV + +KAG+D++   G+ +    + AV++G I E  I+ ++  +  
Sbjct: 281 HTIHRVARDCMEAAV-KAIKAGVDIEYPSGECFKQL-VEAVRKGLIDEDTINRAVERVLK 338

Query: 341 VLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLA 400
           +   LG F+     +      + N    ELA E AR+ IVLLKND G LPL   +IKT+A
Sbjct: 339 LKFMLGLFENPFIDETKVPTTLDNEADRELAREVARKAIVLLKND-GILPLKR-DIKTIA 396

Query: 401 LVGPHANATKAMIGNY---------EGT------PCRYTSPMDGFYAY---SKVINYAPG 442
           ++GP+AN   AM+G+Y         +GT        R  + ++   +    S  + YA G
Sbjct: 397 VIGPNANDPWAMLGDYHYDAHIGSFDGTYGKISPSVRIVTVLEAIKSRVSPSTEVLYAKG 456

Query: 443 CADIVCQNNSMIPAAIDAAKNADATVIVAG-------LDLSVEAEGKDRVDLLLPGFQTE 495
           C D +  + S    AI+ AK AD  + V G       L +    EG DR  L LPG Q E
Sbjct: 457 C-DTIGDDRSGFGEAIEIAKRADIIIAVMGDRSGLFNLKMFTSGEGVDRASLKLPGVQEE 515

Query: 496 LINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYN 555
           L+ ++A   K P+ LV+++     +  +   P + +I+    PGEEGG AIAD++FG Y+
Sbjct: 516 LLKELASLGK-PIILVLINGRP--LALSSILPYVNAIVEAWRPGEEGGNAIADILFGDYS 572

Query: 556 PGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKV 614
           PGGRLP++  Y+   + I Y+  P    N F  R Y  +    ++PFGYGLSYTQF Y+ 
Sbjct: 573 PGGRLPVSLPYDVGQLPIYYSRKP----NCF--RDYVEYPAKPLFPFGYGLSYTQFAYE- 625

Query: 615 ASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDG 674
                               N  V +           +V+  D      ++V+N+G M G
Sbjct: 626 --------------------NLVVEST----------EVRDPDTVIRVSVDVKNVGSMAG 655

Query: 675 SEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA 733
            EVV +Y S+        + ++ G++R+ +  G+   V F +   + L   D   N ++ 
Sbjct: 656 DEVVQLYISRDYASVTRPVAELKGFKRITLEPGEKKTVVFEI-PLELLAYYDMDMNYVVE 714

Query: 734 SGAHTILV 741
            G +T ++
Sbjct: 715 PGEYTFMI 722


>gi|374374543|ref|ZP_09632202.1| Beta-glucosidase [Niabella soli DSM 19437]
 gi|373233985|gb|EHP53779.1| Beta-glucosidase [Niabella soli DSM 19437]
          Length = 799

 Score =  261 bits (668), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 231/820 (28%), Positives = 366/820 (44%), Gaps = 135/820 (16%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WS------ 60
           Y D   P   R KDL+ +MT+ EK  Q   L YG  R+    LP   W    W       
Sbjct: 40  YEDPVAPVANRVKDLLSQMTVEEKTCQTATL-YGFGRVLKDELPTPGWKQEIWKDGIANI 98

Query: 61  -EALHGVS--------------------------FIGRRTNSPPGTHFDSEVPG-----A 88
            E L+G++                          FI       P    +  + G     A
Sbjct: 99  DEELNGLARNKKAQTKYSYPFSNHAEAINKIQKWFIEETRLGIPVDFTNEGIHGLNQDHA 158

Query: 89  TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVL 147
           T+FP  I   +++N+ L  ++GQ +  EA+A+      G T  ++P ++V RD RWGRV+
Sbjct: 159 TAFPAPIGIGSTWNKELVHQMGQIIGREAKAL------GYTNVYAPILDVARDQRWGRVV 212

Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
           ET GEDP++V         G+Q+  GV             ++  KH+A Y +     +  
Sbjct: 213 ETYGEDPFLVAGLGTALAGGIQE-NGV-------------ASTLKHFAVYSVPKGGRDGN 258

Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
              D  V  ++MQ+ F+ PF   +       VM SYN  +G+P  A    L Q +R  + 
Sbjct: 259 ARTDPHVAPREMQQLFLYPFRKVIQNVHPLGVMSSYNDWDGMPVTASNYFLTQLLRQQFG 318

Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTM---GAVQQ 323
           F GY+VSD  +++ + E H    D KE AV  V++AGL++    +  +NF +     +++
Sbjct: 319 FDGYVVSDSRAVEFVYEKHHVAKDYKE-AVKMVMEAGLNVRTEFNAPSNFILPLRQLIKE 377

Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIE-LAAEAARQGIVLL 382
           G ++   ++  +  +  V  RLG FD          + I   +  E +A +  R+ +VLL
Sbjct: 378 GGLSMETLNQRVGEVLSVKFRLGLFDAPYVKDPKAADKIVATEASEAVALQMNRESLVLL 437

Query: 383 KNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG---FYAYSKVINY 439
           KND   LPL+ G  + + + GP A+  +  I  Y  +  +  S ++G   F A    INY
Sbjct: 438 KNDKNILPLSLGQYRNILVTGPLADEKEHAISRYGPSNKKVISVLEGIRHFAAKKATINY 497

Query: 440 APGC--ADIVCQNNSMIPA------------AIDAAKNADATVIVAGLDLSVEAEGKDRV 485
             GC  AD     + +I              A++AAK  D  + V G +     E   R 
Sbjct: 498 IKGCEAADATWPESEIIDTPPTPQEIAEMNKAVEAAKQNDIIIAVMGENDKQVGESLSRT 557

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            L LPG Q  L+ ++    K P+ L++++   + IN+   N  + +IL   +PG  GG A
Sbjct: 558 GLNLPGRQLRLLEELKKTGK-PMVLILINGQPLTINW--ENRYLDAILETWFPGPAGGTA 614

Query: 546 IADVIFGKYNPGGRLPITW------YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
           +A+ IFG YNPGG+L  T+       E N+   P  S   +P +   G       GP +Y
Sbjct: 615 VAEAIFGAYNPGGKLTTTFPKTTGQIEMNFPFKP-ASHAGQPGDGPNGYGKTAVVGP-LY 672

Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
           PFGYGLSYT F+Y         ++K+D ++     + +V                     
Sbjct: 673 PFGYGLSYTTFEY--------ANLKVDPEKARTQADISVA-------------------- 704

Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNAC 718
               ++V+N GK+ G EVV +Y K    + T  + ++ G+ERV ++ G++  V F +   
Sbjct: 705 ----VDVKNTGKVKGDEVVQLYVKQLVSSVTTYESILRGFERVSLSPGETKTVHFKLTP- 759

Query: 719 KSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLNH 758
             L I+D   N ++  GA  I+VG     +    Q+ L  
Sbjct: 760 DDLSILDKNMNFVVEPGAFDIMVGSSSVDIRLKKQIILEQ 799


>gi|427384392|ref|ZP_18880897.1| hypothetical protein HMPREF9447_01930 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727653|gb|EKU90512.1| hypothetical protein HMPREF9447_01930 [Bacteroides oleiciplenus YIT
           12058]
          Length = 954

 Score =  261 bits (668), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 226/756 (29%), Positives = 354/756 (46%), Gaps = 119/756 (15%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSEALHGVSFI 69
           Y D  LP  ER + L+  MT PE   ++    +G+P  G+P LY       EA+HG S+ 
Sbjct: 170 YMDPTLPVEERVESLLSVMT-PEDKMELIREGWGIP--GIPHLYVPPITKVEAVHGFSY- 225

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
                   G+       GAT FP  +   A++N+ L ++I   V  E      L    + 
Sbjct: 226 --------GS-------GATIFPQALAMGATWNKKLTEEIAMAVGDE-----TLAAGTMQ 265

Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
            WSP ++V +D RWGR  ET GEDP +V +    +++G Q            S+ L  + 
Sbjct: 266 AWSPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SKGLFTTP 313

Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
             KH+  +         R   D  ++E++M+E  ++PF   +   D  S+M +Y+   G+
Sbjct: 314 --KHFGGH---GAPLGGRDSHDIGLSEREMREVHLVPFRHVIRNYDCQSLMMAYSDFLGV 368

Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
           P     +LL+  +R +W F G+IVSDC +I  +     +    K +A  + L AG+  +C
Sbjct: 369 PVAKSKELLHNILREEWGFDGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNC 428

Query: 310 GDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC----N 364
           GD Y +   + A + G++   ++D   R +  ++ R   F+ +P  K L  N I     +
Sbjct: 429 GDTYNDKEVIQAAKDGRLNMENLDNVCRTMLRMMFRNELFEKAPN-KPLDWNKIYPGWNS 487

Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY--EGTPCR 422
             H E+A +AAR+ IV+L+N    LPL+ G I+++A++GP A+  +   G+Y  +  P +
Sbjct: 488 DNHKEMARQAARESIVMLENKENILPLDKG-IRSIAVLGPGADDLQP--GDYTPKLLPGQ 544

Query: 423 YTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
             S + G        +KVI Y  GC D    + + IP A+ AA  +D  V+V G   + E
Sbjct: 545 LKSVLTGIKQAVGKQTKVI-YEQGC-DFTNLSETNIPKAVKAASQSDVVVMVLGDCSTSE 602

Query: 479 A---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
           A         E  D   L+LPG Q EL+  V    K PV LV+ +      N  K +   
Sbjct: 603 ATTDVYKTSGENHDYATLILPGKQQELLEAVCATGK-PVILVLQAGRP--YNLTKASKLC 659

Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRT 589
           K+I+    PG+EGG A ADV+FG YNP GRLP+T+ +          +PL       GR 
Sbjct: 660 KAIIVNWLPGQEGGPATADVLFGDYNPAGRLPMTFPQH------VGQLPLYYNFKTSGRR 713

Query: 590 YKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
           Y++ D     +Y FGYGLSYT F+Y              K Q+  + N TV         
Sbjct: 714 YEYSDLEYYPLYYFGYGLSYTSFEYSGL-----------KVQEKDNGNITV--------- 753

Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAG 706
                          Q  V+N+G+  G EVV +Y +       T I ++  + R+ +  G
Sbjct: 754 ---------------QATVKNVGQRAGDEVVQLYVTDMYASVKTRITELKDFTRINLKPG 798

Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +S  V F +     L ++++  + ++  G   ILVG
Sbjct: 799 ESKTVSFELTPY-DLSLLNDHMDRVVEKGEFKILVG 833


>gi|153809292|ref|ZP_01961960.1| hypothetical protein BACCAC_03604 [Bacteroides caccae ATCC 43185]
 gi|149128062|gb|EDM19283.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           caccae ATCC 43185]
          Length = 946

 Score =  261 bits (667), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 231/807 (28%), Positives = 361/807 (44%), Gaps = 138/807 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
           Y D   P   R +DL+ +MTL EK  QM  L YG  R+    LP  EW ++    G+  I
Sbjct: 53  YEDPTAPIDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTSEWKNQLWKDGIGAI 111

Query: 70  GRRTNS------PPG-------------------------------THFDSE-VPG---- 87
               N       PP                                T F +E + G    
Sbjct: 112 DEHLNGFQQWGLPPSDNEYVWPASKHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESY 171

Query: 88  -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
            AT+FPT +    ++N  L  ++G     EAR +      G T  ++P ++V RD RWGR
Sbjct: 172 KATNFPTQLGLGHTWNRQLIHQVGLITGREARML------GYTNVYAPILDVGRDQRWGR 225

Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
             E  GE PY+V    I  VRG+Q                +++A  KH+ AY  +     
Sbjct: 226 YEEVYGESPYLVAELGIEMVRGMQHNH-------------QVAATGKHFIAYSNNKGARE 272

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
                D +++ ++++     PF+  + E  +  VM SYN  +G P  +    L   +RG+
Sbjct: 273 GMARVDPQMSPREVEMLHAYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGE 332

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
             F GY+VSD D+++ +   H    D KE AV + ++AGL++ C     D Y       V
Sbjct: 333 MGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLRELV 391

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAA-EAARQGIV 380
           ++G ++E  I+  +R +  V   +G FD   Q    G +     +  E  A +A+R+ IV
Sbjct: 392 KEGGLSEEVINDRVRDILRVKFLVGLFDTPYQTDLKGADEEVEKKENEEVALQASRESIV 451

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK---VI 437
           LLKN+   LPL+   I+ +A+ GP+A+     + +Y       TS + G     K    +
Sbjct: 452 LLKNEKNVLPLDPSKIRKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKDKADV 511

Query: 438 NYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
            Y  GC D+V  N                  I  A+  AK AD  ++V G       E K
Sbjct: 512 LYTKGC-DLVDANWPESELIDYPLTDEEQKEIDKAVSQAKQADVAIVVLGGGQRTCGENK 570

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
            R  L LPG Q +L+  V    K PV LV+++   + IN+A  +  + +IL   YPG +G
Sbjct: 571 SRSSLDLPGRQLDLLKAVVATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKG 627

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV----- 597
           G A+AD++FG YNPGG+L +T +     +IP+ + P +P +   G      DG +     
Sbjct: 628 GIAVADILFGDYNPGGKLTVT-FPKTVGQIPF-NFPCKPSSQIDGGKNPGPDGNMSRANG 685

Query: 598 -VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
            +YPFGYGLSYT F+Y                    D+  +     P   A     V CK
Sbjct: 686 ALYPFGYGLSYTTFEYS-------------------DLKISPAIITPNQKAY----VTCK 722

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
                    V N GK  G EV+ +Y +       T+ K + G+ERV +  G++ ++ F +
Sbjct: 723 ---------VTNTGKRSGDEVIQLYVRDVLSSVTTYEKNLAGFERVHLKPGETKEITFPI 773

Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
           +  K+L++++   + ++  G  T+++G
Sbjct: 774 DR-KALELLNADMHWVVEPGDFTLMLG 799


>gi|298387489|ref|ZP_06997041.1| periplasmic beta-glucosidase [Bacteroides sp. 1_1_14]
 gi|298259696|gb|EFI02568.1| periplasmic beta-glucosidase [Bacteroides sp. 1_1_14]
          Length = 950

 Score =  261 bits (667), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 226/762 (29%), Positives = 357/762 (46%), Gaps = 119/762 (15%)

Query: 8   KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSEAL 63
           K++D  Y DA LP  ER + L+  MT PE   ++    +G+P  G+P LY       EA+
Sbjct: 160 KVTDRRYMDASLPVEERVESLLAVMT-PEDKMELIREGWGIP--GIPHLYVPPITKVEAV 216

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           HG S+         G+       GAT FP  +   A++N  L +++   +  E  A  N 
Sbjct: 217 HGFSY---------GS-------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NT 259

Query: 124 GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
             A    WSP ++V +D RWGR  ET GEDP +V +    +++G Q            SR
Sbjct: 260 KQA----WSPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SR 303

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            L  +   KH+  +         R   D  ++E++M+E  ++PF   +   D  S+M +Y
Sbjct: 304 GLFTTP--KHFGGHGAPL---GGRDSHDIGLSEREMREIHLVPFRHAIRNYDCQSLMMAY 358

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           +   G+P     +LL Q +R +W F+G+IVSDC +I  +     +    K +A  + L A
Sbjct: 359 SDYMGVPVAKSKELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAA 418

Query: 304 GLDLDCGDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNI 362
           G+  +CGD Y N   + A + G+I   D+D   R +   + R   F+ +P  K L    I
Sbjct: 419 GIATNCGDTYNNKEVIQAAKDGRINMEDLDNVCRTMLGTMFRNELFEKNP-CKPLDWKKI 477

Query: 363 C----NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-- 416
                +  H E+A +AAR+ IV+L+N    LPL+   + T+A++GP A+  +   G+Y  
Sbjct: 478 YPGWNSDSHKEMARQAARESIVMLENKENLLPLSK-TLCTIAVLGPGADDLQP--GDYTP 534

Query: 417 EGTPCRYTSPMDGFYA----YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG 472
           +  P +  S + G        +KV+ Y  GC D    + + IP A+ AA  +D  ++V G
Sbjct: 535 KLLPGQLKSVLTGIKGAVGKQTKVL-YEQGC-DFTNPDETNIPKAVKAASQSDVVIMVLG 592

Query: 473 LDLSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFA 523
              + EA         E  D   L+LPG Q EL+  V    K PV L++ +    DI   
Sbjct: 593 DCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDI--L 649

Query: 524 KNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVN 583
           K +   K+IL    PG+EGG A+ADV+FG YNP GRLP+T+            +PL    
Sbjct: 650 KASEMCKAILVNWLPGQEGGPAMADVLFGDYNPAGRLPMTFPRH------VGQLPLYYNF 703

Query: 584 NFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTN 641
              GR Y++ D     +Y FG+GLSYT F+Y         ++K+   Q+  + N  V   
Sbjct: 704 KTSGRRYEYVDMEYYPLYRFGFGLSYTSFEYS--------NLKI---QEKANGNVEV--- 749

Query: 642 KPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYER 700
                                Q  V+N+G   G EV  +Y +       T + ++  + R
Sbjct: 750 ---------------------QATVKNVGSRAGDEVAQLYVTDMYASVKTRVMELKDFAR 788

Query: 701 VFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           + +  G+S  V F M     + ++++  + ++  G   I+VG
Sbjct: 789 IHLQPGESKTVSFEMTPY-DISLLNDRMDRVVEKGEFKIMVG 829


>gi|329956868|ref|ZP_08297436.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
 gi|328523625|gb|EGF50717.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
          Length = 864

 Score =  261 bits (667), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 169/453 (37%), Positives = 240/453 (52%), Gaps = 47/453 (10%)

Query: 9   LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
           L+   Y D      ERA+DLV+++TL EKV  M D +  V RLG+  Y WW+EALHGV+ 
Sbjct: 19  LAQSIYKDNSYSPAERAEDLVKQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVAR 78

Query: 69  IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-- 126
            G                 AT FP  I   ASF+          VS EARA     +A  
Sbjct: 79  SG----------------WATVFPQPIGMAASFSPEALHTAFVAVSDEARAKNAAYSAEG 122

Query: 127 ------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
                 GLT W+P +N+ RDPRWGR +ET GEDPY+     ++ V+GLQ +       D 
Sbjct: 123 SYKRYQGLTIWTPTVNIYRDPRWGRGIETYGEDPYLASVMGVSVVKGLQCL-------DE 175

Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSV 239
           + +  K+ AC KH+A +    W   +R  F++  ++ +D+ ET++ PFE  V EG V  V
Sbjct: 176 NEKYDKVHACAKHFAVHSGPEW---NRHSFNAENISPRDLYETYLPPFEALVKEGKVKEV 232

Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAV 297
           MC+YNR  G P C   +LLN  +R +W + G +V+DC +I      + HK   D    + 
Sbjct: 233 MCAYNRFEGEPCCGSNRLLNHILRREWGYDGIVVADCSAISDFHNDKGHKTHADAASASS 292

Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YK 355
           A VL +G DL+CG  Y + T G V++G I EADID S++ L      LG  D   Q  + 
Sbjct: 293 AAVL-SGTDLECGSNYRSLTEG-VKKGFIDEADIDRSVKRLLQARFELGEMDEPDQVRWA 350

Query: 356 NLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGN 415
            +  + +C+ +H  L+ + AR+ + LL N N ALPL  G   T+A++GP+AN +    GN
Sbjct: 351 QIPYSVVCSDKHDSLSLDMARKSMTLLLNKNNALPLERGGT-TIAVMGPNANDSVMQWGN 409

Query: 416 YEGTPCRYTSPMDGFYAY----SKVINYAPGCA 444
           Y G P R  + +DG  +      K+I Y  GC+
Sbjct: 410 YNGLPKRTITILDGIRSAMGKDDKLI-YEQGCS 441



 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 90/297 (30%), Positives = 134/297 (45%), Gaps = 53/297 (17%)

Query: 445 DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQT 494
           D+  +  + I  ++   K+AD  +   G+   +E E          G DR D+ LP  Q 
Sbjct: 583 DLGFKEEADIQRSVAKVKDADVVIFAGGISPQLEGEEMGVKLPGFRGGDRTDIELPAVQR 642

Query: 495 ELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKY 554
           E+I  + DA K    ++ ++     I         ++IL   YPG+ GG+A+A+V+FG Y
Sbjct: 643 EMIKALHDAGK---KVIFVNCSGSPIAMEPETEYCQAILQAWYPGQSGGKAVAEVLFGDY 699

Query: 555 NPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKV 614
           NP GRLP T+Y           +P     N  G TY+FF+G  ++PFGYGLSYT FKY  
Sbjct: 700 NPAGRLPATFYRN------LAQLPDFEDYNMAGHTYRFFNGEPLFPFGYGLSYTTFKYG- 752

Query: 615 ASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDG 674
                   I+L    Q                          D      + V N G  +G
Sbjct: 753 -------KIQLKSSAQT-------------------------DETVKITVPVTNTGSRNG 780

Query: 675 SEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL 731
            EVV VY K  G     +K +  ++RV+I AG++ KV   +   K L+  D+A N++
Sbjct: 781 EEVVQVYLKKQGETDGPVKTLRAFKRVYIPAGKTVKVELELTP-KQLEWWDSATNTM 836


>gi|393786770|ref|ZP_10374902.1| hypothetical protein HMPREF1068_01182 [Bacteroides nordii
           CL02T12C05]
 gi|392658005|gb|EIY51635.1| hypothetical protein HMPREF1068_01182 [Bacteroides nordii
           CL02T12C05]
          Length = 864

 Score =  261 bits (667), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 159/449 (35%), Positives = 233/449 (51%), Gaps = 47/449 (10%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
           S  PY D  L   +RA DL++R+T+ EKV  M + + G+ RLG+  YEWW+EALHGV+  
Sbjct: 26  SQLPYQDPNLTPEQRATDLLQRLTIEEKVSLMQNNSPGILRLGIKPYEWWNEALHGVARA 85

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--- 126
           G                 AT FP  I   ASF+++L  ++   +S EARA     N    
Sbjct: 86  GL----------------ATVFPQTIGMAASFDDTLIYEVFNAISDEARAKNRHFNTLGQ 129

Query: 127 -----GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
                GLT W+PNIN+ RDPRWGR  ET GEDPY+  R  +  V+GLQ  +   Y+    
Sbjct: 130 YKRYQGLTMWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPDSARYN---- 185

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVM 240
               K+ AC KH+A +    W   +R  F++  +  +D+ ET++  F+  V E DV  VM
Sbjct: 186 ----KLHACAKHFAVHSGPEW---NRHSFNAENIIPRDLWETYLPAFKTLVQEADVKEVM 238

Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV--- 297
           C+YNR  G P C   +LL Q +R +W F G +VSDC +I     + K  ++T  DA    
Sbjct: 239 CAYNRFEGDPCCGSNRLLTQILRNEWGFKGIVVSDCGAISDFWGTKK--HNTHPDAAHAS 296

Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL 357
           A  +  G DL+CG  Y   T  A++ G I+E  I+ S++ L      LG  +    +  L
Sbjct: 297 AEAVLNGTDLECGSNYRKLTE-AIKAGIISEKQINVSVKRLLKARFELGEMENIHPW-TL 354

Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
             + + +P+H  LA + A + + LL+N    LPL+      +A++GP+AN +    GNY 
Sbjct: 355 PYSIVDSPKHRCLALKMAHETMTLLQNKGKVLPLDKQ--ARIAIIGPNANDSVMQWGNYN 412

Query: 418 GTPCRYTSPMDGFYAYSKV--INYAPGCA 444
           GTP   ++ +  F     +  + Y P C 
Sbjct: 413 GTPSHTSTLLSAFRKRLPISHLIYEPVCG 441



 Score =  110 bits (276), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 88/303 (29%), Positives = 135/303 (44%), Gaps = 59/303 (19%)

Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
           I   ++  K+ D  +   G+  S+E E          G DR D+  P  Q +++  + +A
Sbjct: 591 ISNTLEKLKDIDIIIFAGGISPSLEGEEMNVSATGFKGGDRTDIEFPAVQRKVLAALKEA 650

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKS---ILWVGYPGEEGGRAIADVIFGKYNPGGRL 560
            K  V LV  S  A+ +      P+ KS   IL   YPGEEGG AI +V+FG YNP GRL
Sbjct: 651 GK-KVILVNFSGSAMALT-----PETKSCDAILQAWYPGEEGGMAIVNVLFGDYNPAGRL 704

Query: 561 PITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKS 620
           PIT+Y++         +P     +  GRTY++     ++PFGYGLSYT F +        
Sbjct: 705 PITFYKS------IDQLPDFENYSMKGRTYRYMQEEPLFPFGYGLSYTTFAFG------- 751

Query: 621 VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
                            +  NK   +A           K T  I ++N+G  DG EVV +
Sbjct: 752 ----------------KIHINKNSLSA---------GEKVTLHIPIKNIGDRDGVEVVQI 786

Query: 681 YSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTI 739
           Y +        +K +  ++RV I  G++ +V   +    + +  D   N++    G + I
Sbjct: 787 YIQRQADKEGPVKTLRAFKRVEIPKGKTQEVKIELPYV-AFEWFDPTTNTMRPIQGEYNI 845

Query: 740 LVG 742
           L G
Sbjct: 846 LYG 848


>gi|116621797|ref|YP_823953.1| glycoside hydrolase family protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116224959|gb|ABJ83668.1| glycoside hydrolase, family 3 domain protein [Candidatus Solibacter
           usitatus Ellin6076]
          Length = 765

 Score =  261 bits (666), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 219/703 (31%), Positives = 338/703 (48%), Gaps = 118/703 (16%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+P+  +  E LHG + IG                  TSFP  I   A+F+  L + +
Sbjct: 104 RLGIPVI-FHEECLHGHAAIG-----------------GTSFPQPIGLGATFDPELVESL 145

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
               + EARA     +  LT   P ++V R+PRWGRV ET GEDP++V R  I  VRG Q
Sbjct: 146 FAMTAAEARARGT--HQALT---PVVDVAREPRWGRVEETYGEDPFLVSRMGIAAVRGFQ 200

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
              G    RD      ++ A  KH+AA+       N        V+ + ++ETF+ PF+ 
Sbjct: 201 ---GDATFRDKT----RVIATLKHFAAHGQPESGTN---CAPVNVSMRVLRETFLFPFKE 250

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV---ESH 286
            +++G   SVM SYN ++G+P+ A   LL   +R +W F G++VSD  +I  +    ESH
Sbjct: 251 ALDKGCAISVMASYNEIDGVPSHASRWLLRDVLRKEWGFKGFVVSDYYAIYELSYRPESH 310

Query: 287 -KFLNDTKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLM 343
             F+   K +A A  ++AG++++    D Y +  +  V +G + E+ +D  +  +     
Sbjct: 311 GHFVAKDKREACALAVQAGVNIELPEPDCYLHL-VDLVHKGVLQESQLDELVEPMLRWKF 369

Query: 344 RLGYFDGSPQYKNLGKNNI--CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLAL 401
           ++G FD  P         I  C+  H ELA +AAR+ I LLKND   +PL+   IKT+A+
Sbjct: 370 QMGLFD-DPYVDPAEAERIAGCD-AHRELAMQAARETITLLKNDGPVVPLDLSAIKTIAV 427

Query: 402 VGPHANATKAMIGNYEGTPCRYTSPMDGFY----AYSKVINYAPGCADIV---------- 447
           +GP+AN  ++++G Y G P    + +DG      + +KV+ YA GC   +          
Sbjct: 428 IGPNAN--RSLLGGYSGVPKHDVTVLDGIRERVGSRAKVV-YAEGCKITIGGSWVQDEVT 484

Query: 448 ----CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG------KDRVDLLLPGFQTELI 497
                ++   I  A+  AK AD  V+  G +     E        DR  L L G Q EL+
Sbjct: 485 PSDPAEDRRQIAEAVKVAKRADVIVLAIGGNEQTSREAWSPKHLGDRPSLDLVGRQEELV 544

Query: 498 NKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPG 557
             +    K PV   + +   + IN+   +  + +I    Y G+E GRA+A+V+FG  NPG
Sbjct: 545 RAMVATGK-PVIAFLFNGRPISINYLAQS--VPAIFECWYLGQETGRAVAEVLFGDTNPG 601

Query: 558 GRLPITWYEANYVKIPYTSMPLRPV-NNFPG--RTYKFFDGPVVYPFGYGLSYTQFKYKV 614
           G+LPIT        IP ++  L    N+ P   R Y F +   +Y FGYGLSYT F ++ 
Sbjct: 602 GKLPIT--------IPRSAGHLPAFYNHKPSARRGYLFDEVGPLYAFGYGLSYTTFAFQ- 652

Query: 615 ASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDG 674
                  +++L K +  R+            A VL+D              V N G  +G
Sbjct: 653 -------NLRLAKKKMHRE----------STARVLVD--------------VTNTGAREG 681

Query: 675 SEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMN 716
            EVV +Y +    + T  IK++ G+ ++ +  GQ+  V F + 
Sbjct: 682 REVVQLYIRDLVSSVTRPIKELKGFRKITLQPGQTQTVEFEIT 724


>gi|319901412|ref|YP_004161140.1| glycoside hydrolase 3 [Bacteroides helcogenes P 36-108]
 gi|319416443|gb|ADV43554.1| glycoside hydrolase family 3 domain protein [Bacteroides helcogenes
           P 36-108]
          Length = 944

 Score =  261 bits (666), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 226/805 (28%), Positives = 360/805 (44%), Gaps = 138/805 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
           Y D       R +DL+++M+L EK  QM  L YG  R+    LP  EW    W + +  +
Sbjct: 53  YEDPTAAIDARIEDLLKQMSLEEKTCQMVTL-YGYKRVLKDDLPTPEWKQMLWKDGIGAI 111

Query: 67  S--FIGRRTNSPPGTHFDSEVPG------------------------------------- 87
                G R    P +  ++  P                                      
Sbjct: 112 DEHLNGFRQWGLPPSDNENVWPASRHAWALNEVQRFFVEETRLGIPVDFTNEGIRGVESY 171

Query: 88  -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
            AT+FPT +    ++N  L  KIG     EAR +      G T  ++P ++V RD RWGR
Sbjct: 172 KATNFPTQLGLGHTWNRELIHKIGFITGREARML------GYTNVYAPILDVGRDQRWGR 225

Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
             E  GE PY+V    I  VRG+Q      Y+        +++A  KH+AAY  +     
Sbjct: 226 YEEVYGESPYLVAELGIEMVRGMQ------YNH-------QVAATGKHFAAYSNNKGARE 272

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
                D +++ ++++   I PF   + E  +  VM SYN  +GIP       L   +RG+
Sbjct: 273 GMSRVDPQISPREVENIHIYPFRRVIREAGLLGVMSSYNDYDGIPIQGSHYWLTTRLRGE 332

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
             F GY+VSD D+++ +   H    D KE A+ + ++AGL++ C     D +       V
Sbjct: 333 IGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AIRQSVEAGLNIRCTFRSPDSFVLPLRELV 391

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN-NICNPQHIELAAEAARQGIV 380
           ++G ++E  I+  +R +  V    G FD   Q    G +  +   ++  +A +A+R+ IV
Sbjct: 392 KEGGLSEEIINDRVRDILRVKFLTGLFDTPYQSDLAGADREVEKEENGSIALQASRESIV 451

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF---YAYSKVI 437
           LLKN+N  LPL+   +K +A+ GP+A+     + +Y        + + G     +    +
Sbjct: 452 LLKNENNMLPLDLSTVKRIAVCGPNADEKNYALTHYGPLAVEVITVLKGIQDKVSGKAEV 511

Query: 438 NYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
            Y  GC D+V  N                + I  A + A+ +D  V+V G       E K
Sbjct: 512 LYTKGC-DLVDANWPESEIINHPLTADEQAEINKAAENARQSDVAVVVLGGGQRTCGENK 570

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
            R  L LPG Q +L+  +    K PV LV+++   + +N+A  +  + +IL   YPG +G
Sbjct: 571 SRSSLDLPGRQLQLLQAIQATGK-PVILVLINGRPLSVNWA--DKYVPAILEAWYPGAKG 627

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV----- 597
           G A+ADV+FG YNPGG+L +T +     +IP+ + P +P +   G      +G +     
Sbjct: 628 GIALADVLFGDYNPGGKLTVT-FPKTVGQIPF-NFPYKPASQIDGGKNPGPEGNMSRING 685

Query: 598 -VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
            +YPFGYGLSYT F+Y                    D+  T     P   A         
Sbjct: 686 ALYPFGYGLSYTTFEYS-------------------DLEITPKVITPNEEA--------- 717

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
               T +++V N GK  G EVV +Y +       T+ K + G+ERV +  G++ +V FT+
Sbjct: 718 ----TVRLKVTNTGKRAGDEVVQLYIRDVVSSVITYEKNLAGFERVHLEPGETKEVVFTL 773

Query: 716 NACKSLKIVDNAANSLLASGAHTIL 740
              K L+++D     ++  G  TI+
Sbjct: 774 -GRKHLELLDANMQWVVEPGDFTIM 797


>gi|448410571|ref|ZP_21575276.1| beta-glucosidase [Halosimplex carlsbadense 2-9-1]
 gi|445671607|gb|ELZ24194.1| beta-glucosidase [Halosimplex carlsbadense 2-9-1]
          Length = 760

 Score =  261 bits (666), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 207/707 (29%), Positives = 332/707 (46%), Gaps = 104/707 (14%)

Query: 86  PGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGR 145
           P  T+FP  I   ++++  L   +  T+  +  A   +G A     SP ++V RD RWGR
Sbjct: 102 PEGTTFPQGIGMASTWDPDLMAAVTDTIGDQLEA---IGTA--HALSPVLDVARDLRWGR 156

Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
           V ET GEDPY+V   A  YV GLQ           DS    ISA  KH+  + +    G 
Sbjct: 157 VEETYGEDPYLVAEMATAYVDGLQ----------GDSPADGISATLKHFVGHAV-GAGGK 205

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
           +R   D  V+ + ++E  + PFE  + EG+  SVM +Y+ ++G+P   D  LL   +RG+
Sbjct: 206 NRSSVD--VSRRTLREVHMFPFEAAIQEGNAESVMNAYHDIDGVPCAKDEWLLTDVLRGE 263

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL-----DCGDYYTNFTMGA 320
           W F G +VSD  S+  + E H      +E AV+ V +AG+D+     DC +Y       A
Sbjct: 264 WGFDGTVVSDYFSVDFLKEEHGVAATQQEAAVSAV-EAGVDVELPNTDCYEYLAE----A 318

Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
           V+ G +AE  +D S+R +       G F+      +   +   +   + LA EAAR  +V
Sbjct: 319 VRDGDLAEESLDESVRRVLRAKFEKGLFEEYTVDVDAATDPYEDEAAVGLAREAARDSLV 378

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV---- 436
           +LKN++  LPL+  +  ++A+VGP A+  K M+G+Y      Y      F A + +    
Sbjct: 379 VLKNESDLLPLDDAD--SVAVVGPKADDKKGMLGDY-AYAAHYPEEEYEFEADTPLSAIE 435

Query: 437 ------INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE------------ 478
                 +NYA GC      +   I  A++AA+NAD  +   G   +V+            
Sbjct: 436 NRVGADVNYAQGCT-ATGNSTDKIGRAVEAAENADVALAFVGARSAVDFSDADGVKAEQP 494

Query: 479 -----AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
                 EG D  DL LPG Q EL+ +V +    PV +V++S     I   + +    +++
Sbjct: 495 MVPTSGEGCDVTDLGLPGVQNELVAQV-EETDTPVVIVLVSGKPHAI--PEIDAGADAVV 551

Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA-NYVKIPYTSMPLRPVNNFPGRTYKF 592
               PGEE G AI DV+F  ++ GG LP++  ++   + + Y+  P     N     Y +
Sbjct: 552 QAWLPGEEAGNAIVDVVFEGHDSGGHLPVSMPKSVGQLPVHYSRKP-----NTYSEDYVY 606

Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
            D   VYPFG+GLSY +F+Y                          GT            
Sbjct: 607 DDAQPVYPFGHGLSYAEFEYSDLDLSDV-------------DVDPSGT------------ 641

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERVFIAAGQSAK 710
                  F+  + VEN  + DGS+VV +Y  ++ P +A   +++++G+ RV + AG+S +
Sbjct: 642 -------FSASVTVENTAERDGSDVVQLYVSAENPDLA-RPVQELVGFRRVELDAGESTE 693

Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLN 757
           + F + A + L   D  AN  + +G + + VG     ++   +L++ 
Sbjct: 694 ITFDLAASQ-LAYHDRNANLAVEAGDYELRVGHSSEEIAESARLSVT 739


>gi|390167927|ref|ZP_10219905.1| beta-glucosidase, partial [Sphingobium indicum B90A]
 gi|389589522|gb|EIM67539.1| beta-glucosidase, partial [Sphingobium indicum B90A]
          Length = 771

 Score =  261 bits (666), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 226/770 (29%), Positives = 353/770 (45%), Gaps = 123/770 (15%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEK--------VQQMGDLAYGVPRLGLPLYEWWSEAL 63
           FP+   +   P  AK       +P +        V  +   A    RLG+P+  +  E L
Sbjct: 71  FPHGMGQFTRPSDAKGAFSPREIPGRNPRQTVALVNALQRWATTQTRLGIPIL-FHEEGL 129

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           HG + +G                 ATSFP  I   +S++  L +++   ++ E R+    
Sbjct: 130 HGYAAVG-----------------ATSFPQSIAMASSWDPDLLREVNAVIAREIRSR--- 169

Query: 124 GNAGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
              G++   SP +++ RDPRWGR+ ET GEDPY+VG   +  V GLQ        R    
Sbjct: 170 ---GVSLVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQG-----KGRSRLL 221

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
            P K+ A  KH   +       N      + V+E++++E F  PFE  V    + +VM S
Sbjct: 222 PPGKVFATLKHLTGHGQPESGTN---VGPAPVSERELRENFFPPFEQVVKRTGIEAVMAS 278

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           YN ++G+P+ A+  LL   +RG+W F G +VSD  ++  ++  H    D  E A  R L 
Sbjct: 279 YNEIDGVPSHANRWLLRDVLRGEWGFRGAVVSDYSAVDQLMNIHHVAADL-EQAAGRALD 337

Query: 303 AGLDLDCGDYYTNFTMG-AVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN 361
           AG+D D  D  +  T+G  V++GKI EA +D ++R +  +  R G F+ +P         
Sbjct: 338 AGVDADLPDGLSYATLGRQVREGKIGEALVDRAVRHMLELKFRAGLFE-NPYADAAASEK 396

Query: 362 ICNPQHIELAAEAARQ-GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
           I N       A  A Q  I+LLKND G LPL      ++A++GP  +A  A +G Y G P
Sbjct: 397 ITNDGRARALALKAAQRSIILLKND-GMLPLKPEG--SIAVIGP--SAAVARLGGYYGQP 451

Query: 421 CRYTSPMDGFYA----YSKVINYAPGC---------ADIV-----CQNNSMIPAAIDAAK 462
               S ++G  A     +K++ +A G          AD V      +N  +I  A++AA+
Sbjct: 452 PHSVSILEGIRAKVGNRAKIV-FAQGVRITENDDWWADKVTRSDPAENRRLIAQAVEAAR 510

Query: 463 NADATVIVAGLDLSVEAEG------KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAG 516
           + D  V+  G       EG       DR  L L G Q EL + +    K P+ +V+++  
Sbjct: 511 HVDRIVLTLGDTEQSSREGWADNHLGDRPSLDLMGEQQELFDALKALGK-PIAVVLINGR 569

Query: 517 AVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS 576
               +  K + +  +IL   Y GE+GG A+ADV+FG  NPGG+LP+T        IP ++
Sbjct: 570 PA--STVKVSEQADAILEGWYLGEQGGHAVADVLFGDVNPGGKLPVT--------IPRSA 619

Query: 577 MPLRPVNNF---PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRD 633
             L    N      R Y F     +YPFG+GLSYT F     S+P+    K+      R 
Sbjct: 620 GQLPMFYNVKPSARRGYLFDTTDPLYPFGFGLSYTSFDL---SAPRLSAAKIGVGGTTR- 675

Query: 634 INYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHI 692
                                         ++V N G+ +G EVV +Y +   G     I
Sbjct: 676 ----------------------------VSVDVRNSGRREGDEVVQLYVRDKVGSVTRPI 707

Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           K++ G++RV +  G+   V FT+   ++L++ ++  + ++  G   I+ G
Sbjct: 708 KELKGFQRVTLKPGEVRTVTFTV-GPEALQMWNDHMDRVVEPGDFEIMTG 756


>gi|358342292|dbj|GAA27551.2| probable beta-D-xylosidase 7 [Clonorchis sinensis]
          Length = 826

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 220/828 (26%), Positives = 362/828 (43%), Gaps = 145/828 (17%)

Query: 11  DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGD--------LAYGVPRLGLPLYEWWSEA 62
           + P+ +  LP   R  DL+ R+T  E +QQ+ +         A G+ RL +  Y+W    
Sbjct: 26  EHPFRNPSLPANFRVDDLLARLTNEELIQQVSNGGAGPQHGPAPGIARLNISAYQW---- 81

Query: 63  LHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN 122
                    RTN  PG   D  +   T FP  +   A+F+     ++ +    E RA +N
Sbjct: 82  ---------RTN--PG---DGRI---TPFPQPVNLGATFDVHTVYRVARATGLEMRARWN 124

Query: 123 LGNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
              A        G+  ++P +N++R P WGR  ET GEDP+++G+ A  +VRGL   +  
Sbjct: 125 RAKAKKTYRDGNGIHLFAPVVNLLRHPLWGRNQETFGEDPFMIGKLARTFVRGLGGWKNA 184

Query: 175 EYH----RDSDSRP--LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFE 228
           E      ++  S+P  L + A CKH+A +         R  F++ VT+ D+ +T++  F 
Sbjct: 185 EPQSLDEQNLSSQPDVLLVGANCKHFAVHTGPEDFPVSRLSFEANVTDVDLWQTYLPAFR 244

Query: 229 MCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKF 288
            C+  G VS VMC+Y+ +NG P C +  LL + +R  W F G++V+DC ++Q ++  H+ 
Sbjct: 245 ACLEAGAVS-VMCAYSGINGTPDCINHWLLTELLRQKWKFKGFVVTDCGALQFVIWKHQI 303

Query: 289 LNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQ----GKIAEADIDTSLRFLYIVLMR 344
            N   E A+A V +AG++L+    Y       +      G ++   +    R L++  + 
Sbjct: 304 FNHYNETAMAAV-RAGVNLENSVVYATEVFSTLPHLLASGSLSRDQLIEMARPLFLTRLM 362

Query: 345 LGYFDGSPQ--YKNLG-KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNT------GN 395
            G F+      Y+ L  +  I N  H  +A     + IVLL+N +  LPL        G 
Sbjct: 363 QGEFNPVEMDPYRLLAPEEAILNEDHRRVALATTARSIVLLQNRDRFLPLKNNMSDSGGP 422

Query: 396 IKTLALVGPHANATKAMIGNYEGTP-CRYTSPMD-GFYAYSKVINYAPGCAD---IVCQN 450
           ++ +A+VGP A +   + G+Y   P      P+  G    S+ ++ +  C D       N
Sbjct: 423 LRHIAIVGPFATSVTELYGHYRTAPEPEIEVPLSKGLSQLSRRMHASDICTDGGRCSSLN 482

Query: 451 NSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG---- 506
           +  + + +    + D  V+  G    VE E  DR ++ LPG Q EL+ +    + G    
Sbjct: 483 DDALHSTL-GYDDLDLIVLSLGTGSEVEGENVDRQNITLPGKQPELLEETLKLSSGLGNS 541

Query: 507 -------PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGK------ 553
                  P+ L++ SAG ++I+ A  N  +K+I W G+PG   G A+  ++ G       
Sbjct: 542 GLSKRTVPIILLVFSAGPINISRAVENENVKAIFWCGFPGPLVGDAMRHLLLGSSGELFG 601

Query: 554 ---------------------------YNPGGRLPITWYEA--NYVKIPYTSMPLRPVNN 584
                                      + P  RLP TWYE+      I    M  +    
Sbjct: 602 PSKPISVGFHSFQEAYRWDVTPDDGYWWIPAARLPFTWYESIDQLANITVYEMTNQTYRY 661

Query: 585 FPGRTYKFFDG---PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTN 641
            P + +   +    PV+YPFGYGLSY  F    AS     D+                  
Sbjct: 662 LPTQCHMSSEDCKIPVLYPFGYGLSY-NFNLSGASGFVYSDL------------------ 702

Query: 642 KPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH------IKQV 695
             P +AV        + +  F + V+N G +   EVV VY+K              + Q+
Sbjct: 703 IAPSSAV------SSNQRIVFYVTVQNEGPIACEEVVQVYTKWLNRTENDNSRNGPLIQL 756

Query: 696 IGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
            G+ERV +  G+  ++ FT+   + L +   + N+++   G   I VG
Sbjct: 757 AGFERVRLDVGEYKQLKFTLIPSEHLAVWSLSENTMIPGRGVLQISVG 804


>gi|317477153|ref|ZP_07936394.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316906696|gb|EFV28409.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 863

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 158/466 (33%), Positives = 241/466 (51%), Gaps = 38/466 (8%)

Query: 16  DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
           D   P   R ++++ +MTL EKV Q+ + +  +PRL LP Y +W+E LHGV+  G     
Sbjct: 51  DLSQPISVRIENIIRQMTLEEKVAQLSNESDSIPRLNLPSYNYWNECLHGVARAGE---- 106

Query: 76  PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
                        T FP  I   ++++  L K+I   +STEAR  Y     GLT+W+P I
Sbjct: 107 ------------VTVFPQAINLASTWDTLLVKRIASAISTEARLKYLDIGKGLTYWAPTI 154

Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
           N+ RDPRWGR  ET GEDPY+  R  + +V+GLQ               LK  A  KH+ 
Sbjct: 155 NMARDPRWGRNEETYGEDPYLTSRLGVAFVKGLQ---------GDHPNYLKTVATVKHFV 205

Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
           A    N + NDRF   S++  + + E +   +E CV E +V S+M +YN  NGIP     
Sbjct: 206 A----NNQENDRFSSSSQIPTKQLYEYYFPAYEACVKEANVQSIMTAYNAFNGIPPSGST 261

Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
            LL   +R +W F G++VSDC +I  +   H+ +N + E+A A  + +G DL+CG  Y  
Sbjct: 262 WLLEDVLRKEWGFDGFVVSDCGAIGVMNWQHRIVN-SLEEAAALGINSGCDLECGGTYRE 320

Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELAAE 373
             + AVQ+G ++E  ID +L  +  +  +LG FD      Y +  K  +   Q   LA E
Sbjct: 321 NLVAAVQRGLVSEYAIDRALTRVLTMRFKLGEFDPIELVPYNHYDKKLLAGEQFRRLAYE 380

Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG---F 430
           AA + I+LLKN++  LP++  +++++A+VGP A+     +G Y G P    S + G    
Sbjct: 381 AAVKSIILLKNEDNFLPIDKKDVRSIAIVGPFADNN--YLGGYSGKPVHNISLLQGVKKM 438

Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
                 I+Y  G + +   ++S + A+ D   N      + G DL+
Sbjct: 439 VGEEVEISYIEGTSVVSPVDSSYLLAS-DGVNNGLTADYIDGHDLN 483



 Score =  100 bits (248), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 81/285 (28%), Positives = 136/285 (47%), Gaps = 49/285 (17%)

Query: 464 ADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFA 523
           AD  ++  G D  +  E +D   + LP  Q EL+ K        + L++ +   +   +A
Sbjct: 609 ADLVLVALGNDGKLARENRDLPSIYLPMTQ-ELLLKEIYKVNPRIALILQTGNPLTSQWA 667

Query: 524 KNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP-LRPV 582
             +  + SIL   YPG+EGG A+A ++FG  NP G+LP+T YE+         +P +   
Sbjct: 668 AEH--VPSILQAWYPGQEGGAALAGILFGLENPSGKLPMTIYESE------QQLPNILDY 719

Query: 583 NNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
           + + GRTY++     +Y FG+GLSY+ F+Y               D QC D+ +  GT  
Sbjct: 720 DIWKGRTYQYLSSKPLYGFGHGLSYSNFEY--------------ADLQCNDVVHVDGT-- 763

Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY---SKPPGIAGTHIKQVIGYE 699
                     ++C        I+V+N+  + G EV+ VY    K P +    +K++I + 
Sbjct: 764 ----------LQC-------SIKVKNISDVVGEEVIQVYVSREKTP-VYTFPLKKLIAFA 805

Query: 700 RVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
           RV +   +S  V FT+   + L +  +    +L SG +++ VG G
Sbjct: 806 RVNLKPNESKTVTFTITP-RQLSVWQDGEWKML-SGKYSLFVGGG 848


>gi|423342899|ref|ZP_17320613.1| hypothetical protein HMPREF1077_02043 [Parabacteroides johnsonii
           CL02T12C29]
 gi|409217154|gb|EKN10133.1| hypothetical protein HMPREF1077_02043 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 955

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 219/807 (27%), Positives = 360/807 (44%), Gaps = 138/807 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
           Y D   P   R +DL+ +M + EK  QM  L YG  R+    LP  +W    W + +  +
Sbjct: 61  YEDPTAPIDARVEDLLSQMNVEEKTCQMVTL-YGYKRVLKDDLPTPDWKNQLWKDGMGAI 119

Query: 67  S----------------------------------FIGRRTNSPPGTHFDSE-VPG---- 87
                                              F    T     T F +E + G    
Sbjct: 120 DEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESY 179

Query: 88  -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
            AT+FPT +    ++N  L  K+G     E R +      G T  ++P ++V RD RWGR
Sbjct: 180 IATNFPTQLGLGHTWNRDLVHKVGYITGREGRLL------GYTNVYAPILDVGRDQRWGR 233

Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
             E  GE PY+V    +   +G+Q     +Y         +++A  KHY AY  +     
Sbjct: 234 YEEVYGESPYLVAELGVEMAKGMQ----TDY---------QVAATSKHYIAYSNNKGGRE 280

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
                D +++ ++++   + P++  + E  +  VM SYN  +G P  +    L   +RG+
Sbjct: 281 GMARVDPQMSPREVEMLHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGE 340

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
           + F GY+VSD D+++ +   H    D KE  +  VL AGL++ C     D Y       +
Sbjct: 341 FGFRGYVVSDSDAVEYLFSKHGTAADMKESVLQSVL-AGLNIRCTFRSPDSYVLPLRELI 399

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYK-NLGKNNICNPQHIELAAEAARQGIV 380
            +G +  + ID  +R +  V   +G FD   Q         + + ++ ++A +A+++ +V
Sbjct: 400 AEGALPMSTIDDRVRDILRVKFLVGLFDQPYQIDLKQADKEVNSAENQQVALQASKESLV 459

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK---VI 437
           LLKN +  LPL+   I  +A+ GP+A+     + +Y       T+ ++G     K    +
Sbjct: 460 LLKNQDAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIQNKVKPGTEV 519

Query: 438 NYAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
            +  GC D+V                +  S I  A++ AK +D  V+V G       E K
Sbjct: 520 LFTKGC-DLVDANWPESELIRYPLTSEEQSEIDKAVENAKKSDVAVVVLGGSNRTCGENK 578

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
            R  L LPG Q +L+  V    K PV LV+++   + IN+A  +  + +IL   YPG +G
Sbjct: 579 SRSSLELPGRQLDLLQAVVATGK-PVVLVLINGRPISINWA--DKYVPAILEAWYPGSQG 635

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV----- 597
           G AIAD +FG YNPGG+L +T +     +IP+ + P +P     G   K  DG +     
Sbjct: 636 GTAIADALFGDYNPGGKLTVT-FPKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRVNG 693

Query: 598 -VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
            +YPFGYGLSYT F+Y   S                 I   + T   P        V+CK
Sbjct: 694 PLYPFGYGLSYTTFEYSDIS-----------------IQPAIVTQVQPVT------VRCK 730

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
                    V N GK  G EVV +Y +       T+ K ++G++R+ +  G++ ++ FT+
Sbjct: 731 ---------VTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFTI 781

Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
              + L+++++  + ++  G   ++VG
Sbjct: 782 EP-RDLQLLNSDNHWVVEPGDFKVMVG 807


>gi|404404031|ref|ZP_10995615.1| glycoside hydrolase family protein [Alistipes sp. JC136]
          Length = 740

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 193/625 (30%), Positives = 309/625 (49%), Gaps = 75/625 (12%)

Query: 111 QTVSTEAR-AMYNLGNAGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGL 168
           +T+   AR A      AGL + ++P +++ RDPRWGRV+E  GEDPY+    A   VRG 
Sbjct: 133 ETIEASARMAAVEASAAGLQWTFAPMVDIARDPRWGRVMEGAGEDPYLGSHIARARVRGF 192

Query: 169 QDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFE 228
           Q         D  S P  I AC KH+A Y      G D    D  +++Q ++E ++ PF+
Sbjct: 193 QG--------DDLSAPNTILACAKHFAGYGASEG-GRDYNTVD--ISDQRLRELYLPPFK 241

Query: 229 MCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKF 288
              +    ++ M S+N ++G+P   +  L+ Q +R +W + G IVSD  S+  ++  H  
Sbjct: 242 AAADA-GAATFMNSFNELSGVPATGNRFLVKQILRNEWGWDGVIVSDWGSVAEMI-PHGI 299

Query: 289 LNDTKEDAVARVLKAGLDLDC-GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY 347
             D K+ A+  V K   D+D  G+ Y +     V++GK++E +ID S+R +  +   LG 
Sbjct: 300 AEDKKQAALLAV-KNECDIDMEGNCYPSSLEELVKEGKVSEKEIDRSVRRILRLKYELGL 358

Query: 348 FDGSPQY--KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPH 405
           FD   +Y  +   K    +  H E A + AR+ IVLL+N    LPL  G  +++A+VGP 
Sbjct: 359 FDDPYRYCDEQREKEVTLSAAHREAARDMARKSIVLLENRKSVLPL--GKPRSIAVVGPL 416

Query: 406 ANATKAMIGNY--EGTPCRYTSPMDGFYAYSKV---INYAPGCADIVCQNNSMIPAAIDA 460
           A++   M+G +  +G P    + + G    +     + +A GC D+   + S    A+ A
Sbjct: 417 ADSPVDMLGEWRAKGDPKEVVTILRGIEKTAGAGTRVTHAKGC-DVTGSDRSGFAEAVRA 475

Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
           A++AD  +   G    +  EG  R +L LPG Q EL+ ++    K P+ L++ +   + +
Sbjct: 476 ARSADVVIACLGESADMSGEGYCRSELGLPGVQQELLKELKKTGK-PIVLLLSNGRPLTL 534

Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP------Y 574
            + K N  I++I+   + G E G A+ADV+FGKYNP G+L ++ +  N  +IP      +
Sbjct: 535 AWEKEN--IETIVETWFLGTEAGNAVADVLFGKYNPSGKLVMS-FPYNVGQIPVYYNHKH 591

Query: 575 TSMPLRPVNNFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCR 632
           T  P  P   +      + D PV  +YPFGYGLSYT+F+Y                    
Sbjct: 592 TGRPFEPNQRY---VMHYIDAPVDALYPFGYGLSYTRFEY-------------------- 628

Query: 633 DINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH- 691
                    +P     L  D        T  ++V N G  DG EVV +Y +      T  
Sbjct: 629 --------GEP----TLSSDRMAAGDTITATVKVTNAGDYDGEEVVQLYIRDLKAQITRP 676

Query: 692 IKQVIGYERVFIAAGQSAKVGFTMN 716
           +K++ G+ ++F+  G+SA V F + 
Sbjct: 677 VKELKGFRKIFLKKGESADVTFDIT 701


>gi|261405721|ref|YP_003241962.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
 gi|261282184|gb|ACX64155.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
           Y412MC10]
          Length = 765

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 195/694 (28%), Positives = 324/694 (46%), Gaps = 98/694 (14%)

Query: 87  GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
           G T FP  +   +++N  L++ + + V+ E R+       G   +SP ++VVRDPRWGR 
Sbjct: 122 GGTVFPVPLSIGSTWNVDLYRDMCRAVALETRS-----QGGAVTYSPVLDVVRDPRWGRT 176

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN- 205
            E  GEDPY++  YA+  V GLQ         +S   P  ++A  KH+  Y       N 
Sbjct: 177 EECFGEDPYLISEYAVASVEGLQG--------ESLDSPSSVAATLKHFVGYGSSEGGRNA 228

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
              H  +R    ++ E  +LPF+  V  G  +S+M +YN ++G+P   + +LL+  +R +
Sbjct: 229 GPVHMGTR----ELMEVDMLPFKKAVEAG-AASIMPAYNEIDGVPCTVNTELLDGILRKE 283

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQG 324
           W F G +++DC +I  +   H    D   DA  + ++AG+D++  G+ +      AV+  
Sbjct: 284 WGFDGMVITDCGAIDMLASGHDTAEDGM-DAAVQAIRAGIDMEMSGEMFGKHLQKAVESN 342

Query: 325 KIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKN 384
           K+  + +D ++R +  +  +LG F+         +N I + QH+ LA + A +GIVLLKN
Sbjct: 343 KLEVSVLDEAVRRVLTLKFKLGLFENPYVDPQTAENVIGSEQHVGLARQLAAEGIVLLKN 402

Query: 385 DNGALPLNTGNIKTLALVGPHANATKAMIGNYEG--TPCRYTSPMDGFYAY----SKVIN 438
           +  ALPL+      +A++GP+A+     +G+Y     P   T+ + G  A     ++ + 
Sbjct: 403 EAKALPLSKEG-GVIAVIGPNADQGYNQLGDYTSPQPPAAVTTVLGGIRAKLGEEAQRVL 461

Query: 439 YAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG-----------LDLSVEA-------- 479
           YAPGC  I   +      A+  A+ AD  V+V G           +DL   A        
Sbjct: 462 YAPGCR-IKDDSREGFEFALTCAEQADTVVMVLGGSSARDFGEGTIDLRTGASKVTDDAL 520

Query: 480 ------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
                 EG DR+ L L G Q EL+ ++    K    ++++      I     +    +IL
Sbjct: 521 SDMDCGEGIDRMTLQLSGVQLELVQEIHKLGK---RMIVVYINGRPIAEPWIDEHADAIL 577

Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKF 592
              YPG+EGG A+AD++FG  NP G+L ++       + + Y     R      G+ Y  
Sbjct: 578 EAWYPGQEGGHAVADILFGDVNPSGKLTMSIPKHVGQLPVYYNGKRSR------GKRYLE 631

Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
            D    YPFGYGLSYT+F Y         DI++  +         +GT            
Sbjct: 632 EDSQPRYPFGYGLSYTEFSYS--------DIQMTPE--------VIGT------------ 663

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKV 711
               D      + V N G  +GSEVV +Y        T   +++ G++++F+  G+  KV
Sbjct: 664 ----DGTAVVSVNVTNSGDCEGSEVVQLYVSDAASKYTRPARELKGFQKIFLQPGERRKV 719

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVGEGV 745
            FT+   + L+ +      ++  G   +++G  V
Sbjct: 720 EFTIGP-EQLQYIGQDYRQVVEPGLFRVMLGRHV 752


>gi|380696432|ref|ZP_09861291.1| beta-glucosidase [Bacteroides faecis MAJ27]
          Length = 954

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 224/766 (29%), Positives = 361/766 (47%), Gaps = 123/766 (16%)

Query: 6   KVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSE 61
           K +++D  Y D  LP  ER + L+  MT PE   ++    +G+P  G+P LY       E
Sbjct: 162 KGEVTDRRYMDVSLPVEERVESLLAVMT-PEDKMELIREGWGIP--GIPHLYVPPITKVE 218

Query: 62  ALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY 121
           A+HG S+         G+       GAT FP  +   A++N+ L +++   +  E  A  
Sbjct: 219 AVHGFSY---------GS-------GATIFPQALAMGATWNKKLTEEVAMVIGDETVAA- 261

Query: 122 NLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
           N   A    WSP ++V +D RWGR  ET GEDP +V +    +++G Q            
Sbjct: 262 NTKQA----WSPVLDVAQDARWGRCEETFGEDPVLVSQMGGAWIKGYQ------------ 305

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
           SR L  +   KH+  +         R   D  ++E++M+E  ++PF   +   D  S+M 
Sbjct: 306 SRGLFTTP--KHFGGHGAPL---GGRDSHDIGLSEREMREIHLVPFRHAIRNYDCQSLMM 360

Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
           +Y+   G+P     +LL Q +R +W F+G+IVSDC +I  +     +    K +A  + L
Sbjct: 361 AYSDYMGVPVAKSKELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQAL 420

Query: 302 KAGLDLDCGDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP----QYKN 356
            AG+  +CGD Y N   + A + G+I   D+D   R +   + R   F+ +P     +K 
Sbjct: 421 AAGIATNCGDTYNNKEVIQAAKDGRINMEDLDNVCRTMLSTMFRNELFEKNPCKPLDWKK 480

Query: 357 L--GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIG 414
           +  G N   +  H E+A +AAR+ IV+L+N    LPL+   ++T+A+VGP A+  +   G
Sbjct: 481 IYPGWN---SDSHKEMARQAARESIVMLENKENLLPLSK-TLRTIAVVGPGADDLQP--G 534

Query: 415 NY--EGTPCRYTSPMDGFYA----YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
           +Y  +  P +  S + G  +     +KV+ Y  GC D    + + IP A+  A  +D  +
Sbjct: 535 DYTPKLLPGQLKSVLTGIKSAVGKQTKVL-YEQGC-DFTNPDATNIPKAVKTASQSDVVI 592

Query: 469 IVAGLDLSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
           +V G   + EA         E  D   L+LPG Q EL+  V    K PV L++ +    D
Sbjct: 593 MVLGDCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYD 651

Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
           I   K +   K+IL    PG+EGG A+ADV+FG YNP GRLP+T+            +PL
Sbjct: 652 I--LKASEMCKAILVNWLPGQEGGPAMADVLFGDYNPAGRLPMTFPRH------VGQLPL 703

Query: 580 RPVNNFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
                  GR Y++ D     +Y FG+GLSYT F+Y         ++K+   Q+  + N  
Sbjct: 704 YYNFKTSGRRYEYVDMEYYPLYRFGFGLSYTSFEYS--------NLKI---QEKANGNVE 752

Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVI 696
           V                        Q  V+N+G   G EV  +Y +       T + ++ 
Sbjct: 753 V------------------------QATVKNVGSCAGDEVAQLYVTDMYASVKTRVMELK 788

Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            + R+ +  G+S  V F M     + ++++  + ++  G   I++G
Sbjct: 789 DFTRIHLQPGESKTVSFEMTPY-DISLLNDRMDRVVEKGEFKIMIG 833


>gi|189464219|ref|ZP_03013004.1| hypothetical protein BACINT_00556 [Bacteroides intestinalis DSM
           17393]
 gi|189438009|gb|EDV06994.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 865

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 152/412 (36%), Positives = 218/412 (52%), Gaps = 34/412 (8%)

Query: 20  PYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGT 79
           P   R ++L+ +MTL EKV Q+ +    +PRL LP Y +W+E LHGV+  G         
Sbjct: 55  PISARVENLISKMTLEEKVAQLSNETDSIPRLNLPSYNYWNECLHGVARAGE-------- 106

Query: 80  HFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVR 139
                    T FP  I   ++++  L KK+   +STEAR  Y     GLT+WSP IN+ R
Sbjct: 107 --------VTVFPQAINLASTWDTLLIKKVASAISTEARLKYLEIGKGLTYWSPTINMAR 158

Query: 140 DPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDL 199
           DPRWGR  ET GEDPY+  R  + +V+GLQ       H D     LK  A  KH+ A   
Sbjct: 159 DPRWGRNEETYGEDPYLTSRLGVAFVKGLQGD-----HPDY----LKTVATIKHFVA--- 206

Query: 200 DNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLN 259
            N + NDRF   S++  + + E +   +E CV E D  SVM +YN  NG+       LL 
Sbjct: 207 -NNQENDRFSSSSQIPTKQLYEYYFPAYEACVKEADAQSVMTAYNAFNGVAPSGSTWLLG 265

Query: 260 QTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMG 319
             +R +W F G++VSDC +I  +   H+ +N + E+A A  + +G DL+CG  Y    + 
Sbjct: 266 DVLRKEWGFDGFVVSDCGAIGVMNWQHRVVN-SLEEAAALGINSGCDLECGGTYREKLVA 324

Query: 320 AVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAEAARQ 377
           AV+ G ++E  ID +L  +     +LG FD      Y +  K  +   +  +LA EAA +
Sbjct: 325 AVKMGLVSEQAIDKALTRVLTARFKLGEFDPIELVPYNHYDKKLLAGEKFGKLAYEAAVK 384

Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG 429
            IVLLKNDN  LP++   I+++A+VGP A+     +G Y G P    S + G
Sbjct: 385 SIVLLKNDNDFLPVDKKKIRSVAIVGPFADNN--YLGGYSGKPVHNVSLLQG 434



 Score =  103 bits (256), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 85/311 (27%), Positives = 140/311 (45%), Gaps = 49/311 (15%)

Query: 450 NNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVT 509
           N+  I    +    AD  ++  G D  +  E +D   + LP  Q  L+ ++      P T
Sbjct: 595 NSDQIDKVKEFVSGADLVLVALGNDEKLARENRDLPSIYLPMTQELLLKEIYKV--NPRT 652

Query: 510 LVIMSAG-AVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEAN 568
            +I+  G  +   +A  N  + +IL   YPG+EGG+A+A ++FG  NP G+LP+T YE+ 
Sbjct: 653 ALILHTGNPLTSKWAAEN--VPAILQAWYPGQEGGKALAGILFGSENPSGKLPMTIYESE 710

Query: 569 YVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
                   +P +   + + GRTY++     +Y FG+GLSY+ F+Y    S          
Sbjct: 711 ------EQLPDILDYDIWKGRTYQYLSSKPLYGFGHGLSYSNFEYTHLQS---------- 754

Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY--SKPP 685
                                  DDV   D      IE++N+  + G EVV VY   +  
Sbjct: 755 -----------------------DDVVRPDGTLQCSIEIKNISDVAGEEVVQVYISRENT 791

Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGV 745
            +    +K+++ + RV +  G+S  V FT+ A + L I       +L  G +++ VG G 
Sbjct: 792 PVYTFPLKKLVAFARVDLKPGESKTVTFTI-APRQLSIWQEGIWKMLP-GKYSLFVGSGQ 849

Query: 746 GGVSFPLQLNL 756
            G+S  +  N 
Sbjct: 850 EGLSKGINRNF 860


>gi|218258058|ref|ZP_03474485.1| hypothetical protein PRABACTJOHN_00138 [Parabacteroides johnsonii
           DSM 18315]
 gi|218225777|gb|EEC98427.1| hypothetical protein PRABACTJOHN_00138 [Parabacteroides johnsonii
           DSM 18315]
          Length = 955

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 219/807 (27%), Positives = 360/807 (44%), Gaps = 138/807 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
           Y D   P   R +DL+ +M + EK  QM  L YG  R+    LP  +W    W + +  +
Sbjct: 61  YEDPTAPIDARVEDLLSQMNVEEKTCQMVTL-YGYKRVLKDDLPTPDWKNQLWKDGMGAI 119

Query: 67  S----------------------------------FIGRRTNSPPGTHFDSE-VPG---- 87
                                              F    T     T F +E + G    
Sbjct: 120 DEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESY 179

Query: 88  -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
            AT+FPT +    ++N  L  K+G     E R +      G T  ++P ++V RD RWGR
Sbjct: 180 IATNFPTQLGLGHTWNRDLVHKVGYITGREGRLL------GYTNVYAPILDVGRDQRWGR 233

Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
             E  GE PY+V    +   +G+Q     +Y         +++A  KHY AY  +     
Sbjct: 234 YEEVYGESPYLVAELGVEMAKGMQ----TDY---------QVAATSKHYIAYSNNKGGRE 280

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
                D +++ ++++   + P++  + E  +  VM SYN  +G P  +    L   +RG+
Sbjct: 281 GMARVDPQMSPREVEMLHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGE 340

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
           + F GY+VSD D+++ +   H    D KE  +  VL AGL++ C     D Y       +
Sbjct: 341 FGFRGYVVSDSDAVEYLFSKHGTAADMKESVLQSVL-AGLNIRCTFRSPDSYVLPLRELI 399

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYK-NLGKNNICNPQHIELAAEAARQGIV 380
            +G +  + ID  +R +  V   +G FD   Q         + + ++ ++A +A+++ +V
Sbjct: 400 AEGALPMSTIDDRVRDILRVKFLVGLFDQPYQIDLKQADKEVNSAENQQVALQASKESLV 459

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK---VI 437
           LLKN +  LPL+   I  +A+ GP+A+     + +Y       T+ ++G     K    +
Sbjct: 460 LLKNQDAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIQNKVKPGTEV 519

Query: 438 NYAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
            +  GC D+V                +  S I  A++ AK +D  V+V G       E K
Sbjct: 520 LFTKGC-DLVDANWPESELIRYPLTSEEQSEINKAVENAKKSDVAVVVLGGSNRTCGENK 578

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
            R  L LPG Q +L+  V    K PV LV+++   + IN+A  +  + +IL   YPG +G
Sbjct: 579 SRSSLELPGRQLDLLQAVVATGK-PVVLVLINGRPISINWA--DKYVPAILEAWYPGSQG 635

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV----- 597
           G AIAD +FG YNPGG+L +T +     +IP+ + P +P     G   K  DG +     
Sbjct: 636 GTAIADALFGDYNPGGKLTVT-FPKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRVNG 693

Query: 598 -VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
            +YPFGYGLSYT F+Y   S                 I   + T   P        V+CK
Sbjct: 694 PLYPFGYGLSYTTFEYSDIS-----------------IQPAIVTQVQPVT------VRCK 730

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
                    V N GK  G EVV +Y +       T+ K ++G++R+ +  G++ ++ FT+
Sbjct: 731 ---------VTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFTI 781

Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
              + L+++++  + ++  G   ++VG
Sbjct: 782 EP-RDLQLLNSDNHWVVEPGDFKVMVG 807


>gi|423223721|ref|ZP_17210190.1| hypothetical protein HMPREF1062_02376 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638096|gb|EIY31949.1| hypothetical protein HMPREF1062_02376 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 954

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 226/760 (29%), Positives = 356/760 (46%), Gaps = 119/760 (15%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSEALHG 65
           +   Y D  LP  ER + L+  MT PE   ++    +G+P  G+P LY       EA+HG
Sbjct: 166 TSLRYMDPTLPVEERVESLLSVMT-PEDKMELIREGWGIP--GIPHLYVPPITKVEAVHG 222

Query: 66  VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
            S+         G+       GAT FP  +   A++N+ L + +   V  E      L  
Sbjct: 223 FSY---------GS-------GATIFPQALAMGATWNKKLTEDVAMAVGDE-----TLAA 261

Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
             +  WSP ++V +D RWGR  ET GEDP +V +    +++G Q            S+ L
Sbjct: 262 GTMQAWSPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SKGL 309

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
             +   KH+  +         R   D  ++E++M+E  ++PF   +   D  SVM +Y+ 
Sbjct: 310 FTTP--KHFGGHGAPL---GGRDSHDIGLSEREMREVHLVPFRHVIRNYDCQSVMMAYSD 364

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
             G+P     +LL+  +R +W F G+IVSDC +I  +     +    K +A  + L AG+
Sbjct: 365 YLGVPVAKSRELLHSILREEWGFDGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGI 424

Query: 306 DLDCGDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC- 363
             +CGD Y +   + A + G+I   ++D   R +  ++ R   F+ +P  K L  N I  
Sbjct: 425 ATNCGDTYNDKEVIQAAKDGRINMENLDEVCRTMLRMMFRNELFEKTPN-KPLDWNKIYP 483

Query: 364 ---NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY--EG 418
              +  H E+A +AAR+ IV+L+N +  LPL   +++T+A+VGP A+  +   G+Y  + 
Sbjct: 484 GWNSDSHKEMARQAARESIVMLENKDNILPL-AKDMRTIAVVGPGADDLQP--GDYTPKL 540

Query: 419 TPCRYTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
            P +  S + G        +KV+ Y  GC D    N + IP A+ AA  +D  V+V G  
Sbjct: 541 LPGQLKSVLTGIKQAVGKQTKVV-YEQGC-DFTSSNGTDIPKAVKAASQSDVVVLVLGDC 598

Query: 475 LSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
            + E+         E  D   L+LPG Q EL+  V   A G   ++I+ AG    N +K 
Sbjct: 599 STSESTTDVYKTSGENHDYATLILPGKQQELLEAV--CATGKPVILILQAGR-PYNLSKA 655

Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
           +   K+IL    PG+EGG A ADV+FG YNP GRLP+T+    +V      +PL      
Sbjct: 656 SELCKAILVNWLPGQEGGPATADVLFGDYNPAGRLPMTF--PRHV----GQLPLYYNFKT 709

Query: 586 PGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKP 643
            GR Y++ D     +Y FGYGLSYT F+Y              K Q+  + N  +     
Sbjct: 710 SGRRYEYSDMEFYPLYYFGYGLSYTSFEYSGL-----------KIQEKDNGNVAI----- 753

Query: 644 PCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF 702
                              Q  V+N+G+  G EVV +Y +       T I ++  + RV 
Sbjct: 754 -------------------QATVKNVGQRAGDEVVQLYITDMYASVKTRITELKDFTRVH 794

Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +   +S  V F +   + L ++++  + ++  G   ILVG
Sbjct: 795 LQPDESKIVSFELTPYE-LSLLNDRMDRVVEKGEFKILVG 833


>gi|427387416|ref|ZP_18883472.1| hypothetical protein HMPREF9447_04505 [Bacteroides oleiciplenus YIT
           12058]
 gi|425725577|gb|EKU88448.1| hypothetical protein HMPREF9447_04505 [Bacteroides oleiciplenus YIT
           12058]
          Length = 733

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 217/760 (28%), Positives = 356/760 (46%), Gaps = 92/760 (12%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLYEW 58
           Y DA  P   R KDL++RMTL EKV Q+    +G              +P  +G  +Y  
Sbjct: 25  YKDAGQPVETRVKDLLKRMTLHEKVLQLNQYTFGENDNPNNIGTEVKNLPAEIGSLIYLH 84

Query: 59  WSEALHGV----SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
               L       +    R   P    FD      T +P  +    SFN  L   + Q   
Sbjct: 85  TDPKLRNQIQRKAMEESRLGIPILFGFDVIHGLRTVYPISLAQACSFNPDL---VTQACG 141

Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
             A+    L     TF SP I+V RDPRWGR+ E  GEDPY      +N V G+  V+G 
Sbjct: 142 MAAKESV-LSGIDWTF-SPMIDVARDPRWGRISECYGEDPY------LNTVFGVASVQGY 193

Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
           +  + SD  P  I+AC KHY  Y     EG   + + + ++ Q + ET++ P+E CV  G
Sbjct: 194 QGEKLSD--PYSIAACLKHYVGYGAS--EGGRDYRY-TDISPQALWETYLPPYEACVKAG 248

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
             +++M S+N ++G+P  ++  +L + ++  W   G++VSD ++I+ ++  ++ +   ++
Sbjct: 249 -AATLMSSFNDISGVPATSNHYILTEILKNKWRHDGFVVSDWNAIEQLI--YQGVAKDRK 305

Query: 295 DAVARVLKAGLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
           +A  +   AG+++D  D  Y  +    V + KI  + ID ++  +  V  RLG FD  P 
Sbjct: 306 EAAYKAFHAGVEMDMRDNIYYEYLEQLVAEKKIQMSQIDDAVARILRVKFRLGLFD-EPY 364

Query: 354 YKNLG-KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAM 412
            K L  +      + I LAA  A + +VLLKN+N  LPL++  +K +AL+GP A  +  +
Sbjct: 365 TKELTEQERYLQKEDIALAARLAEESMVLLKNENNLLPLSS-TVKRVALIGPMAKDSANL 423

Query: 413 IGNY------EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADA 466
           +G +      E     Y   M   +     ++Y  GCA +   + S   AA+  A+ +D 
Sbjct: 424 LGAWAFKGHAEDVETIYEG-MQKEFGDKVQLDYEQGCA-LDGNDESGFSAALKTAEASDV 481

Query: 467 TVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
            V+  G       E   R  + LP  Q +L+  +  A K P+ LV+ S   +++   +  
Sbjct: 482 VVVCLGESKQWSGENASRSTIALPDIQEKLLLHLKQANK-PIVLVLSSGRPLEL--IRLE 538

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP-YTSM--PLRPVN 583
           P++++I+ +  PG  GG  +A ++ G+ NP G+L +T +  +  +IP Y +M    RP +
Sbjct: 539 PQVEAIIEMWQPGVAGGTPLAGILSGRVNPSGKLSVT-FPLSTGQIPVYYNMRQSARPFD 597

Query: 584 NFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKP 643
                 Y+      +YPFG+GLSYT F Y   S  K   +K+ K+Q              
Sbjct: 598 AMG--DYQDIPTKPLYPFGHGLSYTTFVY---SDAKLSSLKIRKNQ-------------- 638

Query: 644 PCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVF 702
                          K T ++ V N GKM+G E V+ Y   P  + +  +K++  +E+  
Sbjct: 639 ---------------KITAEVTVTNAGKMEGKETVLWYVSDPFCSISRPMKELKFFEKHS 683

Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           + AG+S    F ++  + L   D      L +G   + VG
Sbjct: 684 LNAGESRVFRFEIDPMRDLSYTDATGKRFLEAGEFIVSVG 723


>gi|375143423|ref|YP_005005864.1| Beta-glucosidase [Niastella koreensis GR20-10]
 gi|361057469|gb|AEV96460.1| Beta-glucosidase [Niastella koreensis GR20-10]
          Length = 793

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 225/801 (28%), Positives = 360/801 (44%), Gaps = 137/801 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
           Y D K     R  DL+ +MTL EK  QM  L YG  R+    LP   W    W + +   
Sbjct: 43  YEDPKQSVNARTADLLSKMTLDEKTCQMATL-YGWHRVLKDSLPTDSWKNAIWKDGIANI 101

Query: 64  --HGVSFIGRRTNSPPGTHFDSE--------------------VPG-------------- 87
             H   F G    +P     D E                    +P               
Sbjct: 102 DEHLNGFAGWGKTAPIDLVKDMEKHVWAMNETQRFFIEQTRLGIPADFTNEGIRGVEAYE 161

Query: 88  ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRV 146
           AT FPT +    ++N+ L  + G     EARA+      G T  ++P ++V RD RWGR+
Sbjct: 162 ATGFPTELNMGMTWNKELVHQEGIITGREARAL------GYTNVYAPIMDVARDQRWGRL 215

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
            E+ GEDPY+V    I   +G+Q        +D      K+++  KH+A Y  +      
Sbjct: 216 EESYGEDPYLVASMGIALAKGIQ--------QDG-----KVASTAKHFAVYSANKGAREG 262

Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
           +   D +V  ++++   + PF+  + E  +  VM SYN  +GIP       L Q +R + 
Sbjct: 263 QARTDPQVAPREVENLLLYPFKKVIKEAGIMGVMSSYNDYDGIPVSGSNYWLIQRLRVEM 322

Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL----DCGDYYTNFTMGAVQ 322
            F GY+VSD D+++ +   H    + KE AV +   AG+++       D    +    V+
Sbjct: 323 GFTGYVVSDSDALEYLATKHHVAANLKE-AVFQAFMAGMNVRTTFKAPDSIIIYLRQLVK 381

Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN--ICNPQHIELAAEAARQGIV 380
           +G+I    I+  +  +  V  RLG FD  P  ++  +    + +    ++A +A+R+ +V
Sbjct: 382 EGRIPMDTINHRVADVLRVKFRLGLFD-HPYVESAAETRKVVNSDASQQIALQASRESVV 440

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY---SKVI 437
           LLKN+N  LPL   ++  +A+VGP+A        +Y        + + G  A     KV+
Sbjct: 441 LLKNNNNILPL-VKSLDKIAVVGPNATDDDYAHTHYGPLGSPSVNVLQGIQAKLGAGKVL 499

Query: 438 NYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
            YA G  D+V +N                +M+ +A++  K A   ++V G +     E K
Sbjct: 500 -YAKGV-DLVDKNWPESEILPEPMDAGEQAMLDSAVNITKQAQMAIVVLGGNTRTAGESK 557

Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
            R DL LPG Q EL+  +    K PV +V++    + IN+   +  I  I++ GYPG +G
Sbjct: 558 SRTDLDLPGHQLELVKAIKATGK-PVVVVLLGTQPMTINWI--DKYIDGIVYAGYPGVKG 614

Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFG 602
           G A+ADV+FG YNPGG+L +TW ++   +IP  + P +P        +    G ++YPFG
Sbjct: 615 GIAVADVLFGDYNPGGKLTLTWPKS-VGQIPL-NFPSKPGAQSDEGEHAKIKG-LLYPFG 671

Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
           +GLSYT F Y                      N  + T K     V +            
Sbjct: 672 FGLSYTSFGY---------------------TNLKISTGKTAADPVAV------------ 698

Query: 663 QIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
            ++V N GK+ G EVV  Y +       T+ K + G+ERV + AG++  + FT+   + L
Sbjct: 699 TVDVTNTGKLAGDEVVQCYIRDVLSSVTTYEKLLKGFERVHLQAGETKTISFTI-PREEL 757

Query: 722 KIVDNAANSLLASGAHTILVG 742
           K+ +     +L  G  ++++G
Sbjct: 758 KLYNREMKFVLEPGEFSVMIG 778


>gi|423223874|ref|ZP_17210343.1| hypothetical protein HMPREF1062_02529 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392637823|gb|EIY31686.1| hypothetical protein HMPREF1062_02529 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 759

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 229/797 (28%), Positives = 369/797 (46%), Gaps = 139/797 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGL-------------------- 53
           Y D+  P  +R +DL++RMTL EKV QM     GV  +                      
Sbjct: 18  YKDSTAPVKDRVEDLLKRMTLEEKVGQMNQFV-GVEHIKANSAVMTEEELKNNTANAFYP 76

Query: 54  -----PLYEWWSEALHGVSFI----------------GRRTNSP-----PGTHFDSEVPG 87
                 + +W  E L G SF+                  R   P        H ++  P 
Sbjct: 77  GFTEKDIEKWTEEGLIG-SFLHVLTIEEANYLQSLAMKSRLQIPIIFGIDAIHGNANAPD 135

Query: 88  ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVL 147
            T +PT I    SF+  +  KI +  + E RAM    N   TF +PN+ V RD RWGRV 
Sbjct: 136 NTVYPTNINLACSFDTLMAYKIARQTAKEMRAM----NMHWTF-NPNVEVARDARWGRVG 190

Query: 148 ETPGEDPYVVGRYAINYVRGLQ-DVEGVEYHRDSDSRPLKISACCKHY--AAYDLDNWEG 204
           ET GEDPY+V    +  V+G Q D+ G E           + AC KH+   +  ++   G
Sbjct: 191 ETYGEDPYLVTLLGVQSVKGYQGDLNGNE----------DVLACIKHFVGGSEPINGTNG 240

Query: 205 NDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRG 264
           +      + ++E+ ++E F  PFE  V  G +S +M ++N +NGIP  ++  L+   +RG
Sbjct: 241 SP-----TDLSERTLREVFFPPFEAGVKAGAMS-LMTAHNELNGIPCHSNEWLMQDILRG 294

Query: 265 DWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC-GDYYTNFTMGAVQQ 323
           +WNF G++VSD   I+ I + H    + KE A  + +  G+D+   G ++    +  V++
Sbjct: 295 EWNFPGFVVSDWMDIEHIHDLHATAENLKE-AFYQSIMGGMDMHMHGIHWNEMVVELVRE 353

Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIVLL 382
           G+I E+ ID S+R +  +  RLG F+          K  +C+ +H   A E+AR GIVLL
Sbjct: 354 GRIPESRIDESVRRILDIKFRLGLFEQPYADEAETMKVRLCD-EHRATALESARNGIVLL 412

Query: 383 KNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG--TPCRYTSPMDGF--YAYSKVIN 438
           KND G LPL+    K + + G +A+  + ++G++         T+ ++G    A     +
Sbjct: 413 KND-GVLPLDASRYKKILVTGINAD-DQNILGDWSAPEKDENVTTILEGLKMIAPDTQFD 470

Query: 439 YAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL-------SVEAEGKDRVDLLLPG 491
           +     D    +   +  A   AK+AD  ++VAG  +         + E  DR DL L G
Sbjct: 471 FVDQGWDPRNMDPKKVAEAAVRAKSADLNIVVAGEYMMRFRWNDRTDGEDTDRSDLDLVG 530

Query: 492 FQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIF 551
            Q ELI KVA + K P  L++++   + + +A  N  + +I+    PG  GG+A+A++++
Sbjct: 531 LQNELIEKVAASGK-PTILILVNGRPLGVQWAAEN--LPAIVEAWAPGMYGGQAVAEILY 587

Query: 552 GKYNPGGRLPITWYEANYVKIPYTSMPLRPV-NNFPGRTYKFFDG----PVVYPFGYGLS 606
           GK NP  +L IT        IP++   L+ + N+ P + +  +        +YPFG+GLS
Sbjct: 588 GKVNPSAKLAIT--------IPHSVGQLQMIYNHKPSQYFHPYAAGKPSTPLYPFGHGLS 639

Query: 607 YTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEV 666
           YT +KY         D+KL + +  +D    V                         ++V
Sbjct: 640 YTTYKYD--------DLKLAQKEITKDGTVDVS------------------------VKV 667

Query: 667 ENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVD 725
            N G  DG E+V +Y +    + T  +K++  + RV + AG+S  V F +   K L   D
Sbjct: 668 TNTGDRDGVEIVQLYIRDKFSSVTRPVKELKDFARVSLKAGESQVVNFKITPDK-LAFYD 726

Query: 726 NAANSLLASGAHTILVG 742
                ++  G   ++VG
Sbjct: 727 KKMKKIVEPGEFIVMVG 743


>gi|346226088|ref|ZP_08847230.1| glycoside hydrolase family 3 domain protein [Anaerophaga
           thermohalophila DSM 12881]
          Length = 749

 Score =  259 bits (663), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 215/726 (29%), Positives = 336/726 (46%), Gaps = 100/726 (13%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +P+ + +L    R  DL+ RMTL EKV  +      VPRLG+       E  HGV+  G 
Sbjct: 52  YPFQNPELDSEARIDDLLSRMTLDEKVSALSTDP-SVPRLGVKGAPH-IEGYHGVAMGGP 109

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN---LGNAGL 128
              +P G   D  VP  T+FP      A++N  L +  G+  S EAR ++    +   GL
Sbjct: 110 ANWAPKG---DEAVP-TTTFPQAYGMGATWNPELIRLAGEIESIEARYIFQNPEIAKGGL 165

Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
              +PN ++ RDPRWGR  E  GEDP++VG  A  + +GLQ           D +  + +
Sbjct: 166 VVRAPNADLGRDPRWGRTEECFGEDPFLVGTSATAFTKGLQ---------GDDDQYWRTA 216

Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
           +  KH+ A   +N   +    FD ++      E +   F     EG  ++ M +YN +NG
Sbjct: 217 SLLKHFLANSNENGRESSSSDFDMQL----YHEYYGASFRRAFIEGGSNAYMAAYNAING 272

Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
           +P      +  +     W   G   +D    Q +V  HK+ +D    A   V+KAGL+  
Sbjct: 273 VPAHVH-DMHKEITERMWGVDGIKCTDGGGYQLLVYGHKYYDDLYL-AAEGVIKAGLN-Q 329

Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICN 364
             D Y     GA+  G I EADID  LR +Y V+++LG  D  PQ    Y  +G++    
Sbjct: 330 FLDNYREGVYGALAHGYITEADIDEVLRGVYRVMIKLGQLD--PQEKVPYSAIGRDGKPA 387

Query: 365 P----QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
           P    +H + A   AR+ IVLLKN+N  LPLN   +  +A++G  A+    ++  Y G P
Sbjct: 388 PWTTQKHKDAALRMARESIVLLKNNNKTLPLNADKLNKVAVIGYLADTV--LLDWYSGLP 445

Query: 421 CRYTSPMDGFYAY----SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
               +P++G        SKV+ YAP         ++   AA++AA  AD  +++ G   +
Sbjct: 446 PYRITPLEGIREKLGNDSKVL-YAP---------DNDYNAAVEAASEADVAIVILGNYPT 495

Query: 477 VEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
             +E          G++ +D        E + K+   A      V+ S+    IN+++ N
Sbjct: 496 CNSEIWADCPDPGMGREAIDRKTLRLTDEYLVKLVMEANPNTIFVLQSSFPYAINWSQQN 555

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
             + +IL + + G+E G A+ADV+FG YNPGG+L  TW ++           +R      
Sbjct: 556 --VPAILHLTHNGQETGSALADVLFGDYNPGGKLTQTWPKSEDQLPDMMEYDIR-----K 608

Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
           G TY +F+   +YPFG+GLSYT F ++                        +  NKP  +
Sbjct: 609 GHTYMYFEDKPLYPFGHGLSYTTFAWE-----------------------DISINKPVVS 645

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAA 705
           A         D +    ++++N G + G EVV +Y S P        K + G++RV +  
Sbjct: 646 A--------DDEEVIITVKLKNTGDVKGDEVVQLYASFPESTVRRPAKALKGFKRVTLEP 697

Query: 706 GQSAKV 711
           G+  K+
Sbjct: 698 GEKKKI 703


>gi|410097652|ref|ZP_11292633.1| hypothetical protein HMPREF1076_01811 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409223742|gb|EKN16677.1| hypothetical protein HMPREF1076_01811 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 780

 Score =  259 bits (663), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 234/814 (28%), Positives = 365/814 (44%), Gaps = 157/814 (19%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQM------------------GDLAYGVPRLGLPL 55
           Y  A  P  +R KDL+ RMT+ EKV Q+                   DL Y      +P+
Sbjct: 25  YKQATAPVEDRVKDLIGRMTVEEKVGQLCCPLGWEMYTKTTNGVVASDL-YKERMKTMPI 83

Query: 56  YEWWS-------------------------EALHGVSFIGRRTNSPPGTHFDSEVP---- 86
             +W+                          AL   +    R   P    F  E P    
Sbjct: 84  GSFWAVLRADPWTQKTLETGLNPELSAKALNALQKYAVEETRLGIP--VLFAEECPHGHM 141

Query: 87  --GATSFPTVILTTASFNESLWKKIGQTVSTEARAM-YNLGNAGLTFWSPNINVVRDPRW 143
             G T FPT +   +++N  L  ++G+ ++ EAR+   N+G      + P +++ R+PRW
Sbjct: 142 AIGTTVFPTSLSQASTWNAELMHRMGEAIALEARSQGANIG------YGPVLDIAREPRW 195

Query: 144 GRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWE 203
            R+ ET GEDP +     + +++G+Q          +D + L   +  KH+AAY +    
Sbjct: 196 SRMEETFGEDPVLTTHLGVAFMKGMQG------KSQNDGKHL--YSTLKHFAAYGIPEAG 247

Query: 204 GNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIR 263
            N      + V  + +   ++ PF+  V EG V+++M SYN ++G+P  ++  LL   +R
Sbjct: 248 HNGA---RANVGMRQLFSDYLPPFKKAVEEG-VATIMTSYNTIDGVPCTSNKYLLTDVLR 303

Query: 264 GDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQ 322
             W F G++ SD  SI+ IV + +   D KE AV   LKAGLD+D G + Y      A++
Sbjct: 304 DQWGFKGFVYSDLTSIEGIVGA-RVAKDNKEAAVL-ALKAGLDMDLGGNAYGKNLQKALE 361

Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLL 382
           +G I   D++ ++  +  +  R+G F+         K  + +  H ELA E AR+GIVLL
Sbjct: 362 EGAITMDDLNRAVANVLRLKFRMGLFENPYVSPEQAKQVVRSKAHKELAREVAREGIVLL 421

Query: 383 KNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR--YTSPMDGF---YAYSKVI 437
           KN+ G LPL   NI  +A++GP+A+     +G+Y     R    + +DG     + S  +
Sbjct: 422 KNE-GVLPLKK-NIGNIAVIGPNADMMYNQLGDYTAPQEREEIVTVLDGIRKAVSPSTKV 479

Query: 438 NYAPGCA--DIVCQNNSMIPAAI------------DAAKNADATVIVAGL-DLSVEA--- 479
           NY  GCA  DI   N +    A              +A++     I  G  D+S +    
Sbjct: 480 NYVKGCAIRDITTSNITAAVEAARAADAVVLVVGGSSARDFKTKYIGTGAADVSNDGNQL 539

Query: 480 -------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
                  EG DR  L L G Q +L+  VA   K P+ ++ +    +++N A  + K +++
Sbjct: 540 LSDMDCGEGYDRSTLRLLGDQEKLLKAVAATGK-PLVVIYIQGRTLNMNLA--SEKAQAL 596

Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG--RTY 590
           L   YPGE+GG AIADV+FG YNP GRLP++        +P +   L P+    G  R Y
Sbjct: 597 LTAWYPGEQGGTAIADVLFGDYNPAGRLPVS--------VPRSEGQL-PLFYSQGKQRAY 647

Query: 591 KFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
              +G  +Y FGYGLSYT+F Y      K                               
Sbjct: 648 VEEEGTPLYAFGYGLSYTKFDYSQLEMQKG------------------------------ 677

Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERVFIAAGQS 708
                KD   T    V N G  DG EVV +Y   K   ++ + I  +  +ER+ +  G+S
Sbjct: 678 ---NGKDVLQTVSCTVTNTGDCDGEEVVQLYICDKVASVSQSPI-LLKAFERISLKKGES 733

Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            KV FT+   + L + +     ++  G   ++VG
Sbjct: 734 KKVTFTLGE-EELSLYNMEMKQVVEPGDFKVMVG 766


>gi|329957143|ref|ZP_08297710.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
 gi|328523411|gb|EGF50510.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
          Length = 803

 Score =  259 bits (663), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 231/813 (28%), Positives = 366/813 (45%), Gaps = 153/813 (18%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WS------ 60
           Y D   P  +R  DL+ +M++ EK  Q+  L YG  R+    LP+  W    W       
Sbjct: 44  YEDPSQPVEKRVADLLSQMSVEEKTCQLATL-YGYGRVLKDSLPVAGWKNEIWKDGIANI 102

Query: 61  -EALHGVS--------------------------FIGRRTNSPPGTHFDSEVPG-----A 88
            E L+GV                           F+       P    +  + G     A
Sbjct: 103 DEMLNGVGKKSALVPDLLYPFSNHAEAVNTVQRWFVEETRLGIPVDFTNEGIHGLNHTKA 162

Query: 89  TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVL 147
           T  P  I   +++N+ L ++ G     EA+A+      G T  ++P ++VVRDPRWGR L
Sbjct: 163 TPLPAPIAIGSTWNKELVRRAGVIAGQEAKAL------GYTNVYAPILDVVRDPRWGRTL 216

Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
           E  GE+P+++       V G+Q  +GV             +A  KHYA Y +     +  
Sbjct: 217 ECYGEEPFLIAALGTEMVNGIQS-QGV-------------AATLKHYAVYSVPKGGRDGH 262

Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
              D  V  +++ E F+ PF+  +       VM SYN  +G+P  A    L + +R ++ 
Sbjct: 263 CRTDPHVAPRELHELFLYPFKKVIQNSHPMGVMSSYNDWDGVPVSASYYFLTELLREEYG 322

Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGA------- 320
           F GY+VSD  +++  VES   + DT ++AV +VL+AGL++      T+FT  +       
Sbjct: 323 FDGYVVSDSQAVE-FVESKHHVADTYDEAVRQVLEAGLNV-----RTHFTPPSDFILPIR 376

Query: 321 --VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLG-KNNICNP-QHIELAAEAAR 376
             +++ KI+ A ID  +  +  V  RLG FD  P   + G  +N+    ++++   E  +
Sbjct: 377 RLLEEKKISMATIDKRVSEVLRVKFRLGLFD-RPYVTDTGAADNVGGADRNMDFVKEMQQ 435

Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK- 435
           Q +VLLKN+N  LPL+   IK + + GP A+    M   Y        + + G  AY + 
Sbjct: 436 QALVLLKNENNILPLDKQRIKKVLVTGPLADEDNFMTSRYGPNGLETVTVLAGLRAYLQG 495

Query: 436 --VINYAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
              ++YA GC DIV                +    I  A+  A  +D  + V G D    
Sbjct: 496 VAEVDYAKGC-DIVDAGWPATEILPVPMNEREKRGIAEAVAKAGESDVVIAVLGEDEYRT 554

Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
            E + R  L LPG Q +L+  +    K PV LV+++   + +N+A  N  I +IL   +P
Sbjct: 555 GESRSRTSLDLPGRQQQLLEALHATGK-PVILVLINGQPLTVNWA--NAYIPAILESWFP 611

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITW------YEANYVKIP--YTSMPLRPVNNFPGRTY 590
           G +GG  IA+ +FG++NPGG+L +T+       E N+   P  + S P +   N  G T 
Sbjct: 612 GCQGGTVIAETLFGEHNPGGKLTVTFPKSVGQIELNFPFKPGSHGSQP-KSGPNGSGATR 670

Query: 591 KFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
              +   +YPFG+GLSYT F Y         D+++   +Q     YTV  N         
Sbjct: 671 VIGE---LYPFGFGLSYTTFAYS--------DLEVSPLRQRTQGEYTVKVN--------- 710

Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSA 709
                          V N GK  G EVV +Y +       T+  Q+ G+ERV +  G++ 
Sbjct: 711 ---------------VTNTGKRAGDEVVQLYVRDKVSSVITYDSQLRGFERVSLKPGETR 755

Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +V F++   + L+I+D   N  +  G   +++G
Sbjct: 756 QVTFSLKP-EDLQILDRNMNWTVEPGEFEVMIG 787


>gi|399578325|ref|ZP_10772073.1| glycoside hydrolase family 3 domain protein [Halogranum salarium
           B-1]
 gi|399236488|gb|EJN57424.1| glycoside hydrolase family 3 domain protein [Halogranum salarium
           B-1]
          Length = 778

 Score =  259 bits (663), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 226/803 (28%), Positives = 360/803 (44%), Gaps = 134/803 (16%)

Query: 24  RAKDLVERMTLPEKVQQMGD---------------------LAYGVPRL-------GLPL 55
           R  DL+ERMTL EK  Q+G                      L+ G+  L        LP 
Sbjct: 19  RVADLLERMTLAEKAAQLGSVNAEKLLTDDGTLDEDAVDEHLSAGIGHLTRIGGEGSLPP 78

Query: 56  YEWWSEALHGVSFIGRRTN-SPPGTHFDSEV-----PGATSFPTVILTTASFNESLWKKI 109
            E         +++   T    P T  +  +     P AT+FP +I   ++++  L + +
Sbjct: 79  REAAERTNELQTYLREETRLGIPATPHEECLSGYMGPEATTFPQMIGMASTWSPELLETV 138

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
             T+  +  A   +G A     SP ++V RD RWGRV ET GEDPY+V   A  YV GLQ
Sbjct: 139 TGTIREQLEA---IGTA--HALSPVLDVARDLRWGRVEETFGEDPYLVAAMACGYVGGLQ 193

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                    D D     ISA  KH+  +      G +R   +  +  ++++ET + PFE 
Sbjct: 194 G--------DGDG----ISATLKHFVGHSAGEG-GKNRSSVN--IGRRELRETHMFPFEA 238

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            +   D  SVM +Y+ V+GIP  +D  LL   +RG+W F G +VSD  S++ +   H   
Sbjct: 239 TIRTADAESVMNAYHDVDGIPCASDEWLLTDVLRGEWGFDGTVVSDYYSVEFLRSEHGVA 298

Query: 290 NDTKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY 347
            D +E  VA V +AG+D++    D Y    + AV+ G ++EA +D S+R +  +    G 
Sbjct: 299 ADEQEAGVAAV-EAGIDVELPYTDCYGEHLVDAVEAGVLSEATLDESVRRVLRMKAEKGL 357

Query: 348 FDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
            D +              +  +L   AAR+ + LLKN++  LPL   +  ++A+VGP A+
Sbjct: 358 LDDATVDPETAAEPFGTEEADDLTTRAARESMTLLKNEDDLLPLVGDDTDSVAVVGPKAD 417

Query: 408 ATKAMIGNYEGTPCRY---------TSPMDGFYA----YSKVINYAPGCADIVCQNNSMI 454
             + ++G+Y   P  Y         T+P+D   A    Y   + +  GC     + +   
Sbjct: 418 DAQELMGDY-AYPAHYPEEEVEFDATTPLDALRARGEEYGFDVLHEQGCTTTGPETDGFD 476

Query: 455 PAAIDAAKNADATVIV---AGLDLS-------------VEAEGKDRVDLLLPGFQTELIN 498
            AA  A+    A   V   + +D S                EG D VDL LPG Q EL+ 
Sbjct: 477 AAAHAASDADVALAFVGARSAVDFSDSDRERVNMPSVATSGEGCDVVDLGLPGVQAELVG 536

Query: 499 KVADAAKGPVTLVIMSAGAVDI-NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPG 557
           ++ +    P+ +V++S     I + A++ P +    W+  PGE GG  +A V+FG++NPG
Sbjct: 537 RLGE-TDTPLVVVVVSGKPHSIESIAESVPAVVQA-WL--PGERGGEGVASVLFGEHNPG 592

Query: 558 GRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVAS 616
           G LP++       + + Y   P     N     Y + +   +YPFG+GLSYT+F+Y    
Sbjct: 593 GHLPVSIPRSVGQLPVHYNRKP-----NTANEEYVYTESDPLYPFGHGLSYTEFEYG--- 644

Query: 617 SPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSE 676
                D+ L  ++             PP   V            T  + VEN G   G +
Sbjct: 645 -----DLTLSTEE------------LPPAGTV------------TATVTVENTGDRAGHD 675

Query: 677 VVMVYSKP--PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS 734
           VV +Y++   P  A   +++++G+ERV + AG++ +V F + A   L   D   +  +  
Sbjct: 676 VVQLYARAVNPDQA-RPVQELVGFERVRLEAGETVQVEFEV-AADQLAYHDRDMDLAVEE 733

Query: 735 GAHTILVGEGVGGVSFPLQLNLN 757
           G +   VG     ++    L + 
Sbjct: 734 GPYEFRVGHSAADITSTASLAVT 756


>gi|404405497|ref|ZP_10997081.1| glycoside hydrolase family protein [Alistipes sp. JC136]
          Length = 804

 Score =  259 bits (662), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 225/814 (27%), Positives = 349/814 (42%), Gaps = 154/814 (18%)

Query: 13  PYCDAKLPYPERAKDLVERMTLPEKVQQMGDL-AYG-VPRLGLPLYEW----WSEALHGV 66
           PY D      ER +DL+ +MTL EK  Q+  L  YG V R  LP   W    W + +  +
Sbjct: 46  PYEDPARSLDERVEDLLGQMTLEEKSCQLATLYGYGRVLRDSLPTERWKNEVWKDGIANI 105

Query: 67  ----SFIGRRTNSPPGTHFDSEVPG----------------------------------- 87
               + +G+   + P  H  S+  G                                   
Sbjct: 106 DEMLNGVGKCLRTTP--HLVSDYTGHVEAKNTIQRWFVEQTRLGIPVEFTNEGIHGLNHS 163

Query: 88  -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
            AT  P  I   +++N +L  + G+    EAR    LG   +  ++P ++V RDPRWGRV
Sbjct: 164 RATPLPAPIAIGSTWNRALVHRAGEIAGHEARV---LGYKNV--YAPILDVARDPRWGRV 218

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
           +E  GEDP+++    +  VRG+Q  +GV             ++  KHYAAY +     + 
Sbjct: 219 VECYGEDPFLIAELGVEMVRGIQS-QGV-------------ASTLKHYAAYSVPKGGRDG 264

Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
               D  +  +++ + ++ PF   + E     VM SYN  +G+P  A    L   +R ++
Sbjct: 265 NCRTDPHIAPRELHQMYLYPFRRVIRESGPMGVMSSYNDWDGVPVTASRYFLTDLLRHEY 324

Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGA------ 320
            F GY+VSD ++++ +   H  + +T EDAV +VL+AGL++      TNF+  A      
Sbjct: 325 GFDGYVVSDSEAVEYVHTKHA-VAETYEDAVRQVLEAGLNV-----RTNFSPPARFILPV 378

Query: 321 ---VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQ 377
              V++G+++   +D  +R +  V  RLG FD                +H +   +  RQ
Sbjct: 379 RKLVREGRLSMEVVDQRVREVLRVKFRLGLFDNPYNDPREAVAEAGADKHRDFVLDIQRQ 438

Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY---S 434
            +VLLKN++  LPL+      + + GP A+    MI  Y        + +DG   Y    
Sbjct: 439 SLVLLKNEDKTLPLDKKKTARVLVAGPLADEDNFMISRYGPNDLPTVTVLDGIRNYLGDG 498

Query: 435 KVINYAPGC--------------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
             + YA GC                +     + I  A+  A   D  V V G D     E
Sbjct: 499 AEVRYAKGCDVVDAGFPDSELTATPLTAAERAGINEAVKQAAGCDVIVAVLGEDDERVGE 558

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
              R  L LPG Q +L+  +  A   PV LV+++   + +N+A  N  + +IL   +P  
Sbjct: 559 SHSRTSLELPGRQQQLLEAL-HATGVPVVLVLINGQPLTVNWAAQN--VPAILEGWFPSV 615

Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEAN--------YVKIPYTSMPLRPVNNFPGRTYKF 592
           EGG AIA+ +FG YNPGG+L IT+  +         Y K  + + P +  N   G   + 
Sbjct: 616 EGGTAIAETLFGDYNPGGKLTITFPRSTGQIELNFPYKKGSHGAQPRKGPNG--GGVTRV 673

Query: 593 FDGPVVYPFGYGLSYTQFKYK---VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL 649
                +YPFGYGLSYT F YK   +A  P                + T G+ +  C    
Sbjct: 674 LGS--IYPFGYGLSYTTFAYKNLRIAPEP----------------SRTQGSFRVSC---- 711

Query: 650 IDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQS 708
                          EV N G   G EVV +Y S       T+   + G+ERV +  G++
Sbjct: 712 ---------------EVTNTGDRRGDEVVQLYISDKFSSVVTYESVLRGFERVTLEPGET 756

Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
             V F +     L+++D+  N  +  G   I +G
Sbjct: 757 KTVSFEVTPSH-LELLDSNMNWTVEPGEFEIRIG 789


>gi|409198206|ref|ZP_11226869.1| beta-glucosidase [Marinilabilia salmonicolor JCM 21150]
          Length = 775

 Score =  259 bits (662), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 195/658 (29%), Positives = 329/658 (50%), Gaps = 82/658 (12%)

Query: 89  TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVL 147
           T+FP  +    S++  L +K  +  + EA A      +G+ + ++P I++ RDPRWGRV+
Sbjct: 129 TTFPIPLAEACSWDLELMEKSARIAAEEATA------SGVAWNFAPMIDIGRDPRWGRVM 182

Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
           E  GED Y+  + A   V G Q   G+E + D  S+   + A  KH+  Y      G D 
Sbjct: 183 EGAGEDVYLATQVARARVIGFQ---GIEDYTDL-SQSNTMMATSKHFVGYGA-ALAGRDY 237

Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
              D  ++E+++ ETF+ PF+  V+EG V+S M ++N +NG+P   +  L  + +R  W 
Sbjct: 238 QSVD--MSERELHETFLPPFKATVDEG-VASFMTAFNDLNGVPCTGNQYLFKEILRDRWG 294

Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQGKI 326
           F G +V+D  +I  +V +H F  D K  A    + AG+D+D   + +       V++G +
Sbjct: 295 FGGMVVTDYTAIMEMV-AHGFAKDLKH-AAELAIDAGIDMDMISEAFVTHLKELVEEGDV 352

Query: 327 AEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN--ICNPQHIELAAEAARQGIVLLKN 384
           +E  ID ++  +  +   LG FD   +Y +  +    + NP+H++ A EAA++ IVLLKN
Sbjct: 353 SEEQIDVAVSRILEMKFLLGLFDDPFRYFDAERQQEVVMNPEHLKTAREAAQRSIVLLKN 412

Query: 385 DNGALPLNTGNIKTLALVGPHANATKAMIGNY--EGTPCRYTSPMDGF---YAYSKV-IN 438
           +   LPL+    K +AL+GP     +++ G +  +G   +  + ++G    Y  S+V   
Sbjct: 413 EGNVLPLDKNTSKRVALIGPFVKERESLNGEWAIKGDRNKSVTLLEGLEEKYDGSRVEFT 472

Query: 439 YAPGCA----DIVCQNNSM--------IPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
           YA G      D   Q  S+           A++ A+N+D  ++  G +     E   R D
Sbjct: 473 YAQGTTLPLIDRSTQKVSVTEVPDRRGFAEAVNVARNSDVIMVAMGENYHWSGEAASRTD 532

Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
           + LPG Q EL+ ++    K P+ LV+ +   +D+++ + N  + +I+   YPG   G A+
Sbjct: 533 ITLPGNQRELLKELKKTGK-PIVLVLFNGRPLDLSWEEEN--VDAIVEAWYPGMMSGHAV 589

Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIP-YTSMPL--RPVNNFPGRTYK--FFDGP--VVY 599
           AD++ G YNP  +L +T +  N  +IP + +M    RP +      Y+  + D P   ++
Sbjct: 590 ADILSGDYNPSAKLVMT-FPRNVGQIPIFYNMKNTGRPFDAEHPADYRSSYIDSPNTPLF 648

Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
           PFGYGLSYT F+Y  A       I  DK Q    +                         
Sbjct: 649 PFGYGLSYTTFEYANAK------ISSDKFQSGSSL------------------------- 677

Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
            T  +EV N G +DG EVV +Y +   G     +K++ G+E++ + AG++  V F+++
Sbjct: 678 -TASVEVTNTGDLDGEEVVQLYLRDRVGSVVRPVKELKGFEKIHLKAGETKTVEFSID 734


>gi|242206820|ref|XP_002469265.1| hypothetical protein POSPLDRAFT_51213 [Postia placenta Mad-698-R]
 gi|220731725|gb|EED85567.1| hypothetical protein POSPLDRAFT_51213 [Postia placenta Mad-698-R]
          Length = 312

 Score =  259 bits (662), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 138/295 (46%), Positives = 172/295 (58%), Gaps = 21/295 (7%)

Query: 15  CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
           CD      ERA  L+   TL EK+   G+ A GVPRLGLP Y+WW EALHGV+       
Sbjct: 34  CDTSATPLERATALISLFTLEEKINNTGNTAPGVPRLGLPAYQWWQEALHGVA------- 86

Query: 75  SPPGTHF--DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
             PG  F    E   ATSFP  IL  A+F+++L   +   VSTEARA  N   +G+ FW+
Sbjct: 87  ESPGVIFAPSGEYSYATSFPQPILMGAAFDDALINHVATIVSTEARAFNNANRSGIDFWT 146

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           PNIN  +DPRWGR  ETPGEDP+ +  Y  N + GLQ     EY R        I A CK
Sbjct: 147 PNINPFKDPRWGRGQETPGEDPFHLQSYVYNLITGLQGGLDPEYKR--------IVATCK 198

Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           H+AAYDL+NWEGN R+ FD+ V+ QD+ E +   F  C  + +V S MCSYN VNG+P+C
Sbjct: 199 HFAAYDLENWEGNVRYGFDALVSLQDLSEFYTRSFRTCARDANVGSFMCSYNAVNGVPSC 258

Query: 253 ADPKLLNQTIRGDW---NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
           A+  LL   +R  W   N   YI SDCD+IQ I E H +   T+ + VA  L AG
Sbjct: 259 ANSYLLQDILRDHWGWTNEDQYITSDCDAIQNIYEPH-YYTATRAETVADALNAG 312


>gi|86143269|ref|ZP_01061671.1| beta-glucosidase precursor [Leeuwenhoekiella blandensis MED217]
 gi|85830174|gb|EAQ48634.1| beta-glucosidase precursor [Leeuwenhoekiella blandensis MED217]
          Length = 873

 Score =  259 bits (662), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 158/422 (37%), Positives = 224/422 (53%), Gaps = 48/422 (11%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           FP+ + +L    R  DLV RMTL EK+ Q+   A  + RL +P Y WW+E+LHGV+  G 
Sbjct: 24  FPFQNEQLDLETRLNDLVSRMTLEEKISQLMSDAPAIERLNIPKYNWWNESLHGVARAGY 83

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG------- 124
                           AT FP  I   AS++  L +++   +S EARA ++         
Sbjct: 84  ----------------ATVFPQSISIAASWDAQLVREVATAISDEARAKHHEYLRRDQHD 127

Query: 125 -NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
              GLT WSPNIN+ RDPRWGR  ET GEDP++ G     YV+GLQ           D  
Sbjct: 128 IYQGLTMWSPNINIFRDPRWGRGHETYGEDPFLTGTLGAQYVKGLQ---------GDDPE 178

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LK+ A  KH+A +         R +FD+  +E+D+ ET++  F M V +  V SVM +Y
Sbjct: 179 YLKVVATAKHFAVHSGPE---ESRHYFDANTSERDLWETYLPAFRMLVKDAQVQSVMTAY 235

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           NR  G    ++ KLL   +R  W F GY+VSDC +I  I E HK +      A A  L+ 
Sbjct: 236 NRFRGEAASSN-KLLFDILRNKWGFDGYVVSDCGAINDIWEDHK-ITADAASASALALET 293

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNI- 362
           G DL+CG  Y +    A+  G I E  I+ ++  L+   ++LG FD     +NL    I 
Sbjct: 294 GTDLNCGATYKSLKE-AIANGLITEEKINIAIERLFRARLKLGMFDTE---ENLSYATIP 349

Query: 363 ----CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
                N  H  LA +AA++ IVLLKN+   LPL + ++K +A++GP+A+  +++ GNY G
Sbjct: 350 FSVNTNASHTALARKAAQESIVLLKNEAHMLPL-SKDLKQIAVIGPNAHNVQSLWGNYNG 408

Query: 419 TP 420
           TP
Sbjct: 409 TP 410



 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 94/302 (31%), Positives = 152/302 (50%), Gaps = 54/302 (17%)

Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAEGK----------DRVDLLLPGFQTELINKVADA 503
           +  A++ A+++D T++V GL+  +E E            DR  L LP  Q EL+  +   
Sbjct: 589 LERAVNLAEDSDVTILVLGLNERLEGEEMRIDVEGFSKGDRTALDLPLEQRELMRALVAT 648

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
            K P+ LV+++  A+ IN+A+ +  + +IL  GYPG+EGG AIADV+FG YNP GRLP+T
Sbjct: 649 GK-PIVLVLLNGSALAINYAQEH--VPAILSAGYPGQEGGNAIADVLFGDYNPAGRLPVT 705

Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
           +Y++         +P     +  GRTY++F+G  +YPFGYGLSYTQF Y    +   +  
Sbjct: 706 YYKS------VDDLPDFEDYSMKGRTYRYFEGEALYPFGYGLSYTQFSYDAIKTSGRL-- 757

Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
                                            D     Q+ V N G  DG EVV +Y K
Sbjct: 758 -------------------------------AADKVLNVQVTVTNSGDRDGDEVVQLYLK 786

Query: 684 PPGIAGTHIK-QVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
               + T  + Q++G++R+ +  G++  V F ++A +   ++++    ++  G  T+  G
Sbjct: 787 DEVASTTRPQVQLVGFKRIHLQKGETQTVEFRLDA-RQFSMINDQEQLVVEPGWFTLYAG 845

Query: 743 EG 744
            G
Sbjct: 846 GG 847


>gi|373952814|ref|ZP_09612774.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373889414|gb|EHQ25311.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 862

 Score =  259 bits (662), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 156/429 (36%), Positives = 228/429 (53%), Gaps = 38/429 (8%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            PY +  L    RAKDLV R+TL EKV  M D++  VPRLG+  + WWSEALHG +    
Sbjct: 22  LPYQNPALSSEARAKDLVTRLTLKEKVGLMKDVSEAVPRLGIKKFNWWSEALHGYA---- 77

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
             N  P           T FP  +   ASF++     +   VS EARA  N         
Sbjct: 78  --NQGP----------VTVFPEPVGMAASFDDQKLFHVFDAVSDEARAKNNEYRKQVESQ 125

Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               L+ W+PN+N+ RDPRWGR  ET GEDPY+  R  ++ V+GLQ          +D++
Sbjct: 126 RFHDLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQG--------PADAK 177

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
             K+ AC KHYA +    W  ++    D  VT +D+ ET++  F+  V + DV  VMC+Y
Sbjct: 178 YRKLLACAKHYAVHSGPEWSRHEMNVTD--VTPRDLWETYLPAFKSLVQDADVREVMCAY 235

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
            R++  P C + +LL Q +R DW F   +VSDC +I     SH   +D    A A+ + +
Sbjct: 236 QRLDDEPCCGNSRLLGQILREDWGFKYLVVSDCGAITDFYNSHHSSSDATH-ASAKAVLS 294

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNN 361
           G D++C  Y  +    AV +G I E DI+TS+  L      LG  D      +  +  + 
Sbjct: 295 GTDVECVGYAFDKIPDAVYRGLIKEKDINTSVVRLMTQRFELGEMDKDELVPWTKIPLSV 354

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           + +  H +LA + AR+ + LL+N+N  LPL + +I  LA++GP+AN ++ + GNY GTP 
Sbjct: 355 VNSEDHQKLALDMARETMTLLQNNNNILPL-SKSIGKLAVIGPNANDSQMLSGNYNGTPL 413

Query: 422 RYTSPMDGF 430
           R  + ++G 
Sbjct: 414 RTINILEGI 422



 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 86/298 (28%), Positives = 134/298 (44%), Gaps = 55/298 (18%)

Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
           I+  K+AD  V V G+   +E E          G DR D+ LP  Q   I  +  A K  
Sbjct: 593 IEKVKDADIVVFVGGISPKLEGEEMPVQLPGFKGGDRTDIELPAVQRNCIEALRKAGK-- 650

Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
             +V ++     I          +IL   Y GE GG+A+ADV+FG YNP G LP+T+Y  
Sbjct: 651 -KIVFVNCSGSAIAMVPETQNCDAILQAWYAGESGGQAVADVLFGDYNPSGHLPVTFYR- 708

Query: 568 NYVKIP-YTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
           N  ++P ++   ++      GRTY++     ++PFG+GLSYT F    A        KL 
Sbjct: 709 NVQQLPDFSDYSMK------GRTYRYLKSAPLFPFGFGLSYTTFNIGEA--------KLT 754

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
           K+      N T G                       ++ V N GK DG+E++ VY +   
Sbjct: 755 KN------NITKGE------------------AIQLRVPVANAGKTDGTELLQVYIRKVD 790

Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVD-NAANSLLASGAHTILVGE 743
                 K + G++R+ ++AG++  V   +   K+ +  D   A   ++ G + +L GE
Sbjct: 791 DPDGASKTLRGFKRIPVSAGKTEMVTLDL-PPKTFEFFDPTDAVVRVSPGNYQLLYGE 847


>gi|389696043|ref|ZP_10183685.1| beta-glucosidase-like glycosyl hydrolase [Microvirga sp. WSM3557]
 gi|388584849|gb|EIM25144.1| beta-glucosidase-like glycosyl hydrolase [Microvirga sp. WSM3557]
          Length = 751

 Score =  259 bits (662), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 210/746 (28%), Positives = 349/746 (46%), Gaps = 94/746 (12%)

Query: 24  RAKDLVERMTLPEKVQQMGDLAYGVP---------RLGLPLYEWWSEALHGVSFIGRRTN 74
           R  +L+ RMTL EKV Q+  +++G P         + G  L    +E +     + R ++
Sbjct: 39  RVNELLGRMTLEEKVGQLNLVSHGPPLRWEDISEGKAGAVLNFNSAEDVARAQALVRESH 98

Query: 75  SPPGTHFDSEVPGA--TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
                 F  +V     T FP  +   A+F+  + +   +  + EA  +        TF +
Sbjct: 99  LKIPLLFGLDVLHGFRTQFPLPLGEAAAFSPRVSRLASEWAAREASYV----GVNWTF-A 153

Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
           P  ++ RD RWGR++E  GEDP +        V G               R   ++A  K
Sbjct: 154 PMADLSRDSRWGRIVEGFGEDPTLGAALTAARVEGF--------------RKGGLAAAAK 199

Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
           H+A Y         R +  + +   +M +T++ PF   V  G  +S M ++N +NG P+ 
Sbjct: 200 HFAGYGAPQ---GGRDYDTTYIPRAEMYDTYLPPFRAAVEAG-TASFMAAFNALNGEPST 255

Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC-GD 311
           A+P LL   +R  W F G++ SD   I  +V +H    D  E A   +L AG+D+D  G 
Sbjct: 256 ANPWLLTDVLRTQWGFDGFVTSDWVGIGELV-NHGIAADGAEAARKAIL-AGVDMDMMGQ 313

Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELA 371
            Y N     V+ G++ E+ ID S+R +     RLG FD      +   +   +P+  + A
Sbjct: 314 LYINHLPDEVRAGRVPESVIDESVRRVLRTKFRLGLFDRPDVDSSHLDSEFPSPESRQAA 373

Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY--EGTPCRYTSPMDG 429
            E AR+  VLL+N +  LP+ +  ++++A+VGP A+A +  +G +   G      + ++G
Sbjct: 374 REVARETFVLLQNRDDVLPIPS-KVRSIAVVGPLADAPQDQMGPHAARGHKEDSVTILEG 432

Query: 430 FYAYSK----VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
               ++     + +APGC D+ C+N   +P A++AA+ +D  + V G    +  E   R 
Sbjct: 433 IRRRAQSAGIAVRHAPGC-DLFCRNTDALPGALEAARQSDFVIAVFGEPQELSGEAASRA 491

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
           ++ L G Q E++ ++A   K PV LVIM  G           +I SIL   YPG E G A
Sbjct: 492 NMELNGKQIEVLEELAKTGK-PVALVIM--GGRPQVLGPVADRIPSILMAWYPGTEAGPA 548

Query: 546 IADVIFGKYNPGGRLPITWYEAN------YVKIPYTSMPLRPVNNFPGRTYKFFDGPV-- 597
           +ADV+FG  +P G+LP+TW  A       Y ++P T  P    N F   T  + D  +  
Sbjct: 549 VADVLFGDVSPSGKLPLTWPRATGQLPLYYNRLP-TGRPTLANNRF---TLHYIDESIAP 604

Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
           +YPFG+GLSYT F Y   S  +    +LD+ Q                            
Sbjct: 605 LYPFGWGLSYTHFAY---SDARIASRQLDEGQ---------------------------- 633

Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMN 716
                 ++V+N G  DG EVV +Y++ P  + +  ++++  +E++ + +G++ +V   + 
Sbjct: 634 -VLEVSLDVKNTGARDGQEVVQLYTRDPVASRSRPLRELKAFEKIALKSGETKRVTLRV- 691

Query: 717 ACKSLKIVDNAANSLLASGAHTILVG 742
             +SL    +    L+ +GA  + VG
Sbjct: 692 PVESLGFHLDDGTYLVEAGAIQVFVG 717


>gi|387789382|ref|YP_006254447.1| beta-glucosidase-like glycosyl hydrolase [Solitalea canadensis DSM
           3403]
 gi|379652215|gb|AFD05271.1| beta-glucosidase-like glycosyl hydrolase [Solitalea canadensis DSM
           3403]
          Length = 771

 Score =  259 bits (662), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 211/725 (29%), Positives = 339/725 (46%), Gaps = 131/725 (18%)

Query: 35  PEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTV 94
           PEK+++  +LA    RL +P+  + S+ +HG                       T+FP  
Sbjct: 89  PEKIRKAQELAVNKSRLKIPMI-FGSDVIHG---------------------HKTTFPIP 126

Query: 95  ILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVLETPGED 153
           +   AS+N  L +K  Q  + EA A       GL + +SP ++V RDPRWGR+ E  GED
Sbjct: 127 LGLAASWNIELIEKSAQIAAKEATA------DGLNWVFSPMVDVARDPRWGRIAEGSGED 180

Query: 154 PYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSR 213
           PY+    A   V+G Q         ++ S    + AC KH+A Y      G D    D  
Sbjct: 181 PYLGSLIAKAMVKGYQG-------DNTYSSATNLMACVKHFALYGAAE-AGRDYNSVD-- 230

Query: 214 VTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIV 273
           ++ Q M E ++ P++  V  G V SVM S+N V G+P   +  LL   +R  W F+G +V
Sbjct: 231 MSRQKMYEFYLPPYKAAVEAG-VGSVMSSFNEVEGVPATGNQWLLTDLLRKQWGFNGMVV 289

Query: 274 SDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQGKIAEADID 332
           SD  S+  ++E H   N  +  A+A  +KAGLD+D  G+ Y +    ++Q+GK++E DI+
Sbjct: 290 SDYTSVNEMME-HGMGNLQEVSALA--IKAGLDMDMVGEGYLSTLQKSLQEGKVSETDIN 346

Query: 333 TSLRFLYIVLMRLGYFDGSPQYKNLGK--NNICNPQHIELAAEAARQGIVLLKNDNGALP 390
            + R +     +LG F    ++ N  +    I   Q +  + EAA +  VLLKN+   LP
Sbjct: 347 LACRRILEAKYKLGLFSDPYKFINEKRAATEILTTQSLSFSREAATRSFVLLKNEKQVLP 406

Query: 391 L-NTGNIKTLALVGPHANATKAMI------GNYEGTPCRYTSPMDGFYAYSKVINYAPGC 443
           L  TG   T+AL+GP A++ + M+      GN++ +       M+    ++KV+ YA G 
Sbjct: 407 LKKTG---TIALIGPLADSKRNMLGTWAVSGNWKTSVSVKEGLMNAVGTHAKVL-YAKGA 462

Query: 444 ------------------ADIVCQNN-SMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
                              DI  +++  ++  A+  A+ +D  ++  G    +  E   R
Sbjct: 463 NISDDSAFARRVNTFGVEIDIDKRSSKELLDEALSIAQQSDVIIVAVGEAADMSGEAASR 522

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
            D+ +P  Q EL+  +    K PV +V+ +   + +++   N  + +IL V  PG + G 
Sbjct: 523 TDINIPESQKELLKALVQTGK-PVVMVLFNGRPLTLSW--ENEHLNAILDVWAPGHQAGN 579

Query: 545 AIADVIFGKYNPGGRLPITWYEANYVKIPY------TSMPLRPVNNFPGRTYKFFDGPVV 598
           AIADV+FG YNP G++ +T +  N  ++P       T  P    N F  +     D   +
Sbjct: 580 AIADVLFGDYNPSGKITVT-FPKNVGQVPMYYNHKNTGRPYDDRNRFTSKYLDMPDNAPM 638

Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
           YPFGYGLSYT F+Y         D+ +D+D           T KP               
Sbjct: 639 YPFGYGLSYTTFQYG--------DVTIDQD-----------TIKP-------------GE 666

Query: 659 KFTFQIEVENMGKMDGSEVVMVYSK-------PPGIAGTHIKQVIGYERVFIAAGQSAKV 711
             T ++ + N G  DG E V +Y +       PP      +K + G++++ +  G+S  V
Sbjct: 667 TITAKVTITNTGNYDGVETVQLYIQDVIASVAPP------VKTLKGFKQISLKKGESKVV 720

Query: 712 GFTMN 716
            F ++
Sbjct: 721 EFVIS 725


>gi|373460605|ref|ZP_09552356.1| hypothetical protein HMPREF9944_00620 [Prevotella maculosa OT 289]
 gi|371955223|gb|EHO73027.1| hypothetical protein HMPREF9944_00620 [Prevotella maculosa OT 289]
          Length = 858

 Score =  259 bits (662), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 159/444 (35%), Positives = 232/444 (52%), Gaps = 40/444 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +PY +  L    RA+DL+ R+TL EK   M D +  +PRLG+  + WWSEALHG + +G 
Sbjct: 31  YPYQNPNLSALTRAQDLLSRLTLEEKALLMLDESPAIPRLGIKKFFWWSEALHGAANMG- 89

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--LGNAG-- 127
                            T FP  I   ASFN++L  K+    S E RA Y+  + N G  
Sbjct: 90  ---------------NVTVFPEPIAMAASFNDALLYKVFSAASDEMRAQYHHRIRNGGED 134

Query: 128 -----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
                L+ W+PN+N+ RDPRWGR  ET GEDPY+        VRGLQ  E        DS
Sbjct: 135 EKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTAVMGTAVVRGLQGPE--------DS 186

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
           +  K+ AC KHYA +    +  +      + V+ +D+ ET++  F+  V E  V  VMC+
Sbjct: 187 KYRKLWACAKHYAVHSGPEYTRHTANL--NNVSPRDLWETYLPAFKTLVEEAKVREVMCA 244

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           Y  ++  P C + +LL Q +R +W F   +VSDC ++  I ++HK  +D    A A+   
Sbjct: 245 YQALDDEPCCGNSRLLQQILRDEWGFQYLVVSDCGAVSDIWQNHKTSSDAVH-ATAKAAL 303

Query: 303 AGLDLDCGDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGK 359
           AG D++CG  YT   +  AVQ+G I+E ++D  +  L      LG  D      +  +  
Sbjct: 304 AGTDVECGFNYTYKCIPEAVQRGLISEKEVDKHVLRLLEGRFDLGEMDDPALVPWSKIPY 363

Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
           + + +  H +L+ + ARQ IVLL+N    LPL   N + +A++GP+A+    M GNY GT
Sbjct: 364 SVMDSKAHRQLSLDMARQSIVLLQNKQNMLPLKKNN-ERIAVIGPNADNVPMMWGNYNGT 422

Query: 420 PCRYTSPMDGFYAYSKVINYAPGC 443
           P R  + +DG  A  K + Y  GC
Sbjct: 423 PNRTVTILDGIRAKHKNVKYIKGC 446



 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 89/298 (29%), Positives = 132/298 (44%), Gaps = 60/298 (20%)

Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
           I A K  +  V V G+  ++E E          G DR D+ LP  Q + I  +  A K  
Sbjct: 602 IRALKGIEKVVFVGGISPALEGEEMPVDIPGFKGGDRTDIELPRVQRDFIKALHAAGK-- 659

Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
             LV ++     I          +I+   Y G+EGG A+ADV+FG YNP G+LP+T+Y+ 
Sbjct: 660 -QLVYVNCSGSAIALEPETTACDAIVQAWYAGQEGGTAVADVLFGDYNPSGKLPVTFYK- 717

Query: 568 NYVKIP-YTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
           N  ++P Y +  ++      GRTY++F  P ++ FG+GLSYT F    A   K  D    
Sbjct: 718 NSNQLPDYENYSMK------GRTYRYFSDP-LFAFGHGLSYTTFNMGTAEIIKKAD---- 766

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
                                               +I VEN+G  DG+E V++Y K   
Sbjct: 767 --------------------------------SIVVRIPVENVGSKDGTETVLLYIKNHQ 794

Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVGE 743
                IK + G+ RVF+ AG  A V   +   KS +  D   N++    G + +L G+
Sbjct: 795 DPNGPIKSLRGFSRVFVKAGHKA-VAELLLTRKSFEFFDENTNTVHFKEGNYDLLYGD 851


>gi|325299987|ref|YP_004259904.1| Beta-glucosidase [Bacteroides salanitronis DSM 18170]
 gi|324319540|gb|ADY37431.1| Beta-glucosidase [Bacteroides salanitronis DSM 18170]
          Length = 864

 Score =  259 bits (661), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 159/422 (37%), Positives = 221/422 (52%), Gaps = 42/422 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            PY + KL   ERA DLV R+TL EK   M + +  +PRLG+  Y+WW+EALHGV   G 
Sbjct: 25  LPYQNPKLTPEERANDLVGRLTLEEKASLMQNTSPAIPRLGIKAYDWWNEALHGVGRAGI 84

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
                           AT FP  I   ASF++ L  ++   VS EARA Y          
Sbjct: 85  ----------------ATVFPQTIGMAASFDDELLYQVFTAVSDEARAKYTQFRKEGDLK 128

Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
              GLTFW+PN+N+ RDPRWGR  ET GEDPY+  +  +  VRGLQ  E   Y       
Sbjct: 129 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGMAVVRGLQGPEDAPYD------ 182

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
             K+ AC KH+A +    W   +R  F++  +  +D+ ET++  F+  V +  V  VMC+
Sbjct: 183 --KLHACAKHFAVHSGPEW---NRHEFNAENIAPRDLWETYMPAFKDLVQKAHVKEVMCA 237

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVES--HKFLNDTKEDAVARV 300
           YNR+ G P C + +LL   +R +W + G +VSDC +I        H+   D K  A A  
Sbjct: 238 YNRLEGEPCCGNNRLLTHILRDEWGYQGIVVSDCGAISDFWRKGDHETHPD-KAHASAGA 296

Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
           + +G DL+CG  Y +    AV+ G IAE+ +D S++ L      LG  D    +  +  +
Sbjct: 297 VLSGTDLECGSNYKSLPE-AVKAGLIAESQLDISVKRLLKARFELGEMDKDVCWDTIPYS 355

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            +    H +LA   AR+ IVLL+N N  LPL   ++K +ALVGP+AN +    GNY G P
Sbjct: 356 VVDCQAHKDLALRMARESIVLLQNRNNILPLRK-DMK-IALVGPNANDSIMHWGNYNGFP 413

Query: 421 CR 422
             
Sbjct: 414 SH 415



 Score =  116 bits (291), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 84/300 (28%), Positives = 132/300 (44%), Gaps = 53/300 (17%)

Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
           + A  D  K+AD  +   G+  ++E E          G DR  + LP  Q +L+ ++   
Sbjct: 591 LQATADKVKDADVILFAGGISPTLEGEEMPVDAEGFRGGDRTSIELPAIQRQLVGELKKL 650

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
            K P+  +  S  A+ +  A  +     ++   YPG+ GG AIADV+FG YNP G+LP+T
Sbjct: 651 GK-PIVFINYSGSAMGL--APESEICDGMIQAWYPGQAGGTAIADVLFGDYNPAGKLPVT 707

Query: 564 WYEANYVKIP-YTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVD 622
           +Y  N  ++P +    ++      GRTY++     ++ FG+GLSYT F Y  A       
Sbjct: 708 FYR-NTEQLPDFEDYAMK------GRTYRYMTETPLFRFGHGLSYTTFDYGKAR------ 754

Query: 623 IKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYS 682
                                     L  +   K    T  I V N G  DG E V VY 
Sbjct: 755 --------------------------LSQNTFSKGETLTLTIPVSNTGTRDGEETVQVYL 788

Query: 683 KPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           + PG A      +  ++RV++  G + ++ FT++    L    +  N  L SG + +L G
Sbjct: 789 RRPGDADAPSHTLRAFKRVYVPKGGTKEIKFTLSDDNFLWFDTSTNNMNLISGEYELLYG 848


>gi|336399403|ref|ZP_08580203.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
 gi|336069139|gb|EGN57773.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
          Length = 757

 Score =  259 bits (661), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 209/753 (27%), Positives = 346/753 (45%), Gaps = 98/753 (13%)

Query: 26  KDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR-------------- 71
           +DL+++MTL EK+ Q+     G    G P     S++L     +G               
Sbjct: 47  RDLIKKMTLTEKIGQLSQYVGGSLLTG-PQSGALSDSLFVRGMVGSILNVGGVESLRKLQ 105

Query: 72  -------RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
                  R   P    FD      T FPT +  + S++      +G    T   A     
Sbjct: 106 EKNMQSSRLKIPVLFAFDVIHGYKTIFPTPLAESCSWD------LGLMFETAKAAAIEAS 159

Query: 125 NAGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
            +G+ + ++P +++ RDPRWGR++E  GED Y+  + A   VRG Q   G         +
Sbjct: 160 ASGIHWTFAPMVDIARDPRWGRIVEGAGEDTYLACKIAETRVRGFQWNLG---------K 210

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
           P  + AC KH+ AY      G D    D  ++   + E ++ PF+ CV+ G V + M ++
Sbjct: 211 PNSVYACAKHFVAYGAPQ-AGRDYAPVD--LSLSTLAEVYLPPFKACVDAG-VHTFMSAF 266

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N +NG+P   +  L+   +R  W FHG++VSD +++Q + ++H  + +T  DA      A
Sbjct: 267 NSLNGVPATGNRWLMTDILRNQWKFHGFVVSDWNAVQEL-KAHG-VAETDTDAALMAFDA 324

Query: 304 GLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLG--KN 360
           G+D+D  D  Y      AV +GK+    IDTS+  +      LG FD   ++ ++   + 
Sbjct: 325 GVDMDMTDGLYNRCLEKAVCEGKLDMQAIDTSVERILRAKYALGLFDDPYRFLDVKRERR 384

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE--G 418
            I +    +LA +AA   +VLLKND+  LPL+  + K +AL+GP A+    ++G+++  G
Sbjct: 385 EIRSEAVTKLARKAAASSMVLLKNDHATLPLSK-HTKRIALIGPLADNRSEVMGSWKARG 443

Query: 419 TPCRYTSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL 475
                 + +DG          + Y  GC D +  +    PAA +AAK +D  + V G   
Sbjct: 444 EESDVVTVLDGIKKKLGSDVAVTYVQGC-DFLEPSTREFPAAFEAAKQSDVVIAVVGEKA 502

Query: 476 SVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
            +  E + R  L LPG Q  L++ +  A + P+ +V+M+   + +   K + +  ++L  
Sbjct: 503 LMSGESRSRAVLRLPGQQEALLDTLQKAGR-PLVVVLMNGRPLCLQ--KVDRQADALLEA 559

Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYT---SMPLRPVNNFPGRTYKF 592
            +PG + G A+AD++FG   P  +L  T +     +IP         RP +     T + 
Sbjct: 560 WFPGTQCGNAVADILFGDAVPSAKL-TTSFPLTEGQIPNNYNYKRSGRPGDMSHSSTVRH 618

Query: 593 FDGP--VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
            D P   +YPFGYGLSYT F Y     PK  +                            
Sbjct: 619 IDVPNRNLYPFGYGLSYTTFSYGEMQCPKQFN---------------------------- 650

Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSA 709
                 D      ++V N G  DG E+V +Y      +    +K++ G+++VFI  GQ+ 
Sbjct: 651 -----ADGTLQVSVDVTNTGGYDGEEIVQLYVADKVASMVRPVKELKGFQKVFIPKGQTK 705

Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           ++ FT+NA + L   +N+   ++  G   I+VG
Sbjct: 706 RIDFTLNA-RDLGFWNNSMQYIVEPGTFEIMVG 737


>gi|375254464|ref|YP_005013631.1| glycosyl hydrolase family 3, C-terminal domain-containing protein
           [Tannerella forsythia ATCC 43037]
 gi|363407375|gb|AEW21061.1| glycosyl hydrolase family 3, C-terminal domain protein [Tannerella
           forsythia ATCC 43037]
          Length = 775

 Score =  258 bits (660), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 213/729 (29%), Positives = 347/729 (47%), Gaps = 124/729 (17%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+P++ +  E +HG   IG                  T FPT I   +++N +L +K+
Sbjct: 121 RLGIPIF-FAEECMHGHMAIG-----------------TTVFPTSIGQASTWNRTLIEKM 162

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           G  ++ E R+           + P +++ R+PRW RV ET GEDP + G     +VRGLQ
Sbjct: 163 GAAIAHETRS-----QGAHIAYGPVLDLAREPRWSRVEETFGEDPVLSGILGSAFVRGLQ 217

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
              G ++   +D R     +  KH AAY +     N R    +++  +++    +LPFEM
Sbjct: 218 ---GKDF---ADGR--HTYSTLKHLAAYGIPVGGHNGR---QAQIGARELIAEHLLPFEM 266

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            V  G   SVM SYN V+G+P  ++  +L + +RG+W+F+G++VSD  SI+ I  +H+  
Sbjct: 267 AVKAG-AQSVMTSYNAVDGVPCTSNTYILKKILRGEWDFNGFVVSDLGSIEGIATTHRVA 325

Query: 290 NDTKEDAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
            D K  A A  L AG+++D G   YT     A     I+ ++ID ++  +  +   +G F
Sbjct: 326 PDIKH-AAAMALNAGVEMDLGGVAYTRNMEQAHTDSLISMSEIDDAVSRILRLKFEMGLF 384

Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
           +      +     I + +H  LA + A + IVLLKN+   LPL+  NI ++A++GP+A+ 
Sbjct: 385 ESPYVQPSRTTEIIRSKEHNRLARKVAEESIVLLKNNANLLPLSK-NIGSIAVIGPNADN 443

Query: 409 TKAMIGNYEG-TPCRY-TSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKN 463
               +G+Y    P  +  + ++G     + + VI Y  GCA +     S I  A+ AA  
Sbjct: 444 LYNQLGDYTAPQPEEHIVTILEGIRNAVSPTTVIRYVKGCA-VRDTTQSNIDEAVRAANA 502

Query: 464 ADATVIVAG----LDLSVE----------------------AEGKDRVDLLLPGFQTELI 497
           ++A V+V G     D   +                       EG DR  L L G Q +LI
Sbjct: 503 SNAVVLVVGGSSARDFHTKYIETGAATVSSRENELIPDMESGEGYDRKSLTLLGHQEKLI 562

Query: 498 NKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPG 557
             +A   K P+ +V +    +++N A  + K  ++L   YPGEEGG A+A+VIFG  NP 
Sbjct: 563 ESIAATGK-PLIMVYIQGRPLNMNLA--DKKASALLTAWYPGEEGGNAVANVIFGDVNPS 619

Query: 558 GRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVA 615
           GRLPI+        +P ++  L PV    G++  + +G    +Y FGYGLSYT F+Y   
Sbjct: 620 GRLPIS--------VPRSTGQL-PVYYSLGKSNDYVEGTSTPLYAFGYGLSYTAFEYG-- 668

Query: 616 SSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGS 675
                 ++ + ++          G N                   T    V N G  DG 
Sbjct: 669 ------NLTISRE----------GGN------------------ITVSCTVTNTGNTDGD 694

Query: 676 EVVMVYSKPPGIAGTHIKQVI--GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA 733
           EVV +Y +   +A   +  V+   + ++ +  G+SA+V F +   + L   +     ++ 
Sbjct: 695 EVVQLYLRDH-VASVSVPPVLLKDFAKISLKKGESARVNFVLTP-EQLAFFNTDLKRVVE 752

Query: 734 SGAHTILVG 742
            G  T+++G
Sbjct: 753 PGEFTVMIG 761


>gi|299141953|ref|ZP_07035087.1| beta-glucosidase [Prevotella oris C735]
 gi|298576415|gb|EFI48287.1| beta-glucosidase [Prevotella oris C735]
          Length = 858

 Score =  258 bits (660), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 159/444 (35%), Positives = 232/444 (52%), Gaps = 40/444 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           +PY +  L    RA+DL+ R+TL EK   M D +  +PRLG+  + WWSEALHG + +G 
Sbjct: 31  YPYQNPNLSALTRAQDLLSRLTLEEKALLMLDESPAIPRLGIKKFFWWSEALHGAANMG- 89

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--LGNAG-- 127
                            T FP  I   ASFN++L  K+    S E RA Y+  + N G  
Sbjct: 90  ---------------NVTVFPEPIAMAASFNDALLYKVFSAASDEMRAQYHHRIRNGGED 134

Query: 128 -----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
                L+ W+PN+N+ RDPRWGR  ET GEDPY+        VRGLQ  E        DS
Sbjct: 135 EKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTAVMGTAVVRGLQGPE--------DS 186

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
           +  K+ AC KHYA +    +  +      + V+ +D+ ET++  F+  V E  V  VMC+
Sbjct: 187 KYRKLWACAKHYAVHSGPEYTRHTANL--NNVSPRDLWETYLPAFKTLVEEAKVREVMCA 244

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           Y  ++  P C + +LL Q +R +W F   +VSDC ++  I ++HK  +D    A A+   
Sbjct: 245 YQALDDEPCCGNSRLLQQILRDEWGFQYLVVSDCGAVSDIWQNHKTSSDAVH-ATAKAAL 303

Query: 303 AGLDLDCGDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGK 359
           AG D++CG  YT   +  AVQ+G I+E ++D  +  L      LG  D      +  +  
Sbjct: 304 AGTDVECGFNYTYKCIPEAVQRGLISEKEVDKHVLRLLEGRFDLGEMDDPALVPWSKIPY 363

Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
           + + +  H +L+ + ARQ IVLL+N    LPL   N + +A++GP+A+    M GNY GT
Sbjct: 364 SVMDSKAHRQLSLDMARQSIVLLQNKQNMLPLKKNN-ERIAVIGPNADNVPMMWGNYNGT 422

Query: 420 PCRYTSPMDGFYAYSKVINYAPGC 443
           P R  + +DG  A  K + Y  GC
Sbjct: 423 PNRTVTILDGIRAKHKNVKYIKGC 446



 Score =  114 bits (285), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 88/298 (29%), Positives = 131/298 (43%), Gaps = 60/298 (20%)

Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
           I A K  +  V V G+  ++E E          G DR D+ LP  Q + I  +  A K  
Sbjct: 602 IRALKGIEKVVFVGGISPALEGEEMPVDIPGFKGGDRTDIELPRVQRDFIKALHAAGK-- 659

Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
             LV ++     I          +I+   Y G+EGG A+ADV+FG YNP G+LP+T+Y+ 
Sbjct: 660 -QLVYVNCSGSAIALEPETTACDAIVQAWYAGQEGGTAVADVLFGDYNPSGKLPVTFYK- 717

Query: 568 NYVKIP-YTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
           N  ++P Y +  ++      GRTY++F  P ++ FG+GLSYT F    A   K  D    
Sbjct: 718 NSNQLPDYENYSMK------GRTYRYFSDP-LFAFGHGLSYTTFNMGTAEIIKKAD---- 766

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
                                               +I VEN+G  DG+E V++Y K   
Sbjct: 767 --------------------------------SIVVRIPVENVGSKDGTETVLLYIKNHQ 794

Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVGE 743
                IK + G+ RVF+ AG  A     +   KS +  D   N++    G + +L G+
Sbjct: 795 DPNGPIKSLRGFSRVFVKAGHQAVAELVLTR-KSFEFFDENTNTVHFKEGNYDLLYGD 851


>gi|295135338|ref|YP_003586014.1| glycoside hydrolase [Zunongwangia profunda SM-A87]
 gi|294983353|gb|ADF53818.1| glycoside hydrolase family protein [Zunongwangia profunda SM-A87]
          Length = 764

 Score =  258 bits (660), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 222/725 (30%), Positives = 333/725 (45%), Gaps = 130/725 (17%)

Query: 35  PEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTV 94
           PEK++   D A    R+G+PL    S+ +HG                       T+FP  
Sbjct: 89  PEKIRVAQDYAVNDTRMGIPLL-IGSDVIHGYK---------------------TTFPIP 126

Query: 95  ILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVLETPGED 153
           + T AS++  + KK  +  + EA A       G+ + +SP +++ RDPRWGR+ E  GED
Sbjct: 127 LGTAASWDMEMIKKTAEIAAQEATA------DGINWNFSPMVDIARDPRWGRIAEGAGED 180

Query: 154 PYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFD-S 212
           PY+  + A   V G        Y  D  ++   + A  KH+A Y      G D    D S
Sbjct: 181 PYLGSQIAKAMVEG--------YQGDDLAKENTMIATVKHFALYGASE-AGRDYNTTDMS 231

Query: 213 RVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYI 272
           RV    M   ++ P++  ++ G   SVM S+N V+G+P   +  LL   +R  W F G++
Sbjct: 232 RVK---MFNEYLPPYKAAIDAG-AESVMSSFNDVDGVPATGNKWLLTDLLRDRWGFEGFV 287

Query: 273 VSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQGKIAEADI 331
            SD  S+  ++ +H   +     A+A  LKAGLD+D  G+ Y      ++ +GK+ EA+I
Sbjct: 288 TSDYTSLNEMI-AHGMGDLQAVSALA--LKAGLDMDMVGEGYLKTLKKSLDEGKVTEAEI 344

Query: 332 DTSLRFLYIVLMRLGYFDGSPQYKNLGK--NNICNPQHIELAAEAARQGIVLLKNDNGAL 389
            T+ R +     +LG FD   +Y +  +   +I + ++   + + A    VLLK D G  
Sbjct: 345 TTAARRILEAKYKLGLFDDPYKYLDESRPEKDILSEENRTFSRKVAAHSFVLLKKDAGVF 404

Query: 390 PLNTGNIKTLALVGPHANATKAMIGNY--EGTPCRYTSPMDGF--YAYSKVINYAPGC-- 443
           PL   N K +AL+GP AN    M+G +   G P      + G    A    + YA G   
Sbjct: 405 PLKK-NAK-IALIGPLANNKNNMLGTWAPTGNPQLSVPVLQGVKNVAPKAKVTYAQGANI 462

Query: 444 ----------------ADIVCQN-NSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
                           A+I   +   M+  A+  AK +D  V V G    +  E   R +
Sbjct: 463 TDDAQLAENINVFGPRAEISETSPEKMLEEALKVAKKSDVIVAVVGEATEMSGEAASRTN 522

Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
           LL+P  Q +LI ++A   K P+ LV+MS     +N ++ +     IL V +PG E G AI
Sbjct: 523 LLIPESQKKLIRELAKTGK-PMALVLMSGRP--LNISEESEMNIDILQVWHPGVEAGNAI 579

Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIP-YTSMPL--RP--VNNFPGRTYKFFDGP--VVY 599
           ADVIFG YNP G++  +W   N  ++P Y +M    RP  V  F     +F D P   +Y
Sbjct: 580 ADVIFGDYNPSGKITASW-PRNVGQVPVYYAMKRTGRPGEVEGFQKFKSEFLDTPNSPLY 638

Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
           PFGYGLSYT+F+Y         D+K   D+   D     GT                   
Sbjct: 639 PFGYGLSYTEFEYS--------DVKASADELKMD-----GT------------------- 666

Query: 660 FTFQIEVENMGKMDGSEVVMVYSK-------PPGIAGTHIKQVIGYERVFIAAGQSAKVG 712
            T    + N G  DG EVV +Y         PP      +KQ+IG+E++ +  G+S  V 
Sbjct: 667 LTLSAIITNTGDYDGEEVVQLYIHDKVRSITPP------MKQLIGFEKIMLKKGESKTVT 720

Query: 713 FTMNA 717
           F ++A
Sbjct: 721 FEISA 725


>gi|451821117|ref|YP_007457318.1| periplasmic beta-glucosidase BglX [Clostridium
           saccharoperbutylacetonicum N1-4(HMT)]
 gi|451787096|gb|AGF58064.1| periplasmic beta-glucosidase BglX [Clostridium
           saccharoperbutylacetonicum N1-4(HMT)]
          Length = 750

 Score =  258 bits (660), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 218/721 (30%), Positives = 335/721 (46%), Gaps = 96/721 (13%)

Query: 36  EKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVI 95
           EK  ++  +A    RLG+P+  +  + +HG                       T FP  +
Sbjct: 95  EKSNELQKIAVEESRLGIPIL-FGLDVIHGYR---------------------TIFPIPL 132

Query: 96  LTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVLETPGEDP 154
               SF+    K+  +  + EA A      AGL + ++P +++ RDPRWGRV E  GEDP
Sbjct: 133 AEACSFDIEKIKESARIAAKEASA------AGLHWTFAPMVDISRDPRWGRVAEGAGEDP 186

Query: 155 YVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRV 214
           Y+    A   V G Q         +S   P  I AC KH+A Y   +  G D    D  +
Sbjct: 187 YLGSVIAKARVEGFQG--------ESLDNPESILACAKHFAGYGAPDG-GRDYNTVD--M 235

Query: 215 TEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVS 274
           + Q + + ++ PF+     G V + M ++N +NGIP   +  LL   +R  + F+G++VS
Sbjct: 236 SLQTLHDVYLPPFKAAAEAG-VGTFMSAFNDLNGIPCTVNKYLLTDVLREKFGFNGFVVS 294

Query: 275 DCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD-YYTNFTMGAVQQGKIAEADIDT 333
           D +SI  +V  H +  D K  A  + L AGLD+D     Y N     V++G I E  +D 
Sbjct: 295 DANSIPEVV-VHGYAEDNKA-ASKKALNAGLDMDMSQGTYRNELPELVKEGDILEEVLDE 352

Query: 334 SLRFLYIVLMRLGYFDG--SPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPL 391
           ++R +  V   LG FD       K   K  +C  +H+E A + +R+ IVLLKN+N ALPL
Sbjct: 353 AVRRVLRVKFLLGLFDNPYRTDAKKEEKTLLCK-EHLEAARDISRRSIVLLKNENNALPL 411

Query: 392 NTGNIKTLALVGPHANATKAMIGNYE--GTPCRYTSPMDGFYAYSKV---INYAPGCADI 446
              ++K +A+VGP A     M+G +   G P    + + G  A       I YA GC  I
Sbjct: 412 KK-DLKKIAVVGPLAENAAEMLGTWSHTGNPSDVVTIISGIKAAVSTETEILYAEGC-KI 469

Query: 447 VCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG 506
             +       A+  AK +D  + V G +  +  E   R+D+ LPG Q EL+ ++    K 
Sbjct: 470 TGEECIDFEGAVRVAKESDVIIAVVGENSDMSGEAASRIDINLPGKQEELLKELRKIGK- 528

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-Y 565
           P+ +V+++   + I +   N  + +++     G + G AIADV+FG YNP G+L  T+ Y
Sbjct: 529 PLIVVLINGRPLTIPWEAEN--VDALVEAWQLGTQSGNAIADVLFGDYNPSGKLVATFPY 586

Query: 566 EANYVKIPYTS-MPLRPVNNFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVD 622
               V I Y + M  RP       T K+ DGP   +YPFG+GLSYT FKY+         
Sbjct: 587 SVGQVPIYYNNPMTGRPAGKIK-FTSKYIDGPAEPLYPFGFGLSYTTFKYE--------- 636

Query: 623 IKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYS 682
                     +++     NK      + D V  K Y       V N G++ G EVV +Y 
Sbjct: 637 ----------NLSILSAENK------IGDTVAVKVY-------VTNTGEVSGEEVVQLYV 673

Query: 683 KPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
                +    +K++  +E+V +   +   + F +N  K L   D   N ++  G   + V
Sbjct: 674 SDVVASRVRPVKELKSFEKVLLQPKECKTIIFKLN-TKDLGFHDENMNYVVEPGLFKVYV 732

Query: 742 G 742
           G
Sbjct: 733 G 733


>gi|103486503|ref|YP_616064.1| glycoside hydrolase [Sphingopyxis alaskensis RB2256]
 gi|98976580|gb|ABF52731.1| glycoside hydrolase, family 3-like protein [Sphingopyxis alaskensis
           RB2256]
          Length = 772

 Score =  258 bits (660), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 217/731 (29%), Positives = 341/731 (46%), Gaps = 98/731 (13%)

Query: 27  DLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSE--------------------ALHGV 66
           DL+ +MTL EK  Q+  L       G  + + + E                     L  +
Sbjct: 59  DLMVKMTLDEKTGQLTLLTSNWESTGPTMRDSYKEDIRAGRVGAIFNAYTAKYTRELQAL 118

Query: 67  SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
           +  G R   P    +D      T FP  +   AS++    +K  +  + EA A       
Sbjct: 119 AVEGTRLKIPLLFGYDVIHGHRTIFPISLGEAASWDLQAIEKAARISAIEASA------E 172

Query: 127 GLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
           G+ + +SP +++ RDPRWGR+ E  GED Y+    A   VRG        Y     SRP 
Sbjct: 173 GIHWTFSPMVDIARDPRWGRISEGAGEDVYLGSLIAKARVRG--------YQGGDLSRPD 224

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
            I A  KH+AAY      G D    D  ++E+ M++ ++ PF+   +    ++ M ++N 
Sbjct: 225 TILATAKHFAAYGAAQ-AGRDYHTVD--ISERTMRDVYLPPFKAAADA-GAATFMTAFNE 280

Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
            +G+P      LL   +R  W F G++V+D  SI  +V  H +  D K+ A  + ++AG+
Sbjct: 281 YDGVPASGSHYLLTDVLRKKWGFKGFVVTDYTSINEMV-PHGYAKDLKQ-AGEQAMRAGV 338

Query: 306 DLDC-GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLG--KNNI 362
           D+D  G  +      +V +GK+  A ID +++ +  +  RLG FD   +Y +    K  I
Sbjct: 339 DMDMQGAVFMENLAKSVAEGKVDTARIDAAVKAILEMKYRLGLFDDPYRYADAAREKATI 398

Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
             P  +E A + AR+ IVLLKN +  LPL   + K++A++GP  N+ + MIG++     R
Sbjct: 399 YKPAFLEAARDVARKSIVLLKNKDNVLPL-AASAKSIAVIGPLGNSKEDMIGSWSAAGDR 457

Query: 423 YTSP---MDGFYAYS---KVINYAPGCA---DIVCQNNSMIPAAIDAAKNADATVIVAGL 473
            T P   ++G  A +     I YA G +   D V + +     A+  A+ +D  +   G 
Sbjct: 458 RTRPVTLLEGLQAGAPKGTTIAYAKGASYHFDDVGKTDG-FAEALALAEKSDVIIAAMGE 516

Query: 474 DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
             ++  E   R  L LPG Q  L+  +    K PV LV+MS     I +A  N  + +IL
Sbjct: 517 HWNMTGEAASRTSLDLPGNQQALLEALEKTGK-PVILVLMSGRPNSIEWADAN--VDAIL 573

Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPL-RPVN-NFPGRTY 590
              YPG  GG AIAD+++G+YNP G+LP+T+      V I Y      RP+    PG  Y
Sbjct: 574 EAWYPGTMGGHAIADILYGRYNPSGKLPVTFPRTVGQVPIHYDMKNTGRPIELGAPGAKY 633

Query: 591 --KFFDGP--VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
             ++ + P   +YPFGYGLSYT F Y    SP    + LD+ +        +   +P   
Sbjct: 634 VSRYLNTPNTPLYPFGYGLSYTSFTY----SP----VTLDRSK--------IRPGEP--- 674

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAA 705
                         T  + V N G  DG EVV +Y +   G     +K++ G++++ +  
Sbjct: 675 -------------LTASVTVTNSGPRDGEEVVQLYVRDLVGSVTRPVKELKGFQKIGLKK 721

Query: 706 GQSAKVGFTMN 716
           G++  V FT+ 
Sbjct: 722 GETRTVRFTLT 732


>gi|393789624|ref|ZP_10377744.1| hypothetical protein HMPREF1068_04024 [Bacteroides nordii
           CL02T12C05]
 gi|392650340|gb|EIY44009.1| hypothetical protein HMPREF1068_04024 [Bacteroides nordii
           CL02T12C05]
          Length = 855

 Score =  258 bits (660), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 162/439 (36%), Positives = 237/439 (53%), Gaps = 46/439 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           + D   P  ER  DL+ R+T+ EKV  + + A  +PRL +  Y   +EALHGV       
Sbjct: 29  FRDMTAPQHERILDLLNRLTVEEKVSLLVNDAREIPRLNIDKYNHGNEALHGVV------ 82

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
              PG          T FP  I   A++N +L  ++   +S EAR  +   + G      
Sbjct: 83  --RPGEF--------TVFPQAIGLAATWNPNLIFRVSTAISDEARGRWKELDYGKKQIAG 132

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFWSP +N+ RDPRWGR  ET GEDP++ GR    +V+GLQ           + R
Sbjct: 133 GSDLLTFWSPTVNMARDPRWGRTPETYGEDPFLSGRIGCEFVKGLQG---------DNPR 183

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LK  +  KH+AA    N E ++R   ++R++E+D++E ++  FE C+ +G   S+M +Y
Sbjct: 184 YLKTVSTPKHFAA----NNEEHNRSSCNARMSERDLREYYLPAFERCIVDGKAQSIMMAY 239

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N VN +P   +  L+ + +RGDWNF+GYIVSDC + + +V  HK++ +  E A    LKA
Sbjct: 240 NAVNDVPCTVNIYLIKKVLRGDWNFNGYIVSDCSAPEWMVTKHKYVKNL-EAAATLALKA 298

Query: 304 GLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
           GLDL+CGD  YT   + A  +  ++EA+ID++   +    M LG FD   Q  Y  +  +
Sbjct: 299 GLDLECGDRVYTAPLLKAYNEYMVSEAEIDSAAYHILRGRMLLGLFDDPSQNPYNKIEPS 358

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            I   +H ELA E ARQ +VLLKN    LPLN   I+++A+VG   +A     G+Y G P
Sbjct: 359 VIGCKEHQELALETARQSMVLLKNQKNFLPLNRKKIRSIAVVG--ISAAHCEFGDYSGNP 416

Query: 421 CRY-TSPMDGFYAYSKVIN 438
                S +DG   Y++  N
Sbjct: 417 KNTPVSVLDGIKKYAENAN 435



 Score =  140 bits (354), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 101/294 (34%), Positives = 147/294 (50%), Gaps = 52/294 (17%)

Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-VD 519
           AK  D TV V G++ S+E EG+DR  L LP  Q E I ++      P T+V++ AG+ + 
Sbjct: 600 AKECDVTVAVLGINKSIEREGQDRYSLELPIDQQEFIKELYKV--NPNTVVVLVAGSSMA 657

Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
           IN+   N  + +IL   YPGE+GG A+A+V+FG YNPGGRLP+T+Y +         +P 
Sbjct: 658 INWMDEN--VPAILNAWYPGEQGGNAVAEVLFGDYNPGGRLPLTYYNS------LDELPA 709

Query: 580 RPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
               +   RTY++F+G  +Y FGYGLSYT FKYK  S  +S D          DI + + 
Sbjct: 710 FDDYSVKNRTYQYFEGKPLYEFGYGLSYTNFKYKKKSIMQSND--------TVDITFNLS 761

Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH--IKQVIG 697
                                       N+GK DG EV  VY + P   GT+  +KQ+ G
Sbjct: 762 ----------------------------NVGKYDGDEVAQVYVRYPE-TGTYMPLKQLKG 792

Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVGEGVGGVSF 750
           + RV +  G+SA +  ++   K L+  D      +  +G +   VG     +S 
Sbjct: 793 FSRVHLKKGKSADITISIPK-KELRYWDEKTRQFVTPTGEYVFQVGGSSENISI 845


>gi|423301451|ref|ZP_17279475.1| hypothetical protein HMPREF1057_02616 [Bacteroides finegoldii
           CL09T03C10]
 gi|408472052|gb|EKJ90581.1| hypothetical protein HMPREF1057_02616 [Bacteroides finegoldii
           CL09T03C10]
          Length = 781

 Score =  258 bits (659), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 211/723 (29%), Positives = 325/723 (44%), Gaps = 115/723 (15%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+PL+    EA HG   IG                  T FPT I   A+++  L  ++
Sbjct: 129 RLGIPLF-LAEEAPHGHMAIG-----------------TTVFPTGIGMAATWSPQLINEV 170

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           G+ +  E R        G   + P +++ RDPRW RV ET GEDP + G      V GL 
Sbjct: 171 GKAIGKEIRL-----QGGHISYGPVLDLARDPRWSRVEETFGEDPVLTGEIGKAMVAGLG 225

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                       SRP    A  KH+ AY +     N    F      +++ E F+ PF  
Sbjct: 226 S--------GDLSRPYSTLATLKHFLAYGISESGQNGNPSFAGM---RELHENFLPPFGQ 274

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            +N G +S VM SYN ++G P  A+  LL + +R DW F G +VSD  SI+ I +SH F+
Sbjct: 275 AINAGALS-VMTSYNSMDGTPCTANHYLLTELLRDDWKFKGVVVSDLYSIEGIHQSH-FV 332

Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
             T ++A    L AG+D+D G D Y N  M AV + +I++  +D ++  +  +   +G F
Sbjct: 333 ASTMKEAAVMALSAGVDIDLGGDAYMNL-MDAVNRKEISKEILDAAVSRVLRLKFEMGLF 391

Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
           +         K  + + +++ LA + A+  I LLKN++  LPL+      +AL+GP+A+ 
Sbjct: 392 ENPYVDPGKAKKEVRSKEYVALARQVAQASITLLKNEHSLLPLDRS--MKVALIGPNADN 449

Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYA--YSKVINYAPGCADIVCQNNSMIPAAIDAAKNA 464
              M+G+Y          + +DG  A   S  + Y  GC+ I     S I  A+ AA+ +
Sbjct: 450 RYNMLGDYTAPQEEENVKTVLDGIRAKLSSSQVEYVKGCS-IRDTVTSDIEQAVAAARRS 508

Query: 465 DATVIVAGLDLSVE-----------------------AEGKDRVDLLLPGFQTELINKVA 501
           +  + V G   + +                        EG DR  L L G Q EL+ K  
Sbjct: 509 EVVIAVVGGSSARDFKTSYKETGAAIADEKTISDMECGEGFDRATLSLLGKQQELL-KAL 567

Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
            A   P+ +V +    +D N+A  N    ++L   YPG+EGG AIADV+FG++NP GRLP
Sbjct: 568 KATGKPLIVVYIEGRPLDKNWASENAD--ALLTAYYPGQEGGNAIADVLFGEFNPAGRLP 625

Query: 562 ITWYEANYVKIPYTSMPLRPVNNFP-GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKS 620
            +      V      +P+      P    Y       +Y FGYGLSYT F+Y        
Sbjct: 626 FS------VPRSVGQVPVYYNKKAPQSHDYVEVSASPLYSFGYGLSYTTFEYS------- 672

Query: 621 VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
                       D++ +  T                 + F    ++ N GK DG EVV +
Sbjct: 673 ------------DLHLSALT----------------PHSFEVSCKIRNTGKYDGEEVVQL 704

Query: 681 YSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
           Y +    +    +KQ+  + R+F+  G+  KV F ++  +   +VD     ++  G   +
Sbjct: 705 YLRDEYASVVQPLKQLKHFARLFLKCGEEQKVKFILSE-EDFALVDRNLKRVVEPGTFQV 763

Query: 740 LVG 742
           ++G
Sbjct: 764 MIG 766


>gi|294673871|ref|YP_003574487.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23]
 gi|294474367|gb|ADE83756.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23]
          Length = 782

 Score =  258 bits (659), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 220/726 (30%), Positives = 331/726 (45%), Gaps = 121/726 (16%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+PL+    EA HG   IG                  T FPT     A++N +L +K 
Sbjct: 130 RLGIPLF-LAEEAPHGHMAIG-----------------TTVFPTGFGMAATWNPALIEKT 171

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
           G+ +  E R        G   + P +++ R+PRW RV ET GEDP + G      V+GL 
Sbjct: 172 GEVIGQEIRL-----QGGHISYGPVLDLAREPRWSRVEETMGEDPVLAGELGAAMVKGLG 226

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
              G+       S+P    A  KH+  Y       N           +++QE+F+ PF+ 
Sbjct: 227 G--GIL------SKPYSTIATLKHFIGYGTTEAGQNGGITI---AGARELQESFLPPFKK 275

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            +N G +S VM SYN ++GIP+     LL   +R  W F+G++VSD  SI  I  +H+ +
Sbjct: 276 AINAGALS-VMTSYNSLDGIPSTCSKALLTDLLRTQWGFNGFVVSDLYSIDGIHGTHR-V 333

Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
            +TK+ A    LKAG+D D G         AVQ+G + EA+ID +++ +  +   +G F+
Sbjct: 334 AETKQQAGVMALKAGVDADLGALAFGRLEDAVQKGMVTEAEIDVAVKRILKMKFEMGLFE 393

Query: 350 GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT 409
                    K  + +  +  +A + AR+ I LLKN N  LPL+    + + + GP+A+  
Sbjct: 394 HPYVDAAQAKQLVRSDNNKAVALQVAREIITLLKNQNHVLPLS--KTQKVLVCGPNADNV 451

Query: 410 KAMIGNY-----EGTPCRYTSPMDGFYAYSKVINYAPGCA--DIVCQNNSMI-------- 454
             M+G+Y     EG      + +      S+V  Y  GCA  D    N +          
Sbjct: 452 YNMLGDYTAPQEEGNVKTILAGIRSKLPASQV-TYVKGCAVRDTTASNIAEAVAAAKQAD 510

Query: 455 --------PAAID---AAKNADATVI----VAGLDLSVEAEGKDRVDLLLPGFQTELINK 499
                    +A D   + K   A V     ++ +D     EG DR  L   G Q +L+ K
Sbjct: 511 VVVVAVGGSSARDFKTSYKETGAAVTDSKTISDMDC---GEGFDRATLTPLGHQMQLL-K 566

Query: 500 VADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGR 559
              A   P+ +V +    +D ++A  +    ++L   YPG+EGG AIADV+FG YNP GR
Sbjct: 567 ALKAIGKPLVVVYIEGRPMDKSWAAQHAD--ALLTAYYPGQEGGTAIADVLFGDYNPAGR 624

Query: 560 LPITWYEANYVKIP--YTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASS 617
           LP++   AN  +IP  Y   P  P        Y       +Y FGYGLSYT FKY     
Sbjct: 625 LPVS-VPANVGQIPVYYNKKPPMP------HDYVEMSARPLYAFGYGLSYTTFKYD---- 673

Query: 618 PKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEV 677
                          D+N              I++     +K TF   V N G MDG EV
Sbjct: 674 ---------------DLN--------------IEETGDTQFKVTFN--VTNTGDMDGDEV 702

Query: 678 VMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGA 736
           V +Y      +    + Q+  + R+FI  G++ +V FT+ A + L+IVD   N ++ +G 
Sbjct: 703 VQLYLHDEFASTAQPMMQLKKFSRIFIPKGETKQVSFTLEA-EDLEIVDQEMNHVVETGD 761

Query: 737 HTILVG 742
            T+++G
Sbjct: 762 FTVMIG 767


>gi|255690204|ref|ZP_05413879.1| xylosidase/arabinosidase [Bacteroides finegoldii DSM 17565]
 gi|260624223|gb|EEX47094.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           finegoldii DSM 17565]
          Length = 954

 Score =  258 bits (659), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 225/768 (29%), Positives = 354/768 (46%), Gaps = 127/768 (16%)

Query: 6   KVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSE 61
           K K++D PY DA LP  ER + L+  MT  +K++ + +  +G+P  G+P LY       E
Sbjct: 162 KGKVTDRPYMDASLPVDERVESLLAAMTPADKMELIRE-GWGIP--GIPHLYVPPITKVE 218

Query: 62  ALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY 121
           A+HG S+         G+       GAT FP  +   A++N  L +++   +  E   + 
Sbjct: 219 AVHGFSY---------GS-------GATIFPQALAMGATWNRQLTEEVAMAIGDET-VIA 261

Query: 122 NLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
           N   A    WSP ++V +D RWGR  ET GEDP +V +    +++G Q            
Sbjct: 262 NTKQA----WSPVLDVAQDARWGRCEETFGEDPVLVSQMGGAWIKGYQ------------ 305

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
           S+ L  +   KH+  +         R   D  ++E++M+E  ++PF   +   D  S+M 
Sbjct: 306 SKGLFTTP--KHFGGHGAPL---GGRDSHDIGLSEREMREVHLVPFRHVIRNYDCQSLMM 360

Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
           +Y+   GIP     +LL + +R +W F+G+IVSDC +I  +     +    K +A  + L
Sbjct: 361 AYSDYMGIPIAKSTELLQRILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQAL 420

Query: 302 KAGLDLDCGDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
            AG+  +CGD Y N   + A + G+I   ++D   R +   + R   F+ +P  K L  N
Sbjct: 421 AAGIATNCGDTYNNKEVIQAAKDGRINMENLDNVCRTMLATMFRNELFEKNP-CKPLDWN 479

Query: 361 NIC----NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
            I     +  H  +A  AA + IV+L+N +  LPL+   ++T+A++GP A+  +   G+Y
Sbjct: 480 KIYPGWNSDSHKAMAHRAACESIVMLENKDNLLPLSK-ELRTIAVLGPGADDLQP--GDY 536

Query: 417 --EGTPCRYTSPMDGFYA----YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV 470
             +  P +  S + G  A     +KV+ Y  GC D      + IP A+  A  AD  V+V
Sbjct: 537 TPKLQPGQLKSVLTGIKAAVSKQTKVL-YEKGC-DFTETGMTDIPKAVKTASQADVVVMV 594

Query: 471 AGLDLSVE----------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
            G D S+            E  D   L+LPG Q EL+  V    K PV L++ +    D+
Sbjct: 595 LG-DCSISEATKDVRKTCGENNDLATLVLPGKQQELLEAVCATGK-PVILILQAGRPYDL 652

Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR 580
              K +   K+IL    PG+EGG A ADV+FG YNPGGRLP+T+            +PL 
Sbjct: 653 --LKASEMCKAILVNWLPGQEGGPATADVLFGDYNPGGRLPMTFPRH------VGQLPLY 704

Query: 581 PVNNFPGRTYKFFDGPV--VYPFGYGLSYTQFKY---KVASSPKSVDIKLDKDQQCRDIN 635
                 GR Y++ D     +Y FGYGLSYT F+Y   KV   P                 
Sbjct: 705 YNFKTSGRRYEYVDMEYYPLYRFGYGLSYTSFEYSGLKVQEKPNG--------------- 749

Query: 636 YTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQ 694
                                    T +  V+N+G   G EV  +Y +       T + +
Sbjct: 750 -----------------------NVTVEATVKNVGGRAGDEVAQLYVTDMYASVKTRVME 786

Query: 695 VIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +  + R+ +  G+S  V F +     L ++++  + ++  G   I VG
Sbjct: 787 LKDFARIHLNPGESKTVSFELTPY-DLSLLNDHMDRVVEKGEFKICVG 833


>gi|319953334|ref|YP_004164601.1| beta-glucosidase [Cellulophaga algicola DSM 14237]
 gi|319421994|gb|ADV49103.1| Beta-glucosidase [Cellulophaga algicola DSM 14237]
          Length = 756

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 216/716 (30%), Positives = 338/716 (47%), Gaps = 116/716 (16%)

Query: 35  PEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTV 94
           PEK++   D A    RLG+PL+ + S+ +HG                       T+FP  
Sbjct: 82  PEKIKTAQDFAVKKTRLGIPLF-FGSDIIHGYK---------------------TTFPIP 119

Query: 95  ILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVLETPGED 153
           +  ++S++  L K+  Q  + EA A       G+ + +SP +++ RDPRWGR+ E  GED
Sbjct: 120 LGLSSSWDMELLKRTAQVAALEATA------DGINWNFSPMVDISRDPRWGRISEGAGED 173

Query: 154 PYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSR 213
           PY+  + A   V G Q  + +  +         + A  KH+A Y      G D    D  
Sbjct: 174 PYLGSQIAKAMVTGYQGEDLMAKNT--------MLATVKHFALYGAAE-AGRDYNSVD-- 222

Query: 214 VTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIV 273
           ++   M   ++ P++  ++ G V SVM S+N ++GIP   +  LL   +R DW F+G++V
Sbjct: 223 MSRLKMYNEYLPPYKAAIDAG-VGSVMSSFNDIDGIPASGNKWLLTDLLRDDWKFNGFVV 281

Query: 274 SDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQGKIAEADID 332
           SD  S+  ++ +H  L D +    A  LKAGLD+D  G+ +      ++ +GK+   +I 
Sbjct: 282 SDYTSVNEMI-AHG-LGDLQA-VSALSLKAGLDMDMVGEGFLTTLKKSLDEGKVTAEEIT 338

Query: 333 TSLRFLYIVLMRLGYFDGSPQY--KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALP 390
           T+ R +     +LG FD   +Y  K     +I   ++  LA EAA++  VLLKND   LP
Sbjct: 339 TACRRILEAKFKLGLFDDPYKYIDKKRPAKDILKDENRALAREAAKKSFVLLKNDTKNLP 398

Query: 391 LNTGNIKTLALVGPHANATKAMIGNY--EGTPCRYTSPMDGF--YAYSKVINYAPGC--- 443
           +N  +   +AL+G  AN+   M+G +   G P    S + GF   A +  I +A G    
Sbjct: 399 INKSS--KIALIGDLANSKDNMLGTWAPTGDPQLSVSILQGFKNVAPNAQITHAKGANIT 456

Query: 444 --ADIVCQNN--------------SMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
             A +  + N               M+  A++ AK +D  V V G       E   R D+
Sbjct: 457 DDAALAKKINVFGERVTIDKRSAEEMLNEAVELAKKSDIIVAVVGEATEFTGESSSRTDI 516

Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
            +P  Q +LI  +A   K P+ LV+MS   + +   +      SIL V +PG E G AIA
Sbjct: 517 SIPQSQKKLIRALAATGK-PLVLVLMSGRPLVLE--EELALSASILQVWFPGVEAGNAIA 573

Query: 548 DVIFGKYNPGGRLPITWYEANYVKIP-YTSMPL--RP--VNNFPGRTYKFFDGP--VVYP 600
           DV+FG YNP G+L  TW   N  +IP Y S+    RP   + F   T  + D P   + P
Sbjct: 574 DVVFGDYNPSGKLTATWPR-NVGQIPIYHSIKNTGRPQLTSEFEKFTSNYLDAPNTPLLP 632

Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
           FGYGLSYT+F+Y         ++ ++  Q        +  N+P                 
Sbjct: 633 FGYGLSYTEFEYS--------NLNVNASQ--------INQNEP----------------L 660

Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTM 715
              + V N G  DG EVV +Y +    + T  +KQ+ G+++V +  G++ +V  T+
Sbjct: 661 IVTVSVTNTGNFDGEEVVQLYLRDVVRSITQPVKQLKGFKKVMLKKGETKQVTLTL 716


>gi|332665860|ref|YP_004448648.1| beta-glucosidase [Haliscomenobacter hydrossis DSM 1100]
 gi|332334674|gb|AEE51775.1| Beta-glucosidase [Haliscomenobacter hydrossis DSM 1100]
          Length = 887

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 170/502 (33%), Positives = 245/502 (48%), Gaps = 63/502 (12%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           FP  D  L +  R KDLV R+TL EKV QM + A  +PRLG+P Y+WW+E LHGV+    
Sbjct: 40  FPMWDTNLSFEVRVKDLVSRLTLEEKVGQMLNAAPAIPRLGIPAYDWWNEVLHGVA---- 95

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
                  T F       T +P  I   A ++ +    +    + E RA++N   A     
Sbjct: 96  ------RTPFH-----VTVYPQAIGMAAGWDSTSLAMMAHYSALEGRAVFNKATALGRNN 144

Query: 127 ----GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
               GLT+W+PNIN+ RDPRWGR  ET GEDP++       +VRGLQ           D 
Sbjct: 145 ERYLGLTYWTPNINIFRDPRWGRGQETYGEDPFLTSMLGRAFVRGLQ---------GDDP 195

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
           + LK +AC KH+A +         R   +   +  D+ +T++  F+  V +  V  VMC+
Sbjct: 196 KYLKAAACAKHFAVHSGPE---PSRHSDNFSPSNYDLWDTYLPAFKELVTKAKVEGVMCA 252

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           YN  +G P C    L+N  +R  W F GY+ SDC +I    + HK   D    +V  VL 
Sbjct: 253 YNAFHGQPCCGSDVLMNDILRKQWQFKGYVTSDCWAIDDFFKFHKTHPDATSASVDAVLH 312

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKN 360
            G D++CG       +  V++G IAEA +D SL  L+    RLG FD     +Y    ++
Sbjct: 313 -GTDVECGTDVYKSLLDGVKKGMIAEAQLDISLIRLFTTRYRLGMFDPVSMVKYAQTPES 371

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            +   +H   + + A+Q IVLLKN+   LPL+  NIK +A++GP+A+    ++GNY G P
Sbjct: 372 ILETAEHKAHSLKMAQQSIVLLKNEGNTLPLSK-NIKKIAVLGPNADNRIVVLGNYNGQP 430

Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMI-PAAIDAAKNADATVIVAGLDLSVEA 479
                        S++I    G  + + Q   +I   AI+     D  +  A +      
Sbjct: 431 -------------SEIITALQGIKNKLGQEVELIYEKAINFTN--DTLLAYANVTNQYSW 475

Query: 480 EGKDRVDLLLPGFQTELINKVA 501
           EGK       PGF+ E  N VA
Sbjct: 476 EGK-------PGFKAEYYNNVA 490



 Score =  139 bits (351), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 89/276 (32%), Positives = 133/276 (48%), Gaps = 53/276 (19%)

Query: 452 SMIPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVA 501
           S + A ++  K+ADA V V G+   +E E          G DR  +LLP  QTEL+  + 
Sbjct: 607 SNLSAIVNRVKDADAIVYVGGISPQLEGEEMRVDFPGFNGGDRTSILLPAVQTELLKMLK 666

Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
              K P+  V+M+  A+ + +   N  I +I+   Y G+  G AIADV+FG YNP GRLP
Sbjct: 667 GTGK-PLVFVVMTGSAIALPYEDQN--IPAIVNAWYGGQSAGTAIADVLFGDYNPAGRLP 723

Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
           +T+Y+A+      + +P     +   RTY++F G  +YPFG+GLSYT F+Y    +P  +
Sbjct: 724 VTFYKAD------SDLPDFKSYDMNNRTYRYFKGDALYPFGHGLSYTSFQYSKLKTPGKI 777

Query: 622 DIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY 681
                                                 F     + N GK DG EVV +Y
Sbjct: 778 K---------------------------------SGASFKVSATLTNTGKKDGDEVVQLY 804

Query: 682 SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMN 716
              P +AG   I+ + G+ R+ + AG+S  V FT++
Sbjct: 805 LAYPEVAGKAPIRALKGFNRIRLKAGESKTVSFTLS 840


>gi|330996730|ref|ZP_08320605.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella
           xylaniphila YIT 11841]
 gi|329572575|gb|EGG54218.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
           11841]
          Length = 725

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 161/449 (35%), Positives = 229/449 (51%), Gaps = 46/449 (10%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
            D PY +  L   ERA DL++R+TL EK+  M + + GV RLG+  Y WWSEALHGV+  
Sbjct: 21  QDEPYKNPDLSPQERADDLLKRLTLKEKISLMQNQSPGVERLGIKPYNWWSEALHGVARN 80

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARA---------M 120
           G                 AT +P  +   + F++ L + I  TVS E RA          
Sbjct: 81  GL----------------ATVYPITMGMASVFDDKLIEDIYVTVSDEGRAKFHDARRHGR 124

Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
           Y  GN GLTFW+PN+N+ RDPRWGR  ET GEDPY+  R  +  V+G+Q          +
Sbjct: 125 YGRGNEGLTFWNPNVNIFRDPRWGRGQETWGEDPYLTTRMGVAVVQGMQG--------PA 176

Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTE-QDMQETFILPFEMCVNEGDVSSV 239
           D++  K  AC KHYA +         R  FD    E +D+ ET++  F+  V E DV  V
Sbjct: 177 DAKYDKTHACAKHYAVHSGPE---AKRHSFDVENLEPRDLWETYLPAFKALVQEADVKEV 233

Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
           MC+Y R  G P C   +LL Q +R +W +   +VSDC +I      ++  ++T  DA   
Sbjct: 234 MCAYQRFEGEPCCGSNRLLTQILRDEWGYKHLVVSDCGAISDFF--YQGRHETHPDAATS 291

Query: 300 VLKA---GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QY 354
              A   G DL+CG  Y +    AV++G I E  IDTSLR L      LG  D      +
Sbjct: 292 SASAVINGTDLECGVEYAHLDE-AVERGLITEHRIDTSLRRLLEARFALGEMDDDALVPW 350

Query: 355 KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIG 414
             +  + +    H ++A +  R+ +VLL N NG LPL+ G++  +A++GP+A  +    G
Sbjct: 351 SRISIDTVDCDMHRQMALDVTRKSMVLLHN-NGILPLDKGDVGKIAVMGPNAVDSVMQWG 409

Query: 415 NYEGTPCRYTSPMDGFYAYSKVINYAPGC 443
           NY+G P    + ++G       + Y  GC
Sbjct: 410 NYKGVPAHTYTILEGIRMEVGNVPYEKGC 438



 Score = 66.2 bits (160), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 44/130 (33%), Positives = 70/130 (53%), Gaps = 21/130 (16%)

Query: 458 IDAAKNADATVIVAGLDLSVEAE-----------GKDRVDLLLPGFQTELINKVADAAKG 506
           ++  K+A+  + V G+  ++E E           G DR  + LP  Q +++ K   AA  
Sbjct: 594 VERVKDAETIIFVGGISPNLEGEDKYFVYCPGFAGGDRTSIELPQVQRDIL-KALKAAGK 652

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKS---ILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
            V  V  S  AV +      P+++S   IL   YPG+ GG A+ADV+FG +NP G+LP+T
Sbjct: 653 KVVFVNCSGSAVALV-----PELESCDAILQAWYPGQAGGLAVADVLFGDFNPSGKLPVT 707

Query: 564 WYEANYVKIP 573
           +Y+ N  ++P
Sbjct: 708 FYK-NTEQLP 716


>gi|150003144|ref|YP_001297888.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
 gi|149931568|gb|ABR38266.1| glycoside hydrolase family 3, candidate beta-glycosidase
           [Bacteroides vulgatus ATCC 8482]
          Length = 785

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 245/830 (29%), Positives = 377/830 (45%), Gaps = 173/830 (20%)

Query: 6   KVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQM------------GDLAYGVPRL-- 51
           +V    + Y  A +P   R KDL+ RMT+ EKV Q+            G     V  L  
Sbjct: 22  RVMAQQWLYKQAAVPIEYRVKDLLGRMTIEEKVGQLCCPLGWEMYTKTGKNEVTVSELYK 81

Query: 52  ----GLPLYEWWS-------------------------EALHGVSFIGRRTNSPPGTHFD 82
                 P+  +W+                          AL   +    R   P    F 
Sbjct: 82  KKMAEAPVGSFWAVLRADPWTQKTLETGLSPELSAKALNALQKYAVEETRLGIP--VLFA 139

Query: 83  SEVP------GATSFPTVILTTASFNESLWKKIGQTVSTEARAM-YNLGNAGLTFWSPNI 135
            E P      G T FPT +   +++NE L  K+G+ ++ EAR    N+G      + P +
Sbjct: 140 EECPHGHMAIGTTVFPTALSAASTWNEGLMLKMGEAIALEARLQGANIG------YGPVL 193

Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
           +V R+PRW R+ ET GEDP       +  + G+  ++G++    +D + L   A  KH+A
Sbjct: 194 DVAREPRWSRMEETFGEDP------VLTTIMGVAMMKGMQGKVQNDGKHL--YATLKHFA 245

Query: 196 AYDLDNWEGNDRFHFDSRVT--EQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
           AY +      +  H  SR     + +   ++ PF   V EG   ++M SYN ++G+P  A
Sbjct: 246 AYGVP-----ESGHNGSRANCGMRQLLSEYLPPFRKAVKEG-AGTLMTSYNAIDGVPCTA 299

Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC-GDY 312
           + +LL   +R  W F G++ SD  SI+ IV   +   D KE AV + LKAGLD+D  G+ 
Sbjct: 300 NKELLTDVLRNQWGFKGFVYSDLISIEGIV-GMRAAKDNKEAAV-KALKAGLDMDLGGNA 357

Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAA 372
           +      A ++G I  AD+D ++  +  +  ++G F+       L K  + + +H ELA 
Sbjct: 358 FGKNLKKAYEEGLITMADLDRAVGNVLRLKFQMGLFENPYVSPELAKKLVHSKEHKELAR 417

Query: 373 EAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR--YTSPMDGF 430
           + AR+G+VLLKN+ G LPL+  +I  LA++GP+A+     +G+Y     R    + +DG 
Sbjct: 418 QVAREGVVLLKNE-GVLPLSK-HIGHLAVIGPNADEMYNQLGDYTAPQVREEVATVLDGI 475

Query: 431 YAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG--------------- 472
            A    S  + Y  GCA +     + IPAA+ AA+ ADA V+V G               
Sbjct: 476 RAAVSESTRVTYVKGCA-VRDTTATDIPAAVAAAQKADAVVLVVGGSSARDFKTKYISTG 534

Query: 473 -LDLSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINF 522
              +S +A         EG DR  L L G Q +LI+ VA   K P+ +V +    +++N 
Sbjct: 535 AATVSEDAKTLPDMDCGEGFDRSSLRLLGDQEKLISAVASTGK-PLVVVYIQGRTMNMNL 593

Query: 523 AKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPV 582
           A    K +++L   YPGE+GG  IAD++FG Y+P GRLP++        +P +   L PV
Sbjct: 594 AAE--KAQALLTAWYPGEQGGMGIADILFGDYSPAGRLPVS--------VPRSEGQL-PV 642

Query: 583 NNFPG--RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGT 640
               G  R Y    G  +Y FGYGLSYT+F Y          ++L K  +   +      
Sbjct: 643 FYSQGTQRDYVESKGTPLYAFGYGLSYTRFTYS--------GLELQKGTEMETLQ----- 689

Query: 641 NKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY--------SKPPGIAGTHI 692
                               T    V N G  DG EVV +Y        S+PP +     
Sbjct: 690 --------------------TVACTVTNTGNRDGEEVVQLYIGDKVASVSQPPLL----- 724

Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
             +  ++R+F+  G+S +V F +     L I D+  N ++  G   ++VG
Sbjct: 725 --LKAFQRIFLKKGESRQVIFHLKK-DDLGIYDSEMNYVVEPGEFKVMVG 771


>gi|431797765|ref|YP_007224669.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
           DSM 17526]
 gi|430788530|gb|AGA78659.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
           DSM 17526]
          Length = 799

 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 203/698 (29%), Positives = 336/698 (48%), Gaps = 111/698 (15%)

Query: 87  GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
           G T FPT I   +++N +L +++   ++ EAR        G   + P +++ R+PRW RV
Sbjct: 156 GTTVFPTSIGQASTWNPALIQEMAAAIALEARL-----QGGHIGYGPVLDLAREPRWSRV 210

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
            ET GEDPY+  +     V G Q         +S +    + +  KH+ AY +     N 
Sbjct: 211 EETYGEDPYINSQMGRAMVSGFQG--------ESIASGKNVISTLKHFTAYGVPEGGHNG 262

Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
                  V ++++ E+++ PF+  V EG +S VM +YN ++G+P  ++  LLN  +R DW
Sbjct: 263 T---SVSVGQRELHESYLPPFKAAVAEGALS-VMTAYNSIDGVPCTSNGHLLNDVLRDDW 318

Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY-YTNFTMGAVQQGK 325
            F+G++VSD  SI  +  SH  + +T E A    + AG+D D G Y +    + AVQ G 
Sbjct: 319 GFNGFVVSDLGSISGLRGSHH-VTETAEGAAQLAINAGVDSDLGGYGFGKNLLAAVQAGG 377

Query: 326 IAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKND 385
           +++  +D ++R +  V   +G F+      +  ++ + + +HI LA + AR+ +VLLKN+
Sbjct: 378 VSQEVLDEAVRRVLKVKFDMGLFENPYVDPSKAESLVRSAKHIALARKVARESVVLLKNE 437

Query: 386 NGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC--RYTSPMDGFYAYSKV-----IN 438
           N  LPL    + ++A++GP+A+ T   +G+Y          + ++G    +KV     +N
Sbjct: 438 NDLLPLRK-KVNSIAVIGPNADNTYNQLGDYTAPQPNENVVTVLEGI--KNKVGKDVRVN 494

Query: 439 YAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE---------------- 478
           Y  GCA I     S I  A   A  +D  V+V G     D   E                
Sbjct: 495 YVKGCA-IRDTTQSEIGKAASLAARSDVAVVVLGGSSARDFDTEYEETAAAKVSEAEEGQ 553

Query: 479 -------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKS 531
                   EG DR+ L L G Q +L+  V  A   PV +V++    +++N+   +  + +
Sbjct: 554 VISDMESGEGFDRMTLDLLGDQLKLVQAV-QATGTPVVVVLIKGRPLNLNWIDEH--VPA 610

Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF--PGRT 589
           I+   YPG+EGG AIADV+FG YNP GRL I+        +P +   L    N+  P R 
Sbjct: 611 IVDAWYPGQEGGNAIADVLFGDYNPSGRLTIS--------VPRSVGQLPVFYNYRNPKR- 661

Query: 590 YKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
           + + +G    +Y FG+GLSY  F+Y       S                  G    P   
Sbjct: 662 HDYVEGSAEPLYAFGHGLSYADFEYDNLEVTAS------------------GMAGSPTVR 703

Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIG---YERVFIA 704
           V                +V N+  +DG EVV +Y +    AG+ ++ ++    +E+V + 
Sbjct: 704 V--------------HFQVSNISNVDGEEVVQLYVRDE--AGSTVRPLLELKRFEKVMVP 747

Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           AG+S+K+ F + A + L+++    N L+  G+  +LVG
Sbjct: 748 AGESSKITFMLTA-EDLQVLGQDMNWLVEPGSFQVLVG 784


>gi|371777646|ref|ZP_09483968.1| beta-glucosidase [Anaerophaga sp. HS1]
          Length = 865

 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 153/439 (34%), Positives = 233/439 (53%), Gaps = 40/439 (9%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
           S+ VK    P+ +  L   ERAKDL+ R+T+ EK + + D +  +PRLG+  + WWSEAL
Sbjct: 14  SMTVKGQVLPFQNPDLSSEERAKDLISRLTVQEKARLLCDQSEAIPRLGIKKFNWWSEAL 73

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           HG +            + DS     T FP  I   ASFNE L  +I   +S EARA Y+ 
Sbjct: 74  HGYA------------NNDS----VTVFPQPIGMAASFNEELVFEIFNAISDEARAKYHQ 117

Query: 124 GNA---------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
                        L+ W+PN+N+ RDPRWGR  ET GEDPY+  R  +  V+GLQ  E  
Sbjct: 118 AQRRGEENRRFLSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQGPEDA 177

Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
           +Y         K+ AC KHY  +    W  ++    D  V+ ++  ET++  F+  V + 
Sbjct: 178 KYR--------KLLACAKHYTVHSGPEWSRHELNIND--VSPREFYETYMPAFKALVQKA 227

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
           DV  VMC+Y+R++  P C++ ++L + +R +W +   +V+DC +I     +H  ++ T  
Sbjct: 228 DVRQVMCAYHRLDDEPCCSNTRILQRILRDEWGYEHMVVADCGAISDFYTTHG-ISSTPV 286

Query: 295 DAVARVLKAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP 352
            A A  L AG DL+C   +Y+      A+++  I E DID SL  +      LG  D + 
Sbjct: 287 HAAATGLLAGTDLECIWDNYHYKMLPEALEKDLITEKDIDRSLMRVLKGRFDLGEMDDNS 346

Query: 353 --QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATK 410
              +  +  + +   +H +LA + A+Q IVLL+N N  LPL+  +I  +A+VGP+A+   
Sbjct: 347 LVPWAQIPPSVLNCEKHRQLAYKMAQQSIVLLQNKNKVLPLDKSSINKIAVVGPNADDEV 406

Query: 411 AMIGNYEGTPCRYTSPMDG 429
            + GNY GTP R  + +DG
Sbjct: 407 VLWGNYNGTPIRTITVLDG 425



 Score =  102 bits (255), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 84/326 (25%), Positives = 138/326 (42%), Gaps = 59/326 (18%)

Query: 407 NATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPG-------CADIVCQNNSMIPAAID 459
           N T A   N+   P R+   ++    Y   I YA           D   + +    A I 
Sbjct: 539 NDTLASYTNWRTIPARFPLYVEAGKTYEIEIRYAQRENWEANIQFDFGREEDIDFTALIK 598

Query: 460 AAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVT 509
             +  +  + V GL   +E E          G DR ++ LP  Q   +  + +A K   T
Sbjct: 599 KLEGIETVIFVGGLSGFLEGEEMPVSYPGFKGGDRTNIELPSVQRNCLKALKEAGK---T 655

Query: 510 LVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANY 569
           ++ ++     I          +I+   Y GE GG+AIADV+FG YNP G+LP+T+Y  + 
Sbjct: 656 VIFVNCSGSAIALEPETESCDAIIQAWYGGESGGQAIADVLFGDYNPSGKLPVTFYRNSD 715

Query: 570 VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQ 629
               +    +       GRTY++ +   ++PFG+GLSYT F+   A   KS  IK D+  
Sbjct: 716 NLGDFEDYSME------GRTYRYTNNH-LFPFGFGLSYTNFEIGKARLSKST-IKADE-- 765

Query: 630 QCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG 689
                                          + +I V+N GK DG+E+V VY +      
Sbjct: 766 -----------------------------TISIKIPVKNTGKRDGTEIVQVYVRKVNDID 796

Query: 690 THIKQVIGYERVFIAAGQSAKVGFTM 715
             +K + G++R+ + AG++ +   ++
Sbjct: 797 GPLKTLKGFQRIAVPAGKTRQANISL 822


>gi|332881173|ref|ZP_08448832.1| glycosyl hydrolase family 3 protein, partial [Capnocytophaga sp.
           oral taxon 329 str. F0087]
 gi|332680887|gb|EGJ53825.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           329 str. F0087]
          Length = 675

 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 162/449 (36%), Positives = 227/449 (50%), Gaps = 46/449 (10%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
            D PY +  L   ERA DL++R+TL EK+  M + + GV RLG+  Y WWSEALHGV+  
Sbjct: 21  QDEPYKNPDLSPQERADDLLKRLTLKEKISLMQNQSPGVERLGIKPYNWWSEALHGVARN 80

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARA---------M 120
           G                 AT +P  +   + F++ L + I  TVS E RA          
Sbjct: 81  GL----------------ATVYPITMGMASVFDDKLIEDIYVTVSDEGRAKFHDARRHGR 124

Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
           Y  GN GLTFW+PN+N+ RDPRWGR  ET GEDPY+  R  +  VRG+Q          +
Sbjct: 125 YGRGNEGLTFWNPNVNIFRDPRWGRGQETWGEDPYLTTRMGVAVVRGMQG--------PA 176

Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTE-QDMQETFILPFEMCVNEGDVSSV 239
           D++  K  AC KHYA +         R  FD    E +D+ ET++  F+  V E DV  V
Sbjct: 177 DAKYDKTHACAKHYAVHSGPE---AKRHSFDVENLEPRDLWETYLPAFKALVQEADVKEV 233

Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
           MC+Y R  G P C   +LL Q +R +W +   +VSDC +I      ++  ++T  DA   
Sbjct: 234 MCAYQRFEGEPCCGSNRLLTQILRDEWGYKHLVVSDCGAISDFF--YQGRHETHPDAATS 291

Query: 300 VLKA---GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QY 354
              A   G DL+CG  Y +    AV++G I E  IDTSLR L      LG  D      +
Sbjct: 292 SASAVINGTDLECGVEYAHLDE-AVERGLITEHRIDTSLRRLLEARFALGEMDDDALVPW 350

Query: 355 KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIG 414
             +  + +    H  +A +  R+ +VLL N NG LPL+ G+   +A++GP+A  +    G
Sbjct: 351 SRISIDTVDCGTHRRMALDVTRKSMVLLHN-NGILPLDKGDAGKIAVMGPNAVDSVMQWG 409

Query: 415 NYEGTPCRYTSPMDGFYAYSKVINYAPGC 443
           NY+G P    + ++G       + Y  GC
Sbjct: 410 NYKGVPAHTYTILEGIRGAIGNVPYEKGC 438


>gi|313205375|ref|YP_004044032.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312444691|gb|ADQ81047.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 858

 Score =  257 bits (657), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 164/454 (36%), Positives = 231/454 (50%), Gaps = 47/454 (10%)

Query: 4   SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
           ++ +     PY + KL    RA DL+ R+TL EK   M + +  +PRLG+  YEWW+EAL
Sbjct: 14  TVSLVAQQLPYQNPKLSAEVRATDLLARLTLAEKAALMQNNSPAIPRLGIKAYEWWNEAL 73

Query: 64  HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
           HGV   G                 AT FP  I   ASFN  L       VS EARA  N 
Sbjct: 74  HGVGRSGV----------------ATVFPQAIGMAASFNNGLLFDAFTAVSDEARAKSNK 117

Query: 124 GN--------AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
            +         GLT+W+PN+N+ RDPRWGR  ET GEDPY+     +  V+GLQ  +  E
Sbjct: 118 FSEQGGLKRYQGLTYWTPNVNIFRDPRWGRGQETYGEDPYLTSLMGVAVVKGLQGPDNAE 177

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEG 234
           Y         K+ AC KH+A +    W   +R  F++  +  +D+ ET++  F+  V + 
Sbjct: 178 YD--------KLHACAKHFAVHSGPEW---NRHSFNAENINPRDLWETYLPAFKALVQKA 226

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDT 292
           DV  VMC+YNR    P C   +LL Q +R DW F G +VSDC +I    +  +H    D 
Sbjct: 227 DVKEVMCAYNRFEDEPCCGSNRLLTQILRNDWKFDGLVVSDCWAISDFYKPNAHATQPDA 286

Query: 293 KEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP 352
              A   VL  G DL+CG  + N    AV+ G I E  ID SL+ L      LG  + S 
Sbjct: 287 THAAANAVLN-GTDLECGSDFRNLPE-AVKAGLIEEKRIDVSLKRLLKARFELGEMN-SD 343

Query: 353 QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAM 412
           Q   +  + + + +H  LA   A + IVLL+N+N  LPL +  +K +A++GP+AN +   
Sbjct: 344 QVWPISYSVVNSEKHQNLALRMAEESIVLLQNNNNILPL-SKKLK-IAVMGPNANDSVMQ 401

Query: 413 IGNYEGTPCRYTSPMDGF---YAYSKVINYAPGC 443
            GNY G P    + ++     +  +++I Y PGC
Sbjct: 402 WGNYNGFPAHTVTLLEAMRKSFPGAQLI-YEPGC 434



 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 77/276 (27%), Positives = 128/276 (46%), Gaps = 53/276 (19%)

Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
           + A+I   K+AD  V   G+  S+E E          G DR D+ LP  Q  L+  + DA
Sbjct: 585 LSASIAKVKDADVVVFAGGIAPSLEGEEMRVTVPGFKGGDRTDIELPAIQRRLLQALKDA 644

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
            K  V  V  S  A+ +         ++IL   YPG+ GG A+A+V+ G YNP GRLP+T
Sbjct: 645 GK-KVVFVNFSGSAMGL--VPETQSCEAILQAWYPGQAGGTAVANVLLGNYNPSGRLPVT 701

Query: 564 WYEANYVKIP-YTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVD 622
           +Y+ N  ++P +    ++      GRTY++     ++ FGYGLSYT+F            
Sbjct: 702 FYK-NVAQLPDFEDYSMK------GRTYRYMTEKPLFSFGYGLSYTKF------------ 742

Query: 623 IKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYS 682
                          +GT K   +++  ++           + V N GK+ G+EV+ VY 
Sbjct: 743 --------------VLGTAKLNKSSIKANET------LKITVPVTNAGKVAGTEVLQVYV 782

Query: 683 KPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
           +         K + G+++V I  G+++++   + + 
Sbjct: 783 RKVKDVDGPAKTLRGFKKVNIEPGKTSQISIDLTSS 818


>gi|357047866|ref|ZP_09109459.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella clara
           YIT 11840]
 gi|355529205|gb|EHG98644.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella clara
           YIT 11840]
          Length = 676

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 162/449 (36%), Positives = 227/449 (50%), Gaps = 46/449 (10%)

Query: 10  SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
            D PY +  L   ERA DL++R+TL EK+  M + + GV RLG+  Y WWSEALHGV+  
Sbjct: 21  QDEPYKNPDLSPQERADDLLKRLTLKEKISLMQNQSPGVERLGIKPYNWWSEALHGVARN 80

Query: 70  GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARA---------M 120
           G                 AT +P  +   + F++ L + I  TVS E RA          
Sbjct: 81  GL----------------ATVYPITMGMASVFDDKLIEDIYVTVSDEGRAKFHDARRHGR 124

Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
           Y  GN GLTFW+PN+N+ RDPRWGR  ET GEDPY+  R  +  VRG+Q          +
Sbjct: 125 YGRGNEGLTFWNPNVNIFRDPRWGRGQETWGEDPYLTTRMGVAVVRGMQG--------PA 176

Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTE-QDMQETFILPFEMCVNEGDVSSV 239
           D++  K  AC KHYA +         R  FD    E +D+ ET++  F+  V E DV  V
Sbjct: 177 DAKYDKTHACAKHYAVHSGPE---AKRHSFDVENLEPRDLWETYLPAFKALVQEADVKEV 233

Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
           MC+Y R  G P C   +LL Q +R +W +   +VSDC +I      ++  ++T  DA   
Sbjct: 234 MCAYQRFEGEPCCGSNRLLTQILRDEWGYKHLVVSDCGAISDFF--YQGRHETHPDAATS 291

Query: 300 VLKA---GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QY 354
              A   G DL+CG  Y +    AV++G I E  IDTSLR L      LG  D      +
Sbjct: 292 SASAVINGTDLECGVEYAHLDE-AVERGLITEHRIDTSLRRLLEARFALGEMDDDALVPW 350

Query: 355 KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIG 414
             +  + +    H  +A +  R+ +VLL N NG LPL+ G+   +A++GP+A  +    G
Sbjct: 351 SRISIDTVDCGTHRRMALDVTRKSMVLLHN-NGILPLDKGDAGKIAVMGPNAVDSVMQWG 409

Query: 415 NYEGTPCRYTSPMDGFYAYSKVINYAPGC 443
           NY+G P    + ++G       + Y  GC
Sbjct: 410 NYKGVPAHTYTILEGIRGAIGNVPYEKGC 438


>gi|336255157|ref|YP_004598264.1| beta-glucosidase [Halopiger xanaduensis SH-6]
 gi|335339146|gb|AEH38385.1| Beta-glucosidase [Halopiger xanaduensis SH-6]
          Length = 774

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 223/821 (27%), Positives = 367/821 (44%), Gaps = 160/821 (19%)

Query: 8   KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL--------GLPLYEWW 59
           +LS   Y D       R +DL+ERMT+ EK  Q+G +     RL           + EW 
Sbjct: 4   ELSTAAYQDESESVENRVEDLLERMTVEEKAAQLGSV--NADRLLDEDGEIDWDAVDEWL 61

Query: 60  SEALHGVSFIGR-------------RTNSPPGTHFDSEV------------------PGA 88
           +   HG+    R             R  +   T+   E                   P A
Sbjct: 62  A---HGIGHFTRLGGEGSLAPSEAARVTNELQTYLREETRLGIPAIPHEECLSGYMGPEA 118

Query: 89  TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLE 148
           T+FP ++   +S+N  L + + +T+  E       G   +   SP ++V RD RWGRV E
Sbjct: 119 TTFPQMLGMASSWNPELLQTVTETIRGELE-----GIGTVHALSPVLDVARDLRWGRVEE 173

Query: 149 TPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRF 208
           T GEDPY+V   A  YV GLQ           D R   ISA  KH+  +   +  G +R 
Sbjct: 174 TFGEDPYMVAEMARAYVSGLQ----------GDGRADGISATLKHFVGHGATDG-GKNRS 222

Query: 209 HFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNF 268
             +  V  ++++ET + P+E  ++E +  SVM +Y+ ++G+P      LL + +RG++ F
Sbjct: 223 SLN--VGPRELRETHLFPYEAVISEANAESVMNAYHDLDGVPCANSEWLLTEVLRGEFGF 280

Query: 269 HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQQGKI 326
            G +VSD  S++ +V  H+  + TK +A  + L+AG+D++    +YY    + AV++G +
Sbjct: 281 DGTVVSDYYSVRHLVTEHETAS-TKPEAAVQALEAGIDVELPYTEYYGEHLVEAVEEGDL 339

Query: 327 AEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDN 386
           AE  ++ S+R +     R G FD      +   +     +  E+  EAARQ + LLKN++
Sbjct: 340 AEETLNESVRRILREKFRKGVFDDPAVDVDAAADAFHTDEAREVTREAARQSMTLLKNED 399

Query: 387 GALPLNTGNIKTLALVGPHANATKAMIGNY--------EGTPCRYTSPMDGFYAYSKV-I 437
                   ++  +A+VGP A+  K ++G+Y        E       +P++   A   + +
Sbjct: 400 DL---LPLDVDDVAVVGPKADNPKELMGDYAYAAHYPEEEYEADAVTPLEALEARDGLDV 456

Query: 438 NYAPGCA-----------------------DIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
            Y  GC                          V   +++  + ++A K    +V  +G  
Sbjct: 457 TYEQGCTISGPSTDGFDAAADAAADADVALAFVGARSAVDFSDVEAEKEEKPSVPTSG-- 514

Query: 475 LSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI-NFAKNNPKIKSIL 533
                EG D   L LPG Q EL+ ++ +    PV +V++S     I + A   P   +IL
Sbjct: 515 -----EGCDVTHLGLPGVQEELVAELLE-TDTPVVVVLVSGKPHAIEDIAAEAP---AIL 565

Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPV-----NNFPGR 588
           +   PG+EGG AIA+ +FG+ NP G+LP++        +P +   L PV      N   +
Sbjct: 566 YAWLPGDEGGTAIAETLFGENNPAGKLPVS--------LPKSVGQL-PVYYNRKENTANK 616

Query: 589 TYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
            Y + D   VYPFG+G SYT+F+Y         D+ L  D               P  + 
Sbjct: 617 DYVYTDSEPVYPFGHGESYTEFEYG--------DVSLSTDSVT------------PLGS- 655

Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQ 707
                      FT  + V N+G   G E+V  Y +    +    +++++G+ERV +  G+
Sbjct: 656 -----------FTASVTVANVGDRAGDEIVQCYGRATNASQARPVQELLGFERVSLEPGE 704

Query: 708 SAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
           S +V F ++A + L   D + N  +  G + I +G    G+
Sbjct: 705 SKRVAFDLSATQ-LAFHDLSMNLAVEEGPYEIRIGRSADGI 744


>gi|285808617|gb|ADC36136.1| glycoside hydrolase family 3 protein [uncultured bacterium 253]
          Length = 752

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 191/641 (29%), Positives = 319/641 (49%), Gaps = 69/641 (10%)

Query: 89  TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVL 147
           T FP  +   +S++ +  ++     + EARA      AG+ + ++P +++ RDPRWGR+ 
Sbjct: 117 TIFPIPLAEASSWDPTSAERSTSIAAREARA------AGVRWTFAPMLDIARDPRWGRIT 170

Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
           E  GED ++   +A   VRG Q   G +Y     S P K+ AC KH+ AY     EG  R
Sbjct: 171 EGAGEDQFLGAAFARARVRGFQ---GTDY-----SAPDKMLACAKHWVAYGAT--EGG-R 219

Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
            +  + ++E  ++E +  PF+  V+ G V +VM  +N +NG+P  A+   L + +RG+W 
Sbjct: 220 DYNTTDMSENTLREIYFPPFKAAVDAG-VGTVMSGFNDLNGVPVSANHFTLTEVLRGEWK 278

Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQGKI 326
           F G++VSD  S++ ++       D  +DA    L AG+D++     +       +++GK+
Sbjct: 279 FDGFVVSDYTSVKELINHGLAFGD--QDAARLALNAGVDMEMVSRLFNQQGPQLLKEGKV 336

Query: 327 AEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDN 386
           + A ID ++R +  +  RLG F      +     ++   ++   A   A + +VLLKN+ 
Sbjct: 337 SPATIDEAVRRILRIKFRLGLFANPYADEARETTSLLTSENRAAARALADRSMVLLKNEG 396

Query: 387 GALPLNTGNIKTLALVGPHANATKAMIGNY--EGTPCRYTSPMDGFYAY---SKVINYAP 441
           G LPL+ G I+++A++GP A+  +A +G +  +G P    +P+ G  A    +  +NYA 
Sbjct: 397 GTLPLSKG-IRSIAVIGPLADDHRAPLGWWSGDGKPEDTVTPLMGIRAKVSPATKVNYAK 455

Query: 442 GCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVA 501
           GC D+   +   I  A+  A+ ++  ++  G    +  E   +  L L G Q +L+  V 
Sbjct: 456 GC-DVQGDSTGDIAEAVAVARESELAIVFVGESAEMVGEAASKSSLDLTGCQMDLVKAVQ 514

Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
              K P  +V+++   + + +  +N       W+G  G E G AIADV+FG  NPGG+LP
Sbjct: 515 ATGK-PTIVVLINGRPLTVGWIFDNTPAVLEAWMG--GTEAGNAIADVLFGDANPGGKLP 571

Query: 562 ITW-YEANYVKIPYTSMPL-RPVNNFPGRTYKFFDGPVV--YPFGYGLSYTQFKYKVASS 617
           +TW      V I Y  M   RP       T K+ D P    + FGYGLSYTQFK      
Sbjct: 572 VTWPRTVGQVPIYYNHMNTGRPPEANNRYTSKYLDVPWTPQFCFGYGLSYTQFKI----- 626

Query: 618 PKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEV 677
               +++L               + P  +A           K T  +EVEN+GK  G EV
Sbjct: 627 ---TNLQL---------------SAPRISAT---------GKLTASVEVENVGKRAGDEV 659

Query: 678 VMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNA 717
           V +Y      + T  +K++ G++R+ +  G+  +V F + +
Sbjct: 660 VQLYIHDVAASMTRPVKELKGFQRITLQPGEKKRVEFVLTS 700


>gi|298374091|ref|ZP_06984049.1| thermostable beta-glucosidase B [Bacteroides sp. 3_1_19]
 gi|298268459|gb|EFI10114.1| thermostable beta-glucosidase B [Bacteroides sp. 3_1_19]
          Length = 732

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 221/776 (28%), Positives = 362/776 (46%), Gaps = 142/776 (18%)

Query: 18  KLPYPERAKDLVERMTLPEKVQQM-GDLAY---GVPRLGLPLYEW-WSEALHGV-SFIGR 71
           K+   +R + L+++MTL EKV  + G+  +   GV RLG+P  EW  S+  HGV + I R
Sbjct: 28  KVQMEKRIEKLIKKMTLEEKVGLLHGNSKFYVAGVERLGIP--EWSLSDGPHGVRAEINR 85

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
              +  G   DS    A+ FPT     A++N  L  + G+ +  EAR             
Sbjct: 86  HDWAYAGWTNDS----ASYFPTGTAFAAAWNPELAYRRGEVLGEEAR-----WRKKDVLL 136

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
            P +N++R P  GR  E   EDPY+    A+ Y++GLQ        RD       ++   
Sbjct: 137 GPGVNIIRSPLCGRNFEYMSEDPYMNSVLAVAYIKGLQS-------RD-------VACSV 182

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
           KH+A   ++N E N R   D   +E+ ++E ++  F+  V EG   +VM +YN+  G   
Sbjct: 183 KHFA---VNNQETN-RTTIDVECSERALREIYLPAFKAAVQEGGALTVMAAYNKFRGEFC 238

Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
             +  L+ + +R +W F G  V+D  +  + V S               ++AGLDL+ G 
Sbjct: 239 AENNYLVRKILRNEWGFDGVYVTDWGAAHSTVPS---------------MEAGLDLEMGT 283

Query: 312 --------YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC 363
                   YY N  + AV+ GK+  + +D  +  +  V+++    D  P+ K  G  ++ 
Sbjct: 284 LIDKYEDWYYANPLIEAVKSGKVPMSLVDEKVGDVLRVMIKTNVLD--PK-KRFGPGSMN 340

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG-----PHANA--TKAMIGNY 416
             +H +   +AA + IVLLKN N  LPL+  +IK+LA++G      H+N   +  +   Y
Sbjct: 341 TKEHQQATYDAAAEAIVLLKNQNNLLPLDFSSIKSLAVIGDNATRKHSNGGLSSEIKAVY 400

Query: 417 EGTP-----CRYTSPMDGFYA--YSKVINYAPGC---------ADIVCQNNSMIPAAIDA 460
           E TP      ++   +D  +A  Y K+  +  G          +    ++++++  A++ 
Sbjct: 401 EVTPLEALRAKWGDKVDIRFAQGYEKLSTFVEGSNNGQSSGTFSSKTQESDALLKEAVEV 460

Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
           A+ +D  ++V GL+   + E  DR+++ +P  Q ELI +V  A   P T+VIM AG+  +
Sbjct: 461 ARTSDVALLVCGLNHDYDTESFDRLNMDIPYGQVELIQEVVKA--NPRTIVIMIAGS-PL 517

Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR 580
           N A  +    +I+W  + G EGG A+ DV+ GK NP G++P T        +     P  
Sbjct: 518 NMAAVDICSPAIVWAWFNGMEGGNALVDVLSGKVNPSGKMPFT------TPVSLDQSPAH 571

Query: 581 PVNNFPGRT------------YKFFDG---PVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
            + NFPGR             Y++FD    PVVYPFGYGLSYT F Y            L
Sbjct: 572 ALGNFPGRDLKVNYEEDILVGYRWFDTKGLPVVYPFGYGLSYTTFNYS----------NL 621

Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
           + D++  D   T+                    + TF +   N G  +G+EV  +Y   P
Sbjct: 622 NTDKKTYDQADTI--------------------QATFTL--TNTGDREGAEVAQLYVSDP 659

Query: 686 GIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
             +    +K++ G+++VF+  G+S ++   +    SL     A +  +      IL
Sbjct: 660 VCSVMRPVKELKGFKKVFLKPGESRRITLDI-PVSSLAFYSEAQSQFVVEPGEFIL 714


>gi|225871719|ref|YP_002753173.1| glycosyl hydrolase family, 3 [Acidobacterium capsulatum ATCC 51196]
 gi|225793416|gb|ACO33506.1| glycosyl hydrolase family, 3 [Acidobacterium capsulatum ATCC 51196]
          Length = 776

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 194/661 (29%), Positives = 318/661 (48%), Gaps = 93/661 (14%)

Query: 89  TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVL 147
           T FP  +   AS++  +  +     + EAR++      G+ + ++P +++ RDPRWGR++
Sbjct: 133 TIFPVPLAQAASWDPVMVSRDQSIAAMEARSV------GIDWAFAPMVDIARDPRWGRMV 186

Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
           E  G DPY+    A   VRG Q              P  I AC KH+A Y     EG  R
Sbjct: 187 EGAGSDPYLGAAMAAAQVRGFQGA--------YPGAPNHILACAKHFAGYGAA--EGG-R 235

Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
            +  S +++  +   ++ PF   V  G V+++M +Y  +N +P   +  LL   +R DW 
Sbjct: 236 DYDASYISDSQLWNVYLPPFHAAVKAG-VATLMSAYMDLNDVPATGNQWLLQDVLRRDWK 294

Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG---DYYTNFTMGAVQQG 324
           F GY+VSD ++++ + ++H F  D +EDA  R  KAG++++       Y +    A+QQG
Sbjct: 295 FDGYVVSDANAVRNL-QTHGFAQD-QEDAAVRAFKAGVNMEMAIGQTAYDSELSKALQQG 352

Query: 325 KIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN--ICNPQHIELAAEAARQGIVLL 382
            I    +D ++R +  + MRLG F+    Y ++ ++   + +P H   A  AA +  VLL
Sbjct: 353 VITGQQLDDAVRPILEMKMRLGLFEHP--YVDVARSQRILDDPAHRTAARIAAERSAVLL 410

Query: 383 KNDNGALPLNTGNIKTLALVGPHANATKAMIG--NYEGTPCRYTSPMDGF---YAYSKVI 437
           +N+ G LPLN      +A++GP A++ +  +G   ++       + + G    +  S  I
Sbjct: 411 RNEGGLLPLNKTRYHNIAVIGPLADSQRDTLGPWTFDENLSETVTVLQGLRNAFGASAKI 470

Query: 438 NYAPGCADIVCQNNSMIPA---------------------AIDAAKNADATVIVAGLDLS 476
            YAPG A +  +  SM  A                     AID A+ +D  V+V G   +
Sbjct: 471 TYAPG-AQMHRKFPSMFDALDRGKKPPVWTPAQARQQMQQAIDLARKSDLVVMVLGEHQN 529

Query: 477 VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
           +  E      L LPG Q +L+  VA   K P+ LV+M+   ++I +A  +  + +IL V 
Sbjct: 530 MSGEAASSDSLKLPGDQEQLLQSVAATGK-PLVLVLMNGRPLNIKWAALH--VPAILDVW 586

Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFFDG 595
           YPG +GG A+A+++ GK  PGG+LP  W  +   V IPY         N   R +     
Sbjct: 587 YPGSQGGNAVANLLLGKSVPGGKLPFDWPRDVGQVPIPYAHNLTHEPQNQARRYWDEAST 646

Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
           P +YPFGYGLSYT F +          +++DK    +                  +DV  
Sbjct: 647 P-LYPFGYGLSYTAFAFS--------HLQIDKSSVSKK-----------------EDVHV 680

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
                   ++V N GK+ G EV  +Y  +  G A   ++++ G+ER+ +  GQ+  + FT
Sbjct: 681 S-------VDVTNTGKLAGDEVAQLYIHQEYGNASRPVRELKGFERITLQPGQTKTLQFT 733

Query: 715 M 715
           +
Sbjct: 734 L 734


>gi|262383006|ref|ZP_06076143.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           2_1_33B]
 gi|262295884|gb|EEY83815.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           2_1_33B]
          Length = 732

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 216/747 (28%), Positives = 354/747 (47%), Gaps = 141/747 (18%)

Query: 18  KLPYPERAKDLVERMTLPEKVQQM-GDLAY---GVPRLGLPLYEW-WSEALHGV-SFIGR 71
           K+   +R + L+++MTL EKV  + G+  +   GV RLG+P  EW  S+  HGV + I R
Sbjct: 28  KVQMEKRIEKLIKKMTLEEKVGLLHGNSKFYVAGVERLGIP--EWSLSDGPHGVRAEINR 85

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
              +  G   DS    A+ FPT     A++N  L  + G+ +  EAR             
Sbjct: 86  HDWAYAGWTNDS----ASYFPTGTAFAAAWNPELAYRRGEVLGEEAR-----WRKKDVLL 136

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
            P +N++R P  GR  E   EDPY+    A+ Y++GLQ        RD       ++   
Sbjct: 137 GPGVNIIRSPLCGRNFEYMSEDPYMNSVLAVAYIKGLQS-------RD-------VACSV 182

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
           KH+A   ++N E N R   D   +E+ ++E ++  F+  V EG   +VM +YN+  G   
Sbjct: 183 KHFA---VNNQETN-RTTVDVECSERALREIYLPAFKAAVQEGGALTVMAAYNKFRGEFC 238

Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
             +  L+ + +R +W F G  V+D  +  + V S               ++AGLDL+ G 
Sbjct: 239 AENNYLVRKILRNEWGFDGVYVTDWGAAHSTVPS---------------MEAGLDLEMGT 283

Query: 312 --------YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC 363
                   YY N  + AV+ GKI  + +D  +  +  V+++    D  P+ K  G  ++ 
Sbjct: 284 LIDKYEDWYYANPLIEAVKSGKIPMSLVDEKVGDVLRVMIKTNVLD--PK-KRFGPGSMN 340

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG-----PHANA--TKAMIGNY 416
             +H +   +AA + IVLLKN N  LPL+  +IK+LA++G      H+N   +  +   Y
Sbjct: 341 TKEHQQATYDAAAEAIVLLKNQNNLLPLDFSSIKSLAVIGDNATRKHSNGGLSSEIKAVY 400

Query: 417 EGTP-----CRYTSPMDGFYA--YSKVINYAPGC---------ADIVCQNNSMIPAAIDA 460
           E TP      ++   +D  +A  Y K+  +  G          +    ++++++  A++ 
Sbjct: 401 EVTPLEALRAKWGDKVDIRFAQGYEKLSTFVEGSNNGQSSGTFSSKTQESDALLKEAVEV 460

Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
           A+ +D  ++V GL+   + E  DR+++ +P  Q ELI +V  A   P T+V+M AG+  +
Sbjct: 461 ARTSDVALLVCGLNHDYDTESFDRLNMDIPYGQVELIQEVVKA--NPRTIVVMIAGS-PL 517

Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR 580
           N A  +    +I+W  + G EGG A+ DV+ GK NP G++P T        +     P  
Sbjct: 518 NMAAVDICSPAIVWAWFNGMEGGNALVDVLSGKVNPSGKMPFT------TPVSLDQSPAH 571

Query: 581 PVNNFPGRT------------YKFFDG---PVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
            + NFPGR             Y++FD    PVVYPFGYGLSYT F Y            L
Sbjct: 572 ALGNFPGRDLKVNYEEDILVGYRWFDTKGLPVVYPFGYGLSYTTFNYS----------NL 621

Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
           + D++  D   T+                    + TF +   N G  +G+EV  +Y   P
Sbjct: 622 NTDKKTYDQADTI--------------------QATFTL--TNTGDREGAEVAQLYVSDP 659

Query: 686 GIAGTH-IKQVIGYERVFIAAGQSAKV 711
             +    +K++ G+++VF+  G+S ++
Sbjct: 660 VCSVMRPVKELKGFKKVFLKPGESRRI 686


>gi|301307693|ref|ZP_07213650.1| thermostable beta-glucosidase B [Bacteroides sp. 20_3]
 gi|423337298|ref|ZP_17315042.1| hypothetical protein HMPREF1059_00967 [Parabacteroides distasonis
           CL09T03C24]
 gi|300834367|gb|EFK64980.1| thermostable beta-glucosidase B [Bacteroides sp. 20_3]
 gi|409237758|gb|EKN30554.1| hypothetical protein HMPREF1059_00967 [Parabacteroides distasonis
           CL09T03C24]
          Length = 732

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 221/776 (28%), Positives = 362/776 (46%), Gaps = 142/776 (18%)

Query: 18  KLPYPERAKDLVERMTLPEKVQQM-GDLAY---GVPRLGLPLYEW-WSEALHGV-SFIGR 71
           K+   +R + L+++MTL EKV  + G+  +   GV RLG+P  EW  S+  HGV + I R
Sbjct: 28  KVQMEKRIEKLIKKMTLEEKVGLLHGNSKFYVAGVERLGIP--EWSLSDGPHGVRAEINR 85

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
              +  G   DS    A+ FPT     A++N  L  + G+ +  EAR             
Sbjct: 86  HDWAYAGWTNDS----ASYFPTGTAFAAAWNPELAYRRGEVLGEEAR-----WRKKDVLL 136

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
            P +N++R P  GR  E   EDPY+    A+ Y++GLQ        RD       ++   
Sbjct: 137 GPGVNIIRSPLCGRNFEYMSEDPYMNSVLAVAYIKGLQS-------RD-------VACSV 182

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
           KH+A   ++N E N R   D   +E+ ++E ++  F+  V EG   +VM +YN+  G   
Sbjct: 183 KHFA---VNNQETN-RTTVDVECSERALREIYLPAFKAAVQEGGALTVMAAYNKFRGEFC 238

Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
             +  L+ + +R +W F G  V+D  +  + V S               ++AGLDL+ G 
Sbjct: 239 AENNYLVRKILRNEWGFDGVYVTDWGAAHSTVPS---------------MEAGLDLEMGT 283

Query: 312 --------YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC 363
                   YY N  + AV+ GKI  + +D  +  +  V+++    D  P+ K  G  ++ 
Sbjct: 284 LIDKYEDWYYANPLIEAVKSGKIPMSLVDEKVGDVLRVMIKTNVLD--PK-KRFGPGSMN 340

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG-----PHANA--TKAMIGNY 416
             +H +   +AA + IVLLKN N  LPL+  +IK+LA++G      H+N   +  +   Y
Sbjct: 341 TKEHQQATYDAAAEAIVLLKNQNNLLPLDFSSIKSLAVIGDNATRKHSNGGLSSEIKAVY 400

Query: 417 EGTP-----CRYTSPMDGFYA--YSKVINYAPGC---------ADIVCQNNSMIPAAIDA 460
           E TP      ++   +D  +A  Y K+  +  G          +    ++++++  A++ 
Sbjct: 401 EVTPLEALRAKWGDKVDIRFAQGYEKLSTFVEGSNNGQSSGTFSSKTQESDALLKEAVEV 460

Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
           A+ +D  ++V GL+   + E  DR+++ +P  Q ELI +V  A   P T+V+M AG+  +
Sbjct: 461 ARTSDVALLVCGLNHDYDTESFDRLNMDIPYGQVELIQEVVKA--NPRTIVVMIAGS-PL 517

Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR 580
           N A  +    +I+W  + G EGG A+ DV+ GK NP G++P T        +     P  
Sbjct: 518 NMAAVDICSPAIVWAWFNGMEGGNALVDVLSGKVNPSGKMPFT------TPVSLDQSPAH 571

Query: 581 PVNNFPGRT------------YKFFDG---PVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
            + NFPGR             Y++FD    PVVYPFGYGLSYT F Y            L
Sbjct: 572 ALGNFPGRDLKVNYEEDILVGYRWFDTKGLPVVYPFGYGLSYTTFNYS----------NL 621

Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
           + D++  D   T+                    + TF +   N G  +G+EV  +Y   P
Sbjct: 622 NTDKKTYDQADTI--------------------QATFTL--TNTGDREGAEVAQLYVSDP 659

Query: 686 GIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
             +    +K++ G+++VF+  G+S ++   +    SL     A +  +      IL
Sbjct: 660 VCSVMRPVKELKGFKKVFLKPGESRRITLDI-PVSSLAFYSEAQSQFVVEPGEFIL 714


>gi|51507369|emb|CAH18932.1| beta-xylosidase [Pyrus communis]
          Length = 238

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 126/239 (52%), Positives = 171/239 (71%), Gaps = 4/239 (1%)

Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQHIELAAEAARQ 377
           ++ G++ E DI+ +L     V MRLG FDG P   +Y NLG  ++C P   ELA EAARQ
Sbjct: 1   MRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQRYGNLGLADVCKPSSNELALEAARQ 60

Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVI 437
           GIVLL+N   +LPL+T   +T+A++GP+++ T+ MIGNY G  C YT+P+ G   Y++ I
Sbjct: 61  GIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMIGNYAGVACGYTTPLQGIARYTRTI 120

Query: 438 NYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELI 497
           + A GC D+ C  N +I AA  AA+ ADATV+V GLD S+EAE +DR +LLLPG Q EL+
Sbjct: 121 HQA-GCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEAEFRDRTNLLLPGHQQELV 179

Query: 498 NKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNP 556
           ++VA A++GP  LVIMS G +D+ FAKN+P+I +I+WVGYPG+ GG AIADV+FG  NP
Sbjct: 180 SRVARASRGPTILVIMSGGPIDVMFAKNDPRIGAIIWVGYPGQAGGTAIADVLFGTTNP 238


>gi|365122063|ref|ZP_09338970.1| hypothetical protein HMPREF1033_02316 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363643257|gb|EHL82578.1| hypothetical protein HMPREF1033_02316 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 819

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 223/804 (27%), Positives = 360/804 (44%), Gaps = 135/804 (16%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WS------ 60
           + + K P  +R +DL+ +M L EK  Q+  L YG  R+    LP  EW    W       
Sbjct: 53  FENPKQPIEKRVQDLLSQMNLDEKTCQLATL-YGYKRVMSDSLPTPEWKNKIWKDGIANI 111

Query: 61  -EALHGVS--------------------------FIGRRTNSPPGTHFDSEVPG-----A 88
            E L+GV                           FI       P    +  + G     A
Sbjct: 112 DEQLNGVGRGAKIAQDLIYPFSKHAEAINKTQKWFIEETRLGIPVDFSNETIHGLNHTKA 171

Query: 89  TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVL 147
           T  P  I   +++N  L  K G     EA+A+      G T  ++P +++ RDPRWGRVL
Sbjct: 172 TPLPAPIGIGSTWNAPLVYKAGSIAGKEAKAL------GYTNIYAPILDLARDPRWGRVL 225

Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
           E  GEDP++V       V+G+Q+ +GV             +A  KH+A Y +     +  
Sbjct: 226 ECYGEDPFLVATLGTQMVKGIQE-QGV-------------AATLKHFAVYSVPKGGRDGS 271

Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
              D  V  ++M +  + PF+  + +     VM SYN  +G+P  A    L Q +R ++ 
Sbjct: 272 VRTDPHVAPREMHQMHLYPFKKVIQDAHPMGVMSSYNDWDGVPVTASYYFLTQLLRQEFG 331

Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAVQQ 323
           F GY+VSD D+++ +   H  + +T E+AV  VL+AGL++       D +       V++
Sbjct: 332 FDGYVVSDSDAVEYVYNKH-HVAETYEEAVRMVLEAGLNVRTTFAAPDIFILPARKLVKE 390

Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNP-QHIELAAEAARQGIVLL 382
           G+++   ID  +  +  V  RLG FD          + I    ++ +   +  RQ +VLL
Sbjct: 391 GRLSMKVIDERVADVLRVKFRLGLFDQPFVADPKAADKIVGADKNKDFVLDIQRQSLVLL 450

Query: 383 KNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKV-INY 439
           KN+N  LPL+   +  + + GP A     M+  Y        +  +G   Y  +KV ++Y
Sbjct: 451 KNENNLLPLDKNKLSRILITGPLAKEENYMVSRYGPQELENITVYEGIKNYLGNKVAVDY 510

Query: 440 APGC--------------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
           A GC              + +  +    I  A++ AK +D  + V G D     E K R 
Sbjct: 511 ALGCKVKDAKWPESEIIHSPLTTEEQQEIQNAVEKAKLSDIVIAVLGEDEESTGESKSRS 570

Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
            L LPG Q +L+  +    K PV LV+++   + IN+A  +  I +IL   +PG+ GG A
Sbjct: 571 GLDLPGRQQQLLEALYATGK-PVVLVLINGQPLTINWA--DRYIPAILEAWFPGQMGGTA 627

Query: 546 IADVIFGKYNPGGRLPITW------YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
           IA+ +FG YNPGG+LP+T+       E N+   P  S   +P     G      +G  +Y
Sbjct: 628 IAETLFGDYNPGGKLPVTFPKTLGQIELNFPFKP-ASQSKQPEAGPNGYGKTRVNG-ALY 685

Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
           PFG+GLSYT F+Y         ++K+  ++Q                          D +
Sbjct: 686 PFGFGLSYTTFEYS--------NLKVSPERQGPK----------------------GDIQ 715

Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNAC 718
            +F  ++ N GK  G E+V +Y K    +    + ++ G+ERV +  G++  + FT++  
Sbjct: 716 VSF--DITNTGKRAGDEIVQLYVKDKVSSVISYESLLRGFERVSLQPGETKNIQFTLHP- 772

Query: 719 KSLKIVDNAANSLLASGAHTILVG 742
           + L+I+D   N  +  G   + +G
Sbjct: 773 EDLEILDINMNWNVEPGEFEVRIG 796


>gi|333377833|ref|ZP_08469566.1| hypothetical protein HMPREF9456_01161 [Dysgonomonas mossii DSM
           22836]
 gi|332883853|gb|EGK04133.1| hypothetical protein HMPREF9456_01161 [Dysgonomonas mossii DSM
           22836]
          Length = 780

 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 203/702 (28%), Positives = 329/702 (46%), Gaps = 97/702 (13%)

Query: 87  GATSFPTVILTTASFNESLWKKIGQTVSTEARAM-YNLGNAGLTFWSPNINVVRDPRWGR 145
           G T FPT I   A++N +L +++   +S EAR+   ++G      + P +++ R+ RW R
Sbjct: 144 GTTVFPTAIGQAATWNPNLIQQMSAVISKEARSQGSHIG------YGPVLDLAREARWSR 197

Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
           V ET GEDP ++ +    +V G        +     S+P  + +  KH+ AY + +   N
Sbjct: 198 VEETYGEDPVLISKMGEAFVTG--------FGSGDLSKPYSLISTLKHFVAYGIPDGGHN 249

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
                 + V  +D++E ++ PFE  V  G +S VM +YN V+GIP  ++  LL   +  D
Sbjct: 250 GN---SNSVGMRDLKENYLPPFEKAVKAGALS-VMTAYNSVDGIPCTSNEYLLKDVLCKD 305

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGK 325
           W F G+ VSD  SI+ +  SH  ++  +E A+   L +GLD D G         AV++G 
Sbjct: 306 WGFKGFTVSDLGSIEGLKGSHYVVSTIQEAAILS-LTSGLDCDLGGNAFFTLSDAVKKGM 364

Query: 326 IAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKND 385
           + E  ID+++  +  +   +G F+     +N  +  +   ++I LA + AR+ IVLL+N 
Sbjct: 365 VGETQIDSAVYKILKLKFDMGLFENPYVDENNARQVVRTQENIVLARQVARESIVLLENK 424

Query: 386 NGALPLNTGNIKTLALVGPHANATKAMIGNYEG--TPCRYTSPMDGFYAYSK--VINYAP 441
           N  LPLN   IK +A++GP+A+     +G+Y          + +DG  +  K   I Y  
Sbjct: 425 NNVLPLNKSKIKKIAVIGPNADNVYNQLGDYTAPQDDSNVKTVLDGIRSKLKQSQIEYVK 484

Query: 442 GCADIVCQNNSMIPAAIDAAKNADATV---------------IVAGLDLSVE-------- 478
           GCA I    N+ I  A+ AA  +D  V               I  G  ++ E        
Sbjct: 485 GCA-IRDTLNTDIDKAVQAALRSDVAVVVVGGSSARDFKTKYIETGAAVADEHSISDMES 543

Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
            EG DRV L L G Q EL+  +    K PV +V +    +++N+A  N    ++L   YP
Sbjct: 544 GEGFDRVSLDLMGKQLELLKAIKATGK-PVVVVYIQGRPLNMNWASENA--DALLSAWYP 600

Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG-RTYKFFDGPV 597
           G+EGG AIADV+FG+YNP GRLP++      V      +P+   +  P    Y       
Sbjct: 601 GQEGGNAIADVLFGEYNPAGRLPMS------VAKSVGQLPVYYNHRNPASHDYVEMTSKP 654

Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
           +Y FGYGLS+T F+Y         ++K++K     ++                       
Sbjct: 655 LYSFGYGLSFTSFEYS--------NLKINKSNSGVEVT---------------------- 684

Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMN 716
                 +E+ N G  DG EVV +Y +    +    I Q+  +ERV +  G++  +   + 
Sbjct: 685 ------VELRNSGNFDGDEVVQLYLRNNRASVVQPIMQLKAFERVNLKKGETKTIKLLLT 738

Query: 717 ACKSLKIVDNAANSLLA-SGAHTILVGEGVGGVSFPLQLNLN 757
                 I+D   N ++  +G  T +VG     +    ++ LN
Sbjct: 739 K-DDFSIIDKKMNRVVEPNGDFTFMVGSASDNIKLREKMMLN 779


>gi|409197254|ref|ZP_11225917.1| glycoside hydrolase 3 [Marinilabilia salmonicolor JCM 21150]
          Length = 734

 Score =  256 bits (655), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 209/734 (28%), Positives = 344/734 (46%), Gaps = 102/734 (13%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPR--------LGLPLYEWWSEALHGVSFIG---R 71
           ER + L+  MTL EK+ QM  ++ G           +G  L E   E ++ +  I     
Sbjct: 22  ERVEQLLGEMTLDEKIGQMCQVSGGQGNEESIRQGMIGSILNEVDPENINRLQKIAVEES 81

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF- 130
           R   P     D      T FP  +   A++N  L +K  +  ++EA       + G+ + 
Sbjct: 82  RLGIPIIVARDVIHGFKTVFPIPLGQAATWNPELVQKGSRIAASEA------ASTGVRWT 135

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           ++P I++ RD RWGR+ E+ GEDPY+        V G Q         DS +    I+AC
Sbjct: 136 FAPMIDISRDARWGRIAESLGEDPYLTSVLGAAMVTGFQG--------DSLNGETSIAAC 187

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
            KH+A Y     EG   ++  S +  +++++ ++ PF+  V+ G V + M  +N V+G+P
Sbjct: 188 AKHFAGYGAA--EGGRDYNTTS-IPPRELRDIYLPPFKAAVDAG-VRTFMSGFNEVDGVP 243

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
             A+  LL   +R +W F G++VSD  S   ++ +H F  D KE A  R +K G+D++  
Sbjct: 244 ATANKYLLTDVLRNEWQFDGFVVSDWASTWEMI-NHGFAADEKE-AAHRAIKVGVDMEMA 301

Query: 311 DY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIE 369
              Y +     +++G +   DI+ ++R +  V   LG FD +P      +N    P+++E
Sbjct: 302 TTTYRDNIAALLKEGALNIEDINQAVRNILRVKFELGLFD-NPYIAEEKQNQFARPEYLE 360

Query: 370 LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGN--YEGTPCRYTSPM 427
            A  AA Q +VLLKN+   LP+N+ +   +AL+GP A+     +G   ++G      +P+
Sbjct: 361 AANLAATQSMVLLKNEQKTLPINSSS--KIALIGPMADQPYEQLGTWIFDGDTTLTVTPL 418

Query: 428 DGF---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
             F   +    V+ +A G      ++      AI+ AKN+D  V   G +  +  E   R
Sbjct: 419 QAFNKTFGQENVL-FAEGMPISRTRHQKGFRKAIEQAKNSDVIVFCGGEESILSGEAHSR 477

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
            ++ LPG Q ELI ++    K P+ LV+M+   + I   + +    ++++  +PG  GG 
Sbjct: 478 ANIDLPGVQNELIKELKKTGK-PLVLVVMAGRPLTI--GEISEHADAVVYAWHPGTMGGA 534

Query: 545 AIADVIFGKYNPGGRLPITWYE-ANYVKIPYTSMPL-RPVNNFPGRTYKFFDGPV----- 597
           A+AD++ GK NP G+LP+T+ +    + I Y      RP N  P    + +D PV     
Sbjct: 535 ALADIVSGKANPSGKLPVTFPKVVGQIPIYYNHKNTGRPAN--PDSWTQMYDIPVKAPQT 592

Query: 598 ---------------VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
                          +YPFGYGLSYT F+Y         D+ LDK+   RD         
Sbjct: 593 SLGNESHYIDAGFIPLYPFGYGLSYTSFEYS--------DLSLDKEVYARD--------- 635

Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERV 701
                      +  + +FT      N G+  G EV  VY +   G     +K++  +ER+
Sbjct: 636 -----------ETIEVRFTLS----NTGEFAGEEVAQVYVRDLVGNVTRPVKELKAFERI 680

Query: 702 FIAAGQSAKVGFTM 715
            +  G+S  V  T+
Sbjct: 681 DLQKGESKTVTLTI 694


>gi|224537403|ref|ZP_03677942.1| hypothetical protein BACCELL_02281 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224520981|gb|EEF90086.1| hypothetical protein BACCELL_02281 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 750

 Score =  256 bits (655), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 205/742 (27%), Positives = 347/742 (46%), Gaps = 109/742 (14%)

Query: 29  VERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGA 88
           V  +T P    ++  +A    RLG+PL     + +HG   I                   
Sbjct: 75  VMSITDPNIFNEVQRIAVEDSRLGIPLINA-RDVIHGFKTI------------------- 114

Query: 89  TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVL 147
             FP  +   ASFN  + +   +  +TEA A      AG+ + ++P I++  DPRWGR+ 
Sbjct: 115 --FPIPLGQAASFNPEIAETGARIAATEASA------AGIRWTFAPMIDITHDPRWGRIA 166

Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
           E  GEDP +V +  +  ++G Q          S + P  I+AC KH+A Y     EG  R
Sbjct: 167 EGFGEDPLLVSQMGVAAIKGFQG--------SSLNHPTSIAACAKHFAGYGAS--EGG-R 215

Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
            +  + +TE+  +  ++ PFE  VN G  +++M ++N  +GIP+ A+P LL   +R +WN
Sbjct: 216 DYNSTYITERQFRNLYLRPFEAAVNAG-AATLMTAFNDNDGIPSSANPFLLKDVLRNEWN 274

Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQGKI 326
           + G +VSD  S+  ++  H F  D KE A+ +   AG D++   + Y  +    +++GK+
Sbjct: 275 YRGTVVSDWASVSEMIR-HGFCEDEKEAAL-KATNAGTDIEMVSETYIKYLPQLIKEGKV 332

Query: 327 AEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDN 386
           +   ID ++R +  +  RLG F+  P   +  K     P  +E A  AA Q  VLLKN+ 
Sbjct: 333 SMETIDNAVRNILRLKFRLGLFE-HPYIADQRKETFYRPDFLEAAQTAAEQSAVLLKNER 391

Query: 387 GALPLNTGNIKTLALVGPHANATKAMIGN--YEGTPCRYTSPMDGFYAYS----KVINYA 440
           G LP+ + NIKT+ + GP A+A    +G   ++G      +P+      S    KV+ YA
Sbjct: 392 GTLPIQS-NIKTILVTGPLADAPHEQLGTWVFDGDASYSQTPLQALRRTSGDSIKVL-YA 449

Query: 441 PGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKV 500
           PG         S     ++ A+ AD  +   G +  +  E     +L L G Q+ L++++
Sbjct: 450 PGLNYSRDTATSQFNKVVELAREADLILAFVGEEAILSGEAHCLANLNLQGAQSRLLHRL 509

Query: 501 ADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRL 560
           ++  K P+  V+M+   + I    N     ++L+  +PG  GG A+A+++FGK  P G+L
Sbjct: 510 SETGK-PLVTVVMAGRPLTIGREVNIS--DALLYAFHPGTMGGPALANLLFGKVVPSGKL 566

Query: 561 PITW-YEANYVKIPYT----------------SMPLRPVNNFPGRTYKFFDGPV--VYPF 601
           P+T+  E   + I Y                 ++P+       G T  + D     ++PF
Sbjct: 567 PVTFPKETGQIPIYYNHTSTGRPASGSEKNIFTIPVGAEQTSLGNTSFYLDAGKDPLFPF 626

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
           GYGLSYT F Y         +++L   Q  R+              V+I          T
Sbjct: 627 GYGLSYTTFAYS--------NLQLSSTQYTRN-------------EVII---------IT 656

Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKS 720
           F  ++ N GK DG+E+  +Y +    + T  +K++  +ER+ + AG++  +   +   K 
Sbjct: 657 F--DLTNTGKTDGTEIAQLYFRDLAASVTRPVKELAAFERIHLKAGETRHIRMEL-PVKQ 713

Query: 721 LKIVDNAANSLLASGAHTILVG 742
           L   + A +  +  G   + +G
Sbjct: 714 LSFWNYAMDYCVEPGKFDLWIG 735


>gi|393784338|ref|ZP_10372503.1| hypothetical protein HMPREF1071_03371 [Bacteroides salyersiae
           CL02T12C01]
 gi|392666114|gb|EIY59631.1| hypothetical protein HMPREF1071_03371 [Bacteroides salyersiae
           CL02T12C01]
          Length = 857

 Score =  256 bits (655), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 218/807 (27%), Positives = 360/807 (44%), Gaps = 162/807 (20%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQM-----------------------GDLAYGV 48
             Y  + LP  ER  DL+ RMTL EK+ Q+                       G +++G 
Sbjct: 26  LSYRQSSLPISERVDDLLGRMTLEEKIAQIRHIHSWNVFNGQDLDMEKLGKFTGGVSWGF 85

Query: 49  -------------------------PRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
                                     RLG+P++   +E+LHG                 S
Sbjct: 86  VEGFPLTGVNCKKNMQLIQKFMVENTRLGIPVFTV-AESLHG-----------------S 127

Query: 84  EVPGATSFPTVILTTASFNESLWKKIGQTVSTE--ARAMYNLGNAGLTFWSPNINVVRDP 141
              G+T +P  I   ++F   L  +    ++ +  A+ M+ +        +P I+VVRD 
Sbjct: 128 VHEGSTIYPQNIAMGSTFRPELAYRKAAMITKDLHAQGMHQV-------LAPCIDVVRDL 180

Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
           RWGRV E+ GEDP + G + I  V+G  D                IS   KHY  +    
Sbjct: 181 RWGRVEESFGEDPVLCGLFGIAEVKGYMDN--------------GISPMLKHYGPH---- 222

Query: 202 WEGNDRFHFDSRVTE---QDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLL 258
             GN     +    E   +D+ E ++ PFEM +    V +VM +YN  N +P  A   LL
Sbjct: 223 --GNPLSGLNLASVECGLRDLHEVYLKPFEMVIRNTPVLAVMSTYNSWNHVPNSASHYLL 280

Query: 259 NQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTM 318
            + +RG + F GY+ SD  +I+ +   H+  +++ E+A  +   AGLD++          
Sbjct: 281 TEVLRGQFGFKGYVYSDWGAIEMLKTLHRVAHNS-EEAAMQAFTAGLDVEASSNCYPLLA 339

Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQG 378
           G +Q+GK+ E  ++ S+R +     ++G F+  P  +    + +   + I L+ E A + 
Sbjct: 340 GLIQKGKLDEEVLNESVRRVLYAKFKMGLFE-DPYGEQYSHSEMHGAESIRLSKEIADES 398

Query: 379 IVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY--TSPMDG---FYAY 433
           +VLLKN+NG LPLN   +K++A++GP  NA +   G+Y  +       +P++G       
Sbjct: 399 VVLLKNENGLLPLNADKLKSVAVIGP--NADQVQFGDYTWSRNNKDGVTPLEGIRRLLGG 456

Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG---------LDLSVEAEGKDR 484
              + YA GC D+V  N   I  A++AA+ ++  ++  G            S   EG D 
Sbjct: 457 KATVRYAKGC-DLVSLNAGGIKEAVEAARKSEVAILFCGSASAALARDYKSSTCGEGFDL 515

Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
            DL L G Q +LI +V +    PV LV+++     I++ K +  I +IL   Y GE+ G 
Sbjct: 516 NDLNLTGVQGQLIKEVYETGT-PVVLVLVTGKPFAISWEKKH--IPAILTQWYAGEQAGN 572

Query: 545 AIADVIFGKYNPGGRLPITWYEAN-YVKIPYTSMPLRP-------VNNFPGRTYKFFDGP 596
           +IAD++FG  +P GRL  ++ +   ++ + Y  +P              PGR Y F    
Sbjct: 573 SIADILFGSISPSGRLTFSYPQTTGHLPVYYNYLPSDKGFYKNPGSYESPGRDYVFSSPD 632

Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
            ++ FG+GL+YT F YK           L  D++   +N T          + ID     
Sbjct: 633 ALWAFGHGLTYTSFVYK----------NLRTDKEHYGLNDT----------IYID----- 667

Query: 657 DYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
                  ++++N GK +G EVV +Y +       T +KQ+  +++V + AG++  V   +
Sbjct: 668 -------VDIKNTGKREGKEVVQLYVNDKVSTVVTPVKQLRDFKKVDVEAGKTETVKLKV 720

Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
            A   L IV+     ++  G   + VG
Sbjct: 721 -AVNDLYIVNAGNKRVVEPGEFELQVG 746


>gi|365121645|ref|ZP_09338561.1| hypothetical protein HMPREF1033_01907 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363645135|gb|EHL84409.1| hypothetical protein HMPREF1033_01907 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 868

 Score =  256 bits (654), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 164/460 (35%), Positives = 238/460 (51%), Gaps = 51/460 (11%)

Query: 2   FESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSE 61
           F +   +  + PY + +L   ERA DL+ RMTL EK  QM +   G+ RLG+  Y+WW+E
Sbjct: 14  FSAFSFRAENPPYKNPELSPDERALDLLNRMTLKEKFAQMHNNTGGIERLGVRPYDWWNE 73

Query: 62  ALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARA-- 119
           ALHG++  G+                AT FP  I   A+F+++   ++   VS E RA  
Sbjct: 74  ALHGIARAGK----------------ATVFPQAIGLAATFDDTAVYEMFDMVSDEGRAKY 117

Query: 120 -------MYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVE 172
                  MYN G  GLTFW+PNIN+ RDPRWGR +ET GEDP++  +  +  V+GLQ   
Sbjct: 118 HDFQRKGMYN-GYKGLTFWTPNINIFRDPRWGRGMETYGEDPFLTTKMGLAVVKGLQG-- 174

Query: 173 GVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVN 232
                 D   +  K  AC KHYA +    W  N   +    ++ +D++ET++  F+  V 
Sbjct: 175 ------DGTQKYDKAHACAKHYAVHSGPEW--NRHSYNAENISIRDLRETYLPAFKALVT 226

Query: 233 EGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV-----ESHK 287
           EG V  VMC+YNR  G P C++  LL   ++ +W F   IVSDC +I         E+H 
Sbjct: 227 EGKVKEVMCAYNRFEGEPCCSNKTLLINILKDEWGFDDVIVSDCGAIADFYTKGRHETHA 286

Query: 288 FLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY 347
              D   DAV     +G DL+CG  Y      A+++G I E  I+ S+  L      LG 
Sbjct: 287 SAADASADAVI----SGTDLECGGSYWALDE-ALEKGLITETKINESVFRLLRARFELGM 341

Query: 348 FDGSP--QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPH 405
           FD      + ++  + +C  +H   A E AR+ +VLL N N  LPL+  +IK +A++GP+
Sbjct: 342 FDDDSLVSWSSIPYSVVCCDKHKAKALEMARKSMVLLSNKNNTLPLSK-SIKKVAVMGPN 400

Query: 406 ANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVINYAPGC 443
           AN +  +  NY GTP R  + ++G  A      + Y  GC
Sbjct: 401 ANDSVMLWANYNGTPDRSVTILEGIKAKLPEGSVIYEKGC 440



 Score =  120 bits (301), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 94/303 (31%), Positives = 141/303 (46%), Gaps = 60/303 (19%)

Query: 456 AAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAK 505
           A  +  K+ADA + V G+  S+E E            DR ++ LP  Q  ++  + +  K
Sbjct: 594 AVAEKVKDADAIIFVGGISSSLEGEEMGVKYPGFRNGDRTNIDLPQVQKNMMKALKETGK 653

Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
            PV  V+ S   + +++   N  + +IL   YPG+EGG A+ADV+FG YNP GRLP+T+Y
Sbjct: 654 -PVIFVLCSGSTMALSWEDKN--MDAILQAWYPGQEGGTAVADVLFGDYNPAGRLPLTFY 710

Query: 566 EANYVKIPYTSMPLRPVNNF-----PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKS 620
                    +S  L    N+      GRTY++F G  +YPFG+GLSYT F Y  A     
Sbjct: 711 A--------SSDDLPDFENYNMSEGQGRTYRYFKGKPLYPFGHGLSYTGFSYSKA----- 757

Query: 621 VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
              KL+K  +   +N +V                         + ++N G  DG EVV V
Sbjct: 758 ---KLNK--KSMSVNDSV----------------------FLSLNLKNTGLRDGDEVVQV 790

Query: 681 YSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTI 739
           Y +         K + GY+RV + AGQ+  V   + A  S +  +     + +  G + I
Sbjct: 791 YIRNLQDPEGPSKSLRGYKRVSVKAGQTVPVKIDLPAS-SFEFFNPVTEKMEVRPGKYEI 849

Query: 740 LVG 742
           L G
Sbjct: 850 LYG 852


>gi|423279982|ref|ZP_17258895.1| hypothetical protein HMPREF1203_03112 [Bacteroides fragilis HMW
           610]
 gi|404584318|gb|EKA88983.1| hypothetical protein HMPREF1203_03112 [Bacteroides fragilis HMW
           610]
          Length = 812

 Score =  256 bits (654), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 237/833 (28%), Positives = 361/833 (43%), Gaps = 161/833 (19%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE---------------- 57
           Y +   P  ER + L+ +MTL EKV QM      +  LG P+YE                
Sbjct: 47  YENPSAPVEERVEHLLSQMTLEEKVGQM------LTSLGWPMYERVGEEIRLTARLEKEI 100

Query: 58  ----------------WWSEALH-GV--SFIGRRTNSPPG---THFDSEVP--------- 86
                           W    LH G+  S   R +N        H    +P         
Sbjct: 101 SEYHIGALWGFMRADPWTQRTLHTGLNPSLAARASNRLQAFVMEHSRLGIPLFLAEECPH 160

Query: 87  -----GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
                G T FPT I   +++N  L +++G+ ++ EA A           + P +++ RDP
Sbjct: 161 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDP 215

Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
           RW RV ET GEDPY+ G      VRG Q         D+      + A  KH+A+Y    
Sbjct: 216 RWSRVEETYGEDPYLNGVMGAALVRGFQG--------DTLRGRKSVIATLKHFASY---G 264

Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
           W         + + E++++E    PF   V  G +S VM SYN ++G P      LL   
Sbjct: 265 WTEGGHNGGTAHLGERELEEAIFPPFREAVGAGALS-VMSSYNEIDGNPCTGSRYLLTDI 323

Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGA 320
           +   W F G++VSD  +I  + E H       E AV + + AG+D D G + Y    + A
Sbjct: 324 LEDRWLFKGFVVSDLYAIGGLRE-HGVAGSDYEAAV-KAVNAGVDSDLGTNVYAEQLVAA 381

Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
           V++G +A   +D ++R +  +   +G FD            + +P+HI LA E ARQ IV
Sbjct: 382 VRKGDVAMETVDKAVRRILSLKFHMGLFDAPFVDDKRPAQLVASPEHIGLAREVARQSIV 441

Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSK 435
           LLKN++  LPL   +I+TLA++GP+A+    M+G+Y     +G+       +    +   
Sbjct: 442 LLKNEDKLLPLKK-DIRTLAVIGPNADNGYNMLGDYTAPQADGSVVTVLEGIRQKVSKDT 500

Query: 436 VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE------------- 478
            + YA GCA +   + +    AI++A++AD  V+V G     D S E             
Sbjct: 501 RVFYAKGCA-VRDSSRTGFADAIESARSADVVVMVVGGSSARDFSSEYEETGAAKVSANR 559

Query: 479 ------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
                  EG DR  L L G Q EL+ +V    K P+ LV++    + +       +  +I
Sbjct: 560 VSDMESGEGYDRATLHLMGRQLELLEEVRKLGK-PMVLVLIKGRPLLMEGVIQ--EADAI 616

Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
           L   YPG +GG A+ADV+FG YNP GRL ++      V      +P+       G   ++
Sbjct: 617 LDAWYPGMQGGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTKRKGNRSRY 670

Query: 593 FD--GPVVYPFGYGLSYTQFKY---KVASSPKSVDIKLDKDQQCR-DINYTVGTNKPPCA 646
            +  G   YPFGYGLSYT F Y   KV  S +S          CR D++ T         
Sbjct: 671 IEEAGTPRYPFGYGLSYTTFSYTGMKVRVSEES--------NHCRVDVSVT--------- 713

Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAA 705
                              V N G +DG EVV +Y +   G   T  +Q+  + RV + A
Sbjct: 714 -------------------VRNQGTVDGDEVVQLYLRDEVGSFTTPDRQLRAFRRVRLKA 754

Query: 706 GQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLNH 758
           G++ ++ FT++  KSL +        +  G  T++ G     ++   +  +N 
Sbjct: 755 GETWEITFTLDK-KSLALYMRDGEWAVEPGRFTVMAGGSSEDIACQQEFEINR 806


>gi|294675359|ref|YP_003575975.1| 1,4-beta-xylosidase [Prevotella ruminicola 23]
 gi|225016052|gb|ACN78955.1| xylosidase/arabinofuranosidase [Prevotella ruminicola]
 gi|294472720|gb|ADE82109.1| putative 1,4-beta-xylosidase [Prevotella ruminicola 23]
          Length = 861

 Score =  256 bits (654), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 157/445 (35%), Positives = 225/445 (50%), Gaps = 41/445 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            PY +  L   ERA DL  R+TL EK   M D +  +PRLG+  + WWSEALHG + +G 
Sbjct: 22  LPYQNPNLSAKERAVDLCSRLTLEEKAMLMLDESPAIPRLGIKKFFWWSEALHGAANMGN 81

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------- 122
            TN                FP  +   ASFN  L  K+    STE RA YN         
Sbjct: 82  VTN----------------FPEPVGMAASFNPHLLFKVFDIASTEFRAQYNHRMYDLNGE 125

Query: 123 -LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
            +    L+ W+PN+N+ RDPRWGR  ET GEDPY+     +  V+GLQ  E        D
Sbjct: 126 DMKMRSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGVQVVKGLQGPE--------D 177

Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
           +R  K+ AC KHYA +    +  +     D  V+ +D  ET++  F+  V +  V  VMC
Sbjct: 178 ARYRKLWACAKHYAVHSGPEYTRHTANLTD--VSARDFWETYMPAFKTLVKDAKVREVMC 235

Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
           +Y R++  P C   +LL Q +R +W F   +VSDC ++    E+HK  +D        VL
Sbjct: 236 AYQRLDDDPCCGSTRLLQQILRDEWGFEYLVVSDCGAVSDFYENHKSSSDAVHGTSKAVL 295

Query: 302 KAGLDLDCGDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLG 358
            AG D++CG  Y   ++  AV++G ++E ++D  +  L      LG  D     ++  + 
Sbjct: 296 -AGTDVECGFNYAYKSLPEAVRKGLLSEKEVDKHVIRLLEGRFDLGEMDDPSLVEWSKIP 354

Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
            + +       +A + ARQ IVLL+N N  LPL   N + +A++GP+A+    M GNY G
Sbjct: 355 YSAMSTKASANVALDMARQTIVLLQNKNNILPLKK-NAEKIAIIGPNAHNEPMMWGNYNG 413

Query: 419 TPCRYTSPMDGFYAYSKVINYAPGC 443
           TP    + +DG  A  K + Y PGC
Sbjct: 414 TPNHTVTILDGVKAKQKKLVYIPGC 438



 Score =  109 bits (272), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 84/309 (27%), Positives = 132/309 (42%), Gaps = 56/309 (18%)

Query: 445 DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQT 494
           D+  + N      I   K  +  +   G+  S+E E          G DR  + LP  Q 
Sbjct: 583 DVARELNIDYQETIAQLKGINKVIFCGGIAPSLEGEEMPVNIEGFKGGDRTSIELPKVQR 642

Query: 495 ELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKY 554
           E +  +  A K    ++ ++     I          +I+   YPG+EGG A+ADV+FG Y
Sbjct: 643 EFLKALKAAGK---QVIYVNCSGSAIALQPETESCDAIVQAWYPGQEGGTAVADVLFGDY 699

Query: 555 NPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKV 614
           NPGG+L +T+Y+ +     Y    ++      GRTY++FD   ++PFGYGLSYT F+   
Sbjct: 700 NPGGKLSVTFYKNDQQLPDYEDYSMK------GRTYRYFDD-ALFPFGYGLSYTTFEVGE 752

Query: 615 ASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDG 674
           A    + D  L                                  +  QI V N G  +G
Sbjct: 753 AKVEAATDGAL----------------------------------YNVQIPVTNTGTKNG 778

Query: 675 SEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS 734
           SE + +Y +        +K + G+ER+ I AG++A     +   +SL+  D   N++   
Sbjct: 779 SETIQLYIRNLQDPDGPLKSLRGFERLDIKAGKTATANLKLTK-ESLEFWDAETNTMRTK 837

Query: 735 -GAHTILVG 742
            G + IL G
Sbjct: 838 PGKYEILYG 846


>gi|150009689|ref|YP_001304432.1| glycoside hydrolase family protein [Parabacteroides distasonis ATCC
           8503]
 gi|149938113|gb|ABR44810.1| glycoside hydrolase family 3, candidate beta-glycosidase
           [Parabacteroides distasonis ATCC 8503]
          Length = 732

 Score =  256 bits (654), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 220/776 (28%), Positives = 362/776 (46%), Gaps = 142/776 (18%)

Query: 18  KLPYPERAKDLVERMTLPEKVQQM-GDLAY---GVPRLGLPLYEW-WSEALHGV-SFIGR 71
           K+   +R + L+++MTL EKV  + G+  +   GV RLG+P  EW  S+  HGV + I R
Sbjct: 28  KVQMEKRIEKLIKKMTLEEKVGLLHGNSKFYVAGVERLGIP--EWSLSDGPHGVRAEINR 85

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
              +  G   DS    A+ FPT     A++N  L  + G+ +  EAR             
Sbjct: 86  HDWAYAGWTNDS----ASYFPTGTAFAAAWNPELAYRRGEVLGEEAR-----WRKKDVLL 136

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
            P +N++R P  GR  E   EDPY+    A+ Y++GLQ        RD       ++   
Sbjct: 137 GPGVNIIRSPLCGRNFEYMSEDPYMNSVLAVAYIKGLQS-------RD-------VACSV 182

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
           KH+A   ++N E N R   D   +E+ ++E ++  F+  V EG   +VM +YN+  G   
Sbjct: 183 KHFA---VNNQETN-RTTVDVECSERALREIYLPAFKAAVQEGGALTVMAAYNKFRGEFC 238

Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
             +  L+ + +R +W F G  V+D  +  + V S               ++AGLDL+ G 
Sbjct: 239 AENNYLVRKILRNEWGFDGVYVTDWGAAHSTVPS---------------MEAGLDLEMGT 283

Query: 312 --------YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC 363
                   YY N  + AV+ GK+  + +D  +  +  V+++    D  P+ K  G  ++ 
Sbjct: 284 LIDKYEDWYYANPLIEAVKSGKVPMSLVDEKVGDVLRVMIKTNVLD--PK-KRFGPGSMN 340

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG-----PHANA--TKAMIGNY 416
             +H +   +AA + IVLLKN N  LPL+  +IK+LA++G      H+N   +  +   Y
Sbjct: 341 TKEHQQATYDAAAEAIVLLKNQNNLLPLDFSSIKSLAVIGDNATRKHSNGGLSSEIKAVY 400

Query: 417 EGTP-----CRYTSPMDGFYA--YSKVINYAPGC---------ADIVCQNNSMIPAAIDA 460
           E TP      ++   +D  +A  Y K+  +  G          +    ++++++  A++ 
Sbjct: 401 EVTPLGALRAKWGDKVDIRFAQGYEKLSTFVEGSNNGQSSGTFSSKTQESDALLKEAVEV 460

Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
           A+ +D  ++V GL+   + E  DR+++ +P  Q ELI +V  A   P T+V+M AG+  +
Sbjct: 461 ARTSDVALLVCGLNHDYDTESFDRLNMDIPYGQVELIQEVVKA--NPRTIVVMIAGS-PL 517

Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR 580
           N A  +    +I+W  + G EGG A+ DV+ GK NP G++P T        +     P  
Sbjct: 518 NMAAVDICSPAIVWAWFNGMEGGNALVDVLSGKVNPSGKMPFT------TPVSLDQSPAH 571

Query: 581 PVNNFPGRT------------YKFFDG---PVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
            + NFPGR             Y++FD    PVVYPFGYGLSYT F Y            L
Sbjct: 572 ALGNFPGRDLKVNYEEDILVGYRWFDTKGLPVVYPFGYGLSYTTFDYS----------NL 621

Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
           + D++  D   T+                    + TF +   N G  +G+EV  +Y   P
Sbjct: 622 NTDKETYDQADTI--------------------QATFTL--TNTGDREGAEVAQLYVSDP 659

Query: 686 GIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
             +    +K++ G+++VF+  G+S ++   +    SL     A +  +      IL
Sbjct: 660 VCSVMRPVKELKGFKKVFLKPGESRRITLDI-PVSSLAFYSEAQSQFVVEPGEFIL 714


>gi|347536214|ref|YP_004843639.1| glycoside hydrolase family protein [Flavobacterium branchiophilum
           FL-15]
 gi|345529372|emb|CCB69402.1| Glycoside hydrolase precursor, family 3 [Flavobacterium
           branchiophilum FL-15]
          Length = 740

 Score =  256 bits (654), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 200/673 (29%), Positives = 327/673 (48%), Gaps = 78/673 (11%)

Query: 89  TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVL 147
           T+FP  +   AS++    +K  +  +TEA       ++G+ + ++P +++ RDPRWGRV+
Sbjct: 111 TTFPIPLAEAASWDVEAIEKSARVAATEA------ASSGIHWTFAPMVDISRDPRWGRVM 164

Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
           E  GED Y+  + A   V+G Q   G + H         + AC KH+AAY      G D 
Sbjct: 165 EGAGEDTYLGSKIAFARVKGFQANLG-DVH--------SVMACVKHFAAYGA-AVGGRDY 214

Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
              D  ++E+ + ET++ PF+  ++ G  ++ M ++N +NGIP  A+  +    ++G W 
Sbjct: 215 NSVD--ISERMLWETYLPPFKAALDAG-AATFMNAFNDINGIPATANKHIQRDILKGKWQ 271

Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC-GDYYTNFTMGAVQQGKI 326
           F G++VSD  SI  +V +H +  D K+ A  + L AG D+D     Y       V++ K+
Sbjct: 272 FQGFVVSDWGSIGEMV-AHGYAKDYKQ-AAEKALLAGSDMDMESSAYIGHLATLVKENKV 329

Query: 327 AEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN--ICNPQHIELAAEAARQGIVLLKN 384
             A ID ++R +    M LG F+   ++ N  + N  + NP+H ++A E A + IVLLKN
Sbjct: 330 PIALIDDAVRRILRKKMELGLFEDPFKFCNPERQNKALNNPEHTKIAREVAAKSIVLLKN 389

Query: 385 DNGALPLNTGNIKTLALVGPHANATKAMIG----NYEGTPCRY-TSPMDGF---YAYSKV 436
           D   LPL+  ++KT+A +GP   + +   G    + +     Y  S  +G       +  
Sbjct: 390 DKQVLPLSK-DLKTIAFIGPMVQSKRDNHGFWAVDLKDVDSTYIVSQWEGLQRKVGKNTK 448

Query: 437 INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTEL 496
           + YA GC D++  N S    AI  A  AD  V+  G   ++  E K R  L LPG Q +L
Sbjct: 449 LLYAKGC-DVLSTNKSGFEEAIAVAHQADVVVVSVGEKHNMSGEAKSRSSLQLPGVQEDL 507

Query: 497 INKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNP 556
           I ++    K P+ ++I +   +  N+  +N  + +IL+  + G E G AIADV+FG YNP
Sbjct: 508 IMELQKTGK-PIVVLINAGRPLIFNWTADN--MPTILYTWWLGSEAGNAIADVLFGDYNP 564

Query: 557 GGRLPITWYEAN-YVKIPYTSMPL-RPVNNFPGRTYKF----FDGPVVYPFGYGLSYTQF 610
             +LPIT+  +   V I Y      RP  +   + YK           +PFGYGLSYT F
Sbjct: 565 SAKLPITFPRSEGQVPIYYNHFSTGRPAKSDDDKIYKSAYIDLQNSPKFPFGYGLSYTTF 624

Query: 611 KYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMG 670
           +Y         D+KL   +        + TN                 +   Q  ++N G
Sbjct: 625 EYS--------DLKLSTQK--------ITTND----------------RIMVQATIKNTG 652

Query: 671 KMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAAN 729
           K  G+E+V +Y K   G     + ++  ++++ + AG S  + F ++  K L   +    
Sbjct: 653 KYAGTEIVQLYIKDQFGSVVRPVLELKDFQKITLEAGASKTISFVIDKEK-LSFYNADLQ 711

Query: 730 SLLASGAHTILVG 742
            +   G   I++G
Sbjct: 712 YVAEPGTFEIMIG 724


>gi|256838635|ref|ZP_05544145.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
           D13]
 gi|256739554|gb|EEU52878.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
           D13]
          Length = 732

 Score =  256 bits (653), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 220/776 (28%), Positives = 362/776 (46%), Gaps = 142/776 (18%)

Query: 18  KLPYPERAKDLVERMTLPEKVQQM-GDLAY---GVPRLGLPLYEW-WSEALHGV-SFIGR 71
           K+   +R + L+++MTL EKV  + G+  +   GV RLG+P  EW  S+  HGV + I R
Sbjct: 28  KVQMEKRIEKLIKKMTLEEKVGLLHGNSKFYVAGVERLGIP--EWSLSDGPHGVRAEINR 85

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
              +  G   DS    A+ FPT     A++N  L  + G+ +  EAR             
Sbjct: 86  HDWAYAGWTNDS----ASYFPTGTAFAAAWNPELAYRRGEVLGEEAR-----WRKKDVLL 136

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
            P +N++R P  GR  E   EDPY+    A+ Y++GLQ        RD       ++   
Sbjct: 137 GPGVNIIRSPLCGRNFEYMSEDPYMNSVLAVAYIKGLQS-------RD-------VACSV 182

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
           KH+A   ++N E N R   D   +E+ ++E ++  F+  V EG   +VM +YN+  G   
Sbjct: 183 KHFA---VNNQETN-RTTVDVECSERALREIYLPAFKAAVQEGGALTVMAAYNKFRGEFC 238

Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
             +  L+ + +R +W F G  V+D  +  + V S               ++AGLDL+ G 
Sbjct: 239 AENNYLVRKILRNEWGFDGVYVTDWGAAHSTVPS---------------MEAGLDLEMGT 283

Query: 312 --------YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC 363
                   YY N  + AV+ GK+  + +D  +  +  V+++    D  P+ K  G  ++ 
Sbjct: 284 LIDKYEDWYYANPLIEAVKSGKVPMSLVDEKVGDVLRVMIKTNVLD--PK-KRFGPGSMN 340

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG-----PHANA--TKAMIGNY 416
             +H +   +AA + IVLLKN N  LPL+  +IK+LA++G      H+N   +  +   Y
Sbjct: 341 TKEHQQATYDAAAEAIVLLKNQNNLLPLDFSSIKSLAVIGDNATRKHSNGGLSSEIKAVY 400

Query: 417 EGTP-----CRYTSPMDGFYA--YSKVINYAPGC---------ADIVCQNNSMIPAAIDA 460
           E TP      ++   +D  +A  Y K+  +  G          +    ++++++  A++ 
Sbjct: 401 EVTPLGALRAKWGDKVDIRFAQGYEKLSTFVEGSNNGQSSGTFSSKTQESDALLKEAVEV 460

Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
           A+ +D  ++V GL+   + E  DR+++ +P  Q ELI +V  A   P T+V+M AG+  +
Sbjct: 461 ARTSDVALLVCGLNHDYDTESFDRLNMDIPYGQVELIQEVVKA--NPRTIVVMIAGS-PL 517

Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR 580
           N A  +    +I+W  + G EGG A+ DV+ GK NP G++P T        +     P  
Sbjct: 518 NMAAVDICSPAIVWAWFNGMEGGNALVDVLSGKVNPSGKMPFT------TPVSLDQSPAH 571

Query: 581 PVNNFPGRT------------YKFFDG---PVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
            + NFPGR             Y++FD    PVVYPFGYGLSYT F Y            L
Sbjct: 572 ALGNFPGRDLKVNYEEDILVGYRWFDTKGLPVVYPFGYGLSYTTFNYS----------NL 621

Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
           + D++  D   T+                    + TF +   N G  +G+EV  +Y   P
Sbjct: 622 NTDKKTYDQADTI--------------------QATFTL--TNTGDREGAEVAQLYVSDP 659

Query: 686 GIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
             +    +K++ G+++VF+  G+S ++   +    SL     A +  +      IL
Sbjct: 660 VCSVMRPVKELKGFKKVFLKPGESRRITLDI-PVSSLAFYSEAQSQFVVEPGEFIL 714


>gi|298374050|ref|ZP_06984008.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_19]
 gi|298268418|gb|EFI10073.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_19]
          Length = 758

 Score =  256 bits (653), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 185/603 (30%), Positives = 299/603 (49%), Gaps = 63/603 (10%)

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           ++P +++ RD RWGRV+E  GEDPY+    A   V G Q   G ++   +D     + AC
Sbjct: 162 FAPMVDISRDARWGRVMEGAGEDPYLGSLIAKARVEGFQG--GNDWRSLADVN--TVLAC 217

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH+AAY      G D   +++    Q+    + +P  +   E  V++ M S+N +NG+P
Sbjct: 218 CKHFAAYGAAE-AGRD---YNTSELSQNTLMNYYMPPYLAAKEAGVATFMASFNEINGVP 273

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-C 309
           +  +  L+   +R DW F+G++V+D   I  +V +H  + + KE A      AG+D+D  
Sbjct: 274 STGNKWLMTDLLRKDWGFNGFVVTDYTGINEMV-AHSIVRNDKE-AGELAANAGIDMDMT 331

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQY--KNLGKNNICNPQH 367
           G  Y+ + + +V++GK++E +ID ++  +  +   LG FD   +Y      KN I  P+ 
Sbjct: 332 GGIYSQYLVQSVKEGKVSEENIDRAVASILEMKFLLGLFDDPYRYLDNEREKNTIMKPEF 391

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGP------HANATKAMIGNYEGTPC 421
           ++ A E + + IVLLKNDN   P++     T+AL+GP      + N   A  G  E +  
Sbjct: 392 LQEARETSARSIVLLKNDNNFFPISKDKNITVALIGPMVKDKINQNGEWAGRGEREESIS 451

Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
            +    + +   +    YA GC D++  ++S    AI  A+ AD  +   G D +   E 
Sbjct: 452 LFEGLTEKYAGTNVKFIYAEGC-DLLTDDSSKFAEAIATARRADIVLAAMGEDFNWSGEA 510

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
             R DL LPG Q  L+ ++    K P+ L++++   +D+++   +  +  IL   Y G  
Sbjct: 511 ACRTDLKLPGAQQALLKELKKTGK-PLGLILVNGRPLDLSW--EDQHVDGILEAWYLGTM 567

Query: 542 GGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPL-RPV-NNFPGRTYK--FFDGP 596
            G  +ADVI G YNP  RL +++      + + Y   P  RPV    P   YK  + D P
Sbjct: 568 AGHGMADVISGDYNPSARLTMSFPRTVGQLPLYYNQKPTGRPVPPEAPDTDYKSRYMDVP 627

Query: 597 --VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
              +YPFGYGLSYT F            +KLD++      ++T G               
Sbjct: 628 NTPLYPFGYGLSYTTFAVN--------SMKLDQN------SFTKGG-------------- 659

Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
               K T   EVEN GK+DG  VV +Y +   G     +K++ G+E+V + AG+  +V F
Sbjct: 660 ----KITVMAEVENTGKVDGETVVQMYIRDLAGSVTRPVKELKGFEKVTLKAGEKKQVSF 715

Query: 714 TMN 716
           T++
Sbjct: 716 TID 718


>gi|427386425|ref|ZP_18882622.1| hypothetical protein HMPREF9447_03655 [Bacteroides oleiciplenus YIT
           12058]
 gi|425726465|gb|EKU89330.1| hypothetical protein HMPREF9447_03655 [Bacteroides oleiciplenus YIT
           12058]
          Length = 864

 Score =  256 bits (653), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 151/428 (35%), Positives = 222/428 (51%), Gaps = 38/428 (8%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
           + + +  LP  ER  DLV  +TL EK+ QM + A  + RLG+P Y WW+E LHGV+    
Sbjct: 24  YKFQNPDLPVEERVNDLVGHLTLEEKISQMMNNAPAIERLGIPAYNWWNECLHGVA---- 79

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
           R+  P            TSFP  I   A+++     ++ +  S E RA+Y+         
Sbjct: 80  RSPYP-----------VTSFPQAIAMAATWDTKSVYQMAEYASDEGRAIYHDAARKGTPG 128

Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
              GLT+WSPNIN+ RDPRWGR  ET GEDPY+     + +V+GLQ           D  
Sbjct: 129 IFRGLTYWSPNINIFRDPRWGRGQETYGEDPYLTAAIGVAFVKGLQ---------GDDPV 179

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LK SAC KHYA +    W   +R  +++ V+  D+ +T++  F   V +  V+ VMC+Y
Sbjct: 180 YLKSSACAKHYAVHSGPEW---NRHTYNAEVSNHDLWDTYLPAFRELVVDAKVTGVMCAY 236

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N     P C +  L+   +R  W F GY+ SDC +I+    +H    D  E +   VL  
Sbjct: 237 NSFFEQPCCGNDLLMMDILRNQWKFDGYVTSDCGAIEDFYNTHNTHEDAAEASADAVLH- 295

Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNN 361
           G D +CG+        A+ +G I E  +D SL+ L+ +  RLG FD   +  Y ++  + 
Sbjct: 296 GTDCECGNGAYRALADAIVRGLITEEQVDVSLKKLFEIRFRLGMFDPDDRVPYSDIPISV 355

Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
           +    H   A + ARQ IVLLKN+   LPL+   IK +A+VGP+A+    ++ NY G P 
Sbjct: 356 LECDAHKAHALKMARQSIVLLKNEKQLLPLDMNKIKKIAVVGPNADDKSVLLANYYGYPS 415

Query: 422 RYTSPMDG 429
             T+ ++G
Sbjct: 416 CVTTVLEG 423



 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 94/297 (31%), Positives = 138/297 (46%), Gaps = 58/297 (19%)

Query: 460 AAKNADATVIVAGLDLSVEAEGK----------DRVDLLLPGFQTELINKVADAAKGPVT 509
           + K+AD  V V GL   VE E            DR  + +P  Q  L+ ++    K PV 
Sbjct: 595 SVKDADVVVFVGGLSAKVEGEEMKVEIDGFKRGDRTSISIPVVQQNLLKELYATGK-PVI 653

Query: 510 LVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANY 569
            ++M+  AV + +   +  + +IL   Y G+ GG+AIADV+FG YNP GRLP+T+Y+   
Sbjct: 654 FILMTGSAVGLEWESEH--LPAILNAWYGGQAGGQAIADVLFGDYNPSGRLPLTFYKN-- 709

Query: 570 VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQ 629
                  +P     +   RTY++F G  VYPFGYGLSYT F+Y       S+D       
Sbjct: 710 ----VNDLPDFEDYSMKNRTYRYFTGIPVYPFGYGLSYTDFQYNTIKVQPSLD------- 758

Query: 630 QCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI--EVENMGKMDGSEVVMVYSKPPGI 687
                                        K + ++  EV N+GK +G EVV +Y   P  
Sbjct: 759 -----------------------------KLSVKVTAEVSNVGKYEGEEVVQLYVSNPRD 789

Query: 688 AGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
             T I+ + G+ R+ +  G+S  V F + + K L +VD A N +   G   I +G G
Sbjct: 790 FVTPIRALKGFRRINLKPGESQMVEFVLTS-KELSVVDVAGNFVPMKGEVQISLGGG 845


>gi|329922637|ref|ZP_08278189.1| glycosyl hydrolase family 3 N-terminal domain protein
           [Paenibacillus sp. HGF5]
 gi|328941979|gb|EGG38262.1| glycosyl hydrolase family 3 N-terminal domain protein
           [Paenibacillus sp. HGF5]
          Length = 765

 Score =  256 bits (653), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 196/694 (28%), Positives = 322/694 (46%), Gaps = 98/694 (14%)

Query: 87  GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
           G T FP  +   +++N  L++ + + V+ E R+       G   +SP ++VVRDPRWGR 
Sbjct: 122 GGTVFPVPLSIGSTWNLDLYRDMCRAVALETRS-----QGGAVTYSPVLDVVRDPRWGRT 176

Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN- 205
            E  GEDPY++  YA+  V GLQ         +S   P  ++A  KH+  Y       N 
Sbjct: 177 EECFGEDPYLISEYAVASVEGLQG--------ESLDSPSSVAATLKHFVGYGSSEGGRNA 228

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
              H  +R    ++ E  +LPF+  V  G  +S+M +YN ++G+P   + +LL+  +R +
Sbjct: 229 GPVHMGTR----ELMEVDMLPFKKAVEAG-AASIMPAYNEIDGVPCTVNTELLDGILRKE 283

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQG 324
           W F G +++DC +I  +   H    D   DA  + ++AG+DL+  G+ +      AV+  
Sbjct: 284 WGFDGMVITDCGAIDMLASGHDTAEDGM-DAAVQAIRAGIDLEMSGEMFGKHLQKAVESN 342

Query: 325 KIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKN 384
           K+  + +D ++R +  +  +LG F+         +N I + QHI LA + A +GIVLLKN
Sbjct: 343 KLEVSVLDEAVRRVLTLKFKLGLFENPYVDPQTAENVIGSGQHIGLARQLAAEGIVLLKN 402

Query: 385 DNGALPLNTGNIKTLALVGPHANATKAMIGNYEG--TPCRYTSPMDGFYAY----SKVIN 438
           +  ALPL+      +A++GP+A+     +G+Y     P   T+ + G  A     ++ + 
Sbjct: 403 EAKALPLSKEG-GVIAVIGPNADQGYNQLGDYTSPQPPAAVTTVLGGIRAKLGEEAQRVL 461

Query: 439 YAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG-----------LDLSVEA-------- 479
           YAPGC  I   +      A+  A+ AD  V+V G           +DL   A        
Sbjct: 462 YAPGCR-IKDDSREGFEFALSCAEQADTVVMVLGGSSARDFGEGTIDLRTGASKVTDDAL 520

Query: 480 ------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
                 EG DR+ L L G Q +L  ++    K    ++++      I     +    +IL
Sbjct: 521 SDMDCGEGIDRMTLQLSGVQLDLAQEIHKLGK---RMIVVYINGRPIAEPWIDEHADAIL 577

Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKF 592
              YPG+EGG AIAD++FG  NP G+L ++       + + Y     R      G+ Y  
Sbjct: 578 EAWYPGQEGGHAIADILFGDVNPSGKLTMSIPKHVGQLPVYYNGKRSR------GKRYLE 631

Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
            D    YPFGYGLSYT+F Y         DI++  +         +GT            
Sbjct: 632 EDSQPRYPFGYGLSYTEFSYS--------DIQMTPE--------VIGT------------ 663

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKV 711
               D      + V N G  +GSEVV +Y        T   +++ G++++ +  G+  KV
Sbjct: 664 ----DGTAVVSVNVTNSGDCEGSEVVQLYVSDAASKYTRPARELKGFQKISLQPGERRKV 719

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVGEGV 745
            FT+   + L+ +      ++  G   +++G  V
Sbjct: 720 EFTIGP-EQLQYIGQDYRQVVEPGLFRVMLGRHV 752


>gi|423222970|ref|ZP_17209439.1| hypothetical protein HMPREF1062_01625 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392640546|gb|EIY34345.1| hypothetical protein HMPREF1062_01625 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 862

 Score =  256 bits (653), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 166/463 (35%), Positives = 243/463 (52%), Gaps = 47/463 (10%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            PY + +L   ERAKDLV+R+TL EK   M D +  +PRLG+  + WWSEALHGV+  G 
Sbjct: 21  LPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQG- 79

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------- 122
                            T FP  +   ASFN+ L  +I   VS E RA +N         
Sbjct: 80  ---------------NVTVFPEPVGMAASFNDKLVFEIFNAVSDEMRAKHNERVRNGLED 124

Query: 123 LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
           +    L+ W+PN+N+ RDPRWGR  ET GEDPY+  +  I  V+GLQ  E  +Y      
Sbjct: 125 VRFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGPENEKYR----- 179

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
              K+ AC KHYA +    W  +      + V+ +D+ ET++  F+  V + DV  VMC+
Sbjct: 180 ---KLLACAKHYAVHSGPEWSRHTANL--NNVSPRDLWETYLPAFKALVQKADVREVMCA 234

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           Y R++  P C + +LL Q +R +W F   +VSDC +I     SHK  +D    AV   + 
Sbjct: 235 YQRLDDDPCCGNTRLLQQILRDEWGFKYLVVSDCGAIADFWTSHKSSSDAVHAAVKGTM- 293

Query: 303 AGLDLDCGDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK-- 359
           AG D++CG  Y    +  AV +G I E ++D  +  L      LG  D  P   N  K  
Sbjct: 294 AGTDVECGYGYAYQKLPEAVSRGLITEEEVDKHVLRLMEGRFELGEMD-DPSLVNWTKIP 352

Query: 360 NNICNPQ-HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
            ++ N + H +L+   +RQ + LL+N N  LPL + +I+ +A++GP+A+    + GNY G
Sbjct: 353 MSVVNCKAHKDLSLNMSRQTMTLLQNKNNVLPL-SKSIRKIAVIGPNADDKPMLWGNYNG 411

Query: 419 TPCRYTSPMDGFYAYSK--VINYAPGCADIVCQNNSMIPAAID 459
           TP +  + +DGF +  K   I Y  GC D+V  N+  + + +D
Sbjct: 412 TPNQTITILDGFKSKLKKNQIVYMKGC-DLV--NDQTLESYLD 451



 Score =  110 bits (276), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 79/264 (29%), Positives = 119/264 (45%), Gaps = 44/264 (16%)

Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
           +G DR D+ LP  Q   +  + DA K    +V ++     +          +IL   Y G
Sbjct: 626 KGGDRTDIELPAVQRNFLKALKDAGK---QVVFVNCSGSSMALLPETESCDAILQAWYGG 682

Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
           E GG A+ADV+FG YNP G+LP+T+Y++      Y    ++      GRTY++   P ++
Sbjct: 683 ELGGYAVADVLFGDYNPSGKLPVTFYKSTKQLPDYEDYSMK------GRTYRYMSDP-LF 735

Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
           PFG+GLSYT F    AS  K+         Q R                        D  
Sbjct: 736 PFGFGLSYTDFAVGTASCNKT---------QLR-----------------------TDES 763

Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
            T  + V N GK  G+EVV VY +    A   +K +  Y RV +AAG    V   + + +
Sbjct: 764 LTLTVPVSNTGKRSGTEVVQVYIRKTDDADGPLKSLKAYARVELAAGAKQDVKIELPS-E 822

Query: 720 SLKIVDNAANSL-LASGAHTILVG 742
           S +  D + N++ +A G + +  G
Sbjct: 823 SFECFDPSTNTMRVAPGKYELFYG 846


>gi|224535195|ref|ZP_03675734.1| hypothetical protein BACCELL_00056 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224523186|gb|EEF92291.1| hypothetical protein BACCELL_00056 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 733

 Score =  255 bits (652), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 214/760 (28%), Positives = 352/760 (46%), Gaps = 92/760 (12%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLYEW 58
           Y DA  P   R KDL+ RMTL EKV Q+    +G              +P  +G  +Y  
Sbjct: 25  YKDAGQPVETRVKDLLNRMTLHEKVLQLNQYTFGENDNPNNIGTEVKNLPAEIGSLIYLH 84

Query: 59  WSEALHG----VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
               L       +    R   P    FD      T +P  +    SFN  L   + Q   
Sbjct: 85  TDPKLRNRIQRKAMEESRLGIPILFGFDVIHGLRTVYPISLAQACSFNPDL---VTQACG 141

Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
             A+    L     TF SP I+V RDPRWGR+ E  GEDPY      +N V G+  V+G 
Sbjct: 142 MAAKESV-LSGIDWTF-SPMIDVARDPRWGRISECYGEDPY------LNTVFGVASVKGY 193

Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
           +  + SD  P  I+AC KHY  Y +   EG   + + + ++ Q + ET++ P+E CV  G
Sbjct: 194 QGEKLSD--PYSIAACLKHYVGYGVS--EGGRDYRY-TDISPQALWETYLPPYEACVKAG 248

Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
             +++M S+N ++G+P  ++  +L + ++  W   G++VSD ++I+ ++  ++ +   ++
Sbjct: 249 -AATLMSSFNDISGVPATSNHYILTEILKNKWRHDGFVVSDWNAIEQLI--YQGVAKNRK 305

Query: 295 DAVARVLKAGLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
           +A  +   AG+++D  D  Y  +    V + KI  + ID ++  +  V  RLG FD  P 
Sbjct: 306 EAAYKAFHAGVEMDMRDNVYYEYLEQLVAEKKIEISQIDDAVARILRVKFRLGLFD-EPY 364

Query: 354 YKNLG-KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAM 412
            K L  +      + I LAA  A + +VLLKN+   LPL++  +K +AL+GP       +
Sbjct: 365 TKELTEQERYLQKEDIALAARLAEESMVLLKNEKNLLPLSS-TVKRVALIGPMVKDRSDL 423

Query: 413 IGNY------EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADA 466
           +G +      E     Y   M   +     ++Y  GCA +   + S   AA+  A+ +D 
Sbjct: 424 LGAWAFKGQAEDVETIYEG-MQKEFGDKVRLDYEQGCA-LDGNDESGFSAALKTAEASDV 481

Query: 467 TVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
            V+  G       E   R  + LP  Q +L+  +  A K P+ LV+ S   +++   +  
Sbjct: 482 VVVCLGESKQWSGENASRSTIALPDIQEKLLLHLKQANK-PIVLVLSSGRPLEL--IRLE 538

Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP-YTSM--PLRPVN 583
           P++++I+ +  PG  GG  +A ++ G+ NP G+L +T +  +  +IP Y +M    RP +
Sbjct: 539 PQVEAIIEMWQPGVAGGTPLAGILSGRVNPSGKLSVT-FPLSTGQIPVYYNMRQSARPFD 597

Query: 584 NFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKP 643
                 Y+      +YPFGYGLSYT F Y   S  K   +K+ K+Q              
Sbjct: 598 AMG--DYQDIPTEPLYPFGYGLSYTTFTY---SDAKLSSLKIKKNQ-------------- 638

Query: 644 PCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVF 702
                          K T ++ V N GK++G E V+ Y   P  + +  +K++  +E+  
Sbjct: 639 ---------------KITAEVTVTNAGKVEGKETVLWYVSDPFCSISRPMKELKFFEKQS 683

Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
           +  G+S    F ++  + L   D      L +G   + VG
Sbjct: 684 LKVGESRVFRFEIDPMRDLSYTDATGKRFLEAGEFIVSVG 723


>gi|423226659|ref|ZP_17213124.1| hypothetical protein HMPREF1062_05310 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392628186|gb|EIY22220.1| hypothetical protein HMPREF1062_05310 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 750

 Score =  255 bits (652), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 205/742 (27%), Positives = 346/742 (46%), Gaps = 109/742 (14%)

Query: 29  VERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGA 88
           V  +T P    ++  +A    RLG+PL     + +HG   I                   
Sbjct: 75  VMSITDPNIFNEVQRIAVEDSRLGIPLINA-RDVIHGFKTI------------------- 114

Query: 89  TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVL 147
             FP  +   ASFN  + +   +  +TEA A      AG+ + ++P I++  DPRWGR+ 
Sbjct: 115 --FPIPLGQAASFNPEIAETGARIAATEASA------AGIRWTFAPMIDITHDPRWGRIA 166

Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
           E  GEDP +V +  +  ++G Q          S + P  I+AC KH+A Y     EG  R
Sbjct: 167 EGFGEDPLLVSQMGVAAIKGFQG--------SSLNHPTSIAACAKHFAGYGAS--EGG-R 215

Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
            +  + +TE+  +  ++ PFE  VN G  +++M ++N  +GIP+ A+P LL   +R +WN
Sbjct: 216 DYNSTYITERQFRNLYLRPFEAAVNAG-AATLMTAFNDNDGIPSSANPFLLKDVLRNEWN 274

Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQGKI 326
           + G +VSD  S+  ++  H F  D KE A+ +   AG D++   + Y       +++GK+
Sbjct: 275 YRGTVVSDWASVSEMIR-HGFCEDEKEAAL-KATNAGTDIEMVSETYIKHLPQLIKEGKV 332

Query: 327 AEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDN 386
           +   ID ++R +  +  RLG F+  P   +  K     P  +E A  AA Q  VLLKN+ 
Sbjct: 333 SMETIDNAVRNILRLKFRLGLFE-HPYIADQRKETFYRPDFLEAAQTAAEQSAVLLKNER 391

Query: 387 GALPLNTGNIKTLALVGPHANATKAMIGN--YEGTPCRYTSPMDGFYAYS----KVINYA 440
           G LP+ + NIKT+ + GP A+A    +G   ++G      +P+      S    KV+ YA
Sbjct: 392 GTLPIQS-NIKTILVTGPLADAPHEQLGTWVFDGDASYSQTPLQALRRISGDSIKVL-YA 449

Query: 441 PGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKV 500
           PG         S     ++ A+ AD  +   G +  +  E     +L L G Q+ L++++
Sbjct: 450 PGLNYSRDTATSQFNKVVELAREADLILAFVGEEAILSGEAHCLANLNLQGAQSRLLHRL 509

Query: 501 ADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRL 560
           ++  K P+  V+M+   + I    N     ++L+  +PG  GG A+A+++FGK  P G+L
Sbjct: 510 SETGK-PLVTVVMAGRPLTIGREVNIS--DALLYAFHPGTMGGPALANLLFGKVVPSGKL 566

Query: 561 PITW-YEANYVKIPYT----------------SMPLRPVNNFPGRTYKFFDGPV--VYPF 601
           P+T+  E   + I Y                 ++P+       G T  + D     ++PF
Sbjct: 567 PVTFPKETGQIPIYYNHTSTGRPASGSEKNIFTIPVGAEQTSLGNTSFYLDAGKDPLFPF 626

Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
           GYGLSYT F Y         +++L   Q  R+              V+I          T
Sbjct: 627 GYGLSYTTFAYS--------NLQLSSTQYTRN-------------EVII---------IT 656

Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKS 720
           F  ++ N GK DG+E+  +Y +    + T  +K++  +ER+ + AG++  +   +   K 
Sbjct: 657 F--DLTNTGKTDGTEIAQLYFRDLAASVTRPVKELAAFERIHLKAGETRHIRMEL-PVKQ 713

Query: 721 LKIVDNAANSLLASGAHTILVG 742
           L   + A +  +  G   + +G
Sbjct: 714 LSFWNYAMDYCVEPGKFDLWIG 735


>gi|256838673|ref|ZP_05544183.1| glycoside hydrolase, family 3 [Parabacteroides sp. D13]
 gi|256739592|gb|EEU52916.1| glycoside hydrolase, family 3 [Parabacteroides sp. D13]
          Length = 758

 Score =  255 bits (652), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 185/603 (30%), Positives = 298/603 (49%), Gaps = 63/603 (10%)

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           ++P +++ RD RWGRV+E  GEDPY+    A   V G Q   G ++   +D     + AC
Sbjct: 162 FAPMVDISRDARWGRVMEGAGEDPYLGSLIAKARVEGFQG--GNDWRSLADVN--TVLAC 217

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH+AAY      G D   +++    Q+    + +P  +   E  V++ M S+N +NG+P
Sbjct: 218 CKHFAAYGAAE-AGRD---YNTSELSQNTLMNYYMPPYLAAKEAGVATFMASFNEINGVP 273

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-C 309
           +  +  L+   +R DW F G++V+D   I  +V +H  + + KE A      AG+D+D  
Sbjct: 274 STGNKWLMTDLLRKDWGFKGFVVTDYTGINEMV-AHSIVRNDKE-AGELAANAGIDMDMT 331

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQY--KNLGKNNICNPQH 367
           G  Y+ + + +V++GK++E +ID ++  +  +   LG FD   +Y      KN I  P+ 
Sbjct: 332 GGIYSQYLVQSVKEGKVSEENIDRAVASILEMKFLLGLFDDPYRYLDNEREKNTIMKPEF 391

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGP------HANATKAMIGNYEGTPC 421
           ++ A E + + IVLLKNDN   P++     T+AL+GP      + N   A  G  E +  
Sbjct: 392 LQEARETSARSIVLLKNDNNFFPISKDKHITVALIGPMVKDKINQNGEWAGRGEREESIS 451

Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
            +    + +   +    YA GC D++  ++S    AI  A+ AD  +   G D +   E 
Sbjct: 452 LFEGLTEKYAGTNVKFIYAEGC-DLLTDDSSKFAEAIATARRADIVLAAMGEDFNWSGEA 510

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
             R DL LPG Q  L+ ++    K P+ L++++   +D+++   +  +  IL   Y G  
Sbjct: 511 ACRTDLKLPGAQQALLKELKKTGK-PLGLILVNGRPLDLSW--EDQHVDGILEAWYLGTM 567

Query: 542 GGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPL-RPV-NNFPGRTYK--FFDGP 596
            G  +ADVI G YNP  RL +++      + + Y   P  RPV    P   YK  + D P
Sbjct: 568 AGHGMADVISGDYNPSARLTMSFPRTVGQLPLYYNQKPTGRPVPPEAPDTDYKSRYMDVP 627

Query: 597 --VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
              +YPFGYGLSYT F            +KLD++      ++T G               
Sbjct: 628 NTPLYPFGYGLSYTTFAVN--------SMKLDQN------SFTKGG-------------- 659

Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
               K T   EVEN GK+DG  VV +Y +   G     +K++ G+E+V + AG+  +V F
Sbjct: 660 ----KITVTAEVENTGKVDGETVVQMYIRDLAGSVTRPVKELKGFEKVALKAGEKKQVSF 715

Query: 714 TMN 716
           T++
Sbjct: 716 TID 718


>gi|423333878|ref|ZP_17311659.1| hypothetical protein HMPREF1075_03310 [Parabacteroides distasonis
           CL03T12C09]
 gi|409226713|gb|EKN19619.1| hypothetical protein HMPREF1075_03310 [Parabacteroides distasonis
           CL03T12C09]
          Length = 732

 Score =  255 bits (652), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 220/776 (28%), Positives = 362/776 (46%), Gaps = 142/776 (18%)

Query: 18  KLPYPERAKDLVERMTLPEKVQQM-GDLAY---GVPRLGLPLYEW-WSEALHGV-SFIGR 71
           K+   +R + L+++MTL EKV  + G+  +   GV RLG+P  EW  S+  HGV + I R
Sbjct: 28  KVQMEKRIEKLIKKMTLEEKVGLLHGNSKFYVAGVERLGIP--EWSLSDGPHGVRAEINR 85

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
              +  G   DS    A+ FPT     A++N  L  + G+ +  EAR             
Sbjct: 86  HDWAYAGWTNDS----ASYFPTGTAFAAAWNPELAYRRGEVLGEEAR-----WRKKDVLL 136

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
            P +N++R P  GR  E   EDPY+    A+ Y++GLQ        RD       ++   
Sbjct: 137 GPGVNIIRSPLCGRNFEYMSEDPYMNSVLAVAYIKGLQS-------RD-------VACSV 182

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
           KH+A   ++N E N R   D   +E+ ++E ++  F+  V EG   +VM +YN+  G   
Sbjct: 183 KHFA---VNNQETN-RTTVDVECSERALREIYLPAFKAAVQEGGALTVMAAYNKFRGEFC 238

Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
             +  L+ + +R +W F G  V+D  +  + + S               ++AGLDL+ G 
Sbjct: 239 AENNYLVCKILRNEWGFDGVYVTDWGAAHSTIPS---------------MEAGLDLEMGT 283

Query: 312 --------YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC 363
                   YY N  + AV+ GKI  + +D  +  +  V+++    D  P+ K  G  ++ 
Sbjct: 284 LIDKYEDWYYANPLIEAVKSGKIPMSLVDEKVGDVLRVMIKTNVLD--PK-KRFGPGSMN 340

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG-----PHANA--TKAMIGNY 416
             +H +   +AA + IVLLKN N  LPL+  +IK+LA++G      H+N   +  +   Y
Sbjct: 341 TKEHQQATYDAAAEAIVLLKNQNNLLPLDFSSIKSLAVIGDNATRKHSNGGLSSEIKAVY 400

Query: 417 EGTP-----CRYTSPMDGFYA--YSKVINYAPGC---------ADIVCQNNSMIPAAIDA 460
           E TP      ++   +D  +A  Y K+  +  G          +    ++++++  A++ 
Sbjct: 401 EVTPLEALRAKWGDKVDIRFAQGYEKLSTFVEGSNNGQSSGTFSSKTQESDALLKEAVEV 460

Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
           A+ +D  ++V GL+   + E  DR+++ +P  Q ELI +V  A   P T+V+M AG+  +
Sbjct: 461 ARTSDVALLVCGLNHDYDTESFDRLNMDIPYGQVELIQEVVKA--NPRTIVVMIAGS-PL 517

Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR 580
           N A  +    +I+W  + G EGG A+ DV+ GK NP G++P T        +     P  
Sbjct: 518 NMAAVDICSPAIVWAWFNGMEGGNALVDVLSGKVNPSGKMPFT------TPVSLDQSPAH 571

Query: 581 PVNNFPGRT------------YKFFDG---PVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
            + NFPGR             Y++FD    PVVYPFGYGLSYT F Y            L
Sbjct: 572 ALGNFPGRDLKVNYEEDILVGYRWFDTKGLPVVYPFGYGLSYTTFNYS----------NL 621

Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
           + D++  D   T+                    + TF +   N G  +G+EV  +Y   P
Sbjct: 622 NTDKKTYDQADTI--------------------QATFTL--TNTGDREGAEVAQLYVSDP 659

Query: 686 GIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
             +    +K++ G+++VF+  G+S ++   +    SL     A +  +      IL
Sbjct: 660 VCSVMRPVKELKGFKKVFLKPGESRRITLDI-PVSSLAFYSEAQSQFVVEPGEFIL 714


>gi|224536538|ref|ZP_03677077.1| hypothetical protein BACCELL_01413 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521794|gb|EEF90899.1| hypothetical protein BACCELL_01413 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 863

 Score =  255 bits (651), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 158/430 (36%), Positives = 224/430 (52%), Gaps = 45/430 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y DA L    RA+ LV+ +TL EK   M D +  V RLG+  Y WW+EALHGV+  G   
Sbjct: 23  YKDASLSPERRAELLVKELTLEEKAHLMMDGSRSVERLGIKPYNWWNEALHGVARAGL-- 80

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   ASFN  +  ++   VS EARA      +       
Sbjct: 81  --------------ATVFPQPIGMAASFNPEMVYEVFNAVSDEARAKNTYYASQDSRERY 126

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLT W+P +N+ RDPRWGR +ET GEDPY+  R  +  V+GLQ          +D +  
Sbjct: 127 QGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQG--------PADGKYD 178

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
           K+ AC KH+A +    W   +R  F++  +  +D+ ET++ PFE  V EG V  VMC+YN
Sbjct: 179 KLHACAKHFAVHSGPEW---NRHSFNAENIKPRDLYETYLPPFEALVKEGKVEEVMCAYN 235

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLK 302
           R  G P C   +LL Q +RG+W F G +VSDC +I        H    D +  + A V+ 
Sbjct: 236 RFEGDPCCGSDRLLMQILRGEWGFDGIVVSDCGAIADFYNDRGHHTHPDAESASAAAVI- 294

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK--- 359
           +G DL+CG  Y    + +V++G I+E  +DTS++ L      LG  D  P+  +  K   
Sbjct: 295 SGTDLECGSSYKAL-IESVKKGLISEETVDTSVKRLMKARFALGEMD-EPEKVSWTKIPF 352

Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
           + + +  H  LA   AR+ + LL N +  LPL  G + T+A++GP+AN +    GNY G 
Sbjct: 353 SVVASAAHDSLALNMARESMTLLMNKDNFLPLKRGGL-TVAVMGPNANDSVMQWGNYNGM 411

Query: 420 PCRYTSPMDG 429
           P    + +DG
Sbjct: 412 PAHTVTILDG 421



 Score =  133 bits (334), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 95/309 (30%), Positives = 152/309 (49%), Gaps = 53/309 (17%)

Query: 445 DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQT 494
           D+  + +  I  +++  K+AD  +  +G+  S+E E            DR D+ LP  Q 
Sbjct: 581 DLGFKKDVDIRKSVERVKDADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQR 640

Query: 495 ELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKY 554
           ELI+ +  A K    +++++     I       K ++IL   YPG++GG+A+A+V+FG Y
Sbjct: 641 ELIDALHRAGK---KIILVNCSGSPIGLEPETQKCEAILQAWYPGQQGGKAVAEVLFGDY 697

Query: 555 NPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKV 614
           NP G+LP+T+Y         + +P     N  GRTY++     ++PFGYGLSYT F Y  
Sbjct: 698 NPAGKLPVTFYRN------VSQLPDFEDYNMTGRTYRYMQDVPLFPFGYGLSYTTFGYG- 750

Query: 615 ASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDG 674
               K+V   LDK++       T G +                      + V N GK +G
Sbjct: 751 ----KTV---LDKNE------LTAGQS------------------LKLTVPVTNTGKRNG 779

Query: 675 SEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LA 733
            EVV VY +  G A   IK +  ++RV I AG++  V F +   K L+  D+ +N++ + 
Sbjct: 780 EEVVQVYLRKQGDAEGPIKTLRAFKRVSIPAGKTVNVEFDLKD-KELEWWDDQSNTVRVC 838

Query: 734 SGAHTILVG 742
            G + I+VG
Sbjct: 839 PGNYDIMVG 847


>gi|410096731|ref|ZP_11291716.1| hypothetical protein HMPREF1076_00894 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409225348|gb|EKN18267.1| hypothetical protein HMPREF1076_00894 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 746

 Score =  255 bits (651), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 202/712 (28%), Positives = 330/712 (46%), Gaps = 109/712 (15%)

Query: 32  MTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSF 91
           +T PE V +   +A    RLG+PL     + +HG   I                     F
Sbjct: 76  LTDPELVNKAQRIAVEESRLGIPLL-MSRDVIHGYKTI---------------------F 113

Query: 92  PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVLETP 150
           P  +   A+FN  L +   +  + EA A       G+ + ++P I++ RDPRWGR+ E+ 
Sbjct: 114 PIPLGQAATFNPQLVEDGARVAAVEASA------DGIRWTFAPMIDISRDPRWGRIAESC 167

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDPY+     +  V+G Q         DS + P  ++AC KH+  Y     EG  R + 
Sbjct: 168 GEDPYLSSVMGVAMVKGFQG--------DSLNNPTAVAACAKHFVGYGAS--EGG-RDYN 216

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
            + + E+ ++  +  PFE     G  ++ M S+N  +GIP+  +  +L   +RG+WN+ G
Sbjct: 217 STFIPERQLRNVYFPPFEAAAKAG-CATFMTSFNDNDGIPSTGNSFILKDVLRGEWNYDG 275

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQGKIAE 328
            +V+D  S   ++ SH F  D KE A+  V  AG++++   G +  N     V++ K++E
Sbjct: 276 LVVTDWASSAEMI-SHGFCKDEKEAAMKSV-NAGINMEMVSGTFIRNLEE-LVKEKKVSE 332

Query: 329 ADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGA 388
           A ID ++R +  +  RLG FD    Y +  +     P H+  A EAA Q ++LLKND   
Sbjct: 333 AAIDEAVRNILRLKFRLGLFDNP--YTDTDQQVKYAPTHLAKAKEAAEQSVILLKNDRET 390

Query: 389 LPLNTGNIKTLALVGPHANATKAMIGN--YEGTPCRYTSPMDGF---YAYSKVINYAPGC 443
           LP  T  I+TLA++GP A+A    +G   ++G      + +      Y     I Y PG 
Sbjct: 391 LPF-TDKIRTLAVIGPLADAAHDQMGTWVFDGEKAHTQTVLTALKEMYGDKVRIIYEPGL 449

Query: 444 ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADA 503
                ++ + I  A++AA +ADA ++ AG +  +  E     DL L G Q+ELI  +A  
Sbjct: 450 GYSRDKHTAGIAKAVNAAMHADAVLVCAGEESILSGEAHSLADLHLQGAQSELIAALAKT 509

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
            K P+  V+M+   + I   +   +  ++L+  +PG  GG A+AD++FGK  P G+ P+T
Sbjct: 510 GK-PLVTVVMAGRPLTI--GQEVEQSDAVLYAFHPGTMGGPALADLLFGKAVPSGKTPVT 566

Query: 564 ----------WYEANYVKIPYT-------SMPLRPVNNFPGRTYKFFDGPV--VYPFGYG 604
                     +Y  N    P +        +P        G T  + D     ++PFGYG
Sbjct: 567 FPKMVGQIPVYYAHNNTGRPASRQETLIDDIPQEAGQTSLGCTSFYMDAGFDPLFPFGYG 626

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
           LSYT F Y                      N  + TN+              D       
Sbjct: 627 LSYTTFGYD---------------------NLQLATNQLAV-----------DGTLEISF 654

Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTM 715
           ++ N GK +G+E+V +Y +    + T  +K++ G+ R+ +  G++  V F++
Sbjct: 655 DLTNTGKYEGTEIVQLYIQDKAGSITRPVKELKGFRRIPLKQGETKTVSFSL 706


>gi|363583088|ref|ZP_09315898.1| b-glucosidase [Flavobacteriaceae bacterium HQM9]
          Length = 779

 Score =  255 bits (651), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 191/679 (28%), Positives = 317/679 (46%), Gaps = 84/679 (12%)

Query: 89  TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVL 147
           T FP  +   AS++    K   +  + EA +       G+ + ++P +++ +D RWGR+ 
Sbjct: 146 TIFPIPLGLAASWDAETAKAAARVSAIEASSY------GIRWTFAPMLDITQDSRWGRIA 199

Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
           E+PGEDPY+    A  YV G QD        +  S+   ++AC KH+  Y         R
Sbjct: 200 ESPGEDPYLASVLAKAYVEGFQD--------NDLSKSTSLAACAKHFIGYGA---AIGGR 248

Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
            +  + + E  ++ T++ PFE  ++ G  ++VM S+N +NG+P   +  LLN+ +R +  
Sbjct: 249 DYNTAIIHEPLLRNTYLKPFEAAIDAG-AATVMTSFNELNGVPASGNKWLLNEVLRKELG 307

Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQGKI 326
           FHG++VSD +SI  ++ +H +  + K  A A  + AGLD++     Y N+    +++ KI
Sbjct: 308 FHGFVVSDWNSITEMI-AHSYAENEKH-AAALGINAGLDMEMTSKSYENYIKQLLKEKKI 365

Query: 327 AEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDN 386
            E  +D  +  +  V  RL  F+   + K     N  + +H++LA  AA +  VLLKN+ 
Sbjct: 366 TETQLDFLVSNILRVKFRLNLFEKPYRLKK-HTGNFYSQEHMDLAKNAAIRSSVLLKNNQ 424

Query: 387 GALPLNTGNIKTLALVGPHANATKAMIG--NYEGTPCRYTSPMDGFYAYSKVINYAPGCA 444
           G LPLN   +  +A++GP ANA    +G   ++G      +P+  F       N+A    
Sbjct: 425 GLLPLN--KLTKVAVIGPLANAPHEQLGTWTFDGDQAYSVTPLQAFKNNKVNFNFAETLT 482

Query: 445 DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAA 504
               Q+      A+  A+++D  +   G +  +  E   R  + LPG Q  LI  +A   
Sbjct: 483 YSRDQSTKAFDKALRTAQSSDVILFFGGEEAILSGEAHSRAHINLPGQQEALIKALAKTG 542

Query: 505 KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW 564
           K P+  VIM+     I   K   ++ +IL   +PG  GG AI ++++GK  PGGRLPITW
Sbjct: 543 K-PIVFVIMAGRP--ITLTKVIDQVDAILMTWHPGTMGGEAIYEMLWGKNEPGGRLPITW 599

Query: 565 YEAN----------------YVK--IPYTSMPLRPVNNFPGRTYKFFDGPVV--YPFGYG 604
            + +                 +K  +   S+P+    +  G T  + D      +PFGYG
Sbjct: 600 PKTSGQLPLFYNHKNTGRPPSIKSFVQMDSIPVGAWQSSLGNTSHYLDVGFTPQFPFGYG 659

Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
           L YT FKY         D+K+      ++ +  V                         +
Sbjct: 660 LGYTTFKYS--------DVKISTTSITKNESLEVS------------------------V 687

Query: 665 EVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKI 723
            + N G   G+E+V +Y +   G     +K++ G++ + +  G S  V FT+NA   L  
Sbjct: 688 TLTNTGDRAGAELVQLYVQDVVGSLTRPVKELKGFKHIHLDKGASTIVKFTLNA-NDLMF 746

Query: 724 VDNAANSLLASGAHTILVG 742
           V+N    +L  G   I VG
Sbjct: 747 VNNTLKPVLEKGEFNIFVG 765


>gi|255013061|ref|ZP_05285187.1| beta-glucosidase [Bacteroides sp. 2_1_7]
 gi|410102523|ref|ZP_11297449.1| hypothetical protein HMPREF0999_01221 [Parabacteroides sp. D25]
 gi|409238595|gb|EKN31386.1| hypothetical protein HMPREF0999_01221 [Parabacteroides sp. D25]
          Length = 758

 Score =  255 bits (651), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 184/603 (30%), Positives = 299/603 (49%), Gaps = 63/603 (10%)

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           ++P +++ RD RWGRV+E  GEDPY+    A   V G Q   G ++   +D     + AC
Sbjct: 162 FAPMVDISRDARWGRVMEGAGEDPYLGSLIAKARVEGFQG--GNDWRSLADVN--TVLAC 217

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH+AAY      G D   +++    Q+    + +P  +   E  V++ M S+N +NG+P
Sbjct: 218 CKHFAAYGAAE-AGRD---YNTSELSQNTLMNYYMPPYLAAKEAGVATFMASFNEINGVP 273

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-C 309
           +  +  L+   +R DW F+G++V+D   I  +V +H  + + KE A      AG+D+D  
Sbjct: 274 STGNKWLMTDLLREDWGFNGFVVTDYTGINEMV-AHSIVRNDKE-AGELAANAGIDMDMT 331

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQY--KNLGKNNICNPQH 367
           G  Y+ + + +V++GK++E +ID ++  +  +   LG FD   +Y      KN I  P+ 
Sbjct: 332 GGIYSQYLVQSVKEGKVSEENIDRAVASILEMKFLLGLFDDPYRYLDNEREKNTIMKPEF 391

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGP------HANATKAMIGNYEGTPC 421
           ++ A E + + IVLLKNDN   P++     T+AL+GP      + N   A  G  E +  
Sbjct: 392 LQEARETSARSIVLLKNDNNFFPISKDKHITVALIGPMVKDKINQNGEWAGRGEREESIS 451

Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
            +    + +   +    YA GC D++  ++S    AI  A+ AD  +   G D +   E 
Sbjct: 452 LFEGLTEKYAGTNVKFIYAEGC-DLLTDDSSKFAEAIATARRADIVLAAMGEDFNWSGEA 510

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
             R DL LPG Q  L+ ++    K P+ L++++   +D+++   +  +  IL   Y G  
Sbjct: 511 ACRTDLKLPGAQQALLKELKKTGK-PLGLILVNGRPLDLSW--EDQHVDGILEAWYLGTM 567

Query: 542 GGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPL-RPV-NNFPGRTYK--FFDGP 596
            G  +ADVI G YNP  RL +++      + + Y   P  RPV    P   YK  + D P
Sbjct: 568 AGHGMADVISGDYNPSARLTMSFPRTVGQLPLYYNQKPTGRPVPPEAPDTDYKSRYMDVP 627

Query: 597 --VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
              +YPFGYGLSYT F            +KLD++      ++T G               
Sbjct: 628 NTPLYPFGYGLSYTTFAVN--------SMKLDQN------SFTKGG-------------- 659

Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
               K T   EVEN GK+DG  V+ +Y +   G     +K++ G+E+V + AG+  +V F
Sbjct: 660 ----KITVTAEVENTGKVDGETVIQMYIRDLAGSVTRPVKELKGFEKVTLKAGEKKQVSF 715

Query: 714 TMN 716
           T++
Sbjct: 716 TID 718


>gi|423223593|ref|ZP_17210062.1| hypothetical protein HMPREF1062_02248 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638218|gb|EIY32065.1| hypothetical protein HMPREF1062_02248 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 863

 Score =  255 bits (651), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 158/430 (36%), Positives = 224/430 (52%), Gaps = 45/430 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           Y DA L    RA+ LV+ +TL EK   M D +  V RLG+  Y WW+EALHGV+  G   
Sbjct: 23  YKDASLSPERRAELLVKELTLEEKAHLMMDGSRSVERLGIKPYNWWNEALHGVARAGL-- 80

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
                         AT FP  I   ASFN  +  ++   VS EARA      +       
Sbjct: 81  --------------ATVFPQPIGMAASFNPEMVYEVFNAVSDEARAKNTYYASQDSRERY 126

Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
            GLT W+P +N+ RDPRWGR +ET GEDPY+  R  +  V+GLQ          +D +  
Sbjct: 127 QGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQG--------PADGKYD 178

Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
           K+ AC KH+A +    W   +R  F++  +  +D+ ET++ PFE  V EG V  VMC+YN
Sbjct: 179 KLHACAKHFAVHSGPEW---NRHSFNAENIKPRDLYETYLPPFEALVKEGKVEEVMCAYN 235

Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLK 302
           R  G P C   +LL Q +RG+W F G +VSDC +I        H    D +  + A V+ 
Sbjct: 236 RFEGDPCCGSDRLLMQILRGEWGFDGIVVSDCGAIADFYNDRGHHTHPDAESASAAAVI- 294

Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK--- 359
           +G DL+CG  Y    + +V++G I+E  +DTS++ L      LG  D  P+  +  K   
Sbjct: 295 SGTDLECGSSYKAL-IESVKKGLISEETVDTSVKRLMKARFALGEMD-EPEKVSWTKIPF 352

Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
           + + +  H  LA   AR+ + LL N +  LPL  G + T+A++GP+AN +    GNY G 
Sbjct: 353 SVVASAAHDSLALNMARESMTLLMNKDNFLPLKRGGL-TVAVMGPNANDSVMQWGNYNGM 411

Query: 420 PCRYTSPMDG 429
           P    + +DG
Sbjct: 412 PAHTVTILDG 421



 Score =  133 bits (334), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 95/309 (30%), Positives = 152/309 (49%), Gaps = 53/309 (17%)

Query: 445 DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQT 494
           D+  + +  I  +++  K+AD  +  +G+  S+E E            DR D+ LP  Q 
Sbjct: 581 DLGFKKDVDIRKSVERVKDADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQR 640

Query: 495 ELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKY 554
           ELI+ +  A K    +++++     I       K ++IL   YPG++GG+A+A+V+FG Y
Sbjct: 641 ELIDALHRAGK---KIILVNCSGSPIGLEPETQKCEAILQAWYPGQQGGKAVAEVLFGDY 697

Query: 555 NPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKV 614
           NP G+LP+T+Y         + +P     N  GRTY++     ++PFGYGLSYT F Y  
Sbjct: 698 NPAGKLPVTFYRN------VSQLPDFEDYNMTGRTYRYMQDVPLFPFGYGLSYTTFGYG- 750

Query: 615 ASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDG 674
               K+V   LDK++       T G +                      + V N GK +G
Sbjct: 751 ----KTV---LDKNE------LTAGQS------------------LKLTVPVTNTGKRNG 779

Query: 675 SEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LA 733
            EVV VY +  G A   IK +  ++RV I AG++  V F +   K L+  D+ +N++ + 
Sbjct: 780 EEVVQVYLRKQGDAEGPIKTLRAFKRVSIPAGKTVNVEFDLKD-KELEWWDDQSNTVRVC 838

Query: 734 SGAHTILVG 742
            G + I+VG
Sbjct: 839 PGNYDIMVG 847


>gi|427384989|ref|ZP_18881494.1| hypothetical protein HMPREF9447_02527 [Bacteroides oleiciplenus YIT
           12058]
 gi|425728250|gb|EKU91109.1| hypothetical protein HMPREF9447_02527 [Bacteroides oleiciplenus YIT
           12058]
          Length = 862

 Score =  255 bits (651), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 163/462 (35%), Positives = 239/462 (51%), Gaps = 45/462 (9%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            PY + +L   ERAKDLV R+TL EK   M D +  +PRLG+  + WWSEALHGV+  G 
Sbjct: 21  LPYQNPELSPAERAKDLVSRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQG- 79

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--LGNA--- 126
                            T FP  +   ASFNE L  +I    S E RA +N  + N    
Sbjct: 80  ---------------NVTVFPEPVGMAASFNEKLVFEIFNATSDEMRAKHNERVRNGLED 124

Query: 127 ----GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
                L+ W+PN+N+ RDPRWGR  ET GEDPY+  +  I  V+GLQ  E  +Y      
Sbjct: 125 TRFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGPEDEKYR----- 179

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
              K+ AC KHYA +    W  +      + ++ +D+ ET++  F+  V + DV  VMC+
Sbjct: 180 ---KLLACAKHYAVHSGPEWSRHSANL--NNISPRDLWETYLPAFKALVQKADVREVMCA 234

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           Y R++  P C   +LL Q +R +W F   +VSDC +I     SHK  +D    AV   + 
Sbjct: 235 YQRLDDDPCCGSTRLLQQILRDEWGFKYLVVSDCGAIADFWTSHKSSSDAVHAAVKGTM- 293

Query: 303 AGLDLDCGDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGK 359
           AG D++CG  Y    +  AV +G I E +ID  +  L      LG  D     ++  +  
Sbjct: 294 AGTDVECGYGYAYQKLPEAVSRGLITEEEIDKHVLRLLEGRFELGEMDDPSLVKWSQIPM 353

Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
           + + +  H +L+   +RQ + LL+N N  LPL + +I+ +A++GP+A+    + GNY GT
Sbjct: 354 SVVNSKAHKDLSLNMSRQTMTLLQNKNNVLPL-SKSIRKIAVIGPNADDKPMLWGNYNGT 412

Query: 420 PCRYTSPMDGFYAYSK--VINYAPGCADIVCQNNSMIPAAID 459
           P +  + +DGF    K   I Y  GC D+V  N+  + + +D
Sbjct: 413 PNQTITILDGFKTKLKKNQIIYMKGC-DLV--NDKTLESYLD 451



 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 82/297 (27%), Positives = 133/297 (44%), Gaps = 54/297 (18%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           +I   KN D  V V G+   +E E            DR D+ LP  Q   +  + +A+K 
Sbjct: 593 SISKLKNVDMVVFVGGISPQLEGEEMPLNLPGFKNGDRTDIELPAVQRNFLKALKEASK- 651

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
              +V ++     +          +IL   Y GE GG+A+ADV+FG YNP G+LP+T+Y+
Sbjct: 652 --QVVFVNCSGSSMALLPETESCDAILQAWYGGELGGQAVADVLFGDYNPSGKLPVTFYK 709

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
           +      Y    ++      GRTY++   P+ +PFG+GLSYT F                
Sbjct: 710 STKQLPDYEDYSMK------GRTYRYMSDPL-FPFGFGLSYTDF---------------- 746

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
                     TVGT +  C+   +      +   T  + + N GK  G+EV+ VY +   
Sbjct: 747 ----------TVGTAQ--CSKTQLR----TEEALTLTVPISNTGKRSGTEVIQVYIRKTD 790

Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
             G  +K +  Y R  +AAG +  +   + A +S +  D + N++ +A G + +  G
Sbjct: 791 DTGGPLKSLKAYARAELAAGATQDIEIQLPA-ESFECFDPSTNTMRVAPGEYELFYG 846


>gi|224537265|ref|ZP_03677804.1| hypothetical protein BACCELL_02142 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521119|gb|EEF90224.1| hypothetical protein BACCELL_02142 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 885

 Score =  255 bits (651), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 166/463 (35%), Positives = 242/463 (52%), Gaps = 47/463 (10%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            PY + +L   ERAKDLV+R+TL EK   M D +  +PRLG+  + WWSEALHGV+  G 
Sbjct: 21  LPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQGN 80

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------- 122
                            T FP  +   ASFN+ L   I   VS E RA +N         
Sbjct: 81  ----------------VTVFPEPVGMAASFNDKLVFDIFNAVSDEMRAKHNERVRNGLED 124

Query: 123 LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
           +    L+ W+PN+N+ RDPRWGR  ET GEDPY+  +  I  V+GLQ  E  +Y      
Sbjct: 125 VRFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGPENEKYR----- 179

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
              K+ AC KHYA +    W  +      + V+ +D+ ET++  F+  V + DV  VMC+
Sbjct: 180 ---KLLACAKHYAVHSGPEWSRHTANL--NNVSPRDLWETYLPAFKALVQKADVREVMCA 234

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           Y R++  P C + +LL Q +R +W F   +VSDC +I     SHK  +D    AV   + 
Sbjct: 235 YQRLDDDPCCGNTRLLQQILRDEWGFKYLVVSDCGAIADFWTSHKSSSDAVHAAVKGTM- 293

Query: 303 AGLDLDCGDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK-- 359
           AG D++CG  Y    +  AV +G I E ++D  +  L      LG  D  P   N  K  
Sbjct: 294 AGTDVECGYGYAYQKLPEAVSKGLITEEEVDKHVLRLMEGRFELGEMD-DPSLVNWTKIP 352

Query: 360 NNICNPQ-HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
            ++ N + H +L+   +RQ + LL+N N  LPL + +I+ +A++GP+A+    + GNY G
Sbjct: 353 MSVVNCKAHKDLSLNMSRQTMTLLQNKNNVLPL-SKSIRKIAVIGPNADDKPMLWGNYNG 411

Query: 419 TPCRYTSPMDGFYAYSK--VINYAPGCADIVCQNNSMIPAAID 459
           TP +  + +DGF +  K   I Y  GC D+V  N+  + + +D
Sbjct: 412 TPNQTITILDGFKSKLKKNQIVYMKGC-DLV--NDQTLESYLD 451



 Score =  115 bits (287), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 86/297 (28%), Positives = 130/297 (43%), Gaps = 54/297 (18%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           +I   K  D  V V G+   +E E          G DR D+ LP  Q   +  + DA K 
Sbjct: 593 SISKLKGIDVVVFVGGISPQLEGEEMPVNIPGFKGGDRTDIELPAVQRNFLKALKDAGK- 651

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
              +V ++     +          +IL   Y GE GG A+ADV+FG YNP G+LP+T+Y+
Sbjct: 652 --QVVFVNCSGSSMALLPETESCDAILQAWYGGELGGYAVADVLFGDYNPSGKLPVTFYK 709

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
           +      Y    ++      GRTY++   P ++PFG+GLSYT F    AS  K+   +L 
Sbjct: 710 STKQLPDYEDYSMK------GRTYRYMSDP-LFPFGFGLSYTDFAVGTASCNKT---QLH 759

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
            D+                               T  + V N GK  G+EVV VY +   
Sbjct: 760 TDES-----------------------------LTLTVPVSNTGKRSGTEVVQVYIRKTD 790

Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
            A   +K +  Y RV +AAG    V   + + +S +  D + N++ +A G + +  G
Sbjct: 791 DADGPLKSLKAYARVELAAGAKQDVKIELPS-ESFECFDPSTNTMRVAPGEYELFYG 846


>gi|374596264|ref|ZP_09669268.1| glycoside hydrolase family 3 domain protein [Gillisia limnaea DSM
           15749]
 gi|373870903|gb|EHQ02901.1| glycoside hydrolase family 3 domain protein [Gillisia limnaea DSM
           15749]
          Length = 758

 Score =  255 bits (651), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 196/658 (29%), Positives = 318/658 (48%), Gaps = 91/658 (13%)

Query: 89  TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLE 148
           T FP  +  TAS++    ++  +  + E+ A         TF SP I++ RD RWGR++E
Sbjct: 122 TIFPVPLGETASWDLEAMEESARIAALESAAH----GVNWTF-SPMIDISRDARWGRIME 176

Query: 149 TPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRF 208
             GEDPY+  + A+  ++G        Y  +  +    I+A  KH+A Y         R 
Sbjct: 177 GSGEDPYLTSKVAVAKIKG--------YQGNDLADANTIAATAKHFAGYGFGE---AGRD 225

Query: 209 HFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNF 268
           +    + E ++  T + PF+     G V++ M ++N ++G P      L    ++GDWN+
Sbjct: 226 YNTVHIGENELHNTILPPFKAAAEAG-VATFMNAFNDIDGTPATGHKILQRDILKGDWNW 284

Query: 269 HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC-GDYYTNFTMGAVQQGKIA 327
           +G+IVSD  SI  ++  H F  D K+ A    +KAG D+D  G  Y N     V+ G+I 
Sbjct: 285 NGFIVSDWASIPEMI-YHGFARD-KKHAAEIAVKAGSDMDMEGGAYENHLEDLVKSGEID 342

Query: 328 EADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK-NNICNPQHIELAAEAARQGIVLLKNDN 386
           E  +D S+R +  V  +LG FD   +Y N     NI   +H++ A + A + IVLLKN+ 
Sbjct: 343 EELLDDSVRRILRVKFKLGLFDDPYKYSNPEMLKNISFEEHLKTARDIASKSIVLLKNEG 402

Query: 387 GALPLNTGNIKTLALVGPHANATKAMIGNY--EGTPCRYTSPMDGF---------YAYSK 435
             LPL   ++K +A++GP A+   + IGN+  +G      S ++G            Y+K
Sbjct: 403 ELLPLKP-SVKNIAVIGPLADDKNSPIGNWRAQGEENSAVSVLEGIKNAVGNNVRVTYAK 461

Query: 436 VINYAPGCADIVC------QNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLL 489
             ++  G  + +        + S    AI+ AKNA+  ++V G D     EG+ +V++ L
Sbjct: 462 GADHGTGVKNFLLPLEINETDKSGFAEAIEVAKNAEVVLMVLGEDAFQTGEGRSQVEIGL 521

Query: 490 PGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADV 549
            G Q EL+ +V    K  + LV+++   ++I++A  N  I +I+   + G E G AIADV
Sbjct: 522 MGVQQELLEEVYKVNKN-IVLVLINGRPLEISWAAEN--IPAIVEAWHLGSESGNAIADV 578

Query: 550 IFGKYNPGGRLPIT----------WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
           +FGKYNP G+LP++          +Y       PY++  +     + G  Y   +   +Y
Sbjct: 579 LFGKYNPSGKLPVSFPRNVGQEPLYYNQKNTGRPYSAEHV----TYSG--YTDVEKDALY 632

Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
           PFGYGLSYT FKY V   P+    KL                              ++  
Sbjct: 633 PFGYGLSYTTFKYGV---PQLTSKKL-----------------------------TQEGS 660

Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMN 716
            T  + V N GK+ G EVV +Y +    + T  +K++  +E V +A G++  V F ++
Sbjct: 661 ITVTVPVTNTGKLKGKEVVQLYIRDLVASTTRPVKELKAFEMVELAPGETRDVQFEID 718


>gi|167645796|ref|YP_001683459.1| glycoside hydrolase family 3 [Caulobacter sp. K31]
 gi|167348226|gb|ABZ70961.1| glycoside hydrolase family 3 domain protein [Caulobacter sp. K31]
          Length = 808

 Score =  254 bits (650), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 209/721 (28%), Positives = 326/721 (45%), Gaps = 107/721 (14%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+P+     EALHG  ++ R                ATSFP  I   ++F+  + +K+
Sbjct: 152 RLGVPML-MHDEALHG--YVAR---------------DATSFPQAIALASTFDTEMTEKV 193

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
               + E RA  +  N  L   +P ++V RDPRWGR+ ET GEDP++     +  +RG Q
Sbjct: 194 FAVAAREMRARGS--NIAL---APVVDVARDPRWGRIEETYGEDPHLCAEIGLAAIRGFQ 248

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
                   +     P K+    KH   +       N      +++ E+ ++E F  PFE 
Sbjct: 249 G-------KTLPLAPDKVFVTLKHMTGHGQPE---NGTNVGPAQIAERTLRENFFPPFER 298

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            V E  V SVM SYN ++G+P+ A+  LL   +R +W + G + SD  +I+ ++  HK  
Sbjct: 299 AVKELPVRSVMPSYNEIDGVPSHANRWLLTDILRKEWGYKGSVQSDYFAIKELMGRHKLT 358

Query: 290 NDTKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY 347
           +D  E AV   + AG+D++   G+ Y       V+ G+I +A +D ++  +  +    G 
Sbjct: 359 DDLGETAVM-AMNAGVDVELPDGEAYALLPQ-LVKVGRIPQAAVDQAVERVLTMKFEGGL 416

Query: 348 FDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
           F+     +         P  I LA EAAR+ +VLLKND G LPLN    K LAL+G HA 
Sbjct: 417 FENPYADEKTADAKTATPDAIALAREAARKAVVLLKNDKGVLPLNPSKFKRLALLGTHAK 476

Query: 408 ATKAMIGNYEGTPCRYTSPMDGFYAYSK----VINYAPGC----ADIVCQ---------- 449
            T   IG Y  TP    S  +G  A +K     ++YA       A I  Q          
Sbjct: 477 DTP--IGGYSDTPRHVVSIYEGLQAEAKKSGFTLDYAEAVRITEARIWAQDEVKLVDPAV 534

Query: 450 NNSMIPAAIDAAKNADATVIVAGLDLSVEAEG------KDRVDLLLPGFQTELINKVADA 503
           N  +I  A++ AK AD  V+V G +     E        DR  L L G Q +L   + D 
Sbjct: 535 NAKLIAEAVEVAKQADVIVMVLGDNEQTSREAWADNHLGDRDSLDLIGQQNDLARAIFDL 594

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
            K P  + +++   + IN      +  +++   Y G+E G A AD++FG+ NPGG+LP++
Sbjct: 595 GK-PTVVFLLNGRPLSINLLAQ--RADAVIEGWYLGQETGNAAADILFGRANPGGKLPVS 651

Query: 564 -WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVD 622
              +   + I Y   P         R Y   D   +YPFG+GLSYT F     S+P+   
Sbjct: 652 IARDVGQLPIYYNRKPT------ARRGYLLGDTSPLYPFGFGLSYTTFDI---SAPRPAK 702

Query: 623 IKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYS 682
            ++  ++  +                              +I+V N GK+ G EVV +Y 
Sbjct: 703 AEIGANESVK-----------------------------VEIDVINTGKVAGDEVVQLYI 733

Query: 683 KPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
                + T  + ++  ++RV +A G    V F ++    L + +     ++  G  T+L 
Sbjct: 734 HDEAASVTRPVLELKHFKRVTLAPGAKQTVTFEVSPL-DLSLWNLEMKRVVEPGKFTLLS 792

Query: 742 G 742
           G
Sbjct: 793 G 793


>gi|424661946|ref|ZP_18098983.1| hypothetical protein HMPREF1205_02332 [Bacteroides fragilis HMW
           616]
 gi|404578257|gb|EKA82992.1| hypothetical protein HMPREF1205_02332 [Bacteroides fragilis HMW
           616]
          Length = 814

 Score =  254 bits (650), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 235/808 (29%), Positives = 354/808 (43%), Gaps = 161/808 (19%)

Query: 23  ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE------------------------- 57
           ER + L+ +MTL EKV QM      +  LG P+YE                         
Sbjct: 58  ERVEYLLSQMTLEEKVGQM------LTSLGWPMYERVGEEIRLTARLEKEISEYHIGALW 111

Query: 58  -------WWSEALH-GV--SFIGRRTNSPPG---THFDSEVP--------------GATS 90
                  W    LH G+  S   R +N        H    +P              G T 
Sbjct: 112 GFMRADPWTQRTLHTGLNPSLAARASNRLQAFVMEHSRLGIPLFLAEECPHGHMAIGTTV 171

Query: 91  FPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETP 150
           FPT I   +++N  L +++G+ ++ EA A           + P +++ RDPRW RV ET 
Sbjct: 172 FPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDPRWSRVEETY 226

Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
           GEDPY+ G      VRG Q         D+      + A  KH+A+Y    W        
Sbjct: 227 GEDPYLNGVMGAALVRGFQG--------DTLRGRKSVIATLKHFASY---GWTEGGHNGG 275

Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
            + + E++++E    PF   V  G +S VM SYN ++G P      LL   ++  W F G
Sbjct: 276 TAHLGERELEEAIFPPFREAVGAGALS-VMSSYNEIDGNPCTGSRYLLTDILKDRWLFKG 334

Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEA 329
           ++VSD  +I  + E H       E AV + + AG+D D G + Y    + AV++G +A  
Sbjct: 335 FVVSDLYAIGGLRE-HGVAGSDYEAAV-KAVNAGVDSDLGTNVYAEQLVAAVRKGDVAME 392

Query: 330 DIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGAL 389
            +D ++R +  +   +G FD            + +P+HI LA E ARQ IVLLKN++  L
Sbjct: 393 TVDKAVRRILSLKFHMGLFDAPFVDDKRPAQLVASPEHIGLAREVARQSIVLLKNEDKLL 452

Query: 390 PLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSKVINYAPGCA 444
           PL   +I+TLA++GP+A+    M+G+Y     +G+       +    +    + YA GCA
Sbjct: 453 PLKK-DIRTLAVIGPNADNGYNMLGDYTAPQADGSVVTVLEGIRQKVSKDTRVLYAKGCA 511

Query: 445 DIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE-------------------AEG 481
            +   + +    AI+AA++AD  V+V G     D S E                    EG
Sbjct: 512 -VRDSSRTGFADAIEAARSADVVVMVVGGSSARDFSSEYEETGAAKVSANRVSDMESGEG 570

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
            DR  L L G Q EL+ +V    K P+ LV++    + +       +  +IL   YPG +
Sbjct: 571 YDRATLHLMGRQLELLEEVRKLGK-PMVLVLIKGRPLLMEGVIQ--EADAILDAWYPGMQ 627

Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD--GPVVY 599
           GG A+ADV+FG YNP GRL ++      V      +P+       G   ++ +  G   Y
Sbjct: 628 GGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTKRKGNRSRYIEEAGTPRY 681

Query: 600 PFGYGLSYTQFKY---KVASSPKSVDIKLDKDQQCR-DINYTVGTNKPPCAAVLIDDVKC 655
           PFGYGLSYT F Y   KV  S +S          CR D++ T                  
Sbjct: 682 PFGYGLSYTMFSYTGMKVRVSEES--------NHCRVDVSVT------------------ 715

Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
                     V N G +DG EVV +Y +   G   T  +Q+  + RV + AG++ ++ FT
Sbjct: 716 ----------VRNQGTVDGDEVVQLYLRDEVGSFTTPDRQLRAFSRVRLKAGETREITFT 765

Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
           ++  KSL +        +  G  T++ G
Sbjct: 766 LDK-KSLALYMRDGEWAVEPGRFTVMAG 792


>gi|300726322|ref|ZP_07059774.1| beta-xylosidase B [Prevotella bryantii B14]
 gi|291292284|gb|ADD92014.1| Xyl3A [Prevotella bryantii B14]
 gi|299776347|gb|EFI72905.1| beta-xylosidase B [Prevotella bryantii B14]
          Length = 885

 Score =  254 bits (650), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 156/444 (35%), Positives = 226/444 (50%), Gaps = 39/444 (8%)

Query: 12  FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
            PY +  L   ERA DL  R+TL EK   M D +  +PRLG+  + WWSEALHG + +G 
Sbjct: 48  LPYQNPNLSAYERAIDLCHRLTLEEKALLMQDESPAIPRLGIKKFFWWSEALHGAANMGN 107

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--LGNAG-- 127
            TN                FP  I   +SFN +L K +    S E RA Y+  + N G  
Sbjct: 108 VTN----------------FPEPIAMASSFNPTLLKSVFSAASDEMRAQYHHRMDNGGED 151

Query: 128 -----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
                L+ W+PN+N+ RDPRWGR  ET GEDPY+        V GLQ  E  +Y      
Sbjct: 152 EKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGCAVVEGLQGPESSKYR----- 206

Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
              K+ AC KH+A +     E        + ++ +D+ ET++  F+  V +G V  VMC+
Sbjct: 207 ---KLWACAKHFAVHS--GPESTRHTANLNNISPRDLYETYLPAFQSTVQDGHVREVMCA 261

Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
           Y R++  P C++ +LL Q +R +W F   +VSDC ++  I +SHK  +D    +    L 
Sbjct: 262 YQRLDDEPCCSNNRLLQQILREEWGFKYLVVSDCGAVSDIWQSHKTSSDAVHASRQATL- 320

Query: 303 AGLDLDCGDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGK 359
           AG D++CG  YT   +  AV++G + E +ID  +  L      LG  D S   ++  +  
Sbjct: 321 AGTDVECGYGYTYAKIPEAVKRGLLTEEEIDKHVIRLLEGRFDLGEMDDSKLVEWSKIPY 380

Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
           + +    H +LA + ARQ IVLL+N    LPL     + +A++GP+A+    M GNY GT
Sbjct: 381 SIMSCKAHAQLALDMARQSIVLLQNKGNILPLQLKKNERIAVIGPNADNKPMMWGNYNGT 440

Query: 420 PCRYTSPMDGFYAYSKVINYAPGC 443
           P    S ++G     K + Y P C
Sbjct: 441 PNHTVSILEGIRKQYKNVVYLPAC 464



 Score =  110 bits (274), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 82/298 (27%), Positives = 135/298 (45%), Gaps = 57/298 (19%)

Query: 456 AAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAK 505
           A I   K  D  + V G+  S+E E          G DR D+ +P  Q + I  +A+A K
Sbjct: 618 ANIAQLKGIDKVIFVGGIAPSLEGEEMPVNIPGFKGGDRTDIEMPQVQRDFIKALAEAGK 677

Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
               +++++     I       + ++I+   YPG+EGG A+AD++ GK NP G+LP+T+Y
Sbjct: 678 ---QIILVNCSGSAIALTPEAQRCQAIIQAWYPGQEGGTAVADILMGKVNPMGKLPVTFY 734

Query: 566 EANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
           ++         +P     +   RTY++F+   +YPFGYGLSYT F+              
Sbjct: 735 KST------QQLPDFEDYSMKNRTYRYFED-ALYPFGYGLSYTSFE-------------- 773

Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
                       +GT K      L ++        T QI V N GK +G+E+V VY +  
Sbjct: 774 ------------IGTAK---LQTLTNN------SITLQIPVTNTGKREGTELVQVYLRRD 812

Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
                  K +  +  + + AG++ K    +N     +  D + N++ +  G +TI  G
Sbjct: 813 DDVEGPSKTLRSFAHITLKAGETKKAILKLNR-NQFECWDASTNTMRVIPGKYTIFYG 869


>gi|393786524|ref|ZP_10374660.1| hypothetical protein HMPREF1068_00940 [Bacteroides nordii
           CL02T12C05]
 gi|392660153|gb|EIY53770.1| hypothetical protein HMPREF1068_00940 [Bacteroides nordii
           CL02T12C05]
          Length = 841

 Score =  254 bits (650), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 227/811 (27%), Positives = 351/811 (43%), Gaps = 149/811 (18%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WS------ 60
           + D   P  +R KDL+ +MT+ EK  Q+  L YG  R+    LP   W    W       
Sbjct: 82  FEDPSQPVEKRVKDLLSQMTIEEKSCQLATL-YGFGRVLKDSLPTPAWKEAIWKDGIANI 140

Query: 61  -EALHGVSFIGRRTNS---PPGTH----------------------FDSE-VPG-----A 88
            E L+GV    +R      P   H                      F +E + G     A
Sbjct: 141 DEQLNGVGRGAKRVPHLIVPFSNHVKAINETQRWFIEETRLGIPVDFSNEGIHGLNHTKA 200

Query: 89  TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVL 147
           T  P  I   +++N  L ++ G+ V  EAR +      G T  ++P ++VVRDPRWGR L
Sbjct: 201 TPLPAPIAIGSTWNTELVREAGEIVGKEARVL------GYTNVYAPILDVVRDPRWGRTL 254

Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
           E  GEDPY++G   +  V G+Q  +GV             +A  KH+A Y       +  
Sbjct: 255 ECYGEDPYLIGELGVQMVDGIQS-QGV-------------AATLKHFAVYSSPKGGRDGN 300

Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
              D  VT +++ E ++ PF+  + +     VM SYN  NG P  +    L + +R ++ 
Sbjct: 301 CRTDPHVTPRELHEIYLYPFKHVIQQSHPMGVMSSYNDWNGEPVTSSYYFLTKLLREEYG 360

Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGA------- 320
           F GY+VSD  +++ +   H+   D  E AV +VL+AGL++      T+FT  A       
Sbjct: 361 FDGYVVSDSQAVEFVHTKHQVAEDYDE-AVRQVLEAGLNVR-----THFTPPADFILPIR 414

Query: 321 --VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNP-QHIELAAEAARQ 377
             + + KI+ A ID  +  +  V  RLG FD   +      + +    +H E   E  RQ
Sbjct: 415 RLLAENKISMATIDKRVSEVLAVKFRLGLFDAPYRDNPKEADEVAGADKHSEFVKEMQRQ 474

Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK-- 435
            +VLLKND   LPLN   IK + + GP A+    MI  Y        + + G   Y K  
Sbjct: 475 SLVLLKNDGQLLPLNKKEIKKVLVTGPLADEDNFMISRYGPNGLPTITVLQGIKDYLKGD 534

Query: 436 -VINYAPGC--------------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
             + Y+ GC              A +  +  + +  A+  A++AD  + V G D     E
Sbjct: 535 VEVVYSKGCNIIDKEWPASEVLPAVLTAEEVADMDKAVSEAQSADVIIAVMGEDEYRVGE 594

Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
            + R  L LPG Q EL+  +    K PV LV+++   + IN+   N  + +IL   +P  
Sbjct: 595 SRSRTSLELPGRQRELLQALHATGK-PVVLVLINGQPLTINWEDQN--LPAILEAWFPSF 651

Query: 541 EGGRAIADVIFGKYNPGGRLPITW------YEANYVKIPYT--SMPLRPVNNFPGRTYKF 592
           +GG+ IA+ +FG YNPGG+L +T+       E N+   P+   S   +P +   G     
Sbjct: 652 QGGKIIAETLFGDYNPGGKLTVTFPKSVGQIELNF---PFKKGSHGTQPSSGPNGSGSTR 708

Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
             G  +YPFGYGLSYT F Y                         +    P         
Sbjct: 709 VLG-ALYPFGYGLSYTTFAYS-----------------------NLEVTAP--------- 735

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKV 711
            K    +     ++ N GK  G EV  +Y +       T+  ++ G++RV +   ++ ++
Sbjct: 736 AKGTQGEVQISFDITNTGKYAGEEVAQLYVRDLVSSVVTYDSRLRGFQRVLLQPNETKRM 795

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
            FT+     L+++D      + SG   + VG
Sbjct: 796 HFTLKPA-DLELLDRNMEWTVESGTFEVRVG 825


>gi|255013016|ref|ZP_05285142.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. 2_1_7]
 gi|410102476|ref|ZP_11297402.1| hypothetical protein HMPREF0999_01174 [Parabacteroides sp. D25]
 gi|409238548|gb|EKN31339.1| hypothetical protein HMPREF0999_01174 [Parabacteroides sp. D25]
          Length = 732

 Score =  254 bits (650), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 219/776 (28%), Positives = 361/776 (46%), Gaps = 142/776 (18%)

Query: 18  KLPYPERAKDLVERMTLPEKVQQM-GDLAY---GVPRLGLPLYEW-WSEALHGV-SFIGR 71
           K+   +R + L+++MTL EKV  + G+  +   GV RLG+P  EW  S+  HGV + I R
Sbjct: 28  KVQMEKRIEKLIKKMTLEEKVGLLHGNSKFYVAGVERLGIP--EWSLSDGPHGVRAEINR 85

Query: 72  RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
              +  G   DS    A+ FPT     A++N  L  + G+ +  EAR             
Sbjct: 86  HDWAYAGWTNDS----ASYFPTGTAFAAAWNPELAYRRGEVLGEEAR-----WRKKDVLL 136

Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
            P +N++R P  GR  E   EDPY+    A+ Y++GLQ        RD       ++   
Sbjct: 137 GPGVNIIRSPLCGRNFEYMSEDPYMNSVLAVAYIKGLQS-------RD-------VACSV 182

Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
           KH+A   ++N E N R   D   +E+ ++E ++  F+  V EG   +VM +YN+  G   
Sbjct: 183 KHFA---VNNQETN-RTTVDVECSERALREIYLPAFKAAVQEGGALTVMAAYNKFRGEFC 238

Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
             +  L+ + +R +W F G  V+D  +  + V S               ++AGLDL+ G 
Sbjct: 239 AENNYLVRKILRNEWGFDGVYVTDWGAAHSTVPS---------------MEAGLDLEMGT 283

Query: 312 --------YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC 363
                   YY N  + AV+ GK+  + +D  +  +  V+++    D  P+ K  G  ++ 
Sbjct: 284 LIDKYEDWYYANPLIDAVKSGKVPMSLVDEKVGDVLRVMIKTNVLD--PK-KRFGPGSMN 340

Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG-----PHANA--TKAMIGNY 416
             +H +   +AA + IVLLKN N  LPL+  +IK+LA++G      H+N   +  +   Y
Sbjct: 341 TKEHQQATYDAAAEAIVLLKNQNNLLPLDFSSIKSLAVIGDNATRKHSNGGLSSEIKAVY 400

Query: 417 EGTP-----CRYTSPMDGFYA--YSKVINYAPGC---------ADIVCQNNSMIPAAIDA 460
           E TP      ++   +D  +A  Y K+  +  G          +    ++++++  A++ 
Sbjct: 401 EVTPLGALRAKWGDKVDIRFAQGYEKLSTFVEGSNNGQSSGTFSSKTQESDALLKEAVEV 460

Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
           A+ +D  ++V GL+   + E  DR+++ +P  Q ELI +V  A   P T+V+M AG+  +
Sbjct: 461 ARTSDVALLVCGLNHDYDTESFDRLNMDIPYGQVELIQEVVKA--NPRTIVVMIAGS-PL 517

Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR 580
           N A  +    +I+W  + G EGG  + DV+ GK NP G++P T        +     P  
Sbjct: 518 NMAAVDICSPAIVWAWFNGMEGGNVLVDVLSGKVNPSGKMPFT------TPVSLDQSPAH 571

Query: 581 PVNNFPGRT------------YKFFDG---PVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
            + NFPGR             Y++FD    PVVYPFGYGLSYT F Y            L
Sbjct: 572 ALGNFPGRDLKVNYEEDILVGYRWFDTKGLPVVYPFGYGLSYTTFDYS----------NL 621

Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
           + D++  D   T+                    + TF +   N G  +G+EV  +Y   P
Sbjct: 622 NTDKETYDQADTI--------------------QATFTL--TNTGDREGAEVAQLYVSDP 659

Query: 686 GIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
             +    +K++ G+++VF+  G+S ++   +    SL     A +  +      IL
Sbjct: 660 VCSVMRPVKELKGFKKVFLKPGESRRITLDI-PVSSLAFYSEAQSQFVVEPGEFIL 714


>gi|120437787|ref|YP_863473.1| beta-glucosidase [Gramella forsetii KT0803]
 gi|117579937|emb|CAL68406.1| beta-glucosidase [Gramella forsetii KT0803]
          Length = 757

 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 202/689 (29%), Positives = 327/689 (47%), Gaps = 84/689 (12%)

Query: 89  TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLE 148
           T FP  +  TAS++    ++  +  + E+ A         TF SP I++ RD RWGR++E
Sbjct: 122 TIFPVPLAETASWDMEAAEESARIAALESVAE----GVNWTF-SPMIDISRDARWGRIME 176

Query: 149 TPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRF 208
             GEDPY+  + A+  V+G        Y  +  S P  I+A  KH+A Y     EG   +
Sbjct: 177 GSGEDPYLTSKVAVAKVKG--------YQGEDLSNPKTIAATAKHFAGYGFA--EGGKDY 226

Query: 209 HFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNF 268
           +    + E ++    + PF+   + G V++ M S+N ++GIP      L  + ++GDW++
Sbjct: 227 N-TVNIGENELHNVILPPFKAAADAG-VATFMNSFNTIDGIPATGSESLQREILKGDWDW 284

Query: 269 HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC-GDYYTNFTMGAVQQGKIA 327
            G++VSD  SI  ++  H F  D K  A    +KAG D+D  G  Y       V  GK+ 
Sbjct: 285 TGFMVSDWGSIAEMI-PHGFAKD-KIHAAEIAVKAGSDMDMEGGAYEAGLEKLVAAGKVE 342

Query: 328 EADIDTSLRFLYIVLMRLGYFDGSPQYKNL-GKNNICNPQHIELAAEAARQGIVLLKNDN 386
           EA ID +++ +  V  ++G FD   +Y N   K N+   +H+  A + A++ IVLLKN+N
Sbjct: 343 EALIDDAVKRILRVKFKMGLFDDPYRYINSETKKNVPYKEHMSTARDIAKKSIVLLKNEN 402

Query: 387 GALPLNTGNIKTLALVGPHANATKAMIGNY--EGTPCRYTSPMDGFY--AYSKVINYAPG 442
             LP+ T ++K +A++GP A+     IGN+  +G      S ++G         I YA G
Sbjct: 403 DLLPIKT-SVKKIAVIGPLADDKDTPIGNWRAQGEENSAVSVLEGLKNANLDAQITYAQG 461

Query: 443 CADIVCQNNSMIP------------AAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
               + + + ++P             A+  AKNA+  V+V G D     EG+ +  + L 
Sbjct: 462 IKLGMGERSFLMPLKINKTDTTGMGEAVRNAKNAELVVMVLGEDAFQSGEGRSQAKIGLA 521

Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
           G Q EL+  V    K  + LV+++   ++++++  N  I +I+     G E G AIADV+
Sbjct: 522 GLQMELLKAVHKVNKN-IVLVLINGRPLELSWSSEN--IPTIVEAWQLGSESGNAIADVL 578

Query: 551 FGKYNPGGRLPITW-----YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGL 605
            GKYNP G+LP+++      E  Y     T  P    +          +GP +YPFGYGL
Sbjct: 579 LGKYNPSGKLPVSFPRAVGQEPLYYNHKNTGRPFSAEHVTYAHYTDIENGP-LYPFGYGL 637

Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIE 665
           SYTQF Y                             +P  +   ++ +K ++ K T  + 
Sbjct: 638 SYTQFDYA----------------------------RPELS---VESIKSRE-KATLSVA 665

Query: 666 VENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIV 724
           V N G   G EVV +Y +         +K++ G+E + +  G++  V F +N  + LK  
Sbjct: 666 VTNSGDRKGKEVVQLYLRDLVATTARPVKELKGFEMIELEPGETKTVEFIINE-EMLKFY 724

Query: 725 DNAANSLLASGAHTILVG---EGVGGVSF 750
           + +       G   ++VG   E V  V F
Sbjct: 725 NASEKWEAEEGEFQLMVGGNSEDVQSVKF 753


>gi|295690896|ref|YP_003594589.1| glycosyl hydrolase family protein [Caulobacter segnis ATCC 21756]
 gi|295432799|gb|ADG11971.1| glycoside hydrolase family 3 domain protein [Caulobacter segnis
           ATCC 21756]
          Length = 806

 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 212/722 (29%), Positives = 324/722 (44%), Gaps = 109/722 (15%)

Query: 50  RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
           RLG+P+     EALHG  ++ R                ATSFP  I   ++F+  L +KI
Sbjct: 151 RLGIPML-MHDEALHG--YVAR---------------DATSFPQAIALASTFDTELTEKI 192

Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
               + E RA  +  N  L   +P ++V RDPRWGR+ ET GEDP+V     +  +RG Q
Sbjct: 193 FAVAAREMRARGS--NLAL---APVVDVARDPRWGRIEETYGEDPHVCAEIGLAAIRGFQ 247

Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
              G       D    K+    KH   +       N      ++++E+ ++E F  PFE 
Sbjct: 248 ---GTTLPLAKD----KVFVTLKHMTGHGQPE---NGTNVGPAQISERVLRENFFPPFER 297

Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
            V E  V +VM SYN ++G+P+     LL + +R +W + G + SD  +I+ ++  HK  
Sbjct: 298 AVTELPVRAVMPSYNEIDGVPSHGSRWLLTKILREEWGYKGSVQSDYFAIKEMISRHKLT 357

Query: 290 NDTKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY 347
            D  E AV R + AG+D++   G+ Y       V+ G+I + +ID ++  +  +    G 
Sbjct: 358 TDLGETAV-RAMHAGVDVELPDGEAYA-LIPELVKAGRIPQFEIDAAVARVLTMKFEGGL 415

Query: 348 FDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
           F+     +         P  + LA EAAR+ +VLLKND G LPL+   IK LAL+G HA 
Sbjct: 416 FENPYCDEKTADAKTATPDAVALAREAARKAVVLLKNDKGVLPLDGKKIKRLALLGTHAK 475

Query: 408 ATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIV------------------CQ 449
            T   IG Y   P    S  +G  A +K   +A   A+ V                    
Sbjct: 476 DTP--IGGYSDVPRHVVSIYEGLTAEAKAQGFALDYAEAVRITEQRIWAQDQVNFTDPAV 533

Query: 450 NNSMIPAAIDAAKNADATVIVAGLDLSVEAEG------KDRVDLLLPGFQTELINKVADA 503
           N  +I  A++ AK AD  V+V G +     E        DR  L L G Q +L   + D 
Sbjct: 534 NAKLIAEAVEVAKKADVVVMVLGDNEQTSREAWADNHLGDRESLDLIGQQNDLAKAIFDL 593

Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
            K P  + +++   + IN      +  +I+   Y G+E G A ADV+FG+ NPGG+LP++
Sbjct: 594 GK-PTVVFLLNGRPLSINLLAE--RADAIIEGWYLGQETGNAAADVLFGRANPGGKLPVS 650

Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSV 621
               N  ++P         N  P     +  G V  +YPFG+GLSYT F     S+P+  
Sbjct: 651 -IARNVGQLPIY------YNRKPTARRGYLGGDVTPLYPFGFGLSYTSFDI---SAPRLA 700

Query: 622 DIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY 681
             K+ + +  +                              +++V N GK+ G EVV +Y
Sbjct: 701 KAKIGQGETVK-----------------------------VEVDVANTGKVAGDEVVQLY 731

Query: 682 SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
                   T  + ++  ++RV +A G    V F +     L + D     ++  G  +IL
Sbjct: 732 IHDETATVTRPVLELKHFKRVTLAPGAKTTVTFEIKPS-DLWMWDLDMKRVVEPGDFSIL 790

Query: 741 VG 742
           VG
Sbjct: 791 VG 792


>gi|261880507|ref|ZP_06006934.1| xylosidase [Prevotella bergensis DSM 17361]
 gi|270332847|gb|EFA43633.1| xylosidase [Prevotella bergensis DSM 17361]
          Length = 948

 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 221/810 (27%), Positives = 365/810 (45%), Gaps = 144/810 (17%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---------------------- 51
           Y DA  P  +R  DL+E+MT+ EK  QM  L YG  R+                      
Sbjct: 61  YEDASAPLNDRINDLLEQMTIEEKTNQMVTL-YGYKRVLEDDLPNAGWKQKLWKDGIGAI 119

Query: 52  ----------GLPLYE--W-WSEALHGVS-------FIGRRTNSPPGTHFDSEVPG---- 87
                     GLP  +  W W  + H  +       F+       P    +  + G    
Sbjct: 120 DEHLNGFVQWGLPPSDNPWVWPASKHAWAINEVQRFFVEETRLGIPVDFTNEGIRGIESY 179

Query: 88  -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
            AT+FPT +    ++N  L +++G     EAR +      G T  ++P ++V RD RWGR
Sbjct: 180 KATNFPTQLGLGTTWNRQLIRQVGYITGREARLL------GYTNVYAPILDVGRDQRWGR 233

Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
             E  GE P++V    I   RGLQ                ++++  KH+AAY  +     
Sbjct: 234 YEEIYGESPFLVAELGIQMTRGLQT-------------DFQVASTAKHFAAYSNNKGGRE 280

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
                D ++  ++++   + P+E  V E  +   M SYN  +GIP       L + +R  
Sbjct: 281 GMSRVDPQMPPREVENIHLYPWERVVQEAGLLGAMSSYNDYDGIPIQGSYHWLTEVLRHR 340

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
           + F GYIVSD D+++ +   H    D KE AV + + AGL++ C     D +       +
Sbjct: 341 FGFRGYIVSDSDALEYLFSKHHTAADMKE-AVYQAVMAGLNVRCTFRSPDSFVLPLRELI 399

Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL--GKNNICNPQHIELAAEAARQGI 379
           ++G+I  + ID  +  +  V    G FD +P   NL      + + ++  +A +A+RQ I
Sbjct: 400 REGRIPMSVIDRLVGDILRVKFITGIFD-NPYQMNLKAADQEVNSERNQAVALQASRQSI 458

Query: 380 VLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV--- 436
           VLLKN +  LPL+   ++ + + GP+A+     + +Y       T+ ++G     KV   
Sbjct: 459 VLLKNQDRLLPLDRSKLRRILVCGPNADDASYALTHYGPLAVDVTTVLEGI--RDKVENN 516

Query: 437 --INYAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
             ++YA GC D+V                Q    I  A+  AK +D  ++V G +     
Sbjct: 517 IEVSYAKGC-DVVDPHWPESEIIGYPMTSQEQQDIDHAVALAKESDVAIVVLGGNSRTCG 575

Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
           E K R  L LPG Q +L+  V    K PV LV+++   + +N+A  +  I +I+   YPG
Sbjct: 576 ENKSRSSLDLPGRQLDLLKAVQATGK-PVVLVLINGRPLSVNWA--DRFIPAIVEAWYPG 632

Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP--- 596
            +GG A+ADV+FG YNPGG+L +T +  +  +IP+ + P +P +   G       G    
Sbjct: 633 SQGGTAVADVLFGDYNPGGKLTVT-FPKSVGQIPF-NFPSKPASQVDGGNKLGLQGNASR 690

Query: 597 ---VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
               +Y FG+GLSYT FKY         +++L K+     +N ++             ++
Sbjct: 691 INGALYSFGHGLSYTTFKYS--------NLRLSKETMT--LNDSI-------------NI 727

Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVG 712
            C         +V N G  +G EVV +Y +       T+ K + G++R+ +  G++  + 
Sbjct: 728 SC---------DVSNTGDREGDEVVQLYIRDVISSVTTYEKNLRGFDRIHLKPGETKTLT 778

Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVG 742
           FT+   + LK+V+     ++  G   I++G
Sbjct: 779 FTIKP-EHLKLVNKDFEKVVEPGEFKIMIG 807


>gi|257051950|ref|YP_003129783.1| glycoside hydrolase family 3 domain protein [Halorhabdus utahensis
           DSM 12940]
 gi|256690713|gb|ACV11050.1| glycoside hydrolase family 3 domain protein [Halorhabdus utahensis
           DSM 12940]
          Length = 783

 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 199/697 (28%), Positives = 326/697 (46%), Gaps = 99/697 (14%)

Query: 86  PGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGR 145
           PG T FP  I   ++++ +L + I  ++     A+       +   SP ++V RD RWGR
Sbjct: 121 PGGTIFPQSIGLASTWSPALVESITDSIRKRLAAV-----GAVQALSPVLDVSRDMRWGR 175

Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
           V ET GEDP +VG     YV GLQ+        D D     I A  KH+AA+   + EG 
Sbjct: 176 VEETYGEDPQLVGALGAAYVSGLQN--------DGDG----IDATLKHFAAHG--SGEGG 221

Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
            +     ++ E++++E  + PFE+ + E D  +VM +Y+ ++G+P  +   LL   +RG+
Sbjct: 222 -KNRSSVQIGERELREVHLYPFEVAIREADARAVMNAYHDIDGVPCASSEWLLTDVLRGE 280

Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQ 323
           W F G++V+D  S+  +   H  + DT+ +A    L+AGLD++    D Y    + AV+ 
Sbjct: 281 WGFDGHVVADYFSVDLLKTEHG-IADTQREAGVAALEAGLDIELPATDCYGENLLKAVED 339

Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLK 383
           G+++EA +DT++R +    +  G FD                +  ELAA AAR+ + LL+
Sbjct: 340 GELSEATVDTAVRRVLRAKIESGVFDDPYVDPEAASEPFDTDEQTELAARAARESMTLLE 399

Query: 384 NDNGALPLNTGNIKTLALVGPHANATKAMIGNY---------EGTPCRYTSPMDGFYAYS 434
           ND+  LPL   ++ ++ALVGP A+  +A +G+Y         E       +P D   A  
Sbjct: 400 NDD-LLPLAGEDLDSVALVGPQADDGRAQVGDYTHAARFDTEEDGDFECVTPRDALEAKG 458

Query: 435 KV----INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGL----------------D 474
           +     + Y  G A +   +     AA +   +AD  V   G                 D
Sbjct: 459 ETAGFDVEYVEG-ATMTGPSTEEFDAAEETVADADVAVACVGARSDIDFADRENPSELPD 517

Query: 475 LSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI-NFAKNNPKIKSIL 533
           +    E  D  DL LPG Q ELI+++A+    P+ +V +S     I   A+  P   ++L
Sbjct: 518 VPTSGENCDVTDLELPGVQAELIDRLAE-TDTPLVVVQVSGKPHAIPEIAETVP---ALL 573

Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA-NYVKIPYTSMPLRPVNNFPGRTYKF 592
               PG+ GG AIADV+FG+YNP G LP++  ++     + Y+  P     N     + +
Sbjct: 574 HAWLPGQAGGTAIADVLFGEYNPSGHLPVSIPKSVGQQPVYYSRKP-----NSANEEHVY 628

Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
            DG  +Y FG+GLSYT F+Y      +                    T +P  +      
Sbjct: 629 MDGEPLYSFGHGLSYTDFEYGELELEEG-------------------TVEPMGS------ 663

Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKV 711
                   +  + V N G+  G +VV +Y      +    +++++G+ERV +  G+S +V
Sbjct: 664 -------LSASVTVTNAGERAGDDVVQLYQHAENPSQARPVQELLGFERVHLEPGESKRV 716

Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
            FT +A + L   D   +  +  G + + VGE    +
Sbjct: 717 TFTFDATQ-LAYYDLNMHLAVEEGPYELRVGESAAEI 752


>gi|262383061|ref|ZP_06076198.1| glycoside hydrolase family 3 [Bacteroides sp. 2_1_33B]
 gi|262295939|gb|EEY83870.1| glycoside hydrolase family 3 [Bacteroides sp. 2_1_33B]
          Length = 758

 Score =  254 bits (648), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 184/603 (30%), Positives = 299/603 (49%), Gaps = 63/603 (10%)

Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
           ++P +++ RD RWGRV+E  GEDPY+    A   V G Q   G ++   +D     + AC
Sbjct: 162 FAPMVDISRDARWGRVMEGAGEDPYLGSLIAKARVEGFQG--GNDWRSLADVN--TVLAC 217

Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
           CKH+AAY      G D   +++    Q+    + +P  +   E  V++ M S+N +NG+P
Sbjct: 218 CKHFAAYGAAE-AGRD---YNTSELSQNTLMNYYMPPYLAAKEAGVATFMASFNEINGVP 273

Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-C 309
           +  +  L+   +R DW F+G++V+D   I  +V +H  + + KE A      AG+D+D  
Sbjct: 274 STGNKWLMTDLLRKDWGFNGFVVTDYTGINEMV-AHSIVRNDKE-AGELAANAGIDMDMT 331

Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQY--KNLGKNNICNPQH 367
           G  Y+ + + +V++GK++E +I+ ++  +  +   LG FD   +Y      KN I  P+ 
Sbjct: 332 GGIYSQYLVQSVKEGKVSEENINRAVASILEMKFLLGLFDDPYRYLDNEREKNTIMKPEF 391

Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGP------HANATKAMIGNYEGTPC 421
           ++ A E + + IVLLKNDN   P++     T+AL+GP      + N   A  G  E +  
Sbjct: 392 LQEARETSARSIVLLKNDNNFFPISKDKHITVALIGPMVKDKINQNGEWAGRGEREESIS 451

Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
            +    + +   +    YA GC D++  ++S    AI  A+ AD  +   G D +   E 
Sbjct: 452 LFEGLTEKYAGTNVKFIYAEGC-DLLTDDSSKFAEAIATARRADIVLAAMGEDFNWSGEA 510

Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
             R +L LPG Q  L+ ++    K P+ L++++   +D+++   N  +  IL   Y G  
Sbjct: 511 ACRTNLKLPGAQQALLKELKKTGK-PLGLILVNGRPLDLSW--ENQHVDGILEAWYLGTM 567

Query: 542 GGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPL-RPV-NNFPGRTYK--FFDGP 596
            G  +ADVI G YNP  RL +++      + + Y   P  RPV    P   YK  + D P
Sbjct: 568 AGHGMADVISGDYNPSARLTMSFPRTVGQLPLYYNQKPTGRPVPPEAPDTDYKSRYMDVP 627

Query: 597 --VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
              +YPFGYGLSYT F            +KLD++      ++T G               
Sbjct: 628 NTPLYPFGYGLSYTTFAVN--------SMKLDQN------SFTKGG-------------- 659

Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
               K T   EVEN GK+DG  VV +Y +   G     +K++ G+E+V + AG+  +V F
Sbjct: 660 ----KITVTAEVENTGKVDGETVVQMYIRDLAGSVTRPVKELKGFEKVTLKAGEKKQVSF 715

Query: 714 TMN 716
           T++
Sbjct: 716 TID 718


>gi|336399370|ref|ZP_08580170.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
 gi|336069106|gb|EGN57740.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
          Length = 862

 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 158/459 (34%), Positives = 232/459 (50%), Gaps = 43/459 (9%)

Query: 5   IKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALH 64
           + V     PY D  L +  RAKDL  R+TL EK   M D++  +PRLG+  + WWSEALH
Sbjct: 16  VGVNAQQSPYQDPGLSFEARAKDLCSRLTLEEKASLMCDVSPAIPRLGIKPFNWWSEALH 75

Query: 65  GVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
           G +  G                  T FP  I   ASFN ++  ++    S EAR  YN  
Sbjct: 76  GYANNG----------------DVTVFPEPIGMAASFNPTMVYQVFTATSDEARGKYNQS 119

Query: 125 NA---------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
            A          L+ W+PN+N+ RDPRWGR  ET GEDPY+     +  V+GLQ  E  +
Sbjct: 120 MAEGKEDTRFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGVEVVKGLQGPESTK 179

Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
           Y         K+ AC KH+A +    +  +     D  ++ +D+ ET++  F+  V +  
Sbjct: 180 YR--------KLYACAKHFAVHSGPEYTRHTANLAD--ISPRDLWETYLPAFKATVQQAG 229

Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKED 295
           V  VMC+Y R++  P C + +LL Q +R +W F   +VSDC +I     +H   +D    
Sbjct: 230 VREVMCAYQRLDDEPCCGNSRLLQQILRDEWGFRHMVVSDCGAIADFYTNHHVSSDAVH- 288

Query: 296 AVARVLKAGLDLDCGDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP-- 352
           A A+   AG D++CG  Y    +  AV++G ++EA++D  +  L      LG  D     
Sbjct: 289 AAAKGTLAGTDVECGFGYAYMKLPEAVRRGLVSEAEVDKHVIRLLKGRFELGVMDDPKLV 348

Query: 353 QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAM 412
            +  +    + +  H +LA   ARQ + LL+N N  LPL  G  + +A+VGP+A     +
Sbjct: 349 SWTKISPKVVDSDAHRQLALNMARQTMTLLQNRNNVLPLAKG--EKIAVVGPNAADGPML 406

Query: 413 IGNYEGTPCRYTSPMDGFYAYS-KVINYAPGCADIVCQN 450
            GNY GTP R T+ ++G  A + K I Y  GC D+V +N
Sbjct: 407 WGNYNGTPSRTTTILEGIRAKAGKDIPYLQGC-DLVNKN 444



 Score =  122 bits (305), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 86/285 (30%), Positives = 129/285 (45%), Gaps = 53/285 (18%)

Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
           AI   +     V V G+   +E E          G DR  + LP  Q + +  +  A K 
Sbjct: 593 AIRQLRGVRTVVFVGGISSKLEGEEMPVHVEGFKGGDRTSIELPAVQRDFLKALKAAGK- 651

Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
             T+V ++     I          +IL   Y GEEGGRA+ADV++G YNPGG+LP+T+Y 
Sbjct: 652 --TVVFVNCSGSAIALTPEVESCDAILQAWYAGEEGGRAVADVLYGDYNPGGKLPVTFYR 709

Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
           +       T +P     +  GRTY++F    ++PFGYGLSYT+F                
Sbjct: 710 ST------TQLPAFDDYSMKGRTYRYFSD-ALFPFGYGLSYTRF---------------- 746

Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
                      +G       A+  D       K T  + V N+GK  G EVV VY +   
Sbjct: 747 ----------AIGKGSLSAPAMKADG------KVTLTVPVSNVGKRTGDEVVQVYVRDVN 790

Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL 731
            A   +K +  + RV + AG+S KV   + A ++  + D+A+N++
Sbjct: 791 DADGPLKSLKAFRRVSLKAGESRKVTIPLTA-ETFSLFDSASNTV 834


>gi|330996729|ref|ZP_08320604.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
           11841]
 gi|329572574|gb|EGG54217.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
           11841]
          Length = 852

 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 161/456 (35%), Positives = 238/456 (52%), Gaps = 46/456 (10%)

Query: 14  YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
           + D K P  ER  DL+ R+T+ EK+  + + A  + RLG+  Y   +EALHGV       
Sbjct: 29  FRDMKAPQHERIMDLLSRLTVEEKISLLVNDAPAIGRLGIDKYNHGNEALHGVV------ 82

Query: 74  NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
              PG          T FP  I   A +N  L  +I   +S EAR  +     G      
Sbjct: 83  --RPGDF--------TVFPQAIGMAAMWNPELLYRISSAISDEARGRWKELEYGKKQIAG 132

Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
               LTFWSP +N+ RDPRWGR  ET GEDPY+ G   + +V+GLQ          +  R
Sbjct: 133 ASDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGVLGVAFVKGLQG---------NHPR 183

Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
            LK  +  KH+A     N E ++R   +++V+E+D++E ++  FE C+ EG   S+M +Y
Sbjct: 184 YLKTVSTPKHFAV----NNEEHNRSSCNAKVSERDLREYYLPSFERCITEGKAQSIMMAY 239

Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
           N VN +P   +  L+   +RGDW F+GYIVSDC + + ++  H ++  T+E A    +KA
Sbjct: 240 NAVNDVPCTVNTYLIKNVLRGDWGFNGYIVSDCSAPEWMITKHHYVK-TREAAATLAVKA 298

Query: 304 GLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
           GLDL+CG+  Y    + A +Q  ++EADID++   +    M LG FD   Q  Y  +  +
Sbjct: 299 GLDLECGNQVYGEGLLKAYRQYMVSEADIDSAAYRILRGRMMLGLFDDPSQNPYNQIEPS 358

Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
            +    H +LA EAARQ +VLLKN +  LPLN   +K++A+VG   +A     G+Y GTP
Sbjct: 359 VVGCKAHQDLALEAARQSMVLLKNKDNFLPLNPQKVKSIAVVG--ISAGHCEFGDYSGTP 416

Query: 421 CRY-TSPMDGFYAYSKVINYAPGCADIVCQNNSMIP 455
                + +DG   Y++   +    A  V  +    P
Sbjct: 417 KNEPVTILDGIKQYAEEYGFKVAYAPWVSASEDFEP 452



 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 106/294 (36%), Positives = 152/294 (51%), Gaps = 52/294 (17%)

Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-VD 519
           A   D TV V G++ S+E EG+DR  L LP  Q E I ++      P T+V++ AG+ + 
Sbjct: 601 AAECDVTVAVLGINKSIEREGQDRFTLELPIDQQEFIKELYKV--NPNTVVVLVAGSSLA 658

Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
           +N+   N  + +IL   YPGE+GG A+A+V+FG YNPGGRLP+T+Y +         +P 
Sbjct: 659 VNWMDEN--VPAILNAWYPGEQGGNAVAEVLFGDYNPGGRLPLTYYNS------LDEIPA 710

Query: 580 RPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
               +  GRTY++F+G  +Y FGYGLSYT+F+YK     K V +  D          TV 
Sbjct: 711 FDNYSVKGRTYQYFEGQPLYEFGYGLSYTKFRYK----SKGVSVARD----------TV- 755

Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH--IKQVIG 697
                              K +F  EV N GK DG EV  VY K P   GT+  +KQ+ G
Sbjct: 756 -------------------KVSF--EVSNTGKYDGDEVAQVYVKYPE-TGTYMPLKQLHG 793

Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVGEGVGGVSF 750
           ++RV I  G+++KV   +   K L+  D      +   G +T +VG     + F
Sbjct: 794 FKRVHIKKGKTSKVTVGVPK-KDLRYWDEQERKFVTPKGEYTFMVGASSEDIKF 846


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.137    0.419 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,620,819,410
Number of Sequences: 23463169
Number of extensions: 566271712
Number of successful extensions: 1242051
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6145
Number of HSP's successfully gapped in prelim test: 1531
Number of HSP's that attempted gapping in prelim test: 1188429
Number of HSP's gapped (non-prelim): 17569
length of query: 758
length of database: 8,064,228,071
effective HSP length: 151
effective length of query: 607
effective length of database: 8,816,256,848
effective search space: 5351467906736
effective search space used: 5351467906736
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 81 (35.8 bits)